-
Efficient Materials Informatics between Rockets and Electrons
Authors:
Adam M. Krajewski
Abstract:
The true power of computational research typically can lay in either what it accomplishes or what it enables others to accomplish. In this work, both avenues are simultaneously embraced across several distinct efforts existing at three general scales of abstractions of what a material is - atomistic, physical, and design. At each, an efficient materials informatics infrastructure is being built fr…
▽ More
The true power of computational research typically can lay in either what it accomplishes or what it enables others to accomplish. In this work, both avenues are simultaneously embraced across several distinct efforts existing at three general scales of abstractions of what a material is - atomistic, physical, and design. At each, an efficient materials informatics infrastructure is being built from the ground up based on (1) the fundamental understanding of the underlying prior knowledge, including the data, (2) deployment routes that take advantage of it, and (3) pathways to extend it in an autonomous or semi-autonomous fashion, while heavily relying on artificial intelligence (AI) to guide well-established DFT-based ab initio and CALPHAD-based thermodynamic methods.
The resulting multi-level discovery infrastructure is highly generalizable as it focuses on encoding problems to solve them easily rather than looking for an existing solution. To showcase it, this dissertation discusses the design of multi-alloy functionally graded materials (FGMs) incorporating ultra-high temperature refractory high entropy alloys (RHEAs) towards gas turbine and jet engine efficiency increase reducing CO2 emissions, as well as hypersonic vehicles. It leverages a new graph representation of underlying mathematical space using a newly developed algorithm based on combinatorics, not subject to many problems troubling the community. Underneath, property models and phase relations are learned from optimized samplings of the largest and highest quality dataset of HEA in the world, called ULTERA. At the atomistic level, a data ecosystem optimized for machine learning (ML) from over 4.5 million relaxed structures, called MPDD, is used to inform experimental observations and improve thermodynamic models by providing stability data enabled by a new efficient featurization framework.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Efficient Structure-Informed Featurization and Property Prediction of Ordered, Dilute, and Random Atomic Structures
Authors:
Adam M. Krajewski,
Jonathan W. Siegel,
Zi-Kui Liu
Abstract:
Structure-informed materials informatics is a rapidly evolving discipline of materials science relying on the featurization of atomic structures or configurations to construct vector, voxel, graph, graphlet, and other representations useful for machine learning prediction of properties, fingerprinting, and generative design. This work discusses how current featurizers typically perform redundant c…
▽ More
Structure-informed materials informatics is a rapidly evolving discipline of materials science relying on the featurization of atomic structures or configurations to construct vector, voxel, graph, graphlet, and other representations useful for machine learning prediction of properties, fingerprinting, and generative design. This work discusses how current featurizers typically perform redundant calculations and how their efficiency could be improved by considering (1) fundamentals of crystallographic (orbits) equivalency to optimize ordered cases and (2) representation-dependent equivalency to optimize cases of dilute, doped, and defect structures with broken symmetry. It also discusses and contrasts ways of (3) approximating random solid solutions occupying arbitrary lattices under such representations. Efficiency improvements discussed in this work were implemented within pySIPFENN or python toolset for Structure-Informed Property and Feature Engineering with Neural Networks developed by authors since 2019 and shown to increase performance from 2 to 10 times for typical inputs. Throughout this work, the authors explicitly discuss how these advances can be applied to different kinds of similar tools in the community.
△ Less
Submitted 14 June, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
MaterialsMap: A CALPHAD-Based Tool to Design Composition Pathways through feasibility map for Desired Dissimilar Materials, demonstrated with RSW Joining of Ag-Al-Cu
Authors:
Hui Sun,
Bo Pan,
Zhening Yang,
Adam M. Krajewski,
Brandon Bocklund,
Shun-Li Shang,
Jingjing Li,
Allison M. Beese,
Zi-Kui Liu
Abstract:
Assembly of dissimilar metals can be achieved by different methods, for example, casting, welding, and additive manufacturing (AM). However, undesired phases formed in liquid-phase assembling processes due to solute segregation during solidification diminish mechanical and other properties of the processed parts. In the present work, an open-source software named MaterialsMap, has been developed b…
▽ More
Assembly of dissimilar metals can be achieved by different methods, for example, casting, welding, and additive manufacturing (AM). However, undesired phases formed in liquid-phase assembling processes due to solute segregation during solidification diminish mechanical and other properties of the processed parts. In the present work, an open-source software named MaterialsMap, has been developed based on the CALculation of Phase Diagrams (CALPHAD) approach. The primary objective of MaterialsMap is to facilitate the design of an optimal composition pathway for assembling dissimilar alloys with liquid-phases based on the formation of desired and undesired phases along the pathway. In MaterialsMap, equilibrium thermodynamic calculations are used to predict equilibrium phases formed at slow cooling rate, while Scheil-Gulliver simulations are employed to predict non-equilibrium phases formed during rapid cooling. By combining these two simulations, MaterialsMap offers a thorough guide for understanding phase formation in various manufacturing processes, assisting users in making informed decisions during material selection and production. As a demonstration of this approach, a compositional pathway was designed from pure Al to pure Cu through Ag using MaterialsMap. The design was experimentally verified using resistance spot welding (RSW).
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
nimCSO: A Nim package for Compositional Space Optimization
Authors:
Adam M. Krajewski,
Arindam Debnath,
Wesley F. Reinhart,
Allison M. Beese,
Zi-Kui Liu
Abstract:
nimCSO is a high-performance tool implementing several methods for selecting components (data dimensions) in compositional datasets, which optimize the data availability and density for applications such as machine learning. Making said choice is a combinatorically hard problem for complex compositions existing in highly dimensional spaces due to the interdependency of components being present. Su…
▽ More
nimCSO is a high-performance tool implementing several methods for selecting components (data dimensions) in compositional datasets, which optimize the data availability and density for applications such as machine learning. Making said choice is a combinatorically hard problem for complex compositions existing in highly dimensional spaces due to the interdependency of components being present. Such spaces are encountered, for instance, in materials science, where datasets on Compositionally Complex Materials (CCMs) often span 20-45 chemical elements, 5-10 processing types, and several temperature regimes, for up to 60 total data dimensions.
At its core, nimCSO leverages the metaprogramming ability of the Nim language (nim-lang.org) to optimize itself at the compile time, both in terms of speed and memory handling, to the specific problem statement and dataset at hand based on a human-readable configuration file. As demonstrated in this paper, nimCSO reaches the physical limits of the hardware (L1 cache latency) and can outperform an efficient native Python implementation over 400 times in terms of speed and 50 times in terms of memory usage (not counting interpreter), while also outperforming NumPy implementation 35 and 17 times, respectively, when checking a candidate solution.
It is designed to be both (1) a user-ready tool, implementing two efficient brute-force approaches (for handling up to 25 dimensions), a custom search algorithm (for up to 40 dimensions), and a genetic algorithm (for any dimensionality), and (2) a scaffold for building even more elaborate methods in the future, including heuristics going beyond data availability. All configuration is done with a simple human-readable YAML config file and plain text data files, making it easy to modify the search method and its parameters with no knowledge of programming and only basic command line skills.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Efficient Generation of Grids and Traversal Graphs in Compositional Spaces towards Exploration and Path Planning Exemplified in Materials
Authors:
Adam M. Krajewski,
Allison M. Beese,
Wesley F. Reinhart,
Zi-Kui Liu
Abstract:
Many disciplines of science and engineering deal with problems related to compositions, ranging from chemical compositions in materials science to portfolio compositions in economics. They exist in non-Euclidean simplex spaces, causing many standard tools to be incorrect or inefficient, which is significant in combinatorically or structurally challenging spaces exemplified by Compositionally Compl…
▽ More
Many disciplines of science and engineering deal with problems related to compositions, ranging from chemical compositions in materials science to portfolio compositions in economics. They exist in non-Euclidean simplex spaces, causing many standard tools to be incorrect or inefficient, which is significant in combinatorically or structurally challenging spaces exemplified by Compositionally Complex Materials (CCMs) and Functionally Graded Materials (FGMs). Here, we explore them conceptually in terms of problem spaces and quantitatively in terms of computational feasibility.
This work implements several essential methods specific to the compositional (simplex) spaces through a high-performance open-source library nimplex. Most significantly, we derive and implement an algorithm for constructing a novel n-dimensional simplex graph data structure, which contains all discretized compositions and all possible neighbor-to-neighbor transitions as pointer arrays. Critically, no distance or neighborhood calculations are performed, instead leveraging pure combinatorics and the ordering in procedurally generated simplex grids, keeping the algorithm $\mathcal{O}(N)$, so that graphs with billions of transitions take seconds to construct on a laptop. Furthermore, we demonstrate how such graph representations can be combined to express path-planning problem spaces and to incorporate prior knowledge while keeping the problem space homogeneous. This allows for efficient deployment of existing high-performance gradient descent, graph traversal search, and other path optimization algorithms.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Developments and applications of the OPTIMADE API for materials discovery, design, and data exchange
Authors:
Matthew L. Evans,
Johan Bergsma,
Andrius Merkys,
Casper W. Andersen,
Oskar B. Andersson,
Daniel Beltrán,
Evgeny Blokhin,
Tara M. Boland,
Rubén Castañeda Balderas,
Kamal Choudhary,
Alberto Díaz Díaz,
Rodrigo Domínguez García,
Hagen Eckert,
Kristjan Eimre,
María Elena Fuentes Montero,
Adam M. Krajewski,
Jens Jørgen Mortensen,
José Manuel Nápoles Duarte,
Jacob Pietryga,
Ji Qi,
Felipe de Jesús Trejo Carrillo,
Antanas Vaitkus,
Jusong Yu,
Adam Zettel,
Pedro Baptista de Castro
, et al. (34 additional authors not shown)
Abstract:
The Open Databases Integration for Materials Design (OPTIMADE) application programming interface (API) empowers users with holistic access to a growing federation of databases, enhancing the accessibility and discoverability of materials and chemical data. Since the first release of the OPTIMADE specification (v1.0), the API has undergone significant development, leading to the upcoming v1.2 relea…
▽ More
The Open Databases Integration for Materials Design (OPTIMADE) application programming interface (API) empowers users with holistic access to a growing federation of databases, enhancing the accessibility and discoverability of materials and chemical data. Since the first release of the OPTIMADE specification (v1.0), the API has undergone significant development, leading to the upcoming v1.2 release, and has underpinned multiple scientific studies. In this work, we highlight the latest features of the API format, accompanying software tools, and provide an update on the implementation of OPTIMADE in contributing materials databases. We end by providing several use cases that demonstrate the utility of the OPTIMADE API in materials research that continue to drive its ongoing development.
△ Less
Submitted 5 April, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Comparing Forward and Inverse Design Paradigms: A Case Study on Refractory High-Entropy Alloys
Authors:
Arindam Debnath,
Lavanya Raman,
Wenjie Li,
Adam M. Krajewski,
Marcia Ahn,
Shuang Lin,
Shunli Shang,
Allison M. Beese,
Zi-Kui Liu,
Wesley F. Reinhart
Abstract:
The rapid design of advanced materials is a topic of great scientific interest. The conventional, ``forward'' paradigm of materials design involves evaluating multiple candidates to determine the best candidate that matches the target properties. However, recent advances in the field of deep learning have given rise to the possibility of an ``inverse'' design paradigm for advanced materials, where…
▽ More
The rapid design of advanced materials is a topic of great scientific interest. The conventional, ``forward'' paradigm of materials design involves evaluating multiple candidates to determine the best candidate that matches the target properties. However, recent advances in the field of deep learning have given rise to the possibility of an ``inverse'' design paradigm for advanced materials, wherein a model provided with the target properties is able to find the best candidate. Being a relatively new concept, there remains a need to systematically evaluate how these two paradigms perform in practical applications. Therefore, the objective of this study is to directly, quantitatively compare the forward and inverse design modeling paradigms. We do so by considering two case studies of refractory high-entropy alloy design with different objectives and constraints and comparing the inverse design method to other forward schemes like localized forward search, high throughput screening, and multi objective optimization.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Generative deep learning as a tool for inverse design of high-entropy refractory alloys
Authors:
Arindam Debnath,
Adam M. Krajewski,
Hui Sun,
Shuang Lin,
Marcia Ahn,
Wenjie Li,
Shanshank Priya,
Jogender Singh,
Shunli Shang,
Allison M. Beese,
Zi-Kui Liu,
Wesley F. Reinhart
Abstract:
Generative deep learning is powering a wave of new innovations in materials design. In this article, we discuss the basic operating principles of these methods and their advantages over rational design through the lens of a case study on refractory high-entropy alloys for ultra-high-temperature applications. We present our computational infrastructure and workflow for the inverse design of new all…
▽ More
Generative deep learning is powering a wave of new innovations in materials design. In this article, we discuss the basic operating principles of these methods and their advantages over rational design through the lens of a case study on refractory high-entropy alloys for ultra-high-temperature applications. We present our computational infrastructure and workflow for the inverse design of new alloys powered by these methods. Our preliminary results show that generative models can learn complex relationships in order to generate novelty on demand, making them a valuable tool for materials informatics.
△ Less
Submitted 31 August, 2021; v1 submitted 26 August, 2021;
originally announced August 2021.
-
Extensible Structure-Informed Prediction of Formation Energy with Improved Accuracy and Usability employing Neural Networks
Authors:
Adam M. Krajewski,
Jonathan W. Siegel,
Jinchao Xu,
Zi-Kui Liu
Abstract:
In the present paper, we introduce a new neural network-based tool for the prediction of formation energies of atomic structures based on elemental and structural features of Voronoi-tessellated materials. We provide a concise overview of the connection between the machine learning and the true material-property relationship, how to improve the generalization accuracy by reducing overfitting, how…
▽ More
In the present paper, we introduce a new neural network-based tool for the prediction of formation energies of atomic structures based on elemental and structural features of Voronoi-tessellated materials. We provide a concise overview of the connection between the machine learning and the true material-property relationship, how to improve the generalization accuracy by reducing overfitting, how new data can be incorporated into the model to tune it to a specific material system, and preliminary results on using models to preform local structure relaxations.
The present work resulted in three final models optimized for (1) highest test accuracy on the Open Quantum Materials Database (OQMD), (2) performance in the discovery of new materials, and (3) performance at a low computational cost. On a test set of 21,800 compounds randomly selected from OQMD, they achieve a mean absolute error (MAE) of 28, 40, and 42 meV/atom, respectively. The second model provides better predictions in a test case of interest not present in the OQMD, while the third reduces the computational cost by a factor of 8.
We collect our results in a new open-source tool called SIPFENN (Structure-Informed Prediction of Formation Energy using Neural Networks). SIPFENN not only improves the accuracy beyond existing models but also ships in a ready-to-use form with pre-trained neural networks and a GUI interface. By virtue of this, it can be included in DFT calculations routines at nearly no cost.
△ Less
Submitted 29 December, 2021; v1 submitted 31 August, 2020;
originally announced August 2020.