Deep materials informatics: Applications of deep learning in materials science

Agrawal, Ankit; Choudhary, Alok

doi:10.1557/mrc.2019.73

Deep materials informatics: Applications of deep learning in materials science

Artificial Intelligence Prospective
Open access
Published: 20 September 2019

Volume 9, pages 779–792, (2019)
Cite this article

Download PDF

You have full access to this open access article

MRS Communications Aims and scope Submit manuscript

Deep materials informatics: Applications of deep learning in materials science

Download PDF

Ankit Agrawal¹ &
Alok Choudhary¹

8109 Accesses
138 Citations
34 Altmetric
2 Mentions
Explore all metrics

Abstract

The growing application of data-driven analytics in materials science has led to the rise of materials informatics. Within the arena of data analytics, deep learning has emerged as a game-changing technique in the last few years, enabling numerous real-world applications, such as self-driving cars. In this paper, the authors present an overview of deep learning, its advantages, challenges, and recent applications on different types of materials data. The increasingly availability of materials databases and big data in general, along with groundbreaking advances in deep learning offers a lot of promise to accelerate the discovery, design, and deployment of next-generation materials.

Directed Energy Deposition (DED) Process: State of the Art

Article Open access 15 February 2021

Copper/graphene composites: a review

Article Open access 11 June 2019

A Survey on Deep Transfer Learning

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In this era of big data, we are being bombarded with huge volumes of data from a variety of different sources (experiments and simulations) at a staggering velocity in practically all fields of science and engineering, and materials science is no exception. This has led to the emergence of the fourth paradigm of science, which is data-driven science, and builds upon the big data created by the first three paradigms of science (experiment, theory, and simulation). Advanced techniques for data-driven analytics are needed to analyze these data in ways that can help extract meaningful information and knowledge from them, and thus contribute to accelerating materials discovery and realize the vision of Materials Genome Initiative (MGI).[1] The fourth paradigm of science utilizes scalable machine learning (ML) and data mining techniques to extract actionable insights from such big data and inform materials design efforts at various levels. Figure 1 depicts the four paradigms of science.[2]

Materials science and engineering researchers rely on experiments and simulations to try to understand the processing–stracture–property–performance (PSPP) relationships,[2,3] which are far from being well-understood. In fact, almost everything in materials science depends on these PSPP relationships, where the cause–effect relationships of science go from left to right, and the goals–means relationships of engineering go from right to left. In order to discover and design new improved materials with desired properties, we need to better understand this complex system of PSPP relationships. Figure 2 depicts these PSPP relationships of materials science and engineering.[2]

The scalable data-driven techniques[4–8] of the fourth paradigm of science have found numerous applications in a lot of diverse fields such as marketing and commerce,[9,10] healthcare,[11,12] climate science,[13,14] bioinformatics,[15,16] social media,[17,18] materials science,[19,20] and cosmology,[21,22] among many others. In particular, over the last few years, deep learning[23] has emerged as a game-changing technique within the arena of data-driven analytics due to its revolutionary success in several traditionally hard artificial intelligence (AI) applications. Deep learning techniques are also increasingly being used for materials informatics applications with remarkable success, which we refer to as deep materials informatics.

In this paper, we discuss some of the recent advances in deep materials informatics for exploring PSPP linkages in materials, after a brief introduction to the basics of deep learning, and its challenges and opportunities. Illustrative examples of deep materials informatics that we review in this paper include learning the chemistry of materials using only elemental composition,[24] structure-aware property prediction,[25,26] crystal structure prediction,[27] learning multi-scale homogenization[28,29] and localization[30] linkages in high-contrast composites, structure characterization[31,32] and quantification,[33,34] and microstructure reconstruction[35] and design.[36] We also discuss the future outlook and envisioned impact of deep learning in materials science before summarizing and concluding the paper.

Deep learning

Deep learning[23] refers to a family of techniques in AI and ML, and is essentially a rediscovery of neural networks that were algorithmically conceptualized back in the 1980s.[37,38] The availability of big data and big compute in recent years have allowed these networks to grow deeper (hence the name deep learning) and realize their promise to be universal approximators[39] capable learning and representing a wide variety of nonlinear functions. Deep learning has indeed emerged as a very powerful method to automate the extraction of useful information from big data, and has enabled ground-breaking advances in numerous fields, such as computer vision[40,41] and speech recognition.[42,43] In the rest of this section, we will briefly describe the unique advantages and limitations of deep learning, followed by the key components of a deep neural network, and finally a few different types of networks being used for deep materials informatics.

Deep learning: advantages and limitations

Deep learning has certain unique characteristics compared to traditional ML techniques, which are crucial for determining whether or not deep learning should be used for a given problem. These characteristics (both advantages and challenges) are depicted in Fig. 3, and described below. There are three primary advantages of deep learning compared to traditional ML methods:

Deep learning is largely feature-engineering-free: This is perhaps the biggest advantage of deep learning. It is well-known that the efficacy of a ML model depends a lot on how the data are represented for the ML algorithm to learn patterns from. Usually for a scientific or engineering application, a good representation would often entail careful feature engineering, which may require extensive domain knowledge as well as significant manual and intuitive effort to come up with the appropriate attributes. In contrast, deep learning is capable of automatically extracting the relevant features from the training data in a hierarchical fashion, thereby eliminating or at least reducing the effort for feature engineering. Not only does this help save the manual effort of having to come up with attributes, but also opens up opportunities to identify new, non-intuitive features in a truly data-driven manner that might help discover new insights.
Deep learning is generally more accurate with big data: Any data-driven ML model is expected to become more accurate with increasing training data, but the accuracy does saturate at some point, after which additional training data does not provide significant accuracy gains. It has been found that although for small data, traditional ML based models are more accurate than deep learning based models, they saturate much sooner, so deep learning based models are usually more accurate when big data is available. This is because of the higher learning capacity possessed by deep neural networks with multiple hidden layers.
Deep learning can produce faster predictions: Although training neural networks is computationally expensive, it is only a one-time cost. Once trained properly, they are capable of making very fast predictions.

The above-described advantages of deep learning clearly give it an edge over traditional ML techniques, but it nonetheless has some characteristics that make its application challenging in some cases. There are four major challenges in applying deep learning:

Deep learning requires big data: In many cases, the biggest limiting factor for applying deep learning is lack of sufficient training data. As discussed before, deep learning requires big data in general. Although big, curated, and labeled datasets do exist for several problems like image classification,[44] they are still a rarity in many scientific and engineering fields, such as materials science.[2]
Deep learning requires big compute: Training deep learning based models is compute-intensive and can take a long time with big data, even on the latest computing hardware. Parallelization of neural network training algorithms is an active area of research.[45,46]
Deep learning network architecture search: Since a neural network is essentially a network of interconnected neurons, there are unlimited possibilities of network architectures. Although there are some general guidelines for choosing an architecture for a given problem based on prior successful designs, there are no formal methods to identify the optimal architecture for a given task, and it is an open research problem.[34]
Model interpretability: Deep learning based models are generally viewed as black-box models due to being highly complex. Although researchers have tried with some success to systematically study the workings of the neural network, in general they are not as readily interpretable as some of the traditional statistical models like linear regression.[47]

Deep learning: key components and concepts

Artificial neural networks (ANNs) are inspired by biological neural networks in our brains. The fundamental computing unit of ANNs is a neuron, which takes multiple inputs, and outputs a possibly non-linear function (called the activation function) of the weighted sum of its inputs. Several activation functions are commonly used, such as sigmoid, linear, rectified linear unit (ReLU), leaky ReLU, etc. Figure 4 illustrates a fully-connected ANN, also known as multilayer perceptron (MLP), and the ReLU activation function. A deep learning network is an ANN with two or more hidden layers. The manner in which the neurons are connected amongst themselves determines the architecture of the network. The edges or interconnections between neurons have weights, which are learned during neural network training with the goal of making the ANN output as close as possible to the ground truth, which is technically referred to as minimizing the loss function. The training process involves making a forward pass of the input data through the ANN to get predictions, calculating the errors or loss, and subsequently back-propagating them through the network to update the weights of the interconnections via gradient descent in order to try to make the outputs more accurate. A single pass of the entire training data is called an epoch, and it is repeated iteratively till the weights converge. Usually when the data are large, the forward passes are done with small subsets of the training data (called mini-batches), so an epoch would comprise of multiple iterations of mini-batches. The inputs of a neural network are generally normalized to have zero mean and unit standard deviation, and the same concept is sometimes applied to the input of hidden layers as well (called batch normalization) to improve the stability of ANNs. Another useful and interesting concept in ANNs is that of dropouts, where some neurons are randomly turned off during a particular forward or backward pass. It is a regularization technique for reducing overfitting, and also turns out to be a remarkably efficient approximation to multi-model averaging.[48]

Convolutional neural networks

A convolutional neural network (CNN) is a special kind of deep learning network which is designed to be used on spatial data such as images, and consists of three types of hidden layers. It is designed to appropriately capture spatially correlated features in images using convolutional layers. A convolutional layer consists of multiple kernels or filters with trainable weights or parameters. Each filter is applied across the input image as a convolving window to extract abstract features in the form of feature maps, which are used as inputs for the next layer. Pooling layers are usually used after one or more convolutional layers to reduce the size of the feature maps by subsampling them in both spatial dimensions using an aggregation function (such as max, min, avg), which also reduces the number of parameters and helps in controlling over-fitting. After several blocks of convolutional and pooling layers, the outputs are flattened to a long one-dimensional (1-D) vector, to be used as input to one or more fully connected layers to finally give the CNN prediction, which could either be a probability distribution (for classification problems) or a single numerical value (for regression problems). Two-dimensional (2-D) CNNs that work with 2-D input matrices (images) as depicted in Fig. 5 are the most common type of CNNs, but there are other variants such as 1-D and three-dimensional (3-D) CNNs that take 1-D vectors and 3-D matrices respectively as input, and graph CNNs[49,50] that can work with graphs (a collection of nodes and edges) as input.

Generative adversarial networks

A generative adversarial network (GAN)[51] is one of the most interesting type of deep learning network architectures in recent years, and has originated from game theory. It consists of not one but two neural networks that are trained simultaneously in a competitive fashion. One of the networks is called a generator, which is essentially an inverse convolutional network, taking in a random noise vector and up-sampling it to a fake image. Then there is a discriminator, which is a standard convolutional network, taking an image as input and down-sampling it to produce a probability distribution, to classify the input image as fake or real. Figure 6 illustrates the concept of GANs. A common analogy often used to describe GANs is that the generator can be thought of like a criminal trying to produce fake currency, and the discriminator like the police whose aim is to identify the currency as fake or real. As the two networks are trained together, they make each other progressively strong till they achieve the Nash equilibrium.[5] It is not surprising that GANs have found numerous interesting applications in image analysis, such as high-resolution image synthesis,[53] text to image synthesis,[54] image editing,[55] blending,[56] inpainting,[57] etc., as well as in non-image domains, like music generation.[58]

Illustrative deep materials informatics

We now review some recent applications of deep learning in materials science for understanding PSPP relationships, both in terms of forward models and inverse models. These examples also illustrate the previously discussed unique characteristics of deep learning in the context of materials.

Learning the chemistry of materials from only elemental composition

As described before, one of the biggest advantages of deep learning is that it is feature-engineering-free and capable of directly working on raw inputs without the need of manually engineered features to incorporate domain knowledge. Jha et al.[24] recently demonstrated the same on materials composition data, by developing a new deep learning network called ElemNet, which takes only the elemental composition of a crystalline compound as input and predicts its formation enthalpy. They used a large simulation dataset of density functional theory (DFT) calculations from the Open Quantum Materials Database (OQMD) for building the deep learning model. The dataset consisted of 75,759 compounds and their corresponding formation enthalpies. Previous studies[59–61] on formation enthalpy prediction have relied on the use of hundreds of composition-derived features called physical attributes (such as average atomic number, average electronegativity, and so on) for constructing ML models, in a bid to provide known chemistry knowledge to the model. However, such a feature extraction step depends heavily on human intuition and domain expertise. Moreover, it may not always be possible to do this step for all problems, as the necessary domain knowledge may not be available or it may be difficult to transform it into quantitative features for ML algorithms to use. Therefore, the authors in Ref. 4 purposely did not provide any domain knowledge to the model in order to investigate how well a model can perform in such a situation. They explored different depths of the fully-connected neural network until 4 layers. The accuracy of the deep learning model rapidly improved until 17 layers, after which it plateaued. ElemNet, the best-performing 17-layer neural network was found to outperform traditional ML algorithms, both with and without physical attributes. The Random Forest model (the best performing traditional ML technique) gave a mean absolute error (MAE) of 0.157 eV/atom using only elemental compositions as features, and 0.071 eV/atom using composition-derived physical attributes as input. In contrast, ElemNet, which only uses elemental compositions as input, was found to give a significantly lower MAE of 0.055 eV/atom. Modeling experiments with different training set size revealed that ElemNet performs better than Random Forest model (even with physical attributes) for all training set sizes greater than 4000, thereby serving as another testimony of the superior performance of deep learning models on large datasets. In terms of computation time, ElemNet took significantly longer for training (about 7 h on a GPU for a training set of ~250,000 compounds), but was much faster in terms of prediction time (9.28 s on a CPU and 0.08 s on a GPU for a test set of ~25,000 compounds). ElemNet was also evaluated with two specially designed training-test splits (withholding the Ti–O binary system and Na–Fe–Mn–O quaternary system, respectively) to find that it can also predict the phase diagrams (convex hulls) for unseen materials systems.

In order to understand why ElemNet was performing well, the authors also studied the representation learned by the network, to try to interpret the model, by analyzing the activations produced within the network at different layers for specific inputs provided to the model. It was found that ElemNet self-learns some interesting chemistry like groups (element similarity) in the early layers, and charge balance (element interaction) in later layers of the network, although no periodic table information was provided to the model during training. For example, the activations of first and second layers produced by group-I elements such as Li, Na, K, Rb, and Cs were all clustered together in a straight line (in that order), when projected in a 2-D space using principal component analysis (PCA). Similarly, when binary combinations of group-I/II and group-VI/VII elements were passed through the model, it was found that the charge balanced and unbalanced compositions tend to cluster separately in the eighth layer of ElemNet. This is consistent with other applications of deep learning, for example, on images, where the initial layers learn simple features such as edges and corners, and then use those features to learn more complex ones such as shapes in next few layers, and so on. The high accuracy and speed of ElemNet allowed the authors to perform combinatorial screening on about half-a-billion compounds in the quaternary space. They found a number of systems with at least one new potential stable compound, including several new compounds that were not in the OQMD but exist in the Inorganic Crystal Structure Database (ICSD), thereby confirming their stability.

Crystal structure aware property prediction

Although composition-based models can be quite accurate as illustrated in the previous example, the role of structure is critical in materials, as allotropes and polymorphs can have contrasting properties with the same composition. Hence, it is also important to build structure-aware models for materials property prediction. There exist a number of studies that use different set of attributes to represent the structure information[62–65] for the ML algorithms to build predictive models. Recently, deep learning has also been applied directly on the crystal structure, as discussed next.

Xie and Grossman[25] developed a crystal graph CNN framework to directly learn material properties from the connection of atoms in the crystal. Their approach first represents the crystal structure by a crystal graph where nodes represent the atoms in the unit cell and edges represent the bonds between the atoms, and then builds a CNN on the graph with convolutional layers, fully connected layers, and pooling layers, to automatically extract optimum representations for modeling the target properties. Their database consisted of 46,744 materials from the Materials Project[66] covering 87 elements, 7 lattice systems, and 216 space groups. A simple convolution function using a shared weight matrix for all neighbors of an atom resulted in a MAE of 0.108 eV/atom for formation energy prediction. However, it neglected the differences of interaction strength between neighbors, so they designed a new convolution function taking into account the interaction strength in the form of a learned weight matrix, which gave a much improved MAE of 0.039 eV/atom. The same framework was subsequently applied to other DFT-computed properties from the Materials Project, such as absolute energy, band gap, Fermi energy, bulk moduli, shear moduli, and Poisson ratio. Apart from impressive model accuracies obtained by deep-learning models, their framework also provided for model interpretability to some degree, by removing the fully-connected hidden layers after atom feature vector extraction and directly performing a linear pooling to predict the property. This allowed the model to learn the contribution of different local chemical environments for each atom to the target property, at the cost of a dip in accuracy [MAE of 0.130 eV/atom on 3787 test perovskites (ABX₃) with the interpretable model versus 0.099 eV/atom with the full model]. The empirical rules generalized from the perovskites study were found to be consistent with known knowledge, and a combinatorial search leveraging the learned chemical insights led to the discovery of several new perovskites.

Of course, another way to take structure information into account is to build structure-specific models, i.e., only train on materials of a specific structure class. For example, Ye et al.[26] recently demonstrated that ANNs utilizing just two descriptors (the Pauling electronegativity and ionic radii of constituent elements) can predict DFT formation energies of C₃A₂D₃O₁₂ garnets and ABO₃ perovskites with low MAEs of 0.007–0.010 eV/atom and 0.020–0.034 eV/atom, respectively. For mixed garnets, i.e., garnets with more than one type of species in the C, A, and D sites, the authors derived an averaging scheme to model complete cation disorder and a binary encoding scheme to account for the effect of orderings, with minimal loss in accuracy.

Crystal structure prediction

One of the grand challenges in materials science has been crystal structure prediction,[67] much like protein structure prediction in bioinformatics.[68] The problem of crystal structure prediction for a given composition can be decomposed into two primary sub-problems: generation of candidate structures, followed by subsequent evaluation of those structures to identify the most likely one(s). Typically, structure generation approaches use evolutionary algorithms with random initialization,[69,70] which are then evaluated by quantum mechanical methods.[71] Ryan et al.[27] recently presented a remarkable application of deep learning for crystal structure prediction, in particular for crystal structure evaluation. They reformulated the crystal structure prediction problem into that of predicting the likelihoods of individual atomic sites in the structure, thereby approximating the likelihood of a crystal structure to exist by the product of the likelihoods for each element to reside on a specific atomic site.

To calculate the element likelihood for a given atomic site (element prediction problem), the authors in Ref. 27 designed a deep neural network using training data from the ICSD and Crystallographic Open Database, with 704,334 unique crystallographic sites in 51,723 crystal structures. The input representation of atomic sites for model training consisted of multiple perspectives of normalized atomic fingerprints, to capture the local topology around each unique atomic site. This input representation provided several useful characteristics such as translational invariance, fixed dimensionality, and retention of 3-D information, and allowed the model to learn structural topologies rather than crystal structures with specific scale. The deep neural network itself consisted of three subnetworks. First, a 42-layer convolutional variational autoencoder was used to allow the model to learn its own representation of the atomic fingerprints, which reduced the 3072-dimensional atomic fingerprints to 64-dimensional latent representations. Then, these latent representations were fed into a five-layer sigmoid classifier to predict what combinations of elements were likely to form specific structural topologies. Finally, the resulting likelihoods from the sigmoid classifier were fed into a five-layer auxiliary softmax classifier with batch normalization and 118 output neurons to predict what specific element corresponded to the input, thereby formulating the element prediction problem as a 118-class classification problem. The average error rate on the test set (20% of entire atomic fingerprints data) was found to be 31%, which is quite impressive for a 118-class problem. Interestingly, most of the errors made by the model were found to be chemically reasonable (e.g., within blocks of 3d and 4f elements). Further, a t-SNE (t-distributed stochastic neighbor embedding, an iterative method for mapping high-dimensional data onto a 2-D or 3-D projection for visualization[72]) embedding of the sigmoid classifier weights for the elements revealed groupings similar to those in the periodic table. It is indeed quite remarkable that the deep learning model was able to learn periodic trends and chemical similarities not explicitly provided to the model while training, simply by learning from raw structural information, just like the ElemNet model[24] did from raw composition information.

Returning to the crystal structure prediction problem, the work in Ref. 27 used the set of known structure types as the starting point for generating new crystal structures, and the above-described deep learning model for crystal structure evaluation. The unique crystallographic sites in 51,723 known structures were used as structural templates for the generator to produce new crystal structures by combinatorial substitution across the series of all elements, thus leading to 623,380 binary and 2,703,834 ternary candidate crystal structures. Coupling this structure generation approach with deep learning based structure evaluation allowed the authors to perform crystal structure prediction. The performance of the structure prediction was evaluated on a holdout test set of 5845 crystal structures, and it was found that the model is able to predict the correct structure as the top-ranked candidate 27% of the time, and the correct structure is within top-10 predicted structures 59% of the time, which is an underestimate of its true performance due to the presence of isostructural crystal structures in the candidate list. The authors also presented a case study of Mn–Ge and Li–Mn–Ge systems, and reported new unique chemical compositions in these systems, with corresponding predicted structure templates. The key takeaway from this work is the demonstrated potential of deep learning models to self-learn chemistry knowledge from purely geometric and topological data, and its application to the important problem of crystal structure prediction.

Multiscale homogenization and localization linkages in high-contrast composites

While the previous examples used deep learning on composition and crystal structure data, in this subsection, we look at some examples of the application of deep learning on 3-D microstructure data of two-phase composites for understanding structure–property linkages, which are often required to be understood across different length scales. In such multi-scale modeling, homogenization refers to transfer of microstructure information from lower length scales to higher length scales, e.g., prediction of macroscale property given its microstructure information. Localization deals with transfer of salient microstructure information from a higher length scale to lower length scale. For example, when a material is subject to a macroscopic loading condition (like imposed stress or strain rate), localization refers to the manner in which the load gets distributed at the microscale for a given microstructure. Both homogenization and localization linkages are modeled together by numerical approaches like microscale finite element simulations or iterative methods employing Green’s functions. In recent years, materials knowledge systems (MKS)[73] have emerged as a promising approach for understanding localization relationships, which utilize calibrated Green’s function-based kernels in a non-iterative series solution, as well as ML-based methods that rely on feature engineering to capture the local neighborhood of the microstructure.

Yang et al.[28] present a feature-engineering-free deep learning based homogenization solution for predicting macroscale effective stiffness in two-phase composites of contrast 50. Contrast refers to the relative dissimilarity in the property of the two constituent phases. In this case, it is the ratio of the Young’s moduli of the two phases. The dataset consisted of 8550 simulated 3-D microstructures of size 51 ×51 ×51, also referred to as microscale volume elements (MVEs). Recall from PSPP relationships that structure is the cause and property is the effect. Therefore for a given loading condition, the macroscale property (in this case effective stiffness) depends on the microstructure. The effective stiffness was calculated using micromechanical finite element simulations with periodic boundary conditions. In order to learn homogenization linkages using deep leaning, the authors used 3-D CNNs to map microstructure information to effective stiffness. The best deep-learning network was identified as a 14-layer model with five convolution blocks (consisting of a convolution layer followed by a pooling layer) stacked together, and subsequently followed by two fully connected layers. In total, this network had about 3.8 million trainable parameters. The accuracy of the deep-learning model [MAE of 1.04 GPa (3.10%)] was found to be significantly better than simple physics-based approaches or rule of mixtures method [MAE of 15.68 GPa (46.66%)] and sophisticated physics-inspired data science approaches that utilize PCA on two-point statistics as input features for regression[74] [MAE of 2.28 GPa (6.79%)]. Another recent related work[29] on using deep learning for homogenization has demonstrated that the filters/kernels learned by the CNN during training can be interpreted as microstructure features that the model learns to be influential for improving the macroscale property of interest. This can be extremely valuable for solving the inverse problem of design exploration, thereby closing the loop and informing materials design.

Yang et al.[30] present a novel feature-engineering-free approach for localization using deep learning. They used two datasets of contrast-10 (2500 MVEs) and contrast-50 (3000 MVEs) of size 21 × 21 × 21 with varying volume fraction and periodic boundary conditions, and were split into training, validation, and test sets. Since each voxel is a data point for the localization problem, a 3-D neighborhood of size 11 × 11 × 11 around the focal voxel was used to represent the focal voxel. Although 3-D CNNs could be used for this problem as well, the dataset here is almost four orders of magnitude larger, and since 3-D CNNs are much more computationally expensive, the authors in Ref. 30 designed a neat workaround to be able to use 2-D CNNs for localization. They accomplished this by treating the 3-D image of size 11 × 11 × 11 as 11 channels of a 2-D images of size 11 × 11, perpendicular to the maximum principal strain direction. The best performing CNN architecture for this problem consisted of six layers, with two convolution layers and two fully-connected layers. The accuracy of the deep learning model was compared against the MKS method,[73] and two ML-based methods called single-agent method[75] and multi-agent method,[76] which is essentially a hierarchical application of the single-agent method. On contrast-10 dataset, the MKS method resulted in a mean absolute strain error (MASE) of 10.86%, while the singleagent and multi-agent methods gave a MASE of 13.02% and 8.04%, respectively. In contrast, the deep learning CNN model gave a significantly lower MASE of 3.07%. For contrast-50, the MKS method gave a MASE of 26.46% as compared to just 5.71% by the deep learning model. A closer look at what the CNN learned revealed that the influence of different level neighbors decreases with increasing level of neighbors, which is consistent with known domain knowledge.

Microstructure characterization and quantification

Materials characterization broadly refers to learning structural information of a given material, and is one of the fundamental processes to further our understanding of materials.[77] Advances in materials characterization technologies at different time and length scales such as numerous kinds of microscopy, spectroscopy, and macroscopic testing has led to a proliferation in materials image data, which has motivated the use of deep learning to solve this inverse characterization problem.

Electron backscatter diffraction (EBSD) is one of the many materials imaging tools to determine crystal orientation of crystalline materials, which can be represented by the three Euler angles 〈ϕ1,Φ,ϕ2〉. The inverse structure characterization problem of determining the orientation angles given an EBSD pattern is called EBSD indexing. The commercially available method for EBSD indexing is the Hough transform based method,[78] which is quite effective in general, but susceptible to the presence of noise in the patterns. In recent years, a new method called dictionary based indexing[79] has been developed, which is essentially a nearest neighbor search approach, where the output angles correspond to the orientation angles of the closest EBSD pattern present in a pre-computed highresolution dictionary. This method is very robust to noise, but computationally very expensive, as the input EBSD pattern needs to be compared to every pattern in the dictionary. Liu et al.[31] presented the first application of deep learning (CNNs) for indexing EBSD patterns using a dictionary of 333,227 simulated EBSD patterns (60 × 60 gray scale images), out of which 300,000 were used from training and rest for testing. Although the CNN results were found to be more accurate than the dictionary method, this work had two significant limitations. First of all, not using the entire dictionary for training is suboptimal and thus would underestimate the accuracy of both dictionary method and the CNN. Second, they created three separate models for the three Euler angles thereby treating them independent, which is not true as they are actually partial representations of the same orientation. Therefore, rather than individually minimizing the difference between the three actual and predicted Euler angles, the model should really be trying to minimize the one angle between the corresponding orientations, which is technically called disorientation. Jha et al.[32] recently presented a new deep learning solution for EBSD indexing overcoming these limitations. They used two dictionaries consisting of a total of 374,852 EBSD patterns for training the model, and an independent set of 1000 simulated EBSD patterns with known orientations for testing. Here the authors optimized for mean disorientation error between the predicted and true orientations as the loss function for CNN training, which posed two challenges. First, the disorientation metric is computationally expensive, as one needs to compute 24 × 24 symmetrically equivalent orientation pairs. Second, the presence of crystal symmetries introduces degeneracies in the orientation space resulting in discontinuities in the gradients of the disorientation loss function, thereby rendering it inappropriate for optimization using stochastic gradient descent. To overcome the above challenges, the authors designed a differentiable approximation to the original disorientation function by building a computational tensor graph in TensorFlow,[80] leveraging its auto-differentiation support. The CNN consisted of eight convolution layers, two pooling layers, and five fully connected layers, making it a 17-layer network with about 200 million parameters. In terms of accuracy, the deep learning model outperformed dictionary-based indexing by 16% (mean disorientation of 0.548° versus 0.652° for the dictionary method).

DeCost et al.[33] present a deep learning solution for quantitative microstructure analysis for ultrahigh carbon steel, by building models for two microstructure segmentation tasks: (i) semantic segmentation of steel micrographs into four regions (grain boundary carbide, spheroidized particle matrix, particle-free grain boundary denuded zone, and Widmanstätten cementite); and (ii) segmenting cementite particles within the spheroidized particle matrix. Unlike image classification, segmentation is a pixel-level task, and thus the CNN needs to produce a latent representation of each pixel instead of the entire image. The authors used the PixelNet[81] architecture for this purpose, where each pixel is represented by the concatenation of its representations in each convolutional layer by applying bilinear interpolation to intermediate feature maps and getting a hypercolumn feature vector for each pixel. Subsequently a MLP is used to map the hypercolumn pixel features to the corresponding target, i.e., segmentation classes. They used the pretrained VGG16 network[82] (trained on the ImageNet database[44]) for the convolutional layers of PixelNet, and trained the MLP layers from scratch, with batch normalization, dropout, weight decay regularization, and data augmentation. Two kinds of loss functions were evaluated: the standard cross-entropy classification loss and focal loss, which extends the cross-entropy classification loss function with a modulating factor and scaling parameter to account for the model confidence and class imbalance respectively. The dataset consisted of 24 ultrahigh carbon steel micrographs at a resolution of 645 × 484. Sixfold cross validation was used to evaluate the models in terms of segmentation accuracy as well as comparison of actual and predicted distributions of particle size and denuded zone widths. The segmentation model with focal loss function was found to be the most accurate for spheroidite and particle segmentation. However, most of the predicted particle size distributions were found to differ from those from human-annotated micrographs, as the model failed to identify small particles with radii smaller than 5 pixels, indicating the need of higher quality input for training. This work nonetheless demonstrated the efficacy of deep learning for microstructural segmentation and quantitative analysis for complex microstructures at a high level of abstraction.

Patton et al.[34] recently presented a 167-petaflop projected (152.5-petaflop measured; petaflop is a unit of computing speed, equaling 10¹⁵ floating point operations per second) deep learning system called MENNDL to automate raw electron microscopy image based atomic defect identification and analysis on a supercomputer using 4200 nodes (with six GPUs per node). It intelligently generates and evaluates millions of deep neural networks with varying architectures and hyperparameters using a scalable, parallel, asynchronous, genetic algorithm augmented with a support vector machine to automatically find the best performing network, all in a matter of hours, which is much faster than a human expert can do. The resulting deep learning network also allowed the authors to create a library of defects, map chemical transformation pathways at the atomic level, including detailed transition probabilities, and explore subtle distortions in local atomic environment around the defects of interest. Further, it also lets the computer automatically choose the best region in the sample to make a measurement or perform atomic manipulations without human supervision, which is a critical step toward enabling an autonomous (self-driving) microscope.

Microstructure reconstruction and design

Reconstruction of the structure of a disordered heterogeneous material using limited structural information about the original system remains an important problem in modeling of heterogeneous materials.[83] Li et al.[35] developed a deep transfer learning based approach for reconstructing statistically equivalent microstructures from arbitrary material systems based on a single given microstructure. In their approach, the input microstructure with k labeled material phases is first encoded to a three-channel (RGB) representation to make it amenable to be used as an input to a pruned version of a pretrained CNN called VGG19.[82] At the same time, another randomly initialized RGB image (which would iteratively be updated to become the encoded microstructure reconstruction) is also passed through the pruned VGG19 network. The loss function to be minimized is the difference of Gram-matrix (a measure of texture of a given image)[84] between the activations of the original and reconstructed microstructure, summed over selected convolutional layers of the pruned VGG19. During neural network training, typically the gradient of the loss function with respect to the network weights is calculated to iteratively optimize the weights and refine the model. But here, the weights of the network are kept constant, and the pixel values of the randomly initialized reconstructed microstructure are the variables to be optimized. Therefore, the gradient of the loss with respect to each pixel in the reconstructed microstructure is computed via backpropagation, and is subsequently fed into a nonlinear optimization algorithm to iteratively update the pixel values of the microstructure reconstruction, till it converges. The converged reconstruction at this point is still encoded though, and thus is subsequently decoded using k-means clustering to separate the pixels into k groups, where k is the number of material phases in the original microstructure. Since this process pipeline ending with k-means clustering does not enforce retention of the original volume fraction (relative ratio of different phases), it is possible that the volume fractions are slightly different from the original microstructure, which is not desirable. Therefore, the authors employed another post-processing step using simulated annealing to switch the phase label of some of the boundary pixels in order to match the phase volume fractions in original and reconstructed microstructures. The approach was successfully tested on a wide variety of structural morphologies (carbonate, polymer composites, sandstone, ceramics, a block copolymer, a metallic alloy, and three-phase rubber composites) and found to outperform other approaches (decision tree based synthesis,[85] Gaussian random field (GRF),[86] two point correlation,[87] and physical descriptor[88]) in four out of the five material systems.

Microstructure design is an important inverse problem for materials design. One of the key tasks for this purpose is to identify suitable microstructure representations that could be used for design. Yang and Li et al.[36] have recently developed a deep adversarial learning methodology for an end-to-end solution for low-dimensional and non-linear embedding of microstructures for microstructural materials design using GANs. GAN-based microstructure design is able to capture complex, non-linear microstructure characteristics owing to the large model capacity offered by deep learning networks, and learn the mapping between latent variables (input noise vector) and microstructures. Subsequently, the low-dimensional latent variables can serve as design variables. The GAN was trained on 5000 synthetic microstructure images of size 128 × 128 created using the GRF method. The GAN generator (discriminator) consisted of five layers with the size of feature maps doubling (halving) along each dimension for each layer. Therefore, the 128 × 128 images are reduced by a factor of 2⁵ (=32) in each dimension, thus converting it to a 4 × 4 latent variable matrix, or 16-dimensional latent variable vector. Once the GAN was trained, it was able to generate new microstructure images simply by randomly sampling the latent variable vector and passing it through the generator. Not only did the generated microstructure images visually looked similar to the real images, but they were also confirmed to be similar in terms of two-point correlation function and lineal-path correlation function. To evaluate the capability of the trained GAN for microstructure optimization and design, it was coupled with a Bayesian optimization approach to search for the optimal microstructure representation (in terms of latent variable vector) along with rigorous coupled wave analysis to simulate the optical absorption performance of a given microstructure. Results indicated that the optical performance of the GAN-generated microstructures (even without Bayesian optimization) was 4.8% better than that of randomly sampled microstructures, and the same for the optimized microstructure (with Bayesian optimization) was 17.2% better than that of randomly sampled microstructures, thereby verifying the effectiveness of the design optimization framework. In addition to the demonstrated capability of generating realistic microstructure images as well as microstructure optimization and design, the authors report a couple of other desirable features of the developed GAN model. These include the ability of the trained generator to generate arbitrary sized microstructures by changing the size of the latent variables (scalability), and the ability of the discriminator to be used as a pre-trained model for developing structure–property prediction models (transferability).

Future outlook and impact

Deep learning is a fast growing field that has attracted a lot of attention, which has led to fascinating algorithmic advances being introduced at an incredible pace. In this section, we discuss some other crucial facets of deep learning in context of materials informatics, which are expected to shape the growing impact of data-driven approaches in materials science.

Other types of deep learning networks

In addition to the different kinds of deep learning neural networks (such as MLPs, CNNs, and GANs) we have seen so far in this paper, there are several others that are capable of analyzing other forms of data. For example, recurrent neural networks are designed to work with sequence data (also known as temporal or time-series data) of varying lengths, with most popular applications in speech recognition[89] natural language processing,[90] as well as some recent applications in materials informatics.[91,92] A relatively new class of deep learning is called geometric deep learning which is capable of dealing with non-Euclidean data, such as graphs with nodes and edges, where standard deep learning kernels like convolution are not well-defined. Due to its ability to work with graph data, it has found applications in quantum chemistry,[93,94] in particular for analyzing data from molecular dynamics simulations.

Transfer learning

As discussed before, deep learning generally requires big data, but transfer learning can enable the application of deep learning for problems where big data are not available, by transferring knowledge from a deep learning model built on big data for a different but related problem, to build a new model on the smaller data for the problem at hand. Transfer learning is expected to be very useful for materials informatics, since most of the materials datasets are usually small in size, compared to the big data available in some other domains such as social media, bioinformatics, cosmology, etc. It is widely used in image classification, where pre-trained deep learning models built on ImageNet[44] (a large image database of more than 14 million labeled images) like VGG[82] are used to extract key features from small image datasets and build ML models.[95]

Uncertainty quantification and active learning

Uncertainty quantification (UQ) for predictive analytics is an important topic, and one that is expected to gain more and more attention in materials informatics in the coming years. It essentially means the ability to identify calibrated uncertainty estimates for the predictions made by a ML model, by trying to capture the expected variance in the response within a specified confidence interval. A commonly used methodology for UQ that has also been used in materials informatics is an ensemble data mining approach,[96] where multiple predictive models are constructed for the same task by using different techniques on the same data and/or the same technique on different subsets of the training data, and the final prediction is calculated as a function of the individual predictions, such as mean. In such cases, the uncertainty can be quantified using the standard deviation of the individual predictions, possibly combined with the model prediction error. Deep learning models provide an alternate way to do the same without having to build multiple models. Recall that dropout during neural network training randomly shuts down some neurons, thereby helping the model to become more generalizable, and it can also approximate model ensembling.[48] While making predictions from the model (testing phase), dropout again randomly drops some neurons, so the same input to the model would generate (slightly) different predictions every time the model is run with dropout. The resulting set of predictions can be used to calculate uncertainty estimates, just like with ensemble learning models or using more sophisticated methods.[97]

UQ also holds the key for active learning and reinforcement learning, where the predictive model is used to recommend which new unlabeled inputs should be labeled next (based on their predicted value and associated uncertainties) in order to have the greatest improvement in the accuracy of the model, or to take a suitable action in a given situation that would maximize the reward. Interestingly, active learning has a direct application for data-driven materials discovery, where based on a given experimental or simulation materials dataset, it can recommend which experiment or simulation should be done next to further improve the predictive models. The new improved models can again be used with active learning to identify the next best experiment or simulation, and so on. In this way, it can significantly reduce the number of experiments needed to discover the optimal material with a target property of interest, thereby accelerating the inverse models of materials discovery and design.

Model interpretability

The issue of model interpretability has always been a crucial one for many applications such as financial modeling, autonomous driving, as well as materials engineering, where the cost of a false positive can be immeasurably large, making it critical to ensure that the model is not just quantitatively accurate but is in fact learning from the correct features and learning things that make sense, at least not learning something known to be false. Lipton[98] presents one of the first attempts toward a comprehensive taxonomy of the desiderata and methods in interpretability research, and identifies transparency to humans and post-hoc explanations as two primary and competing notions of interpretability. The transparency aspect of interpretability relates to understanding how the model works, in terms of the training algorithm, the intuitive significance of individual model inputs and parameters, and the overall working of the trained model. There is usually a trade-off between model transparency and model complexity, and Lipton[98] suggests that no model is intrinsically interpretable, e.g., even linear models with highly engineered and complex inputs, deep decision trees, ensembles, etc. could be considered less transparent than comparatively compact neural networks with raw or lightly processed inputs. Post-hoc explanations, on the other hand, relates to understanding what the model has learned. Since deep learning models learn rich representations, they are especially useful for post-hoc interpretability, which can be done via visualization techniques such as t-SNE[72] to visualize its latent representations in 2-D, or saliency maps[99] to identify the regions of the input that influence the output the most. Further, for a given test example, its low-dimensional latent representation (e.g., activations of the hidden layers) can be used to identify its k-nearest neighbors in the training set, so as to explain the decisions of the model by reporting other similar examples,[98] and provide another way of interpreting deep learning models. Several other methods and guidelines to understand deep leaning networks are available.[100]

Potential long-term impact

Materials are fundamental building blocks of a civilization. The advancement of our society relies on the development of better, safer, more efficient, cost-effective, and environment-friendly materials. Deep materials informatics approaches have the potential to be game changing for materials scientists and industry, by assisting researchers to navigate through the practically infinite space of possible materials and identify a few most promising ones, which can then be evaluated with appropriate simulations and experiments, thereby significantly reducing costs and accelerating the discovery and deployment of advanced materials. Deep materials informatics therefore provides remarkable promise to accelerate the discovery and design of next generation materials in a cost-effective manner, and thus realize the vision of MGI. Illustrative real-world applications that could potentially be impacted by deep materials informatics include construction, automobile, clean energy, aerospace, healthcare, transportation, and so on.

Summary and conclusion

Materials informatics is a rapidly emerging field but still in its early stages, similar to what bioinformatics was about 20 years ago,[2] and this is even more true for deep materials informatics, which is the application of deep learning in materials science. In this paper, we discussed some of the recent advances in deep materials informatics on a variety of materials data like elemental composition, crystal structure, and 2-D/3-D microstructures images. The fundamental concepts in deep learning, its advantages, challenges, types of popular deep learning networks, and future outlook and impact of deep materials informatics were also discussed. The increasingly availability of materials databases and big data in general, along with groundbreaking advances in data science and deep learning approaches offer a lot of promise to revolutionize materials property prediction, discovery, design, and deployment of next-generation materials.

References

Materials Genome Initiative for Global Competitiveness. [Washington, D. C.]: Executive Office of the President, National Science and Technology Council (2011).
Google Scholar
A. Agrawal and A. Choudhary: Perspective: materials informatics and big data: realization of the “fourth paradigm” of science in materials science. APL Mater. 4, 053208 (2016).
Article CAS Google Scholar
G.B. Olson: Computational design of hierarchically structured materials. Science 277, 1237–1242 (1997).
Article CAS Google Scholar
E.E. Papalexakis, C. Faloutsos, and N.D. Sidiropoulos: Tensors for data mining and data fusion: models, applications, and scalable algorithms. ACM Transactions on Intelligent Systems and Technology (TIST 8, 16 (2017).
Google Scholar
D. Palsetia, W. Hendrix, S. Lee, A. Agrawal, W-K. Liao, and A. Choudhary: Parallel Community Detection Algorithm Using a Data Partitioning Strategy with Pairwise Subdomain Duplication, High Performance Computing, 31st International Conference, ISC High Performance 2016, Frankfurt, Germany, June 19–23, 2016, Proceedings (2016), pp. 98–115.
Chapter Google Scholar
C.W. Tsai, C.F. Lai, H.C. Chao, and A.V. Vasilakos: Big data analytics: a survey. J. Big Data 2, 21 (2015).
Article Google Scholar
A. Agrawal, M. Patwary, W. Hendrix, W-K. Liao, and A. Choudhary: High performance big data clustering, In Advances in Parallel Computing, L. Grandinetti, ed., Volume 23: Cloud Computing and Big Data, IOS Press, Amsterdam, 2013, pp. 192–211.
Google Scholar
Y. Xie, Z. Chen, D. Palsetia, G. Trajcevski, A. Agrawal, and A. Choudhary: Silverback+: scalable association mining Via fast list intersection for columnar social data. Knowl. Inf. Syst. (KAIS 50, 969–997 (2017).
Article Google Scholar
S. Fan, R.Y. Lau, and J.L. Zhao: Demystifying big data analytics for business intelligence through the lens of marketing mix. Big Data Res. 2, 28–32 (2015).
Article Google Scholar
Z. Xu, G.L. Frankwick, and E. Ramirez: Effects of big data analytics and traditional marketing analytics on new product success: a knowledge fusion perspective. J. Bus. Res. 69, 1562–1566 (2016).
Article Google Scholar
A. Agrawal and A. Choudhary: Health services data: big data analytics for deriving predictive healthcare insights. In Data and Measures in Health Services Research, B. Sobolev A. Levy and S. Goring eds., Springer US, Boston, MA, 2016, pp. 1–17.
Google Scholar
A. Belle, R. Thiagarajan, S.M. Soroushmehr, F. Navidi, D.A. Beard, and K. Najarian: Big data analytics in healthcare. BioMed Res. Int. (2015).
Google Scholar
A. R. Ganguly, E. Kodra, A. Agrawal, A. Banerjee, S. Boriah, S. Chatterjee, S. Chatterjee, A. Choudhary, D. Das, J. Faghmous, P. Ganguli, S. Ghosh, K. Hayhoe, C. Hays, W. Hendrix, Q. Fu, J. Kawale, D. Kumar, V. Kumar, W-K. Liao, S. Liess, R. Mawalagedara, V. Mithal, R. Oglesby, K. Salvi, P. K. Snyder, K. Steinhaeuser, D. Wang, and D. Wuebbles: Toward enhanced understanding and projections of climate extremes using physics-guided data mining techniques. Nonlinear. Process. Geophys. 21, 777–795 (2014).
Article Google Scholar
A. Karpatne and V. Kumar: Big Data in Climate: Opportunities and Challenges for Machine Learning. 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2017 pp. 21–22.
Chapter Google Scholar
H. Kashyap, H.A. Ahmed, N. Hoque, S. Roy, and D.K. Bhattacharyya: Big data analytics in bioinformatics: A machine learning perspective. arXiv preprint, arXiv:1506.05101, 2015.
Google Scholar
Y. Zhang, S. Misra, A. Agrawal, M. M. A. Patwary, W-K. Liao, Z. Qin, and A. Choudhary: Accelerating pairwise statistical significance estimation for local alignment by harvesting GPU’s power. BMC Bioinf. 13(Suppl 5), S3 (2012).
Article Google Scholar
G. Bello-Orgaz, J.J. Jung, and D. Camacho: Social big data: recent achievements and new challenges. Inf. Fusion 28, 45–59 (2016).
Article Google Scholar
K. Lee, A. Agrawal, and A. Choudhary: Forecasting Influenza Levels using Real-Time Social Media Streams, Fifth International Conference on Healthcare Informatics (ICHI), 2017, pp. 409–414.
Google Scholar
K. Rajan: Materials informatics: the materials “gene” and big data. Annu. Rev. Mater. Res. 45, 153–169 (2015).
Article CAS Google Scholar
R. Ramprasad, R. Batra, G. Pilania, A. Mannodi-Kanakkithodi, and C. Kim: Machine learning in materials informatics: recent applications and prospects. npj Comput. Mater. 3, 54 (2017).
Article Google Scholar
J. Kremer, K. Stensbo-Smidt, F. Gieseke, K.S. Pedersen, and C. Igel: Big universe, big data: machine learning and image analysis for astronomy. arXiv preprint arXiv:1704.04650, 2017.
Google Scholar
E. Rangel, N. Li, S. Habib, T. Peterka, A. Agrawal, W-K. Liao, and A. Choudhary: Parallel DTFE surface density field reconstruction. 2016 IEEE International Conference on Cluster Computing (CLUSTER), Taipei, September 12-16 (2016), pp. 30–39. doi:10.1109/CLUSTER.2016.40
Google Scholar
Y. LeCun, Y. Bengio, and G. Hinton: Deep learning. Nature 521, 436 (2015).
Article CAS Google Scholar
D. Jha, L. Ward, A. Paul, W-K. Liao, A. Choudhary, C. Wolverton, and A. Agrawal: Elemnet: deep learning the chemistry of materials from only elemental composition. Nat. Sci. Rep. 8, Article number: 17593 (2018).
T. Xie and J.C. Grossman: Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett., 120, 145301, 2018.
Article CAS Google Scholar
W. Ye, C. Chen, Z. Wang, I. Chu, and S.P. Ong: Deep neural networks for accurate predictions of crystal stability. Nat. Commun. 9, 3800 (2018).
Article CAS Google Scholar
K. Ryan, J. Lengyel, and M. Shatruk: Crystal structure prediction via deep learning. J. Am. Chem. Soc. 140, 10158–10168 (2018).
Article CAS Google Scholar
Z. Yang, Y. C. Yabansu, R. Al-Bahrani, W-K. Liao, A.N. Choudhary, S.R. Kalidindi, and A. Agrawal: Deep learning approaches for mining structure-property linkages in high contrast composites from simulation datasets. Comput. Mater. Sci. 151, 278–287 (2018).
Article CAS Google Scholar
A. Cecen, H. Dai, Y.C. Yabansu, S.R. Kalidindi, and L. Song: Material structure-property linkages using three-dimensional convolutional neural networks. Acta Mater. 146, 76–84 (2018).
Article CAS Google Scholar
Z. Yang, Y.C. Yabansu, D. Jha, W-K. Liao, A.N. Choudhary, S.R. Kalidindi, and A. Agrawal: Establishing structure-property localization linkages for elastic deformation of three-dimensional high contrast composites using deep learning approaches. Acta Mater. 166, 335–345 (2019).
Article CAS Google Scholar
R. Liu, A. Agrawal, W-K. Liao, M. D. Graef, and A. Choudhary: Materials Discovery: Understanding Polycrystals from Large-Scale Electron Patterns. IEEE BigData Workshop on Advances in Software and Hardware for Big Data to Knowledge Discovery (ASH), 2016, pp. 2261–2269.
Google Scholar
D. Jha, S. Singh, R. Al-Bahrani, W-K. Liao, A. N. Choudhary, M. D. Graef, and A. Agrawal: Extracting grain orientations from EBSD patterns of polycrystalline materials using convolutional neural networks. Microsc. Microanal. 24, 497–502 (2018).
Article CAS Google Scholar
B.L. DeCost, B. Lei, T. Francis, and E.A. Holm: High throughput quantitative metallography for complex microstructures using deep learning: a case study in ultrahigh carbon steel. Microsc. Microanal. 25, 21–29 (2019).
Article CAS Google Scholar
R.M. Patton, J.T. Johnston, S.R. Young, C.D. Schuman, D.D. March, T.E. Potok, D.C. Rose, S.H. Lim, T.P. Karnowski, M.A. Ziatdinov, and S.V. Kalinin: 167-PFlops deep learning for electron microscopy: from learning physics to atomic manipulation. Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, IEEE Press, 2018 p. 50.
Google Scholar
X. Li, Y. Zhang, H. Zhao, C. Burkhart, L.C. Brinson, and W. Chen: A transfer learning approach for microstructure reconstruction and structure-property predictions. Sci. Rep. 8, 13461 (2018).
Article CAS Google Scholar
Z. Yang, X. Li, L.C. Brinson, A. Choudhary, W. Chen, and A. Agrawal: Microstructural materials design via deep adversarial learning methodology. J. Mech. Des. 140, 10 (2018).
Google Scholar
D. E. Rumelhart, G. E. Hinton, and R. J. Williams: Learning representations by back-propagating errors. Nature 323, 533–536 (1986).
Article Google Scholar
D. E. Rumelhart, G. E. Hinton, and R. J. Williams: Learning internal representations by error propagation. Parallel Distributed Processing: Explorations in the Microstructure of Cognition, 1, MIT Press, Cambridge, MA, USA, 1986, pp. 318–362.
Google Scholar
K. Hornik: Approximation capabilities of multilayer feedforward networks. Neural Netw. 4, 251–257 (1991).
Article Google Scholar
A. Krizhevsky, I. Sutskever, and G.E. Hinton: ImageNet classification with deep convolutional neural networks. In Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1 (NIPS’12), Lake Tahoe, Nevada — December 3–6, 2012 F. Pereira, C. J. C. Burges, L. Bottou and K. Q. Weinberger (Eds.), Vol. 1. Curran Associates Inc., USA, pp. 1097–1105.
Google Scholar
K. He, X. Zhang, S. Ren, and J. Sun: Deep residual learning for image recognition. IEEE conference on computer vision and pattern recognition, pp. 770–778, 2016.
Google Scholar
G. Hinton, L. Deng, D. Yu, G.E. Dahl, A.R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T.N. Sainath, and B. Kingsbury: Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. IEEE Signal Process Mag. 29, 82–97 (2012).
Article Google Scholar
L. Deng, G. Hinton, and B. Kingsbury: New types of deep neural network learning for speech recognition and related applications: an overview, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, May, 26-31, 2013, pp. 8599–8603. doi:10.1109/ICASSP.2013.6639344.
Google Scholar
J. Deng, W. Dong, R. Socher, L.J. Li, K. Li, and L. Fei-Fei: ImageNet: A large-scale hierarchical image database, 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, June 20–25, 2009, pp. 248–255. doi:10.1109/CVPR.2009.5206848.
Google Scholar
S. Lee, D. Jha, A. Agrawal, A. Choudhary, and W-K. Liao: Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and Communication, 24th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), pp. 183–192, 2017.
Google Scholar
V. Mnih, A.P. Badia, M. Mirza, A. Graves, T. Lillicrap, T. Harley, D. Silver, and K. Kavukcuoglu: Asynchronous methods for deep reinforcement learning. International conference on machine learning, pp. 1928–1937, 2016.
Google Scholar
Y. Lou, R. Caruana, and J. Gehrke: Intelligible models for classification and regression. Proceedings of 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2012, pp. 150–158
Google Scholar
N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
Google Scholar
T.N. Kipf and M. Welling: Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907, 2016.
Google Scholar
J. Zhou, G. Cui, Z. Zhang, C. Yang, Z. Liu, and M. Sun: Graph neural networks: A review of methods and applications. arXiv preprint arXiv:1812.08434, 2018.
Google Scholar
I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio: Generative adversarial nets. NIPS 2014 2672–2680 (2014).
Google Scholar
M.J. Osborne and A. Rubinstein: A Course in Game Theory, MIT Press, Cambridge, MA, 1994, p. 14, ISBN 9780262150415.
Google Scholar
T. Wang, M. Liu, J. Zhu, A. Tao, J. Kautz, and B. Catanzaro: High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs, CVPR, 2018.
Book Google Scholar
H. Zhang, T. Xu, H. Li, S. Zhang, X. Wang, X. Huang, and D.N. Metaxas: StackGAN: Text to photo-realistic image synthesis with stacked generative adversarial networks. IEEE International Conference on Computer Vision, pp. 5907–5915, 2017.
Google Scholar
G. Perarnau, J. van de Weijer, B. Raducanu, and J.M. Álvarez: Invertible conditional GANs for image editing. arXiv preprint arXiv:1611.06355, 2016.
Google Scholar
H. Wu, S. Zheng, J. Zhang, and K. Huang: GP-GAN: Towards realistic high-resolution image blending. arXiv preprint arXiv:1703.07195, 2017.
Google Scholar
D. Pathak, P. Krahenbuhl, J. Donahue, T. Darrell, and A.A. Efros: Context encoders: Feature learning by inpainting. IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 2536–2544.
Google Scholar
L.C. Yang, S.Y. Chou, and Y.H. Yang: MidiNet: A convolutional generative adversarial network for symbolic-domain music generation. arXiv preprint arXiv:1703.10847, 2017.
Google Scholar
B. Meredig, A. Agrawal, S. Kirklin, J. E. Saal, J. W. Doak, A. Thompson, K. Zhang, A. Choudhary, and C. Wolverton: Combinatorial screening for new materials in unconstrained composition space with machine learning. Phys. Rev. B 89, 1–7 (2014).
Article CAS Google Scholar
L. Ward, A. Agrawal, A. Choudhary, and C. Wolverton: A general-purpose machine learning framework for predicting properties of inorganic materials. npj Comput. Mater. 2, 16028 (2016).
Article Google Scholar
A. Agrawal, B. Meredig, C. Wolverton, and A. Choudhary: A formation energy predictor for crystalline materials using ensemble data mining. IEEE International Conference on Data Mining (ICDM), 2016, pp. 1276–1279.
Google Scholar
K.T. Schütt, H. Glawe, F. Brockherde, A. Sanna, K.R. Müller, and E.K.U. Gross: How to represent crystal structures for machine learning: towards fast prediction of electronic properties. Phys. Rev. B 89, 205118 (2014).
Article CAS Google Scholar
F. Faber, A. Lindmaa, O. Anatole von Lilienfeld, and R. Armiento: Crystal structure representations for machine learning models of formation energies. Int. J. Quantum Chem. 115, 1094–1101 (2015).
Article CAS Google Scholar
A. Seko, H. Hayashi, K. Nakayama, A. Takahashi, and I. Tanaka: Representation of compounds for machine-learning prediction of physical properties. Phys. Rev. B 95, 144110 (2017).
Article Google Scholar
L. Ward, R. Liu, A. Krishna, V.I. Hegde, A. Agrawal, A. Choudhary, and C. Wolverton: Including crystal structure attributes in machine learning models of formation energies via Voronoi tessellations. Phys. Rev. B 96, 024104 (2017).
Article Google Scholar
A. Jain, S.P. Ong, G. Hautier, W. Chen, W.D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder, and K.A. Persson: Commentary: the materials project: a materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013).
Article CAS Google Scholar
A. R. Oganov, ed.: Modern Methods of Crystal Structure Prediction. John Wiley & Sons, Hoboken, NJ, 2011.
Google Scholar
K. A. Dill, S. B. Ozcan, T. R. Weikl, J. D. Chodera, and V. A. Voelz: The protein folding problem: when will it be solved? Curr. Opin. Struct. Biol. 17, 342–346 (2007).
Article CAS Google Scholar
A. L. Bail: Inorganic structure prediction with GRINSP. J. Appl. Crystallogr. 38, 389–395 (2005).
Article CAS Google Scholar
C.W. Glass, A.R. Oganov, and N. Hansen: USPEX—evolutionary crystal structure prediction. Comput. Phys. Commun. 175, 713–720 (2006).
Article CAS Google Scholar
R. Gautier, X. Zhang, L. Hu, L. Yu, Y. Lin, T.O. Sunde, D. Chon, K.R. Poeppelmeier, and A. Zunger: Prediction and accelerated laboratory discovery of previously unknown 18-electron ABX compounds. Nat. Chem. 7, 308 (2015).
Article CAS Google Scholar
L. Van der Maaten and G. Hinton: Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Google Scholar
D.B. Brough, D. Wheeler, and S.R. Kalidindi: Materials knowledge systems in python—a data science framework for accelerated development of hierarchical materials. Integr. Mater. Manuf. Innovation 6, 36–53 (2017).
Article Google Scholar
A. Gupta, A. Cecen, S. Goyal, A.K. Singh, and S.R. Kalidindi: Structure—property linkages using a data science approach: application to a non-metallic inclusion/steel composite system. Acta Mater. 91, 239–254 (2015).
Article CAS Google Scholar
R. Liu, Y. C. Yabansu, A. Agrawal, S. R. Kalidindi, and A. N. Choudhary: Machine learning approaches for elastic localization linkages in high-contrast composite materials. Integr. Mater. Manuf. Innovation 4, 1–17 (2015).
Article Google Scholar
R. Liu, Y. C. Yabansu, Z. Yang, A. N. Choudhary, S. R. Kalidindi, and A. Agrawal: Context aware machine learning approaches for modeling elastic localization in three-dimensional composite microstructures. Integr. Mater. Manuf. Innovation 6, 160–171 (2017).
Article Google Scholar
Y. Leng: Materials Characterization: Introduction to Microscopic and Spectroscopic Methods. Wiley, Hoboken, NJ, 2009 ISBN: 978-0-470-82299-9.
Google Scholar
A.J. Schwartz, M. Kumar, B.L. Adams, and D.P. Field (eds.): Electron Backscatter Diffraction in Materials Science. 2nd ed., New York NY, Springer, 2000.
Google Scholar
Y.H. Chen, S.U. Park, D. Wei, G. Newstadt, M.A. Jackson, J.P. Simmons, M. De Graef, and A.O. Hero: A dictionary approach to electron backscatter diffraction indexing. Microsc. Microanal. 21, 739–752 (2015).
Article CAS Google Scholar
M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, and M. Kudlur: Tensorflow: a system for large-scale machine learning. OSDI 16, 265–283 (2016).
Google Scholar
A. Bansal, X. Chen, B. Russell, A. Gupta, and D. Ramanan: PixelNet: Representation of the pixels, by the pixels, and for the pixels, CoRR, 2017. Available at http://arxiv.org/abs/1702.06506v1.
Google Scholar
K. Simonyan and A. Zisserman: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
Google Scholar
R. Piasecki: Microstructure reconstruction using entropic descriptors. Proc. R. Soc. A 467, 806–820 (2010).
Article Google Scholar
L. Gatys, A.S. Ecker, and M. Bethge: Texture synthesis using convolutional neural networks. Proceedings of Advances in Neural Information Processing Systems (NIPS), 2015, pp. 262–270.
Google Scholar
R. Bostanabad, A.T. Bui, W. Xie, D. Apley, and W. Chen: Stochastic microstructure characterization and reconstruction via supervised learning. Acta Mater. 103, 89–102 (2016).
Article CAS Google Scholar
M. Grigoriu: Random field models for two-phase microstructures. J. Appl. Phys. 94, 3762–3770 (2003).
Article CAS Google Scholar
Y. Liu, M.S. Greene, W. Chen, D.A. Dikin, and W.K. Liu: Computational microstructure characterization and reconstruction for stochastic multiscale material design. Comput. Aided. Des. 45, 65–76 (2013).
Article Google Scholar
H. Xu, D.A. Dikin, C. Burkhart, and W. Chen: Descriptor-based methodology for statistical characterization and 3D reconstruction of microstructural materials. Comput. Mater. Sci. 85, 206–216 (2014).
Article Google Scholar
Y. Miao, M. Gowayyed, and F. Metze: EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Scottsdale, AZ, December 13–17, 2015, pp. 167–174. doi:10.1109/ASRU.2015.7404790.
Google Scholar
Y. Xie, P. Daga, Y. Cheng, K. Zhang, A. Agrawal, and A. Choudhary: Reducing infrequent-token perplexity via variational corpora. 53rd Annual Meeting of the Association of Computational Linguistics (ACL) and 7th International Joint Conference on Natural Language Processing, 2015, pp. 609–615.
Google Scholar
M. Mozaffar, A. Paul, R. Al-Bahrani, S. Wolff, A. Choudhary, A. Agrawal, K. Ehmann, and J. Cao: Data-driven prediction of the high-dimensional thermal history in directed energy deposition processes via recurrent neural networks. Manuf. Lett. 18, 35–39 (2018).
Article Google Scholar
A. Paul, D. Jha, R. Al-Bahrani, W-K. Liao, A. Choudhary, and A. Agrawal: CheMixNet: Mixed DNN Architectures for Predicting Chemical Properties using Multiple Molecular Representations. NIPS workshop on Machine Learning for Molecules and Materials (MLMM), arXiv preprint arXiv:1811.08283, 2018.
Google Scholar
J. Gilmer, S.S. Schoenholz, P.F. Riley, O. Vinyals, and G.E. Dahl: Neural message passing for quantum chemistry. arXiv preprint arXiv:1704.01212, 2017.
Google Scholar
K.T. Schütt, H.E. Sauceda, P.J. Kindermans, A. Tkatchenko, and K.R. Müller: Schnet—A deep learning architecture for molecules and materials. J. Chem. Phys. 148, 241722 (2018).
Article CAS Google Scholar
K. Gopalakrishnan, S.K. Khaitan, A. Choudhary, and A. Agrawal: Deep convolutional neural networks with transfer learning for computer vision-based data-driven pavement distress detection. Constr. Build. Mater. 157, 322–330 (2017).
Article Google Scholar
A. Agrawal and A. Choudhary: An online tool for predicting fatigue strength of steel alloys based on ensemble data mining. Int. J. Fatigue 113, 389–400 (2018).
Article CAS Google Scholar
Y. Gal, J. Hron, and A. Kendall: Concrete dropout. 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, pp. 3581–3590 (2017). http://papers.nips.cc/paper/6949-concrete-dropout.pdf.
Google Scholar
Z.C. Lipton: The mythos of model interpretability. Commun. ACM 61, 36–43 (2018).
Article Google Scholar
K. Simonyan, A. Vedaldi, and A. Zisserman: Deep inside convolutional networks: Visualising image classification models and saliency maps. arXiv:1312.6034, 2013.
Google Scholar
G. Montavon, W. Samek, and K.R. Müller: Methods for interpreting and understanding deep neural networks. Digit. Signal. Process. 73, 1–15 (2018).
Article Google Scholar

Download references

Acknowledgments

The authors gratefully acknowledge partial support from the following grants: NIST awards 70NANB19H005, 70NANB14H012; AFOSR award FA9550-12-1-0458; NSF award CCF-1409601; DOE awards DE-SC0007456, DE-SC0014330. The authors would also like to thank all the collaborators and contributors of the papers that were reviewed in this paper.

Author information

Authors and Affiliations

Department of Electrical Engineering and Computer Science, Northwestern University, Evanston, IL, 60201, USA
Ankit Agrawal & Alok Choudhary

Authors

Ankit Agrawal
View author publications
You can also search for this author in PubMed Google Scholar
Alok Choudhary
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ankit Agrawal.

Rights and permissions

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Agrawal, A., Choudhary, A. Deep materials informatics: Applications of deep learning in materials science. MRS Communications 9, 779–792 (2019). https://doi.org/10.1557/mrc.2019.73

Download citation

Received: 27 January 2019
Accepted: 24 May 2019
Published: 20 September 2019
Issue Date: September 2019
DOI: https://doi.org/10.1557/mrc.2019.73

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Deep materials informatics: Applications of deep learning in materials science

Abstract

Similar content being viewed by others