Inverse Design of Materials by Machine Learning

Wang, Jia; Wang, Yingxue; Chen, Yanan

doi:10.3390/ma15051811

Open AccessReview

Inverse Design of Materials by Machine Learning

by

Jia Wang

¹,

Yingxue Wang

^2,* and

Yanan Chen

³

¹

School of Space and Environment, Beihang University, Beijing 102206, China

²

National Engineering Laboratory for Risk Perception and Prevention, Beijing 100081, China

³

School of Materials Science and Engineering, Tianjin University, Tianjin 300072, China

^*

Author to whom correspondence should be addressed.

Materials 2022, 15(5), 1811; https://doi.org/10.3390/ma15051811

Submission received: 23 December 2021 / Revised: 13 February 2022 / Accepted: 24 February 2022 / Published: 28 February 2022

(This article belongs to the Special Issue Advanced Energy Storage Materials: Preparation, Characterization and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

It is safe to say that every invention that has changed the world has depended on materials. At present, the demand for the development of materials and the invention or design of new materials is becoming more and more urgent since peoples’ current production and lifestyle needs must be changed to help mitigate the climate. Structure-property relationships are a vital paradigm in materials science. However, these relationships are often nonlinear, and the pattern is likely to change with length scales and time scales, posing a huge challenge. With the development of physics, statistics, computer science, etc., machine learning offers the opportunity to systematically find new materials. Especially by inverse design based on machine learning, one can make use of the existing knowledge without attempting mathematical inversion of the relevant integrated differential equation of the electronic structure but by using backpropagation to overcome local minimax traps and perform a fast calculation of the gradient information for a target function concerning the design variable to find the optimizations. The methodologies have been applied to various materials including polymers, photonics, inorganic materials, porous materials, 2-D materials, etc. Different types of design problems require different approaches, for which many algorithms and optimization approaches have been demonstrated in different scenarios. In this mini-review, we will not specifically sum up machine learning methodologies, but will provide a more material perspective and summarize some cut-edging studies.

Keywords:

inverse design; materials design; machine learning; polymer; photonic; inorganic materials; porous materials

1. Introduction

The revolution of materials gave name to different eras of civilization [1,2]. One of the hallmarks of industrialized society is our increasing extravagance in the use of materials. At the same time, the development of other fields enables a deeper understanding of the basis of materials for creating new materials. The enlargement of materials demand, not only in quantity but also in quality, has forced people to explore ways to use existing materials more efficiently, to seek a wide range of new substances as raw materials, to find a way to recycle the waste materials, and to create new materials for specific purposes. The guiding ideology of materials innovation has experienced four paradigms [3]. First, materials innovation relied on empirical trial and error method. Along with the development of mathematics, chemistry, and physics, it came to the second paradigm where people followed scientific laws. The invention of the computer stimulated its application in the scientific field, leading to computational chemistry with computer simulations such as the appearance of Gaussian 70, which can perform ab initio calculations, density functional theory (DFT)-based method, etc. [4,5]. Data-to-knowledge is becoming a new promising solution in materials science as its fourth paradigm by unifying the above three paradigms methodologies in the aspects of theory, experiments, and computer simulation [6]. The powerful fundamental knowledge of materials properties and advanced instruments enables the generation of “big data” and its application of data-driven techniques including data mining, cluster analysis, predictive analytics, genetic programming, visualization of materials dataset, machine learning (ML), business intelligence, learning, and intelligent optimization, etc. [7,8]. These methods have been successfully applied to materials design [9,10,11], chemical synthesis [12,13], and molecular simulations [14,15]. In particular, the emergence of contemporary artificial-intelligence methods and statistic communities [16,17] has provided an astounding new approach to material science and engineering and given birth to the discipline “materials informatics”.

The term “machine learning” was proposed by Samuel in 1959 [18]. ML, with its characteristics of low computational cost and short development cycle, combined with high-quality training data, as well as processing algorithmic methods, allowed for high through-put prediction of experiments of computations [19]. With the possibility of bypassing the solution of complex equations, ML can determine the properties, structure-property relationships, and other data directly through data analysis. For instance, applications in energy, geometry, and the curvature of the potential energy surfaces of molecules have been reported [19]. For materials design, although the multi-objective design requirements and high dimensionality of microstructure space cannot be accomplished by traditional search-based optimization with high efficiency and accuracy in a limited time scale, computational materials design involving ML can successfully provide an accurate way for the nonlinear multi-scale methods to simulate, predict, and select innovative materials [3,20]. Numerous datasets of molecules and materials and their structure-property relationships enable the “learning” and “predicting” of new materials with desirable traits [21]. Furthermore, the use of ML rational resignation is believed to be the most efficient way to replace repetitive lab labor.

Many ML models enable applications in structure determination (phase diagram determination and crystal structure prediction), performance prediction, and fingerprint (descriptor) prediction [22]. Compared to physical-based modeling tools, ML-based density functional theory, molecular dynamics, or the finite element method can offer fast high throughput screening for complex materials analysis, discovery, prediction, and design problems. The ML approaches bring the materials design with a solution to the inherent complexity of searching the vast options [23]. In the materials related ML applications, the principal components are materials descriptors (configuration, topology, fingerprint, etc.) for mapping an input space to an out space, ML algorithms for model training, and optimization process for determining promising candidates [24]. With the development in the above techniques, in recent years ML has been applied to discover new materials from different perspectives, including structure-oriented design (such as the design for polymers, where the chemical composition of the material is predicted from the demanding structure [25]); element-oriented design, where the structure of new compounds is predicted using the composition as input [26]; inverse design, where the target functionality or property is taken as input and the corresponding molecular structure is found out, which generally presented using generative inverse design networks; and drug design, where the structure-activity relationship is accurately predicted using the data from a large number of in vivo functions of small molecules [27]. Currently, the requirement for green chemistry development to mitigate climate change is increasing dramatically. This is a big topic that requires a lot of interdisciplinary, collaborative work. Under such circumstances, materials must revolutionize toward low-carbon or even carbon-free scenarios, which require modified materials or new materials with known functionalities. Moreover, novel materials with unique properties are desperately needed in clean energy technologies [28,29,30,31,32]. The inverse design-based high throughput ML method seems to be a promising area to address materials discovery and materials design. In general, ML-based inverse design uses backpropagation to overcome local minimax traps and performs a quick calculation of the gradient information for a target function concerning the design variable to find the optimizations. In this mini-review, inverse design based on ML and their cut-edging application in several important materials have been reviewed in a limited way.

2. Inverse Design

The general molecular design is a nonlinear optimization [33], in which the wave functions, energy eigenvalues, and properties are theoretically explained after the materials designed with unknown molecular structure beforehand by trying experimental optimization [34]. In the so-called direct design (Figure 1), the inputs are the ACS information such as constituent atoms, composition, and structure information database, and the outputs are the properties [35]. In the inverse method of design, one optimizes the properties by varying the wave function coefficients, which then leads to an interpretation of the molecular structure [34]. Inverse design starts from desired properties as “input” and ends in chemical space as “output”, as opposed to the direct approach that leads from the chemical space to the properties [36]. In this way, inverse design (Figure 1) indicates a process that starts with the target functionality, and then the corresponding molecular structure can be mapped to navigate the deliberate chemical application.

Different types of design problems require different approaches. Zunger emphasized three modalities of inverse design: searching artificial superstructures with target functionality, searching the space of chemical compounds for target functionality, and exploring missing compounds for target functionality [35]. The inverse design is usually processed by solving an optimization problem to map a target set of material properties to a subdomain of specific materials, which indicates lengthy calculation in high-dimensional space. To address the above, genetic algorithms (searching the space step by step) and adjoint method (mathematically reversing the equations) are usually used. For example, genetic algorithms or Bayesian framework, etc. can be used through an iterative algorithm [37]. However, inverse design for materials suffers from the extremely vast search space and the requirements for the property evaluation of each sequence [38]. Besides, the inverse design problem is inherently ill-posed or weakly conditioned; when a property or functionality is targeted, there would have a bunch of different types of materials that can satisfy the requirement, which is controversial to “optimize”. To address this problem, methods such as limiting the search space, projecting the search space to a low-dimensional space, using an annealing algorithm, etc. have been applied [39]. To navigate chemical space, three methodologies can be used for materials identification (Figure 2): (1) high-throughput virtual screening; (2) global optimization; and (3) generative models [40,41].

2.1. High Throughput Virtual Screening (HTVS)

High throughput virtual screening is a computational investigation of a large set of compounds or materials to assess their qualification for specific requirements. It is best defined by core philosophies as (1) significant timescale; (2) automated techniques; (3) data-driven discovery; and (4) computational funnels [42]. It enables a rather narrow chemical space by defining specific properties, functionalities, building blocks, or bonding rules. The resultant hypothesized candidate from the model usually can be tested by ML-based predictor or high throughput simulations such as molecular dynamics (MD), density functional theory (DFT), finite element method (FEM) etc., which can accelerate the computation process significantly through ML.

For example, Jang et al. [43] proposed a HTVS based on DFT prediction method for inorganic materials synthesis, which is the most important problem in predicting the inorganic materials structures in terms of different functional groups or fragments as in molecules. The MP database for inorganic crystal structures with DFT-calculated properties was used as model training dataset. The graph convolutional neural network (GNN) was implemented as a classifier to the model outputs crystal-likeness scores. Previous developed positive and unlabeled machine learning algorithm combined with GNN-based classifier were used to implement the decision tree. Figure 3 shows the algorithmic of the overall process. P represents a positive data set, which is the organic crystal synthesis data from MP database; U represents unlabeled data set, which is the virtual data from MP; K represent the number of positive data; and T is the number of iterations for bagging. For each iteration, a subsample in U is chosen randomly to be K. After n iterations, twenty percent of P and K are used as classifier and the rest are used as training sets for GNN binary classification model. Then, the classifier predicts that the score will be 1 or 0 based on the similarity to positive-labeled. An average score can be obtained for T times repeating, which represented the synthesizability of a given crystal structure.

Afzal et al. [44] present an HTVS calculation method based on ab initio modeling for the identification of new polyimides with exceptional refractive index values for optical or optoelectronic materials. They defined 29 building blocks as the polyimides’ core structure and made specific moieties structure constraints with respect to certain refractive index by a combination of first principles quantum chemistry calculation and data modeling for the resulting candidate to limit the screening space.

Computational HTVS has been widely used in the discovery strategy in many materials, especially in organic materials, inorganic materials, and organic drugs, each of which has different needs in terms of the number of descriptors, the size of the search space, and the level of approximation. The main problem of HTVS is the size of the library. HTVS need to go through the existing database, but when we design new materials, there is no existing database in our library. However, global optimization (GO) and generative models (GM) are quite different in, that they can capture hidden information from a structure-property-linked database for generating new structures that do not exist in the database.

2.2. Global Optimization (GO)

Global optimization is an algorithm to find an optimal solution of the target function and can be applied in the inverse design of various materials, which can help in navigating the chemical space. Bayesian optimization (BO), particle swarm optimization (PSO), genetic algorithm (GA), and stimulated annealing are most seen in materials design. They are potentially useful in multimodal search calculation in inverse problems [45]. For a multi-objective optimization, a function that can normalized the global objectives is needed. For example, we need materials with high x properties, low y properties, and moderate z properties. The optimization of an function f (x,y,z) exactly represent the above multi objectives.

BO is systematic approach to find the optimum of function f without assumption of any form of f. In this way, BO allows acceleration of difficult optimization problems (especially for materials design). In BO, the controllable parameters should be updated to reach the desired objectives. Thus, repeated experiments are needed. For example, Harper et al. [46] used BO with Gaussian processed to obtain eleven different optimal topologies for multi-functional optical materials.

PSO move the optimizers to D-dimensional search space denoted with four vectors: position, velocity, the best position corresponding to the objective function, and the best position found by any of its surroundings. For example, Khadilkar et al. [47] used particle swarm optimization combined with self-consistent-field theory to predict the bulk morphologies in multiblock polymers. In the PSO, the original optimizer agent i are described by four vectors: its position

\vec{x_{i}} = (x_{i 1}, x_{i 2}, \dots, x_{i D})

, its velocity

\vec{v_{i}} = (v_{i 1}, v_{i 2}, \dots, v_{i D})

, the post position corresponding to the objective function

\vec{p_{i}} = (p_{i 1}, p_{i 2}, \dots, p_{i D})

, and the best position found by its neighbors

\vec{n_{i}} = (n_{i 1}, n_{i 2}, \dots n_{i D})

. Thus, agent i in d dimension can be described as:

\begin{matrix} v_{i d}^{n + 1} = v_{i d}^{n} + χ c_{0} ϕ_{0} (p_{i d} - x_{i d}) + χ c_{1} ϕ_{1} (n_{i d} - x_{i d}) - (1 - χ) v_{i d} \\ x_{i d}^{n + 1} = x_{i d}^{n} + v_{i d} \end{matrix}

where ϕ₀ and ϕ₁ are independent, uniformly distributed random variables in the interval [0, 1] generated at every update, and c₀ and c₁ are acceleration coefficients. The parameter χ ∈ [0, 1] is known as the constriction factor. After PSO search, the fitness is certified by self-consistent-field theory for the target phase and candidate phases. They found the procedure is robust in polymer design using bulk information as a describer and can be broadened to targeting properties directly (for example, photonic bandgap).

GA, similar to PSO, uses a population of points or variables to propose potential solutions. It is inspired by the natural biological evolutionary process with steps of crossover, mutation, selection, and passing on the selected genes to the next generation. The structure of a simple GA is shown in Figure 4. GA is suitable for exploring large search spaces and thus can be effectively used for in materials inverse design, especially in the molecular search space. For example, Lee et al. [48] introduced a novel two phase GA method as constrained optimization for molecular inverse design while constraining the molecular structure. Self-referencing embedded strings and graph are used as descriptors for mutation and crossover, respectively, which generate valid molecular candidates and allow new molecules to be generated by random editing, but with appropriate target properties and limited structural information and without previous experience rules. In the new strategy, they first construct a population that is always valid for the existing dataset and a second stage was built to select suitable molecular descriptors to ensure the validity of the generated molecules. They showed that the model can preserve the molecular core and optimize target protein properties across generations through cannabidiol molecular optimization.

2.3. Generative Models (GM)

GM is unsupervised learning that encodes the high-dimensional materials chemical space into the continuous vector space (or latent space)with lower dimensionality, and generates new data using knowledge embedded in the vector space [36]. Thus, it is able to synthesize novel, high dimensional data samples. Several GM approaches have been used for inverse design of materials, and to the best of our knowledge, the most commonly used for various materials are recurrent neural networks (RNNs), variational autoencoders (VAEs), reinforcement learning (RL), generative adversarial networks (GANs), and hybrid architectures [49].

RNNs can generate sequences from incrementally one step at a time and predicting what comes next based on the current and past information. RNNs do not need static input data, as shown in Figure 5. Current input vector

X_{(t)}

and the past knowledge

h_{(t - 1)}

at time step t are the input vector, allowing RNNs to generate sequential data based on the learning information of the last iteration. For example, Kim et al. [50] implemented a hybrid deep encoder-decoder architecture method for discovery of organic molecules, which a deep neural network (DNN) was adopted as the encoder to identified the relationship between structural features and their material properties and RNNs were adopted and the decoder to reconstructed the recognizable molecular structures from the hidden relationship.

AE generally includes an encoder to encode molecules to a continuous vector in a lower dimension and decoder maps for the vector back to obtain the original representation (as shown in Figure 5). The encoder–decoder architecture of VAEs enable better generalizability by constraining the encoder network with a probability distribution [36]. In the inverse design of materials, with the advantages of combining neural networks and probability models, VAE enables the processing of large and complicated datasets. Moreover, continuous representation launches the gradient-based optimization models to decode arbitrary vectors and interpolate structures. For example, Ma et al. [51] described a VAE structure to metamaterial design problem. They defined three variables as input variable x (geometric pattern of metamaterial structure), output variable y (three distinct reflection spectra), and latent variable z (compressed code of the design). A probabilistic relationship between the above three variables was established by a VAEs model. Each probabilistic relationship represents different functionalities of the metamaterials. Their models showed the ability to simultaneously solve the forward and inverse problem, which is predominant compared to GAN, which requires a pre-trained simulator to guarantee the inverse process.

RL considers the generator as agent and studies how an agent interacts with an environment or task to maximize some notion of reward (properties), as shown in Figure 5. RL is a subfield of AI, which is used to solve dynamic decision problems. For example, Popova et al. [52] devised a novel computational strategy based on deep RL for generating chemical compounds with desired physical, chemical, and/or bioactivity properties de novo. They implement two deep neural networks (a generative model and a predictive model) in deep RL framework, which the generative model is used to generate chemically feasible molecules and the predictive model estimates the agent’s behavior by assigning a numerical reward (or penalty) to every generated molecule. The generative model is trained to maximize the reward.

GAN consists of a generator and a discriminator, which are trained simultaneously with conflicting objectives. The generator takes in a noise vector and outputs an image, while the discriminator takes in an image and outputs a prediction about whether the image is a sample from generator. Competition of the generator and the discriminator improves both networks while generator is trained to maximize the probability that discriminator makes a mistake, and discriminator is trained to minimize that probability. For example, Geng et al. [47] adopt a GAN in network model for inverse design of metasurfaces for dielectric materials. In the work, structure-property relationships and generated optical spectrum are simulated by GAN, and rational design prediction is made. The simulator is a pretrained fixed-weight model that takes the generated patterns as input and approximates their transmission spectra without the use of electromagnetic simulation. The distance of user-defined geometric data and the patterns from the generator was minimized by backpropagation training.

3. Application in Materials Design

3.1. Polymers

Polymeric materials are widely used in various aspects of everyday life and technological development, such as actuators, agriculture, aviation, biomedicine, biosensing devices, catalysts, chemotherapy, chitosan, electronics, fuel cell, furniture, membranes, packaging, textile, etc. due to their attractive physical, chemical and electrical properties [53]. The demand for polymers with better performance and lower carbon footprint is driving the design of new polymeric materials. Polymer dynamics and chemo-functionality determine the polymer properties, while the inverse design provides an approach to design polymers based on the desired attributes and a ML approach can make rapid predictions due to the rapid inference rate of ML-based predictive modeling [39]. However, due to the chemical, topological, and morphological complexity of polymers and various synthesis information, research is scarce and mostly computationally expensive; the related field is still in its infancy. The inverse design of polymers in both ML and deep learning methods has been well-reviewed by Sattari et al. [41] and ML for polymer design has been well summarized by Kumar et al. [54]. The data-driven algorithms for inverse design of polymers have two paths to follow in general: high throughput virtual screening and smart search algorithms [36]. These have been well-reviewed by Sattari et al. [41]. Here in this paper, some highlight inverse designs of the polymer by ML will be emphasized.

Phase behavior is a feasible target property for polymer inverse design, it is strongly influenced by polymer structures, polymer-polymers interactions, solution, etc. Based on target-phase properties, such as cloud point, polymer structure information including size, topology, composition, functionality can be derived by ML. Kumar et al. [55] developed an ML method based on particle swarm optimization for tuning of poly(2-oxazoline) cloud point with high accuracy (Figure 6). Four building blocks were identified as descriptor for polymer architecture, by which the machine learning model was trained to predict the cloud point. The model, consisting of a trained algorithm and PSO, was demonstrated by predicting 17 polymer structures with desired cloud point. Incidentally, PSO is often used in the polymer inverse design. It is a bioinspired search technique t suitable for complex systems with divergent distribution and solves the problem without centralized control in a specific individual [56]. Khadilkar et al. [57] used particle swarm optimization to predict the bulk morphologies in multiblock polymers, using separate self-consistent-field theory to ensure accurate estimation of the equilibrium structure. Their methodology was demonstrated suitable for single multiblock polymers as well as blend systems and even more block copolymers. Hiraide et al. [58] predicted the phase separation structure of polymer alloy from specific properties. They trained the framework by the convolutional neural network from previous analysis to predict the phase separation structure of a polymer alloy, subsequently applied a hybrid model consisting of a generative adversarial network and convolutional neural network. The framework they built was demonstrated as a low-cost method.

Polymer dielectrics are essential properties, especially when used in capacitive energy storage, organic photovoltaics. Diverse spectrum information and high data availability provide sufficient training models for ML techniques for polymer design (Figure 7). However, the vastness of polymer chemical and structural space could conceal some key opportunities. There are mainly two distinct steps for the above scenario: fingerprinting polymers into numerical representations and establishing a mapping between the numbers and target property [59]. Several ML algorithms are commonly used in these calculations, such as linear regression, GPR, ANN, RF, deep neural network, etc. [60] Mannodi-Kanakkithodi et al. [61] addressed the polymer dielectric design by ML-based genome approach for optimization of polymer constituent blocks, where they fingerprinted polymers into easily attainable numerical representations in prior. Their method accelerates the discovery of on-demand polymers with desired dielectric constant. Wu et al. [62] processed an algorithm based on inference and sampling with sequential Monte Carlo to target dielectric constant and bandgap. Gurnani et al. [63] proposed a graph-to-graph translation based novel ML algorithm called polyG2G to inverse design the polymer dielectrics. They trained the system with a high range of performance polymers and analyzed the subtle chemical differences between them. The difference continuously became an index from high throughput screening. Thousands of potential targets in an intractable search space with desired glass-transition temperatures, bandgap, and electron injection barriers have been found by the novel algorithm.

The self-assembly of block copolymers, which have robust application in medicine, can be designed through tuning the phase behavior to achieve exotic structures [64]. However, to achieve the inverse design of copolymers, expert knowledge and much time is needed for the selection of order parameters. Moreover, the results of simulation have nowhere to confirm as comprehensive. Patra et al. [38] used a Monte Carlo tree search to minimize the total number of evaluations in a given design cycle to copolymer compatibilizer design, which is inspired by AI gaming algorithms. They established a framework that combined the algorithm with molecular dynamics simulations, then applied it to specific polymer chain lengths to confined overall search space. The framework can also be extended to several proteins.

3.2. Photonic

Integrated photonics including materials and devices are widely applied in optical communication, biomedicine, biomedical, sensing technologies, etc. [65]. They can be accurately manipulated by changing the structure and degrees of freedom (DOF). To achieve target properties in transmittance, polarization, chirality, frequency, etc., researchers have made many efforts in the design of microscopic structures of photonics. Although it is quite understandable that the photonics performance from the knowledge of photonics structures should be predicted, inverse design of on-demand photonics is another story altogether and understandably represents a much more recent development [66]. The background and development history of inverse design in nanophotonic has been well-reviewed by Molesky et al. [67]. The methodologies of photonic design through machine learning at different degrees of freedoms are shown in Figure 8. When DOF of photonics structure is low, either a simple analytical solution or parametric sweeping can be used for the optimization. However, the simple methods suffer from low reliability. The solution space becomes larger as the DOF increases, and discriminative model can be used for the structure-property relationship. However, this approach often fails to find a particular optimal design parameters since multiple structures will produce the same response accordingly. If DOF continue to increase to thousands and more, a generative model can be used to reduce the dimensionality of the chemical space, a good optimization algorithm can be applied to locate an optimization.

The photonic inverse design is typically solved by local optimization as other physical design problems [68]. Traditional optimizations such as adjoint methods, GA, and PSO, have been applied to photonics design but with expensive computation and local minimum problems since it requires the same large amounts of simulations for each design, while ML only needs limited training for neural networks due to its ability to identify hidden correlations in the large data sets during the training phase. More importantly, once the neural networks are trained for a complex system problem, it can approximate the same computation in orders of magnitude less time owing to the ability to retrieve knowledge allows the simulations to be invested in the design tool and can be applied to each design without costly computations [69]. Besides, some approaches that available to ML models can enhance the likelihood of achieving the global minimum in the optimization problems. Thus, ML as a stand-alone technique can help the inverse design of photonics and on the other hand, photonics provides a place to solve ML problems [65]. However, inverse designs have issues such as low training efficiency when dealing with inconsistent data, and inverse problems in photonic design often generate scattering problems. Therefore, the training process and optimization methodology are important. Qu [70] et al. established an optical neural network framework based on optical scattering units by introducing the “kernel matrix”. Micrometer-level footprint allows an accelerated process for deep learning. Their framework demonstrated 97.1% accuracy but with an inefficient training process. They suggested that in situ training on the integrated photonics probably can help the framework further decrease their footprints and not sacrifice efficiency and functionality at the same time.

Topology optimization is a good computational tool that can be used for the systematic design of photonic crystals, waveguides, resonators, filters, and plasmonic, and the related logic and mathematics has been well-reviewed by Jensen et al. [71]. This is owing to the gradient descent nature of topology optimization, such as steepest descent and conjugate gradient, which can provide a reduction of constraints for an objective function [72]. Due to materials’ complex optical response and geometrical structure, the photonics design with tuning targeted topology remains a challenge. Long et al. [73] proposed an ML approach to design optical structures with the target topological states in a one-dimensional dielectric photonic crystal system. In the system, the Zak phase was descripted as state vectors and label vectors, referring to the geomatical information and reflection phase properties respectively. The neural network was trained by a tandem pipeline to establish the inverse design model. The optical structure can be acquired by applying targeted topological properties. Pilozzi et al. [74] employ a supervised ML regression to design photonic topological insulators. Aubry–Andre–Harper band structure models are used for neural networking training and a twist based on a reverse validation between the inverse problem neural network and the direct problem neural network has been introduced to ensure the only solution can be found. The method can be extensively applied to other physical systems in topological science, such as polaritonic, quantum technologies, and ultra-cold atoms, as well as 2D and 3D topological systems, quantum sources, and simulations. With the development of advanced deep learning algorithms, generative adversarial networks and autoencoder extended the possibility to joint with topology optimization to perform optimization in a complex topological system. Jiang et al. [75] demonstrated generative adversarial neural networks are effective for nanoantenna design optimization and can generate high-performance metasurfaces when coupling with topology optimization. Liu et al. [76] propose an encoding method for binary images that represent the topology of photonic structures for data generation and dimensionality reduction. The method was demonstrated and proved the ability to provide a way to generate global optimization results within limited solution space as well as enhance the accuracy of the network. Kudyshev et al. [77] used an adversarial autoencoder coupled with a metaheuristic optimization framework to assist global optimization of photonic devices with complex topologies.

3.3. Inorganic Solid-State Functional Materials

The discovery of novel inorganic functional materials is the core of many technologies’ development such as solid electrolytes for lithium-ion batteries, robust membrane for capturing carbon dioxide, halide perovskites for perovskite solar cells, etc.

For inorganic substances, molecular simulations and first-principles methods are commonly used methodologies, but they are computationally expensive for large chemical space screening. Recently, HTVS based on density functional theory (DFT) calculations have become a rather popular topic, which allows the discovery of crystals with targeted functional properties. However, the above methods focus on screening based on the existing dataset, which means that regressing the crystal or moieties representations can meet the required properties, whereas ML based on global optimization allows inverse design/discovery of new crystals with on-demand properties. This approach generally requires a structural pool of chemical compositions and their corresponding properties. Moreover, probabilistic generative models to existing materials to a continuous latent space can also lead to inverse materials design through mapping the latent space to materials properties. Indeed, there is a vital challenge in inorganic materials design. For example, a significant number of after screening hypothetical crystals are not observed in experiments, a thermodynamic model of crystals is simplified in prior which could lead to the inaccurate descriptor. Another challenge in inorganic materials synthesis design by ML is the high dimensionality of the problems. Synthesis is generally involved in many different parameters including the reactants parameters and synthesis environmental parameters, where n synthesis variables create an n dimension exploration space. Figure 9 shows a typical schematic depiction of ML workflow for inorganic materials design [78].

Many exciting developments have been well-established by Noh et al. [79]. Chen et al. [40] has reviewed the generative models for inverse design of inorganic solid material. Zunger [35] discussed the inverse design of solid-state materials with target functionalities very comprehensively. Only limited works will be mentioned in this mini-review.

HTVS, GO, GM, GAN, and support vector machine regression (SVM) are usually used for inorganic materials inverse design. Kim et al. [80] proposed a generative framework using evolutionary algorithms and quasi-random searching. The framework is inversion-free with a relative low memory requirement on the unit cell. Fractional atomic coordinates are used as crystal representations to build the crystal structures. Atomic coordinates and cell parameters are projected to the ML field by image classification and segmentation, which are used as a set of points and vectors with 3D coordinates. They demonstrated the effectiveness of the framework by asking for photoanode properties for high-throughput virtual screening with the generation of Mg–Mn–O ternary materials. Dan et al. [81] proposed the first GAN model to efficiently sample the inorganic material design space by generating hypothetical inorganic materials. The Open Quantum Materials Database, Materials Project, and ICSD databases have been used for model training of chemical compositional rules. Their application experiments showed that 2 million targeted materials were obtained with as high as 92.53% materials novelty. Rosales et al. [82] describe a HTVS to the inverse design of enantioselective catalyst candidates, substrate and ligand libraries or asymmetric catalysis was screening within hours. SVM was then used to generate a visual map of the space. Braham et al. [78] studied CsPbBr3 perovskite nanocrystal growth by SVM to initially separate regions of the design space that yield quantum-consolidated nanoplatelets from regions that yield bulk particles. Further predictions can also be made by the model, and it provides a perspective on the influence of molecular ligands on the dimensions of nanocrystals.

3.4. Porous Materials

Porous materials are widely used in catalysis, separations, sensors, electronics, architecture, biomedical, and electronics [83,84]. A rational design for porous materials with regular, accessible cages and tunnels is now being demanded. Neural networks based on ML can be applied to materials’ compositions, bandgap energy, formation energy, and gas adsorption uptakes, which is an appropriate method for porous materials such as zeolites, metal-organic framework, etc. However, it is challenging work due to the complex chemistry of these porous materials. For example, they contain various unit cells and unclear lattice parameters. Kim et al. [85] proposed an artificial to generate pure silica zeolite structures, which a generative adversarial network are used for training. Yao et al. [86] applied generative models for nano-porous l neural network crystalline reticular materials (metal-organic framework) inverse design. They demonstrated that autoencoder is a promising optimization method for metal-organic framework related predication when trained with multiple top adsorbent candidates identified for superior gas separation. Wan et al. [87] reported an ML-based inverse design of porous graphene. In their research, they build up a relationship between hole distribution and thermal conductivity reduction in monolayer graphene by machine learning method. This is then used for backpropagation to generate porous graphene with low thermal conductivity.

3.5. Other Materials

There are many other materials have been designed through inverse design approach based on the ML method. Thermoelectric materials represent highly efficient solid state energy conversion and play a role in both primary power generation and energy conservation. The design of it drawing many attentions and the ML-based method can provide a rational design method. The machine learning approaches for thermoelectric materials have been well reviewed by Wang et al. [88] and Gomez et al. [89]. Here, some other materials related research are lists in Table 1 as below.

4. Challenges and Opportunities

Inverse design navigates to material innovation by taking the targeted functionality or property as input to obtain an output of structural material information. It is a promising strategy to accelerate the discovery of materials and shorten the time for technology development, whose direct design requires much more time on trial-error experiments. Traditionally, inverse problem are generally solved by mathematically inverting the Schrödinger equation. However, it is usually not practical to find the inversion of this equation due to mathematical restrictions, the complex physical system of the materials design, and a scalable approach that leverages the talent and efforts of the entire materials community. Data driven techniques provide a different way for inverse problem, which requires no mathematical inversion of any equation but to manipulate a large set of direct approach calculation to find the relationship between the properties/functionalities and molecule structures. ML as a component tool for data driven inverse design is rapidly developing. The ML-based approaches can quickly map between the fingerprinted input and the target properties by using backpropagation to overcome local minimax traps and performs a quick calculation of the gradient information for a target function with respect to the design variable to find the optimizations. It can produce logical framing of chemical space, better exploration of chemical space within required regions, and optimization methods. ML-based approaches are highly available for multi-objective design requirements and the high dimensionality of microstructure space, which cannot be achieved by traditional statistical methodologies. However, there are many challenges. One of the most vital challenges in inverse design, or rather in all data-driven materials design, is the close and iterative interaction between theories and experiments. How to realize the predictions and how to produce predicted materials must be considered. Building an invertible and invariant generative model is quite a challenge due to the lack of an explicit approach for the permutation and combination of different conditions without exploring the entire design space. Another important challenge is to develop an experimental feedback loop which can enhance the reliability of the decisions from the artificial intelligent. As seen, the integration of ML as a new pillar of knowledge in materials will simulate a related application throne, while the application scenario also provides a place to solve ML problems, such as photonics, different catalysis, ultrafast nanomaterials, 2-D materials, etc. [23,24,95,96,97,98].

Author Contributions

J.W. contributed to the conception and manuscript writing of this review. Y.W. contributed significantly to the manuscript preparation, revise, and valid confirmation. Y.C. helped with constructive discussion of this review. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Sass, S.L. The Substance of Civilization: Materials and Human History from the Stone Age to the Age of Silicon; Arcade Publishing: New York, NY, USA, 1998. [Google Scholar]
Headrick, D.R. When Information Came of Age: Technologies of Knowledge in the Age of Reason and Revolution, 1700–1850; Oxford University Press: Oxford, UK, 2000. [Google Scholar]
Agrawal, A.; Choudhary, A. Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science. APL Mater. 2016, 4, 053208. [Google Scholar] [CrossRef] [Green Version]
Pople, J.A. Quantum Chemical Models (Nobel Lecture). Angew. Chem. Int. Ed. 1999, 38, 1894–1902. [Google Scholar] [CrossRef]
Garrity, K.F.; Bennett, J.W.; Rabe, K.M.; Vanderbilt, D. Pseudopotentials for high-throughput DFT calculations. Comput. Mater. Sci. 2014, 81, 446–452. [Google Scholar] [CrossRef] [Green Version]
Hobart, M.E.; Schiffman, Z.S. Information Ages: Literacy, Numeracy, and the Computer Revolution; JHU Press: Baltimore, MD, USA, 2000. [Google Scholar]
Mosavi, A.; Vaezipour, A. Reactive Search Optimization; Application to Multiobjective Optimization Problems. Appl. Math. 2012, 3, 1572–1582. [Google Scholar] [CrossRef] [Green Version]
Rajan, K. Informatics for Materials Science and Engineering: Data-Driven Discovery for Accelerated Experimentation and Application; Butterworth-Heinemann: Oxford, UK, 2013. [Google Scholar]
Mosavi, A.; Rabczuk, T.; Varkonyi-Koczy, A.R. Reviewing the novel machine learning tools for materials design. In Recent Advances in Technology Research and Education; Advances in Intelligent Systems and Computing; Springer: Cham, Switzerland, 2018; pp. 50–58. [Google Scholar]
Goh, G.B.; Hodas, N.O.; Vishnu, A. Deep learning for computational chemistry. J. Comput. Chem. 2017, 38, 1291–1307. [Google Scholar] [CrossRef] [Green Version]
Dam, H.C.; Pham, T.L.; Ho, T.B.; Nguyen, A.T.; Nguyen, V.C. Data mining for materials design: A computational study of single molecule magnet. J. Chem. Phys. 2014, 140, 044101. [Google Scholar] [CrossRef] [PubMed]
Coley, C.W.; Green, W.H.; Jensen, K.F. Machine Learning in Computer-Aided Synthesis Planning. Acc. Chem. Res. 2018, 51, 1281–1289. [Google Scholar] [CrossRef] [PubMed]
Kim, E.; Huang, K.; Jegelka, S.; Olivetti, E. Virtual screening of inorganic materials synthesis parameters with deep learning. npj Comput. Mater. 2017, 3, 53. [Google Scholar] [CrossRef]
Mardt, A.; Pasquali, L.; Wu, H.; Noe, F. VAMPnets for deep learning of molecular kinetics. Nat. Commun. 2018, 9, 5. [Google Scholar] [CrossRef]
Chen, C.; Lu, Z.; Ciucci, F. Data mining of molecular dynamics data reveals Li diffusion characteristics in garnet Li₇La₃Zr₂O₁₂. Sci. Rep. 2017, 7, 40769. [Google Scholar] [CrossRef] [Green Version]
Gopnik, A. Making AI More Human. Sci. Am. 2017, 316, 60–65. [Google Scholar] [CrossRef]
Jordan, M.I.; Mitchell, T.M. Machine learning: Trends, perspectives, and prospects. Science 2015, 349, 255–260. [Google Scholar] [CrossRef] [PubMed]
Provost, F.; Kohavi, R. Glossary of terms. J. Mach. Learn. 1998, 30, 271–274. [Google Scholar] [CrossRef]
Wei, J.; Chu, X.; Sun, X.Y.; Xu, K.; Deng, H.X.; Chen, J.; Wei, Z.; Lei, M. Machine learning in materials science. InfoMat 2019, 1, 338–358. [Google Scholar] [CrossRef]
Fischer, C.C.; Tibbetts, K.J.; Morgan, D.; Ceder, G. Predicting crystal structure by merging data mining with quantum mechanics. Nat. Mater. 2006, 5, 641–646. [Google Scholar] [CrossRef]
Takahashi, K.; Tanaka, Y. Material synthesis and design from first principle calculations and machine learning. Comput. Mater. Sci. 2016, 112, 364–367. [Google Scholar] [CrossRef]
Liu, Y.; Niu, C.; Wang, Z.; Gan, Y.; Zhu, Y.; Sun, S.; Shen, T. Machine learning in materials genome initiative: A review. J. Mater. Sci. Technol. 2020, 57, 113–122. [Google Scholar] [CrossRef]
Moosavi, S.M.; Jablonka, K.M.; Smit, B. The Role of Machine Learning in the Understanding and Design of Materials. J. Am. Chem. Soc. 2020, 142, 20273–20287. [Google Scholar] [CrossRef] [PubMed]
Ward, L.; Agrawal, A.; Choudhary, A.; Wolverton, C. A general-purpose machine learning framework for predicting properties of inorganic materials. npj Comput. Mater. 2016, 2, 16028. [Google Scholar] [CrossRef] [Green Version]
Kim, C.; Batra, R.; Chen, L.; Tran, H.; Ramprasad, R. Polymer design using genetic algorithm and machine learning. Comput. Mater. Sci. 2021, 186, 110067. [Google Scholar] [CrossRef]
Seko, A.; Hayashi, H.; Nakayama, K.; Takahashi, A.; Tanaka, I. Representation of compounds for machine-learning prediction of physical properties. Phys. Rev. B 2017, 95, 144110. [Google Scholar] [CrossRef] [Green Version]
Burbidge, R.; Trotter, M.; Buxton, B.; Holden, S. Drug design by machine learning: Support vector machines for pharmaceutical data analysis. Comput. Chem. 2001, 26, 5–14. [Google Scholar] [CrossRef]
Linares, N.; Silvestre-Albero, A.M.; Serrano, E.; Silvestre-Albero, J.; García-Martínez, J. Mesoporous materials for clean energy technologies. Chem. Soc. Rev. 2014, 43, 7681–7717. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Liu, S.; Shen, Y.; Zhang, Y.; Cui, B.; Xi, S.; Zhang, J.; Xu, L.; Zhu, S.; Chen, Y.; Deng, Y.; et al. Extreme Environmental Thermal Shock Induced Dislocation-Rich Pt Nanoparticles Boosting Hydrogen Evolution Reaction. Adv. Mater. 2021, 34, 2106973. [Google Scholar] [CrossRef]
Liu, C.; Zhou, W.; Zhang, J.; Chen, Z.; Liu, S.; Zhang, Y.; Yang, J.; Xu, L.; Hu, W.; Chen, Y.; et al. Air-Assisted Transient Synthesis of Metastable Nickel Oxide Boosting Alkaline Fuel Oxidation Reaction. Adv. Energy Mater. 2020, 10, 10. [Google Scholar] [CrossRef]
Liu, S.; Hu, Z.; Wu, Y.; Zhang, J.; Zhang, Y.; Cui, B.; Liu, C.; Hu, S.; Zhao, N.; Han, X.; et al. Dislocation-Strained IrNi Alloy Nanoparticles Driven by Thermal Shock for the Hydrogen Evolution Reaction. Adv. Mater. 2020, 32, e2006034. [Google Scholar] [CrossRef]
Wu, H.; Lu, Q.; Zhang, J.; Wang, J.; Han, X.; Zhao, N.; Hu, W.; Li, J.; Chen, Y.; Deng, Y. Thermal Shock-Activated Spontaneous Growing of Nanosheets for Overall Water Splitting. Nanomicro. Lett. 2020, 12, 162. [Google Scholar] [CrossRef]
Press, W.H.; Flannery, B.P.; Teukolsky, S.A.; Vettering, W.T. Numerical Recipes in C: The Art of Scientific Computing; Cambridge University Press: Cambridge, UK, 2002. [Google Scholar]
Kuhn, C.; Beratan, D.N. Inverse Strategies for Molecular Design. J. Phys. Chem. 1996, 100, 10595–10599. [Google Scholar] [CrossRef]
Zunger, A. Inverse design in search of materials with target functionalities. Nat. Rev. Chem. 2018, 2, 0121. [Google Scholar] [CrossRef]
Sanchez-Lengeling, B.; Aspuru-Guzik, A. Inverse molecular design using machine learning: Generative models for matter engineering. Science 2018, 361, 360–365. [Google Scholar] [CrossRef] [Green Version]
Peurifoy, J.; Shen, Y.; Jing, L.; Yang, Y.; Cano-Renteria, F.; DeLacy, B.G.; Joannopoulos, J.D.; Tegmark, M.; Soljacic, M. Nanophotonic particle simulation and inverse design using artificial neural networks. Sci. Adv. 2018, 4, eaar4206. [Google Scholar] [CrossRef] [Green Version]
Patra, T.K.; Loeffler, T.D.; Sankaranarayanan, S. Accelerating copolymer inverse design using monte carlo tree search. Nanoscale 2020, 12, 23653–23662. [Google Scholar] [CrossRef] [PubMed]
Wu, S.; Yamada, H.; Hayashi, Y.; Zamengo, M.; Yoshida, R. Potentials and challenges of polymer informatics: Exploiting machine learning for polymer design. arXiv 2020, arXiv:2010.07683. [Google Scholar]
Chen, L.; Zhang, W.; Nie, Z.; Li, S.; Pan, F. Generative models for inverse design of inorganic solid materials. J. Mater. Inform. 2021, 1, 4. [Google Scholar] [CrossRef]
Sattari, K.; Xie, Y.; Lin, J. Data-driven algorithms for inverse design of polymers. Soft Matter 2021, 17, 7607–7622. [Google Scholar] [CrossRef] [PubMed]
Pyzer-Knapp, E.O.; Suh, C.; Gómez-Bombarelli, R.; Aguilera-Iparraguirre, J.; Aspuru-Guzik, A. What is high-throughput virtual screening? A perspective from organic materials discovery. Annu. Rev. Mater. Res. 2015, 45, 195–216. [Google Scholar] [CrossRef] [Green Version]
Jang, J.; Gu, G.H.; Noh, J.; Kim, J.; Jung, Y. Structure-Based Synthesizability Prediction of Crystals Using Partially Supervised Learning. J. Am. Chem. Soc. 2020, 142, 18836–18843. [Google Scholar] [CrossRef] [PubMed]
Afzal, M.A.F.; Haghighatlari, M.; Ganesh, S.P.; Cheng, C.; Hachmann, J. Accelerated Discovery of High-Refractive-Index Polyimides via First-Principles Molecular Modeling, Virtual High-Throughput Screening, and Data Mining. J. Phys. Chem. C 2019, 123, 14610–14618. [Google Scholar] [CrossRef]
Scales, J.A.; Smith, M.L.; Fischer, T.L. Global optimization methods for multimodal inverse problems. J. Comput. Phys. 1992, 103, 258–268. [Google Scholar] [CrossRef]
Harper, E.; Mills, M. Bayesian Optimization of Neural Networks for the Inverse Design of All-Dielectric Metasurfaces; SPIE: Bellingham, WA, USA, 2020; Volume 11469. [Google Scholar]
Geng, Y.; van Anders, G.; Glotzer, S.C. Predicting colloidal crystals from shapes via inverse design and machine learning. arXiv 2018, arXiv:1801.06219. [Google Scholar]
Lee, Y.; Choi, G.; Yoon, M.; Kim, C. Genetic Algorithm for Constrained Molecular Inverse Design. arXiv 2021, arXiv:2112.03518. [Google Scholar]
Odena, A. Semi-supervised learning with generative adversarial networks. arXiv 2016, arXiv:1606.01583. [Google Scholar]
Kim, K.; Kang, S.; Yoo, J.; Kwon, Y.; Nam, Y.; Lee, D.; Kim, I.; Choi, Y.-S.; Jung, Y.; Kim, S.; et al. Deep-learning-based inverse design model for intelligent discovery of organic molecules. npj Comput. Mater. 2018, 4, 4. [Google Scholar] [CrossRef] [Green Version]
Ma, W.; Cheng, F.; Xu, Y.; Wen, Q.; Liu, Y. Probabilistic Representation and Inverse Design of Metamaterials Based on a Deep Generative Model with Semi-Supervised Learning Strategy. Adv. Mater. 2019, 31, e1901111. [Google Scholar] [CrossRef] [Green Version]
Popova, M.; Isayev, O.; Tropsha, A. Deep reinforcement learning for de novo drug design. Sci. Adv. 2018, 4, eaap7885. [Google Scholar] [CrossRef] [Green Version]
Mishra, M. Encyclopedia of Polymer Applications, 3 Volume Set; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar]
Kumar, J.N.; Li, Q.; Jun, Y. Challenges and opportunities of polymer design with machine learning and high throughput experimentation. MRS Commun. 2019, 9, 537–544. [Google Scholar] [CrossRef] [Green Version]
Kumar, J.N.; Li, Q.; Tang, K.Y.T.; Buonassisi, T.; Gonzalez-Oyarce, A.L.; Ye, J. Machine learning enables polymer cloud-point engineering via inverse design. npj Comput. Mater. 2019, 5, 537–544. [Google Scholar] [CrossRef] [Green Version]
Nápoles, G.; Grau, I.; Bello, R. Constricted Particle Swarm Optimization based algorithm for global optimization. Polibits 2012, 5–11. [Google Scholar] [CrossRef]
Khadilkar, M.R.; Paradiso, S.; Delaney, K.T.; Fredrickson, G.H. Inverse Design of Bulk Morphologies in Multiblock Polymers Using Particle Swarm Optimization. Macromolecules 2017, 50, 6702–6709. [Google Scholar] [CrossRef]
Hiraide, K.; Hirayama, K.; Endo, K.; Muramatsu, M. Application of deep learning to inverse design of phase separation structure in polymer alloy. Comput. Mater. Sci. 2021, 190, 110278. [Google Scholar] [CrossRef]
Ramprasad, R.; Batra, R.; Pilania, G.; Mannodi-Kanakkithodi, A.; Kim, C. Machine learning in materials informatics: Recent applications and prospects. npj Comput. Mater. 2017, 3, 54. [Google Scholar] [CrossRef]
Zhu, M.X.; Deng, T.; Dong, L.; Chen, J.M.; Dang, Z.M. Review of machine learning-driven design of polymer-based dielectrics. IET Nanodielectr. 2021, 1–15. [Google Scholar] [CrossRef]
Mannodi-Kanakkithodi, A.; Pilania, G.; Huan, T.D.; Lookman, T.; Ramprasad, R. Machine Learning Strategy for Accelerated Design of Polymer Dielectrics. Sci. Rep. 2016, 6, 20952. [Google Scholar] [CrossRef] [Green Version]
Wu, S.; Lambard, G.; Liu, C.; Yamada, H.; Yoshida, R. iQSPR in XenonPy: A Bayesian Molecular Design Algorithm. Mol. Inf. 2020, 39, e1900107. [Google Scholar] [CrossRef] [Green Version]
Gurnani, R.; Kamal, D.; Tran, H.; Sahu, H.; Scharm, K.; Ashraf, U.; Ramprasad, R. polyG2G: A Novel Machine Learning Algorithm Applied to the Generative Design of Polymer Dielectrics. Chem. Mater. 2021, 33, 7008–7016. [Google Scholar] [CrossRef]
Li, C.; Li, Q.; Kaneti, Y.V.; Hou, D.; Yamauchi, Y.; Mai, Y. Self-assembly of block copolymers towards mesoporous materials for energy storage and conversion systems. Chem. Soc. Rev. 2020, 49, 4681–4736. [Google Scholar] [CrossRef] [PubMed]
Liu, Z.; Zhu, D.; Raju, L.; Cai, W. Tackling Photonic Inverse Design with Machine Learning. Adv. Sci. 2021, 8, 2002923. [Google Scholar] [CrossRef]
Bendsoe, M.P.; Sigmund, O. Topology Optimization: Theory, Methods, and Applications; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
Molesky, S.; Lin, Z.; Piggott, A.Y.; Jin, W.; Vucković, J.; Rodriguez, A.W. Inverse design in nanophotonics. Nat. Photonics 2018, 12, 659–670. [Google Scholar] [CrossRef] [Green Version]
Angeris, G.; Vučković, J.; Boyd, S.P. Computational Bounds for Photonic Design. ACS Photonics 2019, 6, 1232–1239. [Google Scholar] [CrossRef] [Green Version]
Liu, D.; Tan, Y.; Khoram, E.; Yu, Z. Training Deep Neural Networks for the Inverse Design of Nanophotonic Structures. ACS Photonics 2018, 5, 1365–1369. [Google Scholar] [CrossRef] [Green Version]
Qu, Y.; Zhu, H.; Shen, Y.; Zhang, J.; Tao, C.; Ghosh, P.; Qiu, M. Inverse design of an integrated-nanophotonics optical neural network. Sci. Bull. 2020, 65, 1177–1183. [Google Scholar] [CrossRef]
Jensen, J.S.; Sigmund, O. Topology optimization for nano-photonics. Laser Photonics Rev. 2011, 5, 308–321. [Google Scholar] [CrossRef]
Liu, J.; Gaynor, A.T.; Chen, S.; Kang, Z.; Suresh, K.; Takezawa, A.; Li, L.; Kato, J.; Tang, J.; Wang, C.C.L.; et al. Current and future trends in topology optimization for additive manufacturing. Struct. Multidiscip. Optim. 2018, 57, 2457–2483. [Google Scholar] [CrossRef] [Green Version]
Long, Y.; Ren, J.; Li, Y.; Chen, H. Inverse design of photonic topological state via machine learning. Appl. Phys. Lett. 2019, 114, 181105. [Google Scholar] [CrossRef]
Pilozzi, L.; Farrelly, F.A.; Marcucci, G.; Conti, C. Machine learning inverse problem for topological photonics. Commun. Phys. 2018, 1, 57. [Google Scholar] [CrossRef]
Jiang, J.; Sell, D.; Hoyer, S.; Hickey, J.; Yang, J.; Fan, J.A. Free-form diffractive metagrating design based on generative adversarial networks. ACS Nano 2019, 13, 8872–8878. [Google Scholar] [CrossRef] [Green Version]
Liu, Z.; Zhu, Z.; Cai, W. Topological encoding method for data-driven photonics inverse design. Opt. Express 2020, 28, 4825–4835. [Google Scholar] [CrossRef] [Green Version]
Kudyshev, Z.A.; Kildishev, A.V.; Shalaev, V.M.; Boltasseva, A. Machine learning–assisted global optimization of photonic devices. Nanophotonics 2020, 10, 371–383. [Google Scholar] [CrossRef]
Braham, E.J.; Davidson, R.D.; Al-Hashimi, M.; Arroyave, R.; Banerjee, S. Navigating the design space of inorganic materials synthesis using statistical methods and machine learning. Dalton Trans. 2020, 49, 11480–11488. [Google Scholar] [CrossRef]
Noh, J.; Gu, G.H.; Kim, S.; Jung, Y. Machine-enabled inverse design of inorganic solid materials: Promises and challenges. Chem. Sci. 2020, 11, 4871–4881. [Google Scholar] [CrossRef] [Green Version]
Kim, S.; Noh, J.; Gu, G.H.; Aspuru-Guzik, A.; Jung, Y. Generative Adversarial Networks for Crystal Structure Prediction. ACS Cent Sci. 2020, 6, 1412–1420. [Google Scholar] [CrossRef] [PubMed]
Dan, Y.; Zhao, Y.; Li, X.; Li, S.; Hu, M.; Hu, J. Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials. npj Comput. Mater. 2020, 6, 84. [Google Scholar] [CrossRef]
Rosales, A.R.; Wahlers, J.; Limé, E.; Meadows, R.E.; Leslie, K.W.; Savin, R.; Bell, F.; Hansen, E.; Helquist, P.; Munday, R.H.; et al. Rapid virtual screening of enantioselective catalysts using CatVS. Nat. Catal. 2018, 2, 41–45. [Google Scholar] [CrossRef]
Qin, J.; Chen, Q.; Yang, C.; Huang, Y. Research process on property and application of metal porous materials. J. Alloys Compd. 2016, 654, 39–44. [Google Scholar] [CrossRef]
Ferey, G. Materials science. The simplicity of complexity–rational design of giant pores. Science 2001, 291, 994–995. [Google Scholar] [CrossRef]
Kim, B.; Lee, S.; Kim, J. Inverse design of porous materials using artificial neural networks. Sci. Adv. 2020, 6, eaax9324. [Google Scholar] [CrossRef] [Green Version]
Yao, Z.; Sánchez-Lengeling, B.; Bobbitt, N.S.; Bucior, B.J.; Kumar, S.G.H.; Collins, S.P.; Burns, T.; Woo, T.K.; Farha, O.K.; Snurr, R.Q.; et al. Inverse design of nanoporous crystalline reticular materials with deep generative models. Nat. Mach. Intell. 2021, 3, 76–86. [Google Scholar] [CrossRef]
Wan, J.; Jiang, J.-W.; Park, H.S. Machine learning-based design of porous graphene with low thermal conductivity. Carbon 2020, 157, 262–269. [Google Scholar] [CrossRef]
Wang, T.; Zhang, C.; Snoussi, H.; Zhang, G. Machine Learning Approaches for Thermoelectric Materials Research. Adv. Funct. Mater. 2019, 30, 1906041. [Google Scholar] [CrossRef]
Recatala-Gomez, J.; Suwardi, A.; Nandhakumar, I.; Abutaha, A.; Hippalgaonkar, K. Toward Accelerated Thermoelectric Materials and Process Discovery. ACS Appl. Energy Mater. 2020, 3, 2240–2257. [Google Scholar] [CrossRef]
Zheng, B.; Yang, J.; Liang, B.; Cheng, J.-C. Inverse design of acoustic metamaterials based on machine learning using a Gauss–Bayesian model. J. Appl. Phys. 2020, 128, 134902. [Google Scholar] [CrossRef]
Ismail, M.S.; Moghavvemi, M.; Mahlia, T.M.I. Characterization of PV panel and global optimization of its model parameters using genetic algorithm. Energy Convers. Manag. 2013, 73, 10–25. [Google Scholar] [CrossRef]
Jadrich, R.B.; Lindquist, B.A.; Truskett, T.M. Probabilistic inverse design for self-assembling materials. J. Chem. Phys. 2017, 146, 184103. [Google Scholar] [CrossRef]
Forte, A.E.; Hanakata, P.Z.; Jin, L.; Zari, E.; Zareei, A.; Fernandes, M.C.; Sumner, L.; Alvarez, J.; Bertoldi, K. Inverse Design of Inflatable Soft Membranes Through Machine Learning. Adv. Funct. Mater. 2022, 2111610. [Google Scholar] [CrossRef]
Lininger, A.; Hinczewski, M.; Strangi, G. General Inverse Design of Layered Thin-Film Materials with Convolutional Neural Networks. ACS Photonics 2021, 8, 3641–3650. [Google Scholar] [CrossRef]
Jiang, R.; Da, Y.; Han, X.; Chen, Y.; Deng, Y.; Hu, W. Ultrafast Synthesis for Functional Nanomaterials. Cell Rep. Phys. Sci. 2021, 2, 100302. [Google Scholar] [CrossRef]
Dou, S.; Xu, J.; Cui, X.; Liu, W.; Zhang, Z.; Deng, Y.; Hu, W.; Chen, Y. High-Temperature Shock Enabled Nanomanufacturing for Energy-Related Applications. Adv. Energy Mater. 2020, 10, 10. [Google Scholar] [CrossRef]
Genty, G.; Salmela, L.; Dudley, J.M.; Brunner, D.; Kokhanovskiy, A.; Kobtsev, S.; Turitsyn, S.K. Machine learning and applications in ultrafast photonics. Nat. Photonics 2021, 15, 91–101. [Google Scholar] [CrossRef]
Kitchin, J.R. Machine learning in catalysis. Nat. Catal. 2018, 1, 230–232. [Google Scholar] [CrossRef]

Figure 1. Schematic of the different approaches toward molecular design.

Figure 2. Strategies for inverse design of materials.

Figure 3. Schematic diagram of positive and unlabeled learning, reprinted with the permission from [43].

Figure 4. Structure of a simple genetic algorithm.

Figure 5. DL-based algorithms for GMs. (Reprinted with the permission from [41]).

Figure 6. Study framework of polymer cloud-point engineering via machine learning inverse design, Reprinted with the permission from [55].

Figure 7. Inverse design methods for polymer-based dielectrics. (a) Enumeration method. (b) Active learning algorithm. (c,d) Genetic algorithm method used to design polymers with high glass transition temperature and large bandgap. (e) Inverse design method based on particle swarm optimization. x* refers to polymer design and (y*) refers to desired cloud-point. (f) Variational autoencoder (VAE). (g) VAE used to discover polymers with high Tg and bandgap. (h) Generative adversarial networks. Reprinted with the permission from [60].

Figure 8. Methodologies of photonic design through machine learning at different degrees of freedoms (DOFs), reprinted with the permission from [64].

Figure 9. Schematic depiction of an example of a machine-learning workflow for the iterative exploration and exploitation of a synthetic design space for inorganic materials, reprinted with the permission from [78].

Table 1. Other advanced materials inverse design by machine learning.

Materials/Molecules	Methodology	Target	Reference
Acoustic metamaterials	Gauss-Bayesian model	Specific functionalities	[90]
Photovoltaic	GA using developed MATLAB code	Voltage-current relation of the PV module.	[91]
Organic molecules	RNN	Relation between molecular structures and their material properties	[49]
Self-assembling materials	statistical mechanics based approach	Complex microstructures	[92]
Soft membranes	Neural network	3D shapes starting from 2D planar composite membranes	[93]
Thin-film materials	Neural networks	Relationships between the metamaterial structure and corresponding ellipsometric and reflectance/transmittance spectra	[94]
Colloidal crystals	Alchemical Monte Carlo simulation	Geometric shape structure	[52]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, J.; Wang, Y.; Chen, Y. Inverse Design of Materials by Machine Learning. Materials 2022, 15, 1811. https://doi.org/10.3390/ma15051811

AMA Style

Wang J, Wang Y, Chen Y. Inverse Design of Materials by Machine Learning. Materials. 2022; 15(5):1811. https://doi.org/10.3390/ma15051811

Chicago/Turabian Style

Wang, Jia, Yingxue Wang, and Yanan Chen. 2022. "Inverse Design of Materials by Machine Learning" Materials 15, no. 5: 1811. https://doi.org/10.3390/ma15051811

APA Style

Wang, J., Wang, Y., & Chen, Y. (2022). Inverse Design of Materials by Machine Learning. Materials, 15(5), 1811. https://doi.org/10.3390/ma15051811

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Inverse Design of Materials by Machine Learning

Abstract

1. Introduction

2. Inverse Design

2.1. High Throughput Virtual Screening (HTVS)

2.2. Global Optimization (GO)

2.3. Generative Models (GM)

3. Application in Materials Design

3.1. Polymers

3.2. Photonic

3.3. Inorganic Solid-State Functional Materials

3.4. Porous Materials

3.5. Other Materials

4. Challenges and Opportunities

Author Contributions

Funding

Informed Consent Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI