Next Article in Journal
An Optimized Strain-Compensated Arrhenius Constitutive Model of GH4169 Superalloy Based on Hot Compression
Next Article in Special Issue
Inverse Method to Determine Parameters for Time-Dependent and Cyclic Plastic Material Behavior from Instrumented Indentation Tests
Previous Article in Journal
A Comparative Evaluation of Powder Characteristics of Recycled Material from Bronze Grinding Chips for Additive Manufacturing
Previous Article in Special Issue
Interpretation of Frequency Effect for High-Strength Steels with Three Different Strength Levels via Crystal Plasticity Finite Element Method
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Crystal Plasticity Parameter Optimization in Cyclically Deformed Electrodeposited Copper—A Machine Learning Approach

by
Karol Frydrych
1,2,
Maciej Tomczak
1 and
Stefanos Papanikolaou
1,*
1
NOMATEN Centre of Excellence, National Centre for Nuclear Research, Sołtana 7, 05-400 Otwock, Poland
2
Institute of Fundamental Technological Research, Polish Academy of Sciences, Pawińskiego 5b, 02-106 Warsaw, Poland
*
Author to whom correspondence should be addressed.
Materials 2024, 17(14), 3397; https://doi.org/10.3390/ma17143397
Submission received: 29 May 2024 / Revised: 3 July 2024 / Accepted: 6 July 2024 / Published: 9 July 2024

Abstract

:
This paper describes an application of a machine learning approach for parameter optimization. The method is demonstrated for the elasto-viscoplastic model with both isotropic and kinematic hardening. It is shown that the proposed method based on long short-term memory networks allowed a reasonable agreement of stress–strain curves to be obtained for cyclic deformation in a low-cycle fatigue regime. The main advantage of the proposed approach over traditional optimization schemes lies in the possibility of obtaining parameters for a new material without the necessity of conducting any further optimizations. As the power and robustness of the developed method was demonstrated for very challenging problems (cyclic deformation, crystal plasticity, self-consistent model and isotropic and kinematic hardening), it is directly applicable to other experiments and models.

1. Introduction

Constitutive models enable the prediction of the behavior of materials subjected to external loadings from small [1] to large [2] length scales. In some straightforward problems, the constitutive models are simple, and their parameters are relatively easy to establish using standard experiments. For example, establishment of the Young’s modulus of an isotropic, linearly elastic material is unambiguous using just the result of a uniaxial tension test. However, in more complex situations, the constitutive models tend to use multiple non-linear equations and many parameters in order to capture the material’s behavior. Such a situation is present, e.g., in crystal plasticity, which is the topic of this paper.
The crystal plasticity (CP) theory in its most standard form accounts for the dislocation glide on strictly defined crystallographic planes. Therefore, the information about orientations of crystallites in a polycrystalline microstructure of a given material has to be provided. Besides taking into account these microstructural details, the hardening equations of crystal plasticity are typically similar to those of conventional plasticity and typically account for isotropic and kinematic hardening. Various optimization algorithms such as gradient optimization [3], the Newton–Raphson algorithm [4], the Levenberg–Marquardt method [5], Bayesian optimization [6], particle swarm optimization [7] and evolutionary algorithms [8,9,10,11,12,13,14,15,16,17,18,19] are used in order to establish the correct set of material parameters (see the introduction in [19] for a thorough discussion). However, all of them share a basic disadvantage: one has to repeat the optimization in the case of a new material or new test result.
In order to overcome this limitation, Wang et al. [20] developed a neural-network-based methodology for parameter optimization. The idea behind the approach is performing the training of a neural network (NN) once and then building the database of predicted outputs using the trained NN. The created database is used to predict the set of optimal parameters based on the provided input. Both the neural network and the database can be stored, and thus the main advantage of the proposed approach is that one does not need to perform a new optimization to obtain the parameters of a different material. A similar idea was presented in [21]. The main difference with regard to [20] is that machine learning (ML) was used only to establish reasonable parameter bounds, and then fine-tuning was performed using a genetic algorithm. Yet another approach to parameter optimization using ML was presented in [22], where the parameters of the model were found by training the NN to solve the inverse problem. This approach is clearly different from both [20,21] and it was demonstrated to work well for a model containing two parameters. Determining whether such an approach can work also for models with more parameters deserves further study, but it has to be stressed here that solving the inverse problem in the case of a highly non-linear problem such as the one presented here can be problematic.
In this article, we propose to perform the optimization for the case of the elasto-viscoplastic self-consistent (EVPSC) crystal plasticity model [14]. The investigated case is that of electrodeposited copper subjected to cyclic elasto-plastic deformation. We demonstrate that the proposed method is robust enough in order to provide correct predictions even in the highly complex problem as the one considered.
We build on the idea of using machine learning for the optimization of constitutive model parameters as described in [20]. However, there are important differences with regard to op. cit. First of all, we studied the case of a two-scale elasto-viscoplastic model, while the single-scale viscoelastic model was used in [20]. Second, we applied the LSTM approach directly to cyclic plasticity model data, while deep neural networks applied to data preprocessed using singular value decomposition technique were used in op. cit. Finally, single loading–unloading was considered in [20], while up to 50 cycles were simulated in the current study.
The article is structured as follows. After this introductory section, we describe the methodology (Section 2). This includes the description of the EVPSC model (Section 2.1) that was already thoroughly described in [14,23] and machine learning methodology (Section 2.2). These are followed by the optimization results using two approaches for feature extraction from stress–strain curves (Section 3). The discussion (Section 4) is related to the applicability of the proposed method to other non-obvious tasks. The paper ends with conclusions (Section 5).

2. Methodology

This section contains two subsections. The first one (Section 2.1) covers the description of the constitutive model together with the corresponding micromechanical scheme, which was used to provide input data for training the machine learning framework. The second one (Section 2.2) describes the machine learning approach applied for the purpose of parameter optimization in this paper.

2.1. The EVPSC Model

The sequential elasto-viscoplastic self-consistent (SEVPSC) code was recently developed and validated for the case of electrodeposited copper films [14] and is based on the sequential linearization scheme by [23]. (The code was written in FORTRAN and will be shared upon reasonable request. Researchers willing to use the code should contact Karol Frydrych.) The scheme coincides with the additive tangent approach [24] for the case of the Mori–Tanaka scheme. The results for the case of the self-consistent case were presented in [14,25]. Details of the self-consistent formulation can be found in the aforementioned papers. Nevertheless, some equations essential for the understanding of the present paper are provided in the following.
The strain rate, stress rate and stress have to fulfill the conditions stating that the average response over the polycrystal is consistent with the corresponding macroscopic values:
ε ¯ ˙ = ε ˙ g , σ ¯ ˙ = σ ˙ g , σ ¯ = σ g .
In order to account for the interaction of the grain level and overall quantities, the additive tangent interaction law proposed by [26] or [27] is adopted:
ε ˙ g ε ¯ ˙ = M ¯ e · ( σ ˙ g σ ¯ ˙ ) M ¯ v · ( σ g σ ¯ ) ,
where ε ¯ ˙ , σ ¯ ˙ and σ ¯ denote the overall strain rate, stress rate and stress. In Equation (2), M ¯ v and M ¯ e are the fourth-order inverse Hill tensors for a purely viscous problem and a purely elastic problem, respectively.
The parameters to be established with the proposed ML methodology concern the isotropic and kinematic hardening at the level of individual grain. In the following, the formulation of the single grain (subscript g) constitutive model is provided. The total strain rate is calculated by adding elastic ε ˙ g e and viscoplastic strain rates ε ˙ g v (the current version of the model is formulated in the small-strain theory):
ε ˙ g = ε ˙ g e + ε ˙ g v
In order to account for the elastic part, the anisotropic Hooke’s law is used:
σ ˙ g = A g e · ε ˙ g e or ε ˙ g e = M g e · σ ˙ g ,
where A g e is the fourth-order elastic stiffness tensor. The associated compliance tensor is M g e = ( A g e ) 1 .
We use the rate-dependent CP formulation for a constitutive description at the single grain level. The plastic strain rate tensor at this level is calculated as a sum of shears on slip systems:
ε ˙ g v = r = 1 N γ ˙ r , g P r , g , where P r , g = 1 2 ( m r , g n r , g + n r , g m r , g )
where γ ˙ r , g is the slip rate, and { m r , g , n r , g } are the slip direction and the normal slip plane unit, respectively. Superscripts r and g denote a given slip systems and grain, respectively. P r , g is the symmetric Schmid tensor. The resolved shear stress (RSS) is a projection of the Cauchy stress tensor on the given slip system:
τ r , g = P r , g : σ g .
The slip rate depends on the RSS as well as on the slip system backstress χ r , g and the critical resolved shear stress (CRSS) τ c r , g [28]:
γ ˙ r , g = γ ˙ 0 τ r , g χ r , g τ c r , g n sign ( τ r , g χ r , g ) .
Note that in Equation (7), both isotropic and kinematic hardening are included. The other quantities are the reference slip rate γ ˙ 0 and the inverse of the strain rate sensitivity n. The tangent viscous compliance tensor is equal to a derivative of viscoplastic strain rate over the stress and thus equals:
M g v = n γ ˙ 0 r = 1 N τ r , g χ r , g τ c r , g n 1 sign ( τ r , g χ r , g ) τ c r , g P r , g P r , g .
The evolution of the CRSS and backstress are governed by isotropic and kinematic hardening laws, respectively. The linear–exponential isotropic hardening evolution appears as follows:
τ c r , g = τ c 0 + h 1 γ g + τ sat τ c 0 1 e h b γ g ,
where τ c 0 , τ sat , h 1 , h b are material parameters of the isotropic hardening and γ g = 0 t γ ˙ g d t is the accumulated shear in the grain. The Ohno–Wang kinematic hardening law [29] reads:
χ ˙ r , g = h k γ ˙ r , g h kb γ ˙ r , g χ r , g h kb h k h m ,
where h k , h kb and h m denote the kinematic hardening parameters. We have thus 4 isotropic hardening and 3 kinematic hardening parameters, which yields in total 7 hardening parameters subjected to optimization; cf. [14].

2.2. Machine Learning

Long short-term memory (LSTM) networks are a type of recurrent neural networks (RNNs) [30]. RNNs themselves are a type of artificial neural networks (ANNs) that are specifically designed for processing sequential data. The key difference of RNNs, as compared to traditional feedforward neural networks, is presence of connections that allow outputs to affect subsequent inputs of the same node in the network. The influence of history-related information on the current outputs can be thus considered. Therefore, RNNs were applied in areas where both order and context are of importance, e.g., language modeling and speech recognition [31,32]. Contrary to linear elasticity, models of plasticity are well known to be history- and path-dependent [33,34,35,36,37,38,39,40] and thus are naturally suitable for treatment with RNNs [41]. This is even more important in the case of cyclic loading, which is considered here [42]. Capturing time dependency is possible in RNNs through the hidden states that are updated at each time step when a new input is processed. The possible deficiencies of RNNs include vanishing or exploding gradients. Such problems were partially mitigated in more advanced variants such as LSTM (applied here) and Gated Recurrent Units (GRUs). These improvements brought the RNNs to a new level, enabling them to be a key tool in sequence modeling.
LSTM model architecture was selected because of its proven performance in other sequential prediction tasks, such as speech recognition and low parameter count when compared to fully connected networks, although as outlined in [43], there is no clear answer as to whether LSTM or GRUs are better at solving a given task. As compared to traditional RNNs, LSTM networks have a more complex architecture (cf. Figure 1). In particular, the LSTM cell contains several gates, such as the forget gate f, input gate i, cell gate g and output gate o. The presence of gates allows for the flow of information to be regulated; that is, the gates control which information is added to or removed from the cell state, enabling the network to maintain and access long-term dependencies more efficiently. Our implementation is a unidirectional LSTM as defined in PyTorch 2.0 [44] with a hidden state vector length of 2048. All three curves from the dataset are generated simultaneously by the network. The output is a tensor whose shape depends on the number of predicted values per cycle and number of cycles. Note that the term tensor in this subsection does not match the definition of tensor from the previous subsection and refers only to an understanding of this term in the machine learning jargon.
Two ways of evaluating the fitness of a given parameter set were tested. The first was exactly the same as in [14], i.e., the stress values at the points where loading changes its direction were compared; cf. Figure 2. This approach will be from now on termed App1. In this case, the shape of the output tensor is 2-by-100 (2 values in 2 points for each of the 50 cycles). In the second approach (App2), the stress–strain curves for each example were replaced with 3 simpler curves: stress amplitude A, mean stress σ ¯ and the slope of each cycle α in order to simplify the prediction task; cf. Figure 3. In this second case, the shape of the output tensor is 3-by-50. The input to the network is the same vector of seven parameters repeated 200 or 150 times (depending on the approach) to match the desired output sequence length.
Both input and output data were normalized to improve training stability by eliminating large differences in the scale of data. The model was trained for 300 epochs with a batch size of 16 and an initial learning rate of 1 × 10−3. The learning rate was decayed for each batch with a cosine annealing schedule. Warm restarts were applied after the first 50 epochs and after doubling the previous learning rate decay period after that until the end of training [46]. A weight decay of 1e-5 was used for regularization. The Adam optimization algorithm was applied to train the network with mean squared error as a loss function. The architecture of the network is shown in Figure 4.

3. Results

The method was applied as follows. First, the database of SEVPSC code results was created by generating 36370 loading–unloading curves. Each result was produced by running the SEVPSC code for the same loading scheme but with a different set of parameters. The parameters were generated by selecting every possible combination of parameters from a list of values. The list for each parameter was generated by selecting five equally spaced values from a set range (cf. Table 1). Then, part of the database was used for training the LSTM network, while the remaining part was moved to the test set (the ratio of training set size to the whole set size was equal to 60%). The MSE obtained in both approaches is shown in Figure 5.
The next step was thus using the trained LSTM network for parameter optimization. The approach consists of using the trained LSTM to generate predictions in points lying in the parameter space that are more densely distributed than the set used for training. For this, 35.8 million results were generated. Then, for the reference case (obtained by running the SEVPSC code with random parameters), the closest result in the LSTM-generated database is searched by calculating the mean squared error (MSE) between reference case curves and generated examples and selecting the generated example with the lowest MSE. We have tested the method for 30 different parameter sets. The six representative parameter sets obtained using this technique are shown in Table 2. The corresponding stress–strain curves obtained using the optimized parameters compared against the reference ones are shown in Figure 6. The entire group of parameter sets is presented in the Supplementary Material. We divided the results into six categories:
1.
Very good or reasonable agreement of SS curves obtained using parameters optimized in both approaches—Figure 6a and Supplementary Figure S1;
2.
Disagreement in the first cycle and reasonable agreement of SS curves obtained using parameters optimized in both approaches—Figure 6b and Supplementary Figure S2;
3.
Reasonable agreement of SS curves obtained using parameters optimized in App 1 (lack of convergence for App 2 parameters)—Figure 6c and Supplementary Figure S3;
4.
Reasonable agreement of SS curves obtained using parameters optimized in App 2 (lack of convergence for App 1 parameters)—Figure 6d and Supplementary Figure S4;
5.
Striking disagreement or lack of convergence—Figure 6e and Supplementary Figure S5;
6.
Lack of convergence for the optimized parameters in both approaches—Figure 6f and Supplementary Figure S6.
Out of 30 cases examined, 13 cases show very good or reasonable agreement in both approaches (category 1). Three cases show reasonable agreement in late cycles but disagreement in the first cycle (category 2). It can be thus stated that more than half of the cases the method worked correctly with both approaches. In four cases, App 1 provided the correct prediction, while App 2 did not converge (category 3). The reverse situation (category 4) was present in another four cases. Therefore, in 24 cases (80%), at least one approach provided a reasonably correct set of parameters. Concerning category 5, in one case, both approaches provided parameter sets, leading to striking disagreement in SS curves (Figure 6e and Figure S5b). In another case (cf. Supplementary Figure S5a), there was striking disagreement for App 2 and a lack of convergence for App 1. Finally, in four cases, neither approach provided a parameter set that can converge in actual SEVPSC simulation (category 6).
Let us now take a closer look on the obtained results. In terms of the approaches, it seems that both of them are balanced, and we cannot argue that either App1 or App2 is better. Cases where the predicted SS curve agrees with the actual one (Category 1, cf. Figure 6a and S1) prove the potential of the method. The fact that almost half of cases show only slight deviations in the shape of the curve in the first cycle is actually slightly surprising as the information about the shape of the first cycle was absent in both methods. It is thus easy to understand why three other cases (Category 2, cf. Figure 6b and Figure S2) agree with the reference curve in the cyclic deformation regime while having some deviation in the first cycle. Moreover, these three cases can be treated as correct predictions since the LSTM had no information about the shape of the initial loading curve. In the rest of the cases (Categories 3–6), there was either agreement or a lack of convergence, except for the curve shown in Figure 6e. Cases where the simulations with optimized parameters fail to converge are not useful but also harmless. Thus, only 1 case out of 30 led to a misleading prediction.

4. Discussion

The trained network was used for parameter identification. But it could be also used as a surrogate model to obtain the same overall information in a far shorter time. Such an approach was already presented in a number of papers; e.g., in [47], the surrogate ML model was trained using data from discrete dislocation dynamics. The network was able to predict the stress–strain curve based on the initial dislocation configuration. On the other hand, Deshpande, Lengiewicz and Bordas [48] trained their deep learning (DL) network based on large deformation finite element method (FEM) simulations. Such an approach made it possible to predict the deformed shape based on the initial specimen geometry and applied boundary conditions.
Many surrogate models were also constructed in the framework of crystal plasticity (CP). A surrogate model for the viscoplastic self-consistent (VPSC) model was built in [49]. The researchers developed both forward and backward ML configurations. The first made it possible to predict the final crystallographic texture and stress–strain curve based on the initial texture and hardening parameters. The second enabled the prediction of the initial texture and hardening parameters based on the final texture and stress–strain curve. A neural network allowing the prediction of the stress–strain curve and texture but this time trained using the crystal plasticity finite element method (CPFEM) was presented in [50]. CPFEM-trained ML architecture was used to predict cyclic stress values in low-cycle-fatigue experiments in [51]. An interesting contribution was presented in [52], where the ML was trained on an elastic–viscoplastic fast Fourier transform (EVPFFT) model, and the prediction concerned spatially resolved crystallographic orientations. A novelty in [53] was the application of CP-trained ML to non-monotonic strain paths. In [54], it was demonstrated that using an encoder–decoder deep learning framework, it is possible to regenerate stress–strain curves for a material subjected to several loading–unloading cycles.
We hereby stress that even though our approach of using the ML architecture for parameter optimization could be used as a surrogate model, it is currently clearly different from the aforementioned approaches. In our case, the main purpose of training the ML network was to find parameters for the physical model (EVPSC), which still serves as a base of all the simulations provided. Moreover, constructing a surrogate model that estimates only the stress–strain points at loading direction changes (App1) or features of each cycle (App2) would be pointless. Even if the surrogate model is able to predict the full stress–strain curve (and our trained ML framework is not), it would still reduce the wealth of information that can be attained from the EVPSC model. As pointed out in [14], the main advantages of the EVPSC model are its capabilities to account not only for the global response, but also to give insight into details of the plastic deformation occurring at every grain. Such information would be lost in the case of constructing the surrogate model that returns only the average quantities.

5. Conclusions

This paper demonstrates the possibility of using a machine learning framework based on LSTM to efficiently optimize material parameters in a highly complex situation of modeling the cyclic deformation of copper films using the elasto-viscoplastic self-consistent model with isotropic and kinematic hardening. One can conclude that the method is able to provide a reasonably accurate estimate of the parameter set that in most cases provides the correct cyclic stress–strain.
A significant advantage of the proposed machine-learning-based method over optimization algorithms is that in the case of a different material subjected to the same loading conditions, one does not need to perform any other simulations to obtain the optimal parameters. One could argue that a similar result could be obtained using only a sufficient database. Using such a database instead of the machine learning approach, however, has two disadvantages. First, in order to achieve an accuracy comparable to the one attained here, one should perform about 10 thousand times more simulations (because the number of simulations used for training was 3670 and the number of data points generated by trained ML framework was 35.8 million). Second, in order to use the database for future purposes, one would have to store the whole database of results (which would be 35.8 million times 977 kB = 35 TB), while in our case it is enough to store the trained model (which would be just 50.8 MB) and when necessary regenerate the database with an arbitrarily large or small number of example inputs to achieve a desired database size.
In this paper, the LSTM-based architecture was applied, as some advantages of LSTM over more traditional recurrent units were demonstrated [43]. However, op. cit. also stated that the results of comparison between LSTM and GRUs are not conclusive and shall depend on the application. Moreover, new architectures specifically designed for mechanics problems were proposed; cf., e.g., the minimal state cells (MSCs) [55]. In the future, it will be very interesting to perform detailed studies where the efficiency of various architectures, like LSTM, GRUs, and MSCs (and possibly others) is compared. This task is, however, surely outside the scope of the present contribution.
Of course, the developed approach is not able to provide the material parameters based on different loading conditions than were used as an input. Therefore, in such a case, one would have to repeat the optimization. The method was demonstrated for a highly challenging problem of cyclic deformation simulated with the micromechanical model accounting for both isotropic and kinematic hardening. It is expected that the approach will be directly applicable to other cases. In particular, one can naturally apply the developed methodology to a case of studying low and high cycle fatigue with plasticity and damage models; cf., e.g., [56].

Supplementary Materials

The following supporting information can be downloaded at: Available online: https://www.mdpi.com/article/10.3390/ma17143397/s1, Figure S1: The results of using neural networks for optimization: stress-strain curves obtained using the SEVPSC code for the arbitrary parameter set and its closest neighbours—Category 1: Very good or reasonable agreement of SS curves obtained using parameters optimized in both approaches); Figure S2: The results of using neural networks for optimization: stress-strain curves obtained using the SEVPSC code for the arbitrary parameter set and its closest neighbours—Category 2: Disagreement in the first cycle and reasonable agreement of SS curves in further cycles obtained using parameters optimized in both approaches; Figure S3: The results of using neural networks for optimization: stress-strain curves obtained using the SEVPSC code for the arbitrary parameter set and its closest neighbours—Category 3: Reasonable agreement of SS curves obtained using parameters optimized in App 1 (lack of convergence for App 2 parameters); Figure S4: The results of using neural networks for optimization: stress-strain curves obtained using the SEVPSC code for the arbitrary parameter set and its closest neighbours—Category 4: Reasonable agreement of SS curves obtained using parameters optimized in App 2 (lack of convergence for App 1 parameters); Figure S5: The results of using neural networks for optimization: stress-strain curves obtained using the SEVPSC code for the arbitrary parameter set and its closest neighbours—Category 5: Striking disagreement or lack of convergence; Figure S6: The results of using neural networks for optimization: stress-strain curves obtained using the SEVPSC code for the arbitrary parameter set and its closest neighbours—Category 6: Lack of convergence for the optimized parameters in both approaches; Table S1: Reference parameters vs parameters that made LSTM generate closest matching curves—Category 1: Very good or reasonable agreement of SS curves obtained using parameters optimized in both approaches); Table S2: Reference parameters vs parameters that made LSTM generate closest matching curves—Category 2: Disagreement in the first cycle and reasonable agreement of SS curves in further cycles obtained using parameters optimized in both approaches; Table S3: Reference parameters vs parameters that made LSTM generate closest matching curves—Category 3: Reasonable agreement of SS curves obtained using parameters optimized in App 1 (lack of convergence for App 2 parameters); Table S4: Reference parameters vs parameters that made LSTM generate closest matching curves—Category 4: Reasonable agreement of SS curves obtained using parameters optimized in App 2 (lack of convergence for App 1 parameters); Table S5: Reference parameters vs parameters that made LSTM generate closest matching curves—Category 5: Striking disagreement or lack of convergence; Table S6: Reference parameters vs parameters that made LSTM generate closest matching curves—Category 6: Lack of convergence for the optimized parameters in both approaches.

Author Contributions

Conceptualization, S.P.; Methodology, K.F., M.T. and S.P.; Software, M.T.; Validation, K.F. and M.T.; Formal analysis, K.F. and M.T.; Investigation, K.F. and S.P.; Resources, S.P.; Data curation, M.T.; Writing—original draft, K.F.; Writing—review & editing, M.T. and S.P.; Supervision, S.P.; Project administration, S.P.; Funding acquisition, S.P. All authors have read and agreed to the published version of the manuscript.

Funding

The publication was created within the framework of the project of the Minister of Science and Higher Education “Support for the activities of Centres of Excellence established in Poland under Horizon 2020” under contract no. MEiN/2023/DIR/3795.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhao, Y.; Chen, X.; Park, C.; Fay, C.C.; Stupkiewicz, S.; Ke, C. Mechanical deformations of boron nitride nanotubes in crossed junctions. J. Appl. Phys. 2014, 115, 164305. [Google Scholar] [CrossRef]
  2. Kacprzak, G.; Zbiciak, A.; Józefiak, K.; Nowak, P.; Frydrych, M. One-Dimensional Computational Model of Gyttja Clay for Settlement Prediction. Sustainability 2023, 15, 1759. [Google Scholar] [CrossRef]
  3. Ganesan, S.; Yaghoobi, M.; Githens, A.; Chen, Z.; Daly, S.; Allison, J.E.; Sundararaghavan, V. The effects of heat treatment on the response of WE43 Mg alloy: Crystal plasticity finite element simulation and SEM-DIC experiment. Int. J. Plast. 2021, 137, 102917. [Google Scholar] [CrossRef]
  4. Guery, A.; Hild, F.; Latourte, F.; Roux, S. Identification of crystal plasticity parameters using DIC measurements and weighted FEMU. Mech. Mater. 2016, 100, 55–71. [Google Scholar] [CrossRef]
  5. Cruzado, A.; LLorca, J.; Segurado, J. Modeling cyclic deformation of inconel 718 superalloy by means of crystal plasticity and computational homogenization. Int. J. Solids Struct. 2017, 122, 148–161. [Google Scholar] [CrossRef]
  6. Kuhn, J.; Spitz, J.; Sonnweber-Ribic, P.; Schneider, M.; Böhlke, T. Identifying material parameters in crystal plasticity by Bayesian optimization. Optim. Eng. 2021, 23, 1489–1523. [Google Scholar] [CrossRef]
  7. Hu, L.; Jiang, S.Y.; Zhang, Y.Q.; Zhu, X.M.; Sun, D. Texture evolution and inhomogeneous deformation of polycrystalline Cu based on crystal plasticity finite element method and particle swarm optimization algorithm. J. Cent. South Univ. 2017, 24, 2747–2756. [Google Scholar] [CrossRef]
  8. Skippon, T.; Mareau, C.; Daymond, M.R. On the determination of single-crystal plasticity parameters by diffraction: Optimization of a polycrystalline plasticity model using a genetic algorithm. J. Appl. Crystallogr. 2012, 45, 627–643. [Google Scholar] [CrossRef]
  9. Acar, P.; Ramazani, A.; Sundararaghavan, V. Crystal plasticity modeling and experimental validation with an orientation distribution function for ti-7al alloy. Metals 2017, 7, 459. [Google Scholar] [CrossRef]
  10. Cauvin, L.; Raghavan, B.; Bouvier, S.; Wang, X.; Meraghni, F. Multi-scale investigation of highly anisotropic zinc alloys using crystal plasticity and inverse analysis. Mater. Sci. Eng. A 2018, 729, 106–118. [Google Scholar] [CrossRef]
  11. Kapoor, K.; Sangid, M.D. Initializing type-2 residual stresses in crystal plasticity finite element simulations utilizing high-energy diffraction microscopy data. Mater. Sci. Eng. A 2018, 729, 53–63. [Google Scholar] [CrossRef]
  12. Sedighiani, K.; Diehl, M.; Traka, K.; Roters, F.; Sietsma, J.; Raabe, D. An efficient and robust approach to determine material parameters of crystal plasticity constitutive laws from macro-scale stress–strain curves. Int. J. Plast. 2020, 134, 102779. [Google Scholar] [CrossRef]
  13. Frydrych, K.; Maj, M.; Urbański, L.; Kowalczyk-Gajewska, K. Twinning-induced anisotropy of mechanical response of AZ31B extruded rods. Mater. Sci. Eng. A 2020, 771, 138610. [Google Scholar] [CrossRef]
  14. Girard, G.; Frydrych, K.; Kowalczyk-Gajewska, K.; Martiny, M.; Mercier, S. Cyclic response of electrodeposited copper films. Experiments versus elastic-viscoplastic mean-field approach predictions. Mech. Mater. 2021, 153, 1–17. [Google Scholar] [CrossRef]
  15. Savage, D.J.; Feng, Z.; Knezevic, M. Identification of crystal plasticity model parameters by multi-objective optimization integrating microstructural evolution and mechanical data. Comput. Methods Appl. Mech. Eng. 2021, 379, 113747. [Google Scholar] [CrossRef]
  16. Frydrych, K.; Kowalczyk-Gajewska, K.; Libura, T.; Kowalewski, Z.; Maj, M. On the role of slip, twinning and detwinning in magnesium alloy AZ31b sheet. Mater. Sci. Eng. A 2021, 813, 141152. [Google Scholar] [CrossRef]
  17. Frydrych, K.; Jarzebska, A.; Virupakshi, S.; Kowalczyk-Gajewska, K.; Bieda, M.; Chulist, R.; Skorupska, M.; Schell, N.; Sztwiertnia, K. Texture-Based Optimization of Crystal Plasticity Parameters: Application to Zinc and Its Alloy. Metall. Mater. Trans. A 2021, 52, 3257–3273. [Google Scholar] [CrossRef]
  18. Frydrych, K. Texture evolution of magnesium alloy AZ31B subjected to severe plastic deformation. Eng. Trans. 2021, 69, 337–352. [Google Scholar]
  19. Frydrych, K.; Papanikolaou, S. Unambiguous Identification of Crystal Plasticity Parameters from Spherical Indentation. Crystals 2022, 12, 1341. [Google Scholar] [CrossRef]
  20. Wang, J.; Zhu, B.; Hui, C.Y.; Zehnder, A.T. Determination of material parameters in constitutive models using adaptive neural network machine learning. J. Mech. Phys. Solids 2023, 177, 105324. [Google Scholar] [CrossRef]
  21. Schulte, R.; Karca, C.; Ostwald, R.; Menzel, A. Machine learning-assisted parameter identification for constitutive models based on concatenated loading path sequences. Eur. J. Mech.-A/Solids 2023, 98, 104854. [Google Scholar] [CrossRef]
  22. Pogorelko, V.; Mayer, A.; Fomin, E.; Fedorov, E. Examination of machine learning method for identification of material model parameters. Int. J. Mech. Sci. 2024, 265, 108912. [Google Scholar] [CrossRef]
  23. Kowalczyk-Gajewska, K.; Petryk, H. Sequential linearization method for viscous/elastic heterogeneous materials. Eur. J. Mech. Solids/A 2011, 30, 650–664. [Google Scholar] [CrossRef]
  24. Mercier, S.; Molinari, A. Homogenization of elastic-viscoplastic heterogeneous materials: Self-consistent and Mori-Tanaka schemes. Int. J. Plast. 2009, 25, 1024–1048. [Google Scholar] [CrossRef]
  25. Kowalczyk-Gajewska, K. Micromechanical Modelling of Metals and Alloys of High Specific Strength. IFTR Reports 1/2011. 2011, pp. 1–299. Available online: https://rcin.org.pl/Content/32190/PDF/WA727_9322_56416-1-2011_Micromechanical.pdf (accessed on 5 July 2024).
  26. Molinari, A.; Ahzi, S.; Kouddane, R. On the self-consistent modelling of elastic-plastic behavior of polycrystals. Mech. Mater. 1997, 26, 43–62. [Google Scholar] [CrossRef]
  27. Molinari, A. Averaging Models for heterogeneous viscoplastic and elastic viscoplastic materials. J. Eng. Mater. Technol. 2002, 124, 62–70. [Google Scholar] [CrossRef]
  28. Hennessey, C.; Castelluccio, G.M.; McDowell, D.L. Sensitivity of polycrystal plasticity to slip system kinematic hardening laws for Al 7075-T6. Mater. Sci. Eng. A 2017, 687, 241–248. [Google Scholar] [CrossRef]
  29. Ohno, N.; Wang, J.D. Kinematic hardening rules with critical state of dynamic recovery, part I: Formulation and basic features for ratchetting behavior. Int. J. Plast. 1993, 9, 375–390. [Google Scholar] [CrossRef]
  30. Sak, H.; Senior, A.; Beaufays, F. Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition. arXiv 2014, arXiv:1402.1128. [Google Scholar]
  31. Yao, K.; Zweig, G.; Hwang, M.Y.; Shi, Y.; Yu, D. Recurrent neural networks for language understanding. In Proceedings of the Interspeech 2013, Lyon, France, 25–29 August 2013; pp. 2524–2528. [Google Scholar]
  32. Shewalkar, A.; Nyavanandi, D.; Ludwig, S.A. Performance evaluation of deep neural networks applied to speech recognition: RNN, LSTM and GRU. J. Artif. Intell. Soft Comput. Res. 2019, 9, 235–245. [Google Scholar] [CrossRef]
  33. El-Danaf, E.; Kalidindi, S.R.; Doherty, R.D. Influence of deformation path on the strain hardening behavior and microstructure evolution in low SFE fcc metals. Int. J. Plast. 2001, 17, 1245–1265. [Google Scholar] [CrossRef]
  34. Peeters, B.; Seefeldt, M.; Teodosiu, C.; Kalidindi, S.; Van Houtte, P.; Aernoudt, E. Work-hardening/softening behaviour of b.c.c. polycrystals during changing strain paths: I. An integrated model based on substructure and texture evolution, and its prediction of the stress-strain behaviour of an IF steel during two-stage strain paths. Acta Mater. 2001, 49, 1607–1619. [Google Scholar] [CrossRef]
  35. Beyerlein, I.J.; Tomè, C.N. Modeling transients in the mechanical response of copper due to strain path changes. Int. J. Plast. 2007, 23, 640–664. [Google Scholar] [CrossRef]
  36. Petryk, H.; Stupkiewicz, S. Modelling of microstructure evolution on complex paths of large plastic deformation. Int. J. Mat. Res. 2014, 103, 271–277. [Google Scholar] [CrossRef]
  37. Frydrych, K.; Kowalczyk-Gajewska, K.; Stupkiewicz, S. Modelling of microstructure evolution in hcp polycrystals on non-proportional strain paths. In Proceedings of the 39th Solid Mechanics Conference, Zakopane, Poland, 1–5 September 2014. [Google Scholar]
  38. Kowalczyk-Gajewska, K.; Stupkiewicz, S.; Frydrych, K.; Petryk, H. Modelling of Texture Evolution and Grain Refinement on Complex SPD Paths. IOP Conf. Ser. Mater. Sci. Eng. 2014, 63, 012040. [Google Scholar] [CrossRef]
  39. Hama, T.; Nagao, H.; Kobuki, A.; Fujimoto, H.; Takuda, H. Work-hardening and twinning behaviors in a commercially pure titanium sheet under various loading paths. Mater. Sci. Eng. A 2015, 620, 390–398. [Google Scholar] [CrossRef]
  40. Kowalczyk-Gajewska, K.; Sztwiertnia, K.; Kawałko, J.; Wierzbanowski, K.; Wronski, K. Frydrych, K.; Stupkiewicz, S.; Petryk, H. Texture evolution in titanium on complex deformation paths: Experiment and modelling. Mater. Sci. Eng. A 2015, 637, 251–263. [Google Scholar] [CrossRef]
  41. Heidenreich, J.N.; Mohr, D. Recurrent neural network plasticity models: Unveiling their common core through multi-task learning. Comput. Methods Appl. Mech. Eng. 2024, 426, 116991. [Google Scholar] [CrossRef]
  42. Wu, L.; Kilingar, N.G.; Noels, L.; Nguyen, V.D. A recurrent neural network-accelerated multi-scale model for elasto-plastic heterogeneous materials subjected to random cyclic and non-proportional loading paths. Comput. Methods Appl. Mech. Eng. 2020, 369, 113234. [Google Scholar] [CrossRef]
  43. Chung, J.; Gulcehre, C.; Cho, K.; Bengio, Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv 2014, arXiv:1412.3555. [Google Scholar]
  44. PyTorch Contributors. PyTorch LSTM Description. Available online: https://pytorch.org/docs/stable/generated/torch.nn.LSTM.html#torch.nn.LSTM (accessed on 6 November 2023).
  45. Ley, A.; Bormann, H.; Casper, M. Intercomparing LSTM and RNN to a conceptual hydrological model for a low-land river with a focus on the flow duration curve. Water 2023, 15, 505. [Google Scholar] [CrossRef]
  46. Loshchilov, I.; Hutter, F. Sgdr: Stochastic gradient descent with warm restarts. arXiv 2016, arXiv:1608.03983. [Google Scholar]
  47. Salmenjoki, H.; Alava, M.; Laurson, L. Machine learning plastic deformation of crystals. Nat. Commun. 2018, 9, 1–7. [Google Scholar] [CrossRef] [PubMed]
  48. Deshpande, S.; Lengiewicz, J.; Bordas, S.P. Probabilistic deep learning for real-time large deformation simulations. Comput. Methods Appl. Mech. Eng. 2022, 398, 115307. [Google Scholar] [CrossRef]
  49. Yuan, M.; Paradiso, S.; Meredig, B.; Niezgoda, S. Machine learning–based reduce order crystal plasticity modeling for ICME applications. Integr. Mater. Manuf. Innov. 2018, 7, 214–230. [Google Scholar] [CrossRef]
  50. Ali, U.; Muhammad, W.; Brahme, A.; Skiba, O.; Inal, K. Application of artificial neural networks in micromechanics for polycrystalline metals. Int. J. Plast. 2019, 120, 205–219. [Google Scholar] [CrossRef]
  51. Miyazawa, Y.; Briffod, F.; Shiraiwa, T.; Enoki, M. Prediction of cyclic stress–strain property of steels by crystal plasticity simulations and machine learning. Materials 2019, 12, 3668. [Google Scholar] [CrossRef]
  52. Pandey, A.; Pokharel, R. Machine learning based surrogate modeling approach for mapping crystal deformation in three dimensions. Scr. Mater. 2021, 193, 1–5. [Google Scholar] [CrossRef]
  53. Ibragimova, O.; Brahme, A.; Muhammad, W.; Levesque, J.; Inal, K. A new ANN based crystal plasticity model for FCC materials and its application to non-monotonic strain paths. Int. J. Plast. 2021, 144, 103059. [Google Scholar] [CrossRef]
  54. Li, Q.J.; Cinbiz, M.N.; Zhang, Y.; He, Q.; Beausoleil II, G.; Li, J. Robust deep learning framework for constitutive relations modeling. Acta Mater. 2023, 254, 118959. [Google Scholar] [CrossRef]
  55. Bonatti, C.; Mohr, D. One for all: Universal material model based on minimal state-space neural networks. Sci. Adv. 2021, 7, eabf3658. [Google Scholar] [CrossRef] [PubMed]
  56. Rutecka, A.; Kowalewski, Z.; Pietrzak, K.; Dietrich, L.; Makowska, K.; Woźniak, J.; Kostecki, M.; Bochniak, W.; Olszyna, A. Damage development of Al/SiC metal matrix composite under fatigue, creep and monotonic loading conditions. Procedia Eng. 2011, 10, 1420–1425. [Google Scholar] [CrossRef]
Figure 1. Scheme showing difference between the architecture of the typical (a) recurrent neural network (RNN) and (b) long short-term memory (LSTM) network. The figure was reprinted from [45] based on the Creative Commons Attribution (CC BY) license.
Figure 1. Scheme showing difference between the architecture of the typical (a) recurrent neural network (RNN) and (b) long short-term memory (LSTM) network. The figure was reprinted from [45] based on the Creative Commons Attribution (CC BY) license.
Materials 17 03397 g001
Figure 2. Scheme showing locations of difference evaluations according to Approach 1.
Figure 2. Scheme showing locations of difference evaluations according to Approach 1.
Materials 17 03397 g002
Figure 3. Scheme showing difference evaluations according to Approach 2: stress amplitude A, mean stress σ ¯ , and slope of each cycle tan ( α ) were used.
Figure 3. Scheme showing difference evaluations according to Approach 2: stress amplitude A, mean stress σ ¯ , and slope of each cycle tan ( α ) were used.
Materials 17 03397 g003
Figure 4. Scheme showing the architecture of the network used in the present contribution. The input x is a set of 7 hardening parameters; outputs y i are data describing the shape of stress–strain curves.
Figure 4. Scheme showing the architecture of the network used in the present contribution. The input x is a set of 7 hardening parameters; outputs y i are data describing the shape of stress–strain curves.
Materials 17 03397 g004
Figure 5. Mean squared error as a function of cycles in (a) approach 1 and (b) approach 2.
Figure 5. Mean squared error as a function of cycles in (a) approach 1 and (b) approach 2.
Materials 17 03397 g005
Figure 6. The results of using neural networks for optimization: stress–strain curves obtained using the SEVPSC code for the arbitrary parameter set and its closest neighbors. Only the first and last cycle are shown for the sake of clarity. The selected examples refer to categories: (a) 1: very good or reasonable agreement of SS curves obtained using parameters optimized in both approaches, (b) 2: disagreement in the first cycle and reasonable agreement of SS curves obtained using parameters optimized in both approaches, (c) 3: reasonable agreement of SS curves obtained using parameters optimized in App 1 (lack of convergence for App 2 parameters), (d) 4: reasonable agreement of SS curves obtained using parameters optimized in App 2 (lack of convergence for App 1 parameters), (e) 5: striking disagreement or lack of convergence, (f) 6: lack of convergence for the optimized parameters in both approaches.
Figure 6. The results of using neural networks for optimization: stress–strain curves obtained using the SEVPSC code for the arbitrary parameter set and its closest neighbors. Only the first and last cycle are shown for the sake of clarity. The selected examples refer to categories: (a) 1: very good or reasonable agreement of SS curves obtained using parameters optimized in both approaches, (b) 2: disagreement in the first cycle and reasonable agreement of SS curves obtained using parameters optimized in both approaches, (c) 3: reasonable agreement of SS curves obtained using parameters optimized in App 1 (lack of convergence for App 2 parameters), (d) 4: reasonable agreement of SS curves obtained using parameters optimized in App 2 (lack of convergence for App 1 parameters), (e) 5: striking disagreement or lack of convergence, (f) 6: lack of convergence for the optimized parameters in both approaches.
Materials 17 03397 g006aMaterials 17 03397 g006b
Table 1. Ranges of the hardening parameters serving to generate the database.
Table 1. Ranges of the hardening parameters serving to generate the database.
τ c 0 τ sat h 1 h b h k h kb h m
Min10.010.00.00.00.00.00.0
Max80.0120.05.0120.01.0 · 10 5 1000.010.0
Table 2. Reference parameters vs. parameters that made LSTM generate closest matching curves.
Table 2. Reference parameters vs. parameters that made LSTM generate closest matching curves.
τ c 0 τ sat h 1 h b h k h kb h m
Category 1
Reference1.00 × 10 1 6.50 × 10 1 4.174.00 × 10 1 1.00 × 10 5 5.00 × 10 2 3.33
App 12.91 × 10 1 9.00 × 10 1 4.091.09 × 10 1 1.00 × 10 5 4.55 × 10 2 2.73
App 21.64 × 10 1 8.00 × 10 1 1.362.18 × 10 1 1.00 × 10 5 4.55 × 10 2 2.73
Category 2
Reference1.00 × 10 1 8.33 × 10 1 8.33 × 10 1 1.20 × 10 2 3.33 × 10 4 1.00 × 10 3 0.00
App 16.73 × 10 1 1.00 × 10 2 5.001.09 × 10 1 9.09 × 10 3 5.45 × 10 2 9.09 × 10 1
App 26.73 × 10 1 1.20 × 10 2 4.55 × 10 1 1.09 × 10 1 9.09 × 10 3 8.18 × 10 2 3.64
Category 3
Reference1.00 × 10 1 1.00 × 10 1 1.671.20 × 10 2 3.33 × 10 4 8.33 × 10 2 8.33
App 11.00 × 10 1 1.00 × 10 1 5.002.18 × 10 1 2.73 × 10 4 8.18 × 10 2 7.27
App 21.64 × 10 1 1.00 × 10 1 5.002.18 × 10 1 2.73 × 10 4 7.27 × 10 2 3.64
Category 4
Reference1.00 × 10 1 2.83 × 10 1 5.008.00 × 10 1 1.00 × 10 5 5.00 × 10 2 3.33
App 14.18 × 10 1 2.00 × 10 1 1.822.18 × 10 1 1.00 × 10 5 4.55 × 10 2 2.73
App 22.91 × 10 1 1.20 × 10 2 1.360.001.00 × 10 5 4.55 × 10 2 2.73
Category 5
Reference1.00 × 10 1 1.00 × 10 1 5.001.20 × 10 2 3.33 × 10 4 6.67 × 10 2 0.00
App 11.00 × 10 1 1.00 × 10 1 5.002.18 × 10 1 2.73 × 10 4 9.09 × 10 2 8.18
App 222.7330.005.0032.730.000.000.00
Category 6
Reference10.0083.331.6760.000.001000.000.00
App 135.4590.002.7343.640.001000.004.55
App 222.7390.000.9154.550.00727.270.00
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Frydrych, K.; Tomczak, M.; Papanikolaou, S. Crystal Plasticity Parameter Optimization in Cyclically Deformed Electrodeposited Copper—A Machine Learning Approach. Materials 2024, 17, 3397. https://doi.org/10.3390/ma17143397

AMA Style

Frydrych K, Tomczak M, Papanikolaou S. Crystal Plasticity Parameter Optimization in Cyclically Deformed Electrodeposited Copper—A Machine Learning Approach. Materials. 2024; 17(14):3397. https://doi.org/10.3390/ma17143397

Chicago/Turabian Style

Frydrych, Karol, Maciej Tomczak, and Stefanos Papanikolaou. 2024. "Crystal Plasticity Parameter Optimization in Cyclically Deformed Electrodeposited Copper—A Machine Learning Approach" Materials 17, no. 14: 3397. https://doi.org/10.3390/ma17143397

APA Style

Frydrych, K., Tomczak, M., & Papanikolaou, S. (2024). Crystal Plasticity Parameter Optimization in Cyclically Deformed Electrodeposited Copper—A Machine Learning Approach. Materials, 17(14), 3397. https://doi.org/10.3390/ma17143397

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop