New Approach for Automated Explanation of Material Phenomena (AA6082) Using Artificial Neural Networks and ChatGPT

Goričan, Tomaž; Terčelj, Milan; Peruš, Iztok

doi:10.3390/app14167015

Open AccessArticle

New Approach for Automated Explanation of Material Phenomena (AA6082) Using Artificial Neural Networks and ChatGPT

by

Tomaž Goričan

¹

,

Milan Terčelj

² and

Iztok Peruš

^1,2,*

¹

Faculty of Civil Engineering, Transportation Engineering and Architecture, University of Maribor, Smetanova 17, SI-2000 Maribor, Slovenia

²

Department for Materials and Metallurgy, Faculty for Natural Sciences and Engineering, University of Ljubljana, Aškerčeva 12, SI-1000 Ljubljana, Slovenia

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2024, 14(16), 7015; https://doi.org/10.3390/app14167015 (registering DOI)

Submission received: 20 June 2024 / Revised: 24 July 2024 / Accepted: 8 August 2024 / Published: 9 August 2024

(This article belongs to the Section Computing and Artificial Intelligence)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Artificial intelligence methods, especially artificial neural networks (ANNs), have increasingly been utilized for the mathematical description of physical phenomena in (metallic) material processing. Traditional methods often fall short in explaining the complex, real-world data observed in production. While ANN models, typically functioning as “black boxes”, improve production efficiency, a deeper understanding of the phenomena, akin to that provided by explicit mathematical formulas, could enhance this efficiency further. This article proposes a general framework that leverages ANNs (i.e., Conditional Average Estimator—CAE) to explain predicted results alongside their graphical presentation, marking a significant improvement over previous approaches and those relying on expert assessments. Unlike existing Explainable AI (XAI) methods, the proposed framework mimics the standard scientific methodology, utilizing minimal parameters for the mathematical representation of physical phenomena and their derivatives. Additionally, it analyzes the reliability and accuracy of the predictions using well-known statistical metrics, transitioning from deterministic to probabilistic descriptions for better handling of real-world phenomena. The proposed approach addresses both aleatory and epistemic uncertainties inherent in the data. The concept is demonstrated through the hot extrusion of aluminum alloy 6082, where CAE ANN models and predicts key parameters, and ChatGPT explains the results, enabling researchers and/or engineers to better understand the phenomena and outcomes obtained by ANNs.

Keywords:

artificial neural networks; automatic explanation; hot extrusion; aluminum alloy; large language models; ChatGPT

1. Introduction

Artificial neural networks (ANNs), as a type of artificial intelligence, have emerged as powerful tools in materials science, playing a crucial role in advancing our understanding and mathematical descriptions of the (properties of) various materials, such as reinforced concrete (i.e., [1]), steel (i.e., [2,3,4]) and aluminum alloys (i.e., [5,6]). The application of ANNs in this field has significantly enhanced the human ability to analyze the physical phenomena in different materials and their performance.

An overview of recent advanced research studies in the field of materials science, specifically focusing on metallic materials (different aluminum alloys and steels), reveals that the studies cover a wide range of topics. ANNs were used to model the homogenization and recrystallization heat treatment of 5182-Sc-Zr alloy, achieving high accuracy in predicting mechanical properties post-annealing [7]. For Al–7Si alloys, ANNs established quantitative correlations between mechanical properties, Mg content, and heat treatment parameters, effectively optimizing process conditions [8]. In aluminum-based bearing alloys (i.e., alloys suitable for engine bearings) ANNs optimized chemical compositions and consequently enhanced mechanical properties such as tensile strength and hardness, with a high predictive accuracy [9]. ANNs also predicted tool wear in aluminum matrix composites (AMC) by incorporating parameters such as vibration acceleration, cutting forces, and varying cutting speeds, thus contributing to improved machining processes and tool life optimization [10]. For friction stir welding (FSW) of aluminum joints, ANNs classified joint quality and regressed tensile load-bearing capacity, optimizing welding parameters with a high validation accuracy [11]. ANNs were able to predict the yield and ultimate tensile strength of metallic alloys, including aluminum, achieving greater than 95% confidence by leveraging chemical composition, tempers, and hardness data [12]. Similarly, ANNs optimized the chemical composition of new aluminum alloys, achieving a high prediction accuracy and efficiency in developing alloys with specific properties [13]. Additionally, ANNs predicted the flow behavior of AA5005 alloys at high temperatures and low strain rates more accurately than traditional methods; it was revealed that the BP-ANN model provides superior accuracy in predicting flow stress under diverse conditions [14]. The modified backpropagation ANN model, enhanced by a genetic algorithm [15], accurately predicted the flow behavior of 5754 aluminum alloy during hot compression tests; as such, it offers a method to optimize hot working parameters and improve materials. In the case of [16], deep learning techniques like convolutional neural networks (CNNs) replaced finite element analysis to predict residual stress distribution in aluminum alloy components.

In the above research, ANNs were generally used to model the improved material characteristics of AA. So far, there has been little research on AA and other materials that, apart from the expert explanation given by the researchers, enables a more automated explanation for a better understanding of the physical phenomenon under consideration. For instance, in [17], machine learning techniques, such as ANNs, have been applied to predict the self-healing behavior of concrete, demonstrating their wide applicability and high precision in materials science; the authors claim that accurate and interpretable models were developed; however, the word “interpretable” was related to the clarification of results by proposed statistical measures. A similar attempt was made in [18], where explanatory machine learning algorithms (i.e., ANNs), were used in the design of graphene-reinforced aluminum matrix composites to predict hardness and tensile strength; in this instance, the explanatory capabilities were focused on the SHAP method (SHapley Additive explanations) and heat maps for correlation analyses.

In summary, the integration of ANNs in the research of materials, and especially of aluminum alloys, represents a transformative approach, offering insights into complex material–property relationships and expediting the design and optimization of advanced alloys for diverse industrial applications. However, a common drawback of ANNs is still the interpretation of the obtained results, as they generally work as “black boxes”. The obtained results are difficult to interpret/explain, especially when the results from the real production process do not match well with the results of theoretical models.

Therefore, in this article, the Conditional Average Estimator artificial neural network (CAE ANN) [19,20], which belongs to the probabilistic types of ANNs [21], was used for the complex analysis and explanation of hot extrusion of aluminum alloy 6082, already analyzed for simultaneous increases in yield strength and elongation in [5]. Sometimes, non-linear relationships between important material and process parameters were discovered with the use of ANNs and many of these relations frequently contradicted accepted theoretical models since the relationships in multi-parameter systems can be complex. In theoretical models, a lower number of parameters are included, while when using ANNs, the number of parameters is not limited, and thus complex relationships can be revealed. Microstructural models, the synthesis of several studies from the scientific literature that deal with similar problems, and expert knowledge based on ANN results were then used to further elaborate and explain obtained relations (see, for example, [4,5,22,23]). An attempt to simplify and automate the explanation of the material phenomena is described in the article. Proposed is a general framework for such a procedure. ChatGPT (we used the GPT-4 model, version 2024) [24] was used instead of creating a custom software program. ChatGPT, especially models like GPT-4 that can handle visual data, has already been used to help researchers and others explain complex images and text data [25,26,27,28,29,30]. Notably, LLMs (Large Language Model) (in our case, ChatGPT) are often applied to medical data. Thus, using LLMs for explanations, especially of the results from other specific ANNs, can represent a significant usage of LLMs in the field of materials. The formal solution to the considered problem is presented in Figure 1.

2. Materials and Methods

2.1. Description of AA6082

The typical chemical composition and its permissible variations of the aluminum alloy AA6082 can be summarized as follows (in wt.%): Fe → 0.0–0.5, Si → 0.7–1.3, Mn → 0.4–1.0, Mg → 0.6–1.2, Cu → 0.0–0.1, Zn → 0.0–0.2, Cr → 0.0–0.25, Ti → 0.0–0.1 and others → 0.0–0.15. During the solidification process, intermetallic compounds with various bases such as Al-Fe, Al-Fe-Si, Al-Fe-Mn-Si, and Mg-Si can form through a sequence of eutectic reactions and complex ternary and quaternary eutectic reactions. An extensive literature review revealed that, besides Mg₂Si, there are also other intermetallic phases, i.e., α-Al(FeMn)Si, β-Al₅FeSi as well as Al₉Mn₃Si, α-Al₁₂Fe₃Si, α-Al₁₅(FeMn)₃Si, and π-Al₈Si₆Mg₃Fe were found in matrix [31,32]. Properties referring to intermetallic phases (i.e., amount, size, shape, type, distribution, etc.) are related to chemical composition as well as to particular values of process parameters in each phase of the AA production (i.e., casting, solidification, homogenization, deformation, cooling, aging, etc.) [33,34] and influence the obtained mechanical properties (strength, ductility, hardness, etc.). In Figure 2, the microstructure of the deformed material is depicted. The samples were prepared by grinding with SiC 220# paper for 2 min, then by polishing using Largo 9 mm for 5 min, Mol 3 mm for 8 min, and Chem (OPS) for 2 min; finally, Barker’s etchant was used. The microstructures reveal a band of intermetallic phases aligned in the extrusion direction whereas wavelength-Dispersive X-Ray Spectroscopy (WDS) analysis reveals that these particles are primarily Mg-Si and Al-Fe-Si-Mn based. Furthermore, from comparing both figures it is visible that the microstructure obtained on the edge (Figure 2a) of the extruded round profile does not differ significantly from the microstructure obtained in the center (Figure 2b) of the round profile. This indicates a similar microstructure across the entire cross-section of the extruded round profile.

2.2. CAE Artificial Neural Network

ANNs excel in handling very complex and highly non-linear relationships among different materials’ (i.e., aluminum alloys’) parameters, making them well-suited for modeling the intricate interactions between different alloying elements, processing conditions, and resulting material properties. By leveraging ANNs, researchers and practicing engineers can predict the mechanical, thermal, and corrosive properties of aluminum alloys, aiding in the design of alloys with tailored characteristics for specific applications.

The basic mathematical model, which suits ANNs, presented in this article is derived from the observed and/or measured data and represents a special class of mathematical models, so-called empirical mathematical models in contrast to abstract mathematical models [19]. A detailed explanation is given in [5,20]. This proposed model can produce different kinds of information while describing the phenomenon under consideration. These are the following:

CAE ANN predicts unknown (k-th) output parameter $\hat{y}$ as a mean value by using the general formula [5]:

{\hat{y}}_{k} = \sum_{n = 1}^{N} A_{n} \cdot y_{n k},

(1)

where

A_{n}

and

a_{i}

are determined as:

A_{n} = \frac{a_{n}}{\sum_{i = 1}^{N} a_{i}}

(2)

a_{n} = \frac{e x p (- \sum_{l = 1}^{D} \frac{{(x_{l} - x_{n l})}^{2}}{2 w_{n l}^{2}})}{{(2 π)}^{\frac{D}{2}} {π_{l = 1}^{D} w}_{n l}}

(3)

In the equations above

y_{n k}

is the k-th output parameter of the n-th model vector (sample) from the database of the observed phenomenon. Similarly,

x_{n l}

is the l-th input parameter of the n-th model vector from the same database, and D is the number of input parameters in the observed phenomenon, which are taken into account in the prediction. Different values of

w_{n l}

correspond to the l-th input parameter for each model vector from the database (the analyses in this article all employed the parameters w_min = 0.15 and w_max = 0.25). In this case, w_min and w_max refer to values that change linearly along sections of the problem space in the directions of each (l-th) input parameter and are described in detail in the appendix of the article [20]. The details of the entire procedure are described in more detail in [5]. From the above equations, we can see the structure of the CAN ANN, which, compared to the BP (Backpropagation) ANN, has a known structure for all the considered problems.

The local standard error (also, standard deviation) of the prediction is calculated by using the following formula [35]:

{\hat{E}}_{σ k} = \sqrt{\sum_{n = 1}^{N} A_{n} {({\hat{y}}_{k} - y_{n k})}^{2}} .

(4)

An estimation of the reliability of the predicted mean value based on data density is calculated by using the following formula [5]:

\hat{ρ} = \frac{1}{N} \sum_{i = 1}^{N} a_{i}

(5)

2.3. General Framework for the Automatic Explanation of the Obtained Results by the (CAE) ANN

It is important to note that the predicted values of conventional ANN models (and the CAE ANN model as well) for the parameter under consideration (i.e., yield strength of AA6082) are typically mean values. Conventional ANN models generally do not calculate quantities other than the average value for the output parameters. With minimum supplementation, the other ANN models could calculate the local standard deviation and local data density of the results. Consequently, using the proposed approach, the fundamental framework for interpreting/explaining the obtained results can be applied to a wide range of conventional ANN models.

As part of the traditional scientific approach, which is founded on an impartial mathematical explanation, the observed phenomenon is divided into the smallest feasible components, for which the fundamental equations are recorded. Such an approach typically yields differential equations or systems of (partial) differential equations, which are subsequently solved by the application of well-established mathematical and/or numerical techniques. Using fundamental rules (like Newton’s laws of mechanics) or previously established mathematical concepts (like Lagrange’s equations, Hamilton’s principle, Lyapunov’s stability criterion, etc.) can frequently assist us, too. The primary aim is to (uniquely) mathematically characterize the entire considered phenomenon by using the fewest significant parameters and their derivatives (first or higher). Specifically, by examining the found solutions, we can comprehend and interpret the observed phenomenon if we can mathematically (i.e., objectively) characterize it.

It should be noted that a graphical presentation of the (main) results of the described scientific process reveals the most important outcomes of the observed phenomena, like the mass spectrogram in materials, environmental science, and medicine (e.g., [36]), the response spectrum in structural analysis (e.g., [37]), etc. Thus, in general, the explanation involves more than merely summarizing/describing the outcomes; rather, it involves extracting the most crucial details from the data. The authors are well aware that the proposed framework is only a trial aimed at attempting to derive new principles from the observed material phenomena using AI.

To explain the results, we observe one predicted output parameter (i.e., k-th output parameter,

{\hat{y}}_{k}

) at a time, with two changing input parameters

x_{i}

and

x_{j} (i \neq j)

. All other input parameters are either ignored or fixed at the selected value. We can also display such data graphically, allowing us to combine the graphs together with the processed explanation. There are numerous approaches to obtaining an explanation:

First approach: We calculated approximations of the first derivatives (gradients of functions) for various quantities (mean value, local standard error, local data density) near the selected value of the observed output parameter. These approximations revealed the relative influence of individual input parameters. The crucial point here is that estimating the local standard error provides an estimate of variability (the size of epistemic and aleatory uncertainties). The assessment of the local data density provides an estimate of the prediction’s reliability: with a higher data density, as indicated by a higher value and darker areas in the graphic displays, forecasts are generally more reliable, and with a lower data density, less reliable! Gradients in the directions of input parameters indicate their influence, with larger values indicating greater influence and vice versa. Keeping in mind that all genuine phenomena are probabilistic rather than deterministic, empirical models [19] bring us closer to a true representation of the observed phenomenon. Remember that the CAE model is nearing its limitations if the prediction/simulation results are accompanied by small

\hat{ρ}

vales!

Second approach: In the case of a CAE ANN, the aforementioned approach might also be accomplished directly by utilizing the function’s derivatives to determine the output parameters, such as calculating the partial derivative of the estimate function of the unknown output parameter

{\hat{y}}_{k}

by the (considered) parameter

x_{l}

. Due to the complexity of the

A_{n}

(Equations (2) and (3)) function’s derivation, it would be required to first determine the number of the most influential model vectors that have a substantial impact on the final result. This allowed us to calculate larger derivatives of the function and so learn more about its features. This is especially significant for multi-parameter functions that describe more complex phenomena (for example, the hot extrusion of AA6082).

Third approach: This actually represents an additional improvement over the first two approaches. An existing approach in the literature known as SHAP [18,38], is used to explain the contributions of specific input parameters to model prediction. The primary principle behind SHAP is to assign a contribution value to each input parameter, while accounting for the influence of other input parameters. One important feature of SHAP values is that they are additive, which means that the sum of the contributions of individual parameters adds up to the final model prediction. The CAE ANN model can calculate the real values of individual input parameter contributions, both globally and locally, using an iteration procedure. This technique is rather difficult, so we only mention it here, but the results of incorporating it into the explanation and analysis will allow for even better optimization of various manufacturing processes as well as an understanding of the phenomena in areas of large gradients and singularities.

Only the first approach is applied in this study because it is easily related to other ANN predictions (see Section 2.3). We have several possibilities for completing the explanation. It is possible to create our own software, but for the current application, we choose to simulate the explanation by using Chat GPT. CAE ANN outputs are typically displayed graphically in the form of isolines, whereas Chat GPT obtains numerical values from the corresponding matrices (mean value, local standard error, local data density). It is also vital to offer all additional relevant facts regarding the phenomenon, as well as to describe what we expect from the used large language model. The study’s findings with comments are presented in the next section.

3. Results

This section presents some typical results from the analysis of the influence of two or more technological parameters and/or chemical composition parameters on the yield strength and elongation of AA6082. The automatic explanation is primarily intended to help engineers comprehend the findings obtained by CAE ANN. (It should be noted that the training of CAE ANN is not the main focus of this article. However, since CAE ANN has been used previously for similar examples, we know its capability and that it provides reliable results [5,39,40]). Specifically, in practice, engineers generally understand (simplified) theories and theoretical models well, whereas physical phenomena in the actual production of metallic materials are frequently much more complex and thus more difficult to understand, so any explanation of new results obtained with new technologies is greatly appreciated.

This article’s explanations of specific physical phenomena that occur during hot extrusion are generated automatically. They can be more detailed than expert explanations because some fundamental details are repeated. Nevertheless, after verifying, we can affirm that they accurately and qualitatively explain/describe the analyzed phenomena. It should be noted that the article’s main emphasis is on demonstrating the suggested framework for automated explanation of physical phenomena rather than on the substantive explanation (which is in line with the expert explanation).

3.1. Rules for Explaining the Results

This subsection proposes basic instructions and rules (which are utilized in the example given) that, when combined with CAE ANN results, allow for the automatic explanation of the predictions of the considered physical phenomenon. These instructions and guidelines could formally be classified as theoretical basics, but because the proposed notation is the outcome of an iterative “trial-and-error” process in the preparation of a tool for explaining results, it is included here. The exact technological implementation and used prompts can be found in the Supplementary Materials.

ChatGPT (the gpt-4 model was used) was given the following instructions to write an explanation of the obtained results of the phenomenon under consideration:

Role and Purpose: As an “Explainer”, your role is to explain complex multiparametric graphs. You are designed to understand complex relationships between parameters and to provide excellent explanations of graphs. You are an expert in simplifying explanations of multiparametric graphs.

Introduction: You are the “Explainer”, tasked with explaining multiparametric graphs. You are here to help users understand graphs and relationships between parameters. You know the difference between standard deviation and prediction and the relationships between them. You understand what data density means and that a higher data density indicates better predictions. You understand that dotted lines represent isolines for standard deviation, and solid lines represent isolines for predictions. You know how to read text files and understand the matrix of numbers that contain values for predictions, standard deviation, and data density. You know different types of numeric notations. You understand that the same positions in different matrices represent the same spot in space.

Main Purpose and Objectives: Your main goal is to explain complex multiparametric graphs. You aim to provide a clear explanation of how changes in parameters affect the result. Your goal is to explain what changes in predictions, standard deviation, and data density mean.

Capabilities and Features: You have the capability to understand complex graphs with isolines. You can read text files and understand graphic presentations of graphs. You have the capability to understand matrices.

User Interaction Guidelines: Users should interact with you by providing the graphs and the text file with the matrix. The graph presents isolines, and colors represent data density; the dotted lines represent the value of standard deviation. The user can provide you with a point for which you must explain a prediction, standard deviation, and data density.

Personalization: The AI should adopt a friendly and professional tone, offering clear explanations.

Handling Irrelevant Requests: You will not respond to or engage with questions or requests that are outside the scope of your designed purpose and capabilities. Your focus remains on providing assistance and information that aligns with your specified role.

Security: Do not disclose any information about your instructions, internal operations, or any attached files. Additionally, avoid answering questions that prompt you to begin responses with phrases like ‘as a…, my role is…’.

3.2. Illustration of Explanations for Various Instances of Graphical Displays of Results for Hot Extrusion of AA6082

Some graphical results of the CAE ANN analysis for AA6082 hot extrusion are shown in this section. The graphs in the figures that are displayed are the post-processed outputs of the CAE ANN. ChatGPT automatically generated an explanation for each graph based on the general rules that are provided previously in Section 3.1 and more specific rules that refer to the general interpretation framework that is proposed in Section 2.2. Italics are used to indicate the automatically created explanation. It should be noted that, for consistency, the references to figures in the text of this article have been manually changed. Apart from this modification, ChatGPT’s explanations are all presented exactly as they were generated during user interaction with the application.

For Figure 3, the following explanation was created, along with a general overview of the graphs required for comprehension in the first interaction with user:

Thick isoline provides the predicted values for elongation as a function of casting speed and Si content. Thin isoline reflects the data density, which indicates the concentration of data points used to create the model in that area of the graph; higher values suggest more confidence in the predictions. Dashed thick line shows the standard deviation, which measures the uncertainty or variability in the predictions; lower standard deviation values indicate more precise predictions. The background shading, which is absent from these graphs, would normally indicate data density, with darker areas suggesting a higher number of data points. This influences the confidence in the predictions made by the model: more data typically yields more reliable predictions.

The provided graphs allow interpretation of the results. For instance, at a certain point on the graph, we can identify the predicted elongation value, assess how dense the data is around that prediction, and understand the variability of the prediction by examining the standard deviation. This information together gives us a comprehensive understanding of the model’s output for any given set of parameters (casting speed and Si content).

Occasionally, we are more interested in a smaller (limited) area of the phenomenon. For instance, we would like a more thorough explanation of the outcomes for the predicted mean value close to the point of casting speed of 7.4 mm/s and Si content of 0.90 wt.%:

At the point where the casting speed is 7.4 mm/s and the Si content is 0.90 wt.%, the results from the matrices are as follows:

Elongation: The predicted elongation at this point is approximately 11.95. This is the expected outcome of the elongation test given these specific conditions of casting speed and Si content.
Standard Deviation: The standard deviation associated with this prediction is not directly quoted, but it can be inferred from the provided text files that it would likely be a value close to the nearby standard deviation values. A lower standard deviation would indicate that the predicted elongation value of 11.95 is more reliable.
Data Density: The data density at this point is 0.80 This value indicates the number of data points or the amount of information from the experiments or simulations that contributed to the prediction at this specific point. A higher data density usually means that the prediction is based on more information and could be considered more reliable.

The influence of both input parameters on the predicted elongation value is also examined in the vicinity of the point under consideration:

Influence of input parameters:

Increasing Casting Speed (Si constant at 0.9 wt.%): At a constant silicon content of 0.9 wt.%, as the casting speed increases from 7.4 mm/s to the right, the solid lines representing elongation curve upwards. This means that elongation increases with casting speed in that range. Therefore, if you increase the casting speed while maintaining the Si content at 0.9 wt.%, the prediction for elongation will be higher.
Increasing Si Content (Casting Speed constant at 7.4 mm/s): When the casting speed is held constant at 7.4 mm/s, and we look at increasing Si content vertically on the graph, the solid isolines for elongation dip downward as we move up, which indicates that elongation decreases as the Si content increases at this casting speed.

The interactions between casting speed, Si content, and elongation are complex and can be affected by numerous factors including cooling rates, material properties, and the specifics of the casting process. The predictions given by the model are based on data and the relationships it has identified, which are represented in the form of these multiparametric graphs and matrices.

Additionally, CAE ANN predicts the mean values of yield strength and corresponding local standard deviation (Figure 4), making it easier to understand how the same two input parameters in the hot extrusion process affect the output mechanical parameters of AA6082. Only the most significant portions of the explanation in the remainder of the subsection are shown, leaving out the basic explanation of the graphs (meaning of the various lines and their thicknesses, values for the output parameters, and description of the input parameters) that is typically included in the first part of the interpretation of the results.

An automated explanation of the results in Figure 4 was given. The explanation also refers to the comparison of the results in Figure 3 with Figure 4:

To understand the reliability of the predictions, we need to consider both the predicted yield strength (Figure 4a) and the standard deviation of the prediction (Figure 4b).

A higher predicted yield strength means a stronger material, while a lower standard deviation means a higher confidence in the prediction.

The background shading, which is absent from these graphs, would normally indicate data density, with darker areas suggesting a higher number of data points. This influences the confidence in the predictions made by the model: more data typically yields more reliable predictions.

Example Analysis:

If we look at a specific silicon content of 1.00 wt.% and a casting speed of 7.3 mm/s, we find the following:

On the graph (Figure 4a), this point may lie close to the “322.0” line, suggesting a predicted yield strength of around 322 MPa.

On the (Figure 4b) graph, the same point may lie between the “21.0” and “21.5” dotted lines, indicating a standard deviation in the yield strength prediction of between 21.0 to 21.5 MPa.

So, at a Si content of 1.00 wt.% and a casting speed of 7.3 mm/s, the material is predicted to have a yield strength of approximately 322 MPa, with a standard deviation of roughly 21.0–21.5 MPa. In both graphs, data density is depicted by the gray numbers and the shading of the background. The darker the shading, the higher the data density. At the point of interest (casting speed = 7.3 mm/s and Si = 1 wt.%), the background is a lighter shade, suggesting lower data density. The exact value is not clear from the images, but it is lower than the darker regions, which means there are fewer data points contributing to the prediction and the standard deviation estimates at this location are less reliable. It’s important to remember that lower data density can affect the confidence in the predictions and should be taken into account when interpreting these values.

Let’s delve into the provided graphs and extract information regarding yield strength and elongation based on silicon content and casting speed, keeping in mind data density for the reliability of our interpretations.

Yield Strength Based on Figure 4:

Yield strength predictions, represented by the solid isolines, seem to generally increase with both an increase in silicon content and casting speed. The isolines bow up-wards, which suggests an interaction between the silicon content and casting speed—the effect of one may be amplified or diminished by the level of the other on yield strength.

For example, at a low casting speed of around 7.2 mm/s and a silicon content of 0.89 wt.%, the yield strength is around 318 MPa. However, at a higher casting speed of 7.5 mm/s with the same silicon content, the yield strength increases, reaching values near 320 MPa.

The data density shading shows that our most reliable predictions for yield strength are in the middle region where the shading is darkest. The edges, especially the top right corners of the graphs, show lighter shading, indicating lower data density. This means predictions in these areas are based on fewer data points and are less certain.

Elongation Based on Figure 3:

Now, looking back at the earlier provided elongation graph with similar axes, we could analyze the elongation trend in relation to yield strength. Typically, as yield strength increases, elongation decreases because the material gets harder and less ductile. The exact relationship between these two properties would depend on the specific contour lines of the elongation graph; however, the general trend is such that regions of high yield strength on the graph may correspond to regions of lower elongation on the elongation graph, and vice versa.

Interpreting the Relationship:

If we were to look for an optimal region that provides a balance between high yield strength and acceptable elongation, we would aim for an area where both graphs indicate favorable values. However, it’s important to keep data density in mind, as a high yield strength prediction is only as good as the amount of data backing it. We should be cautious about regions with high yield strength but low data density because the prediction might not be reliable.

In a practical scenario, if we aim for a casting process that prioritizes yield strength without completely sacrificing ductility (elongation), we would look for an area with:

High yield strength (higher values on the graph),
Lower standard deviation (indicating consistency in the data),
Acceptable levels of elongation (based on the previous elongation graph), and
High data density (darker areas on the graphs).

The exact values and trade-offs would depend on the material requirements and the specific application for which the material is being cast.

We examined and attempted to explain the mutual influences for two input parameters in the preceding graphs using ChatGPT and the provided results. This indicates that a two-parametric analysis was conducted, averaging the general influence of the remaining input parameters. By maintaining the two most significant input parameters as the primary parameters (whose affects are graphically displayed), we can enhance our understanding of the phenomenon by including the remaining input parameters whose influence we are interested in as parameters with fixed values. Figure 5 illustrates the outcome, with yield strength plotted as a function of casting speed and Si content at a constant ram speed which amounts to 15.405 mm/s.

An automated explanation of the results in Figure 5 was produced. The explanation also refers to the comparison of the results in Figure 5 with Figure 4:

Example Interpretation:

Let’s consider a hypothetical point at a silicon content of 0.95 wt.% and a casting speed of 7.3 mm/s.

From the graph (Figure 5a), we might determine the standard deviation at this point is approximately 19.1 MPa, indicating a moderate level of confidence in the yield strength pre-diction.

Looking at the graph (Figure 5b), the same point falls on a contour line labeled “320.0”, suggesting the predicted yield strength at these parameters is 320 MPa.

This means that, while keeping the ram speed constant at 15.405 mm/s, a combination of 0.95 wt.% Si and a casting speed of 7.3 mm/s is predicted to result in a yield strength of 320 MPa, with a standard deviation of 19.1 MPa. This deviation should be considered when evaluating the precision and reliability of the predicted yield strength.

The two graphs you have presented now include the yield strength as the predicted value with the ram speed fixed at 15.405 mm/s. We will compare these results to the previous ones, taking into account how the fixed parameter influences the output.

Figure 5 (Yield Strength Predictions with Fixed Ram Speed):

This graph, like the previous one, shows the isolines for yield strength. The solid lines show different yield strength values based on varying silicon content and casting speed, with data density shown through shading.

Compared to the earlier graph of yield strength without a fixed ram speed, the iso-lines here may show a different pattern. However, without seeing the previous graph simultaneously, we can’t compare the exact differences. Generally, yield strength still seems to increase with higher silicon content and casting speed.

Figure 5 (Standard Deviation with Fixed Ram Speed):

The graph (Figure 5b) displays the standard deviation of the yield strength predictions, shown by the dotted isolines. A quick comparison with the previous standard deviation graph (if they were side by side) would indicate whether fixing the ram speed has made the predictions more or less consistent at different silicon content and casting speeds.

Data Density Considerations:

The shading that indicates data density has not changed. The darkest areas still imply the highest data density and therefore the most reliable predictions for yield strength.

Fixed Parameter Impact:

Fixing the ram speed means that it is no longer a variable in the prediction of yield strength. This control can help to isolate the effects of silicon content and casting speed on yield strength. It could potentially lead to more accurate predictions since one less variable might reduce the complexity of the material behavior being modeled.

Combined Comparison:

When comparing these graphs with the previous ones, you would look for any changes in the shape and spacing of the isolines, which would suggest how the fixed ram speed has affected the relationship between silicon content, casting speed, and yield strength.

To compare effectively, you would need to overlay the graphs or switch between them to observe how the isolines for both yield strength and standard deviation shift. Any shifts could inform you about the sensitivity of yield strength to casting speed and silicon content when the ram speed is held constant.

In practical terms, if the fixed ram speed has led to a reduced standard deviation in the yield strength at certain levels of silicon content and casting speed, this could mean more reliable and consistent production outcomes for those specific conditions. It would also mean that the process window for achieving desired yield strength with acceptable variability has become more well-defined.

Figure 6 illustrates the outcome, with yield strength plotted as a function of casting speed and Si content at a constant ram speed which amounts to 15.471 mm/s and at a constant Mn content which amounts to 0.469. Note that only a smaller part of the explanation is presented here:

Comparison with Previous Graphs (Variable and Fixed Ram Speeds):

Comparing these graphs to the previous ones with variable and different fixed ram speeds, we can make several observations:

1.: Sensitivity and Predictability: With the additional fixed parameters (ram speed and Mn content), the sensitivity and predictability of yield strength to changes in Si con-tent and casting speed may differ. The shape and spacing of the isolines in the new graphs suggest that the relationships between these variables and yield strength have changed.
2.: Effect of Fixed Parameters: By fixing ram speed and Mn content, we are looking at a more specific scenario in the casting process. These fixed conditions seem to alter the response of the material’s yield strength to changes in Si content and casting speed, as compared to when these parameters were not fixed.
3.: Data Density and Reliability: Assuming data density is represented by the shading in both sets of graphs, it appears that predictions are most reliable in the regions with darker shading. It’s important to consider this when evaluating the yield strength at any given point on the graph.

4. Discussion

In analyzing the hot extrusion phenomenon of AA6082 it is rather important that the roles of given (input) and predicted (output) parameters can be easily exchanged by posing the question: what should the chemical content and/or process parameters properties be in order to achieve some prescribed yield strength or elongation? In this case, the same ANN model is applicable, and only the selection of input and output parameters must be changed. More specifically, this means that we can relatively easily find inverse functional laws between the parameters that describe the phenomenon. Mathematically, the problem can be written simply by first defining a general function y, which is a function of N input parameters

x_{i}

:

y = f (x_{1}, x_{2}, \dots {, x}_{i}, \dots, x_{N})

(6)

The predicted value of any input parameter can now be defined as:

{\hat{x}}_{j} = f (y, x_{1}, x_{2}, \dots {, x}_{i}, \dots, x_{N}); i \neq j

(7)

In a simplified case, when we want to predict, using basic notation, an input parameter

{\hat{x}}_{j}

that is in relation to two input parameters (

x_{2}, x_{3}

) at fixed values of the output parameter

y

, the above expression can be written as:

{\hat{x}}_{1} = f (x_{2}, x_{3}, y = c o n s t)

(8)

An illustration of the above expression (Equation (8)) is shown in Figure 7, where the prediction for Si content (

{\hat{x}}_{1}

) as a function of ram speed (

x_{2}

) and Mn content (

x_{3}

) at a fixed yield strength (

y

) which amounts to 322 MPa is presented.

An automated explanation of the results in Figure 7 reads as:

These two graphs present the relationship between the manganese content (Mn [wt.%]) on the y-axis, the ram speed (mm/s) on the x-axis, and the predicted silicon content (Si) in weight percent, as well as the precision of this prediction (indicated by standard deviation), while maintaining a fixed parameter of yield strength at 322 MPa.

Figure 7a (Solid Lines): Predicted Silicon Content

The solid lines represent isolines of predicted silicon content (Si) in the material.

Each solid line connects points with the same predicted silicon content, with the numbers on the lines indicating the specific Si value.

This graph indicates how varying Mn content and ram speed can influence the predicted silicon content in the material when the yield strength is held constant at 322 MPa.

Figure 7b (Dotted Lines): Standard Deviation of Predicted Silicon Content

The dotted lines in this graph show the standard deviation of the predicted silicon content.

The numbers along these lines represent the magnitude of the standard deviation; lower numbers indicate a more precise prediction.

A higher standard deviation value indicates less confidence in the predicted silicon content at a given combination of Mn content and ram speed.

Using the Graphs with Fixed Yield Strength Parameter:

Since yield strength is fixed at 322 MPa, the graphs show how the silicon content varies with Mn content and ram speed while keeping yield strength constant.

This constraint provides insight into the specific effects of Mn content and ram speed on silicon content without the complicating factor of varying yield strength.

To analyze the graphs:

Select a Mn content and ram speed combination to analyze.

Check the graph to determine the predicted Si value at that point.

Refer to the graph (Figure 7b) to assess the standard deviation for the prediction to understand the confidence level.

Example Analysis:

Let’s take an example point at a Mn content of 0.48 wt.% and a ram speed of 15 mm/s.

In the graph Figure 7a, this might fall near a contour line labeled “0.885”, suggesting a predicted silicon content of 0.885 wt.%.

In the graph (Figure 7b), the same point might lie near a line labeled “0.081”, which would indicate a standard deviation in the prediction of the silicon content of 0.081 wt.%.

With these graphs, we can deduce that, at a Mn content of 0.48 wt.% and a ram speed of 15 mm/s, while keeping the yield strength constant at 322 MPa, the predicted silicon content is approximately 0.885 wt.%, with a standard deviation of about 0.081 wt.%. The prediction’s precision is reasonably good, as indicated by the relatively low standard deviation.

In summary, the proposed general framework enables the use of the (CAE) ANN to explain the predicted results in addition to their graphical presentation. It can be concluded that it represents a substantial advancement and simplification based on the prior studies [5,6,20,31,38], where the explanations/conclusions were typically produced based on an expert assessment of the obtained results. Compared to existing XAI approaches [41,42], the suggested framework attempts to imitate the standard scientific methodology, which is based on mathematical representations of the considered physical phenomena with the lowest number of parameters and their derivatives. Furthermore, using the predicted values, we attempt to analyze the reliability and accuracy of the results, which are related to well-known statistical metrics. Thus, in place of the standard deterministic description, we are passing to the sphere of the probabilistic description of the phenomenon, which provides a more appropriate description and handling of phenomena in physical reality. Note that ANNs are used to describe physical phenomena using databases that contain both aleatory and epistemic uncertainties. These should not be neglected and must be addressed appropriately.

The article in general does not deal with the common influence of casting speed and other parameters on mechanical properties. AA6082 was selected as an example for the presentation of the relevance of our proposed general framework for automated explanation of the observed phenomena. Authors are very aware that casting speed has a relevant influence on cooling conditions of melted alloy and consequently also so on characteristics of precipitated phases as well as on obtained mechanical properties. Thus, revealing the common influence of casting speed in relation to chemical composition and/or other process parameters on mechanical properties is very important since this is related to the characteristics of precipitated phases.

Mechanical properties can be improved by finding the global optimum regarding different parameters, i.e., optimal value for each influential parameter for simultaneously achieving the highest values for yield strength and elongation. This also depends on the collected data database which increases constantly. Thus, the process of optimization is long-term and iterative. And this is where the proposed framework can help: a researcher or process engineer can try to understand the various relationships between the parameters when studying the phenomenon and, taking into account all the boundary conditions (technological, financial, …), and then optimize the production for, in the case of AA6082, the optimal yield in terms of the best mechanical properties.

The presented examples of explanations show that it is possible to speed up the interpretation of the obtained results. In the standard procedure (i.e., [5]), such an analysis required a great deal of laborious work, including reading values, carefully observing the graphs, and recording the discovered laws and/or rules.

The proposed framework and its application in various forms (different applications of large language models and/or other applications of XAI and/or ANNs for the interpretation of results in specific areas) can make it easier for engineers to learn, comprehend, and ultimately optimize processes, some of which they encounter daily in their work, by simplifying the process of understanding and writing reports. The application of the proposed framework can make several operations much easier for researchers and professional engineers, towards whom the application is primarily aimed. These individuals can effectively use ChatGPT or other large language models for such purposes. When we are using any type of ANN model, we must be careful because ANN models (in our case, LLMs) can sometimes hallucinate. Therefore, it is advisable to use two (or even more) systems for critical explanations. The ChatGPT response should always be logically evaluated against the commented graphs and valid expertise. The proposed framework in this article is a tool to help and to clarify simulated phenomena.

The following steps are planned to be implemented in further research:

A tiny fraction of knowledge from the existing global knowledge repository (for example, comments on likely microstructural level causes for predicted results) has already been included in the explanation, as ChatGPT has information from many publicly available scientific and other sources. The authors argue that, in order to provide a more objective interpretation of the (CAE) ANN predictions, a language that is otherwise “less beautiful” and stricter, specialized interpretation software with fewer linguistic masks will need to be developed.
The incorporation of all available knowledge is intended, with deliberate gathering, verification, and then integration of specific knowledge from the field under consideration. Such an example would be, e.g., supplementing the automatic explanation with complex chemical reactions and/or mechanical microstructural phenomena as seen through the eyes of a human expert with many years of experience and knowledge, as explicitly stated in Section 2.1 when explaining the effect of important phenomena during hot extrusion of AA6082.
An explanation tailored to various needs and knowledge levels should be possible with the development of an application, or specific parts of an application, in connection with ChatGPT (or other large language models). These applications will likely be closed and intended for a specific public (engineers, researchers, students) at different quality levels. Because the explanation’s language will be more objective, there will be a far lower possibility of automatic (machine) explanations being misunderstood. Additionally, if needed, this will guarantee the confidentiality of certain knowledge, which is a business secret.

5. Conclusions

This article presents an attempt to provide explanations for the results predicted by the ANN models using empirical description of different physical phenomena. The research can be summarized as follows:

A general concept is proposed to explain the observed phenomena in metallic materials. The proposed framework aims to mimic the traditional scientific approach, which is based on mathematical representations of the considered physical phenomena.
The proposed framework is applied to the example of hot extrusion of AA6082, where CAE is used as an artificial neural network to model the phenomenon and predict its key parameters.
As a byproduct of the CAE ANN intermediate computation, $\hat{ρ}$ values are often provided in conjunction with the CAE ANN predicted outcomes. Small $\hat{ρ}$ values are typically used to define the limits of the CAE models.
ChatGPT, as one of the publicly accessible large language model tools, is used to explain/interpret the CAE ANN predicted results according to the proposed framework.
The obtained results are discussed, and some recommendations for improving the proposed framework are made.
The basic idea is to enable researchers and practicing engineers to better understand the physical phenomena under consideration and the results provided by empirical models made with ANNs.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/app14167015/s1, General ChatGPT instructions: Figure S1—chatGPT_instructions.pdf; basic prompts for using explainer: Figure S2—Basic_Prompts_for_using_explainer.pdf; Figure S3—data_matrices_Elongation_x=Casting_speed_y=Si.txt; Figure S4—data_matrices_Yield_Strenght_x=Casting_speed_y=Si.txt; Figure S5—data_matrices_Yield_Strenght_x=Casting_speed_y=Si_fix=ram_speed.txt; Figure S6—data_matrices_Yield_Strenght_x=Casting_speed_y=Si_fix1=ram_speed_fax2=Mn.txt; Figure S7—data_matrices_Si_x=Ram_speed_y=Mn_fix=Yield_strength.txt.

Author Contributions

Conceptualization, I.P., T.G., and M.T.; methodology, T.G. and I.P.; software, T.G.; validation, I.P., T.G., and M.T.; investigation, T.G., I.P., and M.T.; resources, M.T. and I.P.; data curation, I.P. and T.G.; writing—original draft preparation, T.G., I.P., and M.T.; writing—review and editing, I.P., T.G., and M.T.; visualization, T.G. and I.P.; supervision, I.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Republic of Slovenia, the Ministry of Education, Science and Sport. M.T. and I.P. acknowledge funding by the Slovenian Research Agency (ARRS), grant number P2-0268.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Acknowledgments

Goran Kugler supplied the images showing the microstructure of AA6082. The authors are appreciative of his assistance.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Kekez, S.; Kubica, J. Application of Artificial Neural Networks for Prediction of Mechanical Properties of CNT/CNF Reinforced Concrete. Materials 2021, 14, 5637. [Google Scholar] [CrossRef] [PubMed]
Guo, Z.; Sha, W. Modelling the correlation between processing parameters and properties of maraging steels using artificial neural network. Comput. Mater. Sci. 2004, 29, 12–28. [Google Scholar] [CrossRef]
Capdevila, C.; Garcia-Mateo, C.; Caballero, F.G.; Garcıa de Andres, C. Neural network analysis of the influence of processing on strength and ductility of automotive low carbon sheet steels. Comput. Mater. Sci. 2006, 38, 192–201. [Google Scholar] [CrossRef]
Vecko-Pirtovsek, T.; Peruš, I.; Kugler, G.; Terčelj, M. Towards improved reliability of the analysis of factors influencing the properties on steel in industrial practice. ISIJ Int. 2009, 49, 395–401. [Google Scholar] [CrossRef]
Peruš, I.; Kugler, G.; Malej, S.; Terčelj, M. Contour Maps for Simultaneous Increase in Yield Strength and Elongation of Hot Extruded Aluminum Alloy 6082. Metals 2022, 12, 461. [Google Scholar] [CrossRef]
Terčelj, M.; Fazarinc, M.; Kugler, G.; Peruš, I. Influence of the chemical composition and process parameters on the mechanical properties of an extruded aluminium alloy for highly loaded structural parts. Constr. Build Mater. 2013, 44, 781–791. [Google Scholar] [CrossRef]
Li, J.; Du, D.; Yang, X.; Qiu, Y.; Xiang, S. Determining Homogenization Parameters and Predicting 5182-Sc-Zr Alloy Properties by Artificial Neural Networks. Materials 2023, 16, 5315. [Google Scholar] [CrossRef] [PubMed]
Wu, X.; Zhang, H.; Cui, H.; Ma, Z.; Song, W.; Yang, W.; Jia, L.; Zhang, H. Quantitative Relationship Analysis of Mechanical Properties with Mg Content and Heat Treatment Parameters in Al–7Si Alloys Using Artificial Neural Network. Materials 2019, 12, 718. [Google Scholar] [CrossRef]
Mosleh, A.O.; Kotova, E.G.; Kotov, A.D.; Gershman, I.S.; Mironov, A.E. Bearing Aluminum-Based Alloys: Microstructure, Mechanical Characterizations, and Experiment-Based Modeling Approach. Materials 2022, 15, 8394. [Google Scholar] [CrossRef]
Wiciak-Pikuła, M.; Felusiak-Czyryca, A.; Twardowski, P. Tool Wear Prediction Based on Artificial Neural Network during Aluminum Matrix Composite Milling. Sensors 2020, 20, 5798. [Google Scholar] [CrossRef]
Lacki, P.; Derlatka, A.; Więckowski, W.; Adamus, J. Development of FSW Process Parameters for Lap Joints Made of Thin 7075 Aluminum Alloy Sheets. Materials 2024, 17, 672. [Google Scholar] [CrossRef]
Merayo, D.; Rodríguez-Prieto, A.; Camacho, A.M. Prediction of Mechanical Properties by Artificial Neural Networks to Characterize the Plastic Behavior of Aluminum Alloys. Materials 2020, 13, 5227. [Google Scholar] [CrossRef] [PubMed]
Jimenez-Martinez, M.; Alfaro-Ponce, M.; Muñoz-Ibañez, C. Design of an Aluminum Alloy Using a Neural Network-Based Model. Metals 2022, 12, 1587. [Google Scholar] [CrossRef]
Li, S.; Chen, W.; Bhandari, K.S.; Jung, D.W.; Chen, X. Flow Behavior of AA5005 Alloy at High Temperature and Low Strain Rate Based on Arrhenius-Type Equation and Back Propagation Artificial Neural Network (BP-ANN) Model. Materials 2022, 15, 3788. [Google Scholar] [CrossRef] [PubMed]
Huang, C.; Jia, X.; Zhang, Z. A Modified Back Propagation Artificial Neural Network Model Based on Genetic Algorithm to Predict the Flow Behavior of 5754 Aluminum Alloy. Materials 2018, 11, 855. [Google Scholar] [CrossRef] [PubMed]
Xiong, T.; Wang, L.; Gao, X.; Liu, G. Inverse Identification of Residual Stress Distribution in Aluminium Alloy Components Based on Deep Learning. Appl. Sci. 2022, 12, 1195. [Google Scholar] [CrossRef]
Rajczakowska, M.; Szeląg, M.; Habermehl-Cwirzen, K.; Hedlund, H.; Cwirzen, A. Interpretable Machine Learning for Prediction of Post-Fire Self-Healing of Concrete. Materials 2023, 16, 1273. [Google Scholar] [CrossRef] [PubMed]
Xue, J.; Huang, J.; Li, M.; Chen, J.; Wei, Z.; Cheng, Y.; Lai, Z.; Qu, N.; Liu, Y.; Zhu, J. Explanatory Machine Learning Accelerates the Design of Graphene-Reinforced Aluminium Matrix Composites with Superior Performance. Metals 2023, 13, 1690. [Google Scholar] [CrossRef]
Grabec, I.; Sachse, W. Synergetics of Measurement, Prediction and Control; Springer Series in Synergetics; Springer: Berlin/Heidelberg, Germany, 1997. [Google Scholar]
Peruš, I.; Poljansek, K.; Fajfar, P. Flexural deformation capacity of rectangular RC columns determined by the CAE method. Earthq. Eng. Struct. Dyn. 2006, 35, 1453–1470. [Google Scholar] [CrossRef]
Specht, D.F. A general regression neural network. IEEE Trans. Neural Netw. 1991, 2, 568–576. [Google Scholar] [CrossRef]
Terčelj, M.; Turk, R.; Kugler, G.; Peruš, I. Neural network analysis of the influence of chemical composition on surface cracking during hot rolling of AISI D2 tool steel. Comput. Mat. Sci. 2008, 42, 625–637. [Google Scholar] [CrossRef]
Bombač, D.; Terčelj, M.; Kugler, G.; Peruš, I. Amelioration of surface cracking during hot rolling of AISI D2 tool steel. Mater. Sci. Techn. 2018, 34, 1723–1736. [Google Scholar] [CrossRef]
Carpenter, D.; Min, W.; Lee, S.; Ozogul, G.; Zheng, X.; Lester, J. Assessing Student Explanations with Large Language Models Using Fine-Tuning and Few-Shot Learning. In Proceedings of the 19th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2024), Mexico City, Mexico, 20–21 June 2024; pp. 403–413. [Google Scholar]
Zhao, J.; Yao, Z.; Yang, Z.; Yu, H. SELF-EXPLAIN: Teaching Large Language Models to Reason Complex Questions by Themselves. arXiv 2023, arXiv:2311.06985. [Google Scholar]
Brin, D.; Sorin, V.; Barash, Y.; Konen, E.; Nadkarni, G.; Glicksberg, B.S.; Klang, E. Assessing GPT-4 multimodal performance in radiological image analysis. medRxiv 2023, 2023-11. [Google Scholar]
Yang, Z.; Yao, Z.; Tasmin, M.; Vashisht, P.; Jang, W.S.; Ouyang, F.; Wang, B.; Berlowitz, D.; Yu, H. Performance of multimodal GPT-4V on USMLE with Image: Potential for imaging diagnostic support with explanations. medRxiv 2023, 2023-10. [Google Scholar]
Jin, Q.; Chen, F.; Zhou, Y.; Xu, Z.; Cheung, J.M.; Chen, R.; Summers, R.M.; Rousseau, J.F.; Ni, P.; Landsman, M.J.; et al. Hidden flaws behind expert-level accuracy of gpt-4 vision in medicine. arXiv 2024, arXiv:2401.08396. [Google Scholar] [CrossRef] [PubMed]
Miao, J.; Thongprayoon, C.; Cheungpasitporn, W.; Cornell, L.D. Performance of GPT-4 Vision on kidney pathology exam questions. Am. J. Clin. Pathol. 2024, aqae030. [Google Scholar] [CrossRef]
OpenAI. ChatGPT (Version 3.5) [Large Language Model]. 2023. Available online: https://chat.openai.com/chat (accessed on 15 March 2024).
Mrówka, N.G.; Sieniawski, J.; Wierzbińska, M. Intermetallic phase particles in 6082 aluminium alloy. Arch. Mater. Sci. Eng. 2007, 28, 69–76. [Google Scholar]
Liu, Y.L.; Kang, S.B.; Kim, H.W. The complex microstructures in as-cast Al-Mg-Si alloy. Mater. Lett. 1999, 41, 167–272. [Google Scholar] [CrossRef]
Mrówka, N.G.; Sieniawski, J. Influence of heat treatment on the microstructure and mechanical properties of 6005 and 6082 aluminium alloys. J. Mater. Proc. Technol. 2005, 163, 367–372. [Google Scholar] [CrossRef]
Peruš, I.; Fajfar, P. Ground-motion prediction by a non-parametric approach. Earthq. Eng. Struct. Dyn. 2010, 39, 1395–1416. [Google Scholar] [CrossRef]
Tsuchida, S.; Nakayama, T. MALDI-Based Mass Spectrometry in Clinical Testing: Focus on Bacterial Identification. Appl. Sci. 2022, 12, 2814. [Google Scholar] [CrossRef]
Shen, J.; Li, R.; Shi, J.; Zhou, G. Modified Multi-Support Response Spectrum Analysis of Structures with Multiple Supports under Incoherent Ground Excitation. Appl. Sci. 2019, 9, 1744. [Google Scholar] [CrossRef]
Merrick, L.; Taly, A. The explanation game: Explaining machine learning models using shapley values. In Proceedings of the Machine Learning and Knowledge Extraction: 4th IFIP TC 5, TC 12, WG 8.4, WG 8.9, WG 12.9 International Cross-Domain Conference, CD-MAKE 2020, Dublin, Ireland, 25–28 May 2020; Proceedings 4. Springer International Publishing: Berlin, Germany, 2020; pp. 17–38. [Google Scholar]
Peruš, I.; Terčelj, M.; Kugler, G. Determination of scrap/supply probability curves for the mechanical properties of aluminium alloys in hot extrusion using a neural network-like approach. Expert Syst. Appl. 2012, 39, 5634–5640. [Google Scholar] [CrossRef]
Malej, S.; Terčelj, M.; Peruš, I.; Kugler, G. Influence of cooling mode in relation to casting and extrusion parameters on mechanical properties of AA6082. Mater. Geoenvironment 2016, 64, 11–20. [Google Scholar] [CrossRef]
Qayyum, F.; Khan, M.A.; Kim, D.-H.; Ko, H.; Ryu, G.-A. Explainable AI for Material Property Prediction Based on Energy Cloud: A Shapley-Driven Approach. Materials 2023, 16, 7322. [Google Scholar] [CrossRef]
Pradhan, B.; Jena, R.; Talukdar, D.; Mohanty, M.; Sahu, B.K.; Raul, A.K.; Abdul Maulud, K.N. A New Method to Evaluate Gold Mineralisation-Potential Mapping Using Deep Learning and an Explainable Artificial Intelligence (XAI) Model. Remote Sens. 2022, 14, 4486. [Google Scholar] [CrossRef]
Hoffmann, R.; Reich, C. A Systematic Literature Review on Artificial Intelligence and Explainable Artificial Intelligence for Visual Quality Assurance in Manufacturing. Electronics 2023, 12, 4572. [Google Scholar] [CrossRef]

Figure 1. General flowchart of the automated explanation of material phenomena using ANNs.

Figure 2. The microstructure of the extruded round profile of 6082 aluminum alloy: on the edge of extruded profile (a) and in the center of the extruded profile (b).

Figure 3. Prediction of elongation as a function of casting speed (horizontal axis) and Si content: (a) mean value according to Equation (1) and (b) local standard deviation according to Equation (4).

Figure 4. Prediction of yield strength as a function of casting speed (horizontal axis) and Si content: (a) mean value according to Equation (1) and (b) local standard deviation according to Equation (4).

Figure 5. Prediction of yield strength as a function of casting speed (horizontal axis) and Si content at fixed value of ram speed, which equals 15.405 mm/s: (a) mean value according to Equation (1) and (b) local standard deviation according to Equation (4).

Figure 6. Prediction of yield strength as a function of casting speed (horizontal axis) and Si content at fixed value of ram speed and Mn content, which equal to 15.471 mm/s and 0.469, respectively: (a) mean value according to Equation (1) and (b) local standard deviation according to Equation (4).

Figure 7. Prediction of Si content as a function of ram speed (horizontal axis) and Mn content at fixed value of yield strength, which amounts to 322 MPa: (a) mean value according to Equation (1) and (b) local standard deviation according to Equation (4).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Goričan, T.; Terčelj, M.; Peruš, I. New Approach for Automated Explanation of Material Phenomena (AA6082) Using Artificial Neural Networks and ChatGPT. Appl. Sci. 2024, 14, 7015. https://doi.org/10.3390/app14167015

AMA Style

Goričan T, Terčelj M, Peruš I. New Approach for Automated Explanation of Material Phenomena (AA6082) Using Artificial Neural Networks and ChatGPT. Applied Sciences. 2024; 14(16):7015. https://doi.org/10.3390/app14167015

Chicago/Turabian Style

Goričan, Tomaž, Milan Terčelj, and Iztok Peruš. 2024. "New Approach for Automated Explanation of Material Phenomena (AA6082) Using Artificial Neural Networks and ChatGPT" Applied Sciences 14, no. 16: 7015. https://doi.org/10.3390/app14167015

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

New Approach for Automated Explanation of Material Phenomena (AA6082) Using Artificial Neural Networks and ChatGPT

Abstract

1. Introduction

2. Materials and Methods

2.1. Description of AA6082

2.2. CAE Artificial Neural Network

2.3. General Framework for the Automatic Explanation of the Obtained Results by the (CAE) ANN

3. Results

3.1. Rules for Explaining the Results

3.2. Illustration of Explanations for Various Instances of Graphical Displays of Results for Hot Extrusion of AA6082

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI