1. Introduction
Manufacturing processes, as a rule, are controlled according to several technological parameters, the combined influence of which determines the resulting product quality. Values of these parameters can individually exert opposite (conflicting) effects on each other, so they have to be optimized [
1,
2]. Solving such problems can be considered as a ‘classical’ design of experiment (DoE), and has attracted considerable attention from many researchers [
3]. An example is turning, when it is necessary to simultaneously take into account the spindle speed, the feed rate, and a number of other factors [
4]. In additive manufacturing by the fused filament fabrication/fused deposition modeling (FFF/FDM) method, both extruder and bed temperatures, the head speed, the material feed rate, and some additional parameters have different effects on the structure and properties of the 3D-printed products [
5].
Since conducting a complete multifactorial experiment is not always possible (or rational), various optimization methods may be applied, for instance, the Taguchi [
6] or Box–Behnken [
7] techniques. Recently, artificial neural networks (ANNs) have increasingly begun to be used for solving such problems, especially for approximation or classification [
8,
9]. ANNs are characterized by high efficiency when a large (experimental) data sample is available [
10]. However, the reliability of the prediction decreases (or it cannot be considered reliable at all) with limited data sets [
11,
12,
13]. At the same time, numerous ANNs have been developed to date, so their correct selection and learning is a challenge under such conditions [
14].
Ultrasonic welding (USW) of laminated polymer composites has been implemented in many high-tech industries (primarily aerospace) [
15,
16]. To form reliable welds, numerous USW parameters have to be optimized [
17,
18], including ultrasonic (US) frequency, amplitude of sonotrode vibration, clamping pressure, USW duration, duration of clamping after USW, etc. These parameters are the input data for the USW process [
19], while its efficiency can be controlled by one output factor, namely the USW joint thinning, taking into account the need to insert an energy director (ED) between joined plates (adherends) [
20].
It should be noted that USW can be used not only for joining laminates for structural components, but for the fabrication of laminates as well [
21,
22,
23,
24,
25,
26,
27]. In such cases, their structure is formed due to processes developing at several interfaces, with the unilateral input of mechanical energy converted into frictional heating [
28]. Respectively, the number of output parameters increases, since some physical, mechanical, dimensional, and structural characteristics have to be considered [
29,
30,
31]. Their complete assessment is a rather long procedure, also requiring thorough statistical justification. The solution to this problem fully correlates with another production route for the formation of prepregs or laminates via automatic fiber placement assisted with heating via a laser beam [
32,
33,
34,
35,
36,
37].
A previous paper by the authors [
38] was devoted to the optimization of the US-consolidation (USC) parameters for the formation of USC lap joints of polyetheretherketone (PEEK) adherends, a carbon fiber (CF) fabric prepreg impregnated with polyetherimide (PEI), and two PEEK EDs [
39]. Impregnation of the CF fabric with the polymer, characterized by a low melting point and a melt flow index (MFI) greater than that for PEEK, determined the specific development of the structure formation process. In particular, molten PEI was squeezed out of the prepreg during USC, damaging the reinforcing CF fabric. Based on this experience, only PEI was utilized in this study, so PEI adherends were joined using the CF fabric impregnated with a PEI-based solution as well. A film of the low-melting TECAPEI (PEI-based copolymer) was inserted as an ED in the development of a USC procedure that firstly enables it to melt, ensuring the formation of lap joints with minimum possible damage to the CF fabric-based prepreg [
30,
40]. Hereinafter, these USC lap joints are designated as the ‘PEI adherend/Prepreg (CF-PEI fabric)/PEI adherend’ samples.
The aim of this study was to optimize the USC parameters by ANN simulation, providing the required functional characteristics of lap joints with a minimum number of full-scale experiments. To achieve this goal, it was firstly necessary to simulate the USC process as a ‘black box’ with many inputs and outputs. Then, two ANNs were trained using an ultra-small sample, which did not provide acceptable predictive accuracy for the applied simulation methods. It was proposed to implement a well-known approach, which consisted in artificially increasing the learning sample by including additional data synthesized according to the knowledge and experience of experts [
41].
This paper is structured as follows.
Section 1 provides an overview of the implementation of ANNs for optimization purposes, and generally reveals the ideology of their use to solve the problems highlighted in this study.
Section 2 presents the sequence of ANN simulation of the USC process with an analysis of both a priori and a posteriori knowledge, as well as the obtained results, while
Section 3 is devoted to their verification.
Section 4 discusses the prospects for application of the developed approach using an example of laminated composites formed by layer-by-layer USC processing of the prepregs based on the CF fabric impregnated with the PEI solution. All the above results are summarized in conclusions.
A brief overview of the application of machine learning methods using small experimental data samples. As the authors have shown previously [
42], the issue of finding an optimal combination of USC parameters cannot be considered as an optimization problem, since the conditions for ensuring the optimality of the functional characteristics of such lap joints are represented by a system of inequalities. This formulation is determined not only by the inconsistency of the requirements (formulation of criteria) for their individual properties, but also by the specifics of the problem. Practical interest is not in the specific values of the USC parameters, but their ranges. So, a solution to this problem should be approached in two stages: (i) approximation of a vector quantity (characteristics of a lap joint) in the multidimensional space of the USC parameters, and (ii) search for such a range of values of the parameter vector within which all the inequalities of the optimality condition are satisfied. In the general case, such areas contain an infinite number of solutions, so calling all of them ‘optimal’ (in the classical sense) is incorrect. More precisely, they should be referred to as areas of ‘suboptimal’ parameters (SOPs), within which the optimal solution is located.
The first stage of approximation can be carried out using (i) linear interpolation algorithms based on triangulation, (ii) inverse distance weighting or polynomial methods, (iii) basis function approach, (iv) Kriging interpolation, (v) piecewise linear function, and (vi) component-wise splines [
43,
44,
45,
46,
47,
48]. Each of these methods has both advantages and drawbacks. Nevertheless, only ANN simulation is recommended for universal approximation of a vector quantity in a multidimensional space, taking into account the relationship of the vector components and the significant non-linearity of the observed patterns [
49].
The second stage of searching and constructing SOP areas based on the results of ANN simulation can be carried out using well-known methods of cluster analysis and image processing. The issue of the implementation of one of these is outside the scope of this study since it deserves a separate investigation.
One of the key challenges in ANN simulation is the selection of the ANN type and architecture. For processing static data, feedforward neural networks (FFNNs), radial basis function networks (RBFNN)s, and their modifications are most often used [
50,
51]. The numbers of layers and neurons in them are determined not only by the complexity of an approximated dependence but also by the learning sample size [
52]. For small samples, the complexity of the applied model is typically neglected. In the presence of a large number of factors, reducing the dimension of the problem is achieved by highlighting one or two of the most significant factors, while the rest are not used for simulation. In [
53], it was proposed to compensate for the simplification of ANNs by including additional parameters to the significant factors obtained using known output/input relationships. The complexity of ANNs depends on the numbers of their layers and neurons, as well as the justified relationships. In some cases, ANNs are divided into two or more interrelated (but simple) types for clarity. A special approach to the development of ANNs that considers the known physical laws of simulated processes is described in [
54,
55]. In that case, the ANN architecture was designed according to those laws and includes a hybrid physical–statistical learning method that explicitly embeds the solution of partial differential equations into the loss function of the so-called physics-informed neuron networks (PINNs).
Great attention is paid to numerous techniques for training ANNs, such as (i) learning algorithms for solving direct problems (for example, based on the finite element method), (ii) genetic algorithms, (iii) support vector machines [
51,
56], etc.
ANNs are characterized by the so-called ‘curse of dimensionality’: as the dimensions of their input vectors rise, the complexity of ANNs increases exponentially [
49]. As a result, a learning sample has to be enhanced. For example, the following procedures should be implemented in one of the most common classes of problems related to classification and image recognition [
57,
58]: (i) generation of surrogate data, (ii) interpolation of experimental data, (iii) algorithms for shifting, permutation, reflection, and rotation of data to achieve system invariance, or (iv) randomization procedures to increase noise immunity.
Due to both the high labor intensity and the cost of full-scale research in materials science, the number of experiments performed is typically negligible [
12,
59,
60]. In these cases, the authors mean by the ‘ultra-small sample’ concept such experimental data arrays that are sufficient to draw a linear or quasi-linear relationship, but significantly less than are necessary to formulate an adequate non-linear one. For example, the number of experiments can vary from nine (for the Taguchi method) to twenty-seven (for the fractional full factorial design) when designing an investigation with three factors and their levels. In such cases, the results of hundreds of experiments are required to train ANNs, but thousands are required for deep learning. Therefore, ANN simulation is characterized by high errors in the approximation region for ultra-small samples, no matter what type of ANN and training method are implemented. This is especially true outside the range of experimentally determined values. Such a phenomenon is referred to by many researchers as ‘the poor ability of ANNs to solve extrapolation problems’ [
61]. It would seem an obvious and correct conclusion that using ultra-small learning samples requires abandoning the implementation of ANNs; however, they may be suitable for solving numerous applied problems that do not require high accuracy of simulation results, i.e., when a quality solution is enough.
It should be noted that for ANN simulation of objects and processes that are not characterized by periodicity or high correlation of values in parameter spaces, the task of extrapolating the small experimental data samples cannot be solved correctly without additional information. Therefore, one of the possible ways to minimize errors in the approximation and extrapolation regions is to add a priori known data for the boundary, limit, or special parameter values (
Figure 1). It is more correct to designate the latter a priori knowledge rather than data, because they are not obtained experimentally but are rather based on the experience of experts. A priori knowledge can be considered in a broad sense, since it includes both theoretical premises and previously comprehended knowledge about the simulated objects or processes. It is not always possible to formulate such knowledge, so the development of ANNs should be carried out in stages, assessing the adequacy of developed models and the accuracy of obtained results, as well as adding a posteriori knowledge, if necessary.
3. Verification of the Results of ANN Simulation
Verification of the results of the ANN simulation was carried out using USC parameters (
t = 510 ms,
τ = 9000 ms,
P = 1.85 atm) from the extrapolation region, for which the predicted values (i) varied most significantly, (ii) could be implemented using the available USW facility, and (iii) were located in the direction of the greatest expected errors of the ANN simulation.
Table 7 presents the predicted values of the functional characteristics of the USC lap joints, obtained using the predicted USC parameters:
- -
the RBFNNs were trained with only the experimental data (the ‘RBFNN’ row in
Figure 8b) and with both a priori and a posteriori knowledge in addition to the experimental data (the ‘RBFNN (+ knowledge)’ row in
Figure 9c);
- -
for the FFNNs, due to the ambiguity of their training, the predicted values were calculated from a variety of the results (examples shown in
Figure 8c,d and
Figure 9d,e) after training with only the experimental data (the ‘FFNN’ row) and with both a priori and a posteriori knowledge in addition to the experimental data (the ‘FFNN (+ knowledge)’ row).
Table 7 also includes the experimentally measured values of the functional characteristics of the lap joint, obtained using the optimal USC parameters.
Figure 10 shows an optical image of its cross-section. The USC lap joint thinning was 330 ± 10 µm, while the top ED was noticeably (but not uniformly) thinned (
δED top = 80 ± 60 µm). At the same time, the bottom ED thickness was changed to a much lesser extent (
δED bottom = 170 ± 70 µm). The “CF fabric layer” thickness was 280 ± 40 µm, which was slightly greater than the initial prepreg value (considering possible scattering). Samples of such USC lap joints fractured at rather high stress levels of 62.60 ± 3.13 MPa and values of elongation at the break of 4.22 ± 0.21%.
An analysis of the results presented in
Table 7 showed that the RBFNN model trained with both additional a priori and a posteriori knowledge significantly increased its accuracy, so the predicted and experimentally measured values were closer to each other. This fact confirmed the possibility of using the RBFNN models for prediction by extrapolation using ultra-small samples after artificial expansion of their sizes. Respectively, such models should be considered as the most promising ones.
On the other hand, the FFNN models, both with additional a priori and a posterior knowledge and without them, were characterized by average predicted values close to the real ones. However, the scatter of the predicted values with additional knowledge in the learning samples increased significantly (
Table 7), bringing into question the applicability of this approach. Additional research is required to determine the reasons for these variations.
A comparison of the predicted and real values (
Table 6) showed that mechanical properties were achieved that exceeded the experimental data obtained above (mode 6). In this case, the
δED top and
δED+CF parameters were outside the acceptable ranges according to
Table 1. Nevertheless, these values reflected the optimal USC parameters, enabling improvement of the mechanical properties (falling within the range specified in
Table 1), a situation which was not obtained using modes 1–9. On this basis, it was decided to change the SOP areas according to
Table 8.
The results of ANN simulation using the updated models trained with both a priori and a posteriori knowledge in addition to the experimental data (
Table 5, modes 1–9;
Table 7, the optimal USC parameters) are summarized in
Figure 11.
4. Discussion
As noted above, most researchers of USW/USC procedures implemented for joining composites based on thermoplastic binders have focused on laminates. The reason for this includes both their high strength properties and the practical relevance of the obtained results. Similar, ANNs have been applied for solving such problems [
12]. However, the key advantage of USC procedures is their short duration, expanding the application areas. For example, data on USW patterns concerning particulate composites based on thermoplastic matrices have been reported [
66,
67,
68], in addition to which the results presented above could be adapted to develop such procedures for composites fabricated from polymer blends or hybrid polymer mixtures [
69,
70,
71]. In this way, ANN simulation performed to optimize the USW parameters for obtaining USC ‘PEI adherend/Prepreg (CF-PEI fabric)/PEI adherend’ lap joints enabled an understanding of their complex mutual influence on the functional characteristics.
By analogy with the approach implemented for manufacturing laminates from sequential layers of both thermoplastic and CF fabric [
72], a similar material was fabricated from PEI/CF prepregs and EDs in this study. To achieve this goal, the optimal USC parameters were applied, which were determined through ANN simulation. The method for manufacturing the PEI-impregnated PEEK-based prepreg based on the CF fabric (Toray Cetex TC1200, Toray Industries, Japan) was described in a previous paper by the authors [
38].
Figure 12 shows cross-sections of the laminates made from (a) PEI-impregnated and (b) commercially available PEEK-based prepregs. The USC parameters justified above were used (clamping pressure of 1.85 atm, USW duration of 510 ms, clamping duration after USW of 9000 ms). The thickness of the PEI prepreg was 250 ± 20 µm, while that of the PEEK-based one was 170 ± 20 µm (PEEK prepregs were US-consolidated without EDs). The number of prepreg layers was five, while it was four for the EDs (for the PEI prepregs only). The USC procedures were carried out using the layer-by-layer method. The total number of passes of the USC instrument (sonotrode) was four. Under such conditions, it was possible to form non-porous USC laminates (joints) with satisfactory quality (minimal damage to the components). Thereby, the correctness of the optimal USW parameters predicted by ANN simulation was verified experimentally. This approach can also be implemented to repair damaged regions of other composites based on thermoplastic binders.
The research areas in which USC procedures have been implemented to form fiber-reinforced composites based on thermoplastic binders were already mentioned above:
- -
continuous ultrasonic impregnation and consolidation of thermoplastic matrix composites;
- -
ultrasonic-assisted consolidation of commingled thermoplastic/glass fiber rovings;
- -
consolidation of composite pipes by in situ ultrasonic welding (thermoplastic matrix composite tape);
- -
ultrasonic vibration-assisted automated fiber placement;
- -
automated fiber placement and tape laying (thermoplastic composite prepreg);
- -
filament winding and automated fiber placement with in situ consolidation.
All of these were characterized by the use of different USC parameters. Moreover, the sizes of the experimental data samples were very limited. Respectively, the authors believe that the approach developed in this study, based on ANN simulation using ultra-small samples, is of undoubted practical interest and can be applied to solve related problems, including the automated tape placement [
73,
74].