1. Introduction
Computer-aided engineering resources like the finite element method (FEM) represent a fundamental tool for modern engineering. Simulations stand at the base of all engineering calculations, improvements, designs, etc.
Since the initial development of the FEM for the analysis of structural components, three main element types have been created: beam, shell, and volume, each utilized for the analysis of specific problems in the balance between the complexity of the modeling process, the required computational resources, and the level of detail or precision of the results [
1].
Volume-type elements are capable of reproducing in detail the geometrical characteristics of any structure; theoretically, they should be the most utilized elements. Nevertheless, they have the highest modeling complexity and computational requirements, so they are generally utilized for detailed model analysis.
Shell-type elements are utilized to model any structure that can be generated upon a midfiber with a small thickness in relation to the other dimensions. They represent the most utilized type of elements for modelling sheet plate structures. The geometrical simplification arising from the mid-fiber-based modelling process involves an accuracy limitation with respect to volume-type elements, but the model is simpler and requires less computational power.
Beam-type elements represent the most simplified modeling procedure generated by a simple line between two points to model tubular structures. Models based on this type of element represent the most attractive alternative for real industrial applications involving the modeling of complex tubular structures such as buses’ and coaches’ upper frames, bridges, cranes, etc. As an example,
Figure 1 shows a bus’s upper body structure, where a significant number of interconnected tubular profiles can be noticed. Nevertheless, FEM models based on this type of element incur a series of limitations related to the inability to account for geometrical details, as will be further detailed below.
To address the modelling process of beam-type elements and the simplification in relation to shell element types for tubular structures,
Figure 2 presents a comparison of a cantilever beam with a total of six divisions used for nodal evaluations (S1 to S6). In the case of the beam elements (top figure), nodes are defined by their coordinates and the beam element is a line between each of the two points; the beam cross section is then set as a property of the model. In the case of the shell element (bottom figure), the section needs to be obtained by extruding the midfibers, resulting in four thin rectangles conforming to the cantilever beam in which the thickness is later defined as an element property. When these procedures get extrapolated for complex tubular structures, the modeling process with shell-type elements requires the generation of each individual profile and the creation of beam connections, thus escalating the complexity and time involved in having a complete model with respect to beam-type elements.
The main drawback of beam-element-type modeling is represented in
Figure 3, where the same beam T-junction model is obtained for both T1 and T2 real junctions, despite the fact that they have significant topological differences at the joint level with consequent behavioral differences between both junctions. In fact, all geometry-related characteristics of the junctions are reduced to a common node regardless its complexity.
One direct consequence is that the real stiffness of the junction cannot be accurately predicted, developing error responses that add up for every junction of the model. This effect can completely bias the global response analysis of complex structures.
This aspect was also analyzed in [
2], in which the authors concluded that the beam-type elements “were simply not designed to take into account and reproduce the behavior of real joints, since all the elements that reach the same joint are joined in a single infinitely rigid node”.
Given the potential resources and time saving when modelling tubular structures with beam-type elements, there is an industrial interest in developing alternative beam models to improve their accuracy without increasing the modelling complexity.
Extensive research efforts have been undertaken to improve the accuracy of beam-type element models, largely focusing on adjusting the stiffness behavior at the joint level to more accurately reflect reality [
3,
4,
5,
6]. One of the primary approaches involves introducing elastic elements to modify the stiffness behavior of the joints.
In their research, the author of [
3] proposed an alternative beam model in which a total of six elastic elements were introduced at the joint level (
Figure 4); the author presented an alternative beam model along with a complete methodology for the evaluation of the elastic element stiffness values by performing an iterative comparative analysis with shell, volume, and real T-junctions.
This model significantly improved the behavioral predictions, depending on the correct stiffness estimation values of the elastic elements for each junction type.
In order to achieve realistic and feasible industrial development for this methodology, various studies have estimated these stiffness values using a variety of methods such as structural FEM comparative analysis [
3], experimental and FEM modal analysis [
7], kriging regression models [
8], or neural networks [
2]. These methods have all resulted in satisfactory improvements in accuracy for T-junction models. Despite this, these approaches lack generalization since their application is limited to T-junctions.
In [
9], the authors performed a validation of the alternative beam model of [
3] on a complex tubular structure. Although an improvement in the accuracy was demonstrated with respect to ordinary beam-element-type models, limitations were encountered related to the fact that, when analyzing real structures, it is not always obvious to classify T-junctions as T1 or T2 (
Figure 5), and that in many cases, more complex junctions simply do not fit in a six-elastic-element alternative beam model. Even for simple tubular structures, like the one presented in
Figure 6, the model proposed by [
3] would not be applicable, as none of the corner junctions could be characterized as either T1 or T2 junctions.
Despite the research in the field, no improved general beam model along with a methodology for its application is provided in the literature, and therefore, an alternative beam-type element with more generalized application to different types of junctions in complex structures is still needed to provide a comprehensive solution for modeling such structures accurately with beam-type elements.
In pursuit of this goal, the authors propose in the present work a generalized alternative beam model consisting of six elastic elements at the extremities (
Figure 7) capable of improving the behavior of any modeled structure if the estimation of the stiffness values is correct.
In this manner, for a simple tubular structure (as shown in
Figure 7), the implementation of the alternative beam model would result in a design where the joint level always has a common node connected with elastic elements. The individual connections to each beam converging at the joint, facilitated through the elastic elements, provide enough flexibility to adapt the behavior of the structure. However, estimating the stiffness values presents a significant challenge that could undermine the benefits of having a more accurate model.
Typically, the stiffness characteristics of junctions at the joint level are quite complex and have a direct and significant impact on the overall behavior of the structure. The proposed model captures all beam interactions at the joint level through the introduction of six elastic elements, as shown in
Figure 8. This generalized approach can be adapted to any joint type, making it applicable to a wide variety of structures.
The principal challenge with the proposed model arises from the complexity involved in estimating the stiffness values in a practical manner. For instance, in the case of a simple tubular cube, it would be necessary to estimate a total of 18 × 8 = 144 stiffness values.
The authors believe that the proposed general alternative beam model should allow for the easy and quick estimation of the elastic element values. If not, the benefits of having more realistic beam models could be negated by the extensive effort required to estimate the correct stiffness values.
Drawing from the experience in the T-junction model of [
3], the estimation of stiffness values could be performed using iterative simulations, comparative modal analysis, mathematical regressions, or artificial neural networks (ANNs) [
10]. Among these, the first two methods involve extensive, joint-specific test plans, making them impractical for a general application to any type of joint. The third option requires constructing large regression models, which also depend heavily on joint construction characteristics.
In this context, the use of ANNs, already explored by the authors in [
2], presents a flexible and fast alternative for estimating the elastic elements, thereby avoiding the need for tests or joint-specific analysis.
ANNs have been used by numerous researchers to estimate results or improve FEMs with satisfactory outcomes. Initially, ref. [
11] proposed a FEM-based neural network for boundary problems. For instance, ref. [
12] proposed the use of ANNs to predict unknown constitutive laws for composite materials, while ref. [
13] utilized neural networks to estimate accurate inductor modeling and design. Another example is [
14], where ANNs were used to evaluate buckling in reinforced steel columns. All these studies demonstrated the potential of ANNs to enhance FEM models.
In this context, the purpose of this work is to assess the capability of ANNs to properly predict the stiffness of the elastic elements of the generalized alternative beam-type element over the cube structure presented in
Figure 7. It details intriguing findings about generalizing the use of elastic elements to enhance the accuracy of beam-type element models, based on artificial neural networks. The extensive analysis presented herein provides a valuable reference to be used as a starting point for similar works involving FEM structural beam models.
2. Methodology
The methodology presented explores the potential of ANNs to estimate the stiffness values of the generalized alternative beam model of the structural cube shown in 0. A total of 100,000 FEM simulations were performed, randomly modifying the stiffness values of each elastic element. Nodal displacement and stiffness values were used as inputs and outputs, respectively, for training the ANN.
In a linear FEM model like the one studied, there is a direct relationship between stiffness and displacements. Initially, it was expected that the training of the ANN would not pose significant difficulty, considering the extensive capabilities and complex applications of these computational tools. However, unexpected challenges arose, predominantly related to the interplay of stiffness values among the different joints. Generally, the better the model was trained, the higher the tendency for overfitting and convergence towards average values.
2.1. FEM Model
The model used for training the ANN was a FEM representation of a steel cube, with each side measuring one meter, modeled using the proposed alternative beam model (
Figure 9). Circular beam sections of 10 mm were used. At each corner, a total of four nodes were positioned: one “floating” common node connected to three other nodes at the extremities of the converging beam elements.
On the simulated model, the bottom common nodes were embedded and three forces of 100 N were applied at node 5 in all the X, Y, and Z directions.
In contrast to a regular beam cube, the alternative beam model cube introduced 8 × 6 = 48 additional degrees of freedom (DOF), which did not significantly increase computational effort. However, the deformations of the cube depended on a total of 18 elastic elements per junction, leading to 144 stiffness values in total.
Preliminary analysis revealed that when the stiffness was set to a value higher than 1 × 107 N/m, the results for the regular beam (without elastic elements) and the alternative beam were almost identical in terms of nodal displacements and von Mises stress values.
The dimensional characteristics of the model were kept unchanged during all simulations; in fact, all four nodes at each junction were located at the same coordinates, with the elastic elements not having a dimension and only acting as a mathematical condition (
Figure 9).
The underlying FEM formulation for the analyzed problem based on the linear relation between stiffness and displacements is as follows:
where:
{F}—Force vector (force loads applied at the nodes);
[K]—Stiffness matrix (includes the beam and elastic elements);
{u}—Displacement vector (force loads applied at the nodes).
Since the correlation between stiffness and displacements was linear, and the number of parameters to be estimated was relatively small, the simulation time was very low so a total of 100,000 FEM simulations were performed, modifying the 144 stiffness values randomly between 1 × 104 and 1 × 106 (N/m) for training the ANN.
For each simulation, the displacements of all 8 corner floating nodes were recorded in all directions (3 displacements + 3 rotations), leading to a total of 48 output values, from which the ones corresponding to the embedded nodes are null.
Ansys
® R2022, software was used for the simulations. The models consisted of regular beams with the Thimoshenko formulation (Beam188 element type) and linear elastic elements (Combin14 element type) at the corners (
Figure 10).
The stiffness of each elastic element is defined through the real constant value of the corresponding Combin14 elements.
Figure 11 presents the distribution of the real constants and the node numbers of the structure. It is noted that each corner is associated with 18 real constant numbers, corresponding to six elastic elements for each converging beam.
2.2. Initial ANN Models
The methodology carried out to train the ANN with the aim of predicting the stiffness values (output) for a set of nodal displacement is presented in the block diagram of
Figure 12. First, the stiffness values and the directional displacements and rotations of the corner nodes (1–8) were recorded for each of the 1 × 10
5 simulations carried out. The process of updating stiffness values, simulating, and storing the data was automatized with a customized APDL script to increase time effectiveness.
The displacement values were subtracted from the reference displacements obtained for a regular beam model in order to isolate the effects of the spring elements.
The resulting displacement values were then fed into different networks, always utilizing 80% of the data to train and 20% of the data to test the network and a total number of epochs of 500.
The rest of the principal characteristics of the network were varied iteratively in order to obtain the most accurate ANN configuration possible (
Figure 13):
- -
Hidden layers: 1 to 5;
- -
Hidden layer nodes: 1 to 1000;
- -
Learning rate: 0.01 to 0.001;
- -
Activation function: Relu Sigmoid, Linear;
- -
Optimizers: gradient descent optimizer (SGD), ADAM optimizer, and RSMProp optimizer.
Python with Tensor Flow and Keras was employed for the simulations of the ANN. As will be explained below, some modifications of the output layer were performed in the network based on the findings of the detailed analysis of converging elastic elements as described in the next section, but the studied ANN parameter ranges were kept unchanged.
From Equation (1), we find that the values of the elastic elements and the displacements are directly related. The elastic elements are integrated in the so-called stiffness matrix of the FEM model:
Furthermore, if the input forces {
F} are kept unmodified among the simulations, the relation between the stiffness values of the elastic elements and the resulting displacements will have an inverse linear relation.
On the basis of this straightforward relation between the input and output values, the initial hypothesis was that it should not be a big challenge for an ANN to show satisfactory results, as was especially the case for significant sampling data.
2.3. FEM Theoretical Aspects
The importance of the stiffness matrix in the response of the FEM model was noticeable, and in some way, a properly trained ANN should “learn” how it is constructed in order to predict the resulting displacements. On this basis, a detailed analysis of the stiffness matrix construction is carried out in this section for the specific case where elastic elements are combined with ordinary beam-type elements.
Ultimately, the stiffness matrix represents a fundamental element which determines how every node of every model will behave in every degree of freedom, completely defining the model response. In the most generic case of a two-node elastic element (
Figure 14), each with 6 degrees of freedom (DOF), the interaction between each DOF would be given by the stiffness values corresponding to that particular DOF, and Equation (1) would be developed as Equation (4).
From this equation, it can be seen that a nodal displacement in the “X” direction will be conditioned by the force applied in that direction and the “kx” stiffness value, evidencing the importance of the stiffness matrix in the characterization of FEM models.
In a similar way, for a generic two-node beam element with 6 DOF per node, as shown in 0, Equation (1) would be developed as Equation (5)
Figure 15.
- -
Iy, and Iz are the moments of inertia of the beam element along each axis;
- -
A and L are the cross section and length of the beam element, respectively;
- -
E and G are the elastic and shear modulus of the beam material;
- -
ϕy, and ϕz are material and geometric constants noted as: .
The terms contained in the stiffness matrix of the beam are composed by the geometrical and material properties of the beam, for example, the term EA/L represents the stiffness under axial load (traction–compression) and is given by the section area of the profile (A), the material’s Young’s modulus (E), and the length of the beam (L).
Extending the analysis to an element composed of a beam and spring element, such as the one shown in
Figure 16, three nodes, with 6 DOF per node, were obtained and Equation (1) would be developed as Equation (6).
where
k1x,
k1y,
k1z,
k1rx,
k1ry, and
k1rz are the 6 stiffness values of the elastic element.
In this case, the stiffness matrix has 3 nodes × 6 DOF per node = 18 × 18 elements. The green region corresponds to node 1 and depends on a spring behavior, the orange region corresponds to node 3 and depends on a beam behavior, and finally, the blue region corresponds to the midnode that depends on both the beam and spring having its stiffness values added. For example, for node 2, an axial load will have a stiffness value of k1x + AE/L.
The proposed alternative beam element would have a total of 4 nodes with 6 DOF, leading to a stiffness matrix of 24 × 24 with two midnodes with mixed influences of beam and spring elements. For the sake of simplicity, the complete stiffness matrix of this element is not included in the document.
In the FEM simulations of the cube structure, the material and geometry characteristics were kept unmodified. Therefore, the terms in the stiffness matrix corresponding to the beam element did not change, and the only modifications that took place were as a consequence of the 144 stiffness values that were randomly modified, as explained before.
Based on this development, the corner floating nodes of the cube structure would be equivalent to node 1 in Equation (5) for each of the three concurrent elements. Therefore, the stiffness of the elastic elements of each element would be added, leading to an equivalent value that will characterize the displacement of the floating node.
2.4. Final ANN Configuration Based on the Stiffness Behavior Evaluation
Based on the analysis presented above, it is possible to reduce the number of input parameters that feed the ANN by using the equivalent stiffness values determined in Equation (7). In this equation, subindex “i” refers to the ith floating node (1 to 8), and subindexes 1 to 3 are given indistinctly to each of the three nodes concurring in the corner.
Given that if these equivalent stiffnesses are equal, the resulting displacements obtained on the FEM model will also be equal, by implementing the equivalent stiffness values, we avoid feeding the ANN with different input combinations that lead to the same results.
The resulting ANN schema would be as shown in
Figure 17; the initial 144 output values are composed of 14 equivalent output values. The rest of the configurations of the ANN were explored as described in
Section 2.2.
Tensor Flow Keras ANN
Once we completed the 1 × 10
5 simulations, numerous ANN configurations of ANN were tested within the parameters’ ranges listed in
Section 2.2. Although most of them showed very similar results, the three that led to the most relevant results are summarized in
Table 1 and will be presented in more detail in the
Section 3.
Due to the large number of data analyzed, the computational requirements were significant for some ANN configurations. This issue was solved by utilizing Google Colab, which provides high computational resources at reduced costs, determining the necessity of programming the ANN with Tensor Flow.
4. Conclusions
This paper presents a novel, generally improved beam model, marking a significant advancement in the field of structural modeling. This model is characterized by 4 nodes, 12 elastic elements, and 1 beam, in stark contrast to the conventional beam elements that comprise only 2 nodes and 1 element. This innovation enhances the adaptability of modeled structures at the joint level and necessitates a novel methodology for the precise estimation of elastic elements at the joint level.
The model is based on the addition of elastic elements to a joint of concurrent beams, making it capable of characterizing its stiffness behavior irrespective of the joint type. The main challenge in developing this model was determining the stiffness values of the newly introduced elastic elements, for which we explored the capability of artificial neural networks (ANNs) in this paper.
The case study is a simple cube-shaped structure with a total of eight corner junctions in which elastic elements stiffness are randomly modified and the resulting displacement is recorded to be used as input data for training the ANN.
A wide variety of ANN configurations were explored, obtained by modifying the number of hidden layers and neurons, the learning rate, the activation function, and the optimizer. The following conclusions were obtained:
- -
In general terms, unexpectedly, the different ANNs evaluated led to similar unsatisfactory results, with a clear tendency to center the predictions on the average of the correct values, with slight variations in the dispersion depending on the ANN configuration.
- -
Further analysis performed revealed that this behavior is noted in the ANN when the input and output data samples are random (i.e., with no correlation between them). This lack of correspondence was clearly observed between most of the displacements (input) and stiffness values of the elastic elements (output) on the calculated structure. Apparently, in such a situation the best prediction that the ANNs achieve to minimize the errors is to center on the average of the correct outputs.
- -
The limitations of using ANNs for general predictions of FEM models’ responses were evidenced based on a few nodal data, where crossed effects might not be characterizable and only pseudorandom responses can be obtained.
- -
Alternatives focused on an increased representation of the stiffness matrix in the ANN training data (i.e., more nodal displacement results) might be explored, although the authors cannot provide a guarantee of success, and would certainly suppose a significant increase in the ANN complexity and consequent computational costs.
In any case, the findings provided in the present work represent a contribution in itself to the state of the art of the scientific knowledge regarding the connection between artificial neural networks and the finite element method, and the information is valuable to be employed in continuing engineering and research projects on this field.