1. Introduction
As infrastructure continues to age, structural health monitoring (SHM) techniques are being developed [
1] to observe structural components and entire systems over time, recording their responses. These responses are then compared against established benchmarks to determine if the structure is under stress, which would require a detailed inspection first, followed by necessary repairs [
2]. With the vast amount of data collected over time, recent advancements in hardware have made it feasible to use wireless sensors that can transmit signals to a remote processing unit, where the structural integrity is assessed. To manage the extensive data flow over time, sensors can be equipped with software that analyzes the monitored structure. This initial evaluation can filter out routine responses caused by environmental factors, allowing only significant data to be sent to the remote processing unit [
3]. Over time, machine learning techniques such as neural networks have been brought into the picture to help evaluate the recorded data. These are being incorporated into existing SHM systems and are further enriched with explainable artificial intelligence (XAI) capabilities [
4], which can suggest reasons for the divergence of the structural response from prescribed norms and advise as to why a specific repair protocol should be followed. Finally, structural control mechanisms can be implemented, ranging from passive to active systems, whereby the latter case requires continuous monitoring for its activation by tracking vibrations before they become harmful to the structure’s integrity [
5].
When focusing on pylons, recent literature highlights advancements in vibration control for wind turbine towers, which include passive, active, and semi-active methods, with the latter two requiring monitoring of the structural response. These methods are crucial for vibration mitigation, as they address a wide range of loads, including wind, waves, currents, and ground motions [
6]. They are also used in a variety of engineering disciplines. In mechanical engineering, for example, machine learning techniques have been applied for real-time condition monitoring of rotating shafts, emphasizing the importance of accurately identifying damage through vibration measurements for predictive maintenance [
7]. Continuous monitoring of wind turbines has also shown variability in their natural frequencies and damping ratios, allowing for enhanced tracking of modal properties through operational modal analysis [
8]. Automated procedures are nowadays developed to define reference modal properties for modal tracking, facilitating the detection of problematic resonances [
9].
Pylons can be viewed as various types of beams, which are essentially unidimensional structural elements that experience tension, compression, bending, and twisting forces. The use of beams is widespread in civil engineering but is also found in mechanical engineering (torsion bars, stringers, etc.) and in aeronautical engineering (turbine blades, struts, etc.). A few parameters enter the design of such structural elements, starting with the type and shape of the cross-section (hollow, non-uniform, etc.), type of construction (uniform, segmented, etc.), type of material (e.g., steel, concrete, composites), their support conditions and the presence of attachments (e.g., secondary systems). Beams subjected to transient loads are of particular interest, as they can be accurately modeled using the elastic waveguide approach [
10]. Depending on the specific beam and type of load applied, the three fundamental modes of mechanical behavior may become interrelated. Additionally, the Bernoulli–Euler beam theory may require modifications to account for the rotational inertia of the cross-section (Rayleigh theory) when the load frequencies are high and to incorporate shear deformation (Timoshenko theory) if the beam is not slender. Furthermore, material damping within the beam must be considered, often modeled for waveguides using the Kelvin–Voigt viscoelasticity model, which results in complex-valued material constants [
11].
Finally, it is essential to define the impact of lumped mass attachments on pylons. They are primarily equipment serving various purposes, e.g., lights, transmitters, cameras, etc. However, they may also play the role of a passive or semi-active structural control device [
12]. A recent approach [
13] explores the influence of time-varying mass attachments on flexible pylons undergoing ground-induced vibrations placed within the context of active control. An analytical solution is derived for a lumped mass at the top of a cantilevered pylon experiencing longitudinal vibrations, highlighting the complications in the modal analysis due to time-dependent system matrices. Results indicate the frequency range where a varying mass optimally modifies the pylon’s kinematic response. This methodology can also be extended to include transverse and rotational vibrations.
In addition, the attached mass may, in turn, be connected to a spring and/or to a damper. For instance, ref. [
14] examined a nonlinear pendulum with such an absorber attached to its mass. Generally, a flexural beam with an attached mass can be considered a coupled system consisting of a primary linear oscillator and a nonlinear secondary system. Significant research has been conducted in this area, particularly when the secondary system is treated as a nonlinear energy sink [
15]. Naturally, new solution methods are required, such as multiple-scale expansions that simplify the problem. The first approximation is a low-dimension vibro-impact system with a force and a damper, which can be solved analytically. However, these analyses are most appropriate when the mass of the attached system is small relative to that of the primary system.
Damage in beams due to the development of Mode I (open/closed) fatigue cracks leads to noticeable changes in the vibration frequencies of the structural system. Specifically, cracks in the web or connections of steel and reinforced concrete (R/C) beams—such as those used in girders, bridge decks, and pylons—significantly affect their vibratory response, weaken the structure, and can eventually lead to local failure [
16]. A fundamental mechanical model for simulating cracking in Bernoulli–Euler beams involves introducing discontinuities at the suspected crack locations [
17]. There are four primary types of discontinuities caused by cracks, which are modeled by incorporating the Heaviside function into the displacement, slope, bending moment, or shear force diagrams of the beam. The most common approach is modeling slope discontinuities, which introduce Dirac delta function representations in the bending moment and shear force diagrams. An engineering approach to modeling cracks, therefore, involves inserting springs at the crack location, with the type of spring depending on the specific discontinuity expected to occur [
18]. Additionally, the Dirac delta function and its first derivative can be treated as forcing functions on the right-hand side of the Bernoulli–Euler equation, maintaining the self-adjoint property of the differential operator [
19] during solution.
In this work, we focus on examining the structural integrity of pylons with variable cross-sections that are subjected to environmentally induced vibrations. Four scenarios are investigated, namely (a) the intact pylon; (b) the intact pylon with an attached point mass at the top that might serve a variety of uses, e.g., lighting equipment; (c) damage in the form of a crack at the base when recordings are available in the same direction as the crack itself; and (d) the same case as before but with recordings available in the direction perpendicular to the crack. Obviously, the first two scenarios involve a linear response of the pylon, while the remaining two scenarios address nonlinear mechanical behavior in the pylon, the reason being localized plastic flow in the vicinity of the crack’s tips plus friction across facing crack surfaces as the pylon vibrates. An engineering approximation for modelling these types of problems is using spring elements, while the current trend is to base the overall assessment of the mechanical response exclusively on data-driven methods [
20]. More specifically, data-driven methods are used when the governing equations of the problem at hand are not known. From the response to a given input, optimization is carried out by assuming a matrix function and sequentially adjusting its coefficients until the error between what is measured and what is assumed is minimized. The data drive part comes in assuming the form of this matrix function. This restricts the choices to ones that are most probably correct. For instance, in structural dynamics, this matrix function should be symmetric and positive definite. This corresponds to the eigenvalue problem for structural dynamics and makes the entire procedure for developing a CNN more efficient.
Typical data-driven methods rely on a first assessment of the eigenvalue-eigenvector pairs of the structure in question under different conditions. In this notion, any deviations observed between these pairs can be classified in terms of their causal factors. Data-driven methods are capable of extracting eigenvectors even if the mechanical response of the structure in question is nonlinear, essentially by piecewise linearization of the eigenvalue problem. Since our experimental data used as input contains nonlinearities, it was essential to use this type of method. Therefore, eigenvector extraction will be accomplished by the use of
autoencoders [
21]. See
Figure 1 for a schematic of their architecture. Every
autoencoder comprises two mappings,
and
, where
pertain to the generalized beam coordinates, the eigenfunctions, and the transverse displacements, respectively, with overdots indicating time derivatives. From the architecture shown in
Figure 1, the weights of the decoder, that is, the mapping
, are the generalized coordinates of the corresponding eigenfunction.
2. Industrial Pylon Case Study
Any CNN development pre-supposes a linear
activation function plus a
cost function interposed between the
input and the
reconstruction segments, see [
22]. By modelling the flexible pylon shown in
Figure 2 as a continuous cantilevered beam, its transverse acceleration
response can be written in terms of the spatially-dependent eigenfunctions
and the time-dependent, generalized coordinates
as
. In most cases, a small number
of low-frequency eigenvalue pairs is required for acceptable accuracy in modelling the structure in question for external vibrations. Thus, for
, the extraction of the eigenvectors using the aforementioned architecture and their verification through comparison with the Principal Component Analysis (PCA) requires the adoption of a different
cost function, separate from the standard one [
22]. However, if the transient records are produced using only one eigenvector, then the use of the standard
cost function yields the same eigenvectors as those derived from the use of the PCA. Note that for a continuous mass representation of the pylon, we use eigenfunctions, while in the case of experimental measurements, these eigenfunctions actually become eigenvectors as they are measured at discrete points along the height of the pylon.
In reference to
Figure 2, the elasticity modulus and the mass density of the pylon were experimentally determined, followed by the quantification of equivalent springs to model the loss of fixity at the pylon’s base. The cases pertaining to the first two pylon scenarios can be handled within the linear material response range. References [
23,
24,
25] give many details on the placement of the mass at the top of the pylon and the insertion of base springs, plus their effect on the pylon’s dynamic response. The ratio of the top mass to the total pylon mass
is non-negligible, i.e., it is sufficient to alter the eigenvalues of the stand-alone pylon.
4. Industrial Pylon Testing
A comprehensive experimental project was conducted to study the mechanical behavior of industrial pylons used for lighting highways [
26]. Fatigue considerations, dynamic response, cyclic loading, and base conditions are all examined for structural health monitoring (SHM) purposes, including the development of CNN, which is reported here.
Two
tall metallic pylons with a variable, ring-type cross-section provided by the manufacturer were tested in the AUTh Laboratory for Experimental Mechanics under fixed-base conditions, with one damaged after cyclic testing and the other remaining intact, as shown in
Figure 4. Next,
Figure 5 is a schematic of the testing setup, while
Table 1 lists the mechanical properties and the geometry of the tested pylons. The effect of an external force, as applied to the attached top mass on the pylon’s vibrations is described in
Appendix A.
An important design consideration was fatigue, as these pylons are exposed year-round to wind pressure and rain. Corrosion resistance is addressed by a zinc coating of the external surface, which provides negligible additional strength to the pylon. Fatigue testing showed that after cycles of low frequency and low amplitude loading, the pylon developed hairline cracks at its base, whose presence cannot be easily detected by dynamic tests. These cracks, however, act as stress concentrators with the potential to compromise the structural integrity of the pylon. More specifically, impact hammer tests, which followed before and after the cyclic testing, showed no measurable change in the dynamic properties of the cantilevered pylon. Next, a digital image correlation (DIC) system was implemented on an intact pylon as a means of tracing the development of the kinematic field from a series of time frames depicting the displacement evolution over the front surface of the pylon. These DIC studies, conducted for cyclic loads, were in excellent agreement with the conventional method of tracing the displacement vector field at a fixed location on the pylon. Overall, good agreement was achieved between the experimentally obtained results and the analytical/numerical predictions.
Table 2 lists some of the numerically obtained results derived from the use of the analytical models previously developed that pertain to the recovery of the eigenfrequencies of the pylon under the four scenarios discussed earlier on. Also,
Figure 6 shows the close agreement between the computed and the experimentally measured first flexural eigenfunction
of the tapered, cantilevered pylon versus normalized pylon height
. In sum, the experiments carried out with the two pylons were used to verify the analytical-numerical models developed for pylon vibrations under intact conditions, base damage and base fixity. Therefore, the analytical models will be used from now on to produce data for the CNN that is being developed in this work.
Data Generation from Testing under Dynamic Conditions
Dynamic testing, as realized by imparting an impact loading to the pylon with an impact hammer, was the most relevant to CNN development. More specifically,
Table 3 gives a description of the wireless acceleration sensors and their accompanying data aggregator, while
Figure 7 depicts the placement of six acceleration sensors along the height of the pylon along with the measurements carried out for the four scenarios previously described. Furthermore,
Table 4 depicts the drift in the first eigenvalue of the originally intact pylon as the crack develops after the cyclic testing is completed and followed by the impact test. We observe that this drift is negligible and does not allow for crack detection. Because of the low-frequency testing, measurements did not show any contribution from the higher eigenfrequencies to the total dynamic response of the pylon.
As far as the extraction of the corresponding eigenvectors from the measured response of the pylon is concerned, it was accomplished by the use of
autoencoders following the architecture outlined in
Figure 1.
In training the neural network, the
adam algorithm was used [
30] with a degree of learning
, while 50
epochs were required for stabilization of the
loss function. All time histories were normalized such that their maximum value over the entire time length was equal to unity. Following training, the CNN was used to extract the eigenfunctions
listed in
Table 5, as well as the generalized coordinate
for each case, see
Figure 8.
5. CNN Realization
Following the classification of damage in four categories (scenarios), it remains difficult to identify each of them from measurements involving the first eigenvalue–vector pair, which is the only possibility given the low-frequency vibration environment realized in the testing. An earlier neural network implementation by the authors [
27] involved a statistical analysis using Spearman correlation coefficients. This method quantifies the influence of varying soil stiffness, mass ratios, and tapering on the pylon’s response while excluding considerations of interior damage. This latte type of nonlinearity prompted the development of a special-purpose CNN capable of distinguishing among four damage categories. A necessary first step in this process is the preparation of the experimentally obtained data. More specifically, the data used come from an acceleration sensor
a805 placed at the top of the pylon, see
Figure 7. The time histories are converted into spectrograms [
31], which are used as input to the CNN.
For each of the four scenarios, we use the first of the time histories to set up both a training and a validation set. Following the development of CNN, we use the remaining s record snippets to create test sets that would check the CNN’s performance. Note that the record duration for the first scenario is , that of the second scenario is , and finally, that of the last two scenarios is .
5.1. Training and Validation Data Sets
As mentioned, training sets were developed using the first
of each experimentally obtained time history record. Specifically, a total of four sets containing 100 time histories were prepared (see
Figure 9), with each set corresponding to a damage scenario. These were used for the CNN training and validation, while at the same time, 281 time histories were reserved for testing the performance of the CNN. These last sets were broken down as follows: (a) 11 time histories for the undamaged state, (b) 16 time histories for the undamaged pylon with an attached top mass, (c) 127 time histories for the damaged pylon with a base crack and with recordings available in the N–S direction only, and (d) similarly with 127 time histories coming from the E–W direction.
Next is the development of a total of 100 spectrograms of
duration, with each spectrogram overlapping the next one by
. In their development, a Hanning window [
31] with an element size of 256 and an element overlap of 250 was used.
Figure 10 depicts four typical spectrograms, each pertaining to one of the four damage scenarios. Specifically, a comparison of cases (a) and (b) indicates that in the latter case, the addition of a top mass with a mass ratio of
renders the pylon more flexible. This is manifested by the fact that the indigo-colored horizontal line in
Figure 10b, which indicates the time variation of the first eigenfrequency, is lower than the corresponding one in
Figure 10a, attesting to a more flexible structural system. Also, the remaining faint indigo-colored horizontal lines in the spectrograms correspond to some parasitic eigenfrequencies brought about by some amount of flexibility in the base connection of the pylon during the experiments. Next, the blue-colored vertical lines in the spectrograms pertain to the instant when an impact was delivered to the pylon through the use of the impact hammer, which excites a wide range of frequencies, and those are registered in the response. Finally, the green-colored areas correspond to a free-vibration environment. In this case, the last two spectrograms have identical lines, as they both derive from the same impact.
5.2. CNN Architecture
The CNN architecture, shown in
Figure 11, consists of three convolution layers. The first layer comprises 32 filters (or kernels) size 6 × 6, the second layer comprises 64 filters size 3 × 3, while the third one has 128 filters, also of size 3 × 3 each. Past the end of each filter is a
max pooling operator, whose purpose is to reduce the size of
feature maps without compromising the integrity of the information contained there. Next, a
dense layer comprising 128 neurons is placed to correspond to the
feature maps with an optimal vector size of
4 ×
1, where 4 is the number of classes. Finally, the
dropout technique is used to handle problems with over-fitting. More specifically, during the training period, this technique ignores a cluster of neurons, forcing the network to seek a plurality of characteristics instead of the ones currently available.
The
cost function used in the CNN development is the
sparse categorical cross-entropy, while the computation of weights materializes through the use of the
adam algorithm with a learning rate of
a = 0.001. The total development session of the CNN lasted 100
epochs, with 80% of the 400 spectrograms used for training and the remaining 20% for validating; see
Figure 12 for details. Following training, the
confusion matrix shown in
Table 6 lists the results of the validation session, indicating a high level of accuracy in the results. Note that the CNN validation was accomplished with spectrograms that were not used in the training session.
The CNN was used to evaluate damage from the remaining spectrograms that were reserved for a true test of CNN capabilities, and the results are shown in the confusion matrix in
Table 7 and
Table 8. Since we have leakage because of the presence of non-diagonal terms in the
confusion matrix, a careful assessment of the CNN performance is warranted. We first start with each damage scenario (or class) and compute the following variables:
True Positives (TP),
False Negatives (FN), and
False Positives (FP). We observe that class (a) receives the lowest precision. Specifically, even if the input is unbalanced, the
confusion matrix indicates that if the CNN performs 100 evaluations and classifies them as belonging to scenario (a), then 58.8% (10/17) are correct, and 41.2% (7/17) are misclassified as class (d) meaning that there is crack damage in direction
N–S, but this damage does not show along direction
W–E and the pylon appears to be undamaged. This has important implications, meaning that when monitoring pylons of circular cylindrical geometry, sensors must record data along two perpendicular directions. Notice that cracks are linear elements, and when section bending occurs, the crack might be inactive if it is subjected to compression, which results in closing the two crack surfaces. This will mask the presence of the crack unless measurements are taken in the directions where the crack opens up. Should this happen, the CNN interpretation of the measurements will be reliable, and a high percentage will appear in both the precision and recall indices, which will provide a high degree of confidence for the practicing engineer’s purposes.
6. Conclusions and Discussion
In SHM, detection of damage remains difficult (unless it is already visible) despite the generation of extensive data, which has to be processed nowadays by artificial intelligence (AI) techniques. Specifically, in this work, a CNN was developed to determine damage in an industrial pylon by juxtaposing two basic states, the undamaged and the damaged ones. Dynamic testing was conducted by using an impact hammer, while time recordings were available from a number of wireless acceleration sensors. Each state further contained two sub-cases, the former having the stand-alone case and the pylon with an attached heavy mass, while the latter dealt with cracking at the base of the pylon following extensive fatigue testing, but with recordings available along the crack and perpendicular to the crack directions.
The basic conclusion of this experimental-numerical work is that damage is difficult to detect at its early stages, and sometimes, the damaged state cannot be identified, and CNN confuses it with undamaged states. This requires measurements taken in the vicinity of the suspected damage area and in the principal directions of the mechanical response. This is because damage may be detected in the transverse vibrations only, or perhaps in the axial vibrations or even in the torsional response.
It was not possible to compare results with similar situations where the response of the pylon is nonlinear and requires data-driven models, despite the fact that the literature on pylons pertaining to wind turbine design is vast, with the focus being on passive and active control mechanisms. This way, the continuous operation of the wind turbine is better secured. For less important categories of pylons, e.g., lighting posts, SHM consists of visual inspections. However, given the proliferation of such categories of pylons, for which vast numbers are produced, it is important to have SHM strategies validated through benchmark tests in place.
In closing, the pylon’s vibrations came about from random impact hammer hits along the pylon’s surface and at different time instances. It is interesting to speculate if the confusion matrices developed for these cases would exhibit similar accuracy with the one presented herein if the cause of vibration was either base motion (e.g., earthquake or machine vibrations) or top motion (e.g., the presence of a rotor).