1. Introduction
The frequency range of a terahertz wave is 0.1~10 THz between infrared and micro-wave, with its low energy, strong penetration, spectral information, and other characteristics. It has a wide range of applications in 6G communication [
1], food safety [
2,
3], and biomedicine [
4,
5,
6]. Frequency Selective Surfaces (FSSs), a two-dimensional planar structure of metamaterials, can regulate terahertz amplitude, phase, polarization, and other covariates flexibly and diversely, which provides a brand new platform for the design of miniaturized, high-performance terahertz micro- and nano-optical devices [
7,
8,
9]. The design of FSSs is faced with difficulties including numerous unit structures, parameter complexity, and time-consuming optimization. A simple, efficient, and unified method to complete the design of various FSSs is urgently needed. It is very important for FSSs to move from simulation to engineering.
To address the challenge exposed by the diversity of unit cell structures, researchers have adopted a discretization approach for the periodic structural elements of Frequency Selective Surfaces (FSSs). By integrating full-wave simulation methods with optimization algorithms such as genetic algorithms (GAs) and particle swarm optimization (PSO), intelligent FSS design has been achieved [
10,
11,
12,
13], effectively eliminating reliance on empirical expertise. However, this methodology exhibits a notable drawback: the iterative optimization process necessitates repeated calculations of FSS spectral characteristics, resulting in substantial computational time and resource consumption.
With the rapid advancement of artificial intelligence, researchers have begun to explore the use of neural networks as alternatives to traditional analytical methods, numerical techniques, and full-wave simulation software [
14,
15,
16,
17,
18,
19]. Leveraging the framework and principles of deep learning, once the neural network is trained, the spectral characteristics of Frequency Selective Surfaces (FSSs) can be directly obtained by inputting parameters such as discrete encodings of the FSS structure [
20]. This approach has demonstrated an efficiency improvement of several orders of magnitude in the simulation process compared to conventional methods. In recent years, scholars both domestically and internationally have paid significant attention to research in this area [
21,
22,
23]. In 2019, Qiu Tianshuo and colleagues from the Chinese Air Force Engineering Research Institute employed deep learning techniques to accurately predict discrete coding metasurface structures and design a tri-band absorber [
24]. Their study utilized a training set comprising 70,000 data samples and a validation set of 10,000 data samples. Similarly, Y. Li and collaborators from the Harbin Institute of Technology implemented a multifunctional transmit array metasurface design capable of frequency reuse and polarization reuse by integrating neural networks with genetic algorithms. Their approach involved a training set of 36,000 data samples and a validation set of 4000 data samples [
25].
Deep learning-based FSS design methods, while significantly enhancing efficiency during the design phase, still require substantial time and resources for data collection. Consequently, scholars have begun exploring transfer learning to reduce data requirements and improve data utilization efficiency [
26,
27]. Dong Xu [
17] realized the transfer learning of FPN from rectangular meta-atoms to elliptical meta-atoms, which was used to predict real part spectra and imaginary part spectra in the range of 100~300 THz. Compared with the training process of the two neural networks, the former without transfer learning had a data size of 27,000, and the latter with transfer learning had a data size of 4500, and the training time was reduced from 4 h to 20 min, and the test error was reduced from 0.00076 to 0.00067. Ruichao Zhu et al. [
18] used Goog-LeNet-Inception-V3 to predict the phase of a 28 × 8 meta-atom with an accuracy of about 90%, realizing transfer learning from the field of image processing to the field of metasurface prediction. Zhixiang Fan et al. [
19] achieved transfer learning from cheap and simple source metasurface data (5 × 5) to intensive and complex data (20 × 20) by an encoder–decoder network. This transfer can speed up the metasurface design process and save target data. Previous studies have achieved the efficient design of FSSs within a defined frequency range, which is typically limited by the dataset. If an FSS needs to be designed in other frequency ranges, it becomes necessary to collect data and retrain the neural network again by the usual approach. However, collecting data is the most time-consuming and resource-intensive step, and how to improve data utilization efficiency and save computing resources remains the difficulty in developing FSS design methods based on deep learning.
In this paper, the influence of FSS parameters on its spectral characteristics is analyzed in detail. The theoretical basis for transferable learning in the FSS design method, combining deep learning with intelligent algorithms, is clarified. More convolutional neural networks are implemented for transfer learning from terahertz to millimeter-wave bands. This research provides a reliable reference for the cross-fusion of deep learning and FSSs. FSS design has been further developed in an efficient and universal way.
2. FSS Structure Design and Spectral Characterization Analysis
The FSS simulation structure is shown in
Figure 1.
Figure 1a shows the three-dimensional simulation schematic of the FSS with the following features: the substrate was of high-resistance silicon, in the terahertz band its refractive index was
, the absorption can be ignored [
28], and the thickness
of the substrate was 5 μm. The external medium was air and the absorption can be ignored [
28]. The surface of the dielectric substrate was covered with a thin layer of gold, and its conductivity was
, and its thickness was
. The cycle length of the FSS was
.
The corresponding coding unit structure is shown in
Figure 1b. The period unit is discretized into small squares with a side lengths of
, in which the number “1” denotes the thin gold layer and the number “0” denotes the substrate. Therefore, the FSS was composed of 16 by 16 “0/1” matrices with one-to-one correspondence.
Considering the polarization stability of FSSs, all FSSs were set as rotationally symmetric structures.
Figure 1b demonstrates the four symmetry axes of the FSS, which reduced the degrees of freedom of the FSSs from 2
16×16 to 2
36. It reduced the difficulty of FSS optimization design greatly.
Following the completion of the FSS symmetric structure model, the simulation was conducted utilizing COMSOL. The boundary condition was designated as the Floquet periodic condition, with a simulation frequency range of 0.5–3 THz and a step size of 0.02 THz, consequently yielding a total of 126 frequency points. The electromagnetic properties of the FSS were characterized in this study by means of the total transmittance, total reflectance, and total absorptance, in order to investigate its transmission and reflection characteristics. The spectra of the “ring-shaped” encoded FSS under TE and TM conditions are shown in
Figure 2a,b, respectively. It can be observed that the two spectra are nearly identical. A similar conclusion can be drawn from the calculation results of other randomly encoded FSSs that satisfy rotational symmetry, thereby verifying the polarization stability of the rotationally symmetric structure.
The dataset serves as the cornerstone of deep learning, where both the quantity and quality of data directly influence the training efficacy of neural networks. The dataset in this study comprised two components: the binary “0/1” matrices corresponding to the Frequency Selective Surface (FSS) configurations and their associated spectral responses. To optimize data generation efficiency, 26,000 unique pairwise-distinct “0/1” matrices were randomly generated. An automated workflow was implemented using MATLAB (Version R2021b) to control COMSOL (Version 6.0) Multiphysics, sequentially constructing FSS models for each matrix, performing electromagnetic simulations, and exporting the computed spectral responses to MATLAB for post-processing and archival storage.
Due to the large amount of data required, and in order to save computation time, “transition boundary conditions” were used instead of thin metal layers to avoid mesh partitioning. Additionally, mesh partitioning settings were optimized to minimize the number of elements and accelerate the computation speed.
Table 1 provides a comparative analysis of two meshing strategies in terms of element count and computational time. The physics-controlled meshing, an automated COMSOL algorithm that determines element size based on the highest frequency in the operational band, delivered higher numerical accuracy. In contrast, the user-defined meshing allowed researchers to manually specify element dimensions tailored to specific simulation requirements.
Figure 3 compares the computational results of the same frequency-selective surface (FSS) using two meshing strategies, demonstrating close agreement between the two approaches. In
Figure 3, the dashed line represents the spectral curve obtained with the default physics-controlled meshing, while the solid line corresponds to the user-defined meshing, achieving a 56.7% reduction in computational time. The maximum and mean absolute errors (MAE) for key performance metrics were quantified as follows: (1) transmittance: maximum error = 0.0445, MAE = 0.0193; (2) reflectance: maximum error = 0.2425, MAE = 0.0291; and (3) absorptance: maximum error = 0.1265, MAE = 0.0173.
The elevated reflectance error at 1.86 THz was attributed to sharp fluctuations in transmittance near the FSS resonance frequency, where field gradients challenge mesh resolution. Despite this localized discrepancy, the low MAE for reflectance (0.0291) confirmed that the slight resonance frequency shift caused by the reduced mesh density negligibly impacted the overall spectral characteristics of the FSS. This validates the applicability of the user-defined meshing strategy for large-scale dataset generation while significantly accelerating simulations.
Further optimization was achieved by replacing the default MUMPS solver with the PARDISO solver, explicitly designed for parallel computing. This adjustment reduced the per-simulation computation time to 2 min and 22 s, enhancing workflow efficiency for the 26,000-sample dataset.
The FSS parameters were mainly categorized into two types: geometrical parameters and material parameters. Geometrical parameters include the period length
D of the FSS, the thickness
h of the substrate, and the thickness
h0 of the metal film layer. Material parameters include the metal conductivity and the refractive index of the substrate. For the convenience of later discussion, all FSSs in this paper used the encoding as shown in
Figure 1b, unless otherwise stated.
Figure 4 demonstrates the spectral properties of different metals of the same FSS surface model.
Figure 4a–d are gold, silver, brass, and aluminum, respectively, whose conductivities are shown in
Table 2. It can be found that the spectral properties were almost identical. This phenomenon is determined by the interaction between light and metal. The calculation of the skin depth
is as follows:
where
is skinning depth of the metal,
is the frequency of electromagnetic wave,
is magnetic permeability of the metal, for non-magnetic media take
, and
is conductivity of the metal.
Table 2 gives the common types of metal conductivity, the simplified skinning depth formula, and its skinning depth at 1 THz. It can be seen that the skinning depth at 1 THz of each type of metal was less than 200 nm. When the thickness of the metal was 200 nm, the effect of different metal types on the spectral characteristics of the FSS was relatively small and could be ignored.
Figure 5 shows the spectral characteristics of the FSS when the surface coating was changed to graphite. Due to the difference of two orders of magnitude between the electrical conductivity of graphite and that of the metal, the skinning depth reached 1600 nm at the incident electromagnetic wave frequency of 1 THz, which is much larger than that of the coating thickness of 200 nm, resulting in an obvious change in the spectral curve.
Figure 6 shows that spectral properties of different thicknesses of the same substrate with the same FSS model. The thickness of the substrate in
Figure 6a is
, and the thickness of the substrate in
Figure 6b is
. The metal type of both models was Au and the type of substrate was high-resistance silicon. They are compared with
Figure 2a. The high transmissivity resonance points of the different substrate thicknesses are shown in
Table 3. The trend is that as the substrate thickness increases, the resonant frequency of FSS shifts towards lower frequencies and the overall trend is similar, but the transmittance decreases overall.
The resonant frequency of the FSS shifted when a substrate of infinite thickness was loaded unilaterally, which is given as follows:
In the above analysis, either the increase in the relative permittivity of the substrate or the increase in the thickness of the substrate was equivalent to the increase in the equivalent permittivity , which led to the shift of the resonant frequency of the FSS to the lower frequency. The simulation results are in good agreement with the theoretical results.
As a subwavelength structure, the period length of FSS should be comparable to the wavelength corresponding to the highest frequency of the incident electromagnetic wave. When the wavelength was much smaller than the period length, the incident electromagnetic wave exhibited a geometrical optics-like property. When the wavelength was longer than the period length, it could not control electromagnetic waves due to the drastic changes in its transmittance spectrum.
Figure 6 shows the comparison results of the spectral properties of the FSS for the same FSS model by expanding its period length by a factor of 100 accordingly when the maximum frequency of the incident electromagnetic wave was reduced by a factor of 100. That is, when the corresponding wavelength was kept equal to it, the comparison results of the frequency spectrum characteristics of FSS. The comparison of the parameters of the cross-band FSS model is shown in
Table 4 below.
Comparing
Figure 7a with
Figure 7c, it can be found that the spectral curves of the two are very different due to the differences in the substrate thickness and relative dielectric constant. However, if the effect of substrate was excluded and the substrate thickness of both FSS models was set to 0, the spectral curves of the two would trend as-shown in
Figure 7b with
Figure 7d, respectively. It can be seen that although their spectral curves were not exactly the same, the spectral curves of the two had a similarity in the trend. A similar conclusion can be obtained by changing the coding of the FSS. This means that FSSs with a certain geometry can have similar spectral characteristics in different electromagnetic wave frequency ranges by scaling their geometrical size.
According to the above analysis, the influence law between the parameters of FSS and its spectral characteristics can be summarized as follows:
When the metal thickness of the FSS was greater than its skin depth, the spectral characteristics of FSS with different metal types and thicknesses were similar;
Different thicknesses of substrates affected their equivalent dielectric constants, resulting in a shift in the resonant frequency of the FSS;
When the period length of FSS was scaled at the same time and the highest frequency of incident electromagnetic wave was the same multiple, the spectrum curves of FSS before and after scaling were similar for pure metal FSS with the same shape.
3. Cross-Band Transfer Learning for FSS
The conclusions drawn in the previous section can be used to realize cross-band transfer learning for FSS. The millimeter-band FSS model determined by
Table 4 was adopted, i.e., the width of the small square grid encoded by the FSS was enlarged by 100 times from 5 um to 0.5 mm. The number of small squares in each cycle unit was kept 16 × 16. The corresponding frequency of the incident electromagnetic wave was narrowed by 100 times from 0.5~3 THz to 5~30 GHz. Its corresponding wavelength was enlarged by 100 times, keeping the cycle length comparable to that of the FSS. In addition, the substrate high-resistance silicon was changed to FR-4 commonly used in the microwave band, which had a relative dielectric constant of 4.3 and a substrate thickness of 1 mm.
Using a combination of MATLAB and COMSOL simulations, 26,000 sets of data in the THz band and 6000 sets were collected in the microwave band. The structure of the convolutional neural network is shown in
Figure 8. The input “image” was a 16 × 16 “0/1” matrix. The output was a 1 × 126 predicted spectral curve. The structure was mainly composed of two parts. They were the “convolution-pooling” part, which was used to extract image features, and the “fully connected” part, which was used to output the spectral curve. The loss function MSE is [
29]:
where
is the CNN prediction result,
is the COMSOL simulation result, and
N is the number of frequency points.
The convolutional neural network in the THz band trained the neural network from random initialization, while the CNN in the millimeter band trained the neural network using transfer learning on the basis of the former. According to the previous analysis, the same FSS coding matrix had similar spectral properties in different electromagnetic wave bands when certain conditions were satisfied. When transfer learning is used to train the neural network, the convolutional part used to extract the coding features should be frozen and the fully connected part should be retrained. The other training details were exactly the same for both.
During the training process of the neural network, the Adam adaptive gradient de-scent algorithm was used to update the weights. The initial learning rate was set to 0.001. The exponential decay was used to adjust the learning rate, i.e., the learning rate decreased to the original 0.5 every 5 rounds:
where
LearnRate is the learning rate corresponding to each round of training and
epoch is the number of rounds of training. A total of 80% of the dataset was the training set and 20% was the test set. The training process is shown in
Figure 9.
Figure 9a shows the training process of the THz-band CNN. After 20 rounds of training, the error of the test set tends to be stabilized. Total training time was 18 min.
Figure 9b is the training process of the millimeter-band CNN. Due to the reduction of the data volume, the total training time was only 1.5 min, much lower than the former.
After the training was completed, two neural networks were validated using their respective test sets and the neural network outputs were compared with the COMSOL simulation results. Their objective evaluation results are shown in
Table 5.
is the mean absolute error,
is the root mean square error, and
values range from 0 to 1, which is used to describe the degree of fitting between them. The formulas are as follows [
30]:
where
is the CNN prediction result,
is the COMSOL simulation result,
is the average value of the simulation result, and
N is the number of frequency points.
The training process is shown in
Figure 10. It can be observed that due to the reduced amount of data, the training results quickly reached stability. After training, the objective evaluation metrics were as follows: MAE was 0.0719, RMSE was 0.1267, and R-squared (R
2) was 0.8145. Compared to the objective evaluation metrics from training from scratch (MAE of 0.0592, RMSE of 0.1005, and R
2 of 0.6692), both MAE and RMSE increased, indicating that the errors had grown and the performance of the neural network had worsened. However, R
2 had increased, indicating improved fit, and thus the neural network’s performance appeared better, showing a contradiction. This is because the data used in millimeter-wave transfer learning were a subset of terahertz-wave data, which enhanced the prediction results. In a comprehensive analysis, the performance of the neural network trained with transfer learning should be weaker than that of the network trained from scratch, but transfer learning only used 23.1% of the former’s data, and both methods showed comparable performance. This makes transfer learning an important and meaningful neural network training approach.
After the training of the transfer learning neural network was completed, some of the results were randomly selected for comparison. The comparison results are shown in
Figure 11. It was found that both the training set and the test set had a high degree of fit, which is comparable to the effect of the neural network trained from scratch by the THz CNN.
After completing the training of the transfer learning neural network, the convolutional neural network and the genetic algorithm can be jointly used to complete the design of various types of typical FSSs in microwave bands. It is worth mentioning that this paper only takes microwave bands as an example to show that this method of transfer learning is feasible in cross-band FSS design. Advantages of concise and efficient deep learning can be further brought out due to the reduction in the amount of original data from 26,000 to 6000 groups, which is a reduction of 76.9%. The method can be applied to other bands where necessary through the same steps to expand the method from solving a specific problem to solving a certain class of problems, greatly expanding the application range of the method.