Fast Near-Field Frequency-Diverse Computational Imaging Based on End-to-End Deep-Learning Network

Wu, Zhenhua; Zhao, Fafa; Zhang, Man; Huan, Sha; Pan, Xueli; Chen, Wei; Yang, Lixia

doi:10.3390/s22249771

Open AccessArticle

Fast Near-Field Frequency-Diverse Computational Imaging Based on End-to-End Deep-Learning Network

by

Zhenhua Wu

^1,2,3

,

Fafa Zhao

¹

,

Man Zhang

^4,*,

Sha Huan

⁴

,

Xueli Pan

¹,

Wei Chen

¹ and

Lixia Yang

¹

Information Materials and Intelligent Sensing Laboratory of Anhui Province, Anhui University, Hefei 230601, China

²

State Key Laboratory of Millimeter Waves, Southeast University, Nanjing 210096, China

³

East China Research Institute of Electronic Engineering, Hefei 230031, China

⁴

School of Electronics and Communication Engineering, Guangzhou University, Guangzhou 510006, China

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(24), 9771; https://doi.org/10.3390/s22249771

Submission received: 2 November 2022 / Revised: 1 December 2022 / Accepted: 12 December 2022 / Published: 13 December 2022

(This article belongs to the Section Sensing and Imaging)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The ability to sculpt complex reference waves and probe diverse radiation field patterns have facilitated the rise of metasurface antennas, while there is still a compromise between the required wide operation band and the non-overlapping characteristic of radiation field patterns. Specifically, the current computational image formation process with a classic matched filter and other sparsity-driven algorithms would inevitably face the challenge of a relatively confined scene information sampling ratio and high computational complexity. In this paper, we marry the concepts of a deep convolutional neural network with computational imaging literature. Compared with the current matched filter and compressed sensing reconstruction technique, our proposal could handle a relatively high correlation of measurement modes and low scene sampling ratio. With the delicately trained reconstruction network, point-size objects and more complicated targets can both be quickly and accurately reconstructed. In addition, the unavoidable heavy computation burden and essential large operation frequency band can be effectively mitigated. The simulated experiments with measured radiation field data verify the effectiveness of the proposed method.

Keywords:

computational imaging; metasurface antennas; deep convolutional neural network; near field imaging

1. Introduction

With the capability to sculpt complex radiative wavefronts and couple energy from the reference wave to a desired radiation pattern, metasurface antennas have made tremendous progress in both antenna design and computational imaging (CI) applications [1,2,3]. In general, the ensemble imaging performance of a metasurface antenna computational imaging (MSACI) system is closely related to the optimized design of the frontend antenna in combination with the robust backend computational imaging algorithm. Since the first proof-of-concept study was demonstrated by Hunt et al. [4], numerous innovative metasurface architectures have been demonstrated, including metamaterial apertures, Mills–Cross cavity, metallic leaky cavity, etc. The fundamental design of these metasurface antennas have evolved from the periodic uniform metamaterial aperture configuration to a chaotic slot arrangement. In the context of designing metasurface antennas for CI, key factors that influence CI performance, including the radiating elements, feeding structure and reconstruction algorithms. The intricate interplay of these factors in current a CI framework gives rise to some inherent inconsistencies between maximizing spatial frequency sampling and maintaining the necessary signal-to-noise: i.e., measurement quantity and quality, radiation losses, and the loaded Q-factor of the radiating element. Towards the primary goal of metasurface antenna CI, the trade-off between the abovedmentioned inconsistencies need to be properly balanced in practical imaging application.

Frequency-diverse imaging is a computational imaging technique that leverages frequency-diverse antennas to capture and reconstruct scene information. A frequency diversity antenna is actually a kind of super surface antenna; a frequency-diverse antenna radiates field patterns that vary strongly as a function of the driving frequency. Over a given frequency bandwidth, the fields of a frequency-diverse antenna vary spatially in a quasi-random (or quasi-orthogonal) manner across the operating frequency band. The key feature of frequency-diverse imaging is that the scene information is encoded onto these quasi-random field patterns, with the data acquired by means of a simple frequency sweep. Therefore, data acquisition is performed in an all-electronic manner (no mechanical scanning apparatus is required) without the need for phase shifting circuits.

In essence, the crux of MSACI lies in generating a sequence of spatially diverse field patterns to multiplex scene information and transfer the burden of image formation from hardware to software, yielding a plethora of work dedicated to the metasurface designing topic [5]. Moreover, in most of the current MSACI demonstrations including monostatic and multistatic configuration, the built measurement matrix (transfer function) is typically ill-conditioned and not generally subject to a straightforward matrix inversion. Scene reflectivities are then retrieved by using computational techniques mainly including least square solvers or compressive reconstruction techniques. These methods either directly adopt a matched filter (MF) approach or take the MF result as an initial estimate and make use of an iterative optimization method to refine the estimation [6]. In this way, the burden of imaging is thus shifted from antenna frontend to software backend, allowing constraints on the physical layer to be relaxed while maintaining high-fidelity and near real-time imaging. Specifically, the matching filtering(MF) algorithm in [1] is more frequently used in modern computational imaging systems when the imaging accuracy satisfies the requirements because of its low operation complexity and high efficiency. The Sparse Bayes learning(SBL) reconstruction algorithm in [7] is a sparse-based reconstruction algorithm that obtains imaging results closer to the real scene when the scene-echo signal-to-noise ratio is low. The primary challenge of these compressive sensing (CS) [8] reconstruction techniques is that the success of the reconstruction process depends heavily on the randomness as well as the condition number of the measurement matrix. Furthermore, when it comes to the imaging problem of extremely large dimensionality, the computation costs will drastically increase given the needed memory and storage usage for matrix inversion and iteration. Compressive reconstruction techniques with ill-conditioned measurement matrix still suffer from the prohibitively high signal-to-noise ratio to maintain a preferable imaging performance.

Note that deep learning (DL)-based strategies have now been successfully applied in a wide variety of imaging applications, including magnetic resonance imaging, X-ray computed tomography [9], computational optical imaging [10,11,12], and some cases of compressive meta-imagers [13,14]. Specifically, it has been found that they can typically outperform conventional image formation techniques in terms of image quality and computational speed [15,16,17,18,19,20,21]. More importantly, the intractable large dimensionality datasets and ill-conditioned features of a measurement matrix that are extremely troublesome to tackle for a CS-based approach can still be properly handled with a DL-based method [22]. For example, in order to address the challenging domain of inverse scattering imaging problems, a cascaded complex U-net model was presented in [23]. The CCU-net cascaded the Image Reconstruction Net (PRNet) and Phase Retrieval Net (IRNet) in order to reconstruct the target and recover the phase and amplitude of the dispersed field from the observed modulus of the total field, respectively. Numerical simulations showed that the suggested method could successfully recreate the scattering field from both simulated and real-world examples without any prior knowledge.

Provided prior knowledge about the nature of the scene to be imaged is available, these works have shown that it is possible to limit the number of sequential captures necessary for image reconstruction compared to the use of purely random patterns. Recent work in [24] took this idea even further by directly integrating a model of the physical layer into an artificial neural network in order to jointly learn optimal measurement and processing strategies based on a prior knowledge of scene, task, and measurement constraints. Since this “learned sensing” strategy enables one to minimize the acquisition of task-irrelevant information, it is highly task-specific and requires a supervised learning technique. In contrast to these prior works [23,25,26], the approach we propose here applies a deep network to the sensing matrix rather than the specific expected scene. Consequently, our approach is not scene dependent and does not require the use of sequential measurements relying on active reconfigurable antennas. We propose a system-dependent but scene-independent method relying on a frequency-sweep to generate a succession of random illumination patterns that interrogate the scene to be imaged; via the optimized deep neural reconstruction network, the dimensionality of the measurement matrix can be limited, and thereby the computational complexity of the image reconstruction can be reduced. Moreover, from the aspect of imaging system implementation, with the large needed operation frequency band and optimal designing burden of the current metasurface antenna frontend could be eased to some extent.

To lower the frequency-diverse CI reconstruction problem’s computational complexity and relieve the metasurface antenna frontend designing pressure, is at the heart of the present paper: we bring the concepts of convolutional neural networks (CNN) [27] into a microwave computational imaging [28,29] framework; we use a carefully designed light deep neural network to reduce the computational complexity of the imaging problem for real-time operation. For the delicate design of a fully connected layer and several convolutional layers, an adaptive momentum estimation (Adam) optimizer [30] with an adjusted learning rate, relatively low memory usage, and fast convergence speed is used to update the network weight parameters. The widely used sparse targets MNIST [27] datasets and complicated targets FMNIST [31] data are input to train the network. The trained reconstruction network is then able to directly retrieve scene reflectivities with high fidelity and handle the prohibitively large dimensionality of the measurement matrix. Extensive imaging simulations with measured radiation fields data are conducted to verify the effectiveness and robustness of our MSACI-Net method. In this paper, compared with a current matched filter and a compressed sensing reconstruction technique, our proposal could handle a relatively high correlation of measurement modes and a low scene sampling ratio. Since the antenna measurement modes are frequency-dependent, the superior image reconstruction capability could therefore reduce the need for a large operation bandwidth and more importantly, the antenna radiation efficiency could be enhanced in the antenna design process to some extent so as to maintain a relatively high SNR level, which is crucial for frequency-diverse computational imaging.

2. Forward Mathematical Model

The near-field computational imaging system based on metasurface antennas initially analyzes the entire working mechanism of the system before constructing a suitable mathematical model. The basic concept of metasurface antenna imaging is to construct a subwavelength resonant aperture structure to regulate the polarization characteristics of electric and magnetic dipoles and produce different polarization characteristics by designing subwavelength basic resonant units with different geometric structures. As the driving frequency changes, the radiation field excited from different subsets of resonators changes. Objects within the scene scatter the incident fields, producing the backscattered components detected by the waveguide probe at the transmitting antenna plane.

To a large extent, the concept of MSACI belongs to the inverse scattering problem. The frequency measurements collected by the receiving probe are related to the scene reflectivity through the measurement matrix (transfer function), which is the product of the electric fields from the transmitting antenna and the receiving probe at each location in the scene. Figure 1 shows the schematic diagram of a metasurface antenna imaging system with single transmitter and single receiver [32]. The total fields that propagate into the OEWG have the following proportionality:

g (f) \propto \int_{V} E_{TX} ({\vec{r}}^{'}; f) \cdot E_{RX} ({\vec{r}}^{'}; f) \cdot σ ({\vec{r}}^{'}) d^{3} {\vec{r}}^{'}

(1)

where

E_{TX} ({\vec{r}}^{'})

and

E_{RX} ({\vec{r}}^{'})

denote the transmitted and received fields, respectively, and

σ

represents scene target reflectivities. Since the system is both diffraction and bandwidth limited, (1) can be written in a more general and concise matrix equation:

g = H σ + n

(2)

where

g \in C^{M \times 1}

is the receiving measurement vector collected by the low-gain OEWG,

σ \in C^{N \times 1}

denotes the unknown reflectivity vector of the discretized scene space,

n \in C^{M \times 1}

is an additive noise term included for generality, and

H \in C^{M \times N}

is the measurement matrix constructed with transmitting and scattering fields.

In the vast majority of Fresnel-zone MSACI systems, to characterize the transmitting aperture, the near field of the aperture is raster scanned with a reference antenna, and the measured fields are then propagated to arbitrary locations in the scene using equivalent surface principles and the corresponding Green’s functions. To reconstruct the scene information, the matrix equation described in (2) needs to be solved for

σ

. In classic matrix theory, various matrix inversion techniques have been proposed for both well- and ill-conditioned

H

.

In the simplest implementation, assuming additive white Gaussian noise, an MF reconstruction suitably solves (2) as

{\hat{σ}}_{e s t} = H^{†} g

, where † denotes the conjugate transpose operator. Assuming the measurement value y of compressed sensing, the output is the original signal

σ

, and the measurement matrix is

H

, the original signal

σ

can be recovered from the measurements

g

by the neural network.

σ = F_{w} (g)

(3)

θ = argmin {∥σ - \hat{σ}∥}^{2}

(4)

where

F (\cdot)

is the simplified neural network function,

\hat{σ}

is the predicted value of the neural network output,

θ

is the resulting parameters to be optimized,

w

is the parameter weight, and the gradient descent method is used to minimize the root mean square loss function to train the network.Given the easy implementation capability, the matched filter approach is commonly applied in current MSACI using arbitrary field patterns. Note that when

H

is extremely underdetermined

M ≪ N

, the estimation with an MF method of under-sampled scenes could be further refined with more sophisticated reconstruction algorithms. In the following section, the CNN-based MSACI principle and network model are specifically introduced.

3. CNN-Based Computational Imaging

3.1. Imaging System Architecture

Figure 1 shows the structure of a near-field MSACI system architecture built using convolutional neural networks, named MSACI-Net. The linear mapping network in this section adopts a fully connected layer, which contains N neurons. The output of the

k_{x} \times k_{y}

feature map is obtained when the measured values are used as network input, where

k_{x}

and

k_{y}

represent the horizontal and vertical near-field scan points. The collected measurements

g

and the MF method estimation results

{\hat{σ}}_{e s t}

constitute the training dataset for the network system.

The fully connected layer is connected after the convolutional layer. In our demonstration, the size of the feature maps generated by all layers is consistent with the number of voxels to be imaged, the first and third layers use a

5 \times 5

convolution kernel to reduce network parameters, and the number of channels is 30 and 60, respectively. The second and last layers of MSACI-Net use the

3 \times 3

convolution kernel to generate the feature map as the output of the network, and the number of channels is 30 and 1, respectively. In this process, each layer of the convolutional layer performs a zero-padding operation to ensure that the size of the feature maps output in each layer remains the same.

3.2. Imaging Algorithm

The Adam algorithm is currently the most commonly used gradient optimization algorithm, which can handle sparse gradients and stationary objectives well. In particular, it can adaptively calculate learning rates for various parameters. Based on the ADAM algorithm, combined with our actual imaging problem, Algorithm 1 presents the optimized update process of our training data in MSACI-Net.

Algorithm 1 MSACI-Net training optimization algorithm

Input:: $H$ : measurement matrix; $f (\cdot)$ : stochastic objective function with parameters; N training datasets $\sum_{i}^{N} \{g_{i}, σ_{i}\}$ ; Number of iterations $t = 0$ ; The exponential decay rate of the moment estimate $ρ_{1} = 0.9$ , $ρ_{2} = 0.999$ ; Numerical stability constant $ε = 10^{- 8}$ ; Stepsize $δ = 10^{- 3}$ ; initial parameter vector: $θ_{0}$ .
1:: Iteration via a gradient descent scheme:
2:: Random initialize weight parameters $w$ and bias parameters $b$ ; Initialize first moment vector $s \leftarrow 0$ ; second moment vector $r \leftarrow 0$ ;
3:: while $θ$ does not converged
4:: Compute gradient $g = \frac{1}{N} \nabla_{θ} \sum_{i}^{N} L (f (σ_{i}; θ), g_{i})$ ; $t = t + 1;$
5:: Bias-corrected first moment and second moment estimate is revised $\hat{s} = \frac{s}{{1 - ρ}_{1}^{t}}$ ; $\hat{r} = \frac{r}{{1 - ρ}_{2}^{t}}$ $;$
6:: Updated parameters $θ = θ_{0} + \frac{δ \hat{s}}{\sqrt{\hat{r}} + ε}$ ,
7:: $end while$
Output:: Resulting parameters $θ$

The training process makes use of two learning methods of forwarding propagation and backpropagation to optimize the weights. The initial reconstruction result of the target in forward propagation is used as input and then output to each convolutional layer. In forward propagation, the output of the lth layer is:

g^{l} = f (u^{l})

, where

f (\cdot)

represents the activation function. All layers use

R e l u

as the activation function, where the activation function can be expressed as:

R e l u (u) = \{\begin{matrix} u, u \geq 0 \\ 0, u < 0 \end{matrix}

(5)

This function takes 0 when the input is less than 0 and takes the input value when the input is greater than or equal to 0.

Backpropagation is the process of correcting the weight coefficients and bias coefficients of the convolutional neural network through the output objective function. The specific correction process is as follows. The predict output of the convolutional neural network and the real target in the training set are, respectively, denoted as

\hat{σ}

and

σ

. The cost function is:

E = \frac{1}{2} \sum {(σ - \hat{σ})}^{2}

(6)

The weight coefficients in each layer are adjusted inversely, including weight coefficients and bias coefficients, according to the output error of each sample. This process calculates the partial derivative of the cost function that corresponds to each weight in the convolutional neural network. The sensitivity of the convolution layer is defined as

δ = \frac{δ E}{δ u}

. The sensitivity of the non-input lth layer is:

δ^{l} = {(w^{l + 1})}^{T} δ^{l + 1} \cdot f^{'} (u^{l})

(7)

The sensitivity of the output layer L is:

δ^{L} = f^{'} (u^{L}) \cdot (\hat{σ} - σ)

(8)

where given

\frac{\partial u}{\partial b} = 1

,

\frac{\partial E}{\partial b} = \frac{\partial E}{\partial u} \frac{\partial u}{\partial b} = \frac{\partial E}{\partial u} = δ

. Therefore, the partial derivative of the bias coefficient is equal to the sensitivity of the corresponding convolutional layer.

Similarly, the partial derivative of the weight coefficient of the l-th layer is

\frac{\partial E}{\partial w^{l}} = σ^{l - 1} {(δ^{l})}^{T}

(9)

Finally, the weights are updated iteratively according to the Adam algorithm and Equations (5)–(7). The specific update methods are shown in (8) and (9).

w_{n e w}^{l} = w^{l} - η_{1} \cdot \frac{\partial E}{\partial w^{l}} - η_{2} w^{l}

(10)

b_{n e w}^{l} = b^{l} - η_{1} \cdot \frac{\partial E}{\partial b^{l}} - η_{2} b^{l}

(11)

where

η_{1}

and

η_{2}

are the gradient descent coefficient and the learning rate, respectively.

In addition, MSE and PSNR are selected to evaluate accuracy and image quality. The solution process of MSE is shown in the following equation:

MSE = \frac{\sum_{i = 1}^{m} \sum_{j = 1}^{m} (σ (i, j) - \hat{σ} {(i, j))}^{2}}{m}

(12)

where m is the number of images, and

σ_{i}

and

h a t σ_{i}

severally represent the pixel values of the original image and the reconstructed image at

(i, j)

. PSNR is essentially the same as MSE. The higher the PSNR value, the smaller the difference between the two contrast images. In the subsequent dataset construction, we will normalize the scene target image; all target images have scattering coefficients between

0 \sim 1

. Therefore, the PSNR formula is given as:

PSNR = 10 \log \frac{1 \times 1}{MSE}

(13)

Furthermore, Algorithm 2 illustrates the specific content of the near-field computational imaging algorithm of the metasurface antenna based on the convolutional neural network, which principally incorporates the following steps. First, according to the given metasurface antenna system parameters, combined with the classical image datasets MNIST and Fashion-MNIST, a certain number of training sets, test sets, and validation sets are generated, respectively. Then, set the basic parameters of the convolutional neural network in the algorithm, such as the number of fully connected layers are three, the number of convolutional layers are four, the size of the convolution kernel of each layer, and the number of channels of the convolutional network in each layer. Select the appropriate gradient optimization training algorithm, this paper selects Algorithm 1, and then sets the learning rate, gradient descent coefficient, training period, and the number of samples participating in the training of the gradient optimization algorithm. After completing the above preparations, we utilize Algorithm 1 to train the network. The weight coefficients of the convolutional layer are continuously updated during the forward propagation and back propagation until the end of the training period. At this time, the loss function of the system tends to converge, and the trained network model is conserved to facilitate subsequent testing of the measured data.

Algorithm 2 MSACI-Net reconstruction algorithm

Input:: The train dataset $(g_{tr}, σ_{tr})$ , test dataset $(g_{t})$ , weight parameter: $θ$ , maximum iteration number: T;
1:: Randomly initialize the weight parameter W of all layers of the MSACI-Net network;
2:: for T do
3:: Batch $g_{tr}$ and $σ_{tr}$ of the training set are input to MSACI-Net as the input and output of the network respectively.
4:: Utilization Equation (6) as loss function, and algorithm 1 above is used to train the weight of the network.
5:: After T cycles, preserve the trained network model.
6:: Input the echo data $g_{t}$ of the test set, call the above trained network model for testing, and retrieve the reconstruction target $σ_{t}$ .
7:: end for
Output:: The reconstructed result $σ_{t}$ .

4. Imaging Simulations with Measured Fields Data

4.1. Measured Fields Data

In this section, we verify the validity of the proposed MSACI-Net method with measured radiation fields data. For near-field CI capability demonstration purpose, in this subsection, a two-dimensional parallel plate waveguide metasurface antenna with waveguide slot feeding mechanism is designed and fabricated. The frequency-diverse radiation fields (measurement matrix) are measured through near-field scanning, and the scanning plane is 0.5 m away from antenna platform. The measured measurement matrix is then used to perform image reconstruction experiments. To ensure that an ample backscattered signal is collected from all possible directions and at all frequencies, an open-ended waveguide (OEWG) probe is used as receiving antenna, comprising a panel-to-probe configuration. The panel size of the antenna is

250 \times 250

mm², the dielectric constant is 3.66, and the loss tangent value is 0.003. The thickness of the substrate between the copper ground plane and the conductive copper metamaterial hole is 0.5 mm, the upper conductor of the waveguide adopts 125 × 125 cELC metamaterial resonators, and the Q-factor of each resonator is between 50∼60.

The system parameters of the antenna are shown in Table 1, and the imaging experiment is based on the simulated metasurface antenna radiation field pattern data and the imaging scene. The operation bandwidth of the antenna is 33∼37 GHz, the frequency sampling interval is 10 MHz, and the pattern pattern of each frequency point is sampled along the two-dimensional spherical coordinate system of elevation and azimuth. The field of view (FOV) size is elevation (−60∼60°), and the sampling interval is 2

^{°}

; the azimuth sampling line of sight is (−60∼60°), and the sampling interval is 2

^{\circ}

, so the dimension of the original pattern

T

is 400 × (61 × 61). In the scheme of feeding by waveguide slot coupling, the return loss and radiation efficiency are shown in Figure 2 and Figure 3.

The original target that contains the sparse target and extended target are employed to qualitatively evaluate the imaging ability of the measurement matrix for the scene. In order to qualitatively evaluate the ability of the measurement matrix to image the scene, the image containing the point scattering target in the same dimension as the measurement matrix is used as the original image. It should be emphasized that the measurement matrix at this time is the pattern data, not the metamaterial in the actual imaging space. The measurement matrix is formed by the radiated field of the aperture antenna. Figure 4a–d show the metasurface antenna feeding mechanism and radiation fields scanning plane.

4.2. Data Preparation

The received echo vector in the imaging process can be regarded as mapping of the scene reflectivity function. This mapping is nonlinear, given the complex scattering of electromagnetic waves. We aim to use CNN in this deep learning imaging framework to iteratively learn the nonlinear mapping of this physical model. The training samples of the network are directly obtained from the original antenna pattern. Specifically, the original measurement value may be used as the input, and the output is the target image of the scene. Compared with the traditional method, the use of the deep neural network method to reconstruct the target image can provide a more realistic and accurate physical model representation.

In the simulation experiment, we choose the classic datasets MNIST and FMNIST to build our target dataset. Both datasets contain 70,000 images, of which 60,000 images form the training set and 10,000 images form the test set. According to the needs of the experiment, we adjust the original image size in the dataset from 28 × 28 to 61 × 61 and regard it as an image that is composed of multiple scattering points with random values between (0, 1) scattering coefficients as the output network of the original target scene. The input value is the echo measurement value of the target scene that passes through the metasurface antenna pattern. The samples used in the experiment are formed through this process. Each group of samples contains a measurement value sample and an original image. The datasets are severally named MSACI-MNIST and MSACI-FMNIST, and they are divided according to the training set that accounts for 70%, the validation set that occupies 20% and the test set that accounts for 10%. The mean square error (MSE) is used as the loss function, and the Adam optimization algorithm is used for optimization. The learning rate is 0.001, and the batch size is 128. The model is trained for 100 cycles. After each training cycle, the validation data is randomly selected to generate a verification set to monitor network performance. This stage is very time-consuming. When the network training is completed, we save the network model and input the scene echo signal into the network to obtain the imaging result of the target in real time. The final prediction is kept on the test set.

The model is implemented in the Python programming environment using the Keras deep learning framework and TensorFlow backend platform. The model is then trained on a computer with GPU NVIDIA 3070Ti and CUDA version 11.0. The batch processing of the dataset is completed on the MATLAB platform. In addition, all reconstruction results are obtained on a computer equipped with Intel(R) Core(TM)i7-10700 CPU. Without using the GPU, the training process for the 100 epochs of our imaging network model using the constructed dataset takes 1.5 h. The same number of training epochs, however, takes around 20 min after calling the GPU.

4.3. Numerical Tests

Our MSACI-Net algorithm based on the CNN framework is trained on the ordinary sparse dataset MSACI-MNIST and the extended target dataset MSACI-FMNIST. After completing the training, we conduct numerical experiments and performance evaluation to evaluate the performance of our algorithm. To assess the effectiveness of the algorithm, we implement an experiment with different scene targets when the number of measurement modes is 400. The scene images come from the test sets of MSACI-MNIST and MSACI-FMNIST datasets. We select 12 target images from the two 2 sets to conduct the experiments. The original sparsity of these 12 targets are 0.6312, 0.5954, 0.6721, 0.6856, 0.6423, 0.6229, 0.5117, 0.4996, 0.6463, 0.4786, 0.5116, and 0.4523, respectively. It should be emphasized that noise-free scenes are considered throughout the imaging simulation process to focus on the implementation of the proposed algorithm, and the test results are shown in Figure 5. The experimental results show that our MSACI-Net can not only reconstruct ordinary sparse targets, but also reconstruct extended targets under the conditions of a fixed number of measurement modes and scene compression ratio. Moreover, reconstructed images also can be obtained with high quality by the proposed algorithm.

In the MSACI-Net of our proposal, the ADAM algorithm is chosen to train the gradients, where the learning rate is chosen to be

1 \times 10^{- 3}

, which can better control the weight update rate and allow the training to converge to better performance. The number of learning cycles is 100 epochs, considering that the increase in the number of training times can make the network training more mature and finally show better imaging performance.

To further evaluate the performance of the proposed algorithm, we set the experimental conditions under different compression ratios of 0.1, 0.05, and 0.025 separately. We also use the traditional MF method, an iterative sparse Bayesian learning (SBL) algorithm and the U-Net method [23] for the two sets of targets and conduct comparative experiments with the proposed algorithm under the same conditions. The experimental results of the two groups are shown in Figure 5 and Figure 6.

The experimental results show that our MSACI-Net can reconstruct the target image clearly when the scene information sampling ratio is 0.1. Our algorithm can effectively reconstruct the basic shape of the scene target with the decrease in the scene information sampling ratio, even the sampling ratio of 0.025. In contrast, the MF and the SBL algorithm can no longer work normally when the scene information sampling ratio is 0.1. The reconstructed target image is not acceptable, and the original shape cannot be distinguished. In Figure 6 and Figure 7, although the U-Net reconstruction imaging results showed the approximate shape, they could not capture the detailed features specific to the target. Moreover, the original target contour features cannot be precisely reconstructed in the SBL algorithm imaging results; the SBL algorithm is unable to operate in the case of extremely low compression ratios, and the imaging outcomes essentially show no details of the original target. Overall, the performance of the proposed algorithm is more robust and effective than traditional algorithms in low scene compression ratio and high coherence measurement mode. The quality of the reconstructed target image is also high and can face the challenges of imaging algorithm performance in complex scene conditions.

We employ MSE and PSNR to quantitatively evaluate the accuracy of imaging reconstruction and the quality of the imaging results. Quantitatively, the MSE in MSACI-MNIST and MSACI-FMNIST of different methods under different information compression ratios are calculated, and the results are shown in Figure 8a and Figure 9a.The PSNRs in MSACI-MNIST and MSACI-FMNIST of different methods under different information compression ratios are also calculated, and the results are shown in Figure 8b and Figure 9b. As can be seen, the proposed algorithm gradually decreases in MSE when the scene information sampling ratio increases. Our proposed algorithm is numerically lower than the MF and the SBL algorithm in this standard. Compared with the generally applied CS method, our MSACI-Net algorithm shows preferable imaging capability given that the end-to-end neural network can adaptively adjust imaging error. The quality of the image reconstruction with our algorithm has significant improvement in PSNR with the same scene information sampling ratio.

We also record the time spent by the two algorithms in the reconstruction of the scene target. The method based on a convolutional neural network is easy to parallelize during imaging reconstruction. Thus, we record the reconstruction time of the MSACI-Net algorithm with CPU and GPU. Table 2 shows the average running time of the 100 epochs test process with the two algorithms on different targets. It is obvious that the proposed algorithm takes less time than the MF and SBL algorithm. Therefore, this method is superior in terms of imaging efficiency and can even achieve near real-time imaging reconstruction.

5. Discussion

Compared with the generally applied MF method, our MSACI-Net algorithm shows preferable imaging capability under no-noise conditions, given that the end-to-end neural network can adaptively adjust imaging error. In order to further test the anti-noise and robustness of the proposed algorithm, we add additive noise when generating the dataset. Thus, the scene information sampling ratio in the MSACI-MNIST and MSACI-FMNIST datasets are set to 0.1. To study the impact of noise and noise on network performance, we divide our dataset into three noise-free cases, namely, SNR = 0 dB, SNR = 5 dB, and SNR = 10 dB after 100 rounds of testing. Qualitative and quantitative tests are conducted on these two datasets. The imaging reconstruction results are shown in Figure 10. The results show that our MSACI-Net algorithm can not only reconstruct ordinary sparse targets, but also reconstruct extended targets under the conditions of a different SNR and fixed scene compression ratio. Moreover, the proposed algorithm can also obtain reconstructed images with high quality and has relatively excellent anti-noise performance and good robustness when the scene information sampling ratio is 0.1.

Quantitatively, the MSEs of different methods under different SNRs are calculated, and the results are drawn in Figure 11a,b. These results show that our network can withstand the test with or without noise in the training data, with negligible impact on accuracy when reconstructing the target. Overall, our proposed algorithm compensates for its shortcomings when performing near-field computational imaging under the condition of insufficient frontend hardware design of the antenna. This result shows excellent effectiveness and robustness, which indicates the promising potential of the MSACI tool.

6. Conclusions

In this paper, the near-field metamaterial aperture CI with CNN-based method is demonstrated. Compared to current MF and CS reconstruction techniques, our delicately trained deep network could handle both sparse and complicated scene targets scenarios under a relatively low scene information sampling ratio and SNR levels, yielding a relatively narrow needed operation frequency band and alleviating the optimal designing burden of a metasurface antenna frontend. The trained network weight parameters can be readily accessed to deal with radiation fields data obtained with different types of current metasurface antennas. Moreover, the proposed CNN-based reconstruction approach could facilitate far-range imaging since the randomness feature among radiation field patterns could be sacrificed to a certain extent to meet the requirement of antenna radiation efficiency. In future work, we intend to improve the network performance, speed up the imaging rate, improve the imaging quality, and with training datasets and echo signal experiment to verify our method. In addition, we have begun to prepare the proposed method for far-field computing imaging and near-field three-dimensional imaging research and through the measured data to verify the imaging performance of our network.

Author Contributions

Conceptualization, Z.W.; methodology, F.Z.; software, F.Z.; validation, F.Z., M.Z. and S.H.; formal analysis, X.P.; investigation, Z.W. and F.Z.; resources, X.P. and L.Y.; data curation, M.Z. and S.H.; writing—original draft preparation, Z.W.; writing—review and editing, F.Z., M.Z., S.H., X.P., W.C. and L.Y.; visualization, Z.W.; supervision, L.Y. and W.C.; project administration, M.Z.; funding acquisition, L.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China: Grant No.62201007, U21A20457, 62071003 and China Postdoctoral Science Foundation: No.2020M681992, and Foundation of An’Hui Educational Committee: No.KJ2020A0026, and Anhui Province University Collaborative Innovation Project: GXXT-2021-028.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MSACI	Metasurface Antenna Computational Imaging
CS	Compressed sensing
SBL	Sparse Bayesian Learning
MF	Matched Filter
CNN	Convolutional Neural Network
PSNR	Peak signal-to-noise ratio
MSE	Mean squared error
SNR	Signal-to-noise ratio
MSACI-MNIST	Metasurface Antenna Computational Imaging MNIST
MSACI-FMNIST	Metasurface Antenna Computational Imaging Fashion-MNIST
MSACI-Net	Metasurface Antenna Computational Imaging Network
ADAM	Adaptive Momentum Estimation

References

Imani, M.F.; Gollub, J.N.; Yurduseven, O.; Diebold, A.V.; Boyarsky, M.; Fromenteze, T.; Pulido-Mancera, M.; Sleasman, T.; Smith, D.R. Review of Metasurface Antennas for Computational Microwave Imaging. IEEE Trans. Antennas Propag. 2020, 68, 1860–1875. [Google Scholar] [CrossRef] [Green Version]
Wu, Z.; Zhang, L.; Liu, H.; Kou, N. Range Decoupling Algorithm for Accelerating Metamaterial Apertures-Based Computational Imaging. IEEE Sens. J. 2018, 18, 3619–3631. [Google Scholar] [CrossRef]
Mait, J.; Euliss, G.; Athale, R. Computational imaging. Adv. Opt. Photon. 2018, 10, 409–483. [Google Scholar] [CrossRef]
Hunt, J.; Driscoll, T.; Mrozack, A.; Lipworth, G.; Reynolds, M.; Brady, D.; Smith, D. Metamaterial apertures for computational imaging. J. Abbr. 2013, 339, 310–313. [Google Scholar] [CrossRef] [PubMed]
Fromentèze, T.; Yurduseven, O.; Del Hougne, P.; Smith, D.R. Lowering latency and processing burden in computational imaging through dimensionality reduction of the sensing matrix. Sci. Rep. 2021, 11, 3545. [Google Scholar] [CrossRef] [PubMed]
Cheng, Q.; Alomainy, A.; Hao, Y. Near-field millimeter-wave phased array imaging with compressive sensing. IEEE Access 2017, 5, 18975–18986. [Google Scholar] [CrossRef]
Luo, Z.; Cheng, Y.; Cao, K.; Qin, Y.; Wang, H. Microwave computational imaging in frequency domain with reprogrammable metasurface. J. Electron. Imaging 2018, 27, 063019. [Google Scholar] [CrossRef]
Donoho, D. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Jin, K.; McCann, M.; Froustey, E.; Unser, M. Deep convolutional neural network for inverse problems in imaging. IEEE Trans. Image Process. 2017, 26, 4509–4522. [Google Scholar] [CrossRef] [Green Version]
Sinha, A.; Lee, J.; Li, S.; Barbastathis, G. Lensless computational imaging through deep learning. Optica 2017, 4, 1117–1125. [Google Scholar] [CrossRef]
Kamilov, U.S.; Papadopoulos, I.N.; Shoreh, M.H.; Goy, A.; Vonesch, C.; Unser, M.; Psaltis, D. Learning approach to optical tomography. Optica 2015, 2, 517–522. [Google Scholar] [CrossRef] [Green Version]
Barbastathis, G.; Ozcan, A.; Situ, G. On the use of deep learning for computational imaging. Optica 2019, 6, 921–943. [Google Scholar] [CrossRef]
Li, L.; Ruan, H.; Liu, C.; Li, Y.; Shuang, Y.; Alù, A.; Qiu, C.-W.; Cui, T.J. Machine-learning reprogrammable metasurface imager. Nat. Commun. 2019, 10, 1082. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Li, L.; Shuang, Y.; Ma, Q.; Li, H.; Zhao, H.; Wei, M.; Liu, C.; Hao, C.; Qiu, C.-W.; Cui, T.J. Intelligent metasurface imager and recognizer. Light Sci. Appl. 2019, 8, 97. [Google Scholar] [CrossRef] [Green Version]
Gan, F.; Luo, C.; Liu, X.; Wang, H.; Peng, L. Fast Terahertz Coded-Aperture Imaging Based on Convolutional Neural Network. Appl. Sci. 2020, 10, 2661. [Google Scholar] [CrossRef] [Green Version]
Gao, J.; Deng, B.; Qin, Y.; Wang, H.; Li, X. Enhanced Radar Imaging Using a Complex-Valued Convolutional Neural Network. IEEE Geosci. Remote Sens. Lett. 2018, 16, 35–39. [Google Scholar] [CrossRef] [Green Version]
Marashdeh, Q.; Warsito, W.; Fan, L.; Teixeira, F. Nonlinear forward problem solution for electrical capacitance tomography using feed-forward neural network. IEEE Sens. J. 2006, 6, 441–449. [Google Scholar] [CrossRef] [Green Version]
Li, L.; Wang, L.; Teixeira, F. Performance Analysis and Dynamic Evolution of Deep Convolutional Neural Network for Electromagnetic Inverse Scattering. IEEE Antennas Wirel. Propag. Lett. 2019, 18, 2259–2263. [Google Scholar] [CrossRef] [Green Version]
Li, L.; Wang, L.; Teixeira, F.; Liu, C.; Nehorai, A.; Cui, T. DeepNIS: Deep Neural Network for Nonlinear Electromagnetic Inverse Scattering. IEEE Trans. Antennas Propag. 2019, 67, 1819–1825. [Google Scholar] [CrossRef] [Green Version]
Yu, Y.; Rashidi, M.; Samali, B.; Mohammadi, M.; Nguyen, T.N.; Zhou, X. Crack detection of concrete structures using deep convolutional neural networks optimized by enhanced chicken swarm algorithm. Struct. Health Monit. 2022, 21, 2244–2263. [Google Scholar] [CrossRef]
Yu, Y.; Wang, C.; Gu, X.; Li, J. A novel deep learning-based method for damage identification of smart building structures. Struct. Health Monit. 2019, 10, 142–149. [Google Scholar] [CrossRef] [Green Version]
Cheng, Q.; Ihalage, A.; Liu, Y.; Hao, Y. Compressive Sensing Radar Imaging with Convolutional Neural Networks. IEEE Access 2020, 8, 212917–212926. [Google Scholar] [CrossRef]
Luo, F.; Wang, J.; Zeng, J.; Zhang, L.; Zhang, B.; Xu, K.; Luo, X. Cascaded Complex U-Net Model to Solve Inverse Scattering Problems With Phaseless-Data in the Complex Domain. IEEE Trans. Antennas Propag. 2022, 70, 6160–6170. [Google Scholar] [CrossRef]
Del, P.; Imani, M.; Diebold, A.; Horstmeyer, R.; Smith, D. Learned Integrated Sensing Pipeline: Reconfigurable Metasurface Transceivers as Trainable Physical Layer in an Artificial Neural Network. Adv. Sci. 2019, 7, 1901913. [Google Scholar]
Hyder, R.; Shah, V.; Hegde, C.; Asif, M. Alternating Phase Projected Gradient Descent with Generative Priors for Solving Compressive Phase Retrieval. In Proceedings of the ICASSP 2019—2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 12–17 May 2019; pp. 7705–7709. [Google Scholar]
Shamshad, F.; Ahmed, A. Compressed Sensing-Based Robust Phase Retrieval via Deep Generative Priors. IEEE Sens. J. 2020, 21, 2286–2298. [Google Scholar] [CrossRef]
Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
Yurduseven, O.; Fromenteze, T.; Decroze, C.; Fusco, V. Frequency-Diverse Computational Automotive Radar Technique for Debris Detection. IEEE Sens. J. 2020, 20, 13167–13177. [Google Scholar] [CrossRef]
Sharma, R.; Yurduseven, O.; Deka, B.; Fusco, V. Hardware Enabled Acceleration of Near-Field Coded Aperture Radar Physical Model for Millimetre-Wave Computational Imaging. Prog. Electromagn. Res. B 2021, 90, 91–108. [Google Scholar] [CrossRef]
Kingma, D.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the 2015 International Conference on Learning Representations(ICLR 2015), San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
Xiao, H.; Rasul, K.; Vollgraf, R. Fashion-MNIST: A Novel Image Dataset for Benchmarking Machine Learning Algorithms. arXiv 2017, arXiv:1708.07747. [Google Scholar]
Wu, Z.; Zhang, L.; Liu, H.; Kou, N. Enhancing Microwave Metamaterial Aperture Radar Imaging Performance with Rotation Synthesis. IEEE Sens. J. 2016, 16, 8035–8043. [Google Scholar] [CrossRef]

Figure 1. MSACI-Net system structure diagram.

Figure 2. Antenna S-parameters [magnitude in dB].

Figure 3. Antenna radiation efficiency [magnitude].

Figure 4. (a) Waveguide slot feeding mechanism; (b) antenna prototype; (c) radiation fields scanning plane.; (d) measurement matrix characterization.

Figure 5. Reconstruction results from MSACI-Net with different scene target.

Figure 6. MSACI-MNIST: Reconstruction results in four imaging algorithms with different scene information sampling ratios.

Figure 7. MSACI-FMNIST: Reconstruction results in four imaging algorithm with different scene information sampling ratios.

Figure 8. MSACI-MNIST: Imaging results with different scene information sampling ratio: (a) MSE performance; (b) PSNR performance.

Figure 9. MSACI-FMNIST: Imaging results with different scene information sampling ratio: (a) MSE performance; (b) PSNR performance.

Figure 10. Reconstruction results from MSACI-Net with different SNR.

Figure 11. MSACI-FMNIST: Imaging results with different scene information sampling ratio: (a) MSE performance; (b) PSNR performance.

Table 1. Main system parameters of metasurface antenna.

Parameters	Values
Operation bandwidth	33∼37 GHz
Antenna panel size	$250 \times 250$ mm²
Number of resonance units	$125 \times 125$
Frequency sampling interval	10 MHz
Field of view (Azimuth)	−60∼60°
Field of view (Elevation)	−60∼60°
Azimuth sampling interval	$2^{\circ}$
Elevation sampling interval	$2^{\circ}$
Dimensions of T	$400 \times 3721$

Table 2. Imaging runtime.

Methods	Values
M F	2.1756 s
SBL	3.5184 s
U-Net	0.4014 s
MSACI-Net(CPU)	0.2785 s
MSACI-Net(GPU)	0.0584 s

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, Z.; Zhao, F.; Zhang, M.; Huan, S.; Pan, X.; Chen, W.; Yang, L. Fast Near-Field Frequency-Diverse Computational Imaging Based on End-to-End Deep-Learning Network. Sensors 2022, 22, 9771. https://doi.org/10.3390/s22249771

AMA Style

Wu Z, Zhao F, Zhang M, Huan S, Pan X, Chen W, Yang L. Fast Near-Field Frequency-Diverse Computational Imaging Based on End-to-End Deep-Learning Network. Sensors. 2022; 22(24):9771. https://doi.org/10.3390/s22249771

Chicago/Turabian Style

Wu, Zhenhua, Fafa Zhao, Man Zhang, Sha Huan, Xueli Pan, Wei Chen, and Lixia Yang. 2022. "Fast Near-Field Frequency-Diverse Computational Imaging Based on End-to-End Deep-Learning Network" Sensors 22, no. 24: 9771. https://doi.org/10.3390/s22249771

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fast Near-Field Frequency-Diverse Computational Imaging Based on End-to-End Deep-Learning Network

Abstract

1. Introduction

2. Forward Mathematical Model

3. CNN-Based Computational Imaging

3.1. Imaging System Architecture

3.2. Imaging Algorithm

4. Imaging Simulations with Measured Fields Data

4.1. Measured Fields Data

4.2. Data Preparation

4.3. Numerical Tests

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI