1. Introduction
Ultrasound imaging has been widely used in clinical disease diagnosis and treatment evaluation due to its non-invasive, real-time, and cost-effective characteristics. Tele-ultrasound imaging is also an important part of telemedicine imaging. Telemedicine has evolved in the last few years to improve healthcare system not only for patient outcomes, but also for education purpose [
1]. The obstetric tele-ultrasound has been proved feasible with a bandwidth of 2 Mbit/s video link [
2]. Transmission of real-time ultrasound video to a remote iPhone with a WiFi connection and a 3G connection were both accomplished. However, the frame rate was reduced to 1.1 frames/s from 11.9 frames/s of the original transmitted signal [
3]. In order to obtain better image quality and higher frame rate in telemedicine imaging, the amount of data can be reduced, in addition to increasing the network bandwidth.
B-mode ultrasound imaging is currently the most widely used clinical ultrasound diagnostic methods. Traditional B-mode imaging has a frame rate of 30–40 frames/s, which can meet the requirements of observing the static physiological structure of tissues, such as imaging of the liver, kidney, and abdomen. With the development of more medical ultrasound technologies, such as shear wave elastography [
4], cardiac imaging, ultrasound localization microscopy (ULM) [
5], etc., higher requirements are placed on the frame rate of ultrasound imaging. Methods to augment the frame rate of ultrasound imaging include multi-line-transmit/receive method, synthetic aperture emission imaging and wide beam emission imaging [
6]. Wide beam imaging is a novel transmission method of ultrasound, which includes plane waves and divergent waves. A single plane wave transmission of ultrasound can cover the entire imaging area, and then the full aperture receives and records the echo signal in order to produce an ultrasound image [
7]. At present, plane wave imaging can obtain the entire image through a single full-aperture transmit-receive, which can increase the frame rate to more than 10,000 frames/s [
8]. It has been applied to shear wave elastography [
9], brain function imaging [
10], and ultrasound localization microscopy [
5].
Compared with the focused wave scanning transmission method, the plane wave transmission method greatly increases the frame rate. However, the signal-to-noise ratio, the image resolution and the contrast of the echo signal are deteriorated because the transmitting beam is not focused. The imaging results of a plane wave emitted once have the problems of poor resolution and low signal-to-noise ratio. In practical applications, multi-angle compound imaging methods are often used [
9]. The multi-angle compound imaging mode improves imaging quality with the increase of the angle, but the frame rate will linearly decrease in this way as well. In order to retain the advantages of plane wave imaging’s high frame rate as much as possible, different beamforming algorithms are applied to plane wave imaging to improve image quality without reducing the frame rate.
Currently, the commonly used beamforming methods are delay-and-sum (DAS) and adaptive beamforming algorithm based on minimum variance (MV) [
11]. The DAS algorithm is the most basic beamforming algorithm, which applies the fixed weights to the echo signal after delay processing to reduce the signal side lobes. However, the weight of the window function used in the traditional amplitude apodization process is usually a set of fixed parameters preset according to the depth. Therefore, the sidelobe attenuation is at a certain degree, and the width of the main lobe will also increase, that is, the resolution will not improve [
12]. Compared with DAS, the adaptive beamforming uses the signal received by the transducer to adaptively calculate the weighting value added to each element. Because this weighting value is dynamic, it has better resolution and anti-interference ability [
13]. However, the MV method of beamforming involves lots of matrix operation which augment the computation complexity, due to the calculation of correlation matrix. Roya et al. proposed a compressive sensing-based approach combined with the combination of the adaptive minimum variance (MV) algorithm to reduce computational burden [
14]. Deep learning is an emerging method for beamforming [
15]. Different neural networks are used to improve image quality or to augment imaging speed in beamforming [
16,
17]. However, this algorithm often requires a lot of data for training or has a high computational complexity, which is not suitable for high -frequency imaging in practical settings. Compared with algorithms based on MV, DAS requires a relatively small amount memory in the temporal domain because there is no matrix computation. Lu [
18] introduced the plane wave imaging process to the Fourier domain, which provided a new direction for beamforming. Compared with beamforming methods in the temporal domain, beamforming in the Fourier domain might augment computational efficiency by applying fast Fourier transform (FFT) [
19]. Lu [
18] introduced the mapping method to coordinate the echo signal from temporal spectrum to frequency-wavenumber space, which achieved to beamform in wavenumber-space. Bernard et al. [
20] proposed a novel remapping method related to the steered angle in reception. After that, Garcia et al. [
21] proposed a beamforming process in the Fourier domain based on the exploding reflector model (ERM), which is widely used in seismology [
22]. Compared with Lu’s method, this model assumes that the echo waves of the backscatters only propagate in the direction of the sensor, so that the principle of Stolt’s migration is different from the model proposed by Lu. This model is closer to the reality, thus improving imaging quality. Then Chen et al. [
23] modified the structure of the algorithm by aligning coordinates before the axial inverse fast Fourier transform (IFFT) and the lateral IFFT, which enables direct coherent compounding in the Fourier domain. Based on f-k migration, a circular statistics vector (CSV) was used as a weighted factor to achieve better image quality [
24].
High imaging frame rate will lead to the problem of large data load and memory storage. Moreover, a large amount of data also reduces the transfer speed which limits transmission speed in telemedicine imaging. Compressed Sensing (CS) is a new sampling theory, which allows the signal to be sampled under the Nyquist sampling rate [
25]. This theory is based on the signal’s sparsity, which means the original signal can be randomly sampled to obtain discrete samples. There is only a small amount of non-zero information in this discrete sample. The sample can be reconstructed to the original signal by a recovery algorithm with high quality. Compressed sensing has been widely used in data acquisition [
26], radar [
27] and the communication field. Wang et al. [
28] and Szasz et al. [
29] considered the transmitted ultrasonic signal as an impulse to form a sparse measurement matrix with a large number of zero-elements to reduce memory usage. Then, wavelet sparsity was introduced to the recovery algorithm to reconstruct the image. Chernyakova et al. [
30,
31] proposed a CS-based Fourier beamforming with a Xampling scheme for undersampling. Besson et al. [
32] firstly joint compressed sensing with beamforming in the wavenumber domain. Although compressed sensing allows the signal to be sampled at a sampling rate lower than the Nyquist sampling rate, the measurement matrix occupies lots of memory as well. Reducing transducer elements is an effective method in DAS to reduce the data load with a small measurement matrix [
33].
In this paper, we propose a new framework based on Stolt’s f-k and compressed sensing theory, which is a compressive beamformer in the wavenumber domain, to obtain a smaller amount of data load during signal transmission and better image quality in telemedicine. We introduce compressed sensing to the frequency-wavenumber domain, which reduce data load for transmission and memory storage, as well as maintaining the image quality and stability compared with that in the temporal domain.
This paper is stated as follows:
Section 2 introduced the modified Stolt’s f-k method and compressed sensing theory in the wavenumber domain at first. Then, beamforming in the frequency-wavenumber domain and compressed sensing are combined to assemble the novel framework: compressive beamformer in the wavenumber domain. In
Section 3, simulated data and experimental acquisitions are used in the new framework to compare with DAS and compressive beamformer in the temporal domain. In
Section 4, the influence of compounding angles on image qualities of different beamformers are discussed. Moreover, memory occupation, computational complexity and telemedicine imaging speed are discussed.
Section 5 provides a conclusion of the novel compressive wavenumber domain beamforming framework.
2. Materials and Methods
2.1. Plane Wave Stolt’s f-k Method
Stolt’s f-k beamforming method was first proposed based on the exploding reflector model (ERM), and was widely used in seismic imaging and other fields. Garcia et al. [
21] applied Stolt’s f-k method to ultrasonic plane wave imaging. In seismology, the ERM model assumes that all sound sources in the underground imaging area generate sound waves at the same time, and the sound waves only propagate upward. In order to apply the Stolt’s f-k method to ultrasonic plane wave imaging, Garcia et al. [
21] first demonstrated that the ERM model can be transformed into plane waves at different angles. Under the premise of the same echo signal, if the real scatterer position in the plane wave is obtained, the corresponding virtual scatterer position in the ERM model can be calculated according to the conversion method. On this basis, Garcia et al. [
21] used Stolt’s f-k method to process plane wave signals at various angles, and obtained the virtual scatterer position under the ERM model. Then the real scatterer position can be calculated by the migration principle. Coherent compound imaging can be implemented by correcting delay and aligning spectrum’s coordinates of different angles.
Shown as
Figure 1, for the plane wave emitted with an angle of
, the longitudinal position of the inclined echo signal due to the emission angle will be distorted. Before using the Stolt’s f-k method, the position of received RF data should be corrected at first. We assume that
is the position of a scatterer, then the travel time of echo signal is:
The signal travels from emitter to scatterer and back to the receiver, depending on the transducer position
. To be compatible with the ERM model, the virtual exploding source is positioned at
with a propagation speed of
.
is a one-way speed from scatterer to the receiver. Then the following ERM travel time is:
By equalizing
and
, the relationships of the sound speed and coordinates between real situation and that in ERM model are obtained by the coefficients as below:
where
are defined as follows:
In the equation,
relates the virtual propagation speed
and real speed
.
represents the scaling ratio between the virtual axial coordinate
and the real axial depth
.
refers to the shift operation between virtual lateral coordinate
and real lateral position
. The relationship between virtual arguments and actual arguments are shown as follow:
Assume that a plane wave is emitted at the angle of
, and the RF data of the echo signal is
. If the plane wave is backscattered in receiving, the signal received at the probe whose coordinate is
can be present as
. Fourier transform is applied to
over time
to obtain the spectrum
, where
is the temporal frequency. Because of the time delay
in the temporal domain, the Fourier transform should be followed by a phase shift
in the frequency domain, as follows:
Then the phase-shifted spectrum is Fourier transformed in the lateral dimension
where
refers to the lateral spatial wavenumber on the transducer surface. Chen et al. [
23] modified the process of beamforming by applying the phase shift once before lateral FFT, instead of applying another phase shift after migration.
The core of the spectrum migration is to estimate initial spectrum from the obtained spectrum as the boundary condition. is the Fourier transform over actual coordinate . This procedure can be divided into two steps. The first is mapping to , where refers to Fourier transform over virtual source . The second step is to shift to .
To map
to
, Garcia et al. [
21] provided the solution as
where
is the axial spatial wavenumber perpendicular to the transducer
and
are related to
by
Then
should be transformed to
for Fourier transform of the real coordinates.
is the Fourier transform over actual coordinates. Then the final image can be obtained by applying 2-D Fourier inverse transform over
is the final image data after beamforming. This procedure augments computational speed because it replaces time-delay calculation with FFT in the Fourier domain. This model assumes that the reflected sound wave of the scatterers only propagates in the direction of the sensor, which makes the principle of spatial spectrum coordinate migration different from the model proposed by Lu [
18]. In reflective ultrasound equipment, the performance of the scatterers as a reflection model is more suitable to the actual situation than the pure scattering model, so this method improves the imaging quality.
2.2. Proposed Compressed Sensing Process in the Wavenumber Domain
The transducer with
elements emits a plane wave signal, which equals the number of lateral numbers. The raw data of echo signal is
in the temporal domain. The echo signal is transferred to
in the Fourier domain by 2D-dimensioan FFT. The ultrasonic echo signal in the wavenumber domain can express sparsity by a transfer matrix based on directivity vector. To avoid large transfer matrix, we compress the data load by
. According to compressed sensing theory, the transfer matrix can be established by the number of transducer elements and size of the transfer matrix. To ensure that the projection vector is sparse, which means there are only few elements non-zero, the number of the rows of the transfer matrix should be less than columns. The projection coefficient vector is obtained by projecting the echo signal on the transfer matrix. Then the echo signal of all transducer elements can be expressed as follows
where
is the transfer matrix.
is the projection coefficient vector where echo signal
project on transfer matrix
. There are only few non-zero elements in the vector
, namely
is sparse. The transfer matrix
can be constructed as follows to make the echo signal in the wavenumber domain
sparse.
Divide the space
into
parts on average to obtain
and establish directivity vector
as follow
where
is the kerf between elements of the transducer.
is wavelength of the plane wave. Directivity vector can be regarded as the sparse base to construct transfer matrix. Then the transfer matrix is
In order to compress signal
, we would like to obtain
by constructing measurement matrix and re-weight minimum-focal underdetermined system solver algorithm (RM-FOCUSS) [
34]. Compressed sampling is not a direct measurement of
, but a design of a
dimensional sampling matrix
, which is uncorrelated to the transfer matrix
. The projection vector
of
on
is
Sampling matrix represents the compressed sampling method of the echo signal, which is composed of sampling bases. Each sampling base is an -dimension vector, such as . The -th row means that the output of all the transducer elements is projected onto the sampling base, corresponding to a compressed sampling point. The sampling matrix has rows in total, which means that only compressed sampling points are needed. transducer elements can be selected for sampling from the transducer elements of the original array.
The matrix
in [
15] is an
matrix, which is called the measurement matrix. The measurement matrix is used to sample the observations so that the original signal can be reconstructed. Theoretical studies [
35] have shown that when the measurement matrix
satisfies the restricted isometry property (RIP) condition, the projection coefficient vector
can be solved, and the original signal
can be reconstructed accurately by the compressed sampling vector
. This property ensures that the original space has a one-to-one mapping relationship to the sparse space, which requires that the sampling matrix randomly selected from the observation matrix must be non-singular. The measurement matrix can measure the signal to obtain the measurement vector, and then use the reconstruction algorithm to reconstruct the full signal from the measured value. When designing the measurement matrix, it is required that the measured value will not affect the information of the original signal during the sparse expression of the signal, so as to ensure that the signal can be accurately reconstructed. The theory proves that the Gaussian random sampling matrix and any fixed transformation matrix can make
satisfy the RIP condition with a high probability [
36], and the Gaussian random matrix can be easily obtained, so this paper adopts the Gaussian random measurement matrix.
The process to design a sampling matrix
is as
Figure 2 [
33]:
Define
as the output of the compressed sampling array elements. The compressed sampling vector is
After obtaining the compressed sample vector
, the re-weight minimum-focal underdetermined system solver algorithm (RM-FOCUSS) [
34] is used to estimate the projection coefficient vector
. Then the original signal
can be reconstructed by [
12].
For
plane wave emitted from transducer, the cost function
is established as follow in order to obtain the sparse signal
when
is closer to 0, the
expressed by
is more sparse. The Lagrange operator is defined as
where
is Lagrange operator vector,
.
can be obtained by minimum Lagrange operator
. The solution of RM-FOCUSS algorithm was proposed in [
34] as follow
where
is related to the sparsity of echo signal.
is a parameter reflecting the noise. We could eliminate part of the noise by adjusting
. After the RM-FOCUSS algorithm, we obtain the sparse signal
. This sparse signal is the compressed solution after compressed sensing because of lots of zero elements. Finally, apply [
12] to obtain the full-array signal. This signal is no longer the original signal but with some amelioration and modification, whether all the information can be recovered and whether the noise can be eliminated depending on the performance of this compressed sensing process.
The beamforming based on compressed sensing can become a method of sparsity by reducing the number of transducer elements without reducing the image performance. It uses the sparseness of the ultrasonic signal to randomly extract part of the signals from the transducer, restores the original signal with a restoration algorithm, and then performs beamforming. In beamforming based on the compressed sensing, the transfer matrix, the sampling matrix and the sparsity solution algorithm all influence whether the reconstructed signal can retain all the image information.
2.3. Wavenumber Domain Compressive Beamforming
To improve the compressed rate without sacrificing image quality, we propose the framework to combine compressed sensing with beamforming in the frequency-wavenumber domain. According to the characteristic of ultrasonic signal, the energy of the echo signal concentrates on low frequency. Assume that
represents the
-th line of the RF data of echo signal. To transform the echo signal to the frequency-wavenumber domain, the spectrum can be expressed as follows
After transforming the echo signal into the frequency-wavenumber domain, the spectrum of signal parallel to the transducer direction become symmetric because of the characteristic of FFT. We choose the 32nd sampling line and the 10th line of the echo signal parallel to the transducer as the example, and the spectrum is shown in
Figure 3. We can see that the information of the signal is repeated twice, when the length of the signal in the wavenumber domain is the same as that in the temporal domain. Thus, we can select the first half of the data in the sampling direction for beamforming, which make the amount of data be reduced to 50%. Moreover, the signal in the frequency-wavenumber domain becomes more centralized with a rapid decay. With these two features of the signal in the wavenumber domain, compressing echo signal in the wavenumber domain will lose less information than that in the temporal domain. Thus, for the same compressing rate, beamforming with compressed sensing in the wavenumber domain will obtain better image quality.
In order to improve the image quality further, the multi-angle compound imaging method is also suitable for this framework. In this way, we introduce the general case of the algorithm, in which the plane waves are emitted at different angles. The process of this frequency-wavenumber compressive beamforming method can be summarized as follows:
Apply 2-dimensional FFT to the RF raw data to transform the echo signal into the frequency-wavenumber domain as .
Select the first half of the data in the sampling direction for beamforming.
For plane waves emitted at different angles, the emission delay caused by angles should be removed by [
1] at first.
Compress the echo signal in the Fourier domain by the sampling matrix
and obtain the sparse solution
by RM-FOCUSS algorithm [
19]. The ameliorative signal can be obtained by [
12].
The Stolt’s migration described by the procedure from [
8,
9,
10,
11] is applied with the ERM velocity
as defined in [
5].
Restore the data in sampling direction before inverse FFT is applied to the data after migration to transform the echo signal back to the temporal domain for imaging.
This new framework is proposed to reduce data load and memory by applying the compressive sensing on the lateral wavenumber, as well as to guarantee the quality and stability of the imaging signal after compressed sensing. The pseudo-code for tilted plane wave is shown in
Figure 4. The compression rate in this frame work is discussed in
Section 3 by applying this algorithm to both simulated data and in vivo experiments. Part of the information of the original echo signal will usually be lost after compressing, so the sampling matrix and the reconstruction algorithm have higher requirements to reconstruct high quality image. Therefore, the algorithm and the reconstruction algorithm can be further explored in these two aspects to find a better compressing method.
2.4. Simulation Settings
Echo data were simulated by Filed II to evaluate the frame work we proposed based on the quality of the images it generates, and to compare it with DAS method and compressive beamformer in the temporal domain. Lateral resolution and contrast were the metrics to assess the image quality in this part. The metrics could be better calculated and analyzed in this part, because noise and other experimental interference could be ignored easily with the simulated data.
The numerical phantom was generated by Field II, including backscattering points and circular regions to calculate the lateral resolution and contrast. The two circular hypoechoic areas with the diameters of 5 mm and 6 mm in the phantom represented the anechoic cyst, and the two high-brightness circular areas with the same diameter simulated high-density parts in ultrasound imaging. There was a column of backscattering points arranged longitudinally and a row of scatterers arranged horizontally in the phantom. The longitudinal scattering points were used to analyze the lateral resolution of different depths.
The specifications of the research linear array transducer manufactured by Verasonics (L11-4v, Verasonics Inc., Redmond, WA, USA) were used to simulate the echo data. This is a linear array transducer with 128 elements, whose central frequency is 6.25 MHz. The pitch of transducer L11-4 is 0.3 mm and the width of each element is 0.27 mm. The transmitted signal was sampled at a rate of 50 MHz. There were 6066 samples recorded for each scanline and the image depth was 92.4 mm. The data processing was conducted using a MATLAB program (Mathworks Inc., Natick, MA, USA).
2.5. In Vivo Experiment
Organs have complicated structures and various influences on echo signal. We applied different beamformers on data obtained from the brain scan and the heart scan of the rat to analyze the performance of different beamformers. The process of the transmit and receive (T/R) for radio frequency (RF) signals was conducted using a Vantage 64 LE system (Verasonics Inc., Redmond, WA, USA), and the data processing was conducted using a MATLAB program (Mathworks Inc., Natick, MA, USA). We demonstrated the beamformers experimentally on RF data acquired linear array transducer called Vermon (Vermon Inc., Tours, France) with 128 elements. The pitch of Vermon is 0.11 mm. The central frequency is 18.8 MHz and the sampling frequency is 62.5 MHz. Vermon is a transducer specially designed for animals. The in vivo rat experiment was performed according to the Suzhou Institute of Biomedical Engineering and Technology (Chinese Academy of Sciences) Institutional Animal Ethics Committee (SibetAEC) protocol (2021-B22). We used female rats of SPF degree which were cultivated by 8 weeks.