1. Introduction
Photoacoustic imaging (PAI) is an emerging biomedical imaging method which integrates the principles of optical and ultrasonic imaging [
1]. It combines the high-resolution characteristic of optical imaging with the deep-penetration capability of ultrasonic imaging [
2]. PAI has garnered significant attention and become a research hotspot in the field of biomedical imaging. It has great application potential in many fields such as tumor detection [
3,
4,
5], vascular imaging [
6,
7,
8], skin disease analysis [
9], and drug delivery monitoring [
10]. The principles of PAI are based on the photoacoustic effect [
11]. When tissue absorbs light energy, the temperature of the tissues increases. This results in transient thermal expansion and the contraction of the tissues. The thermal expansion and contraction propagate outward in the form of ultrasonic waves, which are known as photoacoustic signals (PA signals) [
12]. Photoacoustic signals are detected by ultrasonic transducers and subsequently processed through various reconstruction algorithms, enabling the generation of images that reflect the spatial distribution of optical absorption [
13].
The delay and sum (DAS) algorithm is one of the most commonly used beamforming algorithms in photoacoustic image reconstruction [
14,
15]. Known for its simple principles and ease of implementation, DAS is frequently used in real-time imaging systems. However, its simplicity also results in a higher level of side lobes and more pronounced artifacts in the reconstructed images. In recent years, a great deal of research has been conducted to address these issues, leading to the development of several improved algorithms. These enhancements have shown promising results in reducing side lobes and artifacts, contributing to the ongoing evolution of PAI techniques. For instance, Matron and colleagues proposed the Delay Multiply and Sum (DMAS) algorithm, which improves the reconstructed image in terms of contrast and resolution [
16,
17]. Moein Mozaffarzadeh and colleagues introduced a coherence factor (CF), which integrates the coherence factor (CF) and modified coherence factor (MCF) into the traditional DAS beamforming technique, known as DAS-CF [
18,
19,
20]. This combined approach is applied in PAI to mitigate the impact of unwanted off-axis signals on beamforming. Subsequently, the team proposed an improved version of the DMAS algorithm, known as the double-stage DMAS algorithm, which enhances image resolution and side-lobe levels [
21,
22]. Compared to the DMAS algorithm, the side-lobe level is reduced by approximately 10 dB. Several other optimization algorithms, which are derived from the DMAS method, have also contributed to improving the quality of the reconstructed images to some extent [
23,
24]. There are also some other methods that have made significant contributions to improving the quality of the reconstructed images [
25]. Although the aforementioned algorithms have improved the imaging quality, their effectiveness in suppressing the aliasing issue between adjacent microstructures remains limited. However, these methods, like the DAS method, analyze the photoacoustic signal as a single-valued signal. In reality, the acoustic pressure signal is a short pulse lasting for a certain period, composed of both positive and negative amplitudes, forming an N-shaped signal. The duration of this N-shaped signal is related to the size of the source [
26]. For a particular pixel within the imaging area, the complete PA signal for that point over the entire imaging time can be obtained through multiple DAS calculations. This signal should also be an N-shaped signal. Furthermore, the N-shaped signals acquired from different pixel points exhibit variations in amplitude and in the timing of their peak values. Thus, the real source points in the imaging region and the pixels with artifacts can be distinguished.
Ma Xiang and colleagues proposed the Multiple Delay and Sum with Enveloping (multi-DASE) algorithm, which aims to suppress side lobes and artifacts [
27]. This method not only computes the initial acoustic pressure signal but also computes the complete signal for each point. The complete signal is then enveloped. By comparing the differences between the envelope signals, the method distinguishes between signal sources and artifacts. This method has shown strong performance in suppressing artifacts. However, when imaging a slightly larger signal source, the interior of the signal source is also suppressed, leading to a reconstructed image that exhibits missing internal structures, thereby causing significant distortion in the final reconstructed result. Furthermore, the larger the volume of the source, the more pronounced this phenomenon becomes.
By analyzing this problem, we find that the original N-shaped signal contains abrupt changes. Directly obtaining the envelope signal through the Hilbert transform results in overshoot. Overshoot makes it difficult to distinguish between some pixels inside the real source and the pixels in the artifact regions, resulting in the suppression of both. This is determined by the properties of the Hilbert transformation. When the signal contains an abrupt change, the Hilbert transform produces the Gibbs phenomenon. Specifically, the Hilbert transform involves convolution of the signal using a kernel function of , which is singular and particularly sensitive to abrupt points, resulting in overshoot. The simplest and most effective way to solve this problem is the smooth preprocessing of the original signal. We chose to use the window function to smooth the original signal. This paper builds upon the recovery of the complete signal, and a window function is multiplied before taking the envelope to weaken the abrupt changes in the N-shaped signal, thereby suppressing the overshoot in the envelope signal obtained via the Hilbert transform. The simulation of a simple circular source model and an irregularly shaped source model has verified that this method can effectively avoid distortions caused by overshooting when directly using envelope signals for imaging.
3. Results
We performed numerical simulations using the k-WAVE toolbox (version 1.4) to demonstrate the reconstruction results using the DAS algorithm and multi-DASE algorithm, respectively [
29]. The simulation was based on a homogeneous propagation medium with the sound speed set to 1500 m/s. The computing area is a square area with borders (21.6 × 21.6 mm
2) as shown in
Figure 6a. In total, 108 transducers are uniformly placed at the top of the area, and the circular sound source is placed in the middle of the area with a radius of 1.5 mm. Firstly, the DAS algorithm is used for reconstruction, and there are two obvious cambered artifacts in the reconstruction results, as shown in
Figure 6b. In order to better illustrate the superiority of the multi-DASE method, we also used the DMAS method to reconstruct the source, and the reconstruction result is shown in
Figure 6c. Although the artifacts have been effectively suppressed, it is evident that the reconstructed images exhibit discontinuities, and the reconstruction results still demonstrate a certain degree of shape distortion. When applying Equation (11) for reconstruction, the value of
k needs to be determined first. The suppression effect corresponding to the smaller
k is not obvious, while the larger
k may lead to excessive suppression, which results in unsatisfactory reconstruction results. To ensure the scientific validity of selecting the
k value, a quantitative criterion was established: when
, introducing
k should enable the suppression factor to reach at least
. Based on this criterion, and after conducting numerical calculations and experimental validations, the optimal parameter value of
k = 3 was ultimately determined. The reconstruction result is shown in
Figure 6d. From this result, it can be seen that, compared with the first two methods, artifacts are effectively suppressed, and the shape of the reconstructed result is well preserved. However, the internal structure of the source is also suppressed, leading to a darker appearance in the reconstructed region and causing a distinct “gap” to appear.
Since different k values correspond to different degrees of suppression, we attempt to use different values of
k for reconstruction. This was carried out to investigate whether the selection of k values affected the internal reconstruction results. To make the results more clear, we chose a circular sound source with a radius of 1.5 mm for verification.
Figure 7 shows reconstructed images using Equation (11) with different
k values, respectively. When k = 0, the reconstruction result of multi-DASE, compared to the reconstruction result of DAS, eliminates the negative components, resulting in more complete boundaries. However, in terms of image quality, there is no significant improvement. Since it does not suppress artifacts, the reconstruction results of multi-DASE still exhibit strong artifacts, as shown in
Figure 7a. As the value of
k increases, artifacts are increasingly suppressed, with larger
k values leading to the stronger suppression of artifacts. We observed that when the value of
k is relatively small, the problem of missing the internal structure of the reconstructed source also exists, as shown in
Figure 7b. As
k increases, the problem of the missing internal structure of the reconstructed source becomes more and more obvious, as shown in
Figure 7c,d. Through this experiment, it can be determined that the problem of the missing internal structure of the reconstructed source is not directly determined by the value of
k.
We restored the complete signal at the pixel point at the center of the circular sound source shown in
Figure 6, and calculated its envelope. The result is shown in
Figure 8b. The envelope signal observed is not a single peak but instead exhibits a bimodal distribution, which is what we refer to as signal overshoot. The envelope value at
t = 0 corresponds precisely to the “concavity” between the two peaks. For comparison, we also calculated the complete signal and envelope signal at the center of a circular sound source with a radius of 0.5 mm, as shown in
Figure 8a. It is evident that when the volume of the signal source is larger, the “concavity” between the two peaks in the envelope signal becomes more pronounced. If we directly suppress the envelope signal using Equation (10), the envelope value at
t = 0 will naturally be subject to suppression. The final result is that the source’s internal region is also suppressed. And, as mentioned earlier, the larger the signal source volume, the more pronounced the suppression of its internal region.
The recovered N-shaped PA signal for each pixel can be understood as returning the data received by transducer to the pixel with a time delay. In short, when the sensor receives the PA signal, it records data only at the moment the PA signal arrives at the sensor, and records 0 at all other times, so the signal indicates a sudden change at the moment the sensor starts receiving it and at the moment it stops. Conversely, when we want to obtain the complete signal of a certain pixel over the entire imaging time, the data returned by the sensor will also suddenly change at the moment the signal reaches that pixel point and at the moment it ends, and will be 0 at all other times. In this way, if we directly calculate the envelope of an N-shaped signal containing abrupt changes, it will cause overshoot in the envelope signal, manifesting as the two peaks we observe. This phenomenon is particularly evident when the source volume is large. Therefore, it needs to be smoothed before we calculate its envelope.
The amplitude of the Gaussian function will affect the final value. Considering that the purpose of adding the window function is to smooth the signal as much as possible without changing the amplitude of the envelope signal, in our experiment, the amplitude of the Gaussian function is set to 1 (a = 1). The standard deviation of the Gaussian function determines the width of the Gaussian curve. A wider Gaussian curve results in insufficient smoothing, while a narrower curve leads to over-smoothing. To achieve optimal smoothing, we aim to match the width of the Gaussian curve with that of the complete signal. In our experiment, as all calculations involve discrete data, the width of the Gaussian window can be adaptively adjusted based on the width of the obtained complete signal. For simplicity, we set the standard deviation to 1 (c = 1). Additionally, the improper selection of the position of the symmetry axis may cause the window function to fail to play a smoothing role. The symmetry axis is set to the middle position of the double peaks of the envelope signal with overshoot. Subsequently, we perform a computer simulation of a circular source with a radius of 1.5 mm.
Figure 9 shows the envelope signal obtained directly and the envelope signal obtained by multiplying the window function. It is obvious that, after the window processing, the envelope signal obtained by the Hilbert transform will not overshoot even if it is not a completely smooth signal. The ultimate purpose of obtaining the envelope signal is to determine a suitable inhibitory factor, so even if the amplitude of the envelope signal obtained after multiplied by the window function is greatly changed compared with the previous, it will not have much impact on the final imaging results. In order not to alter the amplitude, we still choose to use the data corresponding to the moment
of the original envelope signal as the initial imaging data. Then, the final output result of the method we propose is as follows:
the definition of the inhibitory factor is as follows:
where
refers to the envelope signal obtained after combining the window function, the calculation method and formula of
are the same as those in Equation (8). The value of
k is set at 3 based on the established criteria discussed earlier.
Therefore, when imaging again, the reconstruction result will not have the distortion problem caused by overshoot. To facilitate a visual comparison, we have integrated the reconstruction results obtained using four different methods into one figure, as shown in
Figure 10.
The DAS algorithm, which merely calculates the initial values of signals during the imaging process, leads to prominent artifacts in the reconstructed images. Additionally, there exists a certain degree of deviation in the shape of the reconstructed sources as shown in
Figure 10a. The results obtained by using the DMAS method for reconstruction have reduced artifacts, but the reconstructed images have obvious discontinuities and distortions in shape, which is shown in
Figure 10b. Compared with DAS, the multi-DASE algorithm shows significant improvement and effectively reduces the presence of artifacts. However, it has the problem of internal structure omission caused by the overshoot of the envelope signal, as shown in
Figure 10c. The method we proposed not only ensures that the artifacts are suppressed, but also retains the internal information of the source more comprehensively, as shown in
Figure 10d.
We calculated the Structural Similarity Index (SSIM) of the reconstructed images obtained using the DAS algorithm, the multi-DASE algorithm, and our proposed method to quantify the superiority of our proposed method. The SSIM has a good correlation with human subjective visual perception and can intuitively reflect the effectiveness of the reconstructed images through the data. Before calculating the SSIM, we need to normalize the reconstructed images obtained by different methods. The data are presented in
Table 1.
The table above presents the SSIM values calculated by comparing the reconstruction results from different methods with the initial sound pressure field. The DAS algorithm simply sums the data for imaging, and the reconstructed image has a lot of artifacts, and there are negative values in the reconstructed data, so its SSIM value is very low, it is just 0.0187. The DMAS method can suppress artifacts and the SSIM value has been improved to 0.5486. However, due to the problems of slice and shape distortion, there is still much room for improvement. Although the DASE algorithm suppresses the artifacts to a large extent, its source is also suppressed, so its SSIM value is improved compared with the former two methods, reaching 0.7518, but there is still a lot of room for improvement. The reconstructed image obtained by our method can suppress the artifacts and distinguish the source at the same time, so it is most similar to the original sound source field, and the SSIM index reaches 0.8660.
We extracted the lateral profile at y = 10.8 mm and the vertical profile at x = 10.8 mm, respectively, of the reconstructed image using the DASE algorithm and the method proposed in this paper.
Figure 11a,c show the lateral profile and vertical profile at the center of the circle source of the reconstruction imaging using multi-DASE, and they show that there is clearly inhibition within the source.
Figure 11b,d are the lateral profile and vertical profile at the center of the circle source of the reconstruction imaging using the method proposed in this paper, and show that there are no obvious faults in the reconstructed images processed by our method. The signal distribution within the sound source is more uniform, no abnormal suppression phenomenon occurs, the overall signal strength remains good, and there is no obvious signal loss. Although this section focuses solely on presenting the reconstruction results of a sound source with a radius of 1.5 mm using various methods, as previously discussed, our approach can adaptively adjust the Gaussian window size based on the sound source dimensions. So our method is applicable to sound sources of different sizes.
To verify the robustness of our method under different noise levels, we added random Gaussian white noise of different levels for simulation.
Figure 12 shows the reconstructed images at different noise levels.
Figure 12a shows the reconstruction results obtained using different methods when the SNR ratio is 34 dB. The noise level is relatively low, and the reconstruction result is not very different from that in the noise-free case.
Figure 12b shows the reconstruction results obtained using different methods when the SNR ratio is 20 dB. It can be seen that the reconstructed images obtained by the four methods are all affected by noise, resulting in a decrease in reconstruction quality.
Figure 12c shows the reconstructed images obtained using different methods when the signal-to-noise ratio is 17 dB. Due to the high noise level, the reconstruction results obtained by various methods have all been affected. To quantitatively compare the robustness of different methods in dealing with noise, we calculated their respective SSIM values for comparison, and the results are shown in
Table 2.
The data shown in
Table 2 can demonstrate that, under different noise levels, the quality of the reconstructed images obtained by our proposed method remains the highest. Additionally, as the noise level increases, the SSIM of the reconstruction results of the DMAS method decreases by 0.1548, that of the DASE method decreases by 0.1255, and that of our method decreases by 0.0442, verifying the robustness of our method. After a simple analysis, we concluded that the key characteristic of our proposed method lies in its ability to smooth the initial signal, which can eliminate the influence of noise to a certain extent and thus has high robustness.
We also conducted simulations for multiple sources. The distribution of multiple sources is shown in
Figure 13a. When there are multiple sound sources within the imaging area, the complete photoacoustic signal obtained through DASE is no longer a distinct N-shaped signal, as shown in
Figure 13b. Throughout the whole imaging period, the signal shape appears as an irregular and fluctuating one. Considering that the signals occurring at times far from
can be regarded as interference signals, we eliminate this part of the signal and only process the portion deemed as useful signals, as shown in
Figure 13c.
Figure 14 shows the comparison results of the reconstructions of multiple sources using different methods.
Figure 14a is the reconstruction result obtained by using the DAS method, which exhibits strong artifacts.
Figure 14b shows the reconstruction result obtained by the DMAS method. It can be seen that the artifacts in the image have been suppressed to a certain extent, but there are some relatively obvious layers in the reconstruction result, which makes the reconstruction quality not very high.
Figure 14c presents the reconstruction using the multi-DASE. Compared with the previous two methods, the artifacts have been well suppressed. However, at the same time, the internal structure of the source has also been suppressed, resulting in the loss of internal structural information.
Figure 14d is the reconstruction result using our proposed method. It can be observed that our method not only preserves the artifact suppression capability of the multi-DASE method but also addresses the issue of missing internal structural information in the reconstructed source, thereby substantially enhancing the quality of the reconstructed image. We also calculated the SSIM for the reconstruction results of the irregular source to quantify the differences between images reconstructed using different methods, as presented in
Table 3. The SSIM value for the DAS method is only 0.0106. In comparison, the DMAS method demonstrates a stronger ability to suppress artifacts, increasing the SSIM to 0.3424. Nevertheless, there remains significant room for improvement. The multi-DASE method exhibits superior performance in artifact suppression, with the SSIM significantly improving to 0.7881. Our proposed method not only preserves the advantages of the multi-DASE method but also addresses the issue of partial information loss observed in the multi-DASE method. Consequently, the SSIM for our proposed method reaches 0.9783, indicating that the reconstruction results achieved using our method represent the optimal outcomes currently available.
We also applied the method proposed in this paper to image point-like sources, and the results are shown in
Figure 15. In the same imaging area, four point sources are vertically distributed at x = 10.8 mm in the center of the imaging area. The vertical distance between the first point source from the top is 6.8 mm, and the distance between each subsequent point source is 3 mm. The results show that our method can not only deal with the distortion problem that may occur when imaging large-volume sources, but also, like many existing methods, can still exhibit excellent imaging performance when imaging small-volume sources.