1. Introduction
Photoacoustic imaging (PAI) is a fast-developing, non-invasive biomedical optics imaging technique [
1]. As a hybrid imaging method, PAI processes images using short-pulsed laser-induced ultrasound for detection, uniquely combining the rich optical contrast and low acoustic scattering in soft tissue. PAI breaks the optical diffusion limit of pure optical imaging and can provide a high resolution to an imaging depth ratio of about 1:200 within an imaging depth of several centimeters.
Because the near-infrared (NIR) laser is primarily employed in PAI, and mainly absorbed by the hemoglobin in the blood vessels, PAI features the high contrast label-free imaging of blood vessels, which dominates the photoacoustic signal in the biological tissues [
2]. Notably, as intensive irregular angiogenesis is a typical mark of cancer [
3], the photoacoustic intensity in the cancerous regions can be several times higher than the normal tissues, and PAI is thus one of the most promising methods for cancer detection [
4,
5]. So far, PAI has been widely employed in the pre-clinical or clinical diagnosis of various cancers, such as breast cancer, cervical cancer, ovarian cancer, prostatic cancer, and colorectal cancer [
6].
In earlier studies, PAI calculated the mean optical absorption to identify tumors. However, this method is easily influenced by inhomogeneous optical illumination [
7,
8]. As PAI develops, different photoacoustic microscopy (PAM) implements are being invented to provide high-resolution images of blood vessel networks for tumor detection. Prehensive parameters of the irregular angiogenesis can be obtained from these images, such as the density, mean length, mean curvature, and mean diameter of the blood vessels, which helps in the accurate diagnosis of cancers.
PAM can be generally categorized into optical-resolution-based PAM (OR-PAM) and acoustic-resolution-based PAM (AR-APM) [
9]. Although OR-APM has a much higher lateral resolution, it can only penetrate about 1 mm into the biological tissue. Comparatively, AR-PAM is more suitable for deep tumor and blood vessel imaging. To increase the lateral resolution of AR-PAM, there is a trend to employ focused ultrasound transducers with high central frequency and numerical aperture [
10]. In conventional B-mode reconstruction methods of AR-PAM, the optimal lateral resolution is only achieved within the focal region of the transducer. Out of this region, the lateral resolution deteriorates quickly, and the image quality degrades as a consequence [
11]. To restore the lateral resolution that is out of focus in AR-PAM, various back-projection (BP)-based reconstruction methods have been proposed, significantly improving the image quality of blood vessels [
12,
13,
14,
15,
16,
17,
18].
Nevertheless, these different implementations of BP give varied image effects, yet little effort has been made to compare these methods. This study aims to set up a framework for generating photoacoustic signals of complicated objects in AR-APM. Then we compare the reconstructed images of complex blood vessel networks by different BP methods and investigate the influence of the scanning parameters and the noise level. With these efforts, this work may guide the design and image reconstruction of AR-PAM for cancer detection.
2. Methods
2.1. The Generation of Photoacoustic Data
In AR-PAM, it is assumed that the pulsed laser gives homogenous illumination to the biological tissue, and the optical attenuation is not generally considered in the image reconstruction. A focused transducer with high central frequency and high NA scans over the surface of the biological tissue, which detects the ultrasound pulses generated by the thermal expansion due to the absorption of the pulsed laser, as seen in
Figure 1.
This assumes a focused transducer scanned over
N positions, with a step size of Δ
x. The detected photoacoustic pressure at an arbitrary point can be expressed with the following Fresnel integral over the transducer surface
S [
19]:
Here Γ is the Grüneisen coefficient, a dimensionless constant representing heat conversion efficiency to pressure. The media is assumed to be acoustically homogenous with an acoustic velocity of
v. We take that the optical absorption distribution
H(r’
) in the imaging domain Ω can be expressed as a total of
NPs evenly distributed point sources. The transducer surface is also described as a total of
NPd evenly distributed point detectors. Then, considering the transducer’s acoustic–electrical response function
he(t), the integral in Equation (1) is changed to the following convolution function for calculating the received photoacoustic signal of the transducer
S(t):
Here hpf(t) is the system’s point transfer function, Ri donates the i-th point detector’s position, rj donates the j-th point source’s position, and A(rj) donates the intensity of the point source. Here the constants are omitted, and the finite nature of the function he(t) (both values and derivations of this function at infinity are zeros) is utilized to calculate the convolution. We only need the system’s point transfer function hpf (t) to calculate the acoustic data if we have the point sources’ positions and intensities. In our simulations, a Gaussian-enveloped cosine function was assumed for this point transfer function, but it can also be experimentally measured with a point source.
2.2. The GPU Parallel Scheme of Data Generation
Because the number of
NPd is about several hundred, and
NPs can be easily more than one million, the calculation cost of Equation (2) can be prohibitively expensive for CPUs. Thus, we parallelized the summation with GPU to speed up the data generation in a Matlab environment [
20] in which a mexCuda function is called to calculate each A-line.
As seen in the flow chart in
Figure 2a, after the parameter definition and initialization, input matrixes such as the transducer’s shape and position, target point positions, and the system’s point transfer function were changed into gpuArray, and stored in the device memory in the GPU card, as seen in
Figure 2b. The mexCuda function (indicated with gray dotted lines) is programmed in C language, which defines interfaces between the C language and the Matlab language and calculates Equation (2). The mexCuda function is divided into several kernels (indicated with black dotted lines) for the A-line calculation, and each kernel deals with the parallel calculation of several thousand point targets—this kind of design helps to avoid possible kernel computation timeouts.
Each block calculates one point target of the blood vessel network in each kernel, which is calculated in the stream multiprocessor (SM). Each thread in the block deals with each point detector of the transducer, calculated in the stream processor (SP). In each block, the first thread judge whether the transducer can receive the point target. Then the threads copy the system’s point transfer function into the shared memory, calculate the distance from the point target to the point detectors, and compute the corresponding time-delay summing. The summing results in each block were stored in the shared memory. Using the shared memory can significantly improve the data transfer bandwidth and helps to improve the calculation speed. The temporary results for each point target are summed back to the A-line in the device memory, which is finally gathered by the Matlab program. Results show that the mexCuda-based forward data generation is about 300 to 500 times faster than the Matlab-based version.
2.3. The BP-Based Reconstruction Methods
The conventional B-mode photoacoustic image reconstruction is performed by simply back-aligning the acquired A-lines according to the flight time of ultrasound signals so that the resulting pixel value at position (
x,
y) is taken as the signal intensity of the A-line collected at
x at time
y/v [
1,
12]:
Here x donates the lateral position, and y donates the depth position, which is usually set relevant to the center of the detector surface. However, the lateral resolution in the out-of-focus regions is significantly reduced because it assumes that the acoustic beam travels along a thin line, only valid in the transducer’s focal zone.
Therefore, many BP-based algorithms have been proposed which account for the transducer’s detector geometry in a simple, robust, and quick way to restore the elongated lateral resolution. The synthetic-aperture-focusing-technique (SAFT) method assumes the focus of the transducer as a virtual point detector (VPD), then applies time delay relative to the position of VPDs in a circle, and sums the delayed signals with the transducer’s spatial impulse response (SIR) for intensity compensation, which is [
12,
16,
21]:
Here the time delay Δ
t and the
SIR are functions of the relative lateral position of the summed A-line
x−
xi and the depth position of the pixel
y. The synthesized aperture size of the circle is equivalent to the NA of the transducer. This Equation can be multiplied with a coherent factor (CF) to further improve the resolution by suppressing the side lobes, which is the SAFT-CF method [
13,
16,
21]:
Here the
N is the number of synthesized A-lines, and the range of the CF weighting factor is from 0 to 1. It is noted that the summation in SAFT is performed within the NA of the transducer. To enable the summing out of the NA for code implementation simplicity, many other algorithms have been proposed recently, and the spatial–temporal response (STR)-based back-projection method (STR-BP) is one such method, which is [
19]:
where
N is the total number of transducer positions and Δ
tSTR and
wSTR are the time delay and weighting coefficient acquired with the STR of the transducer. The improved back-projection (IBP) method is another commonly used method that accounts for the geometry of the transducer detection surface, which can also be performed out of the NA of the transducer [
17]. It assumes that the transducer detection surface is composed of a collection of point detectors and is given as:
Here the time delay Δt is calculated as the flight time from the pixel to the point detector. The signal from the whole transducer approximates the signal of all the point detectors.
2.4. Experiment Design
This research performed numerical simulations for point targets and complicated blood vessel networks. A focused transducer with 20 MHz central frequency and 80% bandwidth was employed in these simulations. The aperture size of the transducer was 6 mm, its focal length was 10 mm, and the focus was set at a depth of 6.5 mm. There were 301 transducer positions evenly distributed within a scanning range of 15 mm in the lateral direction. The origin of the coordinates was set at the center of the top boundary of the image domain, and the transducer detection surface was 3.5 mm above the image domain. The above are the default simulation parameters unless otherwise noted. The simulated data was applied with Hilbert transform, and the module of the complex pixel value was taken as the reconstruction results. Photoacoustic images reconstructed with the conventional B-mode method, the SAFT method, the SAFT-CF method, the STR-BP method, and the IBP method are compared. A total of 11 points were evenly distributed along the depth direction in the point target simulations, with a 1 mm interval. For resolution comparison, we take the full width at half-maximum (FWHM) of the lateral profile of the point target as its lateral resolution. For the stability test, an ensemble-based method was applied to calculate the signal-to-noise ratio (SNR) at the point target positions for each algorithm, with 5% Gaussian white noise added [
22]. A complex digital blood vessel network was employed in the blood vessel network imaging. We studied the setting parameter of point density required in the simulation and compared the reconstructed images with these algorithms. We compared the results reconstructed with different numbers of A-lines. In the case of a small number of A-lines, we interpolated the data for more input numbers of A-lines and compared the results before the data interpolation. We also compared the reconstruction results under different noise levels to see the improvement in peak signal-to-noise ratio (PSNR) using the STR-BP method compared with the conventional B-mode method.
4. Discussion
High-quality vascular imaging is crucial to the accurate diagnosis of cancers by AR-PAM. Compared with in vivo experiments, numerical simulation provides a low-cost and convenient means to characterize AR-PAM performance for vascular imaging. Through the in-depth simulation studies, we can find ways to optimize the scanning and image reconstruction of AR-PAM. On the other hand, there are debates about applying the SAFT-based methods in AR-PAM. Some researchers indicated that the SAFT-based techniques are only suitable for point-target imaging [
14,
18]. Some researchers think that although the CF coefficients can further improve the lateral resolution, they also induce noise amplification [
18]. Some researchers have declared that the imaging results with these SAFT-based methods for targets near the transducer focus are not as good as those reconstructed by the conventional B-mode method [
15,
16]. These remaining issues require intensive and systematic simulation studies.
In this work, we set up a framework for generating photoacoustic signals of complicated objects and explored the potential and adaptability of different variations of BP-based methods in imaging complex vascular networks. In the point target simulation, we found that various BP-based advantages in lateral resolution, SNR, and signal intensity consistency. For example, the SAFT-CF method has the highest lateral resolution (about 0.1 mm, compared with about 0.15 mm for the other three BP-based methods). Still, its SNR is generally 2–3 dB lower because the nonlinear CF coefficient is sensitive to noise. The IBP method has the best signal intensity consistency because it inherently considers the acoustic field of the transducer by dividing the transducer aperture into small point detectors, but it significantly increases its computation cost.
In current studies, the SAFT-CF method is mainly employed to improve the lateral resolution in the out-of-focus regions. However, its advantage is primarily demonstrated with point targets such as the cross-section of blood vessels. The blood vessel simulation in this work revealed that the proposed four BP methods improved the image quality in the off-focal regions. It is noted that three out of the four methods showed slightly different results since they are all based on the modeling of the acoustic field of the transducer, except for the SAFT-CF method, which is more effective for point targets. Although CF coefficients help to improve the lateral resolution of point targets (as seen in
Figure 3b), they are sensitive to data noise (as seen in
Figure 3c) and degrade the reconstructed image quality of the vascular networks (as seen in
Figure 5d). Thus, the SAFT-CF method is not recommended in continuous blood vessel imaging. Therefore, we suggest choosing the STR-BP method because it can balance computational cost, SNR, and consistency of target intensity.
In actual experiments, we speculate which of the three reconstruction methods (the SAFT, STR-BP, and IBP) gives better results is only decided by which method models the acoustic field better. Because there are too many factors in the actual experiments, we must probably preprocess the data carefully and try each method before concluding. Although OR-PAM is frequently employed for skin vascular imaging, it cannot fully utilize the high-penetration advantage of PAI. Comparatively, AR-PAM with high-frequency and high NA is more preferred for vascular imaging due to the extended depth. The simulation shows that with the 20 MHz focused transducer, a lateral resolution of about 0.15 mm and an axial resolution of about 0.05 mm can be achieved. Results also show that the BP-based method can withstand a noise level higher than 25%. The noise level and sensitivity of the system drop fast with the increase of the penetration depth due to the strong optical scattering. Considering the spatial restraints of the AR-PAM system and the penetration depth of the commonly used 532 nm high-repetition pulsed laser, we expect that all the arteries and veins in the lateral direction within a depth of 5 mm can be visualized based on our experiences. AR-PAM gives better images of the blood vessels in the depth range of 1 to 5 mm. It is difficult to see the individual arterioles and venules due to the small diameters (generally smaller than 30 μm) [
23]. There have already been many studies of imaging the deep-seated subcutaneous blood vessels with AR-PAM [
8,
10,
24,
25].
Another important conclusion we have drawn is that data interpolation does not improve image quality. Thus, we infer that the image quality degradation is not due to the back-projection artifacts but may be due to the lost information due to the small detector numbers. Another finding is that the image degradation exists both around the transducer focus and in the out-of-focus regions, as seen in
Figure 6d,h, but is most notable around the transducer focus. This is because the sound fields of different A-lines overlap in the off-focal regions, so the information can be complementary.
As a preliminary simulation study of AR-PAM in vascular imaging, some shortcomings need to be overcome in the studies that follow. Firstly, only the data generation was paralleled with GPU in this work. In future results, we will also parallel the image reconstruction. Secondly, the vascular network mainly consists of line targets. We will use images containing point and line targets for simulation in future works. Thirdly, as the vascular networks are in 3D, we will develop 3D simulation and reconstruction frameworks in the future. Lastly, we will perform a more detailed and in-depth analysis of the reconstructed vascular images.
5. Conclusions
This study built a GPU-based framework for simulating AR-PAM signals for imaging vascular networks. On this basis, we compared the reconstruction results by different implementations of BP-based methods. Results confirmed that compared with the conventional B-mode image reconstruction method, the BP algorithms above improve image reconstruction for both point and continuous-line-like complex blood vessels outside of the focal regions. In particular, the SAFT-CF algorithm is more effective for point targets. Comparatively, the STR-BP has higher adaptability and can balance the computational cost, SNR, and consistency of target intensity well. The STR-BP method can improve the PSNR by about 5 to 8 dB compared with the conventional B-mode method, and it helps the image withstand a noise level of up to 25%. Results also showed that a sufficient small scanning step size is required to ensure the image quality, and simply interpolating the data for small detector intervals does not improve the quality of the images. The proposed simulation framework and the findings will guide the design of AR-PAM systems and the choice of appropriate image reconstruction methods.