Next Article in Journal
Organic Pollution in Surface Waters from the Fuglebekken Basin in Svalbard, Norwegian Arctic
Previous Article in Journal
A Semantic Sensor Web for Environmental Decision Support Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension †

1
Department of Communication Engineering, Fujian Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, Xiamen 361005, China
2
Department of Electronic Science, Fujian Key Laboratory of Plasma and Magnetic Resonance, Xiamen 361005, China
3
School of Software, Shanghai Jiao Tong University, Shanghai 200240, China
*
Author to whom correspondence should be addressed.
A one page abstract of this work was presented at the 18th Scientific Meeting of the International Society for Magnetic Resonance in Medicine, Stockholm, Sweden, 1–7 May 2010; p. 3371.
Sensors 2011, 11(9), 8888-8909; https://doi.org/10.3390/s110908888
Submission received: 31 July 2011 / Revised: 31 August 2011 / Accepted: 5 September 2011 / Published: 15 September 2011
(This article belongs to the Section Chemical Sensors)

Abstract

: Reducing the acquisition time for two-dimensional nuclear magnetic resonance (2D NMR) spectra is important. One way to achieve this goal is reducing the acquired data. In this paper, within the framework of compressed sensing, we proposed to undersample the data in the indirect dimension for a type of self-sparse 2D NMR spectra, that is, only a few meaningful spectral peaks occupy partial locations, while the rest of locations have very small or even no peaks. The spectrum is reconstructed by enforcing its sparsity in an identity matrix domain with p (p = 0.5) norm optimization algorithm. Both theoretical analysis and simulation results show that the proposed method can reduce the reconstruction errors compared with the wavelet-based 1 norm optimization.

Graphical Abstract

1. Introduction

Nuclear magnetic resonance (NMR) spectroscopy is widely utilized to analyze the structures of chemicals and proteins. Multidimensional NMR spectra can provide more information than one-dimensional (1D) NMR spectra. The acquisition time for a conventional two-dimensional (2D) NMR spectrum is mostly determined by the number of t1 increments in the indirect dimension. One possible way is to reduce the acquisition time is to reduce the number of t1 increments. However, this will result in aliasing of the spectrum in the indirect dimension [1,2], because the sampling rate is lower than the requirement of the Nyquist sampling rule.

Researchers have been seeking ways to suppress the aliasing from the aspects of sampling and reconstruction. Radial sampling presents relatively small leakage artifacts [3] and Poisson disk sampling is observed to provide a large low-artifact area in the signal vicinity [4]. The maximum sampling time for multi-dimensional NMR experiments was analyzed by Vosegaard and co-workers [5]. Besides the sampling patterns, some reconstruction algorithms have been employed to improve spectral quality, including maximum entropy [6,7], iterative CLEAN algorithm [8] and Bayesian reconstruction [9]. The sparse sampling was incorporated with intermolecular multiple-quantum coherences for high-resolution 2D NMR spectra in inhomogeneous fields [10].

Recently compressed sensing (CS) theory [11,12], for reconstructing signals from fewer numbers of measurements than the number that the Nyquist sampling rule requires has attracted lots of attention in medical imaging [13], single pixel imaging [14], and computer vision [15], etc. Under the assumption that the acquired data is sparse or compressible in a certain sparsifying transform domain, CS can successfully recover the original signal from a small number of linear projections with little or no loss of information. The choice of sparsifying transform is important in the CS. The sparsfying transform should be maximally incoherent with the measurement operator. Intuitively, the target signal should be sparsely represented in the transform domain, e.g., wavelet transform domain, and this spare representation should be spread out in the encoding scheme. Iddo introduced CS to reconstruct a 2D NMR spectrum from partial random measurements of its time domain signal under the assumption that the spectrum is sparse in the wavelet domain [16].

In this paper, we focus on the reconstruction of self-sparse NMR spectra, that is, a few meaningful spectral peaks occupy partial locations while the rest locations have very small or even no meaningful peaks. NMR spectra includes regions where no signals arise because of the discrete nature of chemical groups [17]. The reason we pay attention to self-sparse NMR spectra is that many NMR spectra of chemical substances fall in this type [3,10,16,17]. Based on the concept of sparsity and coherence in CS, we demonstrate that a wavelet transform is not necessary to sparsify the self-sparse NMR spectra or even worsens the reconstruction. We propose to reconstruct the NMR spectrum by enforcing its sparsity in an identity matrix domain with a p (p = 0.5) norm optimization algorithm. Simulation results show that the proposed method can reduce the reconstruction errors compared with the wavelet-based 1 norm optimization.

Recently, Kazimierczuk and Orekhov [18] and Holland et al. [19] independently proposed to use CS in proton NMR and showed promising results in reducing acquired data. A combination of spatially encoding the indirect domain information and CS was proposed by Shrot and Frydman [20]. The spectra were considered to be sparse themselves [1820], differing from the sparse representation using wavelets [16]. However, no comparison on the reconstructed spectra with and without wavelet transform was given and no theoretical analysis was presented. In this paper, we will analyze the performance of wavelet transform in the CS-NMR basing on the sparsity and coherence properties and simulated results.

The remainder of this paper is organized as follows. In Section 2, the reason to undersample the indirect dimension is given by calculating the acquisition time for a 2D NMR spectrum. In Section 3, the two key factors of CS, sparsity and coherence, are briefly summarized and their values are estimated for 2D spectra, followed by the proposed reconstruction method. In Section 4, reconstruction of self-sparse NMR spectra is simulated to show the shortcomings of the wavelet and the advantage of the identity matrix. The improvement of utilizing the p norm is also demonstrated. Finally, discussions and conclusions are given in Section 5.

2. Undersampling in the Indirect Dimension of 2D NMR

In NMR spectroscopy, a typical sampled noiseless time domain signal can be described as a sum of exponentially decaying sinusoids:

y k = j = 1 J ( A j e i ϕ j ) e k Δ t τ j e 2 π ik Δ t ω j
where J is the number of sinusoids, Aj, ∅j, τj and ωj are the amplitude, phase in radians, decay time and frequency, respectively, of the jth sinusoid [21]. Δt is the sampling interval and k (k = 0, 1, …, K) is an integer to denote the kth sample point. Such a signal will give rise to a spectrum that is the sum of Lorentizian peaks centered at different frequencies ωj [21], where j corresponds to jth type of nuclear spins. A conventional 1D single pulse NMR experiment enforces an excitation pulse on a sample followed immediately by data acquisition. The signal eventually decays due to relaxation [22], thus it is called free induction decay (FID). Fourier transform (FT) is applied on the FID to obtain a frequency domain spectrum. Figure 1 shows the simulated FID signal and the corresponding 1D NMR spectrum obtained from FT.

The typical experimental time for a 1D NMR spectrum usually takes several seconds, thus it is not time consuming. However, for a 2D NMR spectrum, the time domain signal is generated based on two time variables t1 and t2. As shown in Figure 2, one scan of 2D NMR spectrum contains three steps: first, the sample is excited by one or more pulses in the preparation period. These pulses result in the evolution of magnetization with time t1; then, the sample is further excited in the mixing period; finally, an FID signal is recorded as a function of t2. Usually, t1 is set as t1 = Δt1, 2Δt1, ..., n1Δt1, N1Δt1 (The increment Δt1 is usually at the order of milliseconds). The number of t1 increments (N1) is determined by:

N 1 = SW 1 Δ f 1
where SW 1 = 1 Δ t 1 is the desired spectral width and Δ f 1 = 1 N 1 Δ t 1 is the corresponding spectral resolution. The typical N1 is from 50 to 500 [22]. Given a fixed t1 = n1Δt1, one scan is performed and the FID signal is recorded and stored along the direct dimension. After the scan, the nuclear spins are allowed to return to their equilibrium states before the next scan for t1 = (n1 + 1)Δt1 [22].

Finally, 2D FT is performed on the 2D FID data. If the time for performing all the pulses in one scan is tp, the total scanning time for a 2D NMR spectrum will be:

T N 1 = n 1 = 1 N 1 ( d 1 + n 1 Δ t 1 + t m + t 2 + t p ) = N 1 ( d 1 + ( 1 + N 1 ) Δ t 1 2 + t m + t 2 + t p )

In order to obtain a good resolution in the indirection dimension, N1 is usually several tens or hundreds or even more. This will cause the total scanning time for a 2D NMR spectrum to be tens of minutes or even several hours [2226].

In this paper, we aim to reduce the scan number for the t1 dimension. Rather than using the uniform increment in the indirect dimension (t1 = Δt1, 2Δt1, ..., n1Δt1, N1Δt1), we randomly choose unduplicated Q numbers from nq ∈ {1, 2, ..., N1}, and let t1 = nqΔt1. Let:

ρ = Q N 1
be the sampling rate in this paper, the total time to scan a 2D NMR spectrum is approximately:
T Q = Q N 1 T N 1 = ρ T N 1

The approximation is made by ignoring the total evolution time ∑nq∈ {1,2,...,N1},q = 1,2,...,QnqΔt1 since this value is only in the order of seconds. Compared to the time to acquire a 2D spectrum with fully sampled FIDs in the indirection dimension, undersampling the FIDs in the indirect dimension can greatly reduce the acquisition time for a 2D NMR spectrum if ρ is small enough. Figure 3 shows an example where we randomly undersample the indirect dimension with sampling rate ρ = 5/11 = 0.45. It means we save nearly half of the acquisition time of the conventional scheme.

However, this undersampling will result in aliasing artifacts [1,6]. It would be of great value if we can minimize these artifacts and reconstruct the full 2D NMR spectrum from the limited data. Here we explore the undersampling and reconstruction methods under the framework of CS.

3. Reconstruction of 2D Self-Sparse NMR Spectra with Compressed Sensing

3.1. Basic Concepts in Compressed Sensing

The CS proposed by Candès et al. [11] and Donoho [12] is a new theory to do undersampling and reconstruct the signal of interest from limited physically acquired data. They build a theoretical foundation that one can exactly or approximately recover signals from highly incomplete measurements. The two basic tenets to guarantee the performance of CS are sparsity and incoherence.

(a) Sparsity. For the signal x ∈ RN and a basis dictionary Ψ ∈ RS × N (e.g., identity matrix, FT, discrete cosine transform or wavelet transform matrix), the sparsity is often interpreted as:

S = α 0 = Ψ x 0 N
where ‖ α0 denotes the 0 norm that counts the nonzero entries in α, and S is the number of nonzero entries. If x is sparse without transformation (namely sparse in identity matrix I ∈ RN × N), it is called self-sparse since other complicated sparsifying transform, e.g., wavelet transform, is not required.

Candès et al. [11] and Donoho [12] proved that it is possible to recover the original signal x from O(NlogS) measurements. This means the required number of measurements is proportional to the number of nonzero entries in the basis Ψ. The smaller the S is, the less the number of measurements is required.

(b) Incoherence. When a signal x is sampled by a sensing matrix ΦM × N, the measurements y ∈ RM of x is:

f = Φ x

The coherence is defined as [27,28]:

μ ( Φ , Ψ ) = max k , j | ϕ k , ψ j |
where ∅k is the kth rows of Φ and Ψj is the jth column of Ψ. The coherence measures the largest correlation between any row of Φ and column of Ψ. The less the coherence between Φ and Ψ is, the smaller the μ is. The value range of μ is [ 1 , N ]. The minimal coherence μ = 1 occurs when Φ and Ψ is a time-frequency pair [29]. CS requires the coherence to be as small as possible, which means each measurement vector ∅k must be ‘spread out’ in the Ψ domain [28].

If the signal x satisfies [30]:

S = α 0 < 1 2 ( 1 + 1 μ ( Φ , Ψ ) )
it can be perfectly recovered by solving:
α ^ = min α α 0 ,       s . t . y = Φ Ψ α
where ‖ α0 denotes the 0 norm that counts the nonzero entries in α.

The recovered signal is:

x ^ = Ψ α ^

Equation (9) implies that if the coherence between Φ and Ψ is small, more non-zeros can be allowed in the sparse representation α. CS suggests Φ to be random enough to guarantee its incoherence with any Ψ. This is also observed that random sampling in time domain can improve the quality of reconstructed spectra [31].

However, 0 norm is known to be intractable and sensitive to noise [11,12], and 1 norm convex optimization is commonly used in CS to recover x by solving:

α ^ = min α α 1 ,       s . t . y = Φ Ψ α

The accuracy of CS reconstruction using Equation (12) can be guaranteed if ΦΨ satisfies the appropriate restricted isometry properties [32]. A restricted isometry constant σs [32] defined as the smallest number such that:

( 1 σ S ) α 2 2 Φ Ψ α ( 1 + σ S ) α 2 2
holds for all vectors that have at most S nonzero entries. If σ 2 S < 2 1, the solution to the 1 norm problem is that of the 0 problem [32].

The number of measurements M should satisfy:

M C μ 2 ( Φ , Ψ ) S log   N
so that the signal x can be exactly recovered from measurements y in overwhelming majority of cases [28]. Equation (14) implies that the number of measurements is proportional to the number of nonzero entries S in α and the square of coherence μ. If both S and μ are small, the required number of measurements M could be small. This means that one can perform fewer measurements to save acquisition time while reconstruct original signal x very well.

Iddo [16] applied CS to remove the aliasing artifacts from incompletely acquired FID data by enforcing the sparsity of 2D NMR spectra in wavelet domain according to:

α ^ = min α α 1 ,     s . t . y Θ F T Ψ T α σ
where y is the measurements in time domain, Θ is a random sampling operator defining the FIDs acquired in the indirect dimension, FT denotes the inverse 2D FT, and ΨT is the inverse 2D wavelet transform. According to Equation (11), the recovered spectrum is x̂ = ΨTα̂.

In this paper, we focus on the reconstruction of self-sparse NMR spectra in which significant peaks take up partial locations of the full NMR spectra while the remaining locations have very small or even no peaks. Ideally, if the number of sinusoids J in Equation (1) is very small, and the meaningful peaks are narrow enough relative to the whole 2D frequency coverage, the spectra can be considered to be sparse since the number of non-zeros for the spectra is much smaller than the number of spectrum points in the 2D NMR spectra.

The sparsifying transform and the coherence between Ψ and Φ = ΘFT play important roles in the CS, as we have discussed. In the following sections, we will demonstrate that wavelet is not necessary to sparsify or even worsens the self-sparse NMR spectra based on the concept of sparsity and coherence. We will then reconstruct the NMR spectrum by enforcing its sparsity in an identity matrix domain with p (p = 0.5) norm optimization algorithm.

To represent the NMR spectra in conventional way [47,17], the X and Y coordinate axes are shown with unit of parts per million (ppm) [21] defined as:

δ = ω ω ref ω 0 × 10 6
where δ is the chemical shift of a peak with frequency ω, ωref is the frequency of a reference peak and ω0 is the spectrometer carrier frequency.

3.2. Sparsity of Self-Sparse NMR Spectra

Figure 4(a) shows a 2D 1H-1H correlation spectroscopy (COSY) spectrum where most of the peaks fill partial and very limited regions of the full spectrum. This leads to the sparsity of spectrum because the number of non zeros in the 2D spectrum is much smaller than the number of spectrum points. This phenomenon is also observed by Yoh Matsuki et al. [17].

To test the sparsity of NMR spectra, we can measure the decay of coefficients in a sparsifying transform domain and evaluate the approximation error by retaining the k-term largest coefficients, because the reconstruction error is proportional to the power law decay kr, where r is a constant implying the sparsity of signal [29]. Rapid decay of coefficients implies that one can use less non-zero coefficients to approximate a NMR spectrum. If we directly measure the decay of signal without complicated sparsifying transform, e.g., wavelets, it means measure the self-sparsity of signal. Mathematical saying is measuring its sparsity in the identity matrix.

As shown in Figure 4(b), both the spectra and its wavelet coefficients can achieve rapid decay. By retaining 3% largest magnitude coefficients, the spectra can be reconstructed well in Figure 4(c,d). However, the spectrum is sparser than its representation in the wavelet domain. This is demonstrated by the faster decay of spectrum than that of its wavelet coefficients in Figure 4(b). By retaining the 1% largest magnitude coefficients, the wavelet fails to represent some peaks while the spectrum itself can represent these peaks, as marked by the arrows in Figure 4(e,f).

For a 2D 1H-13C COSY spectrum, the spectrum decays faster than its wavelet coefficients (Figure 5(b)). This implies that the identity matrix can provide a sparser representation of spectra than a wavelet does. Peaks are lost or distorted by using the wavelet transform to represent the spectrum (Figure 5(e)), but the spectrum is represented very well with the identity matrix (Figure 5(f)). This phenomenon is consistent with the observation on the 2D 1H-1H COSY spectrum discussed above.

As a result, this spectrum is self-sparse, which means spectrum is sparse in the identity matrix. Thus, according to Equations (9) and (14), it is better to use an identity matrix than to use a wavelet to reconstruct the self-sparse spectra from undersampled FIDs since the wavelet cannot provide a sparser representation of the spectrum. In fact, Stern et al. [33] proposed to do iterative soft thresholding on the spectrum directly, not on wavelet coefficients, to recover one dimensional NMR spectra from the truncated FIDs. Although the sparsity of NMR spectra is not explicitly expressed in that work [33], the recovered spectrum is obtained from minimizing 1 norm of spectrum, which implies enforcing the sparsity of the spectrum. The problem of their method is that truncation violates the random sampling scheme in CS and results in strong Gibbs ringing which is hard to suppress [29]. What is more, truncating the 1D FID is not necessary to save the time to scan a spectrum since scanning a 1D NMR spectrum is fast and only takes on the order of seconds.

3.3. Coherence Property of Wavelet-Based and Identity Matrix-Based CS-NMR Spectra

Besides the sparsity of signal, another key factor for CS is the coherence between Φ and Ψ According to Equations (9) and (14), fewer measurements are required for signal sampling system Φ if it is less coherent with Ψ and the signal has same sparsity for different Ψ.

Pioneering work on CS has pointed out that the coherence of a time-frequency pair is μ(Φ, I) = μ(ΘFT, I) = 1 [28]. Thus, we only need to compute the coherence between undersampled Fourier operator Φ and wavelet basis ΨT.

The undersampling of Θ in the indirect dimension is carried out by choosing some of the FID points in this dimension. To make this undersampling intuitive, a binary mask which has the same size of 2D FID is shown as the undersampling pattern in Figure 6(a). If the value of mask at location (i, j) is equal to 1 shown as a white pixel, the FID at location (i, j) is acquired.

To avoid the influence of randomness on the coherence calculation, Θ is randomly generated 10 times and the coherence is averaged for each sampling rate. Figure 6(b) shows that the coherence between wavelet and undersampled Fourier operator Φ is larger than the coherence between identity matrix and Φ. So, from the aspect of coherence, it is also better to choose the identity matrix for self-sparse NMR spectra.

3.4. Reconstruction of Self-Sparse NMR Spectra with ℓp Norm Minimization

In this paper, we propose to reconstruct the self-sparse 2D NMR spectra with identity matrix I as follows:

x ^ = min x x 1 , s . t . y = Φ x
where Φ = ΘFT.

To further improve the reconstruction, a p (0 < p < 1) norm is incorporated which has been demonstrated to give better reconstruction of MR images with fewer measurements than 1 norm does [3437]:

x ^ = min x x p p , s . t . y = Φ x
where x p p = n = 1 N | x n | p and xn is the nth entry of vector x. For the function f(x) = |x|p, with p → 0, f(x) gets closer to the 0 norm of x, as shown in Figure 7.

Theoretically, the required number of measurements [38] by enforcing the sparsity with a p (0 < p < 1) norm is:

M C 1 ( p ) K + p C 2 ( p ) K log ( N / K )
where C1 and C2 are determined explicitly and bounded in p and the recommend p is 0.5 [34].

In this paper, the p norm minimization is solved via the p-shrinkage operator [39] with continuation algorithm [40] because of its fast computation. This algorithm is abbreviated as PSOCA and summarized in Algorithm 1.

Algorithm 1. Self-sparse NMR spectra reconstruction with undersampled data using PSOCA.
Algorithm 1. Self-sparse NMR spectra reconstruction with undersampled data using PSOCA.
Initialization:
Input the sampled FID data y, set the regularization parameter λ =108 and tolerance of inner loop η = 5 × 10−3.
Initialize x = T y, xlast = x, β = 26, and α = 0.
Main:
  While β ≤ 216
    Inner loop:
    1. Given x,
      For j = 1 to J, solve Equation (20), the solution is α;
    2. Given α,
      solve Equation (22), the solution is x;
    3. If ‖Δx‖ = ‖xlastx‖ > η, xlastx, go to step 1;
        Otherwise, go to step 4;
    Outer loop:
    4. x, β ← 2β, go to step 1.
  End While
Output: x̂

For a given continuation parameter β, PSOCA is implemented to solve two sub-problems:

(1) p-shrinkage operator

α j = S ɛ p ( x j ) = max { x j ɛ | x j | p 1 , 0 } x j | x j |
where ɛ = β 1 p 2 and β is a parameter to be updated in the continuation scheme, xj and αj are the jth entry of column vectors x and α, respectively.

(2) solve the linear equation:

min x β 2 α x 2 2 + λ 2 y Φ x 2 2
which can be simplified to:
( β I + λ P ) F T x = β F T α + λ Θ T y
where the term P = ΘTΘ is a diagonal matrix consisting of ones and zeros. The diagonal entries of P correspond to the location of FID data and the entry value is 1 if a corresponding FID data point is sampled, otherwise the entry value is 0. Equation (22) can be solved fast since only a discrete Fourier transform and entry-wise division are required.

4. Simulation Results and Analysis

In this section, we will show the advantages of the proposed method in two aspects: (1) identity matrix as the sparsifying transform is compared with wavelet transform; (2) p norm minimization is compared with 1 norm minimization. The recommended value of p is 0.5 for stability from empirical experiments [34]. The notation 0.5 is short for p with p = 0.5. The typical 1 norm minimization algorithms compared in this paper include iterative soft thresholding (IST) algorithm [16,4143], alternating and continuation algorithm (ACA) [40]. The ACA is just p = 1 in PSOCA.

Because regions of small spectrum values usually contain no peaks for practical analysis, we set magnitude smaller than a constant T to be zero according to:

x T   ( j ) = { x   ( j ) , x   ( j ) T 0 , x   ( j ) < T
where x denotes the absolute value of spectra and xT denotes the absolute value of post processed NMR spectra. For evaluation, T is set to two values. First, T is set to zero, which means a spectrum with small absolute values, possibly noise, are not suppressed. Second, T is set to the lowest value of contour when plotting the 2D spectrum. This is reasonable because peaks with absolute values smaller than T are not seen in the contour plot.

Suppose denotes the reconstructed spectrum from undersampled FID, relative 2 norm error (RLNE) is defined to measure the reconstruction error as:

RLNE  = x ^ T x ˜ T 2 x ^ T 2
where is the reconstructed spectrum from fully sampled FID and 0 ≤ , T ≤ 1. RLNE evaluates the normalized error presented in the reconstructed spectrum from undersampled FID. The lower the RLNE is, the better the reconstructed spectrum is consistent to the fully sampled spectrum.

4.1. Reconstruction of the spectra

The improvement by using the proposed method is verified from the less crowed 1H-1H COSY spectrum and more crowded 1H-13C COSY spectrum. The sampling patterns of the two spectra are shown in Figure 8.

Figure 9(c–h) show the reconstructed 1H-1H COSY spectra corresponding to the sampling pattern in Figure 9(a) with a sampling rate of 0.20. With the 1 norm minimization, all the peaks are recovered successfully by using identity matrix (Figure 9(d,f)), while some peaks are lost by using wavelets (Figure 9(c,e)).

Since the contours for the marked peaks look faint, we also plot the 1D slices along the indirect dimension in Figure 10. The height of one peak in the wavelet-based reconstruction in Figure 10(a,b) are much lower than those in the fully sampled spectrum, leading to the peak lost in the contour plots in Figure 9(c,e).

Furthermore, the nonlinear operation on wavelet coefficients induces the artifacts labeled in Figure 9(c,e). This phenomenon is also observed in the 1D slices shown in Figure 10(a,b), where wavelet reconstruction generates illusive peaks. With the 0.5 norm minimization, the errors caused from wavelet and identity matrix reconstruction are reduced, as shown in Table 1. One can still observe the reduced peak height and artifacts in wavelet-based reconstruction, but identity matrix performs very well (Figure 10(d)). The advantage of 0.5 norm over 1 norm is obvious in the crowded 1H-13C COSY spectra, as will be shown in the following discussion.

Figure 11 shows the reconstructed 1H-13C COSY spectra corresponding to the sampling pattern in Figure 8(b) with a sampling rate of 0.25. Some peaks are obviously lost in the reconstructed spectra using wavelets with both 1 norm and 0.5 norm minimization (Figure 11(c,e,g)). These lost peaks are found in the identity matrix-based reconstruction spectra (Figure 11(d,f,h)). With the 0.5 norm minimization, the intensities of the peaks marked with arrow in Figure 11(h) are more consistent to the fully sampled spectra in Figure 11(b) than those in the reconstructed spectra with the 1 norm minimization (Figure 11(d,f)). The smallest reconstruction error is achieved with the proposed identity matrix-based 0.5 norm minimization method (Table 2).

All above simulation results demonstrate that wavelet-based reconstruction obviously induces the loss of some peaks in the crowded 1H-13C COSY spectrum and loss of some weak peaks in the less crowded 1H-1H COSY spectrum. The wavelet may even worsen the reconstructed spectra. Thus, it is not a good choice to use wavelets for the self-sparse spectra discussed in this paper.

4.2. Discussion on the Computation

Our simulation is run on a dual core 2.2 GHz CPU laptop with 3 GB RAM. The computational time for the algorithms using wavelet is two times that using the identity matrix, as shown in Table 3.

In the simulation, with the gradual increase of continuation parameter β, the previous solution was used as a ‘warm start’ for the next alternating optimization in the PSOCA. For a given β, with the increase of iterations in inner loop, the difference between reconstructed spectra decreases (see Figure 12(a)), so does the error between the reconstructed spectrum and the fully sampled spectrum (see Figure 12(b)). The reconstruction error decreases when β becomes large in the outer loop. The computational time of 0.5 norm minimization in PSOCA is nearly four times as that of 1 norm minimization, as shown in Table 3.

5. Conclusions and Future Work

Random sampling in the indirect dimension is introduced to reconstruct 2D self-sparse NMR spectra within the CS framework. Based on the assumption of sparsity of NMR spectra, one may remove the aliasing by penalizing the 1 norm on the coefficients of the sparse representation of NMR spectra. Considering the sparsity and the coherence property, we demonstrate that wavelet transform may reduce the peak height and result in loss of peaks. Thus, a wavelet is not necessary and even worsens the reconstruction of self-sparse NMR spectra. With the p (p = 0.5) norm minimization, the quality of reconstructed spectra can be further improved.

However, how to define the meaningless peaks depends on applications and a qualitative analysis of self-sparse NMR spectra is needed in order to satisfy the requirement of CS. By defining regularity of ideal Lorentizian peaks with aspect to typical vanishing moment wavelet basis, it is possible to give a boundary for the approximation error of Lorentizian peaks in wavelet representation. Thus, one may quantify the sparsity of spectra composed of ideal Lorentizian peaks using wavelets. Another way is to set up a database and analyze the sparsity of the meaningful peaks based on the prior knowledge of chemists. Since the peak height may be reduced in the wavelet-based reconstruction and this reduction depends on the crowd of peaks, it is expected to give a quantitative analysis on the effect of using/skipping wavelet transform by setting up a simulated spectrum or spectrum from real chemical substance, in which the crowd of peaks and the fixed relative height of peaks are pre-defined in the spectrum. Besides, based on the coherence property in CS, the analysis of the performance of different random sampling schemes, e.g., Poisson disk sampling, may lead to further reduction of sampling rate and reconstruction error. Extension of the proposed method on higher dimensional NMR spectra is worth investigating.

Acknowledgments

This work was partially supported by the NNSF of China under Grant 10974164, and the Research Fund for the Doctoral Program of Higher Education of China under Grant 200803840019. Xiaobo Qu and Di Guo would like to acknowledge the fellowship of Postgraduates’ Oversea Study Program for Building High-Level Universities from the China Scholarship Council. The authors also thank the reviewers for their thorough review and highly appreciate the comments and suggestions, which significantly contributed to improving the quality of this article.

References

  1. Bretthorst, GL. Nonuniform sampling: Bandwidth and aliasing. Concept Magn. Reson. A 2008, 32A, 417–435. [Google Scholar]
  2. Maciejewski, MW; Qui, HZ; Rujan, I; Mobli, M; Hoch, JC. Nonuniform sampling and spectral aliasing. J. Magn. Reson 2009, 199, 88–93. [Google Scholar]
  3. Kazimierczuk, K; Kozminski, W; Zhukov, I. Two-dimensional fourier transform of arbitrarily sampled NMR data sets. J. Magn. Reson 2006, 179, 323–328. [Google Scholar]
  4. Kazimierczuk, K; Zawadzka, A; Kozminski, W. Optimization of random time domain sampling in multidimensional NMR. J. Magn. Reson 2008, 192, 123–130. [Google Scholar]
  5. Vosegaard, T; Nielsen, NC. Defining the sampling space in multidimensional NMR experiments: What should the maximum sampling time be? J. Magn. Reson 2009, 199, 146–158. [Google Scholar]
  6. Mobli, M; Hoch, JC. Maximum entropy spectral reconstruction of nonuniformly sampled data. Concept Magn. Reson. A 2008, 32A, 436–448. [Google Scholar]
  7. Jee, JG. Real-time acquisition of three dimensional NMR spectra by non-uniform sampling and maximum entropy processing. Bull. Korean Chem. Soc 2008, 29, 2017–2022. [Google Scholar]
  8. Coggins, BE; Zhou, P. High resolution 4-D spectroscopy with sparse concentric shell sampling and FFT-CLEAN. J. Biomol. NMR 2008, 42, 225–239. [Google Scholar]
  9. Yoon, JW; Godsill, SJ. Bayesian inference for multidimensional NMR image reconstruction. Proceedings of the European Signal Processing Conference (EUSIPCO), Florence, Italy, 4–8 September 2006.
  10. Lin, MJ; Huang, YQ; Chen, X; Cai, SH; Chen, Z. High-resolution 2D NMR spectra in inhomogeneous fields based on intermolecular multiple-quantum coherences with efficient acquisition schemes. J. Magn. Reson 2011, 208, 87–94. [Google Scholar]
  11. Candes, EJ; Romberg, J; Tao, T. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inform. Theory 2006, 52, 489–509. [Google Scholar]
  12. Donoho, DL. Compressed sensing. IEEE Trans. Inform. Theory 2006, 52, 1289–1306. [Google Scholar]
  13. Lustig, M; Donoho, D; Pauly, JM. Sparse MRI: The application of compressed sensing for rapid MR imaging. Magn. Reson. Med 2007, 58, 1182–1195. [Google Scholar]
  14. Duarte, MF; Davenport, MA; Takhar, D; Laska, JN; Sun, T; Kelly, KF; Baraniuk, RG. Single-pixel imaging via compressive sampling. IEEE Signal Proc. Mag 2008, 25, 83–91. [Google Scholar]
  15. Wright, J; Yang, AY; Ganesh, A; Sastry, SS; Ma, Y. Robust face recognition via sparse representation. IEEE Trans. Pattern Anal 2009, 31, 210–227. [Google Scholar]
  16. Drori, I. Fast l1 minimization by iterative thresholding for multidimensional NMR spectroscopy. EURASIP J Adv Sig Proc 2007. [Google Scholar] [CrossRef]
  17. Matsuki, Y; Eddy, MT; Herzfeld, J. Spectroscopy by integration of frequency and time domain information for fast acquisition of high-resolution dark spectra. J. Am. Chem. Soc 2009, 131, 4648–4656. [Google Scholar]
  18. Kazimierczuk, K; Orekhov, VY. Accelerated NMR spectroscopy by using compressed sensing. Angew. Chem. Int. Ed 2011, 50, 5556–5559. [Google Scholar]
  19. Holland, DJ; Bostock, MJ; Gladden, LF; Nietlispach, D. Fast multidimensional NMR spectroscopy using compressed sensing. Angew. Chem. Int. Ed 2011, 50, 6548–6551. [Google Scholar]
  20. Shrot, Y; Frydman, L. Compressed sensing and the reconstruction of ultrafast 2D NMR data: Principles and biomolecular applications. J. Magn. Reson 2011, 209, 352–358. [Google Scholar]
  21. Hoch, JC; Stern, AS. NMR Data Processing; Wiley-Liss: New York, NY, USA, 1996; p. 38. [Google Scholar]
  22. Keeler, J. Understanding NMR Spectroscopy; Wiley: New York, NY, USA, 2005; Chapter 7,; pp. 1–30. [Google Scholar]
  23. Aue, WP; Bartholdi, E; Ernst, RR. 2-Dimensional spectroscopy: Application to nuclear magnetic-resonance. J. Chem. Phys 1976, 64, 2229–2246. [Google Scholar]
  24. Ernst, RR; Bodenhausen, G; Wokaun, A. Principles of Nuclear Magnetic Resonance in One and Two dimensions; Oxford University Press: New York, NY, USA, 1990. [Google Scholar]
  25. Frydman, L; Scherf, T; Lupulescu, A. The acquisition of multidimensional NMR spectra within a single scan. Proc. Natl. Acad. Sci. USA 2002, 99, 15858–15862. [Google Scholar]
  26. De Graaf, RA. In Vivo NMR Spectroscopy Principles and Techniques, 3rd ed; John Wiley & Sons: Hoboken, NJ, USA, 2007; pp. 389–444. [Google Scholar]
  27. Donoho, DL; Huo, XM. Uncertainty principles and ideal atomic decomposition. IEEE Trans. Inform. Theory 2001, 47, 2845–2862. [Google Scholar]
  28. Candes, E; Romberg, J. Sparsity and incoherence in compressive sampling. Inverse Probl 2007, 23, 969–985. [Google Scholar]
  29. Candès, EJ; Romberg, J. Practical signal recovery from random projections. Proceedings of the Wavelet Applications in Signal and Image Processing XI, San Diego, CA, USA, 31 July–4 August 2005; p. 5914.
  30. Elad, M. Optimized projections for compressed sensing. IEEE Trans. Signal Process 2007, 55, 5695–5702. [Google Scholar]
  31. Hoch, JC; Maciejewski, MW; Filipovic, B. Randomization improves sparse sampling in multidimensional NMR. J. Magn. Reson 2008, 193, 317–320. [Google Scholar]
  32. Candes, EJ. The restricted isometry property and its implications for compressed sensing. Compt. Rendus Math 2008, 346, 589–592. [Google Scholar]
  33. Stern, AS; Donoho, DL; Hoch, JC. NMR data processing using iterative thresholding and minimum l1-norm reconstruction. J. Magn. Reson 2007, 188, 295–300. [Google Scholar]
  34. Chartrand, R. Exact reconstruction of sparse signals via nonconvex minimization. IEEE Signal Proc. Lett 2007, 14, 707–710. [Google Scholar]
  35. Trzasko, J; Manduca, A. Highly undersampled magnetic resonance image reconstruction via homotopic l0-minimization. IEEE Trans. Med. Imaging 2009, 28, 106–121. [Google Scholar]
  36. Qu, X; Cao, X; Guo, D; Hu, C; Chen, Z. Compressed sensing MRI with combined sparsifying transforms and smoothed l0 norm minimization. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing—ICASSP’10, Dallas, TX, USA, 14–19 March 2010; pp. 626–629.
  37. Majumdar, A; Ward, R. Under-determined non-cartesian MR reconstruction with non-convex sparsity promoting analysis prior. Proceedings of the Medical Image Computing and Computer-Assisted Intervention—MICCAI’10, Beijing, China, 20–24 September 2010; pp. 513–520.
  38. Chartrand, R; Staneva, V. Restricted isometry properties and nonconvex compressive sensing. Inverse Probl 2008, 24, 1–14. [Google Scholar]
  39. Chartrand, R. Fast algorithms for nonconvex compressive sensing: MRI reconstruction from very few data. Proceedings of the 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro—ISBI’09, Boston, MA, USA, 28 June–1 July 2009; pp. 262–265.
  40. Yang, JF; Zhang, Y; Yin, WT. A fast alternating direction method for TV L1-L2 signal reconstruction from partial fourier data. IEEE J. Sel. Top. Signal Process 2010, 4, 288–297. [Google Scholar]
  41. Qu, XB; Zhang, WR; Guo, D; Cai, CB; Cai, SH; Chen, Z. Iterative thresholding compressed sensing MRI based on contourlet transform. Inverse Probl. Sci. En 2010, 18, 737–758. [Google Scholar]
  42. Guo, D; Qu, XB; Huang, LF; Yao, Y. Sparsity-based spatial interpolation in wireless sensor networks. Sensors 2011, 11, 2385–2407. [Google Scholar]
  43. Zibulevsky, M; Elad, M. L1-L2 optimization in signal and image processing. IEEE Signal Proc. Mag 2010, 27, 76–88. [Google Scholar]
Figure 1. Simulated FID data in time domain (a) and its corresponding 1D NMR spectrum (b). Note: the FID is simulated according to Equation (1) with J = 2, A1 = 0.5, A2 = 1, Δt = 0.01 s, τ1 = τ2 = 800, ∅1 = ∅2 = 0, and ω1 = 70 Hz, ω2 = 20 Hz.
Figure 1. Simulated FID data in time domain (a) and its corresponding 1D NMR spectrum (b). Note: the FID is simulated according to Equation (1) with J = 2, A1 = 0.5, A2 = 1, Δt = 0.01 s, τ1 = τ2 = 800, ∅1 = ∅2 = 0, and ω1 = 70 Hz, ω2 = 20 Hz.
Sensors 11 08888f1 1024
Figure 2. General scheme for 2D NMR spectra.
Figure 2. General scheme for 2D NMR spectra.
Sensors 11 08888f2 1024
Figure 3. An example of random undersampling in the indirect dimension. The symbol ⇐ denotes the acquired FIDs.
Figure 3. An example of random undersampling in the indirect dimension. The symbol ⇐ denotes the acquired FIDs.
Sensors 11 08888f3 1024
Figure 4. Sparsity of a 1H-1H COSY spectrum and its wavelet (symmlet wavelet with four decomposition levels and eight vanishing moments) representation. (a) The fully sampled NMR spectrum; (b) decay of real part of spectrum and its wavelet coefficients; (c,e) reconstructed spectra from 3% and 1% largest coefficients in wavelet domain; (d,f) reconstructed spectra from 3% and 1% largest coefficients in identity matrix domain. Note: the wavelet fails to represent peaks marked with arrows in (e) and these peaks are successfully represented in (f).
Figure 4. Sparsity of a 1H-1H COSY spectrum and its wavelet (symmlet wavelet with four decomposition levels and eight vanishing moments) representation. (a) The fully sampled NMR spectrum; (b) decay of real part of spectrum and its wavelet coefficients; (c,e) reconstructed spectra from 3% and 1% largest coefficients in wavelet domain; (d,f) reconstructed spectra from 3% and 1% largest coefficients in identity matrix domain. Note: the wavelet fails to represent peaks marked with arrows in (e) and these peaks are successfully represented in (f).
Sensors 11 08888f4 1024
Figure 5. Sparsity of a 1H-13C COSY spectrum and its wavelet (symmlet wavelet with four decomposition levels and eight vanishing moments) representation. (a) The fully sampled NMR spectrum; (b) decay of real part of spectrum and its wavelet coefficients; (c,e) reconstructed spectra from 1% and 0.1% largest coefficients in wavelet domain; (d,f) reconstructed spectra from 1% and 0.1% largest coefficients in identity matrix domain. Note: the wavelet fails to represent peaks marked with arrows in (e) and these peaks are successfully represented in (f).
Figure 5. Sparsity of a 1H-13C COSY spectrum and its wavelet (symmlet wavelet with four decomposition levels and eight vanishing moments) representation. (a) The fully sampled NMR spectrum; (b) decay of real part of spectrum and its wavelet coefficients; (c,e) reconstructed spectra from 1% and 0.1% largest coefficients in wavelet domain; (d,f) reconstructed spectra from 1% and 0.1% largest coefficients in identity matrix domain. Note: the wavelet fails to represent peaks marked with arrows in (e) and these peaks are successfully represented in (f).
Sensors 11 08888f5a 1024Sensors 11 08888f5b 1024
Figure 6. Coherence of wavelet and FT. (a) One sampling pattern in the indirect dimension with sampling rate ρ = 0.30 (fully sampled points in the indirect dimension is N1 = 64); (b) coherences for different sampling rates. The symmlet wavelet with four decomposition levels and eight vanishing moments is chosen as a typical wavelet for test, which is also the typical wavelet in [16]. Error bar stands for the standard deviation when repeating 10 times at each sampling rate.
Figure 6. Coherence of wavelet and FT. (a) One sampling pattern in the indirect dimension with sampling rate ρ = 0.30 (fully sampled points in the indirect dimension is N1 = 64); (b) coherences for different sampling rates. The symmlet wavelet with four decomposition levels and eight vanishing moments is chosen as a typical wavelet for test, which is also the typical wavelet in [16]. Error bar stands for the standard deviation when repeating 10 times at each sampling rate.
Sensors 11 08888f6 1024
Figure 7. The value of f(x) = |x|p versus the value of p.
Figure 7. The value of f(x) = |x|p versus the value of p.
Sensors 11 08888f7 1024
Figure 8. Sampling pattern used in simulation. (a) Cartesian sampling pattern with sampling rate 0.20 for the 2D 1H-1H COSY spectrum (N1 = 256 points) in Figure 4(a); and (b) Cartesian sampling pattern with sampling rate 0.25 for the 2D 1H-13C COSY spectrum (N1 = 128 points) in Figure 5(a).
Figure 8. Sampling pattern used in simulation. (a) Cartesian sampling pattern with sampling rate 0.20 for the 2D 1H-1H COSY spectrum (N1 = 256 points) in Figure 4(a); and (b) Cartesian sampling pattern with sampling rate 0.25 for the 2D 1H-13C COSY spectrum (N1 = 128 points) in Figure 5(a).
Sensors 11 08888f8 1024
Figure 9. CS reconstruction of a 2D 1H-1H COSY spectrum using wavelet and identity matrix. (a,b) reconstructed spectra using fully sampled FID and undersampled FID with zero filling, respectively; (c,d) reconstructed spectra using wavelets and identity matrix with IST-based 1 norm, respectively; (e,f) reconstructed spectra using wavelets and identity matrix with PSOCA-based 1 norm, respectively; (g,h) reconstructed spectra using wavelets and identity matrix with PSOCA-based p norm, respectively.
Figure 9. CS reconstruction of a 2D 1H-1H COSY spectrum using wavelet and identity matrix. (a,b) reconstructed spectra using fully sampled FID and undersampled FID with zero filling, respectively; (c,d) reconstructed spectra using wavelets and identity matrix with IST-based 1 norm, respectively; (e,f) reconstructed spectra using wavelets and identity matrix with PSOCA-based 1 norm, respectively; (g,h) reconstructed spectra using wavelets and identity matrix with PSOCA-based p norm, respectively.
Sensors 11 08888f9a 1024Sensors 11 08888f9b 1024
Figure 10. 1D slices along the indirect dimension for the chemical shift of 8.2 ppm (ac) or 7.2 ppm (d) in the direct dimension. (a) Spectra reconstructed with IST-based 1 norm; (b) spectra reconstructed with PSOCA-based 1 norm; (c) spectra reconstructed with PSOCA-based 0.5 norm; (d) spectra reconstructed with PSOCA-based 0.5 norm.
Figure 10. 1D slices along the indirect dimension for the chemical shift of 8.2 ppm (ac) or 7.2 ppm (d) in the direct dimension. (a) Spectra reconstructed with IST-based 1 norm; (b) spectra reconstructed with PSOCA-based 1 norm; (c) spectra reconstructed with PSOCA-based 0.5 norm; (d) spectra reconstructed with PSOCA-based 0.5 norm.
Sensors 11 08888f10a 1024Sensors 11 08888f10b 1024
Figure 11. CS reconstruction of a 2D 1H-13C COSY spectrum using wavelet and identity matrix. (a,b) spectra reconstructed using fully sampled FID (N1 = 128 points) and undersampled FID with zero filling, respectively; (c,d) spectra reconstructed using wavelets and identity matrix with IST-based 1 norm, respectively; (e,f) spectra reconstructed using wavelets and identity matrix with PSOCA-based 1 norm, respectively; (g,h) spectra reconstructed using wavelets and identity matrix with PSOCA-based 0.5 norm, respectively.
Figure 11. CS reconstruction of a 2D 1H-13C COSY spectrum using wavelet and identity matrix. (a,b) spectra reconstructed using fully sampled FID (N1 = 128 points) and undersampled FID with zero filling, respectively; (c,d) spectra reconstructed using wavelets and identity matrix with IST-based 1 norm, respectively; (e,f) spectra reconstructed using wavelets and identity matrix with PSOCA-based 1 norm, respectively; (g,h) spectra reconstructed using wavelets and identity matrix with PSOCA-based 0.5 norm, respectively.
Sensors 11 08888f11a 1024Sensors 11 08888f11b 1024
Figure 12. Numerical performance of PSOCA. (a) The 2 norm of difference between reconstructed spectra in the current and previous iteration when β = 212 in inner loop; (b) the reconstruction error RLNE of the reconstructed spectra when β = 212 in inner loop; and (c) the reconstruction error RLNE versus the iterations in outer loop in PSOCA.
Figure 12. Numerical performance of PSOCA. (a) The 2 norm of difference between reconstructed spectra in the current and previous iteration when β = 212 in inner loop; (b) the reconstruction error RLNE of the reconstructed spectra when β = 212 in inner loop; and (c) the reconstruction error RLNE versus the iterations in outer loop in PSOCA.
Sensors 11 08888f12 1024
Table 1. Reconstruction error of a 1H-1H COSY spectrum.
Table 1. Reconstruction error of a 1H-1H COSY spectrum.
MethodsZero-fillingIST 1PSOCA 1PSOCA 0.5
WaveletRLNE (T = 0)2.0540.4150.3930.430
RLNE (T = 0.1)0.0590.0120.0100.007
Identity matrixRLNE (T = 0)2.0540.2820.2730.245
RLNE (T = 0.1)0.0590.0100.0070.022
Table 2. Reconstruction error of a 1H-13C COSY spectrum.
Table 2. Reconstruction error of a 1H-13C COSY spectrum.
MethodsZero-fillingIST 1PSOCA 1PSOCA 0.5
WaveletRLNE (T = 0)1.6870.5470.5330.541
RLNE (T = 0.1)0.0980.0440.0420.042
Identity matrixRLNE (T = 0)1.6870.4220.4050.343
RLNE (T = 0.1)0.0980.0330.0310.027
Table 3. Running time for reconstruction of a NMR spectrum (unit: second).
Table 3. Running time for reconstruction of a NMR spectrum (unit: second).
MethodsZero-fillingIST 1PSOCA 1PSOCA 0.5

1H-1H1H-13C1H-1H1H-13C1H-1H1H-13C1H-1H1H-13C
Wavelet0.10.111.156.88.570.429.1221.2
Identity matrix0.10.15.927.55.731.816.0105.6

Share and Cite

MDPI and ACS Style

Qu, X.; Guo, D.; Cao, X.; Cai, S.; Chen, Z. Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension. Sensors 2011, 11, 8888-8909. https://doi.org/10.3390/s110908888

AMA Style

Qu X, Guo D, Cao X, Cai S, Chen Z. Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension. Sensors. 2011; 11(9):8888-8909. https://doi.org/10.3390/s110908888

Chicago/Turabian Style

Qu, Xiaobo, Di Guo, Xue Cao, Shuhui Cai, and Zhong Chen. 2011. "Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension" Sensors 11, no. 9: 8888-8909. https://doi.org/10.3390/s110908888

Article Metrics

Back to TopTop