1. Introduction
Vibrational spectroscopy is well established as a powerful and widely used technique for the study of molecular structure and dynamics in the fields of chemistry and physics [
1]. It uses molecular bond vibration as a probe to identify the molecule and their structure [
2]. In recent years, development of short pulse ultrafast laser makes it possible to observe the vibrational motion in real time [
3]. This unique feature facilitate to reveal chemical and biological processes even in femtosecond (fs) timescale [
4]. Therefore, it is natural to think that vibrational spectroscopy can be a potential diagnosis method in medical science [
5].
Vibrational spectroscopy uses infrared light to excite molecular bonds and measure absorption and transmission of light due to the vibrations of the bonds. Each chemical bond has an unique vibrational energy [
6,
7]. Due to this unique vibrational energy, each molecule will have an unique spectral features. This unique spectral feature is called fingerprint of the molecule. The molecular fingerprinting is used to identify a particular molecule from a mixture, therefore, it is very important for biological and medical sample analysis.
Vibrational spectroscopy is not only efficient to identify molecules from mixture, but also it has two advantages over other existing techniques for biological sample analysis. (a) The reagent-free analysis of biofluids takes advantage of the fact that a multitude of analytes can be quantified simultaneously. (b) Since the infrared beam only make changes on the vibrational level, molecules remain intact. Therefore, there is no chance of damage of tissue or constituent molecules in biological sample [
8]. However, due to high absorption of infrared radiation by water, penetration depth in the tissue is very small, which limits the use of this technique for medical diagnosis.
It has been already proved that small volatile organic components (VOC) escaping from body by means of breath, body odor, etc., reflect the internal chemistry of the body [
9]. They are not only characteristic of a person, but also specific to different kinds of diseases. VOCs, related to diseases could serve as biomarkers for clinical diagnostics and disease monitoring. Among many, breath is a source of large number of VOCs and easy to collect for diagnosis. A significant effort has been devoted for the analysis of the VOCs, using Mass spectroscopy, electric nose, vibrational spectroscopy, etc., however, each technique has their inherent shortcoming. Breath contain a large amount of water vapor, which is the main obstacle for the vibrational spectroscopy. Due to the lack of appropriate water removal technique, vibrational spectroscopy until recently not a popular technique for breath analysis.
The main advantage of breath analysis using infrared spectroscopy over the tissue or liquid phase biological sample is, water can be removed from the sample without changing the constitute molecules. A highly efficient water removal technique has been developed and reported recently [
10]. In combination with water removal technique, Fourier transform infrared (FTIR) spectroscopy already proved its potential to breath analysis [
11]. In this article, it has been shown, how FTIR spectroscopy is used for analysis of breath samples.
2. Results and Discussions
For a large number of samples, it is a common practice that medical data are typically analysed statistically. Most commonly used statistical methods are principal component analysis (PCA), analysis of variance (ANOVA), random forest, etc. In all these cases, data are analysed blindly, therefore, any measurement errors or noise, affect the statistical data analysis, which leads to misinterpretation of the data. We propose component analysis method, where each of the spectral component is matched with molecular fingerprint, in case of match, spectral component is considered for analysis. Since only identified spectral features are consider for analysis, there is no chance of misinterpretation of the data. The strength of the spectral peak is used to determine the state of the body [
11]. A detail analysis of the body state is out of scope of this article, however, the method of component analysis is explained below in detail.
A typical water suppressed breath spectra of a healthy volunteer is presented in
Figure 1. A large number of spectral features are observed in the spectra. In general without water suppression most of them are buried under water spectra except carbon dioxide (CO
2). It is produced in many biological processes in our body, however, it is not informative enough, because, it is difficult, even impossible to quantify the amount is produced by each individual processes. On the other hand, many small VOCs in breath, are produced in a single biological processes, specially in case of disease. The main task of breath analysis is to identify those particular VOCs and finding out their origin in the body.
The spectral range (2800–3200 cm
−1) in general populated with C-H stretch vibrational absorptions [
12]. The CH bond is the characteristics of the biological molecules, therefore all biological molecules yield their CH absorption peaks in this spectral region, and appear as very congested spectral feature. Therefore, practically it is impossible to identify individual molecule with their fingerprints in the CH stretch vibrational region. However, a distinguish spectral feature around 3100 cm
−1 is identified as
R branch of Methane. Very strong
Q branch is observed at 3020 cm
−1 but
P branch of methane is buried under the broad C-H absorption spectra of other molecules.
Many prominent spectral features are observed in between 900–1800 cm−1. The spectral features in this spectral region are unique to individual molecules and relatively less congested, which makes it easier to identify the molecules using molecular fingerprint. For example, double peaks at around 900 cm−1 is an unique feature for Isoprene. Double peaks at 893 cm−1 and 906 cm−1, indicate the presence of Isoprene in the breath sample. The peak at 1220 cm−1 is a clear signature of Acetone and identified unambiguously. However, the peak at 1375 cm−1, which also a fingerprint of Acetone, is much higher in strength than it should be. Therefore, this spectral feature must be formed due to absorption of more than one molecule. We have identified the second molecule, Acetaldehyde. A very strong absorption is observed at 1750 cm−1 due to the absorption of the C=O of acetone, acetaldehyde and some small molecule with amide band.
The identified molecules, mentioned above, are present in breath of healthy individual with relatively high concentration, between few hundred ppb (parts-per-billion) to few ppm (parts-per-million). However, many more molecules are present in breath with much lower concentration, even down to ppt (parts-per-trillion) level. In the current state, limits of our detection system is about 50 ppb. Therefore, there is a chance to find out more molecules in the spectra. With a proper magnification of the spectra, many more molecules can be found on the pedestal of the above mentioned peaks. For example, a fine structure is visible (see inset of
Figure 1(a)) at the left pedestal of CO
2, by magnifying the spectra ten times. This fine structure is identified as carbon monoxide, which in general produce in the body due to the oxidative stress and tissue injury. Similarly using higher magnification and careful investigation many more molecules can be identified. So far we have identified 14 VOCs, however many more distinct spectral features are on the way for identification.
Using above mentioned component analysis, a pilot study has been performed with 14 cerebral palsy volunteers and 14 healthy volunteers. Two strong distinguish spectral features have been observed at 1189 cm
−1 (
I) and 1203 cm
−1 (
II), between two group of volunteers. Both the spectral features are with similar spectral width of 10 cm
−1. The signatures found for 100% (I) and 86% (II) of the CP patients and present for 92% (I) and 83% (II) of healthy individuals, are in average higher in concentration for the former cohort by approximately a factor of two, without any individual exception. The strength of the peak at 1189 cm
−1 for two groups of volunteers are presented at inset of
Figure 1(b). By performing a statistical analysis based on above mentioned two spectral feature we have found better than 90% accuracy in distinguishing the individuals in the groups.