Next Article in Journal
A Strategic Bargaining Game for a Spectrum Sharing Scheme in Cognitive Radio-Based Heterogeneous Wireless Sensor Networks
Previous Article in Journal
Mn-Doped CaBi4Ti4O15/Pb(Zr,Ti)O3 Ultrasonic Transducers for Continuous Monitoring at Elevated Temperatures
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Colorectal Cancer and Colitis Diagnosis Using Fourier Transform Infrared Spectroscopy and an Improved K-Nearest-Neighbour Classifier

1
School of Instrumentation Science and Opto-Electronics Engineering, Precision Opto-Mechatronics Technology Key Laboratory of Education Ministry, Beihang University, Xueyuan Road No. 37, Haidian District, Beijing 100191, China
2
Department of General Surgery, First Hospital of Xi’an Jiaotong University, Xi’an 710061, China
3
Cancer Imaging Unit—Integrative Oncology Department, BC Cancer Agency Research Centre, Vancouver, BC V5Z 1L3, Canada
*
Author to whom correspondence should be addressed.
Sensors 2017, 17(12), 2739; https://doi.org/10.3390/s17122739
Submission received: 2 August 2017 / Revised: 31 October 2017 / Accepted: 2 November 2017 / Published: 27 November 2017
(This article belongs to the Section Biosensors)

Abstract

:
Combining Fourier transform infrared spectroscopy (FTIR) with endoscopy, it is expected that noninvasive, rapid detection of colorectal cancer can be performed in vivo in the future. In this study, Fourier transform infrared spectra were collected from 88 endoscopic biopsy colorectal tissue samples (41 colitis and 47 cancers). A new method, viz., entropy weight local-hyperplane k-nearest-neighbor (EWHK), which is an improved version of K-local hyperplane distance nearest-neighbor (HKNN), is proposed for tissue classification. In order to avoid limiting high dimensions and small values of the nearest neighbor, the new EWHK method calculates feature weights based on information entropy. The average results of the random classification showed that the EWHK classifier for differentiating cancer from colitis samples produced a sensitivity of 81.38% and a specificity of 92.69%.

1. Introduction

Every year, the number of cancer-caused deaths rises [1]. Among all types of cancer, colorectal cancer is the third most common cause of cancer death worldwide, with an annual incidence of approximately one million cases and 600,000 deaths. The high mortality rate is partially attributed to the fact that established clinical procedures lack reliability and sensitivity for finding cancer at early stages [2,3]. Thus, the importance of early diagnosis in preventing and treating cancer mandates development of an accurate, fast, convenient, and inexpensive diagnostic tool for early detection [4].
Substantial modifications in cancer cells at the molecular level occur prior to morphological changes could be observed in tissues. Therefore, molecular spectroscopes are promising tools to detect cancer-related chemical changes at an early stage [1]. In particular, Fourier transform infrared spectroscopy (FTIR), a popular tool in modern analytical chemistry labs, provides rich information about the bio-molecules that act as building blocks in tissues and cells [5,6,7,8]. Existing clinical diagnosis requires taking biopsy via endoscope, which causes pain and requires lengthy pathological exams. In addition, surgical resection can lead to taking biopsies from non-cancerous tissues due to a number of factors, and there is always a possibility that malignant cells could go into blood stream during such invasive procedure. Therefore, being able to diagnose colorectal cancer in vivo or ex vivo could largely overcome the limitations of existing procedures, providing accurate and rapid determination of proper operative treatment. Combining an attenuated total reflectance (ATR) fiber probe-coupled FTIR spectrometer with an endoscope, a simple, rapid, and noninvasive method to detect human cancer tissues directly with minimal sample preparation may achieve results comparable to the gene expression-based method [9,10,11,12,13]. The current work was carried out on ex vivo tissues using an ATR-FTIR probe.
In recent years, the use of FTIR to diagnose various cancers, such as lung, breast, gastric, liver, and colorectal cancer, has been reported [14,15,16,17,18,19,20,21,22,23]. Chemometric methods, such as support vector machine (SVM) [21], K-nearest neighbor classifier (KNN) [22], and K-local hyperplane distance nearest neighbor (HKNN), enable efficient information extraction and classification model calibration [24,25]. These afore mentioned reports indicate that FTIR spectroscopy along with an effective chemometric classifier could be a useful tool for screening a variety of human tumors. Until now, few studies have been developed for diagnosis and discrimination of colorectal cancers and colitis using FTIR spectroscopy [26]. Most research efforts focus on enabling a high-accuracy and high-sensitivity algorithm for cancer diagnosis. In this study, we combine preprocessing techniques and a novel classification method for analyzing FT-IR spectral data and achieve high accuracy in diagnosing colorectal cancer tissues.

2. Materials and Methods

2.1. Tissue Specimens

All colorectal cancer and colitis tissues were provided by the Medical Division of the First Hospital of Xi’an Jiaotong University, China. Informed consent was obtained from each patient prior to the study, and clinical diagnosis was confirmed by histopathology. A total of 88 tissue samples from 42 female and 46 male patients, were obtained. The average age was 53.7 years old with the oldest being 76 years and the youngest age being 21 years. One fresh endoscopic biopsy of 1–3 mm in diameter was obtained from each patient. According to the pathological exam results, the samples consisted of 41 cases of colitis and 47 cases of cancer.

2.2. Instrumentation and FTIR Data Collection

A WQF-500 FTIR spectrometer linked with a modified attenuated total reflectance (ATR) fiber probe (Beijing No. 2 optical instrument factory, Beijing, China) was used to acquire spectra. The FTIR spectrometer was equipped with a liquid-nitrogen-cooled mercury cadmium telluride (MCT) detector. Specimens were frozen and transported to the laboratory. Before experiment, frozen specimens were thawed at room temperature for approximately 3–5 min. Then, a background spectrum was acquired first. The ATR probe was placed at a 90° angle on the tissue specimen surface for spectrum acquisition. To achieve an acceptable signal-to-noise ratio at a resolution of 4 cm−1, 32 scans were recorded with wavenumbers ranging from 1000 cm−1 to 4000 cm−1. The procedure took approximately 1–2 min. After sample spectra were recorded, samples were stored in liquid nitrogen and sent for the histological examination as reference for spectral analysis.

2.3. Spectra Preprocessing Method

Two preprocessing methods, viz., smoothing and standard normal variate (SNV) [27,28,29], were performed on the FTIR spectra. First, the Savitsky–Golay algorithm with a window width of 5 points was applied to each spectrum to reduce random noise in the data. Then, all available spectra were normalized by the SNV method to remove multiplication interference, slope variation, and scatter effects generated by particles of the sample.
For spectrum x i j of sample i at wavenumber j , SNV standardization is defined as follows:
x i j , S N V = x i j x ¯ i j = 1 n ( x i j x ¯ i ) 2 ( n 1 ) 1 / 2
where x ¯ i denotes the average spectrum of sample i , n denotes the number of wavelengths, and ( n 1 ) denotes the degree of freedom.

2.4. Entropy Weight Local-Hyperplane K-Nearest Neighbor Method

A novel classification method called entropy weight local-hyperplane k-nearest neighbor (EWHK) is proposed for discrimination between colorectal cancer and colitis. For the EWHK method, which is an upgrade of K-local hyperplane distance nearest neighbor (HKNN) algorithm [24,25], feature weights of training sets based on the information entropy are objectively considered to measure the importance of each single feature and to avoid the bias in high dimensions and the limit in small values of the nearest neighbor. On the other hand, HKNN treats every variable as an equally relevant component for classification. Therefore, the class labels of unknown samples are calculated according to the feature weights, the Euclidean distance, and the local hyperplane.
Suppose that training set X = ( x 1 , , x m ) T consists of m training instances with L classes. Each training instance consists of n input features x i = ( x i 1 , , x i n ) T with known class label y i = c , for i = 1 , , m and c = 1 , , L . The class label of a query with input vector q = ( q 1 , , q n ) T . The three stages in the proposed method were as follows: prototype selection, local hyperplane construction, and query classification.
Firstly, the feature weight is estimated objectively based on the concept of information entropy to figure out the entropy weight according to the variance of every variable. Low information entropy resulted in high feature weight, which corresponds to a feature with better class separation capability. The entropy weight w j is calculated according to the following formula:
z i j = x i j i = 1 m x i j , β = 1 ln ( m ) H j = β i = 1 m z i j ln ( z i j ) w j = 1 H j n j = 1 n H j , j = 1 , , n
where z i j denotes the normalized j th component of sample i in the training set; β denotes the regularization parameter; H j denotes the information entropy of the j th feature of the sample. Hence, new weighted Euclidean distance metric D between x i and q is defined as follows:
D ( x i , q ) = j = 1 n w j ( x i j q j ) 2 .
Then, a local hyperplane of class c is constructed for the given query q according to the distance metric D and the number k of nearest neighbors of class c . Formally, the formula is as follows:
L H c ( q ) = { s | s = i = 1 k α i V . i + m c } m c = 1 k i = 1 k p c i V . i = p c i m c α = ( α 1 , , α k ) T
where p i is the i th nearest neighbor of class c ; α is solved by minimizing the distance between q and L H c ( q ) using regularization. Thus, the calculated minimum distance is as follows:
J c ( q ) = min α j = 1 n w j ( V j . α + m c j q j ) 2 + λ α T α     = min α ( s q ) T W ( s q ) + λ α T α W = d i a g ( w 1 , , w n )
where λ is the regularization parameter. J c ( q ) is minimized, and the equation ( U T V + λ I k ) α = U T ( q m c ) , where U T = V T W is used to calculate α , is derived. Finally, the class of the query q is assigned as follows: c l a s s ( q ) = arg   min c   J c ( q ) .

3. Results and Discussion

3.1. Preprocessing of FTIR Spectra

In the process of measurement, the obtained spectra contain not only useful information regarding the molecular structure and the components of the measured samples, but also the noises, such as the high-frequency random noise, the baseline drift, and the stray light. This additional noise needs to be eliminated; otherwise, they will affect the discrimination result.
Prior to classification analysis, data preprocessing is necessary to improve performance of the classification model. Savitzky–Golay (SG) smoothing reduces random noise, and SNV was applied to remove unwanted background variances to some extent. There is no bio-molecules absorbance peak in the 1800–2800 cm−1 region, the majority of peaks are in the 1000–1800 cm1 region and in the 2800–3800 cm1 region. Thus, the SNV method was separately used from 1000 to 1800 cm−1 and from 2800 to 3800 cm−1.The FTIR spectra of colitis and cancerous tissues before and after preprocessing are shown in Figure 1. After performing background correction and normalization, useful information of all spectra (such as at the wavenumber near 1743 cm−1, 2858cm−1, and 2924 cm−1) were marked as shown in Figure 2. The quality of FTIR spectra was greatly improved after data preprocessing.

3.2. Analysis of FTIR Spectra

A total of 88 spectra were obtained by FTIR spectroscopy within the spectral region between 1000 and 4000 cm−1. Because there were no bio-molecule absorbance peaks in the 1800–2800 cm1 region, the SNV method was separately used from 1000 to 1800 cm1 and from 2800 to 3800 cm1 after smoothing.
Figure 2 shows the spectra of colitis and colorectal cancer biopsies after preprocessing, where the band assignments of major absorption in the FTIR spectra of colorectal tissue are marked. The major peaks are similar for the spectra of colitis and colorectal cancer. However, the differences including peak shape and relative intensity can be observed. These results are reasonable because significant changes occurred in both the structure and composition of the main bio-molecules, which constitute the cell such as DNA, water, protein, and lipids, between cancerous and colitis tissues.
As is shown in Figure 2, the spectral profile of cancerous tissues indicates the presence of fewer lipids and more proteins compared with colitis tissues. The peak intensity of the C=O band assigning to the lipids (near 1743 cm−1) and the peak intensity of C-H stretching vibration bands relating to the lipids (near 2958 cm−1, 2924 cm−1, and 2858 cm−1) decrease and even disappear in the spectra of malignant tissues, making it essential to consume fat in the malignant tissue to meet the nutritional and energy requirements in carcinoma development. The spectral profile of cancerous tissues indicates the presence of proteins at wavenumbers ~1643 cm−1 and ~1550cm−1, which belong to amide I band and amide II band of the protein, respectively. The relative intensity near I 1550 / I 1643 decreases more for the spectra of cancerous tissues than for those in colitis biopsies because of the changes in the proportion of proteins during tumor formation. The intensity of the ~1460 cm−1 peak is weaker than that of the ~1400 cm−1 peak in the spectra of the cancerous samples, while the peak at ~1460 cm−1 is stronger than or equal to that of ~1400 cm−1 in the spectra of colitis samples. Cancerous tissue contains greater amounts of nucleic acids, collagen, and certain amino acids compared to the colitis ones. In colitis tissues, the peak at ~1240 cm−1 is weaker, and the band near 1310 cm−1 becomes weak and sometimes disappears. The absorption peak ~1080 cm−1 assigned to nucleic acid is obviously weaker in the spectra of colitis samples than that in the spectra of cancerous tissues. The peak at ~1160 cm−1 assigned to carbohydrate decreases noticeably in the spectra of the cancerous samples. Thus, the characteristics mentioned above between cancerous and colitis tissues provide the basis for spectroscopic diagnosis.
Specific assignments of individual peaks can be found in Table 1.

3.3. Classification Analysis

After spectra preprocessing, 88 spectra data (41 colitis and 47 cancers) were analyzed to identify their class labels. The total 88 spectra were divided into two data sets. The 44 FTIR spectra (21 colitis and 23 cancers) after preprocessing were randomly selected as the training set. The other 44 FTIR spectra (20 colitis and 24 cancers) after preprocessing were randomly selected as the test set. Both EWHK and traditional classification models were built by the training set and validated by the test set. Traditional classification models include SVM and HKNN in this paper. These procedures were repeated five times. The five predicted results were averaged. Table 2 shows the classification results of colorectal tissues with entropy weight local-hyperplane k-nearest neighbor (EWHK). Table 3 shows the average of the five results using the three different methods.
The experiment results are summarized in Table 2 and Table 3. The classification results of colorectal tissue samples with EWHK (Table 2) showed that, among the 88 cases of colorectal tissue samples, only three colitis samples and nine cancer samples are misclassified. In Table 3, EWHK achieved a high accuracy, viz., 85.91%. In addition, other statistics results of detection of colorectal biopsies by FTIR spectroscopy with EWHK and traditional classification models are shown in Table 3. For colorectal cancer diagnosis with EWHK, sensitivity is 81.38%, specificity is 92.69%, predictive value of a positive test is 92.68%, and the predictive value of a negative test is 80.85%. In comparison, statistical analysis results with HKNN were worse than those with EWHK in diagnosing colorectal cancer tissues, achieving 66.46% sensitivity and 79.77% accuracy. The SVM works worse than HKNN. SVM can perform well with large-scale data. However, the choices of the parameters for the kernel are complex and unstable. The HKNN works well only for small values of the nearest-neighbor. However, the accuracy decreases as values of the nearest-neighbor increase. The FTIR spectra can be classified accurately with EWHK because it considers the influence of feature weight according to the information entropy of every variable. In conclusion, the results indicate that the EWHK has better capability in identifying colorectal cancer from colitis.

4. Conclusions

This study shows that it is feasible to classify colitis and cancers using FTIR spectroscopy and chemometrics. FTIR fiber-optic ATR spectroscopy is a powerful tool to detect changes at the molecular level and can rapidly capture small changes in molecular compositions and structures. Therefore, it has the potential to be further developed into noninvasive, in vivo, and real-time detection tools of cancerous tissues before a surgical operation is required. Data pre-processing such as smoothing and SNV greatly improved the signal-to-noise ratio for the FTIR spectra of colorectal tissues, and the EWHK classifier achieved a classification accuracy of 85.91%. The reason that EWHK performs well is because feature weights are calculated according to the information entropy of every variable. The proposed preprocessing and classification method using FTIR spectroscopy is effective and practical for in vivo colorectal cancer or other malignant tissue diagnosis and will be pursued in future studies.

Acknowledgments

The authors gratefully acknowledge the Medical Division of the First Hospital of Xi’an Jiaotong University for providing the tissue specimens and Beijing No. 2 Optical Instrument Factory (Beijing, China) for excellent technical assistance. This work was financially supported by the National Natural Science Foundation of China (Grant No. 61575015).

Author Contributions

Qingbo Li proposed the algorithm, conceived and designed the experiments; Can Hao processed the spectral data, analyzed the spectral data and wrote the manuscript; Xue Kang was responsible for data preprocessing; Jialin Zhang performed the experiments; Xuejun Sun provided the samples and the pathological results; Wenbo Wang contributed to literature search and modified the English articles; Haishan Zeng completed final draft.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Chen, H.; Lin, Z.; Tan, C. Cancer Discrimination Using Fourier Transform Near-Infrared Spectroscopy with Chemometric Models. J. Chem. 2015, 2015, 1–8. [Google Scholar] [CrossRef]
  2. Li, S.X.; Chen, G.; Zhang, Y.J.; Guo, Z.Y.; Liu, Z.M.; Xu, J.F.; Li, X.Q.; Lin, L. Identification and characterization of colorectal cancer using Raman spectroscopy and feature selection techniques. Opt. Express 2014, 22, 25895–25908. [Google Scholar] [CrossRef] [PubMed]
  3. Tatarkovič, M.; Miškovičová, M.; Šťovíčková, L.; Synytsya, A.; Petruželka, L.; Setnička, V. The potential of chiroptical and vibrational spectroscopy of blood plasma for the discrimination between colon cancer patients and the control group. Analyst 2015, 140, 2287–2293. [Google Scholar] [CrossRef] [PubMed]
  4. Rogler, G. Chronic ulcerative colitis and colorectal cancer. Cancer Lett. 2014, 345, 235–241. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Jawaid, S.; Talpur, F.N.; Sherazi, S.T.H.; Nizamani, S.M.; Khaskheli, A.A. Rapid detection of melamine adulteration in dairy milk by SB-ATR-Fourier transform infrared spectroscopy. Food Chem. 2013, 141, 3066–3071. [Google Scholar] [CrossRef] [PubMed]
  6. Zhang, L.; Zhang, L.M.; Li, Y.; Liu, B.P.; Wang, X.P.; Wang, J.D. Application and improvement of partial-least-squares in Fourier transform infrared spectroscopy. Spectrosc. Spect. Anal. 2005, 25, 1610–1613. [Google Scholar]
  7. Lasch, P.; Haensch, W.; Naumann, D.; Diem, M. Imaging of colorectal adenocarcinoma using FT-IR microspectroscopy and cluster analysis. Biochim. Biophys. Acta 2004, 1688, 176–186. [Google Scholar] [CrossRef] [PubMed]
  8. Piva, J.A.D.A.C.; Silva, J.L.R.; Raniero, L.J.; Lima, C.S.P.; Arisawa, E.A.L.; Oliveira, C.D.; Canevari, R.D.A.; Ferreira, J.; Martin, A.A. Biochemical imaging of normal, adenoma, and colorectal adenocarcinoma tissues by fourier transform infrared spectroscopy (FTIR) and morphological correlation by histopathological analysis: Preliminary results. Rev. Bras. Eng. Biomed. 2015, 31, 10–18. [Google Scholar] [CrossRef]
  9. Chonanant, C.; Jearanaikoon, N.; Leelayuwat, C.; Limpaiboon, T.; Tobin, M.J.; Jearanaikoon, P.; Heraud, P. Characterisation of chondrogenic differentiation of human mesenchymal stem cells using synchrotron FTIR microspectroscopy. Analyst 2011, 136, 2542–2551. [Google Scholar] [CrossRef] [PubMed]
  10. Gao, X.X.; Yao, H.W.; Du, J.K.; Zhao, M.X.; Qi, J.; Li, H.Z.; Pan, Q.H.; Xu, Y.Z.; Wu, J.G. Study on Malignant and Normal Rectum Tissues Using IR and H-1 and P-31 NMR Spectroscopy. Spectrosc. Spect. Anal. 2009, 29, 969–973. [Google Scholar]
  11. Walsh, M.J.; German, M.J.; Singh, M.; Pollock, H.M.; Hammiche, A.; Kyrgiou, M.; Stringfellow, H.F.; Paraskevaidis, E.; Martin-Hirsch, P.L.; Martin, F.L. IR microspectroscopy: Potential applications in cervical cancer screening. Cancer Lett. 2007, 246, 1–11. [Google Scholar] [CrossRef] [PubMed]
  12. Wu, J.G.; Xu, Y.Z.; Sun, C.W.; Soloway, R.D.; Xu, D.F.; Wu, Q.G.; Sun, K.H.; Weng, S.F.; Xu, G.X. Distinguishing malignant from normal oral tissues using FTIR fiber-optic techniques. Biopolymers 2001, 62, 185–192. [Google Scholar] [CrossRef] [PubMed]
  13. Yao, H.W.; Liu, Y.Q.; Fu, W.; Shi, X.Y.; Zhang, Y.F.; Xu, Y.Z. Initial Research on Fourier Transform Infrared Spectroscopy for the Diagnosis of Colon Neoplasms. Spectrosc. Spect. Anal. 2011, 31, 297–301. [Google Scholar]
  14. Du, J.K.; Shi, J.S.; Sun, X.J.; Wang, J.S.; Xu, Y.Z.; Wu, J.G.; Zhang, Y.F.; Weng, S.F. Fourier transform infrared spectroscopy of gallbladder carcinoma cell line. HBPD Int. 2009, 8, 75–78. [Google Scholar] [PubMed]
  15. Bai, Y.K.; Yu, L.W.; Zhang, L.; Fu, J.; Leng, H.; Yang, X.J.; Ma, J.Q.; Li, X.J.; Li, X.J.; Zhu, Q.; et al. Research on Application of Fourier Transform Infrared Spectrometry in the Diagnosis of Lymph Node Metastasis in Gastric Cancer. Spectrosc. Spect. Anal. 2015, 35, 599–602. [Google Scholar]
  16. Tian, P.R.; Zhang, W.T.; Zhao, H.M.; Lei, Y.T.; Cui, L.; Wang, W.; Li, Q.B.; Zhu, Q.; Zhang, Y.F.; Xu, Z. Intraoperative diagnosis of benign and malignant breast tissues by fourier transform infrared spectroscopy and support vector machine classification. Int. J. Clin. Exp. Med. 2015, 8, 972–981. [Google Scholar] [PubMed]
  17. Dong, L.; Sun, X.J.; Chao, Z.; Zhang, S.Y.; Zheng, J.B.; Gurung, R.; Du, J.K.; Shi, J.S.; Xu, Y.Z.; Zhang, Y.F.; et al. Evaluation of FTIR spectroscopy as diagnostic tool for colorectal cancer using spectral analysis. Spectrochim. Acta A Mol. Biomol. Spectrosc. 2014, 122, 288–294. [Google Scholar] [CrossRef] [PubMed]
  18. Nallala, J.; Diebold, M.D.; Gobinet, C.; Bouché, O.; Sockalingum, G.D.; Piot, O.; Manfait, M. Infrared spectral histopathology for cancer diagnosis: A novel approach for automated pattern recognition of colon adenocarcinoma. Analyst 2014, 139, 4005–4015. [Google Scholar] [CrossRef] [PubMed]
  19. Yao, H.W.; Shi, X.Y.; Zhang, Y.F. The Use of FTIR-ATR Spectrometry for Evaluation of Surgical Resection Margin in Colorectal Cancer: A Pilot Study of 56 Samples. J. Spectrosc. 2014, 2014, 1–4. [Google Scholar] [CrossRef]
  20. Gao, Y.F.; Huo, X.W.; Dong, L.; Sun, X.J.; Sai, H.; Wei, G.B.; Xu, Y.Z.; Zhang, Y.F.; Wu, J.G. Fourier transform infrared microspectroscopy monitoring of 5-fluorouracil-induced apoptosis in SW620 colon cancer cells. Mol. Med. Rep. 2015, 11, 2585–2591. [Google Scholar] [CrossRef] [PubMed]
  21. Sattlecker, M.; Baker, R.; Stone, N.; Bessant, C. Support vector machine ensembles for breast cancer type prediction from mid-FTIR micro-calcification spectra. Chemom. Intell. Lab. 2011, 107, 363–370. [Google Scholar] [CrossRef]
  22. Li, X.; Li, Q.B.; Xu, Y.Z.; Zhang, G.J.; Wu, J.G.; Yang, L.M.; Ling, X.F.; Zhou, X.S.; Wang, J.S. Application of KNN method to cancer diagnosis using Fourier-transform infrared spectroscopy. Spectrosc. Spect. Anal. 2007, 27, 439–443. [Google Scholar]
  23. Li, Q.B.; Xu, Z.; Zhang, N.W.; Zhang, L.; Wang, F.; Yang, L.M.; Wang, J.S.; Zhou, S.; Zhang, Y.F.; Zhou, X.S.; et al. In vivo and in situ detection of colorectalcancer using Fouriertransform infrared spectroscopy. World J. Gastroenterol. 2005, 11, 327–330. [Google Scholar] [CrossRef] [PubMed]
  24. Okun, O.G. K-Local Hyperplane Distance Nearest Neighbor Algorithm and Protein Fold Recognition. Pattern Recognit. Image Anal. 2007, 17, 621–630. [Google Scholar] [CrossRef]
  25. Xu, J.; Yang, J.; Lai, Z.H. K-Local hyperplane distance nearest neighbor classifier oriented local discriminant analysis. Inform. Sci. 2013, 232, 11–26. [Google Scholar] [CrossRef]
  26. Li, X.; Li, Q.B.; Zhang, G.J.; Xu, Y.Z.; Sun, X.J.; Shi, J.S.; Zhang, Y.F.; Wu, J.G. Identification of colitis and cancer in colon biopsies by Fourier transform infrared spectroscopy and Chemometrics. Sci. World J. 2012, 2012, 936149. [Google Scholar] [CrossRef] [PubMed]
  27. Barnes, R.J.; Dhanoa, M.S.; Lister, S.J. Standard Normal Variate Transformation and De-trending of Near-Infrared Diffuse Reflectance Spectra. Appl. Spectrosc. 1989, 43, 772–777. [Google Scholar] [CrossRef]
  28. Wu, W.; Guo, Q.; Jouan-Rimbaud, D.; Massart, D.L. Using contrasts as data pretreatment method in pattern recognition of multivariate data. Chemom. Intell. Lab. 1999, 45, 39–53. [Google Scholar] [CrossRef]
  29. Bi, Y.M.; Yuan, K.L.; Xiao, W.Q.; Wu, J.Z.; Shi, C.Y.; Xia, J.; Chu, G.H.; Zhang, G.X.; Zhou, G.J. A local pre-processing method for near-infrared spectra, combined with spectral segmentation and standard normal variate transformation. Anal. Chim. Acta 2016, 909, 30–40. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The Fourier transform infrared spectra (FTIR) of colitis and cancerous tissues. (a) Original spectra of colitis tissues; (b) original spectra of cancerous tissues; (c) preprocessed spectra of colitis tissues using smoothing and standard normal variate (SNV); (d) preprocessed spectra of cancerous tissues using smoothing and SNV.
Figure 1. The Fourier transform infrared spectra (FTIR) of colitis and cancerous tissues. (a) Original spectra of colitis tissues; (b) original spectra of cancerous tissues; (c) preprocessed spectra of colitis tissues using smoothing and standard normal variate (SNV); (d) preprocessed spectra of cancerous tissues using smoothing and SNV.
Sensors 17 02739 g001aSensors 17 02739 g001b
Figure 2. The typical FTIR spectra of colorectal biopsies after preprocessing.
Figure 2. The typical FTIR spectra of colorectal biopsies after preprocessing.
Sensors 17 02739 g002
Table 1. Peak positions and assignments of FTIR bands in colon tissues.
Table 1. Peak positions and assignments of FTIR bands in colon tissues.
Peak Positions (cm−1)Major Assignment
1080Stretching vibration(DNA, RNA)
1160Carbohydrate
1240Asymmetric stretching vibration(RNA)
1310C-H deformation vibration & Amide III (protein)
1550amide II (protein)
1643amide I (protein)
1743C=O stretching vibration(lipids)
2858, 2924, 2958C-H stretching vibration (lipids)
3300N-H stretching vibration( protein), O-H stretching vibration(water)
Table 2. The average of predict results of colorectal tissues with entropy weight local-hyperplane k-nearest neighbor (EWHK).
Table 2. The average of predict results of colorectal tissues with entropy weight local-hyperplane k-nearest neighbor (EWHK).
Histologic ExaminationThe Predicted Results of Fourier Transform Infrared Spectroscopy (FTIR) Spectra
CancerColitis
Cancer389
Colitis338
Table 3. Comparison of the average of statistical analysis results with several classification models.
Table 3. Comparison of the average of statistical analysis results with several classification models.
MethodSensitivity (%)Specificity (%)PPV 1 (%)NPV 2 (%)Accuracy (%)
EWHK81.3892.6992.6880.8585.91
HKNN66.4696.1995.2570.4379.77
SVM72.4668.7070.3071.8770.45
Note: 1 Positive predictive value; 2 Negative predictive value.

Share and Cite

MDPI and ACS Style

Li, Q.; Hao, C.; Kang, X.; Zhang, J.; Sun, X.; Wang, W.; Zeng, H. Colorectal Cancer and Colitis Diagnosis Using Fourier Transform Infrared Spectroscopy and an Improved K-Nearest-Neighbour Classifier. Sensors 2017, 17, 2739. https://doi.org/10.3390/s17122739

AMA Style

Li Q, Hao C, Kang X, Zhang J, Sun X, Wang W, Zeng H. Colorectal Cancer and Colitis Diagnosis Using Fourier Transform Infrared Spectroscopy and an Improved K-Nearest-Neighbour Classifier. Sensors. 2017; 17(12):2739. https://doi.org/10.3390/s17122739

Chicago/Turabian Style

Li, Qingbo, Can Hao, Xue Kang, Jialin Zhang, Xuejun Sun, Wenbo Wang, and Haishan Zeng. 2017. "Colorectal Cancer and Colitis Diagnosis Using Fourier Transform Infrared Spectroscopy and an Improved K-Nearest-Neighbour Classifier" Sensors 17, no. 12: 2739. https://doi.org/10.3390/s17122739

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop