Next Article in Journal
A Comprehensive Review of Silymarin Extraction and Liposomal Encapsulation Techniques for Potential Applications in Food
Previous Article in Journal
Research on Automatic Generation of Park Road Network Based on Skeleton Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Technique Using Confocal Raman Spectroscopy Coupled with PLS-DA to Identify the Types of Sugar in Three Tropical Fruits

by
César R. Balcázar-Zumaeta
1,*,
Jorge L. Maicelo-Quintana
2,
Geidy Salón-Llanos
1,
Miguel Barrena
3,
Lucas D. Muñoz-Astecker
1,
Ilse S. Cayo-Colca
2,
Llisela Torrejón-Valqui
1 and
Efraín M. Castro-Alayo
1
1
Instituto de Investigación, Innovación y Desarrollo para el Sector Agrario y Agroindustrial (IIDAA), Facultad de Ingeniería y Ciencias Agrarias, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas, Calle Higos Urco 342-350-356, Chachapoyas 01000, Peru
2
Facultad de Ingeniería Zootecnista, Agronegocios y Biotecnología, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas, Calle Higos Urco 342-350-356, Chachapoyas 01000, Peru
3
Facultad de Ingeniería Civil y Ambiental, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas, Calle Higos Urco 342-350-356, Chachapoyas 01000, Peru
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(18), 8476; https://doi.org/10.3390/app14188476
Submission received: 24 July 2024 / Revised: 17 September 2024 / Accepted: 18 September 2024 / Published: 20 September 2024
(This article belongs to the Section Food Science and Technology)

Abstract

:
Tropical fruits such as cherimoya, soursop, and pineapple share sugars (glucose, fructose, and sucrose) in common but may differ in the content of other phytochemicals. In the present work, confocal Raman spectroscopy and partial least squares discriminant analysis (PLS-DA) were used to establish a classification model among the three fruits and to evaluate the effect of pre-processing methods on the model’s performance. The Raman spectra showed that glucose was present in the fruits in the 800–900 cm−1 band and the 1100–1200 cm−1 band. While sucrose was present in the bands of 1131.22 cm−1, 1134.44 cm−1, and 1133.37 cm−1 in the three fruits, fructose was present in the bands of 1464.22 cm−1, 1467.44 cm−1, and 1464.22 cm−1 in cherimoya, soursop, and pineapple. The accuracy of the PLS-DA model varied according to the pre-processing methods used. The Savitzky–Golay first derivative method produced a model with 98.69–100% and 100% precision on the training and prediction data, respectively.

1. Introduction

Tropical fruits contain many biologically active compounds such as polysaccharides, polyphenols, vitamins, dietary fiber, and carotenoids [1]. Cherimoya (Annona cherimola) fruit is rich in fermentable carbohydrates, especially glucose and fructose [2]. This species is probably native to the area between Ecuador and Peru and has been cultivated since 1200 BC during the Inca Empire period [3]. Similarly, soursop (Anonna muricata L.) is native to tropical America and Africa [4]; its pulp constitutes more than 80% of the fruit and is composed mainly of water, non-reducing sugars, and carbohydrates [5]. Pineapple (Ananas comosus L. Merr) is one of the most popular fruits in the world, widely distributed in tropical regions, and highly dependent on its attractive aroma and sweet taste [6]. Pineapple is an economically important crop in tropical and subtropical regions, and its sweetness determines its quality [7]. Moreover, its soluble sugars such as sucrose, glucose, and fructose are responsible for the sweet taste of fresh-cut pineapple [6]. However, they also have other phytochemical components that can cause differences among these three fruits.
Traditionally, the most commonly used techniques for quantifying sugars in food are liquid and gas chromatography, nuclear magnetic resonance, infrared spectroscopy, spectrophotometry, polarization measurement, and hyperspectral imaging [8,9], which are limited by their invasiveness, intensive use of resources, and negative environmental impact [10,11]. Therefore, in recent years, there has been an increasing need to explore non-destructive and in situ methods that have been reported to work as well as high-performance liquid chromatography [12]. One such non-destructive method is Raman spectroscopy (RS), that enables to detect the internal quality of fruits and vegetables without destroying them [11,12,13]. In addition, the use of RS is increasing, especially in food safety. This technology makes it possible to detect peak signals by exposing the sample to a laser light [14]. This new technology is based on the detection of frequency changes and variations in the intensity of scattered light when a sample interacts with a laser light source [13], obtaining more selective spectra because it provides narrower spectral bands with abundant and well-resolved chemical information that is easy to interpret [12,13].
Andersen et al. [15] mention that RS has become a robust method to assess the content of individual sugars; for example, previous studies have been reported on the application of RS in the quantification of sugars from aqueous solutions, allowing the discrimination of fructose, glucose, maltose, and sucrose with laser excitation at 785 nm [9,12]. This technique is promising because it does not require pretreatment of fruit samples such as berries [9,15]. In addition, RS provides spectroscopic fingerprints that provide information related to specific chemical bonds or functional groups that are useful for analyzing samples qualitatively and quantitatively [11], which is used with chemometric methods to discriminate foods according to their composition [16,17]. In addition, the combination of RS and chemometric methods, was used for the identification of agricultural product varieties in grains, edible oils, and honey [18]. Also, they were used for the discrimination of foreign fats and oils in milk cream, and yogurt [19]; for the rapid quantification of starch in agricultural products [20]; and for the differentiation of conventional and transgenic cotton seed genotypes [21].
Regarding chemometric tools for food discrimination based on spectral data [16,22], principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) are considered robust linear classification methods [23]. PCA is an unsupervised method that reduces the data to find similarities between samples [24]. On the other hand, PLS-DA is a supervised method that classifies samples into predefined categories by establishing an internal relationship between spectra (X-block) and a classifier (y-variable) [24,25]. These two methods, known for their speed and effectiveness in providing clear sample separation, can predict a sample class by a model-based regression [22,26]. Therefore, in this research, the effect of data pre-processing methods on model performance was evaluated, and the applicability of RS combined with PCA and PLS-DA to identify the types of sugar from three tropical fruits was studied.

2. Materials and Methods

2.1. Reagents

Glucose, fructose, and sucrose standards (≥99.5% purity) were bought from Sigma-Aldrich (St. Louis, MO, USA).

2.2. Fruit Sampling

Four kilograms each of soursop (Anonna muricata L.), cherimoya (Annona cherimola), and pineapple (Ananas comosus L. Merr) were obtained from a local market in the province of Bagua, in the Amazonas region of Peru (5° 38′ 21″ S, 78° 31′ 54″ W, 408 m.a.s.l.).
The samples were taken to the Laboratorio de Investigación en Ingeniería de los Alimentos y Poscosecha of the UNTRM, where five fruits (without bruises or marks) of each variety were selected. Then, the fruits were manually separated from the peels, and the pulp alone was placed in 50 mL Falcon tubes and stored in deep freezing conditions (Eppendorf, Premium 0410, Hamburg, Germany). The samples were then lyophilized (Labconco, 710402010 model, Kansas City, MO, USA) at 0.008 bar and −84°C for 18 h [27].

2.3. Raman Spectroscopy (RS)

A Horiba Raman spectrometer system (Horiba Scientific, XploRA plus, Montpellier, France) equipped with a 532 nm and 785 nm laser and an Olympus BX 41 microscope was used in this study [28]. The spectrometer was equipped with a 1200 line/mm grating during the spectral recording, and a 532 nm laser was focused on the sample on the microscope stage through a 50 LWD (long working distance) objective (Olympus, Tokyo, Japan). The Raman scattering signals were detected by a charge-coupled device (CCD) detector, with a detection range from 200 to 1700 cm−1 in extended mode. The measurement was performed with an integration time of 5 s, with 10 spectral accumulations and 50% filter laser power. The spectral data were collected using the LabSpec 6.7.1 (Horiba, France) software (Supplementary Material S1). According to Huang et al. [29], the spectral resolution was 3 cm−1; and the calibration was checked by 520.47 cm−1 line of silicon [10]. Following Kolašinac et al. [10], five replicates of each fruit variety were used in this study to overcome sample inhomogeneity. Ten spectra were recorded for each replicate (50 spectra for each fruit variety), resulting in 150 spectra. The Raman peaks were assigned according to the published literature.

2.4. Chemometric Analysis

Raman spectral data were processed according to Ditta et al. and Gersony et al. [30,31]. The data were then smoothed, and a baseline correction, vector normalization, and substrate removal were performed using the Solo+MIA software 8.1 (Eigenvector, Research, Inc. Wenatchee, WA, USA). Dimensionality reduction of spectral intensities was performed using principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA).
PCA was performed to form clusters per type of fruit. Hotelling’s T2 and Q residual statistics were used to identify and remove the anomalous spectra that were considered outliers [25]. Hotelling’s T2 and Q residual statistics were calculated at 95% confidence. To perform the partial least squares discriminant analysis (PLS-DA), all data were divided into the calibration and test sets (75% and 25% of the samples, respectively) using the Kennard–Stone approach [10]. The highest values of sensitivity (SE), specificity (SP), and precision (P) were used to evaluate the goodness of the models based on the training and prediction data sets. Precision (P) indicates what fraction of predictions are positive as a positive class; sensitivity (SE), or true positive rate, refers to the percentage of samples of a given class that the model correctly recognizes as belonging to that class; specificity (SP), or true negative rate, refers to samples not belonging to a given class that the model correctly rejects [32].

3. Results and Discussion

3.1. Characterization of the Raman Spectra of Fruits

Diverse peaks were identified in these fruits with different intensities (Figure 1). The core of the pineapple contains significantly higher amounts of fructose and glucose [33]. Meanwhile, the soursop contains carbohydrates, proteins, folate, calcium, phosphorus, iron, and vitamin C, and [34,35,36] reported that the spectrum of sucrose presents the characteristic bands of α−glucose (847 cm−1) and β−fructose (868 cm−1). In this work, glucose was present with the highest intensity in cherimoya and pineapple, at 855.15 cm−1 and 851.93 cm−1, respectively, and with the lowest intensity in soursop (854.08 cm−1). Schulz and Baranska [36] identified other peaks related to sugars in carrots (1462, 1126, and 840 cm−1).
The assignments of molecule vibration associated with the sugars in tropical fruits were made according to the published literature (Table 1). Glucose is one of the most common hexoses with aldehydes in its structure [37], containing a carbonyl group (C-O stretch) [38]. This sugar was found at 1131.22 cm−1, 1134.44 cm−1, and 1133.37 cm−1 in cherimoya, soursop, and pineapple, respectively. Fructose, also known as ‘fruit sugar’, occurs in fruits and plants in a free state and as a building block of sucrose [37]. The Raman bands at 1472–1454 cm−1, corresponding to CH2 scissor vibrations [39], contain peaks at 1464.22 cm−1, 1467.44 cm−1, and 1464.22 cm−1 in cherimoya, soursop, and pineapple, respectively; nevertheless, the peak corresponding to fructose’s β(CCO) vibration was absent in soursop. Although RS allowed the identification of peaks associated with sugars in the three tropical fruits, it was not possible to quantify them. Therefore, it is also suggested that other techniques be used to complement the study. One technique that could be useful to validate the results is gas chromatography–mass spectrometry (GC-MS) analysis and profiling with the HS-SPME technique, since it is an effective tool due to its ability to recover and characterize compounds as mentioned by Mihaylova et al. [40].

3.2. Multivariate Analysis

Before performing any chemometric analysis, the raw spectra needed to be corrected because they contained fluorescence, as in Figure 2a. For this purpose, the spectra underwent a baseline correction with a seventh-order polynomial fitting [41,42], resulting in the spectra of Figure 2b, where the characteristic peaks of the, soursop, and pineapple chemical composition are distinguished. Raw spectra are available in the Supplementary Data file.

3.2.1. Principal Component Analysis

PCA is a chemometric method that allows us to explore patterns [43] and is almost always the first analysis performed on a multivariate data set [44]. The acquired spectral data set may be disturbed by random noise during acquisition; therefore, pre-processing methods are necessary to improve the PCA results. Pre-processing methods can reduce the random noise and improve the spectral characteristics of interest [45]. Figure 3 shows the results of the four pre-processing methods applied to the cherimoya, soursop, and pineapple spectral data sets. The mean centering method (Figure 3a) produced the least visible changes in the corrected spectra compared to the Savitzky–Golay first (Figure 3b), second (Figure 3c) and third (Figure 3d) derivative method. These effects can be observed in the intensities and positions of the peaks.
The changes in the model performance indicators are presented to highlight the effect of the pre-processing methods on the model [45]. Table 2 reports the root mean square error for cross-validation (RMSECV), which is an internal validation error estimated according to De Almeida et al. [46]. Table 2 shows that when the SG first derivative pre-processing method was used, the PCA model was composed of four principal components that captured 92.59% of the total data variance and 7.41% of the Q residuals (Table 3). Mata et al. [21] used a plot of Hotelling’s T2 vs. Q residuals to eliminate four spectra identified as outliers for the discrimination between transgenic and conventional cotton seeds. Figure 4 was obtained after removing two outliers and shows that cherimoya was separated from the other fruits using the four pre-processing methods. However, none became an outlier, since it was at most 95%.
The mean centering pre-processing method produced a three-component PCA model with a captured variance of 76.05%, which is the lowest captured variance among the models (Table 2). In the score plot, a better sample separation can be observed due to the formation of three well-defined clusters (Figure 5). Cherimoya and soursop, which belong to the Annonaceae family [47], form clusters closer to each other because they share many Raman peaks (including sugars), such that their chemical compositions are similar, unlike pineapple, which belongs to another family (Bromeliaceae) and has a different chemical composition. The separation of the clusters may be caused by the differences in the fruits’ phytochemical compound content, such as polyphenols, vitamins, fiber, and carotenoids. The loading plot of the first principal component produced by the four pre-processing methods (Figure 5) shows more pronounced peaks in the 1000–1200 cm−1 band, where the characteristic glucose peak (common to the three tropical fruits) is found.

3.2.2. Partial Least Square Discriminant Analysis

Since the discriminative power of a classification model is unknown, it is necessary to analyze it using different pre-processing methods [48]. Table 4 shows the results of the PLS-DA model applied to tropical fruits under the four pre-processing methods studied (all results are in Table 5). The mean centering pre-processing method and the first derivative of the Savitzky–Golay (SG) pre-processing method show the best prediction results, with a precision of 100% in all tropical fruits. On the training data set, the precision for mean centering and the first derivative of the SG pre-processing method was 93.06–100% and 98.68–100%, respectively. With the second and third derivatives of the SG pre-processing method, the precision was 80–100% and 84.44–100% on the prediction and training data sets, respectively. These values are higher than those reported by Kolašinac et al. [10], whose precision was 19.35–100% on the training data for classifying five varieties of Balkan paprika. These results varied because of the pre-processing method applied. Moreover, in the study of meat carcass classification using Raman spectroscopy, Logan et al. [32] found that the best PLS-DA model correctly classified 94% of grass-fed versus grain-fed carcasses. According to Table 4, the first derivative of the SG pre-processing method produced the best results in the PLS-DA classification model for the training data set, as it correctly classifies the samples in 98.68–100% of cases (P), the samples belonging to its class in 97.43–100% of cases (SE), and the samples not belonging to its class in 98.66–100% of cases (SP). All indicators for the prediction data set were 100%.
We also analyzed the classification power of the PLS-DA model through the receiver operating characteristic (ROC) curves (Figure 6). The ROC curves of cherimoya, soursop, and pineapple were located closer to the upper left corner of the ROC space, which indicates that the first derivative of the SG pre-processing method produces a PLS-DA model with optimal performance, since it can correctly classify the tropical fruits in the classes to which they belong. In all three fruits, the value of the area under the curve (AUC) was one. These results confirm the ROC curves.

4. Conclusions

Sucrose was detected in the three fruits in a spectral region between 1131 and 1134 cm−1. Glucose was detected with 855.15 cm−1 (cherimoya) and 851.93 cm−1 (pineapple) peaks. Finally, fructose appeared with peaks between 1464 and 1467 cm−1 in the three fruits. These results show the high sensitivity of RS for detecting sugars. Therefore, RS is an emerging technology that can provide robustness to data analysis obtained from conventional technologies such as chromatography or spectrophotometry. However, access to chromatography equipment is expensive and not sustainable.
Likewise, RS complemented by chemometric tools such as PCA and PLS-DA, as used in this study, made it possible to establish a high-precision classification model to discriminate among three fruits, thus being a method that can be used to discriminate fruits based on spectral fingerprints using non-destructive technology. Finally, since the fruits have sugar types in common, it is assumed that the discriminatory efficiency of PLS-DA is based not only on the sugar type but also on the content of polyphenols, vitamins, dietary fiber, carotenoids, and other chemical compounds. However, we recommend further studies to corroborate these findings.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/app14188476/s1, S1: Raw spectrum data file.

Author Contributions

Conceptualization, E.M.C.-A. and G.S.-L.; methodology, E.M.C.-A. and L.T.-V.; software, C.R.B.-Z. and E.M.C.-A.; validation, E.M.C.-A., I.S.C.-C. and M.B.; formal analysis, G.S.-L.; investigation, E.M.C.-A. and I.S.C.-C.; resources, J.L.M.-Q.; data curation, E.M.C.-A.; writing—original draft preparation, E.M.C.-A. and C.R.B.-Z.; writing—review and editing, I.S.C.-C. and C.R.B.-Z.; visualization, M.B. and J.L.M.-Q.; supervision, E.M.C.-A. and J.L.M.-Q.; project administration, L.D.M.-A.; funding acquisition, E.M.C.-A., G.S.-L. and C.R.B.-Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Consejo Nacional de Ciencia y Tecnología e Innovación Tecnológica–Concytec of the Peruvian Government (Project: Crystallization and polymorphism of cocoa butter in beans of different geographical origins and its impact on the quality of fine chocolates) grant number Nº PE501077920-2022—PROCIENCIA. And the APC was funded by the Vicerrectorado de Investigación, Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas.

Institutional Review Board Statement

No applicable.

Informed Consent Statement

No applicable.

Data Availability Statement

All the Supplementary Data can be found in this article.

Acknowledgments

The authors thank Project CUI N° 2343049—Creación de los Servicios de Investigación en Ingeniería de Alimentos y Poscosecha of the Universidad Nacional Toribio Rodríguez de Mendoza de Amazonas.

Conflicts of Interest

All authors declare no conflicts of interest.

References

  1. Mohan, K.; Muralisankar, T.; Uthayakumar, V.; Chandirasekar, R.; Revathi, N.; Ramu Ganesan, A.; Velmurugan, K.; Sathishkumar, P.; Jayakumar, R.; Seedevi, P. Trends in the Extraction, Purification, Characterisation and Biological Activities of Polysaccharides from Tropical and Sub-Tropical Fruits—A Comprehensive Review. Carbohydr. Polym. 2020, 238, 116185. [Google Scholar] [CrossRef] [PubMed]
  2. Isas, A.S.; Mariotti Celis, M.S.; Pérez Correa, J.R.; Fuentes, E.; Rodríguez, L.; Palomo, I.; Mozzi, F.; Van Nieuwenhove, C. Functional Fermented Cherimoya (Annona Cherimola Mill.) Juice Using Autochthonous Lactic Acid Bacteria. Food Res. Int. 2020, 138, 109729. [Google Scholar] [CrossRef] [PubMed]
  3. Perrone, A.; Yousefi, S.; Salami, A.; Papini, A.; Martinelli, F. Botanical, Genetic, Phytochemical and Pharmaceutical Aspects of Annona Cherimola Mill. Sci. Hortic. 2022, 296, 110896. [Google Scholar] [CrossRef]
  4. Monarres-Cuevas, O.; Alia-Tejacal, I.; Pérez-Arias, G.A.; López-Martínez, V.; Juarez-Lopez, P.; Valle-Guadarrama, S.; Rodríguez-Verastegui, L.L. Physical, Chemical, and Physiological Characterization of Chilling Injury in Soursop Fruit (Anonna Muricata L.). Postharvest Biol. Technol. 2022, 193, 112052. [Google Scholar] [CrossRef]
  5. Berumen-Varela, G.; Hernández-Oñate, M.A.; Tiznado-Hernández, M.E. Utilization of Biotechnological Tools in Soursop (Annona Muricata L.). Sci. Hort. 2019, 245, 269–273. [Google Scholar] [CrossRef]
  6. Zhou, D.; Liu, Q.; Wu, C.; Li, T.; Tu, K. Characterization of Soluble Sugars, Glycosidically Bound and Free Volatiles in Fresh-Cut Pineapple Stored at Different Temperature. Food Biosci. 2021, 43, 101329. [Google Scholar] [CrossRef]
  7. Dorey, E.; Fournier, P.; Léchaudel, M.; Tixier, P. Modeling Sugar Content of Pineapple under Agro-Climatic Conditions on Reunion Island. Eur. J. Agron. 2016, 73, 64–72. [Google Scholar] [CrossRef]
  8. Batsoulis, A.N.; Siatis, N.G.; Kimbaris, A.C.; Alissandrakis, E.K.; Pappas, C.S.; Tarantilis, P.A.; Harizanis, P.C.; Polissiou, M.G. FT-Raman Spectroscopic Simultaneous Determination of Fructose and Glucose in Honey. J. Agric. Food Chem. 2005, 53, 207–210. [Google Scholar] [CrossRef]
  9. González-Viveros, N.; Gómez-Gil, P.; Castro-Ramos, J.; Cerecedo-Núñez, H.H. On the Estimation of Sugars Concentrations Using Raman Spectroscopy and Artificial Neural Networks. Food Chem. 2021, 352, 129375. [Google Scholar] [CrossRef]
  10. Kolašinac, S.; Pećinar, I.; Danojević, D.; Stevanović, Z.D. Raman Spectroscopy Coupled with Chemometric Modeling Approaches for Authentication of Different Paprika Varieties at Physiological Maturity. LWT 2022, 162, 113402. [Google Scholar] [CrossRef]
  11. Wang, K.; Li, Z.; Li, J.; Lin, H. Raman Spectroscopic Techniques for Nondestructive Analysis of Agri-Foods: A State-of-the-Art Review. Trends Food Sci. Technol. 2021, 118, 490–504. [Google Scholar] [CrossRef]
  12. Monago-Maraña, O.; Afseth, N.K.; Knutsen, S.H.; Wubshet, S.G.; Wold, J.P. Quantification of Soluble Solids and Individual Sugars in Apples by Raman Spectroscopy: A Feasibility Study. Postharvest Biol. Technol. 2021, 180, 111620. [Google Scholar] [CrossRef]
  13. Xu, S.; Huang, X.; Lu, H. Advancements and Applications of Raman Spectroscopy in Rapid Quality and Safety Detection of Fruits and Vegetables. Horticulturae 2023, 9, 843. [Google Scholar] [CrossRef]
  14. Wu, L.; Tang, X.; Wu, T.; Zeng, W.; Zhu, X.; Hu, B.; Zhang, S. A Review on Current Progress of Raman-Based Techniques in Food Safety: From Normal Raman Spectroscopy to SESORS. Food Res. Int. 2023, 169, 112944. [Google Scholar] [CrossRef] [PubMed]
  15. Andersen, P.V.; Afseth, N.K.; Aaby, K.; Gaarder, M.Ø.; Remberg, S.F.; Wold, J.P. Prediction of Chemical and Sensory Properties in Strawberries Using Raman Spectroscopy. Postharvest Biol. Technol. 2023, 201, 112370. [Google Scholar] [CrossRef]
  16. Dehelean, A.; Cristea, G.; Feher, I.; Hategan, A.R.; Magdas, D.A. Differentiation of Transylvanian Fruit Distillates Using Supervised Statistical Tools Based on Isotopic and Elemental Fingerprint. J. Sci. Food Agric. 2023, 103, 1454–1463. [Google Scholar] [CrossRef]
  17. Fernández, J.A.; Abbas, O.; Dardenne, P.; Baeten, V. Discrimination of Corsican Honey by FT-Raman Spectroscopy and Chemometrics. Biotechnol. Agron. Soc. Environ. 2011, 15, 75–83. [Google Scholar]
  18. Li, W.; Tan, F.; Cui, J.; Ma, B. Fast Identification of Soybean Varieties Using Raman Spectroscopy. Vib. Spectrosc. 2022, 123, 103447. [Google Scholar] [CrossRef]
  19. Yazgan Karacaglar, N.N.; Bulat, T.; Boyaci, I.H.; Topcu, A. Raman Spectroscopy Coupled with Chemometric Methods for the Discrimination of Foreign Fats and Oils in Cream and Yogurt. J. Food Drug Anal. 2019, 27, 101–110. [Google Scholar] [CrossRef]
  20. Nakajima, S.; Kuroki, S.; Ikehata, A. Selective Detection of Starch in Banana Fruit with Raman Spectroscopy. Food Chem. 2023, 401, 134166. [Google Scholar] [CrossRef]
  21. da Mata, M.M.D.; Rocha, P.D.; Farias, I.K.T.D.; Silva, J.L.B.D.; Medeiros:, E.P.; Silva, C.S.; Simões, S.D.S. Distinguishing Cotton Seed Genotypes by Means of Vibrational Spectroscopic Methods (NIR and Raman) and Chemometrics. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2022, 266, 120399. [Google Scholar] [CrossRef] [PubMed]
  22. Alamar, P.D.; Caramês, E.T.S.; Poppi, R.J.; Pallone, J.A.L. Detection of Fruit Pulp Adulteration Using Multivariate Analysis: Comparison of NIR, MIR and Data Fusion Performance. Food Anal. Methods 2020, 13, 1357–1365. [Google Scholar] [CrossRef]
  23. Sun, M.; Jiang, H.; Yuan, W.; Jin, S.; Zhou, H.; Zhou, Y.; Zhang, C. Discrimination of Maturity of Camellia Oleifera Fruit On-Site Based on Generative Adversarial Network and Hyperspectral Imaging Technique. Food Meas. 2024, 18, 10–25. [Google Scholar] [CrossRef]
  24. Balcázar-Zumaeta, C.R.; Castro-Alayo, E.M.; Cayo-Colca, I.S.; Idrogo-Vásquez, G.; Muñoz-Astecker, L.D. Metabolomics during the Spontaneous Fermentation in Cocoa (Theobroma Cacao L.): An Exploraty Review. Food Res. Int. 2023, 163, 112190. [Google Scholar] [CrossRef]
  25. Kniese, J.; Race, A.M.; Schmidt, H. Classification of Cereal Flour Species Using Raman Spectroscopy in Combination with Spectra Quality Control and Multivariate Statistical Analysis. J. Cereal Sci. 2021, 101, 103299. [Google Scholar] [CrossRef]
  26. Deng, H.; He, R.; Long, M.; Li, Y.; Zheng, Y.; Lin, L.; Liang, D.; Zhang, X.; Liao, M.; Lv, X.; et al. Comparison of the Fruit Volatile Profiles of Five Muscadine Grape Cultivars (Vitis Rotundifolia Michx.) Using HS-SPME-GC/MS Combined With Multivariate Statistical Analysis. Front. Plant Sci. 2021, 12, 728891. [Google Scholar] [CrossRef]
  27. Balcázar-Zumaeta, C.R.; Pajuelo-Muñoz, A.J.; Trigoso-Rojas, D.F.; Iliquin-Chavez, A.F.; Fernández-Romero, E.; Yoplac, I.; Muñoz-Astecker, L.D.; Rodríguez-Hamamura, N.; Maza Mejía, I.M.; Cayo-Colca, I.S.; et al. Reduction in the Cocoa Spontaneous and Starter Culture Fermentation Time Based on the Antioxidant Profile Characterization. Foods 2023, 12, 3291. [Google Scholar] [CrossRef]
  28. Castro-Alayo, E.M.; Torrejón-Valqui, L.; Cayo-Colca, I.S.; Cárdenas-Toro, F.P. Evaluation of the Miscibility of Novel Cocoa Butter Equivalents by Raman Mapping and Multivariate Curve Resolution–Alternating Least Squares. Foods 2021, 10, 3101. [Google Scholar] [CrossRef]
  29. Huang, Y.; Karashima, T.; Yamamoto, M.; Hamaguchi, H. Molecular-level Pursuit of Yeast Mitosis by Time- and Space-resolved Raman Spectroscopy. J Raman Spectrosc. 2003, 34, 1–3. [Google Scholar] [CrossRef]
  30. Ditta, A.; Nawaz, H.; Mahmood, T.; Majeed, M.I.; Tahir, M.; Rashid, N.; Muddassar, M.; Al-Saadi, A.A.; Byrne, H.J. Principal Components Analysis of Raman Spectral Data for Screening of Hepatitis C Infection. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2019, 221, 117173. [Google Scholar] [CrossRef]
  31. Gersony, J.T.; McClelland, A.; Holbrook, N.M. Raman Spectroscopy Reveals High Phloem Sugar Content in Leaves of Canopy Red Oak Trees. New Phytol. 2021, 232, 418–424. [Google Scholar] [CrossRef] [PubMed]
  32. Logan, B.G.; Hopkins, D.L.; Schmidtke, L.M.; Fowler, S.M. Assessing Chemometric Models Developed Using Raman Spectroscopy and Fatty Acid Data for Northern and Southern Australian Beef Production Systems. Meat Sci. 2022, 187, 108753. [Google Scholar] [CrossRef] [PubMed]
  33. Banerjee, S.; Ranganathan, V.; Patti, A.; Arora, A. Valorisation of Pineapple Wastes for Food and Therapeutic Applications. Trends Food Sci. Technol. 2018, 82, 60–70. [Google Scholar] [CrossRef]
  34. Agu, K.C.; Okolie, P.N. Proximate Composition, Phytochemical Analysis, and in Vitro Antioxidant Potentials of Extracts of Annona Muricata (Soursop). Food Sci. Nutr. 2017, 5, 1029–1036. [Google Scholar] [CrossRef]
  35. Leivas, C.; Cantu-Jungles, T.; Barbosa, B.; De Paula Werner, M.; Iacomini, M.; Cordeiro, L.M.C. Investigation of the Chemical Structure and Analgesic and Anti-Inflammatory Properties of Polysaccharides That Constitute the Dietary Fibers of Soursop (Annona Muricata) Fruit. Food Res. Int. 2023, 166, 112588. [Google Scholar] [CrossRef]
  36. Schulz, H.; Baranska, M. Identification and Quantification of Valuable Plant Substances by IR and Raman Spectroscopy. Vib. Spectrosc. 2007, 43, 13–25. [Google Scholar] [CrossRef]
  37. Wiercigroch, E.; Szafraniec, E.; Czamara, K.; Pacia, M.Z.; Majzner, K.; Kochan, K.; Kaczor, A.; Baranska, M.; Malek, K. Raman and Infrared Spectroscopy of Carbohydrates: A Review. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2017, 185, 317–335. [Google Scholar] [CrossRef]
  38. Zheng, Z.; Yao, X.; Zhang, S.; Fang, L.; Jiang, Z. In-Situ Raman Study of α-D-Glucose under Different Pressure and Temperature. J. Mol. Struct. 2023, 1274, 134539. [Google Scholar] [CrossRef]
  39. Yao, X.; Fu, C.; Zhang, S.; Cheng, L.; Jiang, Z. Structure Investigation of β-D-Fructose Crystal under High Pressure: Raman Scattering, IR Absorption, and Synchrotron X-Ray Diffraction. J. Mol. Struct. 2020, 1220, 128746. [Google Scholar] [CrossRef]
  40. Mihaylova, D.; Popova, A.; Dincheva, I.; Pandova, S. HS-SPME-GC–MS Profiling of Volatile Organic Compounds and Polar and Lipid Metabolites of the “Stendesto” Plum–Apricot Kernel with Reference to Its Parents. Horticulturae 2024, 10, 257. [Google Scholar] [CrossRef]
  41. Guo, Y.; Jin, W.; Wang, W.; He, Y.; Qiu, S. Baseline Correction for Raman Spectra Using a Spectral Estimation-Based Asymmetrically Reweighted Penalized Least Squares Method. Appl. Opt. AO 2023, 62, 4766–4776. [Google Scholar] [CrossRef] [PubMed]
  42. Hu, H.; Bai, J.; Xia, G.; Zhang, W.; Ma, Y. Improved Baseline Correction Method Based on Polynomial Fitting for Raman Spectroscopy. Photonic Sens 2018, 8, 332–340. [Google Scholar] [CrossRef]
  43. Brereton, R.G. Applied Chemometrics for Scientists, 1st ed.; Wiley: Hoboken, NJ, USA, 2007; ISBN 978-0-470-01686-2. [Google Scholar]
  44. Esbensen, K.H.; Geladi, P. 2.13-Principal Component Analysis: Concept, Geometrical Interpretation, Mathematical Background, Algorithms, History, Practice. In Comprehensive Chemometrics; Elsevier: Oxford, UK, 2009; pp. 211–226. ISBN 978-0-444-52701-1. [Google Scholar]
  45. Roger, J.-M.; Boulet, J.-C.; Zeaiter, M.; Rutledge, D.N. 3. 01-Pre-Processing Methods. In Comprehensive Chemometrics; Elsevier: Oxford, UK, 2020; pp. 1–75. ISBN 978-0-444-64166-3. [Google Scholar]
  46. De Almeida, M.R.; Correa, D.N.; Rocha, W.F.C.; Scafi, F.J.O.; Poppi, R.J. Discrimination between Authentic and Counterfeit Banknotes Using Raman Spectroscopy and PLS-DA with Uncertainty Estimation. Microchem. J. 2013, 109, 170–177. [Google Scholar] [CrossRef]
  47. Padmanabhan, P.; Paliyath, G. Annonaceous Fruits. In Encyclopedia of Food and Health; Caballero, B., Finglas, P.M., Toldrá, F., Eds.; Academic Press: Oxford, UK, 2016; pp. 169–173. ISBN 978-0-12-384953-3. [Google Scholar]
  48. Devos, O.; Downey, G.; Duponchel, L. Simultaneous Data Pre-Processing and SVM Classification Model Selection Based on a Parallel Genetic Algorithm Applied to Spectroscopic Data of Olive Oils. Food Chem. 2014, 148, 124–130. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Raman spectra of glucose, fructose, and sucrose in cherimoya, soursop, and pineapple.
Figure 1. Raman spectra of glucose, fructose, and sucrose in cherimoya, soursop, and pineapple.
Applsci 14 08476 g001
Figure 2. Chemometric analysis of raw (a) and corrected (b) Raman spectra of cherimoya, soursop, and pineapple.
Figure 2. Chemometric analysis of raw (a) and corrected (b) Raman spectra of cherimoya, soursop, and pineapple.
Applsci 14 08476 g002aApplsci 14 08476 g002b
Figure 3. Spectrum pre-processed with (a) mean centering, (b) the first derivative of the Savitzky–Golay method, (c) the second derivative of the Savitzky–Golay method, and (d) the third derivative of the Savitzky–Golay method.
Figure 3. Spectrum pre-processed with (a) mean centering, (b) the first derivative of the Savitzky–Golay method, (c) the second derivative of the Savitzky–Golay method, and (d) the third derivative of the Savitzky–Golay method.
Applsci 14 08476 g003
Figure 4. Q vs Hotelling’s T2 in the PCA model. Both parameters are below 95%. Pre-processing methods: (a) mean centering method, (b) first derivative of the Savitzky–Golay method, (c) second derivative of the Savitzky–Golay method, and (d) third derivative of the Savitzky–Golay method.
Figure 4. Q vs Hotelling’s T2 in the PCA model. Both parameters are below 95%. Pre-processing methods: (a) mean centering method, (b) first derivative of the Savitzky–Golay method, (c) second derivative of the Savitzky–Golay method, and (d) third derivative of the Savitzky–Golay method.
Applsci 14 08476 g004
Figure 5. PCA scores and loadings with different pre-processing methods: (a,b) mean centering, (c,d) first derivative of Savitzky–Golay, (e,f) second derivative of Savitzky–Golay, and (g,h) third derivative of Savitzky–Golay method.
Figure 5. PCA scores and loadings with different pre-processing methods: (a,b) mean centering, (c,d) first derivative of Savitzky–Golay, (e,f) second derivative of Savitzky–Golay, and (g,h) third derivative of Savitzky–Golay method.
Applsci 14 08476 g005aApplsci 14 08476 g005b
Figure 6. ROC curves for the first derivative of the Savitzky–Golay pre-processing method.
Figure 6. ROC curves for the first derivative of the Savitzky–Golay pre-processing method.
Applsci 14 08476 g006
Table 1. Assignments of Raman peaks for sugar standards and cherimoya, soursop, and pineapple.
Table 1. Assignments of Raman peaks for sugar standards and cherimoya, soursop, and pineapple.
Assignment 1LiteratureFructoseGlucoseSucroseCherimoyaSoursopPineapple
δ (CH2)1460 [37]1473.931471.741472.811464.221467.441464.22
ρ (CH2)1346 [37]1356.801359.731355.731355.731352.501353.58
ν (CC), ν (CO), β (COH)1142 [37] 1133.371133.371133.371131.221134.441133.37
ν (CC), ν (CO)1074 [37] 1080.731081.811080.731078.581085.031085.03
CH, COH bending924 [38]932.49929.27933.57928.20928.20918.53
ν (CH)867 [38] 859.45853.00860.52855.15854.08851.93
β (CCO)596 [39]594.12 595.20601.64 595.12
β (CCO), β (CCC)527 [39] 536.12 537.19536.12535.04535.04
β (CCC)465 [39] 456.63455.55457.70453.40455.55455.55
β (CCC),β (CCO),β (OCO)424 [38]416.88415.81416.88417.95
1 ν–stretching, β–in-plane bending.
Table 2. Effect of four data pre-processing methods on the PCA model for cherimoya, soursop, and pineapple.
Table 2. Effect of four data pre-processing methods on the PCA model for cherimoya, soursop, and pineapple.
Pre-Processing MethodPCRMSECV% Variance Captured TotalQ ResidualHotelling’s T2
Mean centering 3254.1876.0523.9576.05
First derivative of Savitzky–Golay444.0692.597.4192.59
Second derivative of Savitzky–Golay525.8691.108.9091.10
Third derivative of Savitzky–Golay539.3281.0618.9481.06
Table 3. Results of the PCA model with different pre-processing methods.
Table 3. Results of the PCA model with different pre-processing methods.
Pre-Processing MethodsRMSECRMSECVPrincipal Component NumberEigenvalue of Cov (X)% Variance Captured This PC% Variance Captured Total
Mean centering180.47254.1811.29 × 10739.2939.29
26.36× 10619.3458.64
35.72 × 10617.4276.05
First derivative of Savitzky–Golay31.3044.0612.17 × 10668.5268.52
23.81 × 10512.0280.54
32.75 × 1058.6689.20
41.08 × 1053.3992.59
Secondderivative ofSavitzky–Golay19.2025.8615.47 × 10554.9654.96
21.90 × 10519.0674.01
38.61 × 1048.6582.67
44.62 × 1044.6587.31
53.77 × 1043.7991.10
Thirdderivative ofSavitzky–Golay29.1539.3115.12 × 10547.5747.57
21.72 × 10515.9663.53
38.88 × 1048.2471.77
46.25 × 1045.8077.57
53.75 × 1043.4881.06
Table 4. Classification results of the PLS-DA model.
Table 4. Classification results of the PLS-DA model.
Pre-Processing MethodSweet Fruit Training Data SetPrediction Data Set
P 1 (%)SE 2 (%)SP 3 (%)P (%)SE (%)SP (%)
Mean centeringCherimoya 93.0693.8797.98100.00100.00100.00
Soursop 97.0297.9596.97100.00100.00100.00
Pineapple 100.0098.00100.00100.00100.00100.00
First derivative of Savitzky–GolayCherimoya 100.0097.43100.00100.00100.00100.00
Soursop 98.68100.0098.66100.00100.00100.00
Pineapple 100.00100.00100.00100.00100.00100.00
Second derivative of Savitzky–GolayCherimoya 96.6854.8398.11100.0012.50100.00
Soursop 84.4492.0083.0580.00100.0075.00
Pineapple 100.0089.28100.00100.00100.00100.00
Third derivative of Savitzky–GolayCherimoya 98.1371.7998.61100.0020.00100.00
Soursop 87.9497.2286.6780.5969.2383.33
Pineapple 98.68100.0098.67100.00100.00100.00
1 Precision (P), 2 Sensitivity (SE) or true positive rate, 3 Specificity (SP) or true negative rate.
Table 5. Result of the PLS-DA model with different pre-processing methods.
Table 5. Result of the PLS-DA model with different pre-processing methods.
Pre-Processing MethodModeled ClassCherimoyaSoursopPineapple
Mean centeringSensitivity (Cal)0.9591.0001.000
Specificity (Cal)0.9290.9491.000
Sensitivity (CV)0.9391.0001.000
Specificity (CV)0.9090.9491.000
Class. Err (Cal)0.05576170.02525250
Class. Err (CV)0.07606680.02525250
RMSEC0.314260.2733950.217022
RMSECV0.3336920.2895130.229203
Bias1.66533× 10−161.66533× 10−161.11022 × 10−16
CV Bias0.00127205−0.001849870.000577824
R2 Cal0.5540660.6624990.78946
R2 CV0.5012050.6229380.766066
First derivative of Savitzky–GolaySensitivity (Cal)0.9741.0001.000
Specificity (Cal)0.9720.9871.000
Sensitivity (CV)0.8970.9721.000
Specificity (CV)0.9580.9871.000
Class. Err (Cal)0.02670940.006666670
Class. Err (CV)0.07211540.02055560
RMSEC0.262070.1970140.198749
RMSECV0.3112810.2529410.209272
Bias−0.009154750.004086620.00506812
CV Bias0.00775709−0.01457670.00681962
R2 Cal0.6990080.8229520.819861
R2 CV0.5952650.7176180.800737
Secondderivative ofSavitzky–GolaySensitivity (Cal)0.5481.0000.964
Specificity (Cal)0.9060.8311.000
Sensitivity (CV)0.5480.9600.929
Specificity (CV)0.8870.8311.000
Class. Err (Cal)0.2729760.08474580.0178571
Class. Err (CV)0.282410.1047460.0357143
RMSEC0.3976850.3433630.242586
RMSECV0.4008280.3510060.244279
Bias0.0964188−0.1315570.0351384
CV Bias0.0958097−0.134770.0389599
R2 Cal0.3663970.5382070.740784
R2 CV0.3564270.5181020.740114
Thirdderivative ofSavitzky–GolaySensitivity (Cal)0.8210.9441.000
Specificity (Cal)0.8890.8130.987
Sensitivity (CV)0.8210.9171.000
Specificity (CV)0.8750.8130.973
Class. Err (Cal)0.1452990.1211110.00666667
Class. Err (CV)0.1522440.1350.0133333
RMSEC0.3489220.321590.208805
RMSECV0.3618740.3363870.234444
Bias−0.005847020.0004217020.00542532
CV Bias−0.00528552−0.002001150.00728667
R2 Cal0.4659470.5280590.801175
R2 CV0.4259250.4861490.749947
Cal: calibration; CV: cross-validation; RMSEC: root mean square error calibration, RMSECV: root mean square error calibration of cross-validation.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Balcázar-Zumaeta, C.R.; Maicelo-Quintana, J.L.; Salón-Llanos, G.; Barrena, M.; Muñoz-Astecker, L.D.; Cayo-Colca, I.S.; Torrejón-Valqui, L.; Castro-Alayo, E.M. A Novel Technique Using Confocal Raman Spectroscopy Coupled with PLS-DA to Identify the Types of Sugar in Three Tropical Fruits. Appl. Sci. 2024, 14, 8476. https://doi.org/10.3390/app14188476

AMA Style

Balcázar-Zumaeta CR, Maicelo-Quintana JL, Salón-Llanos G, Barrena M, Muñoz-Astecker LD, Cayo-Colca IS, Torrejón-Valqui L, Castro-Alayo EM. A Novel Technique Using Confocal Raman Spectroscopy Coupled with PLS-DA to Identify the Types of Sugar in Three Tropical Fruits. Applied Sciences. 2024; 14(18):8476. https://doi.org/10.3390/app14188476

Chicago/Turabian Style

Balcázar-Zumaeta, César R., Jorge L. Maicelo-Quintana, Geidy Salón-Llanos, Miguel Barrena, Lucas D. Muñoz-Astecker, Ilse S. Cayo-Colca, Llisela Torrejón-Valqui, and Efraín M. Castro-Alayo. 2024. "A Novel Technique Using Confocal Raman Spectroscopy Coupled with PLS-DA to Identify the Types of Sugar in Three Tropical Fruits" Applied Sciences 14, no. 18: 8476. https://doi.org/10.3390/app14188476

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop