Next Article in Journal
Probiotic Potential of a Folate-Producing Strain Latilactobacillus sakei LZ217 and Its Modulation Effects on Human Gut Microbiota
Next Article in Special Issue
HOG-SVM Impurity Detection Method for Chinese Liquor (Baijiu) Based on Adaptive GMM Fusion Frame Difference
Previous Article in Journal
Quality of Protein Isolates and Hydrolysates from Baltic Herring (Clupea harengus membras) and Roach (Rutilus rutilus) Produced by pH-Shift Processes and Enzymatic Hydrolysis
Previous Article in Special Issue
Online Detection of Watercore Apples by Vis/NIR Full-Transmittance Spectroscopy Coupled with ANOVA Method
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Nondestructive Prediction of Isoflavones and Oligosaccharides in Intact Soybean Seed Using Fourier Transform Near-Infrared (FT-NIR) and Fourier Transform Infrared (FT-IR) Spectroscopic Techniques

1
Department of Biosystems Machinery Engineering, College of Agricultural and Life Science, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Korea
2
Department of Agricultural and Biosystems Engineering, Faculty of Agricultural Technology, Gadjah Mada University, Yogyakarta 55281, Indonesia
3
Department of Herbal Medicine Resource, Dogye Campus, Kangwon National University, Samcheok 25949, Korea
4
Department of Gene Engineering, National Institute of Agricultural Sciences, Rural Development Administration, Jeonju 54874, Korea
5
Environmental Microbial and Food Safety Laboratory, Agricultural Research Service, United States Department of Agriculture, Powder Mill Road, BARC-East, Bldg 303, Beltsville, MD 20705, USA
6
Department of Smart Agriculture Systems, College of Agricultural and Life Science, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Korea
*
Author to whom correspondence should be addressed.
Foods 2022, 11(2), 232; https://doi.org/10.3390/foods11020232
Submission received: 11 December 2021 / Revised: 10 January 2022 / Accepted: 14 January 2022 / Published: 16 January 2022
(This article belongs to the Special Issue Nondestructive Optical Sensing for Food Quality and Safety Inspection)

Abstract

:
The demand for rapid and nondestructive methods to determine chemical components in food and agricultural products is proliferating due to being beneficial for screening food quality. This research investigates the feasibility of Fourier transform near-infrared (FT-NIR) and Fourier transform infrared spectroscopy (FT-IR) to predict total as well as an individual type of isoflavones and oligosaccharides using intact soybean samples. A partial least square regression method was performed to develop models based on the spectral data of 310 soybean samples, which were synchronized to the reference values evaluated using a conventional assay. Furthermore, the obtained models were tested using soybean varieties not initially involved in the model construction. As a result, the best prediction models of FT-NIR were allowed to predict total isoflavones and oligosaccharides using intact seeds with acceptable performance (R2p: 0.80 and 0.72), which were slightly better than the model obtained based on FT-IR data (R2p: 0.73 and 0.70). The results also demonstrate the possibility of using FT-NIR to predict individual types of evaluated components, denoted by acceptable performance values of prediction model (R2p) of over 0.70. In addition, the result of the testing model proved the model’s performance by obtaining a similar R2 and error to the calibration model.

1. Introduction

The global market related to functional food has been increasing considerably in the last decade. In Asia, the market is predicted to continue to grow due to the high population [1]. In terms of functional foods, soybeans are one of the most interesting products to be evaluated due to the complex chemical compound inside. Soybeans are one of the world’s most protein sources and are the primary protein supply in animal feed. It is also reported as a leading share in the global oilseed output and contributes to 60% of the world’s meal production. Wilson in Roger-Boerma [2] reported that protein and oil content in soybean ranges from 341 to 568 g/kg (34.1–56.8%) and 83 to 279 g/kg (8.3–27.9%) of total seed weight, respectively. The remaining macronutrient in soybean is carbohydrates (including dietary fiber and soluble carbohydrates), which account for about 35% of a moisture-free soybean [3].
The composition of soybean carbohydrates was already described in detail by Lilienthal et al. [4] On average, mature soybeans contain 15% oligosaccharides, and about 50% of them (6–7%) are sucrose. The raffinose families (stachyose, raffinose, and verbascose) are available in 4–5%, 1–2%, and below 1%, respectively. It has been a long time since soybean oligosaccharides were considered to cause flatulence [5], but they have also been approved as a prebiotic material to improve the quality of microflora in the human intestine [6]. Due to their economic value to the food and feed industries, efforts to recover these components from defatted soybean meal have been carried out. For example, the sweetener industry has applied new technology to make a commercialized high-oligosaccharides syrup from soybean. Moreover, the attractive effect of galactooligosaccharides in soybean has also arrested the attention of researchers to perform an in-depth evaluation for future commercialization.
Despite containing interested oligosaccharides components, soybean also contains isoflavones that benefit human health. The chemical structure of isoflavones, which have a phenolic structure similar to estrogen, allows them to bind to estrogen receptors and exhibit estrogenic activity [7,8]. Moreover, isoflavones and dietary food containing high isoflavones concentrations have been reported as possible anticarcinogens. In addition, several works and reviews reported the relationship between isoflavones and the relief of chronic diseases in humans such as prostate cancer [9], obesity and diabetes [10,11], osteoporosis [12], and the incidence of cardiovascular disease [13]. These studies have opened doors to increase the consumption of traditional soy-based food (soy milk, tofu, natto, tempeh, etc.) and modern food with soy-based ingredients. Furthermore, commercial isolate soy isoflavones are available in the market as pills, concentrates, and extracts.
The increased demand for bioactive compounds encourages the breeding industry to produce enriched new lines of seeds to fulfill consumer demands. Therefore, seed selection will play a key role in providing high-quality seed related to the targeted bioactive components. On the other hand, controlling the quality of enriched bioactive compound products before reaching the consumer’s hand is also crucial to avoid excessive producer claims. From this point of view, a precise and appropriate technique to evaluate the content of the specified biological components is necessary.
Modern laboratories in chemistry have approached many precise measurement methods to determine the chemical composition inside the sample, including high-performance liquid chromatography (HPLC), mass spectrometry (MS), and capillary electrophoresis (CE). These methods have proved their preciseness and reproducibility in determining the macro- and micro-chemical components of soybeans [14,15,16,17,18,19]. However, those methods cannot address environmental issues related to the high amount of chemical waste, as well as being highly cost- and labor-intensive. In addition, these methods destroy the sample during the preparation process prior to analysis, which meets the limitation of the breeding company in the seed-selection process. On the other hand, the determination of seed quality is a meaningful consideration for producers to achieve successful production [20]. Hence, the development of nondestructive techniques is demanding.
One such nondestructive approach combines vibrational spectroscopic techniques with chemometric analysis, successfully applied in food and agricultural products’ qualitative and quantitative determination. For example, MIR spectroscopy has been used to predict sugar in apple juice [21] and detect hairtail surimi in mixed surimi [22]. On the other hand, NIR has also been successfully used to determine the quality parameter of maize including protein, carbohydrate, oil, and ash [23]. Furthermore, Masithoh et al. [24] compare the effectiveness of NIR and MIR to determine protein and glucose content in the tuber and root flour. In the case of soybean, NIR and MIR have been used to determine soybean important components, including protein, moisture, lipids, and ash [25,26], and micro components such as saponins, fatty acids, flavonoids, and isoflavonoids [27]. In addition, the two most common anthocyanins in soybeans have also been successfully predicted based on the spectra of NIR [28]. However, all of the previous research for soybean collected spectral data using a powder sample type.
An investigation of soybean chemical components based on the data obtained from seed samples has also been carried out. Kovalenko et al. [29] evaluated amino acids content in soybean based on NIR spectral data of the seeds. Moreover, three different sample types (single-bean, whole bean, and powder) have been investigated to determine their effects on the success of predicting protein and amino acids in soybeans [30], while FT-NIR and FT-IR (in MIR region) spectra from intact seed have also been successfully used to predict anthocyanin in soybean [31]. The reports reveal that using a single bean to obtain NIR and MIR spectral data is a promising strategy for nondestructive chemical component evaluation of soybeans. However, soybean protein and amino acid content are relatively high (about 40% dw), and anthocyanin is only found on the bean surface.
Our concern in this study was soybean isoflavones and oligosaccharides, which are available in low concentrations and distributed in the whole bean part. Therefore, the effectiveness of FT-NIR and FT-IR was investigated to predict the above-mentioned targeted components based on the spectral data of a single bean. In addition, a more detailed investigation of the individual type of each microcomponent has also been carried out and presented side-by-side with the results of powder samples.

2. Materials and Methods

2.1. Sample Preparation

In this research, 310 soybean samples were used to develop models to predict the examined soybean components. These samples were supplied by Rural Development Administration (RDA), South Korea, and among them, seventy samples had black color coats. The bean size was varied based on the sample ranging from 2.90 to 7.90 mm. In addition, the bean size within a single sample was also varied, which possibly affected the chemical content among different seeds [32]. Therefore, the seed’s chemical composition variability will affect the spectral data measured by every instrument. On the other hand, the evaluation of chemical components was constrained by the minimum weight requirement in the extraction process prior to the HPLC process. Hence, in this study, 30 g of soybean seeds were collected from each examined sample, and among them, twenty-one beans were selected randomly for the spectral data acquisition using Fourier transform near-infrared (FT-NIR) and Fourier transform infrared (FT-IR) systems. Therefore, there were 6510 (21 bean × 310 samples) spectral data collected for every instrument, which were then divided into calibration and validation datasets about 70% and 30%, respectively. After being measured using both instruments, all 30 g of beans were ground, obtaining powder that passed through the sieve designation mesh no 60, with a sieve opening at 250 microns. The spectral data for the powder samples were then collected using both instruments in the same instrumental setup. Finally, the powder samples were sent for chemical analysis to obtain reference values for further analysis.
The additional 65 soybean samples containing yellow soybeans (50 samples) and black soybean (15 samples), which were not initially involved in the model construction and obtained from the same institution, were used for testing the resulting model. Twenty-one beans were also selected randomly from each sample and collected their spectral data using both instruments.

2.2. Spectral Data Acquisition

2.2.1. FT-NIR Spectroscopy

Laboratory FT-NIR spectrometer Antaris II FT-NIR analyzer (Thermo Scientific Co. Waltham, MA, USA) was performed to acquire the diffuse reflectance spectral data of soybean seed in range of waveband between 10,000 and 4000 cm−1 (1000–2500 nm). A single bean was placed in the sample holder and scanned 32 times to obtain average spectral data in the specified waveband at 4 cm−1 intervals. The interference of environmental light during the measurement process was blocked by covering the sample holder with a black lid (Figure 1A). Similar techniques and instrumental settings were applied to collect spectral data of powder samples.

2.2.2. FT-IR Spectroscopy

The spectra of soybean seeds and powder samples were collected using an FT-IR spectrometer Nicolet 6700 (Thermo Scientific Co., Waltham, MA, USA) equipped with an attenuated total reflectance (ATR) sampling mode. The absorbance spectra of each seed were collected in the MIR wavelength region ranging from 4000 to 400 cm−1 (2500–25,000 nm) at the interval of 4 cm−1 spectral resolution. During the measurement, soybean seed was placed on the surface of the diamond crystal and clamped using a pointed tip (Figure 1B), while the background was collected by scanning an empty plate. As a result, the average spectrum of 32 successive scans of each seed was obtained for further analysis. In this study, the FT-IR with an ATR sampling model was selected due to only requiring a very small quantity of sample [33], allowing the analysis of the sample in its natural state. Moreover, the FT-IR ATR also enables us to measure directly onto a solid-state sample surface by pressing the sample towards an ATR crystal with no sample preparation.

2.3. Reference Value Investigation

2.3.1. Chemicals

Pure water (18 MΩ cm−1) was obtained from a Milli-Q water purification system (Millipore, Bedford, MA, USA). HPLC analytical grade water, methanol, ethanol, acetonitrile, and acetic acid were obtained from J.T. Baker Co (Phillipsburg, NJ, USA). Oligosaccharide standards, as stachyose trihydrate, raffinose, and sucrose, were obtained from Sigma Chemical Co. (St. Louis, MO, USA). All isoflavone standards, 6″-O-malonyl daidzin, 6″-O-malonyl glycitin, 6″-O-malonyl genistin, 6″-O-acetyl daidzin, 6″-O-acetyl glycitin, 6″-O-acetyl genistin, daidzin, glycitin, genistin, daidzein, glycitein, and genistein were purchased from Carbosynth company (Compton, Berkshire, UK). All other laboratory chemicals used in this study were reagent grade.

2.3.2. Isoflavones Determination

The isoflavones determination was carried out based on the procedure used by previous researchers [34] with some modifications. One gram of soybean sample was extracted with 70% EtOH (40 mL) at room temperature for 24 h. The 70% EtOH extract was filtered with Whatman No. 6 (Whatman Inc., Maidstone, UK) filter paper. The 20 mL of 70% EtOH extract was evaporated under 40 °C and dissolved in 70% EtOH (5 mL). Before analysis, all samples were filtered through a 0.45 µm membrane filter (Whatman Inc., Maidstone, UK)
An Agilent 1260 series HPLC system (Wilmington, DE, USA) equipped with a photodiode array detector operating at 260 nm was used for analysis. Separations were performed on a Venusil C18 column (5 µm; 250 × 4.6 mm i.d., Agela Technologies, Torrance, CA, USA) protected with a Nova-Pak C18 guard insert column (Waters, Milford, MA, USA). The column temperature was maintained at 30 °C with the mobile phases consisting of solvent (A): 0.1% (v/v) acetic acid in the water, and solvent (B): 0.1% (v/v) acetic acid in acetonitrile.
The gradient elution profile was as follows: 0 min, 85% A, 0–60 min, 85–70% A; 60–65 min, 70–60% A; 66 min, 85% A; 66–75 min, 85% A. The flow rate was 1.0 mL/min, and the injection volume was 20 µL. The isoflavone contents were calculated by comparing HPLC peak areas with external standard calibration curves. The linear standard calibration curves (R2 = 0.999) were generated by injecting 0–20 µg of isoflavone standards in 1 mL of 80% methanol.

2.3.3. Oligosaccharides Determination

In brief, soybean powder (1.0 g) was extracted with 70% EtOH (10 mL) at room temperature for 24 h and then centrifuged at 5000× g for 20 min. Next, the supernatant was filtered with a Sep-Pak C18 solid-phase extraction cartridge (Waters, Milford, MA, USA), and the remaining residue was then dissolved in water. Finally, the diluted extract (20 µL) was injected into an HPLC system.
Oligosaccharides were analyzed using an Agilent 1200 series (Wilmington, DE, USA) HPLC system with Sedex 75 ELSD (Evaporative Light Scattering Detector, Sedere, Alfortville, France). The diluted extract (20 µL) was injected into an HPLC system equipped with a Shodex HILICpak VG-50 4E (4.6 × 250 mm, 5 μm, Showa Denko Co., Tokyo, Japan) column. The column temperature was 30 °C, and the mobile phase (flow rate 1.5 mL/min) was a linear gradient prepared from acetonitrile (A) and water (B). The gradient program was (time,% A): 0–20 min, 80–50%; 21–27 min, 20%; 28 min, 80%; 28–38 min, 80%. The temperature for the flow shift tub in ELSD was 45 °C. The flow rate of N2 was 1.0 bar. The contents of oligosaccharides were calculated by comparing the HPLC peak areas with the external standard calibration curves. The standard calibration curves (R2 = 0.999) were generated by injecting 0–300 mg of three oligosaccharide standards in 1 mL of water.

2.4. Spectral Data Preprocessing and Multivariate Analysis

All spectral data collected using FT-NIR and FT-IR were subjected to several preprocessing methods, including normalization (mean, range, and max), standard normal variate (SNV); Multiplicative Scatter Correction (MSC); and Savitzky-Golay (SG) first and second derivative methods. This process aims to remove noise generated during the measurement due to instrumental or environmental effects [35,36]. The obtained preprocessed spectral data were then divided into calibration and validation datasets by 70% and 30%, respectively.
Afterward, a multivariate analytical model of partial least square regression (PLS-R) was selected to develop models to predict the content of the targeted components. The general equation for PLS-R is expressed as follows:
Y = X × B + E
where X is an m × n matrix that holds the spectra values of the sample, B is the regression coefficient, and E is the error term. In this study, for the construction of the PLS-R model, spectral data of soybean seeds and powder were arranged in a matrix X, while the Y matrix contained the reference values obtained from chemical analysis. The values X and Y are decomposed into latent variables (LVs) to establish a linear connection between the response and predictor variables. The optimum number of LVs was selected based on the standard procedure documented by Varmuza and Filzmozer [37] Section 4.2.
X = T P T + E x
Y = U Q T + E y
T and U are score matrices, whereas P and Q are loading matrices. Ex and Ey are the error matrix of X and Y, respectively. Statistical methods, i.e., coefficient determination (R2) and standard error (S.E.) for both calibration and validation datasets, were used to select the best-obtained model [38]. The R2 is the proportion of the variance in the dependent variable that is predictable from the independent variables and can be found with Equation (4). Meanwhile, the standard error of the regression represents the average distance that the observed values fall from the regression line, which can be calculated using Equation (5).
R 2 = i ( y ^ i y ¯ ) 2 i ( y i y ¯ ) 2
S E = i ( y ^ i y i ) 2 n 1 2
where y ^ i represents predicted value, y i represents the reference value and y ¯ is the mean of the reference values. The number of samples is symbolized with n. All data analysis processes were conducted using MATLAB software (The Math Works, Natick, MA, USA, R2019a). The workflow for model development was summarized and presented as Figure 2 (developing model).

2.5. Model Testing

To evaluate the applicability of the obtained model, the spectral data of 1365 soybean seeds belonging to 65 soybean varieties not initially involved in the model construction were collected using both instruments (FT-IR and FT-NIR). Then, the prediction values of the targeted components were calculated by multiplying the obtained regression coefficient with the seed spectral data in the new data set. On the other hand, after acquiring the spectral data, the soybean seeds were ground and sent for chemical analysis to examine the content of the targeted components. The coefficient determination (R2) and the standard error (SE) were then calculated to evaluate the model performance by using Equations (4) and (5), respectively. The summaries of the steps for model testing was presented in Figure 2 (testing model).

3. Results and Discussion

3.1. Spectral Data Interpretation

The characteristics of the FT-NIR and FT-IR spectra for the soybean (Figure 3) represent some functional groups associated with the main components of the soybean, such as protein, carbohydrates, lipid, and moisture. The NIR spectra arise due to the vibration energy change between the fundamental (ground) state to the higher-order level or the combination bands. For the soybean sample, the near spectra can be highlighted as follows. The bands range from 5200 to 5100 cm−1, corresponding to the O–H vibration’s first overtone or combination mode, while the band between 5600 and 5000 cm−1 can be assigned to the vibration of C–H functional groups (first overtone and HC=CH form) related to the fatty acid component. Protein, the main soybean component, can be associated with the band between 5000 and 4500 cm−1, where the N–H and C=O stretching were detected [26]. The absorption bands ranging from 5720 to 6030 are related to the C–H first overtone region that correlated with the carbohydrates [39].
The FT-IR spectra (Figure 3B), consistent with the vibration in the primary energy level and the first excited vibration, can be basically separated into two waveband ranges. The fingerprint region (1200–900 cm−1) is the first range where the stretching of C–O, C–C, and C–O–C can possibly be identified. The stretching vibration of N–H that characterize protein arises around 1650 cm−1, while the asymmetric and symmetric CH2 and CH3 closely related to fatty acid were found at the band range of 3040 to 2850 cm−1 [40]. Figure 2 also revealed a straight line in the band range between 2600 to 2100 cm−1, indicating that no information can be obtained from this band range. Thus, this range of band was excluded for model development.

3.2. Reference Values Analysis

The result of the chemical data analysis for isoflavones and oligosaccharides was presented in Table 1.
Isoflavones are the essential flavonoid in soybean and have many health benefits, especially their ability to delay the menopausal period for women. In total, twelve types of isoflavones that belong to the three main types of isoflavones (daidzein, genistein, and glycitein) can be evaluated in the examined samples. However, the aglycone form (daidzein, genistein, glycitein) and two acetyl glucoside forms (acetyl genistin, acetyl glycitin) were only available at a low concentration level (below 50 µg/g) and had a narrow range of concentration. Therefore, the prediction model in this study was only developed for seven types of isoflavones and total isoflavones content, which was calculated by summing the content of individual isoflavone types detected in every soybean sample. The statistical data of the isoflavones, including the number of samples used for model development, are presented in Table 1. Overall, the concentration range obtained in this study coincided with the result presented by Berhow et al. [41]
Oligosaccharides were the next soybean chemical content observed in this study. Sucrose was the major oligosaccharide evaluated in the examined soybean samples and comprised about 50–55% of the total oligosaccharides in every sample. The galacto-oligosaccharides from the raffinose family (stachyose and raffinose) were observed around 36 to 40%, while other types of oligosaccharides (verbascose, maltose glucose, and fructose) only can be found in trace amounts in some examined soybean samples. Overall, this result was similar to the result reported by Hollung et al. [42] when evaluating carbohydrate content in Brazilian soybean. Therefore, due to the limited data of some individual oligosaccharide types, in this study, the prediction model was only developed for total oligosaccharides and three major types of observed oligosaccharides in the examined soybean samples.

3.3. Prediction Model of Soybean Chemical Components

3.3.1. Isoflavones Model

The chemical composition of each bean within a single sample may differ due to different maturity stages and seed sizes. Thus, the spectral data of 21 seeds in a single sample were averaged, resulting in one spectrum representing one soybean sample. On the other hand, the result from the chemical analysis revealed that the soybean biochemical composition differs among the variety, which affects the number of spectral data used to develop a model for individual isoflavones types. The detail of the data can be seen in Table 1.
Our result, presented in Table 2, shows that it is possible to predict total isoflavones, as well as individual isoflavones in soybeans. We obtained the determination value for the calibration model (R2c) over 0.8 and an error of about 0.32 mg/g or equal to 14% of the total isoflavones content in the examined sample. Even though the R2c values for individual isoflavones were acceptable (over 0.7), the error was relatively high (over 15%), affecting the model confidence. The obtained models developed based on seed spectral data of FT-IR had a lower performance than the FT-NIR model, which is indicated by lower determination coefficients values (R2c, R2p) and higher error values (SEC, SEP). Meanwhile, the powder model of FT-IR showed comparable performance to FT-NIR. The FT-IR sample holder only allows measuring samples on a tiny area, which allows the possibility of generating inconsistent spectral data, particularly for inhomogeneous samples. Though a total of seven data preprocessing methods were used in this study, only the preprocessing methods that attained the highest performance are presented in Table 2 for brevity.
Previous studies related to the application of NIR and MIR to predict isoflavones in soybean were confined by the sample number and the concentration range of isoflavones in the examined sample. Therefore, multiple linear regression analyses have been applied to NIR spectral data to predict the content of isoflavones in whole and powder soybeans using only 48 samples [43]. In addition, by using the cross-validation method, the previous study obtained an acceptable model, denoted by R2 over 0.74 and 0.94, for individual isoflavones and total isoflavones for powder samples, respectively. The results also revealed an excellent model based on the spectra of whole soybean seeds (group of soybean) to predict total isoflavones (R2: 0.90). However, it failed to develop models for individual forms of isoflavones based on the spectra of the whole bean. Wang et al. [44] developed a model to predict isoflavone content in kudzu using NIRs based on 88 samples data and presented a consistent model. Berhow et al. [41] reported an excellent calibration model to predict total isoflavones with an R2 over 0.90 using more than 700 spectral data of ground soybean samples. For all mentioned reports, no report presented an acceptable model to predict the individual form of isoflavones in an intact seed using NIR or MIR. Therefore, our study obtained an acceptable model for predicting total isoflavones as well as individual types of isoflavones using a large amount of data using NIR and MIR spectra of a single seed.
In terms of PLS-R, one of the essential stages of applying NIRs and MIRs for developing a prediction model is identifying the specific waveband, which contributes significantly to the model performance. In this study, several bands in the NIR region can be identified as influential bands for isoflavones prediction (Figure 4A), which was also reported by previous research. The bands around 5600 cm−1 (1780 nm) and 6600 cm−1 (1515 nm) were reported by Zhang et al. [27] to have significant contributions to flavonoid and isoflavonoid prediction. These signals correspond to the stretching vibration’s first overtone of C–H. Another band that is considered influential is around 8860 cm−1 (1120 nm), which can be associated with the second overtone vibration of C-H from the aromatic ring structure [45].
Figure 4B locates a specific band of FT-IR spectral data for isoflavone detection. All of the marked bands corresponded to the unique structure of isoflavones, which contain two aromatic rings. The bands around 1185 cm−1 (8430 nm) and 2000 cm−1 (5000 nm) were assigned to C–H’s bending vibration and the combination band of aromatic structure. The last significant band was around 1600 cm−1 (6250 nm), corresponding to the unique aromatic ring bonding C=C–C [46].

3.3.2. Oligosaccharides Model

The oligosaccharides prediction models based on FT-NIR and FT-IR spectral data were presented in Table 3. The result revealed that the FT-NIR was a promising method to predict total oligosaccharides and three main types of short-chain carbohydrates in soybeans using two different sample types: seeds and powder. Furthermore, the statistical parameters obtained based on the FT-NIR spectroscopy technique showed a good agreement between the prediction values and the reference values of the powder sample, indicated by the R2c values over 0.75 and low error (below 1%) for all of the evaluated chemicals. The seed-based model also exhibited an acceptable result, proven by low standard error values for both SEC and SEP (lower than 1%), representing a minimum difference between prediction and reference values. In addition, the results of the FT-NIR method were slightly better than those of FT-IR, denoted by higher R2p values and lower standard error values (SEP) for both sample types.
Carbohydrate is a macro component and one of the essential nutrients for humans. Thus, many previous research studies evaluated the robustness of NIR and MIR to predict this component in agriculture and food products. The effectiveness of NIR to predict carbohydrates in foxtail millet has been reported by Chen et al. [47] The full cross-validation method using 82 samples was applied to create a model and resulted in an excellent performance (R2) over 0.9 and low RMSE (below 0.8%). Ferreira et al. [39] used NIR spectral data of 82 varieties of Brazilian soybeans to predict the concentration of dietary fiber and obtained a high-performance model (R2) of 0.80 with an RMSEP of 0.86%. In addition, the sucrose content in soybean has also been successfully predicted by Choung [48]. However, most research was constrained by a limited number of samples, and the spectral data were acquired based on the powder sample, which is more homogenous. This current study developed models to predict total oligosaccharides as well as three main short-chain carbohydrates in soybean based on the seed spectral data. In addition, more reference values were also used to produce a more precise prediction model. Even though the obtained model revealed a lower performance, it could demonstrate the possibility of predicting oligosaccharides in soybean using an intact seed sample.
The highly correlated bands of evaluated components can be identified based on the regression coefficients. Figure 5A presents the NIR beta coefficients to predict soybean oligosaccharides, revealing three important peaks. The peak around 4380 cm−1 (2280 nm) refer to the C–H stretch and CH2 deformation combination belonging to starch, while the peak around 5450 cm−1 (1820 nm) was assigned to the stretching of O–H or C–H bond’s second overtone that possibly comes from the cellulose structure. The last pointed band, around 7570 cm−1 (1300 nm), was closely related to the vibration of the C–H second overtone/combination from the CH2 structure [45]. Masithoh et al. [24] reported that bands at 4264 and 4380 cm−1 were influential bands for glucose. Other research reported by Fereira et al. [39] pointed out that the band at 5400 contributed to the model development based on NIR spectra for soybean dietary fiber. Oligosaccharides are part of carbohydrates; thus, their fundamental chemical structure is similar to monosaccharides and polysaccharides. Hence, the influential bands identified in this study coincide with the previous studies that evaluated carbohydrates components.
The regression coefficients to develop an oligosaccharides model using FT-IR spectroscopy are shown in Figure 5B. The most significant bands to determine oligosaccharides can be identified at 985, 1430, 1700, and 3280 cm−1. The bands between 900 and 1100 can be associated with the stretching and bending vibration of C–C and C–O bonds. The bands around 1700 cm−1 represent the crystal water in raffinose pentahydrate, while the bands that arise at 3260 can be assigned to the O–H stretching vibration [49].

3.4. Testing Model

Testing the model was a crucial part of ensuring the model’s performance and evaluating the obtained model’s applicability. This study developed a prediction model for isoflavones and oligosaccharides for total content as well as individual types value. Since the general concept of spectroscopy relies on the chemical bond vibration of the molecular structure of a compound, thus the biological compounds with similar chemical structures result in identical vibrational frequencies under the IR spectral range [50]. Hence, the influential bands obtained from the model analysis could not clearly be confirmed to originate from a specific individual type of component. In the same manner, the prediction result can be derived from specific kinds or mixed types (total) of examined components. Previous research by Schoonjans et al. [51] reported that compounds with similar chemical structures have been classified in the close class by using the hierarchical upgma-clustering method based on the IR spectral data.
In our study, individual types of examined components have the same chemical structure backbone. Hence, the influential bands obtained from the model analysis could not clearly be confirmed to originate from a specific type of component. From this point of view, testing the model for total components is more reasonable than individual types ones. Furthermore, the evaluation of isoflavones and oligosaccharides for the commercial application was only carried out for the total component instead of individual types. In addition, this research also emphasized predicting soybean components using an intact seed sample. Hence, the testing procedure was only performed to the seed-based model of the total content of evaluated components.
On the other hand, the results obtained from the chemical analysis showed that four soybean varieties contained isoflavones lower than 0.50 mg/g, and one contained over 5.0 mg/g. Therefore, to maintain the concentration range within the range of the model, we excluded those data. The contents of the targeted components and the number of samples used for the testing procedure are presented in Table 4.
Statistical analysis of testing results showed a promising FT-NIR-technique-based model, presenting a similar R2 and error to the calibration model (Table 5). Meanwhile, the FT-IR technique presented R2 lower than the calibration model. The distribution of the prediction values and their correlation with the reference values for the FT-NIR technique can be seen in Figure 6.
Overall, the FT-NIR demonstrated better results than FT-IR on predicting isoflavones and oligosaccharides in soybean seeds. NIR radiation can penetrate the sample deeper than IR, which is beneficial for evaluating an agricultural product that is mostly categorized as an inhomogeneous sample. According to William and Noris [52], a model with Rp2 values between 0.66 to 0.82 can be categorized as a model that can be used for sample screening.

4. Conclusions

Using an intact seed sample, FT-NIR and FT-IR spectroscopic techniques were investigated to determine the concentration of isoflavones and oligosaccharides in soybeans. In addition, the possibility of predicting the different types of targeted components was also investigated to explore the potential of these techniques to measure microcomponents of agricultural products. In total, 6510 seeds were selected randomly from 310 soybean samples, and their spectral data were collected using both instruments to develop a calibration and validation model. The testing model was carried out using 1365 seeds that belong to 65 soybean varieties, which were not involved in the model construction. The result showed that FT-NIR spectroscopy combined with the PLSR was a promising method predicting total isoflavones and oligosaccharides using intact soybeans seed, presenting a performance prediction model (R2p) of 0.80 and 0.72, respectively. The results of the testing model also demonstrated good performances, which were close to the calibration model. Meanwhile, FT-IR spectroscopy also shows an acceptable result, even though the performance (R2p: 0.73 and 0.70) was lower than the FT-NIR technique. The results of this fundamental study can be used as basic knowledge to develop a seed-sorting machine based on chemical components.

Author Contributions

Data curation, H.Z.A. and S.S.T.; formal analysis, H.Z.A.; investigation, H.Z.A.; methodology, H.Z.A., S.S.T. and R.E.M.; software, H.Z.A.; validation, M.-G.C., K.-H.K., W.-H.L. and B.-K.C.; writing-original draft, H.Z.A.; conceptualization, M.S.K., I.B., W.-H.L. and B.-K.C.; writing—review and editing, R.E.M. and B.-K.C.; supervision, B.-K.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was carried out with the support of “Cooperative Research Program for Agriculture Science and Technology Development (Project No. PJ0159182021)” Rural Development Administration, Republic of Korea.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Farid, M.; Kodama, K.; Arato, T.; Okazaki, T.; Oda, T.; Ikeda, H.; Sengoku, S. Comparative Study of Functional Food Regulations in Japan and Globally. Glob. J. Health Sci. 2019, 11, 132. [Google Scholar] [CrossRef]
  2. Roger Boerma, H.; Specht, J.E. Soybeans: Improvement, Production, and Uses; Shibles, R.M., Harper, J.E., Wilson, R.F., Shoemaker, R.C., Eds.; Agronomy Monographs; American Society of Agronomy, Crop Science Society of America, and Soil Science Society of America: Madison, WI, USA, 2004; ISBN 9780891182665.
  3. Liu, K. Soybeans: Chemistry, Technology, and Utilization; Springer: Cham, Switzerland, 1999; ISBN 0834212994. [Google Scholar]
  4. Karr-Lilienthal, L.; Kadzere, C.; Grieshop, C.; Fahey, G. Chemical and nutritional properties of soybean carbohydrates as related to nonruminants: A review. Livest. Prod. Sci. 2005, 97, 1–12. [Google Scholar] [CrossRef]
  5. Borejszo, Z.; Khan, K. Reduction of Flatulence-Causing Sugars by High Temperature Extrusion of Pinto Bean High Starch Fractions. J. Food Sci. 1992, 57, 771–777. [Google Scholar] [CrossRef]
  6. Bakker-Zierikzee, A.M.; Alles, M.S.; Knol, J.; Kok, F.J.; Tolboom, J.J.M.; Bindels, J.G. Effects of infant formula containing a mixture of galacto- and fructo-oligosaccharides or viable Bifidobacterium animalison the intestinal microflora during the first 4 months of life. Br. J. Nutr. 2005, 94, 783–790. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Sakai, T.; Kogiso, M. Soy isoflavones and immunity. J. Med. Investig. 2008, 55, 167–173. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Cooke, P.S.; Selvaraj, V.; Yellayi, S. Genistein, Estrogen Receptors, and the Acquired Immune Response. J. Nutr. 2006, 136, 704–708. [Google Scholar] [CrossRef] [Green Version]
  9. Zuniga, K.; Clinton, S.K.; Erdman, J.W. The Interactions of Dietary Tomato Powder and Soy Germ on Prostate Carcinogenesis in the TRAMP Model. Cancer Prev. Res. 2013, 6, 548–557. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Velasquez, M.T.; Bhathena, S.J. Role of Dietary Soy Protein in Obesity. Int. J. Med. Sci. 2007, 4, 72–82. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Zimmermann, C.; Cederroth, C.R.; Bourgoin, L.; Foti, M.; Nef, S. Prevention of Diabetes in db/db Mice by Dietary Soy Is Independent of Isoflavone Levels. Endocrinol. 2012, 153, 5200–5211. [Google Scholar] [CrossRef] [Green Version]
  12. Wei, P.; Liu, M.; Chen, Y.; Chen, D.-C. Systematic review of soy isoflavone supplements on osteoporosis in women. Asian Pac. J. Trop. Med. 2012, 5, 243–248. [Google Scholar] [CrossRef] [Green Version]
  13. Zhang, X.; Gao, Y.-T.; Yang, G.; Li, H.; Cai, Q.; Xiang, Y.-B.; Ji, B.-T.; Franke, A.A.; Zheng, W.; Shu, X.-O. Urinary isoflavonoids and risk of coronary heart disease. Int. J. Epidemiol. 2012, 41, 1367–1375. [Google Scholar] [CrossRef] [Green Version]
  14. Kim, G.-H.; Hwang, Y.-S.; Ahn, K.-G.; Kim, G.-P.; Kim, M.-J.; Hong, S.-B.; Moon, J.-K.; Choung, M.-G. Determination of Soluble Carbohydrates in Soybean Seeds Using High Performance Liquid Chromatography with Evaporative Light Scattering Detection. J. Korean Soc. Food Sci. Nutr. 2014, 43, 1062–1067. [Google Scholar] [CrossRef]
  15. Aguiar, C.L.; Haddad, R.; Eberlin, M.N.; Carrão-Panizzi, M.C.; Tsai, S.M.; Park, Y.K. Thermal behavior of malonylglucoside isoflavones in soybean flour analyzed by RPHPLC/DAD and eletrospray ionization mass spectrometry. LWT Food Sci. Technol. 2012, 48, 114–119. [Google Scholar] [CrossRef]
  16. Bustamante-Rangel, M.; Delgado-Zamarreño, M.; Pérez-Martín, L.; Carabias-Martínez, R. QuEChERS method for the extraction of isoflavones from soy-based foods before determination by capillary electrophoresis-electrospray ionization-mass spectrometry. Microchem. J. 2013, 108, 203–209. [Google Scholar] [CrossRef]
  17. Wang, Y.; Liu, Y.; Xiao, C.; Liu, L.; Hao, M.; Wang, J.; Liu, X. Simultaneous Determination of 15 Phenolic Constituents of Chinese Black Rice Wine by HPLC-MS/MS with SPE. J. Food Sci. 2014, 79, C1100–C1105. [Google Scholar] [CrossRef]
  18. Hao, J.; Zhu, H.; Zhang, Z.; Yang, S.; Li, H. Identification of anthocyanins in black rice (Oryza sativa L.) by UPLC/Q-TOF-MS and their in vitro and in vivo antioxidant activities. J. Cereal Sci. 2015, 64, 92–99. [Google Scholar] [CrossRef]
  19. Cho, C.; Jung, Y.S.; Nam, T.G.; Rha, C.; Ko, M.; Jang, D.; Kim, H.; Kim, D. pH-adjusted solvent extraction and reversed-phase HPLC quantification of isoflavones from soybean (Glycine max (L.) Merr.). J. Food Sci. 2020, 85, 673–681. [Google Scholar] [CrossRef]
  20. Yasmin, J.; Ahmed, M.R.; Lohumi, S.; Wakholi, C.; Kim, M.S.; Cho, B.-K. Classification Method for Viability Screening of Naturally Aged Watermelon Seeds Using FT-NIR Spectroscopy. Sensors 2019, 19, 1190. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Mureşan, A.E.; Muste, S.; Vlaic, R.A.; Bobiş, O.; Mureşan, C.; Socaciu, C.; Mureşan, V. HPLC Determination and FT-MIR Prediction of Sugars from Juices of Different Apple Cultivars during Fruits Development. Not. Bot. Horti Agrobot. Cluj-Napoca 2015, 43, 222–228. [Google Scholar] [CrossRef] [Green Version]
  22. You, Z.-H.; Liu, Z.-H.; Gong, C.-Y.; Yang, X.-L.; Cheng, F. Applying Attenuated Total Reflection-Mid-Infrared (ATR-MIR) Spectroscopy to Detect Hairtail Surimi in Mixed Surimi and Their Surimi Products. Guang Pu Xue Yu Guang Pu Fen Xi 2015, 35, 2930–2939. [Google Scholar]
  23. Determination of quality parameters in maize grain by NIR reflectance spectroscopy. Tarım Bilim. Derg. 2012, 18, 31–42. [CrossRef]
  24. Masithoh, R.E.; Amanah, H.Z.; Yoon, W.-S.; Joshi, R.; Cho, B.-K. Determination of protein and glucose of tuber and root flours using NIR and MIR spectroscopy. Infrared Phys. Technol. 2021, 113, 103577. [Google Scholar] [CrossRef]
  25. Brás, L.P.; Bernardino, S.A.; Lopes, J.; Menezes, J.C. Multiblock PLS as an approach to compare and combine NIR and MIR spectra in calibrations of soybean flour. Chemom. Intell. Lab. Syst. 2005, 75, 91–99. [Google Scholar] [CrossRef]
  26. Ferreira, D.; Galão, O.; Pallone, J.; Poppi, R. Comparison and application of near-infrared (NIR) and mid-infrared (MIR) spectroscopy for determination of quality parameters in soybean samples. Food Control. 2014, 35, 227–232. [Google Scholar] [CrossRef]
  27. Zhang, G.; Li, P.; Zhang, W.; Zhao, J. Analysis of multiple soybean phytonutrients by near-infrared reflectance spectroscopy. Anal. Bioanal. Chem. 2017, 409, 3515–3525. [Google Scholar] [CrossRef]
  28. Kim, Y.; Ahn, H.; Lee, E.; Kim, H. Development of Prediction Model by NIRS for Anthocyanin Contents in Black Colored Soybean. Korean J. Crop Sci. 2008, 53, 15–20. [Google Scholar]
  29. Kovalenko, I.V.; Rippke, G.R.; Hurburgh, C.R. Determination of Amino Acid Composition of Soybeans (Glycine max) by Near-Infrared Spectroscopy. J. Agric. Food Chem. 2006, 54, 3485–3491. [Google Scholar] [CrossRef] [Green Version]
  30. Lee, H.-S.; Kim, J.-B.; Lee, Y.-Y.; Lee, S.-Y.; Gwag, J.-G.; Baek, H.-J.; Kim, C.-K.; Yoon, M.-S. Estimating the Important Components in Three Different Sample Types of Soybean by Near Infrared Reflectance Spectroscopy. Korean J. Crop. Sci. 2011, 56, 88–93. [Google Scholar] [CrossRef]
  31. Amanah, H.Z.; Joshi, R.; Masithoh, R.E.; Choung, M.-G.; Kim, K.-H.; Kim, G.; Cho, B.-K. Nondestructive measurement of anthocyanin in intact soybean seed using Fourier Transform Near-Infrared (FT-NIR) and Fourier Transform Infrared (FT-IR) spectroscopy. Infrared Phys. Technol. 2020, 111, 103477. [Google Scholar] [CrossRef]
  32. Choi, Y.-M.; Yoon, H.; Lee, S.; Ko, H.-C.; Shin, M.-J.; Lee, M.C.; Hur, O.S.; Ro, N.Y.; Desta, K.T. Isoflavones, anthocyanins, phenolic content, and antioxidant activities of black soybeans (Glycine max (L.) Merrill) as affected by seed weight. Sci. Rep. 2020, 10, 1–13. [Google Scholar] [CrossRef]
  33. Gorla, G.; Mestres, M.; Boqué, R.; Riu, J.; Spanu, D.; Giussani, B. ATR-MIR spectroscopy to predict commercial milk major components: A comparison between a handheld and a benchtop instrument. Chemom. Intell. Lab. Syst. 2020, 200, 103995. [Google Scholar] [CrossRef]
  34. Romani, A.; Vignolini, P.; Galardi, C.; Aroldi, C.; Vazzana, A.C.; Heimler, D. Polyphenolic Content in Different Plant Parts of Soy Cultivars Grown under Natural Conditions. J. Agric. Food Chem. 2003, 51, 5301–5306. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Osborne, B.G.; Fearn, T.; Hindle, P.H. Practical NIR spectroscopy with applications in food and beverage analysis. Longman Sci. Tech. 1993, 4, 12. [Google Scholar]
  36. Candolfi, A.; De Maesschalck, R.; Jouan-Rimbaud, D.; Hailey, P.; Massart, D. The influence of data pre-processing in the pattern recognition of excipients near-infrared spectra. J. Pharm. Biomed. Anal. 1999, 21, 115–132. [Google Scholar] [CrossRef]
  37. Varmuza, K.; Filzmoser, P. Introduction to Multivariate Statistical Analysis in Chemometrics; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar] [CrossRef] [Green Version]
  38. Masithoh, R.E.; Lohumi, S.; Yoon, W.-S.; Amanah, H.Z.; Cho, B.-K. Development of multi-product calibration models of various root and tuber powders by fourier transform near infra-red (FT-NIR) spectroscopy for the quantification of polysaccharide contents. Heliyon 2020, 6, e05099. [Google Scholar] [CrossRef] [PubMed]
  39. Ferreira, D.S.; Poppi, R.J.; Pallone, J.A.L. Evaluation of dietary fiber of Brazilian soybean (Glycine max) using near-infrared spectroscopy and chemometrics. J. Cereal Sci. 2015, 64, 43–47. [Google Scholar] [CrossRef]
  40. Lohumi, S.; Lee, S.; Cho, B.-K. Optimal variable selection for Fourier transform infrared spectroscopic analysis of starch-adulterated garlic powder. Sensors Actuators B Chem. 2015, 216, 622–628. [Google Scholar] [CrossRef]
  41. Berhow, M.A.; Singh, M.; Bowman, M.J.; Price, N.P.J.; Vaughn, S.F.; Liu, S.X. Quantitative NIR determination of isoflavone and saponin content of ground soybeans. Food Chem. 2020, 317, 126373. [Google Scholar] [CrossRef]
  42. Hollung, K.; Øverland, M.; Hrustić, M.; Sekulić, P.; Miladinović, J.; Martens, H.; Narum, B.; Sahlstrøm, S.; Sørensen, M.; Storebakken, T.; et al. Evaluation of Nonstarch Polysaccharides and Oligosaccharide Content of Different Soybean Varieties (Glycine max) by Near-Infrared Spectroscopy and Proteomics. J. Agric. Food Chem. 2005, 53, 9112–9121. [Google Scholar] [CrossRef]
  43. Sato, T.; Equchi, K.; Hatano, T.; Nishiba, Y. Use of Near-infrared Reflectance Spectroscopy for the Estimation of the Isoflavone Contents of Soybean Seeds. Plant Prod. Sci. 2008, 11, 481–486. [Google Scholar] [CrossRef]
  44. Wang, P.; Zhang, H.; Yang, H.; Nie, L.; Zang, H. Rapid determination of major bioactive isoflavonoid compounds during the extraction process of kudzu (Pueraria lobata) by near-infrared transmission spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2015, 137, 1403–1408. [Google Scholar] [CrossRef]
  45. Aenugu, H.P.R.; Sathis Kumar, D.; Srisudharson, N.P.; Ghosh, S.S.; Banji, D. Near infra red spectroscopy–An overview. Int. J. Chem Tech Res. 2011, 3, 825–836. [Google Scholar]
  46. Coates, J. Encyclopedia of Analytical Chemistry; Meyers, R.A., Ed.; John Wiley & Sons Ltd.: Chichester, UK, 2000; pp. 10815–10837. [Google Scholar]
  47. Chen, J.; Ren, X.; Zhang, Q.; Diao, X.; Shen, Q. Determination of protein, total carbohydrates and crude fat contents of foxtail millet using effective wavelengths in NIR spectroscopy. J. Cereal Sci. 2013, 58, 241–247. [Google Scholar] [CrossRef]
  48. Choung, M.-G. Determination of Sucrose Content in Soybean Using Near-infrared Reflectance Spectroscopy. J. Korean Soc. Appl. Biol. Chem. 2010, 53, 478–484. [Google Scholar] [CrossRef]
  49. Wiercigroch, E.; Szafraniec, E.; Czamara, K.; Pacia, M.Z.; Majzner, K.; Kochan, K.; Kaczor, A.; Baranska, M.; Malek, K. Raman and infrared spectroscopy of carbohydrates: A review. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2017, 185, 317–335. [Google Scholar] [CrossRef] [PubMed]
  50. Justino, C.I.L.; Duarte, K.; Freitas, A.C. Analysis of Marine Samples in Search of Bioactive Compounds; Elsevier: Amsterdam, The Netherlands, 2014; Volume 65, ISBN 9780444633590. [Google Scholar]
  51. Schoonjans, V.; Questier, F.; Guo, Q.; Van der Heyden, Y.; Massart, D. Assessing molecular similarity/diversity of chemical structures by FT-IR spectroscopy. J. Pharm. Biomed. Anal. 2001, 24, 613–627. [Google Scholar] [CrossRef]
  52. Williams, P.; Norris, K. Near-Infrared Technology in the Agricultural and Food Industries, 2nd ed.; Amer Assn of Cereal Chemists: Washington, DC, USA, 2001; ISBN 1891127241. [Google Scholar]
Figure 1. The photograph of the bean seed spectral data acquisition of FT-NIR (A) and FT-IR (B).
Figure 1. The photograph of the bean seed spectral data acquisition of FT-NIR (A) and FT-IR (B).
Foods 11 00232 g001
Figure 2. The experimental workflow to develop a model to predict isoflavones and oligosaccharides using FT-NIR and FT-IR spectroscopic techniques.
Figure 2. The experimental workflow to develop a model to predict isoflavones and oligosaccharides using FT-NIR and FT-IR spectroscopic techniques.
Foods 11 00232 g002
Figure 3. The typical pattern of raw spectra of soybean acquired using FT-NIR (A) and FT-IR (B).
Figure 3. The typical pattern of raw spectra of soybean acquired using FT-NIR (A) and FT-IR (B).
Foods 11 00232 g003
Figure 4. The regression coefficient of the PLSR model for isoflavones prediction using FT-NIR (A) and FT-IR (B).
Figure 4. The regression coefficient of the PLSR model for isoflavones prediction using FT-NIR (A) and FT-IR (B).
Foods 11 00232 g004
Figure 5. The beta coefficients curve for oligosaccharides prediction in soybean: (A) FT-NIR, (B) FT-IR.
Figure 5. The beta coefficients curve for oligosaccharides prediction in soybean: (A) FT-NIR, (B) FT-IR.
Foods 11 00232 g005
Figure 6. The distribution of the prediction data correlated with reference for the testing procedure using FT-NIR technique: (A) total isoflavones; (B) total oligosaccharides.
Figure 6. The distribution of the prediction data correlated with reference for the testing procedure using FT-NIR technique: (A) total isoflavones; (B) total oligosaccharides.
Foods 11 00232 g006
Table 1. The statistical data of targeted soybean components used for reference value of the prediction model.
Table 1. The statistical data of targeted soybean components used for reference value of the prediction model.
ComponentsNumber of SamplesMean ± SDMaxMin
Isoflavones (mg/g)
Daidzin 2890.129 ± 0.0570.3170.018
Genistin 2890.136 ± 0.0430.2500.039
Glycitin 2650.042 ± 0.0180.1120.012
6-O-Malonyl daidzin3100.665 ± 0.2711.4030.110
6-O-Malonyl genistin3101.080 ± 0.3752.0280.200
6-O-Malonyl glycitin2800.147 ± 0.0610.3760.005
Acetyl daidzin2700.065 ± 0.0210.1230.020
Total isoflavones 3102.320 ± 0.774.3390.728
Oligosaccharides (%)
Sucrose3106.001 ± 1.2108.2572.889
Stachyose3102.885 ± 0.5174.0671.781
Raffinose3101.124 ± 0.1331.4560.806
Total oligosaccharides31011.131 ± 1.64514.6436.879
Table 2. The PLSR statistical model for predicting total isoflavones and individual forms of isoflavones in soybeans using FT-NIR and FT-IR techniques.
Table 2. The PLSR statistical model for predicting total isoflavones and individual forms of isoflavones in soybeans using FT-NIR and FT-IR techniques.
Components (Preprocessing Method)FT-NIRFT-IR
R2cSECR2pSEPLVsR2cSECR2pSEPLVs
Seed
Daidzin (MN/MN)0.740.030.720.03250.720.030.700.0325
Genistin (RD/RD)0.730.020.700.03250.700.020.670.0325
Glycitin (MN/MN)0.780.010.760.01250.730.010.720.0125
6-O-Malonyl daidzin (MN/MN)0.770.140.750.12250.720.150.700.2025
6-O-Malonyl genistin (MN/MN)0.790.170.770.18250.710.190.700.2925
6-O-Malonyl glycitin (MN/MN)0.750.030.710.03250.700.030.700.0325
Acetyl daidzin (SNV/SNV)0.760.010.730.01230.710.020.680.0222
Total isoflavones (MN/MN)0.800.320.800.30250.740.290.730.3025
Powder
Daidzin (RD/MN)0.770.030.750.03220.790.020.780.0222
Genistin (RD/MN)0.810.020.760.03230.840.020.730.0222
Glycitin (MN/MN)0.760.010.740.01240.740.010.700.0122
6-O-Malonyl daidzin (MN/MN)0.830.110.730.13220.830.110.740.1522
6-O-Malonyl genistin (MN/MN)0.830.160.770.17220.880.140.770.1923
6-O-Malonyl glycitin (MN/MN)0.730.030.720.03240.780.030.740.0324
Acetyl daidzin (SNV/SNV)0.770.010.750.01240.770.010.760.0124
Total isoflavones (MN/MN)0.920.210.840.33250.920.210.840.3325
R2c: coefficient determination of calibration; SEC: standard error of calibration; R2p: coefficient determination of prediction; SEP: standard error prediction; RD: raw data; MN; mean normalization; SNV: standard normal variate.
Table 3. The PLSR statistical model for predicting total oligosaccharides and short-chain carbohydrates in soybean using FT-NIR and FT-IR techniques.
Table 3. The PLSR statistical model for predicting total oligosaccharides and short-chain carbohydrates in soybean using FT-NIR and FT-IR techniques.
Components (Preprocessing Method)FT-NIRFT-IR
R2cSECR2pSEPLVsR2cSECR2pSEPLVs
Seed
Sucrose (RD/MN)0.720.710.700.75190.720.670.710.6819
Stachyose (RD/SNV)0.700.280.660.29190.670.300.660.3319
Raffinose (SNV/SNV)0.720.060.700.07200.680.070.660.0820
Total soluuble Carb (SNV/MN)0.720.800.700.82180.700.880.700.9518
Powder
Sucrose (RD/MN)0.830.550.750.67190.730.620.740.6419
Stachyose (RD/SNV)0.770.240.700.28190.660.290.670.3019
Raffinose (SNV/SNV)0.770.060.720.06200.730.070.720.0720
Total soluuble Carb (SNV/MN)0.780.740.750.80180.730.870.720.8418
R2c: coefficient determination of calibration; SEC: standard error of calibration; R2p: coefficient determination of prediction; SEP: standard error prediction; MN: mean normalization; RD: raw data; SNV: standard normal variate.
Table 4. Statistical data of the targeted components for testing model.
Table 4. Statistical data of the targeted components for testing model.
Components (Unit)Number of VarietiesMean ± SDMaxMin
Total isoflvones (mg/g)602.799 ± 0.7364.1760.946
Total oligosaccharides (%)6511.317 ± 0.93813.4199.135
SD: standard deviation.
Table 5. The statistical result of the testing model for predicting isoflavones and oligosaccharides in soybean seeds.
Table 5. The statistical result of the testing model for predicting isoflavones and oligosaccharides in soybean seeds.
ComponentsN SeedsN VarietiesR2SE
FT-NIR
Total Isoflavones1260600.800.31
Total oligosaccharides1365650.710.64
FT-IR
Total Isoflavones1260600.710.35
Total oligosaccharides1365650.680.81
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Amanah, H.Z.; Tunny, S.S.; Masithoh, R.E.; Choung, M.-G.; Kim, K.-H.; Kim, M.S.; Baek, I.; Lee, W.-H.; Cho, B.-K. Nondestructive Prediction of Isoflavones and Oligosaccharides in Intact Soybean Seed Using Fourier Transform Near-Infrared (FT-NIR) and Fourier Transform Infrared (FT-IR) Spectroscopic Techniques. Foods 2022, 11, 232. https://doi.org/10.3390/foods11020232

AMA Style

Amanah HZ, Tunny SS, Masithoh RE, Choung M-G, Kim K-H, Kim MS, Baek I, Lee W-H, Cho B-K. Nondestructive Prediction of Isoflavones and Oligosaccharides in Intact Soybean Seed Using Fourier Transform Near-Infrared (FT-NIR) and Fourier Transform Infrared (FT-IR) Spectroscopic Techniques. Foods. 2022; 11(2):232. https://doi.org/10.3390/foods11020232

Chicago/Turabian Style

Amanah, Hanim Z., Salma Sultana Tunny, Rudiati Evi Masithoh, Myoung-Gun Choung, Kyung-Hwan Kim, Moon S. Kim, Insuck Baek, Wang-Hee Lee, and Byoung-Kwan Cho. 2022. "Nondestructive Prediction of Isoflavones and Oligosaccharides in Intact Soybean Seed Using Fourier Transform Near-Infrared (FT-NIR) and Fourier Transform Infrared (FT-IR) Spectroscopic Techniques" Foods 11, no. 2: 232. https://doi.org/10.3390/foods11020232

APA Style

Amanah, H. Z., Tunny, S. S., Masithoh, R. E., Choung, M. -G., Kim, K. -H., Kim, M. S., Baek, I., Lee, W. -H., & Cho, B. -K. (2022). Nondestructive Prediction of Isoflavones and Oligosaccharides in Intact Soybean Seed Using Fourier Transform Near-Infrared (FT-NIR) and Fourier Transform Infrared (FT-IR) Spectroscopic Techniques. Foods, 11(2), 232. https://doi.org/10.3390/foods11020232

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop