Next Article in Journal
Non-Destructive Inspection of Physicochemical Indicators of Lettuce at Rosette Stage Based on Visible/Near-Infrared Spectroscopy
Previous Article in Journal
Flower Species Ingredient Verification Using Orthogonal Molecular Methods
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Nutritional Quality Assessment of Miscellaneous Cassava Tubers Using Principal Component Analysis and Cluster Analysis

1
Department of Mechanical Engineering, Guangxi Normal University, Guilin 541004, China
2
College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou 310058, China
3
Agricultural Engineering Research Institute, Agricultural Research Center, Giza 12619, Egypt
4
Faculty of Urban and Regional Planning, Cairo University, Giza 12613, Egypt
5
College of Engineering, South China Agricultural University, Guangzhou 510642, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Foods 2024, 13(12), 1861; https://doi.org/10.3390/foods13121861
Submission received: 1 May 2024 / Revised: 6 June 2024 / Accepted: 12 June 2024 / Published: 13 June 2024
(This article belongs to the Section Food Nutrition)

Abstract

:
Cassava is a staple crop in developing countries because its starchy roots provide essential dietary carbohydrates. The aim of this research was to conduct a comprehensive inquiry and scientific evaluation of the nutritional value of cassava tubers. Eight nutritional characteristics were examined in native and imported cassava variants: starch, reduced sugar, anthocyanins, protein, dietary fiber, quinic acid, vitamin C, and dry matter content. Principal component analysis (PCA) was conducted to minimize the dimensionality of the nutritional markers. A scientific assessment technique was developed to calculate a composite score for the various cassava samples. Analysis of the data revealed noticeable variance among the samples’ nutritional indicators, suggesting varying degrees of association. Starch had a substantial positive link with lower sugar, protein, and dry matter content (p < 0.01). Anthocyanins and quinic acid interacted favorably (p < 0.05), and a positive link between protein and dry matter content was observed (p < 0.05); however, protein and dietary fiber interacted negatively (p < 0.05). The contribution rate of the top three PCA factors was over 76%, demonstrating that these factors incorporated the primary information acquired from the eight original nutritional indices, while maintaining excellent representativeness and impartiality. The experimental results showed a preliminary nutritional grade for 22 cassava tuber samples. The top five types were Guangxi Muci, Gui Cassava 4, Glutinous Rice Cassava, Huifeng 60, and Dongguan Hongwei. In the cluster analysis, the levels of similarity between the data showed that the 22 types of cassava tubers could be grouped into five categories, each with their own set of nutrients. This study promotes the directed breeding of cassava species and offers a theoretical foundation for creating and using various cassava varieties. Furthermore, this work lays the groundwork for a systematic and dependable technique for the quality assessment, comprehensive evaluation, and reasonable classification of cassava species and similar crops.

1. Introduction

Cassava (Manihot esculenta Crantz) is a sturdy shrub that typically grows to heights of 2–5 m. It has substantial roots that contain high amounts of carbohydrates and fiber. Propagation involves stem cuttings, specifically 13–17 cm long seed stems. After trenching, seed stems are planted horizontally or obliquely [1,2]. Beyond their rich biodiversity, cassava tuber stems and leaves can be transformed into many products, and more than 2000 variants, such as starch, alcohol, and organic chemicals, can be derived from them. Additionally, processing allows for its widespread use across the food and medicine industries, among other light industries. Cassava, a renewable biomass energy crop, is a raw material for fuel ethanol production. Breeders and molecular biologists have recently introduced high-sugar and high-starch cassava varieties adapted for post-harvest storage. However, these varieties exhibit differential nutritional composition changes. Such differences are conditioned by specific geographical influences, including the regional climate, soil fertility, and fertilization levels in growing regions [3]. Conducting a quality evaluation to select superior varieties in China’s predominant cassava-cultivating region is crucial to the cassava industry’s progress [4]. While evaluating cassava traits accurately with a single index analysis proves challenging, a comprehensive evaluation would enable a more objective and precise appraisal of cassava varieties, effectively bypassing the limitations of a single index.
PCA is an effective and popular dimension reduction and linear transformation method for distilling multiple variables into a concise set of integrated variables [5,6]. These composite indicators encapsulate most of the information present in the original variables, while remaining mutually exclusive. This process allows for the reduction of complex factors into several principal components, thus simplifying the issue and facilitating the acquisition of more scientifically reliable and efficient data. In addition, cluster analysis identifies statistical measures that gauge the degree of similarity between samples or indicators. Utilizing these statistical measures, samples with significant similarities are aggregated into classes [7]. PCA has been used for quality analysis across various agriproducts, such as potatoes, rice, and wheat. Goncalves et al. employed PCA for data dimension reduction and hierarchical clustering (HCPC) to form clusters [8]. The effectiveness of these clusters was evaluated based on coffee yield variance, thereby informing the cultivation of novel soil management methods for coffee production. Similarly, Ephzibah et al. designed a model utilizing PCA and multi-label fuzzy classifier techniques to help farmers diagnose disease categories in rice crops, enabling timely interventions [9].
Mengistu et al. employed PCA and cluster analysis to evaluate the genetic diversity of wheat [10]. Their findings were invaluable for designing and implementing new wheat breeding programs. Seid et al. leveraged PCA and clustering to select genetically diverse parents, ascertain genetic diversity and factors impacting potato genotypes, and develop high-yield, superior-quality potatoes [11]. He et al. assessed potato germplasm adaptability in Guangxi, China, using a comparative analysis integrating PCA and comprehensive systematic cluster evaluation methods [12]. Their research identified seven exceptional potato germplasms suitable for widespread application in Guangxi. Xu et al. used PCA and cluster analysis to evaluate nutrition quality across diverse rice varieties, dividing all quality indicators into five classifications [13]. Their research provided a scientific foundation for nutritional quality assessment and initiated novel concepts for the production of new functional rice products. Yu et al. selected 23 mature fruits from Huangpi germplasm resources, measured 11 species, and determined representative Huangpi quality evaluation indicators [14]. Cluster analysis was used to divide 23 Huangpi germplasms into four categories based on their quality scores. Many scholars have used PCA and cluster analysis to classify and evaluate various food crops and cassava. These studies based on PCA and cluster analysis, as well as mating techniques to develop varieties more suitable for local cultivation, are valuable for local breeding programs. Such techniques ensure higher nutritional values and economic benefits. Therefore, PCA and cluster analysis have substantial potential for applications in cassava tuber quality research. In this study, 8 key indicators were selected from 22 cassava varieties, and a comprehensive quality evaluation and classification of various cassava tuber varieties were conducted using PCA and cluster analysis.

2. Materials and Methodological Framework

2.1. Devices and Materials

Various precise instruments were employed in this study, including a precision electronic balance, a high-speed centrifuge, and a liquid chromatography–mass spectrometer. Electronic balance scales and dryers were also used at various stages of the research methodology. Regarding the test materials, this research encompassed 22 distinctive varieties of cassava roots (Guilin Academy of Agricultural Sciences, Guilin, China). The varieties were as follows: C1115, SC9, Nanzhi 199, GR 4, SC6068, Dongguan Hongwei, Glutinous Rice Cassava, Qiongzhong 1, Guangdong 1, Colombia 8H, Switzerland D32, Hainan Hongxin, Xinglong 1, Huifeng 60, Guangxi Muci, Huixian Baipi, Rongyang 1, Swiss P11, Gui Cassava 4, Shatian Bread, TMS60444, and BRA900. Each variety has its unique characteristics and nutritional profile, contributing to the variety and robustness of the quality evaluation.

2.2. Determination of Indicators

To determine the composition, ten uniform-sized roots were randomly selected from each variety, and a wide range of nutrients and substances, including starch, reduced sugar, anthocyanins, protein, dietary fiber, quinic acid, vitamin C, and dry matter content, were assessed using precise techniques. Regarding nutritional value, the starch content in cassava tubers is quite high. Starch and reducing sugars are the major energy sources, while anthocyanins are powerful antioxidants. Cassava also contains a certain amount of protein and can be used as a source of complementary protein. Dietary fiber is much less abundant in cassava, but cassava can still promote gastrointestinal peristalsis and reduce the time that feces stays in the body. Quinacid and vitamin C are powerful anti-inflammatory antioxidants, and vitamin C is an important substance involved in the synthesis of collagen. The dry matter contains the main nutrients of cassava, such as protein, starch, and fiber, and this content directly affects the cassava yield.
The measurement method was as follows: First, the method of determining starch in foods was followed as outlined by the national standard GB 5009.9-2016 (2016) [15]. The starch was hydrolyzed into disaccharides with amylase, followed by hydrolysis into monosaccharides after removing fats and soluble sugars. The reduced sugar measurement was then converted into starch content. Second, reduced sugar was measured using the 3,5-dinitrosalicylic acid method [16]. Reduced sugar was oxidized into sugar acid and other products under alkaline conditions. 3,5-Dinitro salicylic acid was transformed into red–brown-colored 3-amino-5-nitro salicylic acid. The substance’s absorbance was measured at 540 nm, and the proportion of reduced sugar was determined using colorimetry. Third, anthocyanins were measured using the extraction methods of Zhang Cong et al. and Wu Peng et al. [17,18]. The samples for the test were prepared with the following conditions: pH 2, 80% ethanol, and a materials-to-liquid ratio of 1:10 g/mL. Then, a portion was extracted for 40 min under 300 W ultrasonic power conditions and centrifuged at 8000 r/min, and 1 mL of its supernatant was analyzed. Fourth, protein was measured in accordance with the national standard GB 5009.5-2016 (2016) [15]. Protein in cassava was decomposed under catalytic heating conditions, and the resulting ammonia was combined with sulfuric acid to form ammonium sulfate. After alkaline distillation, the ammonia was released and absorbed with boric acid. The solution was titrated with a sulfuric or hydrochloric acid standard. The nitrogen content was computed based on the acid’s consumption and then multiplied by a conversion coefficient to determine the protein content, as shown in Equation (1).
X = ( V 1 V 2 ) × c × 0.0140 m × V 3 / 100 × F × 100
where X is the protein content of the sample (g/100 g), V1 is the volume of standard titrant of sulfuric or hydrochloric acid consumed by the test solution (mL), V2 is the volume of sulfuric acid or hydrochloric acid standard titrant consumed by reagent blanks (mL), c is the concentration of standard titration solution of sulfuric or hydrochloric acid (mol/L), 0.0140 is the mass of the nitrogen equivalent to 1.0 mL of standard titration solution of sulfuric or hydrochloric acid (g), V3 is the volume of digestive solution aspirated (mL), F is the nitrogen-to-protein conversion factor, and 100 is the conversion coefficient.
Fifth, dietary fiber was determined according to the national standard GB 5009.88-2014 (2014) [19]. A dry sample was digested with α-amylase, protease, and glucosidase to remove protein and starch. Post-enzymatic hydrolysis, the solution was precipitated and filtered with ethanol, and the residue was washed with ethanol and acetone. The material was weighed upon drying to determine the total dietary fiber residue. Sixth, quinic acid was measured using the methods of Yao Gaifang et al. and Liu Wanjun et al. [20,21]. The method involved grinding 2 g of the pulp in 10 mL of deionized water, placing the sample in a water bath at 37 °C for 30 min, ultrasonic extraction for 15 min, and then centrifugation at 4 °C and 12,000 g for 15 min. In the HPLC analysis, 1 mL of supernatant was passed through a C18 SPE column and a 0.45 m Sep-Pak microporous membrane to determine the quinic acid content. Seventh, vitamin C was measured using 2,6-dichlorindiindiophol titration [22,23]. To create a homogenate, 5.0 g of cassava root was ground with 2% oxalic acid and poured into a 50 mL volumetric flask. After being rinsed several times with 2% oxalic acid and centrifuged at 9000 r/min for 10 min, 7 mL of the supernatant was titrated with a standard solution of 2,6-dichlorophinophentil until a 30 s endpoint. The consumed volume was then recorded, and the vitamin C content was calculated. Eighth, the dry matter content was determined using a standard drying method.

2.3. Analytical Methods

The analytical flow of this research is illustrated in Figure 1. The study included sourcing common local and imported cassava varieties to serve as test materials. A total of 22 tubers from 22 cassava varieties and eight indicators of nutritional value were selected for planting and nutrient content determination. Then, utilizing the PCA method, we calculated the differences and varying degrees of correlation among the nutritional value indicators, leading to comprehensive scoring. Ultimately, cluster analysis was employed through inter-group linkages to cluster the eight nutritional value indicators of cassava tubers, effectively serving as characteristic identifiers. Following this, we designed and evaluated a genealogical map. The data were statistically analyzed using Origin 2022 and SPSS 26.0 software. This analysis revealed a significant difference between the various groups, followed by PCA and cluster analysis. A scientific and quantitative evaluation system was established by analyzing and simplifying the nutritional value index. This system allowed for comprehensive scoring of each cassava sample.

3. Results and Analysis

3.1. Comparative Analysis

Based on the observations made in Table 1 and Table 2, the 22 cassava nutrients displayed varying traits. The anthocyanin content showed the most significant variation, with a coefficient of variation of 121.98%. This was followed by the reduced sugar content, with a coefficient of variation of 106.79%. Next were dietary fiber, vitamin C, dry matter content, quinic acid, and protein. The starch content had the slightest variation, with a relatively low coefficient of variation of 3.10%. Note that these findings display specific differences compared to previous studies [4]. This is likely due to variations in the cassava varieties used, differences in field management practices, and nuances in the ecological environment present during the experiment.

3.2. Correlation Analysis

Our findings indicated significant differences in nutrient indices among the samples, as shown in Figure 2. Various degrees of correlation were also noted among these nutrient indices. A significant positive correlation was observed between starch content and reduced sugar, protein, and dry matter content (p < 0.01). Reducing sugars correlated significantly with protein and dry matter content (p < 0.05). Moreover, a positive correlation was observed between the contents of anthocyanins and quinic acid (p < 0.05). Protein was positively correlated with dry matter content (p < 0.05) but negatively correlated with dietary fiber (p < 0.05). Finally, vitamin C exhibited a positive correlation with dry matter content (p < 0.05) but a negative correlation with dietary fiber (p < 0.05).

3.3. Principal Component Analysis

3.3.1. Standardized Data Processing

Given the varying data scales, the magnitude of each indicator differed significantly. These indicators posed a challenge, as higher numbers may amplify their respective indicators’ roles and diminish in influence at lower numerical levels. This process would not be conducive to a comprehensive evaluation analysis using Euclidean distances [24]. To tackle this issue and achieve a more balanced overall evaluation, it was necessary to normalize the nutrient composition values of the 22 assessed cassava tubers. A nonlinear normalization method was utilized in this study to standardize these values into a non-dimensional form. The outcomes of this process are displayed in Table 3 and Figure 3.

3.3.2. Appropriability Test

The Kaiser–Meyer–Olkin test (KMO) and Bartlett’s test of sphericity are commonly employed in statistics to evaluate whether data are suitable for factor analysis and correlation analysis. The KMO test is used to assess the adequacy of a sample and the correlation between variables. If the KMO value is close to 1, the correlation between the variables is robust, the sample is sufficient, and the factor analysis method is suitable. The Bartlett sphericity test is used to evaluate independence between variables. A value less than 0.05 indicates a correlation between variables. In this study, the KMO measure of sampling adequacy had a value of 0.703, which exceeded the advisable threshold of 0.7. Coupled with the fact that the significance derived from Bartlett’s test of sphericity was less than 0.05, these results emphasized a robust correlation among the variables, making them suitable for PCA. Therefore, PCA could be effectively used to evaluate and interpret these data and identify the principal components or features that explained the most variance in the dataset.

3.3.3. Selection of PCA

We moved forward with PCA and calculated the variance and cumulative contribution rates following the dimensionless standardization of nutrient components in the 22 cassava tubers [25]. In Table 4, the first principal component had an eigenvalue of 3.354 and contributed to 41.921% of the variance. The second principal component had an eigenvalue of 1.550, accounting for 19.369% of the variance. The third principal component, with an eigenvalue of 1.188, contributed 14.852% of the variance. By adding these rates, we found that the cumulative variance contribution rate of the first through third principal components exceeded 76.1%. As per Figure 4, there was an inflection point at the third principal component. Beyond this point, the eigenvalues fell below 1, and the trend line flattens. This suggests that the first three principal components could adequately represent the characteristic information of the eight traits across the 22 cassava tuber samples. Thus, these first three principal components could be selected as a comprehensive evaluation index for assessing the nutritional value of cassava tubers.

3.3.4. Comprehensive Evaluation

A PCA load matrix reflects each trait’s relative magnitude and the direction of its respective contribution to the principal components. In other words, it illustrates how each trait impacts the principal components, as shown in Table 5. A principal component load map, shown in Figure 5, was constructed to examine further the relationships between each principal component and the indices. These relationships are shown in diagrams for the first and second, second and third, and first and third pairs of principal component loads. In the analysis, A3, A8, A7, and A2 showed more significance in the direction of the first principal component. Similarly, A1 and A5 had significant relevance to the second principal component. A6 and A4 were noteworthy for their alignment with the third principal component direction. These observations further underlined the unique influence of different traits on the individual principal components, providing a more nuanced understanding of their contribution to the comprehensive nutritional evaluation of cassava tubers.
According to the analysis, the first principal component of cassava tubers represents traits associated with reduced sugar, anthocyanins, vitamin C, and dry matter content. On the other hand, the second principal component predominantly reflects starch and dietary fiber characteristics, while the third principal component represents protein and quinic acid. In the principal component expression, the magnitude of each factor coefficient can provide information about the extent of the factor’s contribution to that specific principal component. Notably, a load greater than 0.3 is generally considered significant in PCA. We constructed expressions for the three principal components, as shown in Equations (2)–(4), as per the eigenvectors of the correlation matrix of the nutrient components in the cassava tubers mentioned in Table 6.
F 1 = 0.251 A 1 + 0.227 A 2 + 0.084 A 3 + 0.251 A 4 0.154 A 5 0.075 A 6 + 0.154 A 7 + 0.245 A 8
F 2 = 0.116 A 1 0.073 A 2 + 0.537 A 3 0.132 A 4 + 0.199 A 5 0.523 A 6 + 0.063 A 7 + 0.062 A 8
F 3 = 0.316 A 1 + 0.453 A 2 0.014 A 3 0.196 A 4 + 0.473 A 5 + 0.031 A 6 0.515 A 7 + 0.095 A 8
Considering both the principal component coefficients and their corresponding variance contribution rates, a comprehensive evaluation could be formulated, as shown in Equation (5).The principal components and composite scores of different varieties of cassava tubers were calculated according to Equations (2)–(5) as shown in Table 7.
F c = 0.41921 F 1 + 0.19369 F 2 + 0.14852 F 3
As shown in Table 8, the sequence of the first five cassava types consisted of Guangxi Muci, Gui Cassava 4, Glutinous Rice Cassava, Huifeng 60, and Dongguan Hongwei. Figure 6 shows the distribution map of the principal component scores, while Figure 7 illustrates the load diagram resulting from the PCA of diverse cassava tuber varieties. Varieties with higher comprehensive scores are predominantly localized in the positive direction of F1, revealing that principal component 1 exerts substantial influence over the composite score.

3.4. Cluster Analysis

The system clustering analysis strategy can be utilized to categorize test samples into separate groups for in-depth evaluation and study. This division of samples is termed Q-type clustering, while the categorization of variables is R-type clustering. The results derived from this system are comprehensive, objective, and based on scientific principles [25]. In the experiment of this study, PCA was utilized and clustering analysis was systematically applied to organize eight nutritional value indicators from different strains of cassava tuber into R-type clusters and 22 cassava tuber strains into Q-type clusters. Figure 8 depicts the results of the clustering process. With a Euclidean distance of 18, 22 cassava strains were classified into five main categories. Figure 8 shows that C1115, Hainan Hongxin, Nanzhi 199, and Rongyang 1, among others, were assigned to one category; SC9, Swiss P11, Switzerland D32, and TMS60444 were clustered in another category; Guangxi Muci and Gui Cassava 4 belonged to a separate category; and Dongguan Hongwei and BRA900 formed individual categories.
Table 8 outlines the clustering analysis of nutritional value indicators from the 22 cassava types. Category I comprises Guangxi Muci and Gui Cassava 4. These varieties are locally sourced, demonstrate favorable nutritional value, and possess high starch content, reducing sugars, dry matter content, and other nutrients, rendering them excellent candidates for selection. Category II includes Glutinous Rice Cassava, Huifeng 60, and 12 varieties, mainly from localities in Southern China. They offer moderate nutritional value, a balanced protein content, and a subset of varieties with elevated individual nutrient content—qualities that lend them significant potential for selection. Category III contains Dongguan Hongwei, which exhibits exceptional protein content, making it a suitable choice. Six strains, including SwissP11 and Switzerland D32, are included in Category IV. Despite the introduction of varieties, they have commendable fiber content. Regardless of these limitations, some varieties have a comparatively low content of certain nutrients but maintain a high selection value. Finally, category V, represented by BRA900, has a higher anthocyanin content but lower levels of individual nutrients. As a result, selection from this class of cassava would primarily depend on specific production circumstances.

4. Discussion

At present, multivariate statistical methods, specifically PCA, are widely used to assess the agronomic traits and nutritional quality of crops. In addition, there is cluster analysis, which can divide an unknown category of crops into several clusters to further explore the relationship between the species. In this study, the results from the PCA, which classified the eight nutrients into three primary components, and the cluster analysis, which grouped the 22 cassava root tubers into five different categories and the eight nutrients into three categories, significantly aided the comprehensive evaluation process of cassava quality. Five superior varieties were identified: Guangxi Muci, Gui Cassava 4, Glutinous Rice Cassava, Huifeng 60, and Dongguan Hongwei.
Plant growth, development, and physiological characteristics, such as those of cassava, are influenced by the plant’s genetic material and environmental conditions like temperature, light, and water. In various regions, local climate and cultivation methods significantly influence cassava’s nutrient content. During the cassava planting period in Guangxi, limited sunlight and excessive rainfall affect some imported varieties’ yield and nutritional value, leading to reduced quality of some varieties. On the other hand, large-scale discretization was performed after excluding the error of the experimental method. The reasons for this discretization included differences in harvest time and sample preservation time.
The 22 kinds of cassava in this study were comprehensively ranked according to the nutritional value of their tubers. In addition to its nutritional value, cassava can be economically beneficial. In practical applications, different kinds of cassava have varying advantages, including having edible, feeding, industrial, and medicinal value. In addition, the leaves of some varieties have high nutritional value. At present, cassava is widely used in many fields, and the development of production in poor mountainous areas could increase farmers’ economic income, reduce the feed burden of the livestock and poultry industries, and provide raw materials for enterprises for processing and production.

5. Conclusions

The nutritional value indicators of the cassava root samples showed significant differences and varying degrees of correlation. In the correlation analysis, there were positive or negative correlations between multiple nutrients. The PCA extracted three primary factors with a cumulative contribution rate exceeding 76%. The first was closely related to reducing sugars, anthocyanins, vitamin C, and dry matter content; the second was mainly linked to starch and dietary fiber; and the third was primarily associated with protein and quinic acid. Cluster analysis grouped 22 cassava root samples into 5 categories and the 8 nutrients into 3 categories. Each category displayed noticeable differences in nutrient composition and Euclidean distance, suggesting unique nutritional profiles. In conclusion, cassava roots hold excellent nutritional value and excellent potential for various utilizations. However, particular applications might vary significantly across cassava varieties, due to the geographical environment and other factors. The findings in this study provide a solid theoretical basis for the targeted breeding of different cassava species and the comprehensive use and productive development of each variety. In future studies, the cassava tuber sample size will be expanded, a more comprehensive evaluation of different uses will be conducted, and an additional evaluation analysis of cassava leaves will be considered. This could facilitate the development of cassava products in a more targeted manner, depending on classification characteristics.

Author Contributions

Conceptualization, L.C. and E.M.A.; Methodology, L.C., R.C., E.M.A. and X.M. (Xu Ma); Software, L.C., R.C. and E.M.A.; Formal analysis, H.J.; Investigation, X.M. (Xiangwei Mou); Resources, X.M. (Xiangwei Mou); Data curation, R.C.; Writing—original draft, L.C., R.C., H.J. and X.M. (Xu Ma); Writing—review & editing, L.C., E.M.A. and M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by financial support from the Guangxi Key R&D Program [grant No. 2021AB38023].

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Xiao, M.; Liu, G.; Song, J.; Duan, C.; Jiang, T.; Zhang, L.; Yan, W.; Shen, S.; Zhou, Y.; Xiong, X.; et al. Analysis of Agronomic Characteristics of Different Cassava Varieties and Screening of High-Yielding Varieties. Crops 2022, 4, 77–82. [Google Scholar]
  2. Song, J.M.; Zhang, L.H.; Duan, C.F.; Jiang, T.; Liu, Q.; Xiong, X.; Chang, R.; Liu, G. Analysis and Comprehensive Evaluation of Main Agronomic Traits of Ten New Lines of Cassava. Guangdong Agric. Sci. 2020, 47, 8–15. [Google Scholar] [CrossRef]
  3. Zainuddin, I.M.; Lecart, B.; Sudarmonowati, E.; Vanderschuren, H. A method for rapid and homogenous initiation of post-harvest physiological deterioration in cassava storage roots identifies Indonesian cultivars with improved shelf-life performance. Plant Methods 2023, 19, 4. [Google Scholar] [CrossRef]
  4. Cao, S.; Chen, J.-F.; Huang, F.-Y.; Yan, H.-B.; Wei, C.-N.; Li, F.-S.; Lu, L.-Y.; Qin, X.-Y.; Chen, H.-X.; Li, H.-R. Development status and countermeasures of cassava industry in Guangxi. J. South. Agric. 2021, 52, 1468–1476. [Google Scholar]
  5. Ning, H. Research and Application of Functional Principal Component Analysis. Jilin Univ. 2021, 6–7. [Google Scholar] [CrossRef]
  6. Zhao, X.H. Research on Feature Extraction Based on Principal Component Analysis. Xidian Univ. 2016, 5–8. [Google Scholar] [CrossRef]
  7. Sun, J.G.; Liu, J.; Zhao, L.Y. Clustering Algorithms Research. J. Softw. 2008, 1, 48–61. [Google Scholar] [CrossRef]
  8. Gonçalves, M.G.M.; Avalos, F.A.P.; dos Reis, J.V.; Costa, M.V.; Silva, S.H.G.; Poggere, G.C.; Curi, N.; de Menezes, M.D. Pedology-based management class establishment: A study case in Brazilian coffee crops. Precis. Agric. 2022, 12, 1027–1050. [Google Scholar] [CrossRef]
  9. Ephzibah, E.P.; Remya, L. Dimensionality Reduction Using Principal Component Analysis and Multi-Label Fuzzy Classification for Rice Crop Disease in Rural Areas of India. Electrochem. Soc. Trans. 2022, 107, 16451. [Google Scholar] [CrossRef]
  10. Mengistu, S.; Asefa, M. Genetic Diversity Based on Cluster and Principal Component Analyses for Agro-morphological Traits of Wheat Germplasm. Int. J. Genet. Genom. 2022, 10, 79–80. [Google Scholar]
  11. Seid, E.; Mohammed, W.; Abebe, T. Genetic Diversity Based on Cluster and Principal Component Analyses in Potato (Solanum Tuberosum L.) for Yield and Processing Attributes. J. Hortic. 2021, 8, 1–6. [Google Scholar]
  12. He, W.; Zhang, X.F.; Guo, S.Y.; Yang, J.Y.; Li, H.R.; Liu, L.J. Comprehensive Evaluation of 22 Potato Germplasms Based on Principal Component Analysis and Cluster Analysis. Seed 2021, 40, 80–86. [Google Scholar] [CrossRef]
  13. Xu, Q.Y.; Yu, J.; Zhu, D.W.; Zheng, X.L.; Meng, L.Q.; Zhu, Z.W.; Shao, Y.F. Nutritional Quality Evaluation of Different Rice Varieties Based on Principal Component Analysis and Cluster Analysis. China Rice 2022, 28, 1–8. [Google Scholar] [CrossRef]
  14. Yu, H.; Zhao, Z.; Gao, A.; Luo, R. Quality Evaluation of 23 Species of Clausena lansium (Lour.) Skeels Germplasm Resources Based on Principal Component Analysis and Cluster Analysis. Chin. J. Trop. Crops 2022, 43, 1357–1364. [Google Scholar] [CrossRef]
  15. GB 5009.9-2016; National Standard for Food Safety, the Determination of Starch in Food. The National Health and Family Planning Commission of the People’s Republic of China: Beijing, China; The China Food and Drug Administration, and the Standardization Administration of the People’s Republic of China. China Standards Press: Beijing, China, 2016.
  16. Li, Z.X.; Nie, J.Y.; Li, J.; Kuang, L.X.; Shen, Y.M. Optimization of 3,5-Dinitrosalicylic Acid Colorimetry Determination Conditions of Reducing Sugar in Fruits by Response Surface Method. J. Instrum. Anal. 2016, 35, 1283–1288. [Google Scholar] [CrossRef]
  17. Zhang, C.; Zhang, Y.L.; Bai, L.L.; Zhao, D. Optimization of ultrasonic-assisted extraction of anthocyanins from Lonicera edulis by response surface methodology and anti-inflammatory activity studies. Biotechnology 2020, 30, 473–480. [Google Scholar] [CrossRef]
  18. Wu, P.; Jia, C.S.; Fan, S.Y.; Sun, Y. Principal component analysis and fuzzy comprehensive evaluation of fruit quality in cultivars of cherry. Trans. Chin. Soc. Agric. Eng. 2018, 34, 291–300. [Google Scholar] [CrossRef]
  19. GB 5009.88-2014; National Standard for Food Safety, the Determination of Dietary Fiber in Food; The National Health and Family Planning Commission of the People’s Republic of China. Standardization Administration of the People’s Republic of China: Beijing, China; China Standards Press: Beijing, China, 2014.
  20. Yao, G.F.; Zhang, S.L.; Wu, J.; Cao, Y.F.; Liu, J.; Han, K.; Yang, Z.J. Analysis of components and contents of soluble sugars and organic acids in ten cultivars of pear by high performance liquid chromatography. J. Nanjing Agric. Univ. 2011, 34, 25–31. [Google Scholar] [CrossRef]
  21. Liu, W.J.; Zhang, Y.; Zhang, Y.X.; Du, G.Q. Analysis and Evaluation of Fruit Quality and Aroma Components of ‘Yali’ Pear (Pyrus bretschneideri Rehd.) Pollinated with Eighteen Pollinizers. Food Sci. 2022, 43, 294–302. [Google Scholar] [CrossRef]
  22. Li, H.S. Experimental Principles and Techniques of Plant Physiology and Biochemistry; Higher Education Press: Beijing, China, 2000. [Google Scholar]
  23. Chen, M.; Lin, X.; Li, X.; Liu, X.; Gao, H.; Ming, J.; Dai, M.; Zhou, Z. Comprehensive Evaluation of Durian Quality Based on Principal Component Analysis and Cluster Analysis. Sci. Technol. Food Ind. 2023, 44, 278–286, (In Chinese with English Abstract). [Google Scholar] [CrossRef]
  24. Fujikoshi, Y.; Satoh, T.; Sugiyama, T. Asymptotic Distribution of the Contribution Ratio in High-Dimensional Principal Component Analysis. Am. J. Math. Manag. Sci. 2011, 31, 39–54. [Google Scholar] [CrossRef]
  25. Jiang, L.; Bao, Y.; Jia, Y.; Lai, Z.; Xu, F. Nutritional component analysis and comprehensive quality evaluation of 18 different varieties of Lonicera caerulea. Trans. Chin. Soc. Agric. Eng. 2022, 38, 326–335. [Google Scholar] [CrossRef]
Figure 1. Methodological framework.
Figure 1. Methodological framework.
Foods 13 01861 g001
Figure 2. Correlation analysis of nutritional value indicators of different varieties of cassava tubers. Note: * indicates a significant correlation (p < 0.05); ** indicates a highly significant correlation (p < 0.01).
Figure 2. Correlation analysis of nutritional value indicators of different varieties of cassava tubers. Note: * indicates a significant correlation (p < 0.05); ** indicates a highly significant correlation (p < 0.01).
Foods 13 01861 g002
Figure 3. A radar plot of nutritional value indicators for different varieties of cassava tubers.
Figure 3. A radar plot of nutritional value indicators for different varieties of cassava tubers.
Foods 13 01861 g003
Figure 4. Bottom lithotripsy map of main components.
Figure 4. Bottom lithotripsy map of main components.
Foods 13 01861 g004
Figure 5. PCA diagrams: (a) three-dimensional load diagram; (b) diagram of first and second principal component load; (c) diagram of second and third principal component load; (d) diagram of first and third principal component load.
Figure 5. PCA diagrams: (a) three-dimensional load diagram; (b) diagram of first and second principal component load; (c) diagram of second and third principal component load; (d) diagram of first and third principal component load.
Foods 13 01861 g005aFoods 13 01861 g005b
Figure 6. Principal component scores of cassava tuber nutrients.
Figure 6. Principal component scores of cassava tuber nutrients.
Foods 13 01861 g006
Figure 7. Load diagram of PCA of cassava tubers of different varieties.
Figure 7. Load diagram of PCA of cassava tubers of different varieties.
Foods 13 01861 g007
Figure 8. Results of systematic clustering analysis of cassava tubers: (a) Q-type clustering of nutritional value indicators of 22 cassava tubers; (b) R-type clustering of nutritional value indicators of 22 cassava tubers.
Figure 8. Results of systematic clustering analysis of cassava tubers: (a) Q-type clustering of nutritional value indicators of 22 cassava tubers; (b) R-type clustering of nutritional value indicators of 22 cassava tubers.
Foods 13 01861 g008
Table 1. Results of nutritional value determination of different cassava tuber varieties.
Table 1. Results of nutritional value determination of different cassava tuber varieties.
BreedCharacter
A1/(g·kg−1)A2/(mg·kg−1)A3/
(mg·kg−1)
A4/(g·kg−1)A5/(g·100 g−1)A6/(mg·g−1)A7/(mg·100 g−1)A8/
%
C111568036.25196.498.311.40.2441.6219.61
SC967636.13276.815.822.170.2924.118.67
Nanzhi 19967135.31263.246.091.620.1832.2118.46
GR 466426.49120.435.321.50.2635.7616.81
SC606867236.02249.456.091.80.2242.2318.96
Dongguan Hongwei68445.81312.927.871.230.1620.4518.36
Glutinous Rice Cassava69343.57335.467.751.840.1835.619.82
Qiongzhong 166320.95167.447.121.930.2134.2516.79
Guangdong 167836.76416.334.52.420.1632.1218.86
Colombia 8H66237.8812.674.922.240.1825.6815.56
Switzerland D3267939.46146.325.532.380.2228.4419.43
Hainan Hongxin68436.45101.537.131.340.2335.3819.68
Xinglong 168544.69194.166.12.010.2933.8816.34
Huifeng 6066938.65414.627.041.860.1538.9819.61
Guangxi Muci70949.62536.328.021.680.1541.1721.06
Huixian Baipi67037.55237.047.582.470.1939.0118.71
Rongyang167438.68235.967.641.690.237.5317.56
Swiss P1168043.13227.916.442.130.2425.8517.98
Gui Cassava 472853.24190.988.641.860.2139.4319.73
Shatian Bread67140.29340.296.21.990.1736.7318.21
TMS6044467737.56133.745.342.650.2224.4818.68
BRA90065320.71482.574.232.50.1829.6715.26
Note: A1 is starch; A2 is reduced sugar; A3 is anthocyanins; A4 is protein; A5 is dietary fiber; A6 is quinic acid; A7 is vitamin C; and A8 is dry matter content.
Table 2. Differential analysis of 22 different nutrient components of cassava tubers.
Table 2. Differential analysis of 22 different nutrient components of cassava tubers.
ItemMinimumMaximumAverageStandard DeviationsCoefficient of Variation/%
A1/(g·kg−1)653.00728.00685.234621.26743.10
A2/(mg·kg−1)2.7153.2414.267915.2364106.79
A3/(mg·kg−1)12.671198.32242.3659295.6314121.98
A4/(g·kg−1)4.238.647.12651.269717.82
A5/(g·100 g−1)1.212.651.69841.365980.42
A6/(mg·g−1)0.140.290.20590.0410219.92
A7/(mg·100 g−1)20.6042.303526.365875.33
A8/%15.2621.066.22771.268520.37
Table 3. Data standardization.
Table 3. Data standardization.
BreedCharacter
A1A2A3A4A5A6A7A8
C11150.10686−0.21925−0.448851.40851−1.358830.831121.301170.84253
SC9−0.14192−0.234610.17571−0.562830.573882.05010−1.468600.20360
Nanzhi 199−0.45290−0.339520.07020−0.34907−0.80663−0.63165−0.186480.06086
GR 4−0.88827−1.46802−1.04028−0.95868−1.107831.318710.37475−10.06066
SC6068−0.39070−0.24868−0.03703−0.34907−0.354820.343531.397600.40072
Dongguan Hongwei0.355651.003920.456501.06016−1.78553−1.11925−2.04564−0.00711
Glutinous Rice Cassava0.915410.717320.631770.96516−0.25442−0.631650.349450.98527
Qiongzhong 1−0.95047−2.17684−0.674740.46639−0.028520.099730.13603−1.07425
Guangdong 1−0.01753−0.154001.26061−1.607881.20138−1.11925−0.200700.33275
Colombia 8H−1.01266−0.01070−1.87822−1.275360.74958−0.63165−1.21882−1.91029
Switzerland D320.044670.19146−0.83896−0.792421.100980.34353−0.782480.72018
Hainan Hongxin0.35565−0.19367−1.187250.47430−1.509430.587330.314670.89011
Xinglong 10.417840.86062−0.46696−0.341150.172282.050100.07754−1.38012
Huifeng 60−0.577290.087821.247310.40305−0.20422−1.363040.883810.84253
Guangxi Muci1.922991.491402.193641.17892−0.65602−1.363041.230031.82811
Huixian Baipi−0.51510−0.05292−0.133530.830571.32688−0.387860.888550.23079
Rongyang1−0.266310.09166−0.141930.87807−0.63092−0.144060.65457−0.55087
Swiss P110.106860.66102−0.20453−0.071970.473480.83112−1.19194−0.26540
Gui Cassava 43.092271.95456−0.491691.66978−0.204220.099730.954950.92410
Shatian Bread−0.452900.297650.66933−0.261980.12208−0.875450.52810−0.10906
TMS60444−0.07972−0.05164−0.93679−0.942851.778680.34353−1.408530.21040
BRA900−1.57243−2.207551.77569−1.821641.40218−0.63165−0.58803−2.11420
Table 4. Variance contribution rate from the PCA of cassava tubers.
Table 4. Variance contribution rate from the PCA of cassava tubers.
IngredientInitial EigenvalueExtracted Sum of Load Squares
TotalVariance %Accumulation %TotalVariance %Accumulation %
13.35441.92141.9213.35441.92141.921
21.55019.36961.2901.55019.36961.290
31.18814.85276.1421.18814.85276.142
40.7008.74884.891
50.4525.65390.544
60.3684.60195.145
70.2473.08298.228
80.1421.772100.000
Table 5. Principal component load matrix of the cassava tuber nutrients.
Table 5. Principal component load matrix of the cassava tuber nutrients.
ItemPrincipal Component
123
A30.843−0.1800.375
A80.843−0.204−0.232
A70.8220.0960.112
A20.760−0.1130.538
A10.2830.832−0.017
A5−0.252−0.8110.036
A60.5170.097−0.612
A4−0.5180.3080.562
Table 6. Eigenvectors of the nutrient composition correlation matrix of the cassava tubers.
Table 6. Eigenvectors of the nutrient composition correlation matrix of the cassava tubers.
ItemPrincipal Component
123
A30.251−0.1160.316
A80.227−0.0730.453
A70.0840.537−0.014
A20.251−0.132−0.196
A1−0.1540.1990.473
A5−0.075−0.5230.031
A60.1540.063−0.515
A40.2450.0620.095
Table 7. Principal component and comprehensive score of different varieties of cassava tubers.
Table 7. Principal component and comprehensive score of different varieties of cassava tubers.
BreedF1Sort-1F2Sort-2F3Sort-3Comprehensive Score (Fc)Sort-c
C11150.846610574−0.9942186419−1.5423606221−0.06673399112
SC9−0.633806517−0.8356597171.067399034−0.26902684617
Nanzhi 199−0.11468116120.322954668−0.5287822815−0.06405714511
GR 4−1.0146561420−1.1741749920−1.5201237120−0.87854872822
SC60680.097041189−0.0477149610−1.0060531319−0.11798028814
Dongguan Hongwei0.6637490150.0913889690.5265555190.3741553745
Glutinous Rice Cassava1.0696842730.4161329170.18991147110.5572287783
Qiongzhong 1−0.9176552219−0.270602913−1.5509306522−0.66744754121
Guangdong 1−0.38749749161.7348919820.890722260.3058784666
Colombia 8H−1.458273321−0.43771123150.732602927−0.58729685320
Switzerland D32−0.3540699615−0.33029663141.270714132−0.023678369
Hainan Hongxin0.519563967−1.2598174721−0.8249361717−0.14872715816
Xinglong 1−0.3310915714−1.43562142220.569298188−0.33231024418
Huifeng 600.5571794161.457304684−0.75307342160.403994064
Guangxi Muci2.1419596211.4635848530.20914306101.2124745691
Huixian Baipi0.07408384100.419461446−0.16775428120.0873873078
Rongyang10.236243278−0.211046611−0.905073918−0.0762636513
Swiss P11−0.2422032713−0.59298305161.188536223−0.0398675210
Gui Cassava 41.995094642−0.96117721181.044665650.8053469452
Shatian Bread0.04582152110.933482375−0.2180308130.1676331057
TMS60444−0.812124818−0.26698425121.746675241−0.1327468115
BRA900−1.98096505221.978815521−0.4190898114−0.50940679919
Note: F1F3 represents the scores of the main components of the different varieties of cassava tubers.
Table 8. Clustering analysis of nutritional value indicators for 22 types of cassava tubers.
Table 8. Clustering analysis of nutritional value indicators for 22 types of cassava tubers.
ItemMean Value
Category
I
(2)
Category
II
(12)
Category
III
(1)
Category
IV
(6)
Category
V
(1)
A1/(g·kg−1)718.60674.08684.00676.50653.00
A2/(mg·kg−1)51.4335.5845.8139.8120.71
A3/(mg·kg−1)363.65256.52312.92165.27482.57
A4/(g·kg−1)8.336.737.875.694.23
A5/(g·100 g−1)1.771.821.232.262.50
A6/(mg·g−1)0.180.200.160.240.18
A7/(mg·100 g−1)40.3036.7920.4527.0729.67
A8/%20.3918.5918.3617.7815.26
Note: A1 is starch; A2 is reducing sugars; A3 is anthocyanins; A4 is protein; A5 is dietary fiber; A6 is quinic acid; A7 is vitamin C; and A8 is dry matter content.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, L.; Chen, R.; Atwa, E.M.; Mabrouk, M.; Jiang, H.; Mou, X.; Ma, X. Nutritional Quality Assessment of Miscellaneous Cassava Tubers Using Principal Component Analysis and Cluster Analysis. Foods 2024, 13, 1861. https://doi.org/10.3390/foods13121861

AMA Style

Chen L, Chen R, Atwa EM, Mabrouk M, Jiang H, Mou X, Ma X. Nutritional Quality Assessment of Miscellaneous Cassava Tubers Using Principal Component Analysis and Cluster Analysis. Foods. 2024; 13(12):1861. https://doi.org/10.3390/foods13121861

Chicago/Turabian Style

Chen, Lintao, Rui Chen, Elsayed M. Atwa, Mahmoud Mabrouk, Huanyu Jiang, Xiangwei Mou, and Xu Ma. 2024. "Nutritional Quality Assessment of Miscellaneous Cassava Tubers Using Principal Component Analysis and Cluster Analysis" Foods 13, no. 12: 1861. https://doi.org/10.3390/foods13121861

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop