Next Article in Journal
Research on Tobacco Field Semantic Segmentation Method Based on Multispectral Unmanned Aerial Vehicle Data and Improved PP-LiteSeg Model
Previous Article in Journal
Prediction of Anthracnose Risk in Large-Leaf Tea Trees Based on the Atmospheric Environmental Changes in Yunnan Tea Gardens—Cox Regression Model and Machine Learning Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Nutritional Diagnosis of Potato Crops Using the Multivariate Method

by
Danilo dos Reis Cardoso Passos
1,
Arthur Bernardes Cecílio Filho
1,*,
Rogério Peres Soratto
2,
Danilo Eduardo Rozane
3,
Danilo Ricardo Yamane
1,
Adalton Mazetti Fernandes
2,
Emerson de Freitas Cordova de Souza
4,
Fabiana Morbi Fernandes
5,
André Luiz Gomes Job
6 and
Camila Seno Nascimento
1
1
Department of Plant Production, São Paulo State University (Unesp), Jaboticabal 14884-900, Brazil
2
Department of Crop Science, São Paulo State University (Unesp), Botucatu 18610-034, Brazil
3
Department of Agronomy and Natural Resources, São Paulo State University (Unesp), Registro 11900-000, Brazil
4
Department of Soil, Water, and Climate, University of Minnesota, Saint Paul, MN 55108, USA
5
Timac Agro Industria e Comércio de Fertilizantes Ltda., Campinas 13069-960, Brazil
6
McCain Brasil Alimentos Ltda., Araxá 38180-555, Brazil
*
Author to whom correspondence should be addressed.
Agronomy 2024, 14(7), 1500; https://doi.org/10.3390/agronomy14071500
Submission received: 24 May 2024 / Revised: 4 July 2024 / Accepted: 8 July 2024 / Published: 10 July 2024
(This article belongs to the Section Plant-Crop Biology and Biochemistry)

Abstract

:
The compositional nutrient diagnosis (CND) method considers the multiple relationships among nutrients and has been proposed to evaluate the nutritional status of plants in place of the univariate and bivariate methods. As it is mathematically based and considers the interactions among all nutrients at the same time, it avoids the errors and trends observed in the calculations of other methods estimating nutritional status, enabling a greater relationship with productivity. The objective of this study was to obtain the CND norms for high-yielding populations of potato crops. For this, 587 samples were used from 21 experimental areas in the state of São Paulo, Brazil to correlate the leaf nutrient contents and the yields of potato crops. Crops with yields higher than 48,993.24 kg ha−1 were considered to have high yields, and the Mahalanobis distance separated the balanced samples from the nutritionally unbalanced ones. Thus, the CND-ilr method generated the norms and classified the 587 samples as nutritionally balanced with a high yield (5% of the total), nutritionally unbalanced with a low yield (92%), nutritionally unbalanced with a high yield (0.3%), or nutritionally balanced with a low yield (2.7%), with accuracy, sensitivity, specificity, NPV, and PPV scores of 96.9, 97.1, 93.6, 64.4, and 99.6%, respectively.

1. Introduction

Potato (Solanum tuberosum L.) is a key component of global food security, being grown on nearly 18 million hectares in more than 150 countries [1,2]. In Brazil, it is cultivated on 118.1 thousand hectares, with a production of 3.6 million tons in 2020 [3,4]. In the last ten years, there has been a 28.1% increase in the yield of potatoes grown in Brazil, driven by improvements in cultivation techniques and the adoption of more efficient varieties [2]. Such an increase ranks the country as the 21st largest producer in the world [5], with production concentrated in the south and southeast regions, mainly in the states of Minas Gerais, São Paulo, and Paraná [4].
There are several factors responsible for the yield and profitability of potato crops. One of the main ones is fertilization, since potato is a highly nutrient-demanding crop [6], which makes fertilizers responsible for 19% of the crop’s production cost [7]. Thus, nutritional monitoring is essential to evaluate and adjust fertilization management to ensure balanced nutrition, maximize the production potential of the crop, and avoid environmental damage [8].
Among the possibilities for monitoring nutritional status, the compositional nutrient diagnosis (CND) method has received increased attention from researchers. This method employs a multivariate approach that considers the interactions among all nutrients in order to evaluate the balance of the nutritional status of the plant [9,10]. When applied to foliar nutrient analysis, the method associates each nutrient of interest with the geometric mean of all nutrients in the leaf dry matter and allows researchers to infer the analytically determined balance of nutrients, as well as other elements not directly assayed [9,11].
The CND method makes it possible to group nutrients from the most limited, due to deficiency, to those at an excessive level. As a result, the generated norms are more independent of local conditions compared to the norms generated by calibration curves. The CND standards are a set of reference values for a plant’s nutrients that are used to assess its nutritional status. They allow one to more accurately identify the nutritional needs of each plant, avoiding nutrient deficiencies and excesses, as they take into account the interactions among nutrients [12]. The method also makes it possible to assign the same weights to deficiencies and excesses in the imbalance, which can be detected with the use of Mahalanobis distance [13]. This makes it possible to define the contribution of each nutrient to the nutritional composition of the dry matter and to assign a single standard deviation to each sample, with the possibility of its identification in the compositional space, and thus the exclusion of outliers, which increases the reliability in the interpretation of the results [13,14]. Compositional space refers to a coordinate system that represents the nutritional composition of a sample in relation to the content of all analyzed nutrients. Each nutrient occupies an axis in this space, and the position of the sample in this space is defined by the concentrations of each nutrient in its composition [9]. The identification of outliers in this compositional space is important to ensure the reliability of the interpretation of the results. By excluding these outliers from the compositional space, it is possible to obtain a more accurate view of the real distribution of the nutrient content in the samples, which facilitates the identification of patterns and trends in the nutritional composition of potato plants [15].
The methodology has been widely applied to annual crops, such as soybean [16,17], fruit trees [11,14], and maize [13,18]. However, few studies on the nutritional diagnosis of potato crop by CND have been conducted [19,20,21], among which none have been carried out under Brazilian conditions, which differ in terms of cultivars, climate, cultural management, and CND norms.
The arrangement of orthonormal balances with the use of the CND-ilr (isometric log ratio) methodology complements the CND-clr method because it employs an orthogonal balance between two groups of components (chemical elements evaluated from dry matter), not overlapped and arranged in the order of D−1 contrasts, where D is the number of components of the dry matter. Thus, the concept of CND-ilr was formulated, in which binary sequential partitions reflect the interactions of the nutrients. Binary sequential partitions can be defined according to the already known interactions of nutrients [11,14]. CND-clr uses the “centered log-ratio” (clr) transformation, which divides each nutrient content by the sum of all nutrient contents in the sample and then applies the natural logarithm of the result. This transformation seeks to centralize the data around zero and ensure that the scale of values is the same for all nutrients [22]. However, CND-clr has some limitations, such as sensitivity to the presence of outliers (extreme values) and heteroscedasticity (unequal variation in variance) in the data. To overcome these limitations, CND-ilr was proposed as an evolution of CND-clr [13]. CND-ilr uses the “isometric log-ratio” (ilr) transformation, which divides each pair of nutrient contents by the sum of the contents and applies the natural logarithm of the result. This transformation compares the nutrient contents in pairs, instead of comparing them with the sum of all contents, as in CND-clr [14]. Thus, the CND-ilr method represents a significant advance compared to CND-clr in several aspects, especially in the interpretation and precision of nutritional diagnosis, as it has a greater capacity to identify nutrient deficiencies and excesses, especially in cases of complex interactions among them, increasing the sensitivity of the method to detect subtle nutritional imbalances, which makes it more accurate and reliable [11,23].
The use of a meta-analysis database allows for a robust statistical evaluation, considering the variability between different studies and cultivation contexts. This approach is advantageous because it provides a broad database that can increase the generalizability of results and provide more robust insights into the usefulness of the CND-ilr method [24]. Thus, CND-ilr can effectively contribute to improving soil fertility management, potato nutrition, and increasing crop productivity.
In view of the advantages of the method, the proven feasibility of performing the nutritional diagnosis of crops, and the scarcity of studies on CND-ilr in potato crops, it is believed that the CND-ilr norms for the high-yielding populations of potato crops contribute to the evaluation of the nutritional status of potato crops, classifying the samples into nutritionally balanced and nutritionally unbalanced in high- and low-yield scenarios. Therefore, the objective of this study was to obtain the CND norms for high-yielding populations of potato crops.

2. Materials and Methods

2.1. Database

A database of potato tuber yields and the associated leaf nutrient contents for 752 experimental units was constructed from 21 experiments carried out between 2011 and 2019 in municipalities in the state of São Paulo, Brazil (Avaré, Bernardino de Campos, Botucatu, Cerqueira César, Itaí, Jaboticabal, São Manuel, and Taquarituba) (Table 1).
The potato cultivars used in the experiments are the main ones used in the state of São Paulo. Agata has a high yield and is the most planted cultivar in Brazil, occupying approximately 50% of the planted area [29]. Asterix is the most widely used cultivar in the pre-fried and frozen potato stick industry. Atlantic accounts for the largest demand in the domestic market for potato chips [30]. Electra is used for the fresh market. Markies is used for cooking and frying [31]. Mondial has a high yield and is suitable for the fresh market.
The climatic classification of experiments 1 to 19 is the Cfa type (a hot, humid, temperate (mesothermal) climate), and the average temperature of the hottest month was above 22 °C. Experiments 20 and 21 were carried out in a tropical climate, the Aw type, with drought in winter [32]. Irrigation was carried out using a center pivot or conventional sprinkler, and phytosanitary management was carried out according to the technical recommendations for the crop [33].

2.2. Foliar Diagnosis and Tuber Yield

In each experimental unit, the nutritional diagnostic leaves were sampled. The third leaf of the stem was collected approximately 30 days after the emergence of the apical tuft of fifteen plants per plot, with one complete leaf (leaf + petiole) per plant, as recommended by Lorenzi et al. [34]. The sampled leaves were washed, dried in an oven with forced ventilation at 65 °C until reaching a constant mass, and then ground in a Wiley mill with a 120-mesh sieve. Then, the dry leaves were digested, and the nutrient contents (N, P, K, Ca, Mg, S, B, Cu, Fe, Mn, and Zn) were read, according to methodologies proposed by Malavolta et al. [35]. In each experimental unit, in addition to the leaves for nutritional diagnosis, the tuber yield was evaluated, estimating the production for one hectare.

2.3. Obtaining the Norms Using the CND-ilr Method

The contents of all nutrients are expressed in mg kg−1, and the yield is expressed in kg ha−1. The samples in the database were sorted in descending order according to yield. For each nutrient, outliers were removed using interquartile variation (IQR) and a boxplot to display the distribution of the data. The boxplot was divided into quartiles, and the lower and upper limits were defined. Quartiles divide a series of ordered data into four equal parts: the first quartile (Q1) separates the bottom 25% from the top 75%; the median (Q2) divides the data into the bottom 50% and the top 50%; and the third quartile (Q3) separates the bottom 75% from the top 25%. This analysis is crucial to understanding the dispersion and central tendency of the data. The IQR was the difference between the Q1 and Q3 quartiles, so all values that exceeded the limits were considered outliers, as proposed by Rousseeuw and Croux [36], using the Mahalanobis distance [13]. This procedure is recommended for improving the evaluation parameters of the sample classifications and, consequently, the quality of the CND norms [10,11,14,23].
After excluding the outliers, the data normality was checked with the Shapiro–Wilk test, as indicated by Hair et al. [37], since data normality is a condition required for making valid inferences about population parameters.
Subsequently, for each sample, the filling value (Fv) was calculated by deducting the sum of the nutrient contents from 1,000,000 in the leaf dry matter. The filling value (Fv) represents the amount, in ppm, not accounted for by nutrients in the leaf analysis. Therefore, by subtracting the nutrient contents from 1,000,000 in the dry matter of the leaves, the Fv is obtained, which indicates the remaining concentration that is not composed of the analyzed nutrients [21]. The geometric mean (g) of the nutrient contents of each sample in the database [38] was calculated as follows:
g = ( N × P × × Z n × F v ) 1 ( d + 1 )
where d is the number of nutrients evaluated.
For the 12 components (D) corresponding to the 11 nutrients and the Fv, the 11 orthonormal balances (D−1) and sequential binary partition were established [9,14]. The balances were defined using the interpretation of the data according to Seaborn, which is a data visualization library of the Python program, establishing high-level correlations among the nutrients. Using the clustermap function with predefined parameters (spearman parameters (method), Euclidean distance (metric), and a scale pattern equal to 1), a heatmap was constructed and is presented in the form of a dendrogram. The isometric logarithmic ratio (ilr) was calculated as proposed by Egozcue and Pawlowsky-Glahn [10]:
i l r j = n j + n j n j + + n j l n g ( c j + ) g ( c j )
where n j + is the number of components in the numerator (r); n j is the number of components in the denominator(s), determined from the sequential binary partition; g ( c j + ) is the geometric mean between the components in the numerator; and g ( c j ) is the geometric mean between the components in the denominator.
For each ilr calculated, the mean of the high-yielding population was calculated, and then the covariance matrix of the high-yielding population was constructed. The Mahalanobis distance (D2) was calculated to separate the nutritionally balanced from the nutritionally unbalanced samples [13], based on the Euclidean space [14], as follows:
D 2 = i l r j i l r j * T C O V 1 ( i l r j i l r j * )
where i l r j * is the barycenter of a reference population, COV is the covariance matrix of the reference population (TN), and i l r j is the sample being used in the calculation.
Also, using the Cate–Nelson method allows for maximizing the sum of squares between two partitions, with the following formula:
R 2 = ( i = 1 k Y i ) 2 k + ( j = k n Y j ) 2 ( n k ) C F
where Y is the yield; k is an elementary count that begins with the first observation ordered above n ; n is a variable that represents a certain value or cut-off point of the high-yielding population; and CF is a correction factor.
The D2 value and its corresponding yield were observed in the sample that obtained the highest R2 value (Figure 1).
The D2 value and the yield represent the two cut-off points for determining the Cate–Nelson partition of the samples into four quadrants, which were named by Parent et al. [14] as follows: (1) True negative (TN) samples are nutritionally balanced and have a high yield and adequate nutritional status; they are classified as the “reference population”. (2) False positive (FP) samples indicate a Type I error; they are nutritionally unbalanced samples with a high yield. FP represents samples with a “luxury consumption” of nutrients or a high nutrient use efficiency. (3) False negative (FN) samples indicate a Type II error. FN corresponds to samples influenced by other production factors (climate, pests, diseases, and others), negatively affecting the production performance of the crop. (4) True positive (TP) samples have a low yield and are nutritionally unbalanced, where at least one nutrient is causing the imbalance.
The parameters to evaluate the classifications of the samples in the quadrants were calculated according to Parent et al. [14]:
Accuracy (ACC) is the probability of a sample being correctly identified as balanced or unbalanced, calculated by:
ACC = (TN + TP)/(TN + FN + TP + FP)
Sensitivity (SEN) is the probability that a low-yield sample is unbalanced, calculated by:
SEN = TP/(TP + FN)
A positive predicted value (PPV) is the probability that an unbalanced sample will return to poor performance, calculated by:
PPV = TP/(TP + FP)
Specificity (SPC) is the probability of a high-yield sample being balanced, calculated by:
SPC = TN/(TN + FP)
A negative predicted value (NPV) is the probability that a balanced sample will return to high performance, calculated by:
NPV = TN/(TN + FN)
Thus, the CND-ilr norms corresponded to the mean and the multiplication matrix, which were calculated from the CND-ilr coordinates of the reference population (TN).

3. Results

The normality test was performed on the 752 samples that made up the initial database, which showed normality (Shapiro–Wilk W = 0.8897, p = 0.999). However, 165 outliers were removed—140 using the IQR method and 25 using the two-tailed probability (p) of the chi-square distribution (p < 0.05). The test was performed again on the 587 remaining data, which showed normality, with W = 0.9206 and p = 0.999.

CND-ilr, Norms, Sequential Binary Partition, and Cate–Nelson Partition

In the heatmap, which shows the groups hierarchically, the correlations among the nutrients according to the study were observed using the Seaborn tool in Python (Figure 2), based on which 11 balances and sequential binary partitioning were constructed (Table 2).
After optimizing the true negative (TN; n = 29), false positive (FP; n = 2), true positive (TP; n = 540), and false negative (FN; n = 16) quadrants (Figure 3), to achieve the highest accuracy (97%), cut-off points whose D2 was 3.87 and yield was 48,993 kg ha−1 were obtained with n = 587, establishing a high-yielding population with an increase of 63.3% compared to the Brazilian average of 30,000 kg ha−1, as indicated by IBGE [4].
The present study indicates that the high-yielding population has a production variation between 48,993 and 61,328 kg ha−1; that is, the samples classified in the TN group have a yield above the national average. The population classified as nutritionally unbalanced with a low yield (TP) had minimum and maximum yields of 2778 and 48,638 kg ha−1, respectively, which indicates great potential for an increased yield with adequate nutritional balances of the production areas. Despite being nutritionally unbalanced, there were 92 samples in the TP quadrant that had a yield above the Brazilian average [4]. This indicates that the Brazilian average potato yield could be improved if the crop generally responds to enhanced nutrient balance.
Of the total samples, 29 samples (5%) were classified as TN, i.e., as a reference population. Of these, 20 and 9 samples were from the N and P experiments, respectively. Parent et al. [14], Modesto et al. [18], and Marchand et al. [39], in similar studies with mango, sweet corn, and cranberry crops, found that 13, 10, and 15% of the database samples were classified as TN, respectively. Regarding the TP group, it was represented by 540 (92%) of the samples, with 148, 318, and 74 samples from the N, P, and K experiments, respectively (Table 3).
The results of the accuracy, sensitivity, specificity, NPV, and PPV parameters were 96.9, 97.1, 93.6, 64.4%, and 99.6%, respectively. The high values of the parameters can be attributed to the fact that the database consists of samples obtained from experiments with nutrient doses that allowed for greater discrepancies among the yields as a function of the leaf nutrient contents.
For the initial database, without the elimination of outliers, the values obtained for accuracy, sensitivity, specificity, NPV, and PPV were 70.6, 70.7, 50.0, 0.45, and 99.8%, respectively. Therefore, with the removal of outliers, the parameters of accuracy, sensitivity, specificity, and NPV improved by 37, 37, 87, and 14%, while the PPV parameter decreased by 0.2%. When comparing the nutrient balances between the TN and the TP, FN, and FP classes, the t-test detected significant differences (Table 4).
When comparing the TN and TP classes, it is possible to observe differences in balances 1, 2, 5, 7, 8, 9, and 10, indicating that there were differences between the nutritional balance classes and the main sources of potential nutritional imbalances that impaired the crop yield in the TP group. Thus, there were four differences in the balances between the TN and FN classes (Table 4), although the samples of both classes were nutritionally balanced.
The CND-ilr norms generated corresponded to the multiplication matrix (multi-nutrient) and to the mean calculated from the CND-ilr coordinates of the reference population (TN) (Table 5).
Figure 4 presents a balance dendrogram derived using an ad hoc sequential binary partition (SBP) that obtained the linkages among groups of components. SBP is a (D−1) × D matrix, where the parts labeled “+1” (group numerator) are balanced with parts labeled “−1” (group denominator) in each row, and the part labeled “0” is excluded (Table 2). The composition was partitioned sequentially in each ordered row into two contrasts until the subcompositions (+1) and (−1) each contained a single role, with the overall mean values of ilr in the fulcrums and univariate confidence intervals of 0.05 for the TN and TP classes.

4. Discussion

Variations in nutrient contents can establish significant differences in a given balance, such as in the [Mg, N, K|Ca, Zn] and [S|B] balances. However, when these are part of another, more comprehensive balance considering more nutrients, for example, [P, Fe, Mn|Mg, N, K, Ca, Zn, Cu, S, B], these contents are understood as being in balance (Table 4), as also observed by Parent et al. [14].
In the comparisons between the TN and FP classes, there was a significant difference only in balance 10 (Table 4), which shows nutritional imbalance despite having a high yield. This was probably due to the nutrient content being within the range considered “luxury consumption”, to the contamination of the leaf sample with the nutrient not belonging to the leaf content, or, exceptionally, to high nutrient use efficiency [14]. Balances can be represented metaphorically, such as by a moving autonomous diagram with fulcrums or the weighing pans of a scale, where the nutrient contents directly impact the nutritional balances in the fulcrums after a change [11,14].
The balances that were significant (Figure 4) corroborate the results found in Table 4, whose ilrs were 1, 2, 5, 7, 8, 9, and 10. Between the TN and TP classes, there was a significant difference in all balances in which N was present, except for the balance that compared all nutrients with the Fv, which was not significant (Figure 4). Also, the balances of [N|K], [N, K|Mg], [Mg, N, K,|Ca, Zn], and [Mg, N, K, Ca, Zn|Cu, S, B] had negative means in the samples in the TP quadrant, which were −0.54, −2.69, −4.46, and −6.61, respectively.
N was mainly responsible for causing imbalance. The N deficiency of the samples in TP, represented by values of −0.54 in the [N|K] balance and −2.69 in the [N, K|Mg] balance (Figure 4), is directly related to the yield due to its high demand by potato crops, being the second most accumulated nutrient [40], with a strong association with leaf growth, photosynthesis, and tuber production [40,41]. In the [Mg, N, K|Ca, Zn] balance, there are also competitive interactions among the K, Ca, and Mg cations, as reported by Hawkesford [42].
In the [Fe|Mn] balance, there was no difference between the TN and TP samples. Fe and Mn are directly related to the genesis of the Latossolo Vermelho (Oxisol) in the experimental areas, being supplied in sufficient quantities to the plant. However, in the [P|Fe, Mn] balance, there was an imbalance caused by the excess of P, which can be justified by the very high doses evaluated in the experiments, even for soils with high contents of the nutrient. In well-conducted fertilizer calibration experiments, it was indicated that the high doses optimized the fitting of the response curves; however, it was not optimized in the partition for the best nutritional balances.
Difficulties in fitting deterministic models to facts and data, controllable or not, in order to derive indications of the adequate doses of nutrients, highlight the historical difficulties [43] of this technique employing a limited number of calibration experiments. Rozane et al. [23] and Lima Neto [44] showed that the number of experiments with fertilizers could be reduced by observing large amounts of data from current production bases, as well as through the compositional analysis of numerous calibration experiments [11,39], as performed in the present study.
For the [S|B] balance, the indices of the TP samples did not differ from the indices of the TN samples. The [S|B] balance in the TP quadrant is probably similar to that observed for a population with a high yield (nutritionally balanced); i.e., this does not seem to be the problem of the TP samples. However, the [Cu|S, B] balance was nutritionally unbalanced, which indicates that the limiting factor was Cu deficiency and/or that the samples in the TP quadrant did not receive fungicide applications, which generally contain high concentrations of Cu, sufficiently equal to TN, as also reported by Rozane et al. [23] and Modesto et al. [18], and that the imbalance caused by Cu is not effectively an imbalance, but rather an increase in the Cu content in TP due to the deposition of the element on the leaves, generating contamination. Also, excess P in the soil can reduce the availability of Cu to plants, which explains the low Cu content in leaf tissue, since Cu deficiency is not common in clay soils. Thus, the imbalance in the [Mg, N, K, Ca, Zn|Cu, S, B] balance was caused by N and Cu, both due to deficiency. In the [P, Fe, Mn|Mg, N, K, Ca, Zn, Cu, S, B] balance, the main nutrients responsible for nutritional imbalances were P on the left side, due to excess, and N and Cu on the right side, due to deficiency (Figure 4).
There was no significant difference between the TN and TP groups for the following balances: [Ca|Zn], [Fe|Mn], [S|B] and [Fv|P, Fe, Mn, Mg, N, K, Ca, Zn, Cu, S, B], which probably did not negatively affect the production of potato tubers. All nutrients evaluated were compared with the filling value (Fv), as nutrient accumulation in the leaf tissues or as a dilution measure [45].
From the observed balances (Figure 4), it was found that the nutritional balances were more important than the absolute values themselves, since there was no significant difference between the TN and TP groups for [Fv|P, Fe, Mn, Mg, N, K, Ca, Zn, Cu, S, B]. Thus, nutritional interactions should be considered in the concept of nutritional balance, where nutrient groups need to be optimally balanced, rather than just considering the individual nutrient contents [10,11,14].

5. Conclusions

Considering the Mahalanobis distance of 3.87 and yield of 48,993.24 kg ha−1, the CND-Ilr method generated the following norms—Ilr1 (−0.0974), Ilr2 (−1.9180), Ilr3 (−3.5445), Ilr4 (−0.7164), Ilr5 (−3.7686), Ilr6 (−3.2896), Ilr7 (−2.0332), Ilr8 (0.6681), Ilr9 (−4.5701), Ilr10 (1.5875), and Ilr11 (−6.2811)—and classified 5% of the 587 samples as nutritionally balanced with a high yield, 92% as nutritionally unbalanced with a low yield, 0.3% as nutritionally unbalanced with a high yield, and 2.7% as nutritionally balanced with a low yield, with accuracy, sensitivity, specificity, NPV, and PPV scores of 96.9, 97.1, 93.6, 64.4, and 99.6%, respectively.

Author Contributions

Conceptualization, A.B.C.F.; methodology, A.B.C.F. and D.E.R.; formal analysis, D.d.R.C.P., A.B.C.F., D.E.R. and D.R.Y.; investigation, R.P.S., A.M.F., E.d.F.C.d.S., F.M.F., A.L.G.J. and C.S.N.; resources, R.P.S. and A.B.C.F.; data curation, R.P.S., A.M.F., E.d.F.C.d.S., F.M.F., A.L.G.J. and C.S.N.; writing—original draft preparation, D.d.R.C.P. and D.R.Y.; writing—review and editing, A.B.C.F. and D.E.R.; supervision, A.B.C.F. and D.E.R.; project administration, A.B.C.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data are available on request from the corresponding author.

Acknowledgments

We would like to thank CAPES and CNPq.

Conflicts of Interest

Author Fabiana Morbi Fernandes was employed by the company Timac Agro Industria e Comércio de Fertilizantes Ltda., Author André Luiz Gomes Job was employed by the company McCain Brasil Alimentos. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as potential conflicts of interest.

References

  1. Çalışkan, M.E.; Yousaf, M.F.; Yavuz, C.; Zia, M.A.B.; Çalışkan, S. History, Production, Current Trends, and Future Prospects. In Potato Production Worldwide; Elsevier: Amsterdam, The Netherlands, 2023; pp. 1–18. [Google Scholar]
  2. FAOSTAT. Production: Crops. Available online: https://www.fao.org/faostat/en/#data/QCL (accessed on 12 March 2024).
  3. ABBA (Associação Brasileira da Batata). Previsão Consumo de Batata. Batata Show. Available online: https://www.abbabatatabrasileira.com.br/revista/edicao-58/ (accessed on 15 March 2024).
  4. IBGE (Instituto Brasileiro de Geografia e Estatística). Levantamento Sistemático da Produção Agrícola. Available online: https://biblioteca.ibge.gov.br/visualizacao/periodicos/2415/epag_2021_dez.pdf (accessed on 15 March 2024).
  5. Salvador, C.A.; Pereira, J.R. Prognóstico Agropecuário de Olericultura 2021/2022, 37th ed.; Governo do Estado do Paraná: Amsterdam, The Netherlands, 2021; ISBN 2764-2887. [Google Scholar]
  6. EMBRAPA. Sistema de Produção de Batata. Available online: https://ainfo.cnptia.embrapa.br/digital/bitstream/item/132923/1/Sistema-de-Producao-da-Batata.pdf (accessed on 13 December 2023).
  7. CEPEA (Centro de Estudos Avançados em Economia Aplicada). PIB Do Agronegócio. CEPEA/ESALQ/USP. Available online: https://www.cepea.esalq.usp.br/br/categoria/batata-cepea.aspx (accessed on 20 February 2024).
  8. Esteves, E.; Locatelli, G.; Bou, N.A.; Ferrarezi, R.S. Sap Analysis: A Powerful Tool for Monitoring Plant Nutrition. Horticulturae 2021, 7, 426. [Google Scholar] [CrossRef]
  9. Parent, L.E.; Dafir, M. A Theoretical Concept of Compositional Nutrient Diagnosis. J. Am. Soc. Hortic. Sci. 1992, 117, 239–242. [Google Scholar] [CrossRef]
  10. Egozcue, J.J.; Pawlowsky-Glahn, V.; Mateu-Figueras, G.; Barceló-Vidal, C. Isometric Logratio Transformations for Compositional Data Analysis. Math. Geol. 2003, 35, 279–300. [Google Scholar] [CrossRef]
  11. Parent, L.-É. Diagnosis of the Nutrient Compositional Space of Fruit Crops. Rev. Bras. Frutic. 2011, 33, 321–334. [Google Scholar] [CrossRef]
  12. Dias, J.R.M.; Wadt, P.G.S.; Partelli, F.L.; Espindula, M.C.; Perez, D.V.; Souza, F.R.; Bergamin, A.C.; Delarmelinda, E.A. Normal Nutrient Ranges and Nutritional Monitoring of ‘Pêra’ Orange Trees Based on the CND Method in Different Fruiting Stages. Pesqui. Agropecuária Bras. 2017, 52, 776–785. [Google Scholar] [CrossRef]
  13. Parent, L.E.; Natale, W.; Ziadi, N. Compositional Nutrient Diagnosis of Corn Using the Mahalanobis Distance as Nutrient Imbalance Index. Can. J. Soil Sci. 2009, 89, 383–390. [Google Scholar] [CrossRef]
  14. Parent, S.-É.; Parent, L.E.; Rozane, D.E.; Natale, W. Plant Ionome Diagnosis Using Sound Balances: Case Study with Mango (Mangifera Indica). Front. Plant Sci. 2013, 4, 449. [Google Scholar] [CrossRef] [PubMed]
  15. Aguinis, H.; Gottfredson, R.K.; Joo, H. Best-Practice Recommendations for Defining, Identifying, and Handling Outliers. Organ. Res. Methods 2013, 16, 270–301. [Google Scholar] [CrossRef]
  16. Urano, E.O.M.; Kurihara, C.H.; Maeda, S.; Vitorino, A.C.T.; Gonçalves, M.C.; Marchetti, M.E. Determinação de Teores Ótimos de Nutrientes em Soja Pelos Métodos Chance Matemática, Sistema Integrado de Diagnose e Recomendação e Diagnose da Composição Nutricional. Rev. Bras. Ciência Solo 2007, 31, 63–72. [Google Scholar] [CrossRef]
  17. Souza, E.D.F.C.D. Fontes e Manejo de Nitrogênio Na Cultura da Batata Em Solos Arenosos. Ph.D. Thesis, Universidade Estadual Paulista “Júlio de Mesquita Filho”, São Paulo, Brazil, 2014. [Google Scholar]
  18. Modesto, V.C.; Parent, S.-É.; Natale, W.; Parent, L.E. Foliar Nutrient Balance Standards for Maize (Zea Mays L.) at High-Yield Level. Am. J. Plant Sci. 2014, 5, 497–507. [Google Scholar] [CrossRef]
  19. Parent, L.E.; Cambouris, A.N.; Muhawenimana, A. Multivariate Diagnosis of Nutrient Imbalance in Potato Crops. Soil Sci. Soc. Am. J. 1994, 58, 1432–1438. [Google Scholar] [CrossRef]
  20. Ganeshamurthy, A.N.; Govindakrishnan, P.; Raghupathi, H.B.; Mahendra Kumar, M.B. Compositional Nutrient Diagnosis (CND) Norms and Indices for Potato (Solanum Tuberosum L.). J. Hortic. Sci. 2019, 14, 142–148. [Google Scholar]
  21. Khiari, L.; Parent, L.-É.; Tremblay, N. The Phosphorus Compositional Nutrient Diagnosis Range for Potato. Agron. J. 2001, 93, 815–819. [Google Scholar] [CrossRef]
  22. Bendaly, L.M.; Khiari, L.; Gallichand, J.; Kebede, F.; Kadri, N.; Ben Ammar, N.; Ben Hmida, F.; Ben Mimoun, M. Nutrient Diagnosis Norms for Date Palm (Phoenix dactylifera L.) in Tunisian Oases. Agronomy 2020, 10, 886. [Google Scholar] [CrossRef]
  23. Rozane, D.E.; Parent, L.E.; Natale, W. Evolution of the Predictive Criteria for the Tropical Fruit Tree Nutritional Status. Científica 2015, 44, 102. [Google Scholar] [CrossRef]
  24. Nakagawa, S.; Yang, Y.; Macartney, E.L.; Spake, R.; Lagisz, M. Quantitative Evidence Synthesis: A Practical Guide on Meta-Analysis, Meta-Regression, and Publication Bias Tests for Environmental Sciences. Environ. Evid. 2023, 12, 8. [Google Scholar] [CrossRef]
  25. Fernandes, F.M. Produtividade de Tubérculos e Acúmulo, Partição e Exportação de Nutrientes Em Duas Cultivares de Batata Sob Influência Do Manejo Do Nitrogênio. Ph.D. Thesis, Universidade Estadual Paulista “Júlio de Mesquita Filho”, São Paulo, Brazil, 2021. [Google Scholar]
  26. Fernandes, A.M. Adubação Fosfatada Em Cultivares de Batata (Solanum Tuberosum L.). Ph.D. Thesis, Universidade Estadual Paulista “Júlio de Mesquita Filho”, São Paulo, Brazil, 2013. [Google Scholar]
  27. Job, A.L.G. Doses e Parcelamento Da Adubação Potássica Na Cultura Da Batata (Solanum Tuberosum L.). Ph.D. Thesis, Universidade Estadual Paulista “Júlio de Mesquita Filho”, São Paulo, Brazil, 2014. [Google Scholar]
  28. Nascimento, C.S. Yield and Quality of Potato Tubers in Response to Nitrogen Management. Ph.D. Thesis, Universidade Estadual Paulista “Júlio de Mesquita Filho”, São Paulo, Brazil, 2022. [Google Scholar]
  29. Ribeiro, G.H.M.R.; Samartini, C.Q.; Silva, L.F.L.; Vieira, S.D.; Resende, L.V. Cultivares. In Batata do Plantio à Colheita; Nich, C., Borém, A., Eds.; Universidade de Viçosa: Viçosa, Brazil, 2017; pp. 77–93. ISBN 9788572695756. [Google Scholar]
  30. Pereira, A.M.; da Silva Guimarães, M.E.; da Silva Galdino, A.G.; de Paula Gomes, M.; Cruz, R.R.P.; Ribeiro, F.C.S.; Ribeiro, W.S.; Finger, F.L. Potato Cultivars Evaluation for Processing Industry. Res. Soc. Dev. 2020, 9, e7291210803. [Google Scholar] [CrossRef]
  31. Gonçalves, D.N.; Pereira, A.M.; Soares, L.G.; da Silva Guimarães, M.E.; Petrucci, K.P.D.O.S.; de Oliveira, L.S.; de Mendonça Neto, A.B.; Finger, F.L. Post-Harvest Quality of Cv. Markies Potatoes Submited to Mechancial Damages. J. Agric. Sci. 2020, 12, 124. [Google Scholar] [CrossRef]
  32. Köppen, W.; Geiger, R. Klima Der Erde (Climate of the Earth). Wall Map 1:16 Mill. Available online: http://koeppen-geiger.vu-wien.ac.at/koeppen.htm (accessed on 10 March 2024).
  33. EMBRAPA. A Cultura da Batata. Available online: https://www.embrapa.br/hortalicas/batata/clima (accessed on 10 November 2023).
  34. Lorenzini, J.O.; Monteiro, P.A.; Miranda Filho, H.S.; Raij, B. Raízes e Tubérculos. In Recomendação de Adubação e Calagem para o Estado de São Paulo; Van Raij, B., Cantarela, H., Quaggio, J.A., Furlani, A.M.C., Eds.; Instituto Agronômico de Campinas: Campinas, Brazil, 1997; pp. 221–229. ISBN 0100-3100. [Google Scholar]
  35. Malavolta, E.; Vitti, G.C.; Oliveira, S.A.D. Avaliação Do Estado Nutricional Das Plantas: Princípios e Aplicações, 1st ed.; Associação Brasileira para Pesquisa da Potassa e do Fosfato, Ed.; Potafos: Piracicaba, Brazil, 1997. [Google Scholar]
  36. Rousseeuw, P.J.; Croux, C. Explicit Scale Estimators with High Breakdown Point. In L1-Statistical Anal. Relat. Methods; North Holland: Amsterdam, The Netherlands, 1992; Volume 1, pp. 77–92. [Google Scholar]
  37. Hair, J.J.F.; Black, W.C.; Sant’Anna, A.S. Análise Multivariada de Dados, 6th ed.; Grupo A—Bookman: Porto Alegre, Brazil, 2005; ISBN 9788577805341. [Google Scholar]
  38. Aitchison, J. The Statistical Analysis of Compositional Data, 1st ed.; Chapman and Hall: London, UK, 1986. [Google Scholar]
  39. Marchand, S.; Parent, S.-É.; Deland, J.-P.; Parent, L.-É. Nutrient Signature of Quebec (Canada) Cranberry (Vaccinium macrocarpon Ait.). Rev. Bras. Frutic. 2013, 35, 292–304. [Google Scholar] [CrossRef]
  40. Alemayehu, M.; Jemberie, M.; Yeshiwas, T.; Aklile, M. Integrated Application of Compound NPS Fertilizer and Farmyard Manure for Economical Production of Irrigated Potato (Solanum Tuberosum L.) in Highlands of Ethiopia. Cogent Food Agric. 2020, 6, 1724385. [Google Scholar] [CrossRef]
  41. Silva, J.G.; França, M.G.C.; Gomide, F.T.F.; Magalhaes, J.R. Different Nitrogen Sources Affect Biomass Partitioning and Quality of Potato Production in a Hydroponic System. Am. J. Potato Res. 2013, 90, 179–185. [Google Scholar] [CrossRef]
  42. Hawkesford, M.; Horst, W.; Kichey, T.; Lambers, H.; Schjoerring, J.; Møller, I.S.; White, P. Functions of Macronutrients. In Mineral Nutrition of Higher Plants; Marschner, H., Ed.; Academic Press: Amsterdam, The Netherlands, 2012; pp. 135–189. ISBN 978-0-12-384905-2. [Google Scholar]
  43. Kyveryga, P.M.; Caragea, P.C.; Kaiser, M.S.; Blackmer, T.M. Predicting Risk from Reducing Nitrogen Fertilization Using Hierarchical Models and On-farm Data. Agron. J. 2013, 105, 85–94. [Google Scholar] [CrossRef]
  44. De Lima Neto, A.J.; Natale, W.; de Deus, J.A.L.; Rozane, D.E. Establishment of Critical Nutrient Levels in the Soil and Leaf of ‘Prata’ Banana Using the Boundary Line. Sci. Hortic. 2024, 328, 112923. [Google Scholar] [CrossRef]
  45. Koch, M.; Busse, M.; Naumann, M.; Jákli, B.; Smit, I.; Cakmak, I.; Hermans, C.; Pawelzik, E. Differential Effects of Varied Potassium and Magnesium Nutrition on Production and Partitioning of Photoassimilates in Potato Plants. Physiol. Plant. 2019, 166, 921–935. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Binary distribution of data (samples) as a function of Mahalanobis distance and R2.
Figure 1. Binary distribution of data (samples) as a function of Mahalanobis distance and R2.
Agronomy 14 01500 g001
Figure 2. Correlations among potato leaf nutrient contents using Python’s Seaborn library.
Figure 2. Correlations among potato leaf nutrient contents using Python’s Seaborn library.
Agronomy 14 01500 g002
Figure 3. Distribution of 587 potato samples (nutritional status and yield) in true negative (TN), false positive (FP), false negative (FN), and true positive (TP) quadrants as a function of Cate–Nelson partition.
Figure 3. Distribution of 587 potato samples (nutritional status and yield) in true negative (TN), false positive (FP), false negative (FN), and true positive (TP) quadrants as a function of Cate–Nelson partition.
Agronomy 14 01500 g003
Figure 4. Moving averages and confidence intervals of the orthonormal balances obtained with samples from the high-yield and nutritionally balanced (TN) and low-yield and nutritionally unbalanced (TP) classes of samples.
Figure 4. Moving averages and confidence intervals of the orthonormal balances obtained with samples from the high-yield and nutritionally balanced (TN) and low-yield and nutritionally unbalanced (TP) classes of samples.
Agronomy 14 01500 g004
Table 1. Characterization of the experiments that make up the database.
Table 1. Characterization of the experiments that make up the database.
Id. 1MunicipalityCultivarNutrient
Studied
Growing Season
1São ManuelAgataN9 May to 20 August 2011
2São ManuelAgataN9 May to 20 August 2011
3AvaréAgataN9 May to 20 August 2011
4São ManuelAgataN9 May to 20 August 2011
5São ManuelAgataN9 May to 20 August 2011
6AvaréAgataN9 May to 20 August 2011
7BotucatuAgataN13 May to 20 August 2011
8BotucatuAgataN6 August to 6 November 2016
9BotucatuAgataN27 July to 4 November 2017
10BotucatuElectraN13 May to 20 August 2016
11BotucatuElectraN6 August to 6 November 2016
12BotucatuElectraN27 July to 4 November 2017
13AvaréAgata, Asterix, Atlantic, Markies, and MondialP28 April to 18 August 2011
14ItaíAgata, Asterix, Atlantic, Markies, and MondialP20 April to 23 July 2011
15CerqueiraAgata, Asterix, Atlantic, Markies, and MondialP20 May to 25 August 2011
16TaquaritubaAgata and MondialP17 June to 24 September 2012
17TaquaritubaAgataK15 June to 22 September 2012
18BernardinoAgataK18 April to 20 July 2013
19BotucatuAgataK12 July to 13 November 2013
20JaboticabalAsterixN16 April to 11 July 2019
21JaboticabalAgataN16 April to 4 July 2019
1 Id: Identification of experiments: 1 to 6 [16]; 7 to 12 [25]; 13 to 16 [26]; 17 to 19 [27]; 20 to 21 [28].
Table 2. Sequential binary partition (SBP) of n−1 (11) orthonormal balances.
Table 2. Sequential binary partition (SBP) of n−1 (11) orthonormal balances.
ilrBalancesNKPSCaMgBCuZnMnFeFvrs
1[N|K]−11000000000011
2[N, K|Mg]−1−1000100000012
3[Ca|Zn]0000−1000100011
4[Fe|Mn]0000000001−1011
5[Mg, N, K|Ca, Zn]−1−1001−100100023
6[S|B]000−10010000011
7[P|Fe, Mn]00−100000011021
8[Cu|S, B]0001001−1000021
9[Mg, N, K, Ca, Zn|Cu, S, B]−1−101−1−111−100035
10[P, Fe, Mn|Mg, N, K, Ca, Zn,
Cu, S, B]
11−1111111−1−1083
11[Fv|P, Fe, Mn, Mg, N, K, Ca,
Zn, Cu, S, B]
11111111111−1111
Fv: filling value; r and s: number of positives (numerator) and negatives (denominator), respectively, of the formula for calculating multi-nutrient relationships ( i l r j ).
Table 3. Partitioning the database samples into true negative (TN), false positive (FP), false negative (FN), and true positive (TP) quadrants.
Table 3. Partitioning the database samples into true negative (TN), false positive (FP), false negative (FN), and true positive (TP) quadrants.
QuadrantsNumber of Experimental SamplesTotal of Samples%
NPK
TN2090295.0
TP1483187454092.0
FP20020.3
FN6100162.7
Total17633774587100.0
TN = reference population, nutritionally balanced with high yield; TP = nutritionally unbalanced population with low yield; FP = nutritionally unbalanced population with high yield; and FN = nutritionally balanced population with low yield.
Table 4. Results of the t-test comparing the means of the samples classified as true negative (TN) and the TP, FN, and FP classes of orthonormal balances.
Table 4. Results of the t-test comparing the means of the samples classified as true negative (TN) and the TP, FN, and FP classes of orthonormal balances.
ilrBalancesTNTPFNFP
1[N|K]−0.100.03 **−0.01 ns−0.27 ns
2[N, K|Mg]−1.91−1.80 **−1.88 ns−1.86 ns
3[Ca|Zn]−3.54−3.66 ns−3.30 ns−3.77 ns
4[Fe|Mn]−0.71−0.66 ns−0.76 ns−0.68 ns
5[Mg, N, K|Ca, Zn]−3.78−3.46 **−3.44 *−3.95 ns
6[S|B]−3.29−3.28 ns−3.38 *−3.03 ns
7[P|Fe, Mn]−1.99−2.25 *−2.33 ns−1.64 ns
8[Cu|S, B]0.561.46 **0.96 ns0.63 ns
9[Mg, N, K, Ca, Zn|Cu, S, B]−4.49−5.30 **−4.96 *−4.75 ns
10[P, Fe, Mn|Mg, N, K, Ca, Zn, Cu, S, B]1.621.37 **1.84 ns0.89 *
11[Fv|P, Fe, Mn, Mg, N, K, Ca, Zn, Cu, S, B]−6.28−6.27 ns−6.22 *−6.33 ns
ns = not significant; * and ** = significant to 5 and 1% of probability according to F test, respectively; TN = reference population, nutritionally balanced with high yield; TP = nutritionally unbalanced population with low yield; FP = nutritionally unbalanced population with high yield; and FN = nutritionally balanced population with low yield.
Table 5. CND-ilr norms (multi-nutrient and average matrix) calculated from the CND-ilr coordinates of the reference population (TN).
Table 5. CND-ilr norms (multi-nutrient and average matrix) calculated from the CND-ilr coordinates of the reference population (TN).
ilr1ilr2ilr3ilr4ilr5ilr6ilr7ilr8ilr9ilr10ilr11
ilr10.0307
ilr20.00970.0209
ilr30.05330.03710.1758
ilr4−0.0121−0.0026−0.02800.0391
ilr50.05450.03620.1537−0.01260.1659
ilr6−0.0120−0.0085−0.0425−0.0010−0.03730.0332
ilr7−0.0782−0.0524−0.20000.0363−0.18440.04210.3075
ilr80.10780.06290.2691−0.04790.2572−0.0707−0.38350.6397
ilr9−0.1058−0.0714−0.25900.0425−0.25680.04840.3757−0.55790.5553
ilr100.06510.04110.1493−0.01220.1562−0.0442−0.20970.2538−0.25490.2095
ilr110.00840.00410.0249−0.00620.0252−0.0032−0.02580.0318−0.03450.02210.0059
Average *−0.0974−1.9180−3.5445−0.7164−3.7686−3.2896−2.03320.6681−4.57011.5875−6.2811
* Average reference population (TN) of the CND-ilr.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Passos, D.d.R.C.; Cecílio Filho, A.B.; Soratto, R.P.; Rozane, D.E.; Yamane, D.R.; Fernandes, A.M.; Souza, E.d.F.C.d.; Fernandes, F.M.; Job, A.L.G.; Nascimento, C.S. Nutritional Diagnosis of Potato Crops Using the Multivariate Method. Agronomy 2024, 14, 1500. https://doi.org/10.3390/agronomy14071500

AMA Style

Passos DdRC, Cecílio Filho AB, Soratto RP, Rozane DE, Yamane DR, Fernandes AM, Souza EdFCd, Fernandes FM, Job ALG, Nascimento CS. Nutritional Diagnosis of Potato Crops Using the Multivariate Method. Agronomy. 2024; 14(7):1500. https://doi.org/10.3390/agronomy14071500

Chicago/Turabian Style

Passos, Danilo dos Reis Cardoso, Arthur Bernardes Cecílio Filho, Rogério Peres Soratto, Danilo Eduardo Rozane, Danilo Ricardo Yamane, Adalton Mazetti Fernandes, Emerson de Freitas Cordova de Souza, Fabiana Morbi Fernandes, André Luiz Gomes Job, and Camila Seno Nascimento. 2024. "Nutritional Diagnosis of Potato Crops Using the Multivariate Method" Agronomy 14, no. 7: 1500. https://doi.org/10.3390/agronomy14071500

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop