Next Article in Journal
Evaluation of the Acaricidal Activity of Lithium Chloride against Tetranychus urticae (Acari: Tetranychidae)
Next Article in Special Issue
Calcium Nutrition in Fig Orchards Enhance Fruit Quality at Harvest and Storage
Previous Article in Journal
Potential of Debaryomyces hansenii Strains on the Inhibition of Botrytis cinerea in Blueberry Fruits (Vaccinium corymbosum L.)
Previous Article in Special Issue
Nutrient Content with Different Fertilizer Management and Influence on Yield and Fruit Quality in Apple cv. Gala
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Site-Specific Nutrient Diagnosis of Orange Groves

by
Danilo Ricardo Yamane
1,*,
Serge-Étienne Parent
2,
William Natale
3,
Arthur Bernardes Cecílio Filho
1,
Danilo Eduardo Rozane
4,
Rodrigo Hiyoshi Dalmazzo Nowaki
1,
Dirceu de Mattos Junior
5 and
Léon Etienne Parent
2,6
1
Department of Plant Production, São Paulo State University (UNESP), Jaboticabal 14884-900, SP, Brazil
2
Department of Soils and Agri-Food Engineering, Université Laval, Québec, QC G1V 0A6, Canada
3
Department of Plant Science, Federal University of Ceará, Fortaleza 60020-181, CE, Brazil
4
Department of Agronomy, São Paulo State University (UNESP), Registro 11900-000, SP, Brazil
5
Instituto Agronômico de Campinas (IAC)/Centro de Citricultura Sylvio Moreira, Cordeirópolis 13490-000, SP, Brazil
6
Department of Soils, Federal University of Santa Maria, Santa Maria 97105-900, RS, Brazil
*
Author to whom correspondence should be addressed.
Horticulturae 2022, 8(12), 1126; https://doi.org/10.3390/horticulturae8121126
Submission received: 27 July 2022 / Revised: 7 November 2022 / Accepted: 15 November 2022 / Published: 30 November 2022

Abstract

:
Nutrient diagnosis of orange (Citrus sinensis) groves in Brazil relies on regional information from a limited number of studies transferred to other environments under the ceteris paribus assumption. Interpretation methods are based on crude nutrient compositions that are intrinsically biased by genetics X environment interactions. Our objective was to develop accurate and unbiased nutrient diagnosis of orange groves combining machine learning (ML) and compositional methods. Fruit yield and foliar nutrients were quantified in 551 rainfed 7–15-year-old orange groves of ‘Hamlin’, ‘Valência’, and ‘Pêra’ in the state of São Paulo, Brazil. The data set was further documented using soil classification, soil tests, and meteorological indices. Tissue compositions were log-ratio transformed to account for nutrient interactions. Ionomes differed among scions. Regression ML models showed evidence of overfitting. Binary ML classification models showed acceptable values of areas under the curve (>0.7). Regional standards delineating the multivariate elliptical hyperspace depended on the yield cutoff. A shapeless blob hyperspace was delineated using the k-nearest successful neighbors that showed comparable features and reported realistic yield goals. Regionally derived and site-specific reference compositions may lead to differential interpretation. Large-size and diversified data sets must be collected to inform ML models along the learning curve, tackle model overfitting, and evaluate the merit of blob-scale diagnosis.

1. Introduction

Brazil is the world leader in orange production with a total area of 682.167 ha, of which 63% is located in the state of São Paulo [1]. The successful combinations of scions and rootstocks, balanced fertilization, and pest management form the first rampart against productivity loss in Brazilian orange orchards [2,3,4,5]. Nevertheless, the productivity of Brazilian orange orchards remains below crop potential due in part to imbalanced fertilization [6,7,8]. Balanced crop nutrition may in turn reduce pest problems [9].
Compared to surface soil testing, tissue testing is more suitable to guide fertilization because trees have access to nutrients deeper in the soil [10]. [11]. However, ionomes may differ among scions and rootstocks [3,12,13]. There is a growing interest in Brazil to update tissue nutrient references at the grove scale where genetic, managerial, and environmental growth factors vary widely [14,15].
Tissue analytical data are commonly interpreted using ‘critical’ nutrient concentration ranges that neglect nutrient interrelationships [16]. Indeed, plant nutrition is regulated by a network of genetically controlled physiological processes that impact tissue compositions [17]. Multiple levels of nutrient interactions result not only in synergistic or antagonistic effects [18], but also evolve from chemical similarities between elements that are networking and self-adjusting to each other within the tissue compositional space [19]. Tissue compositions are unique assemblages of intrinsically multivariate data in which components cannot be interpreted in isolation [20,21,22,23]. Proximate [24] and remote sensing [25] studies showed that the interpretation of spectra to diagnose nutrient stresses should consider several nutrients simultaneously. There is thus a need to diagnose plant nutrition as full compositional networks [19,26].
Nutrient interactions have been reported traditionally as dual ratios [27]. Diagnosis and Recommendation Integrated System (DRIS) dual-ratio standards [27] have been elaborated for orange orchards in the USA [28], Venezuela [29], and Brazil [12,30]. The DRIS standards elaborated so far in Brazil were based on a small number of groves [30]. Nevertheless, DRIS has conceptual flaws. Dual-ratio standards are biased by including false positive specimens (cases of luxury consumption, suboptimal concentration, etc.). The DRIS variables are neither reflective nor additive despite attempts to correct such defaults [28,31]. The DRIS also promotes universality and timelessness [32], which were proved to be wrong [33,34,35]. Distortions in the DRIS were addressed using log-ratio transformations [20]. Log-ratios such as centered log-ratios (clr) project the constrained compositional data (e.g., between 0 and 100%, 1000 g kg−1, or 106 mg kg−1) into the real space (±∞) to allow the conducting of a multivariate analysis in the Euclidian space [36]. Log-ratios are useful in discriminating the ionomes of fruit species [23] and scions [4], as well as in ranking nutrients in the order of their limitation to yield to support fertilization decisions [15].
The distribution of tissue nutrient data has been traditionally determined using a multinormal distribution represented by ellipsoids with a centroid and variance [23]. However, successful agroecosystems may be unevenly distributed in the compositional hyperspace, and some of them may even be located outside the ellipses [37]. As a result of the complexity of agroecosystems and inherent interactions, the tissue nutrient space of nutritionally balanced and high-yielding specimens may show shapeless distribution patterns such as ‘islands’ or ‘blobs’ [14,37,38]. A ‘blob’ is a collection of nutritionally balanced and high-yielding groves that show features comparable to those of the diagnosed grove but are the limiting ones. The ‘blob’ concept reflects a grower’s propension to make comparisons with successful neighbors [11].
Successful agroecosystems can be detected using machine learning regression or classification models. Machine learning (ML) is a general term representing a wide variety of models used to process data sets and make predictions [39]. The ML can incorporate numerical and categorical variables [40,41]. The ML methods require much fewer assumptions than the traditional statistical methods [42]. The prediction ability of ML models is retrained by overfitting [43]. The ML models are increasingly applied to make predictions in biological and agronomic sciences [14,15,34,44,45,46] such as soil classification and mapping [47,48], carbon sequestration [49], image analysis [50,51], disease diagnosis [52], and the prediction of crop yields [53,54,55].
We hypothesized that: (1) the yield of orange groves could be predicted accurately by ML models; and (2) compositional nutrient diagnoses would be similar at regional and blob scales. The objective of this paper was to develop site-specific diagnostic tools to elevate low fruit yields to the locally documented yield potential.

2. Material and Methods

2.1. Experimental Setup

The data set comprised 551 observational data surveyed from 2012 to 2014 in rainfed orange orchards across the Central South region of the state of São Paulo, Brazil. Sites were located between −23.5812 and −21.7764 LAT and −49.5197 and −48.0364 LONG. The altitude of the orchards varied from 488.55 to 718.87 m. The 7–15-year-old groves belonged to the young-age group (>6 years of age) in which tree nutrition and production are stabilized [12,56]. There were two rootstocks (‘Citrumelo Swingle’ and ‘Tangerina Sunki’) and three scions (‘Hamlin’, ‘Valência’, and ‘Pêra’) (Table 1). Because ‘Hamlin’ and ‘Valência’ were grafted uniquely onto ‘Citrumelo Swingle’ and ‘Pêra’ onto ‘Tangerina Sunki’, the rootstock effect could not be tested. The flowering period did not vary among the scion X rootstock combinations (Table 1). ‘Hamlin’ X ‘Citrumelo Swingle’ showed the earliest harvesting period, followed by the intermediate ‘Pêra’ X ‘Tangerina Sunki’ and the late ‘Valência’ X ‘Citrumelo Swingle’. Fruit yields were reported as kg tree−1 or tons ha−1. Tree planting density varied from 220 to 830 plants ha−1. The plot area averaged 15 ha.
The climatic regime of the region is Aw according to the Köppen–Geiger classification. The climate is tropical with a dry winter season and the rainfall concentrated in the summer season. The prevailing soils are Oxisols (Ustox) and Ultisols (Udults and Ustults) [57]. Oxisols (lateritic soils) and Ultisols (tropical Podzols) are altered soils high in Fe and Al oxi-hydroxides and contain kaolin clay. There were 205 Red Oxisols, 210 Red-Yellow Oxisols, and 136 Red-Yellow Ultisols in the data set. ‘Hamlin’ X ‘Citrumelo Swingle’ was grown on 48 Red Oxisols, 43 Red-Yellow Oxisols, and 30 Red-Yellow Ultisols. ‘Pêra’ X ‘Tangerina Sunki’ was grown on 26 Red Oxisols, 45 Red-Yellow Oxisols, and 55 Red-Yellow Ultisols. ‘Valência’ X ‘Citrumelo Swingle’ was grown on 131 Red Oxisols, 122 Red-Yellow Oxisols, and 51 Red-Yellow Ultisols. Soils may show hard setting layers at 20+ cm [58,59], but the depth to hardpan was not documented. Both soil groups are naturally acidic low-nutrient soils that require liming and fertilization [60].

2.2. Crop Management

Mineral fertilization [12] is intended to offset crop nutrient removal and loss and meet the nutrient demand for fruit development and new growth of leaves, branches, and roots [4]. In Brazil, the average nutrient offtake per tons of fresh fruit was estimated at 1.2 kg of nitrogen (N), 0.18 kg of phosphorus (P), 1.54 kg of potassium (K), 0.57 kg of calcium (Ca), 0.12 kg of magnesium (Mg), 0.09 kg of sulfur (S), 1.6 g of boron (B), 0.39 g of copper (Cu), 2.1 g of iron (Fe), 0.38 g of manganese (Mn), and 0.40 g of zinc (Zn) [61]. Based on a soil chemical analysis and the expected yield, the P and K application rates were 0–160 kg P ha−1 and 0–200 kg K ha−1, respectively [4]. The N rates varied from 70 to 240 kg N ha−1 based on the leaf N concentration and expected yield. Fertilizers were split-applied on three occasions during the spring and summer. The B, Zn, and Mn were supplied as foliar sprays in the spring and summer [4]. Some orchards received 2 kg B ha−1 year−1 as a soil-applied fertilizer. Copper was mainly supplied through fungicide applications or metallic copper. The objective of liming was to reach 70% base saturation in the 0–20 cm layer during a long period [60]. Gypsum was also indicated to tackle Al in the 20–40 cm layer where Al saturation of CEC exceeds 40% and to supply calcium. Other management practices were carried out as recommended [61].

2.3. Meteorological Data

The daily heat units (DHU) were computed as follows [62]:
D H U   ( ° C ) = T 2 M 13 ° C  
where T 2 M is the daily average daily temperature at a 2 m height and 13 °C is the minimum temperature for growth. The T 2 M was reported via satellite using Boltzman’s law to transform the surface radiation into temperature followed by spatialization (0.5° × 0.65°). Because D H U and T 2 M were perfectly correlated, we retained T 2 M as a meteorological index. Precipitations were recorded on daily basis then cumulated per month [59]. Where temperatures exceeded 30–35 °C and the rainfall and relative air humidity were low, flowers and fruits may have dropped but were not recorded. The most critical period extended from flowering (September/October) to a fruit diameter of up to 50 mm (end of December).

2.4. Tissue Analysis

Twenty-five trees were randomly selected for leaf sampling in each plot. Four mature (six-month-old) leaves were collected per tree from fruit-bearing shoots (3rd or 4th leaf from fruit) when the fruit size was 2–4 cm in diameter [4]. In total, 100 leaf samples were composited in each commercial grove. Leaves were gently washed successively with distilled water, a detergent solution (0.1%), a solution with hydrochloric acid (0.3%), and deionized water to reduce the surface contamination by dust and fungicides. The samples were oven-dried at 65 °C for 48–96 h and ground to less than 2 mm. The nitrogen (N) was analyzed using the micro-Kjeldahl method. Phosphorus (P), potassium (K), calcium (Ca), magnesium (Mg), sulfur (S), boron (B), copper (Cu), zinc (Zn), manganese (Mn), and iron (Fe) were quantified using ICP after acid digestion [63].

2.5. Soil Analysis

Soils were sampled in the 0–20 and 20–40 cm layers in each grove. Samples were composited, air-dried, and then ground and sieved to <2 mm. Soils were analyzed for pH (0.01 M CaCl2), organic matter content, K, Ca, Mg, and (H + Al) [64]. The P was extracted using an exchange resin Amberlite IRA-400 (20–50 mesh), quantified via colorimetry using the ascorbic acid method, and reported as mg dm−3. The K, Ca, and Mg were extracted using an exchange resin Amberlite IRA-120 (20–50 mesh), quantified via flame photometry (K) or atomic absorption spectrophotometry (Ca, Mg), and reported as mmolc dm−3. The potential acidity (H+Al) was derived from the SMP pH buffer methodology [65]. The cation exchange capacity (CEC) was computed as the sum of the cationic species (K, Ca, and Mg) and the exchangeable capacity. The CEC of Brazilian Oxisols may vary from 17 to 134 cmolc kg−1 compared 7–88 cmolc kg−1 for Brazilian Ultisols, indicating a large variation in clay and organic matter contents [66].

2.6. Centered Log-Ratio Transformation

The tissue compositional simplex comprised N, P, K, Ca, Mg, S, Cu, Zn, Mn, Fe, and B concentrations expressed using the same measurement unit (g kg−1). The filling value ( F v ) used to allow the back-transformation of the log-ratio-transformed data into familiar concentration values [4,23] was computed as follows:
F v = 1,000,000 ( sum   of   quantified   nutrient   concentrations   reported   in   g   kg 1 )
The clr of each component (clrx) was computed as the ratio of nutrient concentration of any component xi to the geometric mean across the D components (G) to account for nutrient interactions as follows [20]
c l r x i = l n x G
where the geometric mean (G) was computed as follows:
G = ( [ N ] × [ P ] × [ K ] × [ Ca ] × [ Mg ] × [ S ] × [ B ] × [ Cu ] × [ Fe ] × [ Mn ] × [ Zn ] × [ F v ] ) 1 D
As a result, the clr variable integrated all pairwise ratios in the compositional simplex as follows:
c l r x i = l n ( x i N × P × × F v ) = l n ( x i N × x i P × × x i F v )
The clr variables added up to zero.

2.7. Statistical Analysis

A discriminant analysis (DA) was conducted to compare the ionomes, the soil properties in upper (0–20 cm) and lower (20–40 cm) layers, and the soil classes. Tissue nutrient balances were transformed into orthonormal isometric log-ratios (ilrs) prior to the DA analysis [14,15]. Machine learning (ML) models were run using the R ‘caret’ package [67]. A supervised ML classification model approximated the function that predicts the outcome of interest based on the relationship between target variable and features or predictors [68]. The data matrix (Xij; xi = [xi1, xi2 …, xij]) was a sample composed of j predictors. The target variable was crop yield. The features were scion X rootstock combinations, plant age, tree density (for crop yields expressed as kg tree−1), centered log-ratio-transformed tissue compositions (N, P, K, Ca, Mg, S, Fe, Mn, Cu, Zn, and B), soil classification, soil test results for the 0–20 and 20–40 cm layers (organic matter content, pH, P, K, Ca, Mg, and exchangeable acidity), and monthly meteorological indices (T2M and monthly precipitation). The clay content and cation exchange capacity (CEC) were excluded as features because they were derived from already-documented features (organic matter content and cationic species) and thus provided redundant information.
Since any ML algorithms can produce overfitting [69], a validation data set was required to evaluate the model. The data set was split into training (70%) and testing (30%) sets [67]. Data were preprocessed with ‘zv’ to identify zero-variance predictors, ‘center’ and ‘scale’ to provide a simple location and scale the transformation of each predictor, ‘spatialSign’ to project predictor values onto a unit circle, applying x* = x/ | | x | | , and the 11 nearest neighbors (kmax = 11), a distance of two, and a kernel set at optimum. Statistical analyses were conducted in the R statistical environment [70]. Log-ratio transformations were computed using the R ‘compositions’ package [71]. Machine learning modeling was conducted using the R ‘caret’ package [67].
Features were selected for their contribution to model accuracy whatever their statistical significance [72]. The most accurate ML model was random forest. The ML models were first run as regression models. The model precision was measured as the R2 value and as the root-mean-square error (RMSE). The target variables were the fruit yield expressed in kg tree−1; plant density was added as feature. To run the binary random forest classification models, the target variable was fruit yield in tons per ha−1; i.e., the product of fruit yield per tree and tree density per ha, as a familiar unit to select a yield goal by growers. Yield cutoff values for the binary classification were set at 50 or 60 tons ha−1. The latter cutoff value is the one generally chosen by Brazilian growers. The 50-ton ha−1 yield cutoff was tested to include ‘Pêra’ X ‘Tangerina Sunki’ specimens among high yielders at the regional scale. Model precision was measured as the area under the curve (AUC) and classification accuracy (CA). An AUC of 0.5 had no diagnostic interest [73]. The model was little informative in the AUC range of 0.5 to 0.7, moderately informative if 0.7 ≤ AUC < 0.9, and very informative if AUC ≥ 0.9. Where the AUC of training and testing models were acceptable, data classification was further conducted in cross-validation after merging the training and testing data sets.
Observations were partitioned by the confusion matrix of the ML classification model into four groups as follows [23]: (1) true negative (TN) specimens (high performance for nutritionally balanced groves), which formed altogether the reference population; (2) true positive (TP) specimens (low performance for nutritionally imbalanced groves); (3) false negative (FN) specimens (low performance for nutritionally balanced groves (type II error); and (4) false positive (FP) specimens (high performance for nutritionally imbalanced groves (type I error). Accuracy (Acc), the proportion of specimens correctly classified as balanced or imbalanced [23], was computed as follows:
[ A c c = [ ( T N + T P ) / ( T N + F N + T P + F P ) ] ]
The negative predictive value (NPV) was the probability that nutrient balance returned a high grove performance. The positive predictive value (PPV) was the probability that the nutrient imbalance returned a low grove performance. Specificity was the probability that a high grove performance was nutritionally balanced. Sensitivity was the probability that a low grove performance was nutritionally imbalanced. The NPV, PPV, specificity, and sensitivity were computed as follows [23]:
N P V = T N ( T N + F N )
N P V = T P ( T P + F P )
S p e c i f i c i t y = T N ( T N + F P )
S e n s i t i v i t y = T P ( T P + F N )

2.8. Delineation of the Regional and Blob Spaces

The regional multivariate space was the multinormal elliptical distribution of TN grove specimens set apart from the regional survey data set (Figure 1). Some TN specimens may have been located outside the swarm of TN specimens, thereby losing important information on successful groves. While the ML classification model returned a probability to reach the cutoff yield, there was no indication of an attainable yield goal at the grove scale. The shapeless blob multivariate space was delineated by local features [38]. The nutrient compositions of the diagnosed specimen were compared to those of TN specimens in the blob where most features were similar under the ceteris paribus assumption (Figure 1).

2.9. Regional and ‘Blob’-Scale Nutrient Standards

Tissue nutrient standards were computed at the regional scale as the clr means ( c l r i * ) and standard deviations ( S D i * ). The number of TN specimens depended on the yield cutoff used to run the classification model. While the concentration values were not additive to allow computing contrasts between the compositions, the clrs of two equal-length compositions could be contrasted. The clrs could rank nutrient indices in the order of nutrient limitations. The I c l r i indices were computed as follows [20]:
I c l r i   = c l r i c l r i * S D i *
where c l r i is the ith clr value of the diagnosed specimen. Positive and negative I c l r i   indices indicate relative nutrient excess and shortage, respectively, and can be reported in a histogram.
It is common for growers to compare abnormal to normal plants both growing in otherwise similar conditions [11]. We selected k TN specimens compositionally close to the diagnosed specimen to rank nutrients in the order of their limitation to yield. The k-closest successful TN neighbors formed a locally representative blob, which is a concept of agroecosystem similarity developed by [38] that has been successfully applied to agricultural crops [37,74,75]. Depending on the number of TN specimens in the blob, the clr indices ( I c l r i b l o b ) were computed using blob clr centroids as follows and reported in histograms:
I c l r i b l o b = c l r i c l r i b l o b
or, depending on the number of close neighbors:
I c l r i b l o b = c l r i c l r i b l o b S D i b l o b
To delineate the blob, we selected k TN specimens that presented features close to those of the diagnosed specimen. Because the clr variables had a Euclidian geometry, a distance ε between two equal-length compositions a and b could be computed as follows:
ε = i = 1 D ( c l r i a c l r i b ) 2
The plant, meteorological, and soil features of the TN specimens were retrieved from the data set to further support the comparison between the diagnosed specimen and those of the TN blob. Equation (12) can also be reported as a ‘perturbation’ vector ( p ) of the nutrient concentration ratios between the diagnosed and reference compositions (*) as follows [15]:
p = X x = { N N * , P P * , , F v F v * } ,
The perturbation index was zero-scaled as X x 1 to illustrate the relative deficiency, sufficiency, or excess [76]. The perturbation vector returned the same ranking as the clr difference between two equal-length compositions if the geometric means were exactly the same (i.e., G i = G i b l o b ) as shown below:
I c l r i b l o b = c l r i c l r i b l o b = l n ( x i / G i x i b l o b / G i b l o b ) l n ( x i / x i b l o b ) ,
The assumption of geometric mean similarity may thus hold where G G i b l o b .

3. Results

3.1. Results of Tissue and Soil Tests

The concentration quartiles of the surveyed groves showed some deviations from the current Brazilian concentration ranges (Table 2). An excess of Cu in foliar tissues of the surveyed groves was the result of disease management that used Cu-based fungicides. Excessive K levels were related to high K requirements and K fertilization; these can be antagonistic to tissue Ca and Mg. There was a large variation in the foliar Fe for ‘Hamlin’ X ‘Citrumelo Swingle’ and ‘Valência’ X ‘Citrumelo Swingle’, which indicated differences in the soil properties.
The soil properties are presented in Table 3. There were apparently large differences in nutrient and lime management among growers. The P and K centroids were at low to medium levels according to Brazilian guidelines [78]. Soils were generally acidic with pH values ranging from 3.8 to 7.0 in the 0–20 cm layer and from 3.8 to 6.9 in the 20–40 cm layer. The base saturation of CEC ranged from 8 to 96% in the 0–20 cm layer and from 17 to 94% in the 20–40 cm layer compared to the 70% recommended for fruit crops [60]. Organic matter contents varied from 8 to 64 g dm−3 in the 0–20 cm layer and from 8 to 67 g dm−3 in the 20–40 cm layer. Clay contents ranged from 17 to 344 g kg−1 in the 0–20 cm layer and from 15 to 360 g kg−1 in the 20–40 layer. The CEC values, which reflected the large variation in clay and organic matter contents, ranging from 24 to 296 mmolc dm−3 in the 0–20 cm layer and from 21 to 204 mmolc dm−3 in the 20–40 cm layer compared to 17 to 134 mmolc dm−3 for Oxisols and 7–88 mmolc dm−3 for Ultisols documented in Brazil [66]. The average T2M seasonally varied in the range of 15.5 to 27.4 °C. The monthly rainfall ranged from 0.3 to 446.3 mm. Highly variable features may explain the strong variations in fruit yields.
The discriminant analysis showed that the centroids of small ellipses differed significantly (p = 0.05) among the scion X rootstock combinations and soil classes (Figure 2). As a result, the scion X rootstock combinations, soil classes, and soil properties in the 0–20 cm and 20–40 cm layers could be included as predictors that contributed to the outcome.

3.2. Random Forest Regression Models

The feature contribution to the regression model was tested by removing successively meteorological indices, soil tests in the 20–40 cm layer, soil tests in the 0–20 cm layer, and plant features (scion, age, and nutrient balances). The performance of the regression model showed a close fit for the training data set (R2 = 0.860 to 0.905) but did not perform as well on the test data (R2 = 0.086 to 0.520) (Table 4). The smaller error rates in the training compared to the testing showed evidence of overfitting due to noisy information learned and memorized under the training. Because the R2 values increased and the RMSE decreased as more features were added to capture the complexity of the agroecosystems, the model’s performance improved along the learning curve. Additional key features were likely needed to support the model but were not documented in the data set. Nonetheless, we tested whether the classification random forest models were still acceptable using in-hand features.

3.3. Random Forest Classification Models

The random forest binary classification models were validated using the test data set and via cross-validation. Model performance varied with the yield cutoff and feature selection (Table 5). The AUC exceeded 0.7 whatever the selected yield cutoff when at least scion, age, and nutrient balances were included as features. The soil type, meteorological indices, and soil tests contributed to a smaller extent than plant features (scion, age, and tissue tests). At yield cutoff of 50 tons ha−1, plant features, and soil properties in the 0–20 cm layer sufficed to reach a high AUC value (0.801). A minimum data set would comprise the tissue test and soil test in the 0–20 cm layer, the scion and tree age as routinely acquired by growers, and the daily meteorological data and soil classification available from state records.
The 121 ‘Hamlin’ X ‘Citrumelo Swingle’, 126 ‘Pêra’ X ‘Tangerina Sunki’, and 304 ‘Valência’ X ‘Citrumelo Swingle’ were partitioned into four quadrants in the confusion matrix (Table 6). At a yield cutoff of 60 tons ha−1, there were 150 TN specimens but no ‘Pêra’ X ‘Tangerina Sunki’. At a yield cutoff of 50 tons ha−1, there were 261 TN specimens, of which 21 were ‘Pêra’ X ‘Tangerina Sunki’. The NPV and specificity were the highest among ‘Hamlin’ X ‘Citrumelo Swingle’ groves, which indicated that high fruit yields were closely associated with adequate nutrient balances. ‘Pêra’ X ‘Tangerina Sunki’ groves showed the highest sensitivity. Quartile nutrient concentration ranges for TN specimens were computed at the scion X rootstock level and a yield cutoff value of 50 tons ha−1 (Table 7). Compared to the other scion X rootstock combinations, the B concentration range was lower in the foliar tissues of ‘Pêra’ X ‘Tangerina Sunki’.

3.4. Nutrient Standards at Regional Scale

Regional TN clr standards were computed for each scion X rootstock combination (Table 8). The centroids and variances differed among scions, which indicated that the three scions should be diagnosed separately; this confirmed the results of the discriminant analysis (Figure 2) and of the random forest binary classification model (Table 5). The clr values of tissue N, P, K, and Mg of the early maturing ‘Pêra’ X ‘Tangerina Sunki’ tended to be high compared to the tissue components of other scions at a yield cutoff of 50 tons ha−1. In contrast, the Ca, B, Cu, and Fe levels appeared to be low.

3.5. Order of Nutrient Limitations at the Regional and Blob Scales: Example

The random forest binary classification model could predict the performance of a given grove from features documented in the data set. The model classified the yield as a probability of high or low under the ceteris paribus assumption. At the blob scale, comparable genetic, managerial, and environmental conditions could be set apart in the data set to delineate the referential blobs of the comparable TN specimens [38]. Thereafter, nutrients could be ranked against the clr nutrient benchmarks in the selected referential blob.
Let us diagnose a defective (true positive) tissue specimen of ‘Valência’ X ‘Citrumelo Swingle’ (Table 9). The fruit yield of the diagnosed specimen was 20 tons ha−1. Compared to the current Brazilian nutrient ranges, there was an apparent relative excess of Cu and a sufficiency of others. The excess Cu was indicative of difficulties in disease control. This would mean little or no change in the fertilization regime despite the low yield for a specimen classified as true positive (low-yielding, nutritionally imbalanced specimen).
The relative order of the nutrient limitations to the yield was computed at the blob scale using the k-closest TN compositional neighbor (k = 1) or the clr means and standard deviations of the six closest TN neighbors (k = 6). The six closest neighbors were selected by ranking the Euclidean distances of the tissue compositions between the diagnosed specimen and the TN specimens. The soil type of the defective specimen was Red-Yellow Ultisol. The meteorological indices were similar between the diagnosed and referential TN specimens. The soil pH, organic matter content, and extractable elements (P, K, Ca, and Mg) in the upper layer (0–20 cm) were also comparable except for one Red-Yellow Ultisol that showed a low pH (pH 4.3) compared to those of the diagnosed specimen (pH 5.8). The yield was 53 tons ha−1 in the k = 1 blob and averaged 67 tons ha−1 in the k = 6 blob, which indicated a large yield gap between the diagnosed grove and the successful neighboring groves.
The regional diagnosis returned a relatively large N excess; some P, Mg, and B excess; and a large Mn shortage (Figure 3). The blob diagnosis (k = 6) returned a large Ca excess and Mn shortage; some B and Cu excess; and some P, K, and S shortage. The blob diagnosis (k = 1) returned relative excess of N, Ca, and Cu and a relative shortage of K and Mn. This differential diagnosis posed a new challenge that has not been reported before.

4. Discussion

4.1. Contribution of Documented Features to ML Models

Orange yields in the surveyed region were impacted primarily by plant genetics, age, and nutrient balances (Table 4 and Table 5). Soil properties in the 0–20 cm layer were also contributive. Indeed, the root system of the young orange trees was much more abundant in the 0–30 cm layer than below [59,79]. While features not yet documented such as the fertilization regime and disease control could likely have contributed to the yield, they were assumed to be applied as recommended. The fertilization regime and related features (placement, timing, source, and rate) could vary among growers. Where available, a grower’s fertilization data could provide information on corrective measures to reach the yield potential at the minimum cost under comparable conditions. Scions could be grafted on different rootstocks depending on pedoclimatic conditions. Citrus rootstocks are selected for resistance to diseases and tolerance to drought, especially in hard-setting soils in which a cohesive soil mass located 20–70 cm below the surface limits water movement and root penetration [58,59]. The blob paradigm follows a grower’s propension for comparisons with successful neighbors and Alexander von Humboldt’s concept of natural systems that combines local features and knowledge [11,14]. The blob concept of local-scale diagnosis should be further tested via field experimentation and compared to current diagnostic methods.
The experimental data could validate the model prediction, measure the sensitivity of nutrient balances to nutrient additions, and provide response curves to support corrective measures [37,74,75]. Experimental data exist in various formats in Brazil [2,4,80,81,82,83,84], but data assemblage from different sources was beyond the scope of this study. Another issue was the carryover effects of carbohydrate and nutrient accumulations in the plant tissues of perennial plants that impact the following fertilization regime [85,86,87,88]. Several years of experimentation are needed to monitor the nutrition and fertilizer requirements of perennial crops given the information on carryover effects such as prior plant nutrient status, tree pruning, trunk diameter, and crop yield.

4.2. Nutrient Diagnosis

Differences between the nutrient concentration ranges assessed in the present study and others reported in Brazil [4] were attributable to: (1) TN specimens only in our reference group (excluding FP specimens), (2) tissue compositions analyzed as assemblages of nutrients rather than processed separately [34], (3) concentrations ranges tending to narrow down as more information on nutrient compositions and interactions was integrated [89], and (4) varietal effects. We compared the nutrient diagnoses using regional standards and blob standards for the k-nearest neighbors. The blob provided realistic attainable yields under comparable growing conditions.
New questions arose on the relevance of current methods to diagnose the nutrient status of orange groves. Using blob or regional standards, the results of nutrient diagnosis may differ due to differences in centroids and supporting assumptions such as ceteris paribus (all factors but the ones being examined were assumed to be at equal or optimum levels), thus impacting the decision to adjust the fertilization regime to local conditions. Nutrient standards may also vary with the yield cutoff selected.

4.3. Boron Limitations

The B concentration ranges for ‘Pêra’ X ‘Tangerina Sunki’ differed compared to the other scions. This may have been due to genetics, managerial conditions, or environmental growing conditions. Boron deficiency is common in the tropics [89,90,91] because soil B availability is generally low [92]. Soil B is mobile in its predominantly nonionic form (boric acid) and may be leached through excessive rainfall [90]. Boron plays a central structural role in the formation of the xylem, root growth, and water transportation [93]. Boron concentrations in reproductive plant parts that contain higher levels of pectin, phenols, or sugar alcohol most often exceed the B concentrations detected in the vegetative parts. Boron is a hardener for plant cells and hence a protective mechanism against pests. Boric acid binds to sugars in cell walls and cross-links two chains of pectic polysaccharide through borate–diester bonding at the rhamnogalacturonan II region, which forms a network of polysaccharides [94]. In rainfed orange orchards, long drought periods affect root B transfer due to reduced mass flow [95]. Considering the low B mobility in the phloem of citrus [93], foliar fertilization is inefficient to fully meet the demand of new flushes of shoot growth and should be complemented by soil B application [61,90,96,97]. The B is commonly applied as a foliar spray [61,97].

4.4. Nutrient Excess and Shortage

The clr standards that incorporate all nutrients interacting in the plant tissue could also indicate overfertilization. There is a trend of overfertilizing orchards with N, P, and K because the macronutrient offtake through fruit harvest is substantial [61]. As a result, the fruit yield and quality may be affected by overfertilization driven by nutrient budgets rather than field experimentation and success stories documented in data sets [82,83,84]. N overfertilization may result in poor-quality fruits and a higher susceptibility to disease and insect feeding [9,82]. As is the case for other perennials [85,86,87,88], N requirements may vary widely in orange orchards [4] due in part to the carryover effects of carbohydrate and nutrient accumulations over time. Carbohydrate and nutrient reserves accumulated in off-years or previous years can be remobilized at a high rate in on-years [98]. The N stored in the tree biomass can be redistributed to developing leaves and fruits via the phloem [61]. Therefore, fruit production might not be impacted by a low nutrient supply during the current season. Nonetheless, if nutrient reserves are not replenished regularly, trees may undergo a gradual reduction in canopy density, which results in decreased fruit production in subsequent seasons [4]. This aspect could be addressed in field experiments and monitored via crop logging.
Due to strong sorption of phosphate ions by aluminum and iron oxyhydroxides in tropical soils such as Oxisols and Ultisols [99], nutrient-use efficiency of the fertilizer P is thought to be low [100,101,102] At low soil-test P values, citrus response to P fertilization was found to stimulate root–shoot growth [80] and increase the fruit yield [2,82]. The fruit yield can respond to P fertilization when the soil-test P is higher than 20 mg resin-P dm−3 [2,82]. However, a yield loss can occur when the soil-test P exceeds 40 mg resin-P dm−3 [81]. There is a high risk of P overfertilization [89]. The co-precipitation of P- and Zn-conducting vessels is exacerbated by an excess of P, which inhibits Zn transport from root to shoot and causes metabolic disorders [103]. While P fertilization has been guided traditionally by tissue diagnosis, soil analysis, and expected yield [4], a blob-scale diagnosis could provide the realistic yield goals documented in the data set. The blob provides successful fertilization regimes under comparable conditions and can be used to derive response curves to added nutrients under comparable experimental conditions.
The K offtake due to a fruit harvest is substantial [61] and requires heavy K fertilization [82,83,84]. However, this may lead to a K excess in the soil and a relative K excess in foliar tissues, especially if the soil test is not taken into account or when K surpluses are large [61]. An excessive K dosage may lead to a yield loss due to K-Ca antagonism [83,104]. Ca accumulates even more than N in the tree biomass and correlates positively with fruit yield [105,106]. A soil pH correction through liming may not suffice to meet plant Ca demand. It has been recommended to complement Ca nutrition with gypsum [102] or calcium nitrate when plant Ca demand peaks [107]. Parsimonious K supply can contribute to re-establishing tissue K-Ca balance. To account for Ca, K, and Mg interactions in the soil, the relationship between the crop yield and soil test as exchangeable K, Ca, Mg, and acidity could be revisited using the nutrient-balance concept [108].
Relative Cu excess is common in the foliage of orange orchards due to Cu applications to tackle foliar diseases. Any Cu toxicity due to Cu accumulation may increase the oxidative stress and reduce plant growth (especially root growth), nutrient, water absorption, and photosynthesis rates [109,110]. While high soil-test Cu values are also commonly reported in orchards, other nutrients should be properly balanced to increase the resistance of orange trees to common diseases [6]. A concomitant S shortage may be problematic. S plays a central role in regulating cross-talk with cationic microelements [19]. Mn and Zn may be in short supply in Oxisols and Ultisols. Mn and Zn are sorbed strongly by soil colloids and are commonly at low levels in the parent material of tropical soils [3,61,111]. Mn and Zn are often supplied through foliar sprays [3,105,106]. Soil applications of those micronutrients are complementary to foliar sprays in sandy soils that are low in organic matter [111].

5. Conclusions

The random forest regression model that related fruit yields to meteorological, soil, and plant features showed evidence of overfitting. A random forest binary classification model was found to be acceptable based on the area under the curve. The scion X rootstock combination and grove age contributed more to the fruit yield than the meteorological and soil variables. The scion X rootstock combination, plant age, tissue nutrient balances, and soil properties in the 0–20 cm layer formed the minimum data set to reach an acceptable model AUC. The nutrient concentration ranges of the surveyed TN specimens at a regional scale depended on the yield cutoff.
There was a need to adjust the search for nutrient standards to yield a cutoff specific to the scion X rootstock combination and to focus on the local conditions delineated in the blob of the k neighboring successful agroecosystems. Using a defective ‘Valência’ X ‘Citrumelo Swingle’ TP specimen as an example, the regional and blob diagnoses differed. The predictions of the blob model, which were based Alexander von Humboldt’s concept of natural systems that combines local features and knowledge and on grower’s propension for comparisons with successful neighbors, should be further evaluated via field experimentation.
Citrus growers could adopt a concept of yield-limiting clr nutrient balances specific to their agroecosystems in which groups of nutrients are optimally balanced in referential blobs. Most nutrient standards currently used to adjust crop fertilization are derived from regional data sets with few features. More local key features such as those related to the soil profile, carryover effects, and pest and fertilizer management should be collected to build a diversified data set that enables an increase in the model’s accuracy and improve the yield prediction. The blob paradigm based on Alexander von Humboldt’s concept of natural systems that combines local features and knowledge and is deciphered using artificial intelligence tools [112] should be further compared to the current diagnostic methods via field experimentation.

Author Contributions

D.R.Y. collected and organized the data, elaborated the models, and cowrote the paper; S.-É.P. conceived, wrote, and calibrated models; W.N., A.B.C.F., D.E.R., R.H.D.N., and D.d.M.J. collected data and revised the paper; L.E.P. cowrote the paper. All authors have read and agreed to the published version of the manuscript.

Funding

The authors acknowledge the financial support of the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) and the Natural Sciences and Engineering research Council of Canada #2254.

Acknowledgments

This project was elaborated under an interinstitutional agreement between Universidade Estadual Paulista (UNESP), Jaboticabal-SP, Brazil; and the Université Laval, Québec (QC), Canada.

Conflicts of Interest

The authors declare that this research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. AGRIANUAL. Anuário Da Agricultura Brasileira; Agribusiness Intelligence/Informa IEG/FNP: São Paulo, Brazil, 2022. [Google Scholar]
  2. Neves, C.S.V.J.; Murata, I.M.; Stenzel, N.M.C.; Medina, C.d.C.; Borges, A.V.; Okumoto, S.H.; Lee, R.H.C.; Kanai, H.T. Root Distribution of Rootstocks for “Tahiti” Lime. Sci. Agric. 2004, 61, 94–99. [Google Scholar] [CrossRef] [Green Version]
  3. Quaggio, J.A.; Cantarella, H.; Van Raij, B. Phosphorus and Potassium Soil Test and Nitrogen Leaf Analysis as a Base for Citrus Fertilization. Nutr. Cycl. Agroecosyst. 1998, 52, 67–74. [Google Scholar] [CrossRef]
  4. Quaggio, J.A.; Mattos, D., Jr.; Boaretto, R.M. Citros. Boas Práticas para Uso Eficiente de Fertilizantes; Prochnow, L.I., Casarin, V., Stipp, S.R., Eds.; International Plant Names Index: Piracicaba, Brazil, 2010; pp. 371–409. [Google Scholar]
  5. Rozane, D.E.; Mattos, D.; Parent, S.-É.; Natale, W.; Parent, L.E. Meta-Analysis in the Selection of Groups in Varieties of Citrus. Commun. Soil Sci. Plant Anal. 2015, 46, 1948–1959. [Google Scholar] [CrossRef]
  6. Mendonça, L.; Zambolim, L.; Badel, J. Bacterial Citrus Diseases: Major Threats and Recent Progress. J. Bacteriol. Mycol. Open Access 2017, 5, 1–12. [Google Scholar] [CrossRef] [Green Version]
  7. Behlau, F. An Overview of Citrus Canker in Brazil. Trop. Plant Pathol. 2021, 46, 1–12. [Google Scholar] [CrossRef]
  8. Machado, F.J.; Lanza, F.E.; Ferretti, M.O.; Fialho, R.O.; Behlau, F.; Silva-Junior, G.J. Citrus Black Spot Severity Related to Premature Fruit Drop in Sweet Orange Orchards. Plant Pathol. 2022, 71, 400–410. [Google Scholar] [CrossRef]
  9. Martinez, D.A.; Loening, U.E.; Graham, M.C.; Gathorne-Hardy, A. When the Medicine Feeds the Problem; Do Nitrogen Fertilisers and Pesticides Enhance the Nutritional Quality of Crops for Their Pests and Pathogens? Front. Sustain. Food Syst. 2021, 5, 701310. [Google Scholar] [CrossRef]
  10. Smith, G.; Asher, G.J.; Clark, C. Kiwifruit Nutrition Diagnosis of Nutritional Disorders, 3rd ed.; Agpress Communications Ltd.: Washignton, DC, USA, 1997; ISBN 0-9597693-0-7. [Google Scholar]
  11. Munson, R.D.; Nelson, W.L. Principles and Practices in Plant Analysis. In Soil Testing and Plant Analysis; Westerman, R.L., Ed.; Soil Science Society of America: Madison, WI, USA, 1990; pp. 359–387. [Google Scholar]
  12. Mourão Filho, F.d.A.A.; Azevedo, J.C. DRIS Norms for “Valencia” Sweet Orange on Three Rootstocks. Pesqui. Agropecuária Bras. 2003, 38, 85–93. [Google Scholar] [CrossRef]
  13. Filho, F.d.A.A.M. DRIS and Sufficient Range Approaches in Nutritional Diagnosis of “Valencia” Sweet Orange on Three Rootstocks. J. Plant Nutr. 2005, 28, 691–705. [Google Scholar] [CrossRef]
  14. Betemps, D.L.; Paula, B.V.d.; Parent, S.-É.; Galarça, S.P.; Mayer, N.A.; Marodin, G.A.B.; Rozane, D.E.; Natale, W.; Melo, G.W.B.; Parent, L.E.; et al. Humboldtian Diagnosis of Peach Tree (Prunus persica) Nutrition Using Machine-Learning and Compositional Methods. Agronomy 2020, 10, 900. [Google Scholar] [CrossRef]
  15. Lima Neto, A.J.d.; Deus, J.A.L.d.; Rodrigues Filho, V.A.; Natale, W.; Parent, L.E. Nutrient Diagnosis of Fertigated “Prata” and “Cavendish” Banana (Musa spp.) at Plot-Scale. Plants 2020, 9, 1467. [Google Scholar] [CrossRef]
  16. Bates, T.E. Factors Affecting Critical Nutrient Concentrations in Plants and Their Evaluation: A Review. Soil Sci. 1971, 112, 116–130. [Google Scholar] [CrossRef]
  17. Baxter, I. Should We Treat the Ionome as a Combination of Individual Elements, or Should We Be Deriving Novel Combined Traits? J. Exp. Bot. 2015, 66, 2127–2131. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Wilkinson, S.R.; Grunes, D.L.; Sumner, M.E. Nutrient Interactions in Soil and Plant Nutrition. In Handbook of Soil Fertility and Plant Nutrition; Sumner, M.E., Ed.; CRC Press: London, UK, 2000; p. 91. [Google Scholar]
  19. Courbet, G.; Gallardo, K.; Vigani, G.; Brunel-Muguet, S.; Trouverie, J.; Salon, C.; Ourry, A. Disentangling the Complexity and Diversity of Crosstalk between Sulfur and Other Mineral Nutrients in Cultivated Plants. J. Exp. Bot. 2019, 70, 4183–4196. [Google Scholar] [CrossRef] [PubMed]
  20. Parent, L.E.; Dafir, M. A Theoretical Concept of Compositional Nutrient Diagnosis. J. Am. Soc. Hortic. Sci. 1992, 117, 239–242. [Google Scholar] [CrossRef] [Green Version]
  21. Hernandes, A.; Parent, S.-É.; Natale, W.; Parent, L.É. Balancing Guava Nutrition with Liming and Fertilization. Rev. Bras. Frutic. 2012, 34, 1224–1234. [Google Scholar] [CrossRef]
  22. Parent, S.-É.; Parent, L.E.; Egozcue, J.J.; Rozane, D.E.; Hernandes, A.; Lapointe, L.; Hébert-Gentile, V.; Naess, K.; Marchand, S.; Lafond, J.; et al. The Plant Ionome Revisited by the Nutrient Balance Concept. Front. Plant Sci. 2013, 4, 39. [Google Scholar] [CrossRef] [Green Version]
  23. Parent, S.-É.; Parent, L.E.; Rozane, D.E.; Natale, W. Plant Ionome Diagnosis Using Sound Balances: Case Study with Mango (Mangifera indica). Front. Plant Sci. 2013, 4, 449. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Khairiddine, N.; Khiari, L.; Leblanc, M.; Cambouris, A.; Parent, L.E. Dependence of SPAD Readings on Nutrient Balance in Potato (Solanum tuberosum L.). In Proceedings of the 13th International Symposium for Soil and Plant Analysis, Queenstown, New Zealand, 8–12 April 2013. [Google Scholar]
  25. Liu, S.; Yang, X.; Guan, Q.; Lu, Z.; Lu, J. An Ensemble Modeling Framework for Distinguishing Nitrogen, Phosphorous and Potassium Deficiencies in Winter Oilseed Rape (Brassica napus L.) Using Hyperspectral Data. Remote Sens. 2020, 12, 4060. [Google Scholar] [CrossRef]
  26. D’Oria, A.; Courbet, G.; Lornac, A.; Pluchon, S.; Arkoun, M.; Maillard, A.; Etienne, P.; Diquélou, S.; Ourry, A. Specificity and Plasticity of the Functional Ionome of Brassica Napus and Triticum Aestivum Exposed to Micronutrient or Beneficial Nutrient Deprivation and Predictive Sensitivity of the Ionomic Signatures. Front. Plant Sci. 2021, 12, 641678. [Google Scholar] [CrossRef]
  27. Beaufils, E. Diagnosis and Recommendation Integrated System (DRIS), 1st ed.; University of Natal: Pietermaritzburg, South Africa, 1973. [Google Scholar]
  28. Beverly, R.B. Modified Dris Method for Simplified Nutrient Diagnosis of ‘Valencia’ Oranges. J. Plant Nutr. 1987, 10, 1401–1408. [Google Scholar] [CrossRef]
  29. Rodríguez, O.; Rojas, E.; Sumner, M. Valencia Orange DRIS Norms for Venezuela. Commun. Soil Sci. Plant Anal. 1997, 28, 1461–1468. [Google Scholar] [CrossRef]
  30. Hernandes, A.; de Souza, H.A.; de Amorim, D.A.; Natale, W.; Lavres, J., Jr.; Boaretto, A.E.; Camacho, M.A. DRIS Norms for Pêra Orange. Commun. Soil Sci. Plant Anal. 2014, 45, 2853–2867. [Google Scholar] [CrossRef] [Green Version]
  31. Beverly, R.B. Comparison of DRIS and Alternative Nutrient Diagnostic Methods for Soybean. J. Plant Nutr. 1987, 10, 901–920. [Google Scholar] [CrossRef]
  32. Walworth, J.L.; Sumner, M.E. The Diagnosis and Recommendation Integrated System (DRIS). Adv. Soil Sci. 1987, 6, 149–188. [Google Scholar] [CrossRef]
  33. Souza, H.A.; Parent, S.É.; Rozane, D.E.; Amorim, D.A.; Modesto, V.C.; Natale, W.; Parent, L.E. Guava Waste to Sustain Guava (Psidium Guajava) Agroecosystem: Nutrient “Balance” Concepts. Front. Plant Sci. 2016, 7, 1252. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Paula, B.V.; Squizani Arruda, W.; Etienne Parent, L.; Frank de Araujo, E.; Brunetto, G. Nutrient Diagnosis of Eucalyptus at the Factor-Specific Level Using Machine Learning and Compositional Methods. Plants 2020, 9, 1049. [Google Scholar] [CrossRef] [PubMed]
  35. Rodrigues Filho, V.A.; Neves, J.C.L.; Donato, S.L.R.; Guimarães, B.V.C. Universality of Kenworthy and DRIS Norms for Prata and Cavendish Bananas Grown in Two Environments. Rev. Bras. Ciência Do Solo 2021, 45, e0200120. [Google Scholar] [CrossRef]
  36. Aitchison, J. The Statistical Analysis of Compositional Data. J. R. Stat. Soc. Ser. B 1982, 44, 139–160. [Google Scholar] [CrossRef]
  37. Parent, S.-É.; Lafond, J.; Paré, M.C.; Parent, L.E.; Ziadi, N. Conditioning Machine Learning Models to Adjust Lowbush Blueberry Crop Management to the Local Agroecosystem. Plants 2020, 9, 1401. [Google Scholar] [CrossRef]
  38. Parent, S.-É. Why We Should Use Balances and Machine Learning to Diagnose Ionomes. Authorea 2020, 1–13. [Google Scholar] [CrossRef]
  39. Witten, I.H.; Frank, E.; Hall, M.A. Data Mining: Practical Machine Learning Tools and Techniques (Google EBook), 2nd ed.; Morgan Kaufmann: Burlington, MA, USA, 2011. [Google Scholar]
  40. McQueen, R.J.; Garner, S.R.; Nevill-Manning, C.G.; Witten, I.H. Applying Machine Learning to Agricultural Data. Comput. Electron. Agric. 1995, 12, 275–293. [Google Scholar] [CrossRef] [Green Version]
  41. Behmann, J.; Mahlein, A.-K.; Rumpf, T.; Römer, C.; Plümer, L. A Review of Advanced Machine Learning Methods for the Detection of Biotic Stress in Precision Crop Protection. Precis. Agric. 2015, 16, 239–260. [Google Scholar] [CrossRef]
  42. Pietersma, D.; Lacroix, R.; Lefebvre, D.; Wade, K.M. Performance Analysis for Machine-Learning Experiments Using Small Data Sets. Comput. Electron. Agric. 2003, 38, 1–17. [Google Scholar] [CrossRef]
  43. Demšar, J.; Zupan, B. Hands-on Training about Overfitting. PLoS Comput. Biol. 2021, 17, e1008671. [Google Scholar] [CrossRef]
  44. Kampichler, C.; Wieland, R.; Calmé, S.; Weissenberger, H.; Arriaga-Weiss, S. Classification in Conservation Biology: A Comparison of Five Machine-Learning Methods. Ecol. Inform. 2010, 5, 441–450. [Google Scholar] [CrossRef]
  45. Scott, I.M.; Lin, W.; Liakata, M.; Wood, J.E.; Vermeer, C.P.; Allaway, D.; Ward, J.L.; Draper, J.; Beale, M.H.; Corol, D.I.; et al. Merits of Random Forests Emerge in Evaluation of Chemometric Classifiers by External Validation. Anal. Chim. Acta 2013, 801, 22–33. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Shekoofa, A.; Emam, Y.; Shekoufa, N.; Ebrahimi, M.; Ebrahimie, E. Determining the Most Important Physiological and Agronomic Traits Contributing to Maize Grain Yield through Machine Learning Algorithms: A New Avenue in Intelligent Agriculture. PLoS ONE 2014, 9, e97288. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Brungard, C.W.; Boettinger, J.L.; Duniway, M.C.; Wills, S.A.; Edwards, T.C. Machine Learning for Predicting Soil Classes in Three Semi-Arid Landscapes. Geoderma 2015, 239–240, 68–83. [Google Scholar] [CrossRef] [Green Version]
  48. Kirkwood, C.; Cave, M.; Beamish, D.; Grebby, S.; Ferreira, A. A Machine Learning Approach to Geochemical Mapping. J. Geochem. Explor. 2016, 167, 49–61. [Google Scholar] [CrossRef]
  49. Grunwald, S. Artificial Intelligence and Soil Carbon Modeling Demystified: Power, Potentials, and Perils. Carbon Footpr. 2022, 1, 6. [Google Scholar] [CrossRef]
  50. Li, H.; Lee, W.S.; Wang, K. Identifying Blueberry Fruit of Different Growth Stages Using Natural Outdoor Color Images. Comput. Electron. Agric. 2014, 106, 91–101. [Google Scholar] [CrossRef]
  51. Li, H.; Li, M.; Luo, J.; Cao, X.; Qu, L.; Gai, Y.; Jiang, X.; Liu, T.; Bai, H.; Janz, D.; et al. N-Fertilization Has Different Effects on the Growth, Carbon and Nitrogen Physiology, and Wood Properties of Slow- and Fast-Growing Populus Species. J. Exp. Bot. 2012, 63, 6173–6185. [Google Scholar] [CrossRef] [PubMed]
  52. Chaudhary, A.; Kolhe, S.; Kamal, R. A Hybrid Ensemble for Classification in Multiclass Datasets: An Application to Oilseed Disease Dataset. Comput. Electron. Agric. 2016, 124, 65–72. [Google Scholar] [CrossRef]
  53. Romero, J.R.; Roncallo, P.F.; Akkiraju, P.C.; Ponzoni, I.; Echenique, V.C.; Carballido, J.A. Using Classification Algorithms for Predicting Durum Wheat Yield in the Province of Buenos Aires. Comput. Electron. Agric. 2013, 96, 173–179. [Google Scholar] [CrossRef]
  54. Natarajan, R.; Subramanian, J.; Papageorgiou, E.I. Hybrid Learning of Fuzzy Cognitive Maps for Sugarcane Yield Classification. Comput. Electron. Agric. 2016, 127, 147–157. [Google Scholar] [CrossRef]
  55. Pantazi, X.E.; Moshou, D.; Alexandridis, T.; Whetton, R.L.; Mouazen, A.M. Wheat Yield Prediction Using Machine Learning and Advanced Sensing Techniques. Comput. Electron. Agric. 2016, 121, 57–65. [Google Scholar] [CrossRef]
  56. Aregay, N.; Belew, D.; Zenebe, A.; Grima, A.; Haile, M.; Gebresamuel, G. Influences of Rootstock and Agro-Climatic Condition on Physico-Chemical and Bioactive Compounds of Gunda Gundo Orange (Citrus sinensis L. Osbeck) in the Northern Ethiopia. Hortic. Plant J. 2021, 7, 509–519. [Google Scholar] [CrossRef]
  57. Edition, T. Soil Survey Staff-Keys to Soil Taxonomy, 12th ed.; USDA-Natural Resources Conservation Service: Washington, DC, USA, 2014.
  58. Lepsch, I.F.; Buol, S.W.; Daniels, R.B. Soil-landscape Relationships in the Occidental Plateau of São Paulo State, Brazil: II. Soil Morphology, Genesis, and Classification. Soil Sci. Soc. Am. J. 1977, 41, 109–115. [Google Scholar] [CrossRef]
  59. Ribeiro, L.d.O.; Costa, D.P.; Ledo, C.A.d.S.; Carvalho, L.M.d.; Carvalho, H.W.L.d.; Soares Filho, W.d.S.; Girardi, E.A. ‘Tropical Sunki’ Mandarin and Hybrid Citrus Rootstocks under ‘Pera’ Sweet Orange in Cohesive Soil and As Climate without Irrigation. Bragantia 2021, 80, e1321. [Google Scholar] [CrossRef]
  60. Natale, W.; Rozane, D.E.; Parent, S.-E.; Etienne, L. Soil Acidity and Liming in Tropical Fruit Orchards. In Soil Fertility; InTech: London, UK, 2012. [Google Scholar] [CrossRef] [Green Version]
  61. Mattos, D., Jr.; Quaggio, J.A.; Cantarella, H.; Alva, A.K. Nutrient Content of Biomass Components of Hamlin Sweet Orange Trees. Sci. Agric. 2003, 60, 155–160. [Google Scholar] [CrossRef]
  62. Hardy, S.; Khurshgid, T. Calculating Heat Units for Citrus. Primefact 2021, 749, 1–4. [Google Scholar]
  63. Bataglia, O.C.; Furlani, A.M.C.; Teixeira, J.P.F.; Furlani, P.R.; Gallo, J.R. Métodos de Análise Química de Plantas, 1st ed.; Instituto Agronômico: Campinas, Brazil, 1983.
  64. van Raij, B.; Andrade, J.C.; Cantarella, H.; Quaggio, J.A. Análise Química Para a Avaliação Da Fertilidade de Solos Tropicais, 1st ed.; Instituto Agronômico: Campinas, Brazil, 2001.
  65. Quaggio, J.A.; van Raij, B.; Malavolta, E. Alternative Use of the SMP-buffer Solution to Determine Lime Requirement of Soils. Commun. Soil Sci. Plant Anal. 1985, 16, 245–260. [Google Scholar] [CrossRef]
  66. Soares, M.R.; Alleoni, L.R.F. Contribution of Soil Organic Carbon to the Ion Exchange Capacity of Tropical Soils. J. Sustain. Agric. 2008, 32, 439–462. [Google Scholar] [CrossRef]
  67. Kuhn, M. Building Predictive Models in R Using the Caret Package. J. Stat. Softw. 2008, 28. [Google Scholar] [CrossRef] [Green Version]
  68. Hastie, T.; Tibshirani, R.; Friedman, J. Springer Series in Statistics the Elements of Statistical Learning-Data Mining, Inference, and Prediction; Springer: Berkeley, CA, USA, 2009; Volume 2. [Google Scholar]
  69. Weiss, S.M.; Kulikowski, C.A. Computer Systems That Learn: Classification and Prediction Methods from Statistics Nets, Machine Learning, and Expert Systems; Morgan Kaufmann Publishing: San Mateo, CA, USA, 1991. [Google Scholar]
  70. R Foundation for Statistical Computing R Core Team. R: A Language and Environment for Statistical Computing. 2017. Available online: https://www.R-project.org/ (accessed on 6 October 2022).
  71. Van Den Boogaart, D.G.; Tolosana-Delgado, R.; Bren, M. Compositions: Compositional Data Analysis in R Package. 2013. Available online: https://cran.r-project.org/web/packages/compositions/compositions.pdf (accessed on 6 October 2022).
  72. Amrhein, V.; Greenland, S.; McShane, B. Scientists Rise up against Statistical Significance. Nature 2019, 567, 305–307. [Google Scholar] [CrossRef] [Green Version]
  73. Delacour, H.; Servonnet, A.; Perrot, A.; Vigezzi, J.F.; Ramirez, J.M. ROC (Receiver Operating Characteristics) Curve: Principles and Application in Biology. Ann. Biol. Clin. 2005, 63, 145–154. [Google Scholar]
  74. Coulibali, Z.; Cambouris, A.N.; Parent, S.-É. Cultivar-Specific Nutritional Status of Potato (Solanum tuberosum L.) Crops. PLoS ONE 2020, 15, e0230458. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  75. Hahn, L.; Parent, L.-É.; Feltrim, A.L.; Rozane, D.E.; Ender, M.M.; Tassinari, A.; Krug, A.V.; Berghetti, Á.L.P.; Brunetto, G. Local Factors Impact Accuracy of Garlic Tissue Test Diagnosis. Agronomy 2022, 12, 2714. [Google Scholar] [CrossRef]
  76. Montañés, L.; Heras, L.; Abadía, J.; Sanz, M. Plant Analysis Interpretation Based on a New Index: Deviation from Optimum Percentage (DOP). J. Plant Nutr. 1993, 16, 1289–1308. [Google Scholar] [CrossRef] [Green Version]
  77. Quaggio, J.; Mattos, D., Jr.; Boaretto, R.M.; Zambrosi, F.C.; Cantarella, H. Citros. In Recomendações de Adubação e Calagem para o Estado de São Paulo; Cantarella, H., Quaggio, J.A., Mattos, D., Jr., Boaretto, R., van Raij, B., Eds.; Campinas, Brazil, 2022; pp. 187–198. ISBN 9786588414095. Available online: https://d1wqtxts1xzle7.cloudfront.net/56601012/boletim_tecnico_100_pdf-with-cover-page-v2.pdf?Expires=1670230027&Signature=DD5V28udQ6-suRcCw6VRjrt57xzWwTeulenQR~q77yurrFCYXLKIK8rG0aOjan082AQM1DihplrUbKIVVW9q2OyA9i7Fs9R78xUw0RvlQNA9STjML2kC-CG3K4014D1NGpCGmPArfjVv3-Sytk7QM0KpFrdBVYU3ME0ecST4iJMa-hLbbe~5Qd89i4QrCQhKe-0spanNoXDFmQe3SmCyaLjAI9NCbqeB4H~cxSMIwQ2PoTc8aTPEI4d5B~smP8o2aWE7VjrKUbpIkj0SYgLjTsIpnRbHaS63ukhNBPUC3DmjFyj9tz6y7UaMHlbwZeb6nb~T7F7EJLMlsmoMuq0MNg__&Key-Pair-Id=APKAJLOHF5GGSLRBV4ZApdf (accessed on 6 October 2022).
  78. Brunetto, G.; Ernani, P.R.; Melo, G.W.B.d.; Nava, G. Frutíferas. In Manual de Calagem e Adubação para os Estados do Rio Grande do Sul e de Santa Catarina; Comissão de Química e Fertilidade Do Solo RS/SC; Santa Maria, Brazil, 2016; pp. 189–232. ISBN 978-85-66301-80-9. Available online: https://www.sbcs-nrs.org.br/docs/Manual_de_Calagem_e_Adubacao_para_os_Estados_do_RS_e_de_SC-2016.pdf (accessed on 6 October 2022).
  79. Bevington, K.B.; Castle, W.S. Annual Root Growth Pattern of Young Citrus Trees in Relation to Shoot Growth, Soil Temperature, and Soil Water Content. J. Am. Soc. Hortic. Sci. 1985, 110, 840–845. [Google Scholar] [CrossRef]
  80. Zambrosi, F.C.B.; Mattos, D.; Quaggio, J.A.; Cantarella, H.; Boaretto, R.M. Phosphorus Uptake by Young Citrus Trees in Low-P Soil Depends on Rootstock Varieties and Nutrient Management. Commun. Soil Sci. Plant Anal. 2013, 44, 2107–2117. [Google Scholar] [CrossRef]
  81. Quaggio, J.; Mattos, D.; Cantarella, H.; Almeida, E.L.; Cardoso, S.A. Lemon Yield and Fruit Quality Affected by NPK Fertilization. Sci. Hortic. 2002, 96, 151–162. [Google Scholar] [CrossRef]
  82. Quaggio, J.A.; Mattos, D.; Cantarella, H. Fruit Yield and Quality of Sweet Oranges Affected by Nitrogen, Phosphorus and Potassium Fertilization in Tropical Soils. Fruits 2006, 61, 293–302. [Google Scholar] [CrossRef] [Green Version]
  83. Quaggio, J.A.; Mattos Junior, D.; Boaretto, R.M. Sources and Rates of Potassium for Sweet Orange Production. Sci. Agric. 2011, 68, 369–375. [Google Scholar] [CrossRef]
  84. Alva, A.K.; Mattos, D.; Paramasivam, S.; Patil, B.; Dou, H.; Sajwan, K.S. Potassium Management for Optimizing Citrus Production and Quality. Int. J. Fruit Sci. 2006, 6, 3–43. [Google Scholar] [CrossRef]
  85. Parent, L.E.; Jamaly, R.; Atucha, A.; Jeanne Parent, E.; Workmaster, B.A.; Ziadi, N.; Parent, S.-É. Current and Next-Year Cranberry Yields Predicted from Local Features and Carryover Effects. PLoS ONE 2021, 16, e0250575. [Google Scholar] [CrossRef]
  86. Nava, G.; Reisser Júnior, C.; Parent, L.-É.; Brunetto, G.; Moura-Bueno, J.M.; Navroski, R.; Benati, J.A.; Barreto, C.F. Esmeralda Peach (Prunus persica) Fruit Yield and Quality Response to Nitrogen Fertilization. Plants 2022, 11, 352. [Google Scholar] [CrossRef] [PubMed]
  87. Badra, A.; Parent, L.E. Early Nutrient Diagnosis of Kentucky Bluegrass Combining Machine Learning and Compositional Methods. Am. J. Plant Sci. 2022, 13, 1247–1260. [Google Scholar] [CrossRef]
  88. Brunetto, G.; Stefanello, L.O.; Kulmann, M.S.d.S.; Tassinari, A.; Souza, R.O.S.d.; Rozane, D.E.; Tiecher, T.L.; Ceretta, C.A.; Ferreira, P.A.A.; Siqueira, G.N.d.; et al. Prediction of Nitrogen Dosage in ‘Alicante Bouschet’ Vineyards with Machine Learning Models. Plants 2022, 11, 2419. [Google Scholar] [CrossRef] [PubMed]
  89. Nowaki, R.H.D.; Parent, S.-É.; Cecílio Filho, A.B.; Rozane, D.E.; Meneses, N.B.; Silva, J.A.d.S.d.; Natale, W.; Parent, L.E. Phosphorus Over-Fertilization and Nutrient Misbalance of Irrigated Tomato Crops in Brazil. Front. Plant Sci. 2017, 8, 825. [Google Scholar] [CrossRef]
  90. Mattos, D.; Hippler, F.W.R.; Boaretto, R.M.; Stuchi, E.S.; Quaggio, J.A. Soil Boron Fertilization: The Role of Nutrient Sources and Rootstocks in Citrus Production. J. Integr. Agric. 2017, 16, 1609–1616. [Google Scholar] [CrossRef] [Green Version]
  91. Liu, G.; Dong, X.; Liu, L.; Wu, L.; Peng, S.; Jiang, C. Boron Deficiency Is Correlated with Changes in Cell Wall Structure That Lead to Growth Defects in the Leaves of Navel Orange Plants. Sci. Hortic. 2014, 176, 54–62. [Google Scholar] [CrossRef]
  92. Goldberg, S.; Suarez, D.L. Influence of Soil Solution Cation Composition on Boron Adsorption by Soils. Soil Sci. 2011, 176, 80–83. [Google Scholar] [CrossRef] [Green Version]
  93. Mesquita, G.L.; Zambrosi, F.C.B.; Tanaka, F.A.O.; Boaretto, R.M.; Quaggio, J.A.; Ribeiro, R.V.; Mattos, D. Anatomical and Physiological Responses of Citrus Trees to Varying Boron Availability Are Dependent on Rootstock. Front. Plant Sci. 2016, 7, 224. [Google Scholar] [CrossRef] [Green Version]
  94. Matoh, T. Boron in Plant Cell Walls. Plant Soil 1997, 193, 59–70. [Google Scholar] [CrossRef]
  95. Brown, P.H.; Shelp, B.J. Boron Mobility in Plants. Plant Soil 1997, 193, 85–101. [Google Scholar] [CrossRef]
  96. Boaretto, R.M.; Quaggio, J.A.; de Assis Alves Mourão Filho, F.; Giné, M.F.; Boaretto, A.E. Absorption and Mobility of Boron in Young Citrus Plants. Commun. Soil Sci. Plant Anal. 2008, 39, 2501–2514. [Google Scholar] [CrossRef]
  97. Boaretto, R.M.; Quaggio, J.A.; Mattos, D.; Muraoka, T.; Boaretto, A.E. Boron Uptake and Distribution in Field Grown Citrus Trees. J. Plant Nutr. 2011, 34, 839–849. [Google Scholar] [CrossRef]
  98. Tagliavini, M.; Zavalloni, C.; Rombolà, A.D.; Quartieri, M.; Malaguti, D.; Mazzanti, F.; Millard, P.; Marangoni, B. Mineral Nutrient Partitioning to Fruits of Deciduous Trees. Acta Hortic. 2000, 512, 131–140. [Google Scholar] [CrossRef]
  99. Roy, E.D.; Richards, P.D.; Martinelli, L.A.; Coletta, L.D.; Lins, S.R.M.; Vazquez, F.F.; Willig, E.; Spera, S.A.; VanWey, L.K.; Porder, S. The Phosphorus Cost of Agricultural Intensification in the Tropics. Nat. Plants 2016, 2, 16043. [Google Scholar] [CrossRef] [PubMed]
  100. Novais, R.; Kamprath, E.J. Phosphorus Supplying Capacities of Previously Heavily Fertilized Soils. Soil Sci. Soc. Am. J. 1978, 42, 931–935. [Google Scholar] [CrossRef]
  101. Baligar, V.C.; Bennett, O.L. Outlook on Fertilizer Use Efficiency in the Tropics. Fertil. Res. 1986, 10, 83–96. [Google Scholar] [CrossRef]
  102. Baligar, V.C.; Bennett, O.L. NPK-Fertilizer Efficiency—A Situation Analysis for the Tropics. Fertil. Res. 1986, 10, 147–164. [Google Scholar] [CrossRef]
  103. Sumner, M.E.; Farina, M.P.W. Phosphorus Interactions with Other Nutrients and Lime in Field Cropping Systems. Adv. Soil Sci. 1986, 5, 201–236. [Google Scholar] [CrossRef]
  104. Mattos Junior, D.; Quaggio, J.A.; Cantarella, H.; Carvalho, S.A. De Superfícies de Resposta Do Tangor “Murcott” à Fertilização Com N, P e K. Rev. Bras. Frutic. 2004, 26, 164–167. [Google Scholar] [CrossRef] [Green Version]
  105. Boaretto, A.E.; Muraoka, T.; Boaretto, R.M. Absorção E Translocação De Micronutrientes, Aplicados Via Foliar, Pelos Citros. Laranja 2003, 24, 177–197. [Google Scholar]
  106. Boaretto, A.E.; Boaretto, R.M.; Muraoka, T.; Nascimento Filho, V.F.; Tiritan, C.S.; Mourão Filho, F.A.A. Foliar Micronutrient Application Effects on Citrus Fruit Yield, Soil and Leaf Zn Concentrations and 65Zn Mobilization within the Plant. Acta Hortic. 2002, 594, 203–209. [Google Scholar] [CrossRef]
  107. Quaggio, J.A.; Souza, T.R.; Bachiega Zambrosi, F.C.; Marcelli Boaretto, R.; Mattos, D. Nitrogen-Fertilizer Forms Affect the Nitrogen-Use Efficiency in Fertigated Citrus Groves. J. Plant Nutr. Soil Sci. 2014, 177, 404–411. [Google Scholar] [CrossRef]
  108. Parent, S.-É.; Parent, L.E. Balance Designs Revisit Indices Commonly Used in Agricultural Science and Eco-Engineering. In Proceedings of the 7th International Workshop on Compositional Data Analysis, Abbadia San Salvatore, Italy, 5–9 June 2017; pp. 95–227. [Google Scholar]
  109. Brunetto, G.; Bastos de Melo, G.W.; Terzano, R.; Del Buono, D.; Astolfi, S.; Tomasi, N.; Pii, Y.; Mimmo, T.; Cesco, S. Copper Accumulation in Vineyard Soils: Rhizosphere Processes and Agronomic Practices to Limit Its Toxicity. Chemosphere 2016, 162, 293–307. [Google Scholar] [CrossRef] [Green Version]
  110. Brunetto, G.; Ferreira, P.A.A.; Melo, G.W.; Ceretta, C.A.; Toselli, M. Heavy Metals in Vineyards and Orchard Soils1. Rev. Bras. Frutic. 2017, 39, e-263. [Google Scholar] [CrossRef] [Green Version]
  111. Hippler, F.W.R.; Boaretto, R.M.; Quaggio, J.A.; Azevedo, R.A.; Mattos, D. Towards Soil Management with Zn and Mn: Estimates of Fertilisation Efficacy of Citrus Trees. Ann. Appl. Biol. 2015, 166, 484–495. [Google Scholar] [CrossRef]
  112. Keppel, G.; Kreft, H. Integration and Synthesis of Quantitative Data: Alexander von Humboldt’s Renewed Relevance in Modern Biogeography and Ecology. Front. Biogeogr. 2019, 11, e43187. [Google Scholar] [CrossRef]
Figure 1. Elliptical distribution of true negative (TN) specimens under the classical assumption of multinormal distribution where nutrient centroids are represented by the empty circle. Data are distributed as ‘Islands’ or ‘blobs’. Some TN specimens are located in ‘blobs’ outside the ellipse. The diagnosed specimen (dark circle) is located at shorter distance from two blobs’ centroids compared to the regional centroids.
Figure 1. Elliptical distribution of true negative (TN) specimens under the classical assumption of multinormal distribution where nutrient centroids are represented by the empty circle. Data are distributed as ‘Islands’ or ‘blobs’. Some TN specimens are located in ‘blobs’ outside the ellipse. The diagnosed specimen (dark circle) is located at shorter distance from two blobs’ centroids compared to the regional centroids.
Horticulturae 08 01126 g001
Figure 2. Discriminant analysis of scion ionomes (upper figure) and soil test results in the 0–20 cm and 20–40 cm layers of soil classes (lower figure). P = ‘Pêra’ X ‘Tangerina Sunki’ (●); HAM = ‘Hamlin’ X ‘Citrumelo Swingle’ (▪); VAL = ‘Valência’ X ‘Citrumelo Swingle’ (+); RYO—Red-Yellow Oxisol (●); RO = Red Oxisol (▪); U = Red-Yellow Ultisol (▲); ilr = orthonormal isometric log-ratio.
Figure 2. Discriminant analysis of scion ionomes (upper figure) and soil test results in the 0–20 cm and 20–40 cm layers of soil classes (lower figure). P = ‘Pêra’ X ‘Tangerina Sunki’ (●); HAM = ‘Hamlin’ X ‘Citrumelo Swingle’ (▪); VAL = ‘Valência’ X ‘Citrumelo Swingle’ (+); RYO—Red-Yellow Oxisol (●); RO = Red Oxisol (▪); U = Red-Yellow Ultisol (▲); ilr = orthonormal isometric log-ratio.
Horticulturae 08 01126 g002aHorticulturae 08 01126 g002b
Figure 3. Order of nutrient limitations in a defective ‘Valência’ X ‘Citrumelo Swingle’ specimen diagnosed at regional or blob (k = 1 or 6) scales. The signs of clr indices indicate relative nutrient excess (+) or deficiency (−). Fruit yield averaged 53–67 tons ha−1 in the blobs compared to 20 tons ha−1 for the defective specimen.
Figure 3. Order of nutrient limitations in a defective ‘Valência’ X ‘Citrumelo Swingle’ specimen diagnosed at regional or blob (k = 1 or 6) scales. The signs of clr indices indicate relative nutrient excess (+) or deficiency (−). Fruit yield averaged 53–67 tons ha−1 in the blobs compared to 20 tons ha−1 for the defective specimen.
Horticulturae 08 01126 g003aHorticulturae 08 01126 g003b
Table 1. Number of observations, yield ranges, and flowering and harvesting periods for surveyed ‘Hamlin’ X ‘Citrumelo Swingle’, ‘Valência’ X ‘Citrumelo Swingle’, and ‘Pêra’ X ‘Tangerina Sunki’.
Table 1. Number of observations, yield ranges, and flowering and harvesting periods for surveyed ‘Hamlin’ X ‘Citrumelo Swingle’, ‘Valência’ X ‘Citrumelo Swingle’, and ‘Pêra’ X ‘Tangerina Sunki’.
‘Hamlin’‘Pêra’‘Valência’
No. of observations121126300
tons ha−1
Minimum yield18.86.41.1
Median yield62.839.452.1
Maximum yield136.4102.4141.4
2012 season
Flowering periodSeptember–October 2012September–October 2012September–October 2012
Harvest periodMay–June 2013July–October 2013October–December 2013
2013 season
Flowering periodSeptember–October 2013September–October 2013September–October 2013
Harvest periodMay–June 2014July–October 2014October–December 2014
Table 2. Tissue test results for ‘Hamlin’ X ‘Citrumelo Swingle’, ‘Valência’ X ‘Citrumelo Swingle’, and ‘Pêra’ X ‘Tangerina Sunki’ compared to current Brazilian concentration ranges [3,77].
Table 2. Tissue test results for ‘Hamlin’ X ‘Citrumelo Swingle’, ‘Valência’ X ‘Citrumelo Swingle’, and ‘Pêra’ X ‘Tangerina Sunki’ compared to current Brazilian concentration ranges [3,77].
‘Hamlin’ X ‘Citrumelo Swingle’‘Valência’ X ‘Citrumelo Swingle’
MinimumMedianMaximumMinimumMedianMaximum
g kg−1g kg−1
N21.625.633.018.225.736.1
P0.91.22.70.81.23.0
K8.013.970.76.5013.687.1
Ca13.834.649.717.535.756.0
Mg1.83.37.92.03.78.4
S1.82.621.31.72.623.4
B0.0380.0990.2110.0280.0970.251
Cu0.0050.0520.3330.0050.0640.545
Zn0.0140.0370.1470.0120.0410.154
Mn0.0180.0520.2000.0110.0490.176
Fe0.0400.1293.9740.0480.1163.746
‘Pêra’ X ‘Tangerina Sunki’Current Brazilian standards
MinimumMedianMaximumLower BoundCentroidUpper Bound
g kg−1g kg−1
N20.124.135.02527.530
P0.81.21.71.21.41.6
K8.813.727.11012.515
Ca12.532.856.03542.550
Mg1.63.15.33.54.25.0
S1.62.53.22.02.53.0
B0.0360.0650.2010.0500.1000.150
Cu0.0070.0630.4860.0100.0150.020
Zn0.0150.0430.2180.0350.0530.070
Mn0.0190.0450.1650.0300.0450.060
Fe0.0410.1070.2920.0500.0100.150
Table 3. Statistics of soil properties in the 0–20 and 20–40 cm layers.
Table 3. Statistics of soil properties in the 0–20 and 20–40 cm layers.
ScionpH (CaCl2)SOMPKCaMg(H+Al) †CECBase Saturation
g dm−3mg dm−3mmolc dm−3%
0–20 cm layer
‘Hamlin’5.19 ± 0.5525 ± 1234 ± 253 ± 230 ± 2614 ± 1029 ± 1475 ± 3857 ± 16
‘Valência’5.30 ± 0.5422 ± 1332 ± 202 ± 126 ± 1712 ± 824 ± 1263 ± 2960 ± 15
‘Pêra’5.06 ± 0.5615 ± 628 ± 212 ± 119 ± 109 ± 521 ± 850 ± 1756 ± 14
20–40 cm layer
‘Hamlin’5.06 ± 0.7325 ± 1532 ± 492 ± 128 ± 2412 ± 926 ± 1169 ± 3557 ± 15
‘Valência’5.05 ± 0.5021 ± 1324 ± 242 ± 124 ± 1811 ± 823 ± 860 ± 2957 ± 14
‘Pêra’4.73 ± 0.4614 ± 618 ± 162 ± 116 ± 118 ± 622 ± 747 ± 1850 ± 15
† Exchangeable acidity.
Table 4. Performance of the random forest regression model after partitioning the data set into training (70%) and testing (30%).
Table 4. Performance of the random forest regression model after partitioning the data set into training (70%) and testing (30%).
FeaturesYield as tons ha−1 §Yield as kg Tree−1
R2RMSER2RMSE
Training data set
Temperature, rainfall, scion, age, tissue nutrients, soil classification, S1, S20.9057.2950.91314.544
Scion, age, tissue nutrients, soil classification, S1, S20.8987.5710.90715.061
Scion, age, tissue nutrients, soil classification, S10.8977.5930.90515.196
Scion, age, tissue nutrients, soil classification0.8987.5750.90814.970
Scion, age, nutrient balances0.8977.5860.89915.647
Nutrient balances0.8608.8520.88516.674
Testing data set
Temperature, rainfall, scion, age, tissue nutrients, soil classification, S1, S20.28517.8740.50636.804
Scion, age, tissue nutrients, soil classification, S1, S20.32117.4150.51535.691
Scion, age, tissue nutrients, soil classification, S10.25718.2170.52036.274
Scion, age, tissue nutrients, soil classification0.10919.9510.48937.438
Scion, age, nutrient balances0.14419.5620.49437.248
Nutrient balances0.08620.2100.42339.759
§ Computed as the product of tree density and yield per tree; † plant density added as feature in those models; S1, S2: soil properties in the 0–20 and 20–40 cm layer, respectively.
Table 5. Performance of the random forest binary classification model at yield cutoff values of 50 and 60 tons ha−1 using cross-validation (10 folds).
Table 5. Performance of the random forest binary classification model at yield cutoff values of 50 and 60 tons ha−1 using cross-validation (10 folds).
Features50 tons ha−160 tons ha−1
AUCAccuracyAUCAccuracy
Temperature, rainfall, scion, age, tissue nutrients, soil classification, S1, S20.7960.7480.8060.740
Scion, age, tissue nutrients, soil classification, S1, S20.7970.7300.8130.750
Scion, age, tissue nutrients, soil classification, S10.8110.7420.8010.755
Scion, age, tissue nutrients, soil classification0.7990.7480.7990.731
Scion, age, nutrient balances0.7830.7350.7830.728
Nutrient balances0.6830.6620.6580.633
Table 6. Partitioning of the true negative (TN), false negative (FN), false positive (FP), and true positive (TP) results among scion X rootstock combinations in the confusion matrix of the binary random forest model that included all features.
Table 6. Partitioning of the true negative (TN), false negative (FN), false positive (FP), and true positive (TP) results among scion X rootstock combinations in the confusion matrix of the binary random forest model that included all features.
Scion X RootstockTNFNFPTPTotalNPVPPVSpecificitySensitivityAccuracy
60 tons ha−1
‘Hamlin’ X ‘Citrumelo Swingle’671219231210.850.550.780.660.74
‘Pêra’ X ‘Tangerina Sunki’04211011260.000.830.000.960.80
‘Valência’ X ‘Citrumelo Swingle’8343451333040.660.750.650.760.71
15059852575510.720.750.640.810.74
50 tons ha−1
‘Hamlin’ X ‘Citrumelo Swingle’871012121210.900.500.880.550.82
‘Pêra’ X ‘Tangerina Sunki’.211721671260.550.760.500.800.70
‘Valência’ X ‘Citrumelo Swingle’1534929733040.760.720.840.600.74
26176621525510.770.710.810.670.75
Table 7. Lower and upper quartiles of true negative specimens of ‘Hamlin’ X ‘Citrumelo Swingle’, ‘Pêra’ X ‘Tangerina Sunki’, and ‘Valência’ X ‘Citrumelo Swingle’ at yield cutoff value of 50 tons ha−1.
Table 7. Lower and upper quartiles of true negative specimens of ‘Hamlin’ X ‘Citrumelo Swingle’, ‘Pêra’ X ‘Tangerina Sunki’, and ‘Valência’ X ‘Citrumelo Swingle’ at yield cutoff value of 50 tons ha−1.
Nutrients‘Hamlin’ X ‘Citrumelo Swingle’‘Pêra’ X ‘Tangerina Sunki’‘Valência’ X ‘Citrumelo Swingle’
Lower QuartileUpper QuartileLower QuartileUpper QuartileLower QuartileUpper Quartile
N24.327.422.625.924.326.8
P1.11.41.01.31.11.3
K11.816.912.315.511.715.4
Ca30.840.425.839.231.940.9
Mg2.83.82.63.53.24.0
S2.42.92.32.82.42.8
B0.0810.1180.0540.0800.0760.127
Cu0.0250.0800.0270.0620.0320.087
Zn0.0250.0560.0280.0480.0290.056
Mn0.0370.0740.0300.0620.0370.068
Fe0.0930.1440.0750.1210.0960.140
Table 8. Nutrient standards as clr mean and standard deviation (SD) of ‘Hamlin’ X ‘Citrumelo Swingle’, ‘Pêra’ X ‘Tangerina Sunki’, and ‘Valência’ X ‘Citrumelo Swingle’ TN specimens at a yield cutoff of 50 tons ha−1.
Table 8. Nutrient standards as clr mean and standard deviation (SD) of ‘Hamlin’ X ‘Citrumelo Swingle’, ‘Pêra’ X ‘Tangerina Sunki’, and ‘Valência’ X ‘Citrumelo Swingle’ TN specimens at a yield cutoff of 50 tons ha−1.
Nutrients‘Hamlin’ X ‘Citrumelo Swingle’‘Pêra’ X ‘Tangerina Sunki’‘Valência’ X ‘Citrumelo Swingle’
clr Meanclr SDclr Meanclr SDclr Meanclr SD
50 tons ha−1
N2.8660.2092.9230.2202.7890.155
P−0.1420.239−0.1160.236−0.2720.198
K2.2610.3262.3540.2112.1570.323
Ca3.1530.2473.0400.2783.1410.171
Mg0.8120.2410.8630.2550.8330.216
S0.6070.2810.5910.1630.5580.320
B−2.5380.370−2.7280.262−2.5400.314
Cu−3.3550.430−3.5440.309−3.4040.370
Zn−3.6740.875−3.3300.765−3.3280.668
Mn−3.7320.414−3.6600.255−3.5540.438
Fe−2.6780.335−2.9240.408−2.7440.366
Fv6.3650.1346.5320.2116.4210.199
60 tons ha−1
N2.8890.213--2.7800.140
P−0.1160.226--−0.3220.161
K2.2610.332--2.0930.247
Ca3.1660.261--3.1600.143
Mg0.8270.245--0.8270.191
S0.6360.299--0.4900.114
B−2.5240.388--−2.5950.212
Cu−3.3210.434--−3.3850.436
Zn−3.7870.846--−3.2330.481
Mn−3.7640.416--−3.4560.467
Fe−2.6970.364--−2.7320.396
Fv6.4310.212--6.3720.112
Table 9. Nutrient diagnosis of a defective ‘Valência’ X ‘Citrumelo Swingle’ specimen.
Table 9. Nutrient diagnosis of a defective ‘Valência’ X ‘Citrumelo Swingle’ specimen.
ComponentConcentrationBrazilian Nutrient Ranges §SpecimenRegional †
g kg−1clr
N31.025303.0242.7890.155
P1.31.21.6−0.188−0.2720.198
K13.010152.1562.1570.323
Ca36.535503.1863.1410.171
Mg4.23.55.01.0190.8330.216
S2.32.03.00.4350.5580.320
B0.1340.0500.150−2.619−2.5400.314
Cu0.0560.0100.020−3.950−3.4040.370
Zn0.0350.0350.070−3.287−3.3280.668
Mn0.0290.0300.060−3.760−3.5540.438
Fe0.1100.0500.150−2.421−2.7440.366
Filling value---6.4046.4210.199
† Across features under the ceteris paribus assumption; § Quaggio et al. (2022) [77].
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Yamane, D.R.; Parent, S.-É.; Natale, W.; Cecílio Filho, A.B.; Rozane, D.E.; Nowaki, R.H.D.; Mattos Junior, D.d.; Parent, L.E. Site-Specific Nutrient Diagnosis of Orange Groves. Horticulturae 2022, 8, 1126. https://doi.org/10.3390/horticulturae8121126

AMA Style

Yamane DR, Parent S-É, Natale W, Cecílio Filho AB, Rozane DE, Nowaki RHD, Mattos Junior Dd, Parent LE. Site-Specific Nutrient Diagnosis of Orange Groves. Horticulturae. 2022; 8(12):1126. https://doi.org/10.3390/horticulturae8121126

Chicago/Turabian Style

Yamane, Danilo Ricardo, Serge-Étienne Parent, William Natale, Arthur Bernardes Cecílio Filho, Danilo Eduardo Rozane, Rodrigo Hiyoshi Dalmazzo Nowaki, Dirceu de Mattos Junior, and Léon Etienne Parent. 2022. "Site-Specific Nutrient Diagnosis of Orange Groves" Horticulturae 8, no. 12: 1126. https://doi.org/10.3390/horticulturae8121126

APA Style

Yamane, D. R., Parent, S. -É., Natale, W., Cecílio Filho, A. B., Rozane, D. E., Nowaki, R. H. D., Mattos Junior, D. d., & Parent, L. E. (2022). Site-Specific Nutrient Diagnosis of Orange Groves. Horticulturae, 8(12), 1126. https://doi.org/10.3390/horticulturae8121126

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop