Next Article in Journal
Optimizing Continuous Casting through Cyber–Physical System
Previous Article in Journal
Techno-Economic Analysis of Ethylene Adsorptive Separation Using Zeolite 13X in Oxidative Coupling of Methane Integrated Process
Previous Article in Special Issue
Tiny Particles, Big Problems: The Threat of Microplastics to Marine Life and Human Health
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integration of Slurry–Total Reflection X-ray Fluorescence and Machine Learning for Monitoring Arsenic and Lead Contamination: Case Study in Itata Valley Agricultural Soils, Chile

by
Guillermo Medina-González
1,*,
Yelena Medina
2,
Enrique Muñoz
3,4,
Paola Andrade
5,
Jordi Cruz
6,
Yakdiel Rodriguez-Gallo
7 and
Alison Matus-Bello
1
1
Department of Environmental Chemistry, Faculty of Sciences, Universidad Católica de la Santísima Concepción, Concepción 4090541, Chile
2
EMOingenieros Ltda., Concepción 4090070, Chile
3
Departament of Civil Engineering, Faculty of Engineering, Universidad Católica de la Santísima Concepción, Concepción 4090541, Chile
4
Centro de Investigación en Biodiversidad y Ambientes Sustentables CIBAS, Universidad Católica de la Santísima Concepción, Concepción 4090541, Chile
5
Departament of Ecology, Faculty of Sciences, Universidad Católica de la Santísima Concepción, Concepción 4090541, Chile
6
Escola Universitària Salesiana de Sarrià (EUSS School of Engineering, Barcelona), 08017 Barcelona, Spain
7
Faculty of Engineering, Don Bosco University, Calle a Plan del Pino Km 1 1/2, Soyapango 1874, El Salvador
*
Author to whom correspondence should be addressed.
Processes 2024, 12(8), 1760; https://doi.org/10.3390/pr12081760
Submission received: 10 July 2024 / Revised: 16 August 2024 / Accepted: 19 August 2024 / Published: 20 August 2024
(This article belongs to the Special Issue Solid and Hazardous Waste Disposal and Resource Utilization)

Abstract

:
The accuracy of determining arsenic and lead using the optical technique Slurry–Total Reflection X-ray Fluorescence (Slurry-TXRF) was significantly enhanced through the application of a machine learning method, aimed at improving the ecological risk assessment of agricultural soils. The overlapping of the arsenic Kα signal at 10.55 keV with the lead Lα signal at 10.54 keV due to the relatively low resolution of TXRF could compromise the determination of lead. However, by applying a Partial Least Squares (PLS) machine learning algorithm, we mitigated interference variations, resulting in improved selectivity and accuracy. Specifically, the average percentage error was reduced from 15.6% to 9.4% for arsenic (RMSEP improved from 5.6 mg kg−1 to 3.3 mg kg−1) and from 18.9% to 6.8% for lead (RMSEP improved from 12.3 mg kg−1 to 5.03 mg kg−1) compared to the previous univariable model. This enhanced predictive accuracy, within the set of samples concentration range, is attributable to the efficiency of the multivariate calibration first-order advantage in quantifying the presence of interferents. The evaluation of X-ray fluorescence emission signals for 26 different synthetic calibration mixtures confirmed these improvements, overcoming spectral interferences. Additionally, the application of these models enabled the quantification of arsenic and lead in soils from a viticultural subregion of Chile, facilitating the estimation of ecological risk indices in a fast and reliable manner. The results indicate that the contamination level of these soils with arsenic and lead ranges from moderate to considerable.

1. Introduction

In environmental geochemistry, it is crucial to understand the concepts of geological baselines and critical levels. This understanding is important for differentiating between naturally occurring substance levels and pollution caused by human activities [1]. Several ecological indicators based on these geological baselines have been suggested to evaluate the extent and impact of soil contamination by trace elements, including the Geoaccumulation Index (GEOI), Enrichment Factor (EF), and Contamination Factor (CF) [2].
Establishing the baseline levels of trace elements and ecological indices in a particular location is important for detecting contamination. Still, it requires the systematic gathering and examination of a large number of samples [3].
Quantitative analyses in soil studies often utilize various optical analytical methods, including traditional approaches like inductively coupled plasma mass spectrometry (ICP-MS) [4], flame atomic absorption spectroscopy (FAAS) [5], inductively coupled plasma emission spectroscopy (ICP-OES) [6], graphite furnace atomic absorption spectroscopy (GF-AAS) [7], and cold vapor atomic absorption spectrometry (CV-AAS) [8].
Total X-ray fluorescence spectroscopy (TXRF) is a reliable and convenient alternative method for determining trace element concentrations. This versatile technique can be applied with or without sample digestion, making it a highly favored choice [2,9,10].
However, these methods often lead to significant spectral interferences. Studies have been carried out to explore the use of machine learning algorithms such as Partial Least Squares Regression (PLS-R) in liquid matrices [11], the Synchrotron Radiation X-ray Fluorescence technique [12], and Cd-K in soils by TXRF [13], where these models improve the accuracy of quantification in the presence of spectral interferences.
Several studies have examined the levels of heavy metals or potentially toxic elements in areas primarily used for agriculture and mining [2,14,15,16,17,18,19,20,21]. However, there is a lack of studies on effectively monitoring arsenic and lead contamination in agricultural soils, particularly in regions like Itata Valley. While there are standardized methods suggested by international quality standards for the determination of arsenic and lead [22], these methods are often slow, expensive, and environmentally unfriendly.
Therefore, it is crucial to continue developing and improving analytical methods for trace element analysis in soil samples, as well as exploring innovative techniques and approaches such as machine learning and X-ray fluorescence to overcome challenges and improve quantification accuracy in the context of low-resolution analytical methods. [23].
This study aimed to assess the effectiveness of the specific method of slurry-TXRF combined with machine learning models in analyzing arsenic and lead levels in viticultural soils. This method offers a rapid and efficient way to estimate and monitor contamination levels using various ecological indices, providing an innovative alternative for gaining valuable insights into the extent and potential risks of heavy-metal contamination in vineyard soils. These insights can help make informed decisions about soil management practices and ensure sustainable agriculture for the quality and safety of products.

2. Materials and Methods

This study on improving lead and arsenic measurement using slurry-TXRF and machine learning is part of a larger research effort focused on studying heavy-metal contamination in agricultural soils. Medina-Gonzalez et al.’s (2023) [2] study served as the basis for our methodology with certain adaptations.

2.1. Study Area and Samples

A well-known wine-producing region in southern Chile was chosen as the focus of this study. This region covers around 311,418 hectares, with approximately 13,030 hectares allocated to agricultural activities, specifically for wine production [24]. The Itata Valley is notable for its cultivation of grape varieties like País, Muscat of Alexandria, and Cinsault, with the latter being the most widely grown in terms of cultivation area [25].
Geographically, the Itata Valley is part of the Central Valley and Coastal Range, with elevations not exceeding 400 m above sea level. The area benefits from a Mediterranean climate, with distinct rainy and dry seasons, influenced by the Itata River, which flows from east to west across the valley. The presence of the Coastal Range creates climatic variations, with the western slopes receiving more rainfall compared to the drier eastern slopes [24].
The valley’s soil originates primarily from three parent materials:
  • Eroded Metamorphic Rock: Found in higher elevations with steep or complex slopes, this soil is characterized by shale, sandstone, phyllite, and slates. It has a clayey texture with slow water infiltration and is prone to forming Catena due to topography and drainage patterns.
  • Granite Origin: Derived from granite and diorite rocks, this soil also has a clayey texture and low water infiltration, making it susceptible to erosion. It is typically found in hilly areas with complex slopes.
  • Fine Alluvial Sediment: Formed from fluvio-glacial sediments transported by rivers, this soil has a loamy–clayey texture and poor drainage. The thickness of these sediments varies significantly across the valley.
In summary, the soils in the Itata Valley are primarily derived from eroded metamorphic rocks, granite and diorite, and fine alluvial sediments, each contributing to unique characteristics such as soil texture, water infiltration, and erosion susceptibility [25].
The soil sampling was carried out in two phases, namely June and October of 2023. These samples were obtained from five vineyards (A, B, C, D, E) at a total of 48 sampling points within the area corresponding to the Itata Valley, following the owners’ authorization and using the combined method as recommended by Servicio Agrícola y Ganadero de Chile [26] (Figure 1).
Approximately two kilograms of the sample were collected at each sampling location by taking five subsamples from the corners and center of a 1 × 1 m grid to create a composite sample. The topsoil samples (0–20 cm depth) were collected while removing any plant cover present and deep soil samples at a depth of 150 cm using a Dutch auger. Surface soils are most affected by human activities, whereas deep soils represent relatively undisturbed samples considered free from contamination [27].
The samples were packed in polyethylene bags for transport to the laboratory and then dried in an oven at 40 °C for 48 h. Next, each sample was sieved using a nylon sieve to remove coarse material and other residues, leaving only the fine material (<2 mm). The sieved samples were stored in tightly sealed plastic bags for further analysis [28].

2.2. Sample Preparation

2.2.1. PLS Model Samples

Standard calibration and external validation samples were prepared using 1000 mg L−1 solutions of As and Pb. Ultra-pure deionized water with a resistance of 18.2 MΩ cm−1 obtained from a Milli-Q quality purification system was used for dilution purposes. A silicone solution in isopropanol was applied to create a silicon film on reflective disks of quartz glass before the deposition of the sample. Working solutions with concentrations of 100 mg L−1 were utilized to generate a total of 26 synthetic calibration solutions (Table 1) and 8 external validation samples (Table 2) for these elements in 2 mL Eppendorf tubes.
Each sample preparation included a random mixing of concentrations to avoid any potential correlations that could affect the calibration process. The concentration ranges were intentionally chosen to facilitate the measurement of arsenic and lead in environmental samples with varying levels of contamination. Sample preparation involved using the drop-drying method, which entails applying a small volume (such as 10 μL) of the mixture onto 30 mm diameter quartz sample disks and drying them with an infrared lamp.
The resulting solid residues on the sample holders were then analyzed in an S4TStar TXRF spectrometer after the evaporation of the liquid phase [2].
To create a model in milligrams per kilogram (mg kg−1) concentration units from liquid calibration samples, enabling us to evaluate its suitability in solid matrices, conversions in concentrations were conducted using Equation (1):
mgkg 1 = mgL 1 × V dep S dep
where Vdep represents the volume of liquid calibration sample deposited in liters, while Sdep denotes the quantity of dry solid sample deposited in kilograms, based on the specific methodologies employed.

2.2.2. Slurry Sample Preparation

The soil sludge samples were prepared in sterilized 2.5 mL Eppendorf vials by combining 30 mg of dried and pulverized soil with 1500 μL of Triton-X as a surfactant using an agate mortar. The mixtures were homogenized for 300 s with an electronic shaker. After this, 10 μL of the thoroughly homogenized suspension was extracted and placed into the TXRF’s 30 mm diameter quartz sample holder immediately to prevent potential sedimentation. A silicone solution in isopropanol was then applied to create a silicon film on the quartz glass disk before depositing the sample, which was followed by drying it with an infrared lamp. To evaluate accuracy and suitability, loamy clay certified reference material for trace metals from Sigma-Aldrich was used to test the model’s performance in soil samples.

2.3. TXRF Spectra and Data Acquisition

This study used a TXRF S4TSTAR analytical system (Bruker® AXS Microanalysis GmbH, Berlin, Germany) for fluorescent emission measurements. The X-ray source operated at 50 W with an electron flow of 1 mA at 50 kV and was cooled by air. The system included a multi-layer carbon–nickel monochromator of 150 nm and a high-resolution silicon semiconductor detector XFlashR with an active area of 100 mm2 and energy resolution < 140 eV. The specific range of interest was defined from 10.27 keV to 12.78 keV, focused on the overlap of the arsenic Kα line (10.54 keV) and lead Lα line (10.55 keV). Counts were recorded using the SPECTRA 7.5.0.3 software for data analysis [29], generating a .txt format file processed through PIROUETTE 4.5 software [30].

2.4. Quantification Methods

For the aim of assessing the efficacy of the PLS model in quantifying arsenic and lead, in comparing its performance to classical deconvolution methods utilizing the SPECTRA 7.5.0.3 software, specifically the deconvolution routine (SuperBayes), the quantification was carried out employing internal standardization with germanium (10 ng).
A set of 26 synthetic mixtures served as the basis for PLS analysis. These mixtures were deliberately correlated with their known concentrations and employed as calibration samples. The fluorescence spectra within the calibration set underwent preprocessing. Specifically, mean centering was applied as the sole transformation method without additional alterations.
Latent variables were chosen through the cross-validation procedure employing the leaving-one-out method. This method systematically excluded each sample from the calibration set one at a time, utilizing it as a prediction object. With this, the root mean square error of cross-validation (RMSECV), calculated as per Equation (2), served as the criterion to determine the optimal number of latent variables essential for establishing the calibration model.
To assess the effectiveness of the Partial Least Squares model in quantifying arsenic and lead, its performance was compared to those of classical deconvolution methods using Spectra 7.5.0.3 software’s SuperBayes routine with internal standardization using germanium (10 ng). A set of 26 synthetic mixtures correlated with known concentrations served as calibration samples for PLS analysis. These fluorescence spectra underwent mean-centering preprocessing without any further transformation. The optimal number of latent variables required for establishing the calibration model was determined through cross-validation and root mean square error of cross-validation [31], calculated according to Equation (2):
RMSECV = y i y ^ i 2 / n
where ŷi represents the estimated concentration values obtained through cross-validation, with each sample i being estimated using a model constructed from the sample set excluding that particular sample. Here, yi denotes the known concentration variable value, and n denotes the total number of samples in the calibration set.
The same sample set used for the direct quantification of arsenic and lead through deconvolution (Table 2) using the SPECTRA software also played another important role: external validation of the model. External validation is a crucial practice in chemometrics that helps guard against overfitting in calibration models, ensuring their robustness and generalizability [32].
The root mean square error of prediction (RMSEP) was used to evaluate the predictive accuracy of the multivariate model [33]. It is calculated similarly to RMSECV, but in this case, each predicted sample (yi) from the final model is compared with the corresponding reference value from an external validation set (ŷ(i,ref)), where N represents the size of the external validation set. The calculation for the RMSEP follows Equation (3):
RMSEP = y i y ^ i , ref 2 / N
Linear regression can identify biases in the predictions of external samples, similar to its use in internal validation procedures. Equation (4) shows the linear regression approach used to assess systematic biases within the predictions for external samples.
y ^ = a + y b
The comparison between predicted values ( y ^ ) and reference values (y) is crucial for evaluation. Ideally, predictions made by the calibration model should exhibit a linear relationship with known values—a line with a slope (b) of 1 and an intercept (a) of 0. Any deviations from this linear behavior indicate issues arising during the analytical procedure [34]. The Elliptical Joint Confidence Region (EJCR) method was employed to assess the correlation of results and obtain a statistically significant confidence level of 95% [35].

Analytical Figures of Merit (AFOM’s)

Figures of Merit for the PLS approach were calculated according to the NAS theory. This concept works by isolating relevant signals associated with a particular component of interest from the various interfering elements present in the spectra. The specific method utilized in this research follows the approach described by Short et al. [36], concentrating on deriving figures of merit using the NAS theory applied to spectral data.

2.5. Background Values and Ecological Indices

The assessment of natural or baseline values required the examination of data from deep soil samples following the procedures outlined in ISO 19258 [37]. There is no standardized method for setting threshold values to identify samples with abnormally high levels of element concentrations, as noted by Reimann et al. [38]. Nonetheless, this research applies the “MAD” technique, which makes use of the range from Median ± 2MAD (median absolute deviation) to pinpoint values suggestive of elevated element concentrations.
Various environmental indicators are frequently utilized to evaluate possible soil pollution, aiding in the differentiation between natural trace element accumulation and that caused by human actions [18,39]. One commonly used method is the Geoaccumulation Index, which is determined using a specific equation (Equation (5)):
GEOI = log 2 ( C i / 1 . 5 C b )
where Ci is the measured content of the element in the soil, and Cb is the estimated background value of that element in the soil. According to Muller [40], the GEOI for a particular element is calculated and then classified as shown in Table 3.
The Enrichment Factor (EF) is determined by standardizing the concentration of a measured element with respect to a reference element. The selection of the reference element relies on its minimal variability in occurrence within the specific environment [38] and its independence from anthropogenic influences [41]. Commonly used reference elements comprise Sc, Mn, Ti, Al, and Fe. This research chose Fe as the reference element due to its significantly low variation in viticultural soils in the Itata Valley [1]. Its abundance in the Earth’s crust and its connection with the soil matrix validate its choice as a stable reference element [42]. The EF is calculated using Equation (6):
EF = ( C i / CFe i ) / ( C b / CFe b )
where Ci represents the concentration of the particular element, CFei denotes the quantity of Fe as a reference constituent, Cb is the reference content derived via the MAD method, while CFeb signifies background content for Fe. Enrichment Factor values are divided into five categories based on Table 4:
The Contamination Factor (CF) is calculated using Equation (7):
CF = C i / C b
where Ci represents the concentration of the examined element in the soil, and Cb is the content of background values obtained using the MAD method. Hakanson [43] defines the categories for CF as presented in Table 5.

3. Results

3.1. Classical Regression: Deconvolution

Table 6 shows the results obtained for the quantification of arsenic and lead in conditions where there is overlap, using the classical method of internal standardization followed by signal deconvolution.
The results indicate significant positive biases for both arsenic and lead, with relative biases ranging from 11.9% to 150% for arsenic and from 5.9% to 74.3% for lead. These values, respectively, in relation to the reference values of the external validation set, reveal a noticeable systematic error. The calculated RMSEP for this quantification method is 5.6 mg kg−1 for arsenic and 12.3 mg kg−1 for lead, resulting in an average percentage error of 15.6% and 18.9%, respectively, within the concentration range of the sample set.

3.2. Machine Learning Regression: Partial Least Squares

The X-ray fluorescent emission spectra of the 26 samples, which were utilized to develop the PLS models, cover a range of energies, including the Kα (10.54 keV) and Kβ (11.73 keV) lines for arsenic, as well as Lα (10.55 keV) and Lβ (12.61 keV) lines for lead, as shown in Figure 2.
When considering two latent variables, the model shows RMSECVs of 2.95 mg kg−1 and 3.92 mg kg−1. For external validation, there is an RMSEP of 3.31 mg kg−1 and 5.03 mg kg−1 for As and Pb, respectively. The discrepancy between both parameters is expected due to the tendency of cross-validation to be overly optimistic in comparison to external sample validation.
Additionally, it is observed that both cross-validation and external validation predictions for these elements exhibit a high level of correlation with the real concentrations of the samples, achieving correlation coefficients rVal = 0.981 and rPred = 0.972 for As and rVal = 0.986 and rPred = 0.982 for Pb (Figure 3).
The quantification biases for arsenic and lead, observed in both multivariate models and classical quantification methods (internal standardization and deconvolution), have significantly reduced. There is no longer a pronounced positive trend, and the percentage of biases has notably decreased. The summary of all these parameters for the PLS model is presented in Table 7.
When examining the regression vectors depicted in Figure 4, it can be observed that the multivariate model’s predictions for arsenic concentration predominantly rely on signals produced within the energy range corresponding to its primary emission line (Kα). Signals from the secondary emission line (Kβ) also have a lesser impact. This finding highlights how incorporating multiple variables and assessing their contributions improves the accuracy of predicting arsenic levels, effectively managing interference from lead’s fluorescent signal.
In contrast, there is a more evident overlap of signals, particularly from lead’s most intense emission line (Lα). Consequently, signals from its weaker line (Lβ) become more significant within the model. This indicates that reliance on signals from the Lβ zone is higher due to interference with the stronger Lα line.

3.3. Elliptical Joint Confidence Region (EJCR)

In employing the EJCR correlation method at a significance level (α) of 0.05, it has been statistically shown that determinations made using univariate methods like internal standardization and deconvolution lack accuracy. This conclusion is drawn from the failure to meet the criteria of a slope of 1 and an intercept of 0, indicating a deviation from the ideal correlation.
Figure 5 visually illustrates this difference. The ellipse in the figure demonstrates the disparity between the ideal point representing slope vs. intercept and the observed correlation, highlighting that there is a significant distance between the observed data points and their ideal position.
The deconvolution via software prior internal standardization fails to satisfy the EJCR criteria, as evidenced by significant deviations in slope and intercept from their ideal values (α = 0.05).
In contrast, multivariate calibration using Partial Least Squares demonstrates accuracy in quantification. Adhering to the EJCR criteria with an ideality point within the ellipse confirms that both slope and intercept statistically approach 1 and 0, respectively (α = 0.05).
This comparison emphasizes how effective multivariate calibration through PLS is in addressing overlapping X-ray fluorescence signals of arsenic and lead. Consequently, this method produces more precise and accurate results than deconvolution processes.

3.4. AFOM

Table 8 displays the analytical performance metrics for each quantification approach. It is evident from the table that the PLS method has resulted in an enhanced limit of detection by two orders of magnitude and limit of quantification by three orders and two orders of magnitude for As and Pb, respectively, compared to internal standard calibration. The analytical sensitivities obtained through PLS models have also seen important improvements.

3.5. Application in Soil Samples

Due to the lack of noticeable differences in the spectra between liquid and well-prepared solid samples within the relevant range, a comparison of techniques was conducted using a validated soil sample (Clay 2—CRM051). The assessment employing a certified soil sample, based on the t-test criteria (α = 0.05, n = 3), reveals that both techniques yield results that are not statistically different from the certified value (Table 9). However, in terms of bias and accuracy, the PLS model surpasses the quantification achieved through internal standardization and signal deconvolution. The PLS model demonstrates lower bias and higher accuracy compared to other methods. Subsequently, the developed PLS models for arsenic and lead quantification were utilized to evaluate these metals in the viticultural soils of Itata Valley.

3.6. Ecological Assessment of Viticultural Soils

Table 10 provides a statistical overview of the contents of As and Pb in topsoil samples. The difference between the highest and lowest concentrations of As and Pb indicates spatial diversity within the topsoil samples. This variance suggests natural disparities in soil composition among the locations sampled, potentially influenced by variations in mineral content, organic matter, geological origins, or human activities [2].
After establishing the upper limits based on the background levels, an evaluation was conducted to determine how many samples exceed these limits for both arsenic and lead, as detailed in Table 10. The results indicate that a majority of samples—approximately 95% for both elements—have concentrations exceeding these established upper limits.
Figure 6 graphically illustrates the results of the GEOI analysis, suggesting that the soils exhibit relatively low to moderate levels of contamination. This evaluation likely considers specific element concentrations in relation to their natural or background levels. EF assessments indicate a significant number of sampling points showing moderate to substantial enrichment, particularly when compared to iron concentrations, which show minimal vertical variation in the soil. The more rigorous CF index, not considering natural variations, indicates that most sampling points demonstrate moderate to considerable contamination levels for both elements. These conclusions align with the areas identified by the GEOI as moderately contaminated and reinforce their importance in terms of contamination severity.

4. Discussion

While lower RMSE values are preferable, the average percentage errors of 15.6% for arsenic and 18.9% for lead quantification using classical regression and deconvolution are considered reasonably acceptable, particularly given the analytical technique and concentration range involved. Such error magnitudes are common in this field, as evident from various studies on solid matrices [2,44,45,46].
Conversely, the utilization of the proposed PLS machine learning algorithm demonstrates improved performance, with the average percentage error notably reduced from 15.6 to 9.4% for arsenic (from an RMSEP of 5.6 mg kg−1 to 3.3 mg kg−1) and from 18.9% to 6.8% for lead (from an RMSEP of 12.3 mg kg−1 to 5.03 mg kg−1) compared to the previous univariable model. This enhanced predictive accuracy within the sample set’s concentration range is attributable to the efficiency of the multivariate calibration’s first-order advantage in quantifying in the presence of interferents, which enables the identification and use of lower-intensity signals to mitigate the errors generated by signal overlaps [46]. The RMSEP results undoubtedly demonstrate improvement; however, they do not surpass the performance achieved for liquid samples in TXRF using PLS models. Notably, the multivariate model developed by Nagata et al. (2006) [12] facilitated the determination of lead and arsenic with RMSEPs of 0.03 and 0.24 mg l−1, respectively. However, the Slurry-TXRF with PLS improvements are limited in comparison to liquid sample analysis, primarily due to two factors. Firstly, liquid samples present fewer physical interferences for total reflection [10]. Achieving a comparable state from solid samples would necessitate significant resources, including time and reagents for digestion. Secondly, the TXRF equipment utilized in the aforementioned research employs synchrotron radiation TXRF (SR-TXRF), which enhances the instrument performance [12].
According to the EJCR criteria, the deconvolution method provided by the univariate calibration software for mitigating the overlap of fluorescent arsenic and lead signals appears inadequate in generating reliable quantifications, in contrast with the superior performance of the PLS method. However, as mentioned earlier, the quality of results obtained through classical calibration is commonly encountered within this analytical technique and concentration range.
Alongside the improved quality of results observed with PLS quantification, all figures of merit are also enhanced, potentially due to the multivariate calibration’s ability to exploit modeled interferents, thereby improving selectivity and reducing background noise [36].
The proposed methodology also reduces costs and analysis time compared to traditional methods [11]. This novel approach, based on Slurry-TXRF and PLS quantification models, facilitates soil monitoring for the measurement of arsenic and lead contents, and it is presented as an alternative with a positive technical and environmental impact. This is a preliminary advancement in an era where automation and unmanned systems, known for their reliability and cost-effectiveness, are becoming increasingly popular. [47].
The enhanced methodology for quantifying arsenic and lead indicates that approximately 95% of the agricultural soil samples from the Itata Valley exceed the background limit, which is expected for a soil type influenced by productive anthropogenic activity where the addition of these elements is possible through the use of fertilizers [48] and fossil fuels from machinery [49]. Additionally, the geomorphological fluvial–glacial–volcanic plain zone that the Itata Valley covers [24] could significantly contribute to the presence of these two elements in the environment, especially arsenic [50].
In general, the ecological indices suggest that the agricultural soils in the studied area are moderately contaminated by these elements, consistent with previous findings on other potentially toxic substances. However, this level of contamination does not necessarily imply immediate adverse biological effects on the resident communities or pose significant health risks to humans.

5. Conclusions

The methodology proposed in our study, which integrates Slurry-TXRF with a Partial Least Squares (PLS) machine learning algorithm, offers a significant advancement in the field of environmental monitoring, particularly for arsenic and lead contamination in soils. While the accuracy of our results does not surpass that of traditional methods, such as hydride generation for arsenic (which typically achieves errors of less than 5% with quantification limits of 0.2 mg/kg) [51] and flame atomic absorption for lead (with expected errors around 2%) [52], the application of machine learning techniques has notably improved the performance of Slurry-TXRF. Specifically, our study achieved a reduction in the average percentage error from 15.6% to 9.4% for arsenic (an RMSEP from 5.6 mg kg−1 to 3.3 mg kg−1) and from 18.9% to 6.8% for lead (an RMSEP from 12.3 mg kg−1 to 5.03 mg kg−1) just through using multivariate calibration.
These improvements, while not surpassing the precision and accuracy of the most established methods, provide significant practical advantages for environmental monitoring. The proposed methodology is faster, simpler, less costly, and more environmentally friendly, making it a valuable alternative in situations where these factors are critical.
Additionally, the PLS calibration effectively corrects the mutual influence of overlapping signals between arsenic and lead, enhancing the reliability of quantitative analyses. This is particularly advantageous in complex soil matrices where traditional methods may struggle with signal interference.
Overall, while the traditional methods remain superior in numerical accuracy, the combined use of Slurry-TXRF and machine learning represents a promising alternative that balances reasonable accuracy with substantial operational benefits. This approach is well suited for efficient environmental and ecological evaluations, particularly in regions where rapid and cost-effective monitoring is necessary.
Our assessment indicates moderate to significant contamination levels with arsenic and lead in the soils of the Itata Valley, which is typical for agriculturally used soils. However, further studies would be required to assess the potential risks to human health, including an evaluation of factors such as elemental speciation and migration from soil to plant parts, which are outside the scope of the present study.

Author Contributions

Conceptualization, G.M.-G.; data curation, Y.M. and P.A.; formal analysis, G.M.-G., Y.M., and A.M.-B.; investigation, G.M.-G., E.M., and P.A.; methodology, G.M.-G. and J.C.; project administration, Y.M. and P.A.; resources, G.M.-G.; supervision, G.M.-G.; founding, E.M.; validation, G.M.-G., E.M., and J.C.; visualization, Y.R.-G. and A.M.-B.; writing—original draft, P.A. and A.M.-B.; writing—review and editing, G.M.-G., Y.M., E.M., J.C., and Y.R.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Agencia Nacional de Investigación, grant number 21170516. The APC was funded by the Universidad Católica de la Santísima Concepción by means of projects Ingeniería 2030 (ING222010004) and InES Ciencia Abierta (INCA210005).

Data Availability Statement

The data supporting this article will be made available by the authors upon request.

Acknowledgments

The authors are deeply grateful for the support of project REDTPI4.0-320RT0006 CYTED program.

Conflicts of Interest

Author Yelena Medina was employed by the company EMOingenieros Ltda. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Guo, J.; Liu, L.; Feng, Q.; Okonkwo, C.U.; Zhang, X.; Chen, J.; Yang, S.; Yang, C. Heavy Metal Pollution near an Abandoned Mercury-Bearing Waste Recovery Enterprise in Southwestern China: Spatial Distribution and Its Sources in Soil and Plants. Environ. Res. 2024, 252, 118694. [Google Scholar] [CrossRef]
  2. Medina-González, G.; Medina, Y.; Muñoz, E.; Fuentes, P. Rapid and Convenient Assessment of Trace Element Contamination in Agricultural Soils through Slurry-TXRF and Ecological Indices: The Ñuble Region, Chile as a Case Study. Sustainability 2023, 15, 9190. [Google Scholar] [CrossRef]
  3. Saha, J.K.; Selladurai, R.; Coumar, M.V.; Dotaniya, M.L.; Kundu, S.; Patra, A.K. Assessment of Heavy Metals Contamination in Soil. In Soil Pollution—An Emerging Threat to Agriculture; Environmental Chemistry for a Sustainable World; Springer Singapore: Singapore, 2017; Volume 10, pp. 155–191. ISBN 978-981-10-4273-7. [Google Scholar]
  4. Mao, J.; Liu, X.; Chen, B.; Luo, F.; Wu, X.; Jiang, D.; Luo, Z. Determination of Heavy Metals in Soil by Inductively Coupled Plasma Mass Spectrometry (ICP-MS) with Internal Standard Method. Electron. Sci. Technol. Appl. 2017, 4. [Google Scholar] [CrossRef]
  5. Yahaya, S.M.; Abubakar, F.; Abdu, N. Ecological Risk Assessment of Heavy Metal-Contaminated Soils of Selected Villages in Zamfara State, Nigeria. SN Appl. Sci. 2021, 3, 168. [Google Scholar] [CrossRef]
  6. Al-Momani, I.F.; Al-Massaedh, A.A. Assessment of Heavy Metal Contamination in Roadside Soils along Irbid-Amman Highway, Jordan by ICP-OES. Jordan J. Chem. 2020, 15, 1–12. [Google Scholar] [CrossRef]
  7. Ahmad, I.; Ansari, T.M. An Assessment of Toxic Heavy Metals in Soil and Plants (Allium Cepa and Daucus Carota) by GFAAS. Int. J. Environ. Anal. Chem. 2022, 102, 1029–1048. [Google Scholar] [CrossRef]
  8. Ismičić-Tanjo, D.; Huremović, J.; Selović, A.; Šehović, E. Human Health Risk Assessment of Mercury in Soil–Plants System. Int. J. Environ. Res. 2021, 15, 797–805. [Google Scholar] [CrossRef]
  9. Carvalho, P.M.; Marguí, E.; Kubala-Kukuś, A.; Banaś, D.; Machado, J.; Casal, D.; Pais, D.; Santos, J.P.; Pessanha, S. Evaluation of Different Analytical Approaches Using Total Reflection X-Ray Fluorescence Systems for Multielemental Analysis of Human Tissues with Different Adipose Content. Spectrochim. Acta Part B At. Spectrosc. 2022, 198, 106548. [Google Scholar] [CrossRef]
  10. Klockenkämper, R.; Bohlen, A. Von Total-Reflection X-Ray Fluorescence Analysis and Related Methods; John Wiley & Sons: Hoboken, NJ, USA, 2014; ISBN 978-1-118-98587-8. [Google Scholar]
  11. Liu, C.; Zhang, F.; Wang, X.; Chan, N.W.; Rahman, H.A.; Yang, S.; Tan, M.L. Assessing the Factors Influencing Water Quality Using Environment Water Quality Index and Partial Least Squares Structural Equation Model in the Ebinur Lake Watershed, Xinjiang, China. Environ. Sci. Pollut. Res. 2022, 29, 29033–29048. [Google Scholar] [CrossRef]
  12. Nagata, N.; Peralta-Zamora, P.G.; Poppi, R.J.; Perez, C.A.; Bueno, M.I.M.S. Multivariate Calibrations for the SR-TXRF Determination of Trace Concentrations of Lead and Arsenic in the Presence of Bromine. X-Ray Spectrom. 2006, 35, 79–84. [Google Scholar] [CrossRef]
  13. González, G.M.; Castillo, R.d.P.; Neira, J.Y. Multivariate Calibration for the Improvement of the Quantification of Cadmium in the Presence of Potassium as Interferent by Total Reflection X-Ray Fluorescence. X-Ray Spectrom. 2019, 48, 700–707. [Google Scholar] [CrossRef]
  14. Carvajal, M.; Jeldres, P.; Vergara, A.; Lobaina, E.; Olivares, M.; Meza, D.; Velásquez, A.; Dorta, F.; Jorquera, F.; Seeger, M.; et al. Bioremoval of Copper by Filamentous Fungi Isolated from Contaminated Soils of Puchuncaví-Ventanas Central Chile. Environ. Geochem. Health 2023, 45, 4275–4293. [Google Scholar] [CrossRef] [PubMed]
  15. Corradini, F.; Meza, F.; Calderón, R. Trace Element Content in Soil after a Sediment-Laden Flood in Northern Chile. J. Soils Sediments 2017, 17, 2500–2515. [Google Scholar] [CrossRef]
  16. Madrid, E.; Pino, J.A.; Muñoz, S.; Cardemil, F.; Martinez, F.; Berasaluce, M.; San Martín, S.; Reyes, J.; González-Miranda, I. Oxidative Damage Associated with Exposure to Heavy Metals Present in Topsoils in Central Chile. Environ. Geochem. Health 2023, 45, 9891–9901. [Google Scholar] [CrossRef] [PubMed]
  17. Rubinos, D.A.; Jerez, Ó.; Forghani, G.; Edraki, M.; Kelm, U. Geochemical Stability of Potentially Toxic Elements in Porphyry Copper-Mine Tailings from Chile as Linked to Ecological and Human Health Risks Assessment. Environ. Sci. Pollut. Res. 2021, 28, 57499–57529. [Google Scholar] [CrossRef]
  18. Tume, P.; González, E.; Reyes, F.; Fuentes, J.P.; Roca, N.; Bech, J.; Medina, G. Sources Analysis and Health Risk Assessment of Trace Elements in Urban Soils of Hualpen, Chile. Catena 2019, 175, 304–316. [Google Scholar] [CrossRef]
  19. Tume, P.; Melipichún, T.; Ferraro, F.; Sepúlveda, B.; Roca, N.; Bech, J. Contamination of As, Cd, Cr, Hg and Pb in Soils in Arica Commune (Chile). Environ. Geochem. Health 2023, 45, 9199–9213. [Google Scholar] [CrossRef]
  20. Li, Z.; He, M.-Y.; Li, B.; Wen, X.; Zhou, J.; Cheng, Y.; Zhang, N.; Deng, L. Multi-Isotopic Composition (Li and B Isotopes) and Hydrochemistry Characterization of the Lakko Co Li-Rich Salt Lake in Tibet, China: Origin and Hydrological Processes. J. Hydrol. 2024, 630, 130714. [Google Scholar] [CrossRef]
  21. Liu, J.; Wang, Y.; Li, Y.; Peñuelas, J.; Zhao, Y.; Sardans, J.; Tetzlaff, D.; Liu, J.; Liu, X.; Yuan, H.; et al. Soil Ecological Stoichiometry Synchronously Regulates Stream Nitrogen and Phosphorus Concentrations and Ratios. Catena 2023, 231, 107357. [Google Scholar] [CrossRef]
  22. US EPA, O. SW-846 Test Method 6010D: Inductively Coupled Plasma-Optical Emission Spectrometry (ICP-OES). Available online: https://www.epa.gov/hw-sw846/sw-846-test-method-6010d-inductively-coupled-plasma-optical-emission-spectrometry-icp-oes (accessed on 16 August 2024).
  23. Yang, Z.; Lu, W.; Long, Y.; Bao, X.; Yang, Q. Assessment of Heavy Metals Contamination in Urban Topsoil from Changchun City, China. J. Geochem. Explor. 2011, 108, 27–38. [Google Scholar] [CrossRef]
  24. Serra, I.; Calderón-Orellana, A.; Hidalgo, M. Heroic Viticulture in Itata Valley, Chile: Characteristics and Challenges for the Development of Unique Wines in Southern Chilean Vineyards. In Latin American Viticulture Adaptation to Climate Change: Perspectives and Challenges of Viticulture Facing up Global Warming; Gutiérrez Gamboa, G., Fourment, M., Eds.; Springer International Publishing: Cham, Switzerland, 2024; pp. 215–228. ISBN 978-3-031-51325-1. [Google Scholar]
  25. Stolpe, B.N. Descripciones de Los Principales Suelos de La VIII Región de Chile; Universidad de Concepción: Concepción, Chile, 2006. [Google Scholar]
  26. Protocolo de Toma de Muestras de Suelos|SAG. Available online: https://www.sag.gob.cl/content/protocolo-de-toma-de-muestras-de-suelos (accessed on 21 June 2024).
  27. Kang, Z.; Wang, S.; Qin, J.; Wu, R.; Li, H. Pollution Characteristics and Ecological Risk Assessment of Heavy Metals in Paddy Fields of Fujian Province, China. Sci. Rep. 2020, 10, 12244. [Google Scholar] [CrossRef]
  28. Feng, F.; Jiang, Y.; Jia, Y.; Lian, X.; Shang, C.; Zhao, M. Exogenous-Organic-Matter-Driven Mobilization of Groundwater Arsenic. Environ. Sci. Ecotechnology 2023, 15, 100243. [Google Scholar] [CrossRef] [PubMed]
  29. XRF Software. Available online: https://www.bruker.com/en/products-and-solutions/elemental-analyzers/xrf-spectrometers/xrf-software.html (accessed on 21 June 2024).
  30. Pirouette Software, Version 4.5; Infometrix Inc.: Bothell, WA, USA, 2014.
  31. Kusumiyati; Mubarok, S.; Sutari, W.; Hadiwijaya, Y. Application of Spectra Pre-Treatments on Firmness Assessment of Intact Sapodilla Using Vis-Nir Spectroscopy. IOP Conf. Ser. Earth Environ. Sci. 2021, 644, 012001. [Google Scholar] [CrossRef]
  32. Montesinos López, O.A.; Montesinos López, A.; Crossa, J. Overfitting, Model Tuning, and Evaluation of Prediction Performance. In Multivariate Statistical Machine Learning Methods for Genomic Prediction; Montesinos López, O.A., Montesinos López, A., Crossa, J., Eds.; Springer International Publishing: Cham, Switzerland, 2022; pp. 109–139. ISBN 978-3-030-89010-0. [Google Scholar]
  33. Westad, F.; Marini, F. Variable Selection and Redundancy in Multivariate Regression Models. Front. Anal. Sci. 2022, 2, 897605. [Google Scholar] [CrossRef]
  34. Monago-Maraña, O.; Domínguez-Manzano, J.; Muñoz de la Peña, A.; Durán-Merás, I. Second-Order Calibration in Combination with Fluorescence Fibre-Optic Data Modelling as a Novel Approach for Monitoring the Maturation Stage of Plums. Chemom. Intell. Lab. Syst. 2020, 199, 103980. [Google Scholar] [CrossRef]
  35. Beh, E.J. Elliptical Confidence Regions for Simple Correspondence Analysis. J. Stat. Plan. Inference 2010, 140, 2582–2588. [Google Scholar] [CrossRef]
  36. Short, S.M.; Cogdill, R.P.; Anderson, C.A. Determination of Figures of Merit for Near-Infrared and Raman Spectrometry by Net Analyte Signal Analysis for a 4-Component Solid Dosage System. AAPS PharmSciTech 2007, 8, 96. [Google Scholar] [CrossRef] [PubMed]
  37. ISO 19258:2018(En); Soil Quality—Guidance on the Determination of Background Values. Available online: https://www.iso.org/obp/ui/en/#iso:std:iso:19258:ed-2:v1:en (accessed on 21 June 2024).
  38. Reimann, C.; de Caritat, P. Distinguishing between Natural and Anthropogenic Sources for Elements in the Environment: Regional Geochemical Surveys versus Enrichment Factors. Sci. Total Environ. 2005, 337, 91–107. [Google Scholar] [CrossRef] [PubMed]
  39. Kowalska, J.B.; Mazurek, R.; Gąsiorek, M.; Zaleski, T. Pollution Indices as Useful Tools for the Comprehensive Evaluation of the Degree of Soil Contamination—A Review. Environ. Geochem. Health 2018, 40, 2395–2420. [Google Scholar] [CrossRef]
  40. Muller, G. Index of Geoaccumulation in Sediments of the Rhine River. GeoJournal 1969, 2, 108–118. [Google Scholar]
  41. Samuel, B.; Solomon, S.; Daniel, F.; Zinabu, G.M.; Riise, G. Heavy Metals Contamination of Soil in the Vicinity of Hawassa Industrial Zone, Ethiopia. J. Appl. Sci. Environ. Manag. 2020, 24, 1447–1454. [Google Scholar] [CrossRef]
  42. Zhu, Y.; Wang, L.; Zhao, X.; Lian, J.; Zhang, Z. Accumulation and Potential Sources of Heavy Metals in Soils of the Hetao Area, Inner Mongolia, China. Pedosphere 2020, 30, 244–252. [Google Scholar] [CrossRef]
  43. Hakanson, L. An Ecological Risk Index for Aquatic Pollution Control.a Sedimentological Approach. Water Res. 1980, 14, 975–1001. [Google Scholar] [CrossRef]
  44. Towett, E.K.; Shepherd, K.D.; Cadisch, G. Quantification of Total Element Concentrations in Soils Using Total X-Ray Fluorescence Spectroscopy (TXRF). Sci. Total Environ. 2013, 463–464, 374–388. [Google Scholar] [CrossRef] [PubMed]
  45. Bilo, F.; Borgese, L.; Cazzago, D.; Zacco, A.; Bontempi, E.; Guarneri, R.; Bernardello, M.; Attuati, S.; Lazo, P.; Depero, L.E. TXRF Analysis of Soils and Sediments to Assess Environmental Contamination. Environ. Sci. Pollut. Res. 2014, 21, 13208–13214. [Google Scholar] [CrossRef] [PubMed]
  46. Espinoza-Quiñones, F.R.; Módenes, A.N.; dos Santos, J.; Obregón, P.L.; de Pauli, A.R. Insights on Limits of Detection, Precision and Accuracy in TXRF Analysis of Trace and Major Elements in Environmental Solid Suspensions. Appl. Radiat. Isot. 2018, 137, 80–90. [Google Scholar] [CrossRef] [PubMed]
  47. Liu, L.; Gong, Y.; Miao, Y.; Guo, J.; Long, H.; Feng, Q.; Chen, Y. New Trends in Pollution Prevention and Control Technology for Healthcare and Medical Waste Disposal in China. Processes 2024, 12, 7. [Google Scholar] [CrossRef]
  48. Khatun, J.; Intekhab, A.; Dhak, D. Effect of Uncontrolled Fertilization and Heavy Metal Toxicity Associated with Arsenic(As), Lead(Pb) and Cadmium (Cd), and Possible Remediation. Toxicology 2022, 477, 153274. [Google Scholar] [CrossRef] [PubMed]
  49. Inumaru, J.; Hasegawa, T.; Shirai, H.; Nishida, H.; Noda, N.; Ohyama, S. 1—Fossil Fuels Combustion and Environmental Issues. In Advances in Power Boilers; Ozawa, M., Asano, H., Eds.; JSME Series in Thermal and Nuclear Power Generation; Elsevier: Amsterdam, The Netherlands, 2021; Volume 2, pp. 1–56. ISBN 978-0-12-820360-6. [Google Scholar]
  50. Bundschuh, J.; Schneider, J.; Alam, M.A.; Niazi, N.K.; Herath, I.; Parvez, F.; Tomaszewska, B.; Guilherme, L.R.G.; Maity, J.P.; López, D.L.; et al. Seven Potential Sources of Arsenic Pollution in Latin America and Their Environmental and Health Impacts. Sci. Total Environ. 2021, 780, 146274. [Google Scholar] [CrossRef]
  51. Nakagoe, S.; Olivares, J.C.; García, L.; Yubero, F.; Monteiro, M.; Mereles, L.; Diez-Pérez-Núñez, D. Validación de un método analítico para la determinación de arsénico total en muestras de suelo por espectrofotometría de absorción atómica con generación de hidruros. Innotec 2023, 25, e635. [Google Scholar] [CrossRef]
  52. Sabol, J. Major Analytical Methods for Determining Lead in Environmental and Biological Samples. In Lead in Plants and the Environment; Gupta, D.K., Chatterjee, S., Walther, C., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 1–13. ISBN 978-3-030-21638-2. [Google Scholar]
Figure 1. The study area shows the vineyards where samples were collected.
Figure 1. The study area shows the vineyards where samples were collected.
Processes 12 01760 g001
Figure 2. X-ray fluorescence spectra of the 26 calibration samples. The spectra of each sample are represented in different colors.
Figure 2. X-ray fluorescence spectra of the 26 calibration samples. The spectra of each sample are represented in different colors.
Processes 12 01760 g002
Figure 3. Actual vs. predicted values of arsenic (a) and lead (b) in cross-validation and external validation procedure.
Figure 3. Actual vs. predicted values of arsenic (a) and lead (b) in cross-validation and external validation procedure.
Processes 12 01760 g003
Figure 4. Regression coefficients for the PLS models for the quantification of arsenic (a) and lead (b), including lines Kα (10.54 keV) and Kβ (11.73 keV) of arsenic and Lα (10.55 keV) and Lβ (12.61 keV) of lead.
Figure 4. Regression coefficients for the PLS models for the quantification of arsenic (a) and lead (b), including lines Kα (10.54 keV) and Kβ (11.73 keV) of arsenic and Lα (10.55 keV) and Lβ (12.61 keV) of lead.
Processes 12 01760 g004
Figure 5. EJCR for the quantification of arsenic (a) and lead (b) by means of internal standardization (orange ellipse) and PLS (blue ellipse). The black dot indicates the ideal: a slope of 1 and an intercept of 0.
Figure 5. EJCR for the quantification of arsenic (a) and lead (b) by means of internal standardization (orange ellipse) and PLS (blue ellipse). The black dot indicates the ideal: a slope of 1 and an intercept of 0.
Processes 12 01760 g005
Figure 6. Ecological indices of every viticultural soil sample (black dots) from the Itata Valley for As (arsenic) and Pb (lead). Geoaccumulation Index (a), Enrichment Factor (b), and Contamination Factor (c).
Figure 6. Ecological indices of every viticultural soil sample (black dots) from the Itata Valley for As (arsenic) and Pb (lead). Geoaccumulation Index (a), Enrichment Factor (b), and Contamination Factor (c).
Processes 12 01760 g006
Table 1. Composition of synthetic calibration mixtures.
Table 1. Composition of synthetic calibration mixtures.
SampleAs (mg kg−1)Pb (mg kg−1)
C11034
C21272
C34813
C44220
C53767
C63315
C73319
C83249
C9839
C104365
C113536
C12495
C13104
C144578
C151331
C163369
C174136
C182422
C19436
C20772
C212823
C223336
C234218
C243647
C25043
C261227
Range4974
Minimum04
Maximum4978
Table 2. Composition of synthetic mixtures for external validation.
Table 2. Composition of synthetic mixtures for external validation.
SampleAs (mg kg−1)Pb (mg kg−1)
V1366
V21954
V3132
V41251
V53378
V61642
V72171
V8217
All samples contained 10 ng of germanium as internal standard.
Table 3. Contamination level according to the Geoaccumulation Index [18].
Table 3. Contamination level according to the Geoaccumulation Index [18].
GEOI GroupLevel of Contamination
GEOI ≤ 0Uncontaminated
0 < GEOI ≤ 1Slightly contaminated
1 < GEOI ≤ 2Moderately contaminated
2 < GEOI ≤ 3Moderately to heavily contaminated
3 < GEOI ≤ 4Heavily contaminated
4 < GEOI ≤ 5Heavily to extremely contaminated
GEOI > 5Extremely contaminated
Table 4. Contamination level according to the Enrichment Factor (EF) [18].
Table 4. Contamination level according to the Enrichment Factor (EF) [18].
EF GroupLevel of Contamination
EF < 2Deficiency to minimal enrichment
2 < EF < 5Moderate enrichment
5 < EF < 20Significant enrichment
20 < EF < 40Very high enrichment
EF > 40Extremely high enrichment
Table 5. Contamination level according to Contamination Factor (CF) [18].
Table 5. Contamination level according to Contamination Factor (CF) [18].
CF GroupLevel of Contamination
CF < 1Low contamination
1 < CF < 3Moderate contamination
3 < CF < 6Considerable contamination
CF > 6Very high contamination
Table 6. Comparison of As and Pb quantification results by Internal Standardization vs. PLS.
Table 6. Comparison of As and Pb quantification results by Internal Standardization vs. PLS.
SampleConcentration
(mg kg−1)
Internal
Standardization
(mg kg−1)
Relative Bias
(%)
PLS
(mg kg−1)
Relative Bias
(%)
AsPbAsPbAsPbAsPbAsPb
A36645.38.125.835.039.87.210.620.0
B195424.268.227.426.315.262.4−20.015.6
C1322.533.9150.05.91.931.490.0−1.9
D125116.260.335.018.29.356.6−22.511.0
E338040.387.622.19.538.583.416.74.3
F164218.552.215.624.318.237.413.8−11.0
G217128.398.134.838.217.364.2−17.6−9.6
H21723.512.211.974.322.310.86.254.3
RMSEP (mg kg−1) 5.612.3 3.35.0
RMSEP (%) 15.618.9 9.46.8
Table 7. Characteristics and parameters of the PLS model for As and Pb.
Table 7. Characteristics and parameters of the PLS model for As and Pb.
As ModelPb Model
RMSECV (mg kg−1)2.953.92
rVal0.9810.986
RMSEP (mg kg−1)3.315.03
rPred0.9720.982
Calibration Samples26
Number of Variables257
PretreatmentMean-centering
Latent Variables2
Range [As] (mg kg−1)0 to 49
Range [Pb] (mg kg−1)4 to 78
Table 8. Analytical figures of merit using internal standardization and PLS methods are presented, including limit of detection (LOD), limit of quantification (LOQ), sensitivity, and analytical sensitivity.
Table 8. Analytical figures of merit using internal standardization and PLS methods are presented, including limit of detection (LOD), limit of quantification (LOQ), sensitivity, and analytical sensitivity.
Figure of MeritInternal StandardPLS
AsPbAsPb
LOD (mg kg−1)5.42 × 10−26.28 × 10−22.56 × 10−43.48 × 10−4
LOQ (mg kg−1)1.64 × 10−11.90 × 10−17.69 × 10−41.05 × 10−3
Sensitivity (counts (mg kg−1)−1)432298474402
Analytical Sensitivity ((mg kg−1)−1)18.312.759.663.3
Table 9. Comparison of accuracy for arsenic (As) and lead (Pb) quantification in Certified Soil Sample Clay 2—CRM051 using internal standardization and PLS methods.
Table 9. Comparison of accuracy for arsenic (As) and lead (Pb) quantification in Certified Soil Sample Clay 2—CRM051 using internal standardization and PLS methods.
Certified ValueInternal Standard (n = 3)PLS Model (n = 3)
AsPbAsPbAsPb
Mean (mg kg−1)45.568.155.379.251.263.2
SD 14.457.66.2311.27.238.23
RSD 2 (%)9.7811.1611.2714.1414.1213.02
p (α = 0.05)--0.0770.2150.310.483
1 Standard deviation; 2 relative standard deviation.
Table 10. Statistical summary of As and Pb concentrations (mg kg−1) in soils from Itata Valley.
Table 10. Statistical summary of As and Pb concentrations (mg kg−1) in soils from Itata Valley.
MedianMinPercentileMaxBackground
Upper Limit
Outliers (%)
525507595
As2.340.560.921.242.343.454.725.010.8395.8
Pb22.56.027.4513.022.531.142.745.56.9095.6
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Medina-González, G.; Medina, Y.; Muñoz, E.; Andrade, P.; Cruz, J.; Rodriguez-Gallo, Y.; Matus-Bello, A. Integration of Slurry–Total Reflection X-ray Fluorescence and Machine Learning for Monitoring Arsenic and Lead Contamination: Case Study in Itata Valley Agricultural Soils, Chile. Processes 2024, 12, 1760. https://doi.org/10.3390/pr12081760

AMA Style

Medina-González G, Medina Y, Muñoz E, Andrade P, Cruz J, Rodriguez-Gallo Y, Matus-Bello A. Integration of Slurry–Total Reflection X-ray Fluorescence and Machine Learning for Monitoring Arsenic and Lead Contamination: Case Study in Itata Valley Agricultural Soils, Chile. Processes. 2024; 12(8):1760. https://doi.org/10.3390/pr12081760

Chicago/Turabian Style

Medina-González, Guillermo, Yelena Medina, Enrique Muñoz, Paola Andrade, Jordi Cruz, Yakdiel Rodriguez-Gallo, and Alison Matus-Bello. 2024. "Integration of Slurry–Total Reflection X-ray Fluorescence and Machine Learning for Monitoring Arsenic and Lead Contamination: Case Study in Itata Valley Agricultural Soils, Chile" Processes 12, no. 8: 1760. https://doi.org/10.3390/pr12081760

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop