1. Introduction
Soil is a complex medium composed of minerals, organic matter, air, and water. Among these constituents, soil moisture is one of the most critical variables in agricultural and hydrological processes because it governs the exchange of water and energy between the land surface and atmosphere. Soil water content (SWC), calculated as the volumetric proportion of water contained within soil pores, largely dictates the soil’s physical and mechanical behavior [
1]. As a key nexus variable linking weather, climate, ecosystem dynamics, and surface energy balance, SWC underpins feedback mechanisms related to climate change and is indispensable for predicting landslides and soil erosion [
2,
3]. Recent research shows that accurate knowledge of soil moisture can enhance climate prediction modeling, drought monitoring, yield forecasting, and crop growth management [
4,
5,
6].
Soil water data can be collected using several methods. The oven-drying method remains the canonical laboratory procedure for quantifying soil moisture, while neutron scattering, time-domain reflectometry, capacitance probes, and frequency-domain reflectometry are widely used alternatives [
7,
8,
9,
10]. However, the accuracy of these techniques is strongly soil-dependent and easily affected by texture, salinity, porosity, and organic matter, and their deployment is often limited by high cost, intricate installation, and site-specific calibration requirements [
11,
12,
13]. Moreover, conventional measurements are typically obtained at the point or plant scale, restricting their applicability to larger areas [
14].
To bridge these spatial gaps, researchers have interpolated point observations [
15] and developed diverse approaches for large-area monitoring, including remote sensing. Internet-of-Things sensor networks now deliver real-time data streams linking soil moisture to yield and simultaneously record temperature, nitrogen, and other agronomic indicators [
16,
17]. Shallow soil moisture retrieval using ground-penetrating radar has also been explored; however, this method depends on subsurface reflectors and a stable dielectric–moisture relationship, making generalization across soil types difficult [
18]. Satellite platforms have achieved regional-to-global estimates of surface soil moisture, and recent work has combined vegetation indices with thermal infrared data for efficient retrievals [
19,
20,
21]. Moreover, open-source tools have demonstrated the potential of automated satellite imagery processing, allowing large-scale agricultural monitoring [
22]. Nevertheless, scale mismatches between coarse satellite footprints and ground truth can lead to substantial errors [
6].
Therefore, unmanned aerial vehicles (UAVs) with RGB cameras have gained traction for cost-effective mesoscale mapping. Many UAV studies have employed linear regression–based statistical models for moisture estimation [
23,
24]. Soil reflectance in the visible range (≈400–700 nm) typically decreases as soil water content rises because thin water films suppress multiple scattering and increase the effective refractive index, lengthening photon path lengths and enhancing absorption by pigments and organics. Consequently, soil generally darkens across R, G, and B channels with increasing moisture. Moreover, linear relationships with both wavelengths and reflectance were reported in a domain of 5-25% of SWC, which indicates that simple linear models could be useful in these data [
25,
26,
27].
However, because image data are sensitive to ambient illumination, sensor quality, and environmental conditions, ensuring robust accuracy and reliability is essential. A basic attempt to achieve this goal is to use different color spaces or light ranges. While recent studies show that a variety of color spaces can be used when analyzing an image, RGB and HSV color spaces are considered highly related to SWC and are therefore widely used [
28,
29,
30]. Camera-based SWC estimation using RGB/HSV under controlled or field illumination has been repeatedly demonstrated, and recent studies emphasize illumination or color calibration as a prerequisite for reliable color–moisture links [
31,
32]. Consequently, numerous studies now fuse traditional color-space statistics with machine learning algorithms to capture the nonlinear relationship between spectral signatures and SWC [
32,
33].
Machine learning approaches leverage large training sets to model complex, nonlinear links between moisture and remote-sensing variables. Beyond numerical simulations of infiltration and subsurface redistribution [
34], recent UAV-based works have combined RGB and thermal imagery with techniques such as Least Absolute Shrinkage and Selection Operator (LASSO), support vector machines, and gradient-boosted models to predict moisture at multiple depths [
35,
36]. However, performance often degrades in topographically complex terrain. For example, both Soil Moisture Analytical Relationship (SMAR) and regression-based models yield accurate root-zone soil moisture at calibrated sites, but their accuracy varies significantly with terrain and season [
37]. High accuracy also hinges on abundant, region-specific training data, which can be operationally burdensome [
34,
38]. Crucially, missing soil moisture observations cannot be resampled, and even oven drying can cause evaporative losses during sample handling, complicating the acquisition of precise target moisture levels.
To mitigate these challenges, this study proposes an alternative strategy that exploits color-space histogram analysis and interpolation. We assessed its reliability using the dice score (DS), Bhattacharyya distance (BD), and Earth Mover’s Distance (EMD) [
39,
40]. Compared with machine learning models, this approach requires fewer computational resources, and relative to oven drying and other methods, it saves considerable time, labor, and cost.
2. Materials and Methods
2.1. Soil Sampling
Soil samples were collected from an agricultural parcel in Bubuk-myeon, Miryang-si, Republic of Korea (
Figure 1). This field, typically used for forage crops, rice, and soybeans, was fallow at the time of sampling, and the soil type is shown as silt loam according to the USDA (United States Department of Agriculture Soil Taxonomy) standards. Immediately after collection, the samples were sealed in airtight containers and transported to the laboratory. Upon arrival, coarse organic debris such as roots and leaves was removed. The soil was then oven-dried for 24 h at 105 °C to achieve constant mass. Gravel and other coarse fractions were eliminated by sieve separation, retaining material with a particle diameter of ≤2.00 mm. Finally, approximately 210 g of the processed soil was placed in a square acrylic mold (80 mm × 80 mm, 22 mm high), and its surface was leveled with a tamping rod.
For each moisture level, water mass (W
w, g) corresponding to a prescribed percentage of the dry mass (W
d, g) was added using a pipette, with the moisture content increasing in 3% increments, and the soil was thoroughly mixed. After a 5 min equilibration period to allow uniform wetting, a surface image was recorded. All data were collected on 16 December 2024.
2.2. Image Acquisition Setup and Preprocessing
To ensure uniform imaging conditions, a dedicated studio was constructed, as shown in
Figure 2. The acrylic mold was centered beneath two ceiling-mounted LED lamps positioned 30 cm above the sample to minimize shading. Images were captured from above using a remotely triggered smartphone camera (Samsung Galaxy S22 Ultra, Suwon, Republic of Korea) [
41]. Key camera parameters are summarized in
Table 1. Remote operations eliminated motion blur and guaranteed consistent image quality.
The region of interest was restricted to the soil surface by cropping the acrylic frame, background, and peripheral areas. The resulting image tiles measured 4000 × 4000 pixels (
Figure 3a).
While the intended SWCs were 3%, 6%, 9%, 12%, 15%, 18%, and 21% (on a dry mass basis), the actual values obtained were 3.83%, 5.33%, 6.75%, 8.41%, 10.32%, 11.91%, 13.16%, 14.83%, 15.97%, 17.26%, 19.17%, and 19.75% (12 images in total;
Figure 3b). Deviations from the target levels and the unequal spacing between successive contents were attributed to evaporative loss, imperfect homogenization, and the high sensitivity of the small sample mass to incremental water additions.
2.3. Processing and Interpolation of Histograms
This study employs both the RGB and HSV color spaces. However, because the interpolation method is identical for both, we illustrate the procedure using RGB as an example. Each pixel contains red, green, and blue intensities; thus, an empirical histogram can be computed for every channel. Because soil color varies continuously with added moisture, the color statistics associated with an intermediate target content can, in principle, be inferred from neighboring images.
Nevertheless, raw histograms do not follow ideal normal shapes, and both skewness and dispersion differ across channels. Therefore, direct interpolation of these empirical distributions yields poor estimates. Accordingly, for each channel, the arithmetic mean (μ) and standard deviation (σ) were extracted, and a theoretical normal probability density function (PDF) was generated under the assumption of Gaussian behavior. The PDF was then scaled so that its peak height matched that of the observed histogram, which will be hereinafter referred to as the “tuned histogram” (
Figure 4). A fine adjustment of the tail ratio ensured optimal alignment between the two curves, thereby capturing the moisture-induced color shift more faithfully. Finally, synthetic pixel values were drawn from the scaled PDF to create artificial images (
Figure 5).
2.4. Generation of Target Moisture Images via Linear Interpolation
To reproduce the color statistics corresponding to an arbitrary target SWC, linear interpolation was applied channel-wise to three descriptive statistics: mean (
), standard deviation (
), and peak height (
). Two reference images—A and C—with SWCs W
A and W
C that bracket the desired level W
B were selected, where
<
<
. The distance ratio (
) was computed as follows:
The statistics for the target image were estimated as
Assuming near-Gaussian behavior, the PDF for each channel was drawn according to and and then rescaled according to . Random sampling produced replicable RGB and HSV images representative of the target SWC.
2.5. Similarity Assessment Between Histograms
Because all histograms share an identical domain (0–255 for R, G, B, S, V; 0–199 for H) but differ in total pixel count after probability conversion, scaling, and interpolation, their similarity was quantified using two complementary metrics: DS, which is insensitive to absolute scaling; BD, which compares the shapes of the normalized PDFs; and EMD, which could show distribution change.
2.5.1. Dice Score
DS gauges the overlap between two distributions based on the harmonic mean of their intersection and individual magnitudes as follows:
In short, DS is calculated by dividing twice the number of overlapping pixels between two histograms by the total number of pixels in both histograms. Values of DS range from 0 to 1, while those approaching 1 indicate high similarity [
39].
2.5.2. Bhattacharyya Distance
Let
and
be the normalized probabilities in bin i for the two histograms. The Bhattacharyya coefficient (BC) is expressed as
and the corresponding BD is
Values of BD approaching 0 indicate high similarity, whereas values near 1 signify substantial divergence [
39].
2.5.3. Earth Mover’s Distance
EMD was used to compare two normalized signatures
and
, where
, and
are bin centroids with frequencies of
and
. Let
be the ground distance between
and
and the flow that minimizes the total cost between the two be
.
The optimal flow
solves the following:
Values of EMD decreasing to 0 indicate high similarity, and increasing to 1 indicates the opposite [
40].
2.5.4. Derivation of the Similarity Metric Threshold and Statistical Analysis
Using a new dataset of seven original soil-surface images ordered by SWC, we estimated a threshold that reflects natural variability among distinct SWC states (
Figure 6). Consecutive images were treated as near neighbors, and the threshold was set to the smallest value observed among the similarity values calculated by the neighboring pairs. This choice yields a strict, utility-oriented cutoff: a generated image must be at least as close to its target as any two different originals ever are to each other. Although the SWC spacing in this new dataset is uneven and sometimes wider than in the original study, several pairs like 0% and 2.63%, 2.63% and 5.17%, and 14.67% and 18.42% remain visually close, making this cutoff conservative but informative.
In evaluation, a generated image is deemed “similar” to its target original if its R, G, and B similarity results do not exceed the threshold. For higher-is-better scores such as dice, the similarity results should exceed the threshold. The threshold is pre-specified from original–original comparisons only and is not tuned on generated images, avoiding circularity.
We evaluated similarity against pre-specified thresholds using one-sided Wilcoxon signed-rank tests (DS: DS − TDS > 0; BD/EMD: T − metric > 0; significance levels: p < 0.05 (***), p < 0.01 (**), p < 0.001 (*), ns = not significant). Channel effects within each color space (RGB; HSV) were assessed with Friedman tests followed by Wilcoxon signed-rank post hoc comparisons with Holm correction. Monotonic associations with soil water content (SWC) were analyzed using Spearman’s ρ. We contrasted an operating window (10.32–14.83% SWC) against other conditions using one-sided Mann–Whitney U tests on threshold-centered differences. For Wilcoxon tests, we report the test statistic (W), p-value, and Hodges–Lehmann median difference with 95% bootstrap confidence intervals. Statistical significance was set at α = 0.05 (two-sided unless otherwise stated). All analyses were nonparametric (n = 10 per series).
4. Discussion
This study demonstrates that the RGB and HSV histograms of soil surface images can be roughly modeled using channel-wise Gaussian PDFs parameterized by their empirical mean and SD. After additional scaling via SD to match the peak height of the histogram of the original image, linear interpolation of these parameters enables reliable reconstruction of missing soil water levels. Then, we evaluated a histogram-based interpolation framework that generates intermediate soil-color distributions as a function of SWC. Using RGB and HSV color spaces and three complementary similarity metrics, we found clear, channel-specific behaviors that are consistent across datasets.
First, luminance-linked channels such as R and V were the most stable. DS remained high across the SWC range, BD increased only beyond mid-SWC, and EMD exceeded its threshold chiefly at the maximum SWC, indicating strong global overlap, modest shape divergence, and limited drift until late stages. G was reliable at low–mid SWC but exhibited earlier drift: EMD crossed its threshold from the low-teens, and BD later signaled growing shape mismatch. By contrast, B and H were structurally fragile: B showed persistent drift and illumination sensitivity, and H failed almost universally under a stringent EMD threshold and is intrinsically unstable at low saturation due to circular wrap-around, which seemed to originate from the point that the histogram of H itself is highly concentrated in a narrow range.
Second, the metrics captured different but convergent signals. DS was permissive until global overlap degraded, BD flagged peak/tail mismatches earlier at higher SWC, and EMD provided the earliest warnings of distributional shift, which was notable for G. BD and EMD were strongly correlated channel-wise, indicating that shape change and position shift often co-occur. Change-point analyses identified a behavioral transition near ~10–12% SWC. EMD growth slowed after ~10%, whereas BD steepened after ~12%, suggesting early relocation of distributions followed by progressive reshaping. Accordingly, cases with EMD rising while DS remains high indicate dominant mean shift, BD rising with DS high indicates dispersion/shape change, and DS dropping signals global overlap loss, providing a diagnostic map from metric patterns to underlying distributional causes.
Third, the most consistent agreement across metrics and color spaces occurred between 10.32% and 14.83% SWC. In this window, S and V were simultaneously strong, R passed throughout, and G passed in most intervals, yielding the highest multi-metric, multi-channel consensus. Beyond ~16% SWC, both BD and EMD deteriorated (particularly V_BD, V_EMD, and G_EMD), reducing similarity confidence. High consistency across metrics and color spaces in this window indicates that both location and shape are preserved, supporting the robustness of histogram-based interpolation for SWC retrieval in this regime.
Limitations and avenues for improvement are clear. Thresholds were strictly calculated, and H is penalized by both circular geometry and low saturation; adopting a circular EMD with saturation-weighted distances would yield fairer hue judging. B’s sensitivity to illumination and white balance argues for color-constancy and brightness-standardization preprocessing before assessment. Finally, while Gaussian tuning captured central tendencies well, channels exhibiting nonmonotonic or multi-modal behavior may benefit from mixture models or nonlinear/quantile-based interpolation and even evolve into a better model that could predict inconsistent changes in color space histograms.