Interpreting Image Patterns for Agricultural Sprays Using Statistics and Machine Learning Techniques

Cryer, Steven; Raymond, John

doi:10.3390/fluids9020040

Open AccessArticle

Interpreting Image Patterns for Agricultural Sprays Using Statistics and Machine Learning Techniques

by

Steven Cryer

^1,*

and

John Raymond

²

¹

Advanced Modeling and Artificial Intelligence, Corteva Agriscience, Indianapolis, IN 46268, USA

²

Crop Protection Discovery and Development, Corteva Agriscience, Indianapolis, IN 46268, USA

^*

Author to whom correspondence should be addressed.

Fluids 2024, 9(2), 40; https://doi.org/10.3390/fluids9020040

Submission received: 12 December 2023 / Revised: 23 January 2024 / Accepted: 27 January 2024 / Published: 1 February 2024

(This article belongs to the Special Issue Multiphase Flow for Industry Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The atomization of liquid spray solutions through nozzles is a mechanism for delivering many pesticides to the target. The smallest drop sizes (<150 μm) are known as driftable fines and have a propensity for wind-induced convection. Many agricultural applications include oil-in-water formulations. The experimental metrics obtained from spray images of these formulations include the distance from the nozzle origin to the drop centroid once a drop has formed; the hole location and surface area for holes that form in the liquid sheet (all hole areas approximated as polygons); the angles formed between polygon segments (whose vertices are represented as boundary points); and the ligament dimensions that form from intersecting holes, such as the ligament aspect ratio (R/L), ligament length (L), and ligament radius (width), along with the number of drops a ligament breaks up into. These metrics were used in a principal component regression (PCR) analysis, and the results illustrated that 99% of the variability in the response variable (DT10) was addressed by 10 principal components. Angles formed by the colliding holes, hole distance from the nozzle, drop distance, hole number, ligament number, and drop number were negatively correlated to the atomization driftable fine fraction, while hole area, ligament distance, ligament area, and boundary area were positively correlated. Thus, to decrease/minimize driftable fines, one needs to increase the negatively correlated metrics.

Keywords:

atomization; spray patterns; drift; oil-in-water; driftable fines; agriculture

1. Introduction

Foliar adherence and drift propensity are important considerations for pesticide formulations. Two of the most important factors influencing the spray properties of an agricultural formulation are nozzle selection and adjuvant additions. Both factors have been well studied using a variety of experimental and computational techniques [1,2,3,4,5,6,7,8,9]. However, the spraying phenomenon largely remains an enigma and persists as an obstacle in the product formulation process. Flat fan spray nozzles operate by discharging a liquid sheet, which subsequently breaks up into droplets. In agricultural settings, droplets that are too small can become entrained in ambient air currents and be carried off target, while larger droplets often reduce pesticide coverage and efficacy.

Three distinct mechanisms of sheet breakup are observed based on the properties of the formulation being sprayed, the nozzle geometry, and process conditions. Mechanisms leading to sheet rupture include wave growth, rim breakup, and collisions of growing holes within the sheet in multiphase systems [10]. Many agricultural chemicals are dissolved in an oil phase and subsequently formulated as an emulsifiable concentrate that spontaneously forms an oil-in-water emulsion when mixed with water. Spray patterns of an oil-in-water emulsion (and breakup mechanisms) are very different from those of pure water in terms of the droplet breakup mechanism (e.g., flapping sheet vs. hole growth), as are the resulting droplet size distribution and mechanisms [10,11]. It is hypothesized that the oil drops in the multiphase system create nucleation sites for hole formation as the oil penetrates the water/air interface. An oil-in-water spray pattern can be segregated into three regimes: (i) continuous sheet nucleation, (ii) hole growth and collision, and (iii) ligament formation and breakup into drops [11,12,13].

High-speed imaging of the spray trajectory is one technique that has been used to study spray phenomena [14,15,16,17]. Typically, these efforts have attempted to serve as a proxy technology for establishing the droplet size distribution. Our proposed technique departs from previous efforts in both scope and approach. Rather than attempt to provide a numerical descriptor (e.g., droplet size, etc.), our objective is to empirically provide a navigable landscape with which to explore the potential formulation space (for a specific nozzle/pressure) visually and iteratively. By establishing a set of known reference formulations (e.g., solution controls as well as various commercial formulations) in the exploration landscape and projecting any new putative formulation onto this landscape, it is then possible to qualitatively establish where the new product formulation stands with respect to known formulations. The overall result of any iterative modifications can then be readily visualized.

2. Materials and Methods

High-speed videos were taken of atomization sprays, from which characteristic metrics were defined and explored [14,15]. Physical and empirical metrics were defined, and algorithms were written (Matlab v. 2019a or C++) such that the video images could be distinguished from each other based on the differences in image analysis (IA) metrics. Videos of 21 different oil-in-water formulations being sprayed through a conventional flat fan nozzle (XR8002, TeeJet, Glendale Heights, IL, USA) were taken at two different pressures (40 and 10 psi) and oil-phase concentrations (2.5 and 5.0%). Thus, the two-phase fluid being sprayed was mostly water with a small fraction (2.5–5%) of oil, which is representative of typical agricultural sprays. A HELOS Sympatec laser diffraction system (Sympatec GmbH, Clausthal-Zellerfeld, Germany) was used to measure the resulting droplet size distribution upon atomization. The laser was aligned at the center of the spray, located below the continuous sheet at 330 mm below the nozzle (thus, the sheet was fully broken up into droplets). The XR8002 flat fan nozzle is a conventional nozzle with a spray angle of 80°. Further details of the experimental system can be found elsewhere [12].

2.1. Numerical Descriptors

The image analysis toolbox of Matlab v. 2019 was used to program customized functions for determining physical metrics from a series of steady-state images logged by a high-speed video camera, as shown in Table 1. Each metric quantified a physical or geometrical feature(s) that characterizes the spray pattern (e.g., destabilization and breakup into drops of a continuous liquid sheet emanating from an agricultural flat fan nozzle). The nozzle in the images had a known size (as measured by a micrometer) and was used as the reference for all measurements taken from the image. The diameter of the ligament was assumed to be cylindrical, with the diameter given by the measured ligament width. Holes within the sheet were approximated as polygons. The detected features were the average angle of the polygons representing holes (angle); average distance from holes’ centroids to the nozzle (hole_dis); average area of holes (hole_area); average distance from ligaments’ centroids to the nozzle (liga_dis); average area of ligaments (liga_area); aspect ratio (average radius over length of ligaments, R/L); average distance from droplets’ centroids to the nozzle (drop_dis); average area of droplets (drop_area); average radius of droplets (drop_radius); average distance from continuous water sheets’ centroid to the nozzle (bdr_dis); average area of continuous water sheets (bdr_area); and average number of holes, ligaments, and droplets (hole_n, liga_n, and drop_n, respectively). The physical and empirical image analysis (IA) metrics presented in Table 1 were used in statistical analysis to uncover possible relationships to the droplet size distribution, specifically focusing on the driftable fine fraction (e.g., drop sizes < 150 μm in size).

2.2. Visual Navigable Landscape (Empirical)

As a proof of concept, a set of experiments was performed (distinct formulations and nozzle configurations) to generate high-speed videos for several spray events. At any given time point (i.e., a single video frame), the phenomenologically chaotic arrangement exhibited by the spray pattern can be characterized numerically using image texture features (e.g., entropy, fractal number, and statistical regularity; see Figure 1). Essentially, this technique strives to ascribe a descriptor of the image pattern that is numerically consistent within images of the same category (i.e., class) yet distinct from images of different categories [18]. For the most part, these computations are irreversible and noninterpretable, but they offer the ability to numerically categorize distinct, seemingly intractable image patterns, as shown in Table 2. The 21 videos resulted in 3281 images.

Ideally, a dynamic texture approach [18,19,20] would best describe the given system; however, here, we chose a simpler approximation scheme for the proof of concept. For each video, we randomly selected approximately 150 single frames, labeled each image with the corresponding formulation/nozzle combination (i.e., class), and then learned the best combination of image features to numerically categorize each image into its appropriate class. It is important to note that the more powerful dynamic texture approach extracts numeric features representing temporal changes in texture throughout the course of the video, whereas the current approach simply uses the texture at a static point in time. Therefore, any successful proof of concept for the single frame approach would support the assumption that the dynamic texture approach would offer better performance.

To account for experimental variability in the video capture (e.g., camera placement, lighting differences, nozzle geometry, etc.), each image frame was standardized so that the center of each nozzle lip was cropped and registered (e.g., aligned) with respect to all other frames. In addition, the background pixel intensities were suppressed. All image manipulation was performed using the CImg 1.5.1 C++ image handling libraries. The continuous sheet was characterized by the area (pixels), the farthest distance of the sheet from the nozzle before breakup, and by a centroid in relation to the origin (e.g., nozzle tip), as shown in Figure 1. Approximately 150 frames from each of the 21 video experiments (a subset of the 80 videos analyzed) were extracted for analysis. Different spray formulations and additives, such as solid particles, were investigated. Some of these 21 experiments were replicates, resulting in 17 unique classes.

Table 2. Representative empirical IA feature metrics determined for each frame in a video.

Empirical Feature (Metric)	Description
Run Length	Characterizes textures via patterns of sequence segments of similar pixel intensities (Tang, 1998 [21])
Haralick	Characterizes images via estimation of pixel entropy, pixel differences, etc. (Haralick et al., 1973 [22])
Simple statistics	Characterizes images via statistical values of pixels and their neighborhoods (e.g., average, variance, skew, kurtosis, etc.)
Lacunarity	Fractal-type descriptors for quantifying spatial complexity (Plotnick et al., 1996 [23]; Vernon-Carter et al., 2009 [24])

2.3. Experimental

The experimental setup consisted of an enclosed spray chamber, a nozzle system, a Sympatec laser, an LED lighting system, tripod supports, and a high-speed video camera. Other metrics obtained by image analysis are presented in Figure 2. Another method to quantify sheet breakup involves representing holes that form within the liquid sheet using a polygon approximation method for these holes or “blobs” (Figure 2). This simplified approach reduces the number of vertices needed to describe the hole and allows the polygon area and side lengths of a hole to be computed. Details of the experimental setup used in the paper can be found elsewhere [12]. Figure 3 illustrates the flowchart used to represent the image processing procedure.

2.4. Theoretical

Perforations in the liquid spray sheet lead to hole formation. These holes expand and collide, which leads to the formation of ligaments (Figure 4). These ligaments subsequently stretch and ultimately succumb to finite-amplitude disturbances and break up into drops. It is the breakup of multiple ligaments that leads to the atomization drop size distribution for the oil-in-water breakup mechanism [10]. Liquid ligaments are treated as liquid cylinders whose endpoints are either free or constrained (pinned). Details of this linear stability analysis can be found elsewhere [25]. IA metrics for ligaments include ligament length, radius, and the distance from the nozzle (origin), where they are located. The stability of the ligament is governed by the aspect ratio (R/L); thus, we can compare the observed ligament stability from IA metrics with theoretical calculations for the onset of ligament breakup [25].

3. Results

3.1. Visual Navigable Landscape (Empirical)

For each image, dozens of numerical image features were extracted ([21,24,26,27]. Using the class labels (e.g., formulation type), the image features were subjected to a feature selection algorithm that selects a small subset of features that are most correlated with the class label structure [28,29]. The 10 most correlated features were linearly combined (i.e., weighted and summed; principal components) and then used to classify each sample image. This was performed so that three distinct linear combinations were computed that best separate the classes into three dimensions [30], thus allowing the images to be “projected” into a 3D graphical visualization. The resulting projection for the 21-video subset using different formulations is depicted in Figure 5, with bre1, bre2, and bre3 representing the three principal components used. Separation between the class label structures (e.g., different formulation types) indicates that image analysis (IA) metrics could distinguish between atomization (breakup) patterns for all the formulations investigated in the navigable landscape exercise. Thus, metrics generated from images of spray patterns can be used as dependent parameters in data mining and statistical techniques and in offering insight/quantification for mechanistic modeling of sheet atomization.

3.2. Physical Image Analysis (IA) Metrics

Histograms for metric measurements for a single atomization video for a two-phase system (1% oil) using an AIXR 110° nozzle are provided in Figure 6. The metrics included the centroid of the drops following atomization, liquid ligaments, and holes within each video frame, along with the area and the centroid of the continuous sheet of liquid before succumbing to the instability responsible for sheet breakup. Lastly, the aspect ratio (R/L) for each of the many ligaments found in an image was measured and tabulated as a histogram (bottom right in Figure 6).

The breakup of a cylinder ligament is not instantaneous. In Altieri, Cryer, and Acharya [10], it was assumed mathematically that the Plateau–Rayleigh instability immediately destroyed the liquid ligaments that were formed. The presence of fluid webs seen in the images is evidence that this did not happen and can be attributed to two factors: (1) the rate of growth of the cylinder instability and (2) ligament stabilization by “pinning” cylinder ends. Constraining the ends of the ligament (approximated as a cylinder) to have fixed contact lines stabilizes the ligament (based on the linear stability theory [11]) when contrasted against a liquid cylinder with “natural” endpoints (Figure 7). Not only does the pinned growth rate decrease from the corresponding free case but the R/L value for the maximum growth rate also shifts. Thus, cylinders of fluid can grow longer before becoming unstable, and it takes longer for the disturbance to grow along the cylinder interface, which ultimately leads to cylinder breakup and drop formation.

Holes that formed within the continuous liquid sheet were approximated as polygons with up to 12 edges. The angles that formed between the polygon edges were interpreted as representing the number of distinct holes that collided (Figure 7). The angle between the polygon edges decreased as more holes collided. Thus, the representative histogram describing polygon angles for a hole is related to the number of colliding holes.

3.3. Statistical Analysis

Exploratory Data Analysis: On Physical Metrics

A data set with 14 detected features from the videos and five variables were determined after running our algorithm on this spray data. The 14 detected features were as follows: average angle of the polygons representing holes (angle); average distance from holes’ centroids to the nozzle (hole_dis); average area of holes (hole_area); average distance from ligaments’ centroids to the nozzle (liga_dis); average area of ligaments (liga_area); aspect ratio (average radius over length of ligaments, R/L); average distance from droplets’ centroids to the nozzle (drop_dis); average area of droplets (drop_area); average radius of droplets (drop_radius); average distance from continuous water sheets’ centroid to the nozzle (bdr_dis); average area of continuous water sheets (bdr_area); and average number of holes, ligaments, and droplets (hole_n, liga_n, and drop_n, respectively).

The five dependent variables predicted were the atomization drop sizes D10, D50, D90, f74, and f150. Here, D10, D50, and D90 are actual drop diameters for percentiles of 10, 50, and 90%, respectively, while f74 and f150 are percentages of drops below a size of 74 and 150 μm, respectively. A linear regression model was used to represent this relationship and clearly identify which feature(s) are important in determining the response variable. Data fitted into a linear regression model should meet the standards of (1) keeping the normality of each variable, (2) adding some possibly related new terms, and (3) eliminating the linear correlation between predictors. A representative statistical analysis using D10 as the dependent variable is provided. The regression model checked assumptions that require the error distribution to be normal (all predictors should be normally distributed such that the prerequisites of the linear model are satisfied). However, the relationship between one predictor and the response variable may not be a linear one, and if the local regression model is far from being linear, then quadratic terms are added. Quadratic terms for liga_dis, drop_dis, hole_n, and liga_n (outlined by red boxes in Figure 8) were added and identified as liga2, drop2, hn2, and ln2, respectively. The distributions for hole_area, bdr_area, ln2, and hn2 did not follow an obvious normal distribution, but the logarithm for these original variables was normally distributed.

Table 3 summarizes the parameters that were log transformed to meet normality constraints. A linear regression model will not be a good choice if the response variable is not normally distributed. However, we can find some transformations to convert the data into normal data via a Box–Cox transformation

\frac{y^{λ} - 1}{λ}

(equivalent to taking the logarithm while parameter λ = 0). Here,

y

is the response variable, and λ is the transformation parameter. So, the probability density function for a sample

(y_{1}, \dots, y_{n})

is

P (y_{1}, \dots, y_{n}) = \prod_{i = 1}^{n} \frac{{y_{i}}^{λ} - 1}{λ}

. In this way, the Jacobian can be written as follows:

(λ; y_{1}, \dots, y_{n}) = \prod_{i = 1}^{n} | d y_{i}^{(λ)} / d y | = \prod_{i = 1}^{n} y_{i}^{λ - 1} = G M {(y)}^{n (λ - 1)}

Here, GM represents the geometric mean, so the normal log likelihood at its maximum is as follows:

\log (L (\hat{μ}, \hat{σ})) = - \frac{n}{2} (\log (2 π {\hat{σ}}^{2}) + 1) + n (λ - 1) \log (G M (y)) = - \frac{n}{2} (\log (\frac{2 π {\hat{σ}}^{2}}{G M {(y)}^{2 (λ - 1)}}) + 1)

Minimizing the sum of square residuals is equivalent to maximizing the sum of the normal log likelihood of deviations from

\frac{y^{λ} - 1}{λ}

as well as the log of the Jacobian of the transformation. The best prediction for each response variable selected (proper transformation) is summarized in Table 4.

3.4. Principal Component Analysis (PCA)

The newly transformed data for both the predictors and response variables (e.g., D10) followed an approximately normal distribution. The next step was to eliminate the linear correlation between predictors using the classical method of principal component analysis (PCA) because PCA creates a new set of orthogonal linear combinations of the original predictors. The advantages of PCA include the following:

Orthogonal linear combinations compose a new feature space such that each feature is linearly uncorrelated with others.
Principal components (PC) are listed in order. The order describes the variance in the data that each component can explain.

A transforming matrix was obtained while running PCA to deliver information on how the new features are composed (and these new features are easily transformed back to the original feature set). A score plot is a representative example for the first two principal components (PC), as shown in Figure 9, where the x-axis (Dim 1) and y-axis (Dim 2) represent principal components 1 and 2, respectively. PCs are orthogonal and linearly independent of each other. Figure 9 is useful to visualize the original predictor variable representation on different principal components. For example, if one takes PC1 and PC2 as the graphical axes, then one can find that the hole distance (variable) is almost horizontal (PC1) but is not related to PC2. In addition, the vector representing the variable of ligament distance (liga_dis) in Figure 9 has a length of 0.9 and forms a

π / 4

angle with both the x and y axes. This implies both PC1 and PC2 contain

0.9 \times c o s (\frac{π}{4})

of liga_dis in their final representations.

Group clusters indicate the projection of each sample observation on the plane formed by PC1 and PC2. All the principal components form an orthogonal space, and each sample observation represents a point in this space. PC1 and PC2 will form a plane. After performing PCA, 15 new variables (PCs) were generated. It was found that 10 of the new PC variables could explain more than 99% of the variance in the predictors. Therefore, the first 10 PCs were kept in the analysis (e.g., PC1, PC2, …, PC10).

First, a full linear regression model was fitted to the data set to check whether linear regression requirements were satisfied. Figure 10 is the diagnostic plot for the full linear regression model. The diagnostic plot consisted of four parts: residual versus fitted plot, scale–location plot, normal Q–Q plot, and residual versus leverage plot and is used to evaluate whether the data and model assumptions of linear regression are satisfied. The residual versus fitted plot was relatively flat and had a magnitude of approximately absolute zero (meaning the data had an average value of 0, which implies the data followed a linear model). The normal Q–Q plot was almost a straight line, meaning that the normality condition (the error term in the linear regression model should follow a normal distribution) was true for our data set (an important assumption for building linear regression models). The scale–location plot shows the square root of the standardized residuals as a function of the fitted values. No obvious trend in this plot means the linearity of the data was satisfied. Finally, the fourth plot, the residuals versus leverage plot, shows outliers in the sample set. Outliers are defined by Cook’s distance, and usually one chooses 0.5 or 1 as the threshold. All points were located inside the (−0.5, 0.5) region, which means there were no outliers in the sample data set. The results summarized in Figure 10 imply that a linear regression model is reasonable.

A metric to balance accuracy and complexity is called AIC. Suppose there exists a full model that includes all predictors and a null model that includes no predictors. Between these two models, one can pick a set of predictors to build a linear regression model and obtain a corresponding AIC value. The model with the best AIC value is the final model that is used. A threefold cross-validation approach was used to test the robustness of this new AIC model, and the regression lines for each fold were similar, which implies that the model had similar performance on all three randomly picked subsets. Thus, one can conclude the final linear regression model is robust because it passed the cross-validation test (the finalized model is the one that used PCs), as shown in Table 5. Using the rotation matrix between the PCs and original features, we could translate the PC model back into original features (Table 6).

3.5. Individual Factor Analysis

The slope of the factor plots is indicative of the relative sensitivity of the IA metric, and a positive slope indicates the response variable (e.g., log(D10)) increases as the parameter increases and vice versa (Table 7). A short synopsis of the response variable is provided, with the drop angle of the holes that were formed being the most sensitive. As the angles formed within a hole increase, driftable fines decrease (this is related to the number of colliding holes). As more holes collide, more sides of a polygon will be needed to characterize the hole (and thus the polygon angles will decrease). Also, as the distance between the hole centroids and the nozzle increases, the driftable fines also decrease. Therefore, it is reasonable that a larger hole distance leads to a smaller response variable log(D10) (thus, we want holes to form near the nozzle, which corroborates well with the two-hole model results [11]). The angle used to approximate a hole is important and has been found to be the most sensitive parameter correlating to the prediction of driftable fines.

The current experimental observations for driftable fine reductions indicate that an upper limit for atomization drop sizes is achieved when air induction nozzles are used. This is likely attributed to the fact that the control of hole formation is lost or masked by the air being entrained into the sheet (which is an alternative way to control the number of holes that are colliding). There are several cross terms in this screening model that are sensitive, and all have an angle in the cross term. The average distance from the nozzle to the centroid of the continuous liquid sheet (Bdr_dis) analysis suggests that as this distance increases, driftable fines increase slightly. However, when one multiplies the coefficient for Bdr_dis with a scalar, then this parameter is not sensitive for DT10. Also, the number of holes in the continuous fluid sheet (Hole_n) decreases the driftable fines as this parameter increases. This is because more holes imply completeness of breakup, and smaller droplets occur; thus, the driftable fine fraction decreases. Analogous findings occur for the number of ligaments (Liga_n).

As the number of drops the sheet breaks up into increases, the number of driftable fines likewise decreases. More drops in this case means a higher fraction of smaller drops; thus, we want to keep the drop numbers to a minimum (e.g., the bigger drops). The more drops we have, the more completely the liquid sheet breaks up. The completeness of the breakup implies smaller droplet sizes.

Parameters that were found to be positively correlated were hole area, ligament distance, ligament area, and the biggest blog area. Thus, as the parameters increase in magnitude, so does the driftable fine fraction. A larger hole area is indicative of the incompleteness of the breakup procedure, so it leads to larger driftable fines. It is not intuitively clear, but the ligament distance indicates that only the short and chubby ligaments are seen a long way from the nozzle, and if this is the case, then the driftable fine fraction should decrease. An increase in ligament distance shows that ligament formation occurs, which means the breakup procedure is not complete, so an increase in drop size can be expected. A larger driftable fine fraction is implied as the ligament area increases because drops are formed when ligaments break up.

The average area of the continuous water sheet (biggest blob) is given by Bdr_area, which indicates a positive correlation to driftable fines. However, by looking at the scale and coefficient of the variable in the final model, this is not a sensitive parameter. Also, the square of the variables for ligament number and hole number should lead to a decrease in driftable fines, but the results suggest a slight increase in driftable fines. R/L, drop distance from the nozzle, and (ligament distance)² and (droplet distance)² are not sensitive parameters.

A comparison of the coefficients of each predictor in the linear regression model is given in Figure 11. The magnitude of each correlation is related to parameter sensitivity. A positive coefficient indicates that increasing the parameter increases D10 (with the converse also being true). The IA metrics found to be the most sensitive in predicting D10 were log(hole number²) and log(hole area). Details of deep learning and other software approaches are discussed in other articles [12,13].

4. Discussion

Photographs of atomization spray patterns for two-phase systems showed rich and diverse patterns that were dependent not only on the geometries/types and operating conditions of the nozzle but also on the composition of the formulation and its attributes, such as interfacial tensions between various phases, wettability, etc. Thus, metrics generated from images of spray patterns can be used as dependent parameters in data mining and statistical techniques and in offering insight/quantification for mechanistic modeling of sheet atomization. PCR analysis showed that 99% of the variability in the response variable for atomization droplet size (DT10) was addressed by 10 principal components. Sensitive parameters included the angles formed by colliding holes within the atomization sheet, hole distance from the nozzle, drop distance, hole number, ligament number, and drop number, which were negatively correlated to the atomization driftable fine fraction (drops smaller than 150 μm), while hole area, ligament distance, ligament area, and boundary area were positively correlated. Image analysis and machine learning techniques suggested the most sensitive metrics included log(hole number²) and log(hole_area), followed by hole_dis and log(ligament number²). Some variables that had a significant impact on the variance of the data were the angles that formed when holes collided and the log of hole area (log(hole_area)). Based on the coefficient and significance analysis and the analysis of variance, the variables angle, log(hole_area), drop_dis², hole_dis, and log(ligament number²) were found to be the five most important predictors that could be used to quantify the breakup of oil-in-water formulations into individual droplets, Table 8. PCA/PCR and linear regression were utilized to substantiate the findings in this paper.

Author Contributions

S.C. and J.R. are the principal investigators for this work and are responsible for all aspects, including data generation, analysis, and writing of the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Corteva Agriscience.

Data Availability Statement

The data are not publicly available due to the confidentiality of the pesticides used.

Conflicts of Interest

Authors Steven Cryer and John Raymond were employed by the company Corteva Agriscience. All authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Vernay, C. Destabilization of Liquid Sheets of Dilute Emulsions. Ph.D. Dissertation, Universite de Montpellier, Montpellier, France, 2015. [Google Scholar]
Heidary, A.; Douzals, M.; Sinfort, J.; Vallet, A. Influence of spray characteristics on potential spray drift of field crop sprayers: A literature review. Crop Prot. 2014, 63, 120–130. [Google Scholar] [CrossRef]
Dorr, G.J.; Hewitt, A.J.; Adkins, S.W.; Hanan, J.; Zhang, H.; Noller, B. A comparison of initial spray characteristics produced by agricultural nozzles. Crop Prot. 2013, 53, 109–117. [Google Scholar] [CrossRef]
Donkersley, P.; Nuyttens, D. A meta analysis of spray drift sampling. Crop Prot. 2011, 30, 931–936. [Google Scholar] [CrossRef]
Nuyttens, D.; Baetens, K.; De Schampheleire, M.; Sonck, B. Effect of nozzle type, size and pressure on spray droplet characteristics. Biosyst. Eng. 2007, 97, 333–345. [Google Scholar] [CrossRef]
Ellis, M.B.; Tuck, C. How adjuvants influence spray formation with different hydraulic nozzles. Crop Prot. 1999, 18, 101–109. [Google Scholar] [CrossRef]
Kluza, P.A.; Kuna-Broniowska, I.; Parafiniuk, S. Modeling and Prediction of the Uniformity of Spray Liquid Coverage from Flat Fan Spray Nozzles. Sustainability 2019, 11, 6716. [Google Scholar] [CrossRef]
Sijs, R.; Kooij, S.; Holterman, H.J.; van de Zande, J.; Bonn, D. Drop size measurement techniques for sprays: Comparison of image analysis, phase Doppler particle analysis, and laser diffraction. AIP Adv. 2021, 11, 015315. [Google Scholar] [CrossRef]
Xiao, L.; Liu, M.; Zhu, H.; Cai, J.; Lin, J. Spray Droplet Size Characteristics of Different Biological Pesticides with Different Hydraulic Nozzle. Trans. Chin. Soc. Agric. Mach. 2018, 49, 100–106. [Google Scholar]
Altieri, A.; Cryer, S.A.; Acharya, L. Mechanisms, Experiment and Theory of Liquid Sheet Breakup and Drop Size from Agricultural Nozzles. At. Sprays 2014, 24, 695–721. [Google Scholar] [CrossRef]
Altieri, A.L.; Cryer, S.A. Rupture of sprayed emulsions from fan nozzles using a hole kinematics model. Biosyst. Eng. 2018, 169, 104–114. [Google Scholar] [CrossRef]
Li, H.; Cryer, S.A.; Acharya, L.; Raymond, J. Raymond. Video and image classification using atomisation spray image patterns and deep learning. Biosyst. Eng. 2020, 200, 13–22. [Google Scholar] [CrossRef]
Li, H.; Cryer, S.A.; Raymond, J.; Acharya, L. Interpreting Atomization Spray Image Patterns Using Latent Dirichlet Allocation Techniques. Artif. Intell. Agric. 2020, 4, 253–261. [Google Scholar] [CrossRef]
Vulgarakis Minov, S.; Cointault, F.; Vangeyte, J.; Pieters, J.; Nuyttens, D. Spray nozzle characterization using a backlighted high speed imaging technique. In Proceedings of the 8th Biennial Conference on International Advances in Pesticide Application, Oxford, UK, 8–10 January 2014; Volume 122, pp. 353–361. [Google Scholar]
Lu, P.; Li, R.; Wu, E. Research on the testing technology for the water-saving performance of shower nozzle based on image analysis. In Proceedings of the International Conference on Image Processing and Pattern Recognition in Industrial Engineering, Xi’an, China, 7–8 August 2010; p. 782011. [Google Scholar]
Choo, Y.; Kang, B. Extraction of sizes and velocities of spray droplets by optical imaging method. KSME Int. J. 2004, 18, 123601245. [Google Scholar]
Zhang, D.; Islam, M.M.; Lu, G. A review on automatic image annotation techniques. Pattern Recognit. 2012, 45, 346–362. [Google Scholar]
Zhu, Z.; You, X.; Yu, S.; Zou, J.; Zhao, H. Dynamic texture modeling and synthesis using multi-kernel Gaussian process dynamic model. Signal Process. 2016, 124, 63–71. [Google Scholar]
Gonclaves, W.N.; Machado, B.B.; Bruno, O.M. A complex network approach for dynamic texture recognition. Neurocomputing 2015, 153, 211–220. [Google Scholar]
Xu, Y.; Quan, Y.; Ling, H.; Ji, H. Dynamic texture classification using dynamic fractal analysis. In Proceedings of the 2011 IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain, 6–13 November 2011; pp. 1219–1226. [Google Scholar]
Tang, X. Texture information in run-length matrices. IEEE Trans. Image Process. 1998, 7, 1602–1609. [Google Scholar] [CrossRef]
Haralick, R.M.; Shanmugam, K.; Dinstein, I.H. Textural Features for Image Classification. IEEE Trans. Syst. Man Cybern. 1973, SMC-3, 610–621. [Google Scholar] [CrossRef]
Plotnick, R.; Gardner, R.; Hargrove, W.; Prestegaard, K.; Perlmutter, M. Lacunarity Analysis: A General Technique for the Analysis of Spatial Patterns. Phys. Rev. E 1996, 53, 5461. [Google Scholar] [CrossRef]
Vernon-Carter, J.; Lobato-Calleros, C.; Escarela-Perez, R.; Rodriguez, E.; Alvarez-Ramirez, J. A suggested generalization for the lacunarity index. Phys. A Stat. Mech. Its Appl. 2009, 388, 4305–4314. [Google Scholar] [CrossRef]
Cryer, S.A.; Altieri, A.L. Use of Large Inhomogeneities in Spray Solutions to Initiate Liquid Sheet Rupture. Biosyst. Eng. 2017, 163, 103–115. [Google Scholar] [CrossRef]
Pietikäinen, M.; Ojala, T.; Xu, Z. Rotation-invariant texture classification using feature distributions. Pattern Recognit. 2000, 33, 43–52. [Google Scholar] [CrossRef]
Gipp, M.; Marcus, G.; Harder, N.; Suratanee, A.; Rohr, K.; König, R.; Männer, R. Haralick’s Texture Freatures Computation Accelerated by GPUs for Biological Applications. In Modeling, Simulation and Optimization of Complex Processes; Springer: Berlin/Heidelberg, Germany, 2012; pp. 127–137. [Google Scholar]
Brown, G.; Pocock, A.; Zhao, M.-J.; Luján, M. Conditional likelihood maximisation: A unifying framework for information theoretic feature selection. J. Mach. Learn. Res. 2012, 13, 27–66. [Google Scholar]
Butterworth, R.; Simovici, D.A.; Santos, G.S.; Ohno-Machado, L. A greedy algorithm for supervised discretization. J. Biomed. Inform. 2004, 37, 285–292. [Google Scholar] [CrossRef]
Wang, G.; Zhang, H.; Zhang, Z.; Lochovsky, F.H. A bernoulli relational model for nonlinear embedding. In Proceedings of the Fifth IEEE International Conference on Data Mining, Houston, TX, USA, 27–30 November 2005; p. 8-p. [Google Scholar]

Figure 1. Example of a standardized video frame image representation of the continuous liquid sheet from a flat fan nozzle before breakup for nucleation and collision of holes.

Figure 2. Physical image analysis features determined for each frame in a video.

Figure 3. Flowchart representing image processing procedure.

Figure 4. Polygon-shaped holes that form in the breakup of oil-in-water formulations. Red and blue polygons represent holes that form in the liquid sheet.

Figure 5. Statistical projection of each formulation/nozzle image frame (from two different vantage points).

Figure 6. Metrics summarized from a video (1% oil in the liquid, 110 degrees for the spray angle, and AIXR as the nozzle type). (a) Centroid location for the continuous liquid sheet (e.g., biggest blob), (b) area of the biggest blob, (c) centroid location for each hole that initially forms inside the biggest blob, (d) ligament aspect ratio (R/L).

Figure 7. Example of hole collision from a video image and interpretation of the angles that form (e.g., polygon) whose magnitude is dependent on the number of colliding holes. Length scale given by nozzle width at the top of the image.

Figure 8. Scatterplot, linear, and local linear fit of each predictor versus response variable D10 [red lines are indicative of linear regression model between a single predictor and response variable, and blue lines are local linear regression, which includes some quadratic terms (or higher-order terms)]. Thus, estimations with quadratic terms will ensue when the blue line differs dramatically from the red line). Red boxes in this figure indicate terms added to the original data set.

Figure 9. Illustration for principal components 1 and 2 in a plane.

Figure 10. Diagnostic plot for full linear regression model.

Figure 11. Comparison of coefficients of each predictor in the final linear model.

Table 1. Physical IA metrics (automated) that are characterized for each frame in a video.

Feature (Metric)	Variable Name	Detailed Description
Control volume	Bdr_area	Area of the continuous liquid sheet before being disrupted by hole formation
Breakup length	Bdr_dis	Farthest location from the origin of the control volume
Hole location	hole_dis	Spatial location and size of holes that form inside and outside of the continuous liquid sheet
Hole surface area	hole_area	Area of holes within the continuous sheet
Polygon representation for all holes	Poly	Approximation of hole surface area (as collide holes) as a polygon (vertices ≤ 12) that is optimally fit to each hole
Boundary points characterizing each hole	BP	Vertices of a polygon (≤12 vertices) that are optimally fit to each hole
Angles formed as holes collide	Angle	Angles formed as holes collide with other holes, forming liquid ligaments
Ligament aspect ratio	R/L	Aspect ratios (R/L) for liquid bridges (ligaments) formed from the collision of multiple holes as the holes grow
Ligament length	L	Length of the liquid bridges that form upon the collision of multiple holes
Ligament width	W = 2R	Width of the liquid bridges that form upon the collision of multiple holes
Ligament area	Liga_area	Area of a ligament
Ligament distance from origin	liga_dis	Distance from the flat fan nozzle (origin) to the centroid of the ligament
Droplet distance	Drop_dis	Distance from the flat fan nozzle (origin) to the drop centroid
Droplet area	Drop_area	Area of the drop
Droplet radius	Drop_radius	Radius of the drop (assumed to be spherical)
Number of holes in continuous sheet	Hole_n	Number of holes in the image
# ligaments	Liga_n	Number of ligaments in the image
# drops	Drop_n	Number of drops in the image

Table 3. Predictor explanations and their new names.

Predictor	Explanation	New Name
angle	Average angles of polygons representing holes	V1
hole_dis	Average distance from holes’ centroids to the nozzle	V2
hole_area	Average area of holes (take logarithm)	log(V3)
liga_dis	Average distance from ligaments’ centroids to the nozzle	V4
liga_area	Average area of ligaments	V5
drop_dis	Average distance from droplets’ centroids to the nozzle	V6
bdr_dis	Average distance from continuous water sheets’ centroid to the nozzle	V7
bdr_area	Average area of continuous water sheets (take logarithm)	log(V8)
hole_n	Average number of holes	V9
liga_n	Average number of ligaments	V10
drop_n	Average number of droplets	V11
liga2	Square of the average distance from ligaments’ centroids to the nozzle	V12
drop2	Square of the average distance from droplets’ centroids to the nozzle	V13
ln2	Square of the average number of ligaments (take logarithm)	log(V14)
hn2	Square of the average number of holes (take logarithm)	log(V15)

Table 4. Box–Cox transformation parameters and the corresponding functions.

	D10	D50	D90	f74	f150
Estimated λ	−0.2	0	−0.4	0.3	0.2
Suggested λ	0	0	−0.5	0.33	0.25
Transformation	$l o g (D 10)$	$l o g (D 50)$	1/ $\sqrt{D 90}$	$\sqrt[3]{f 74}$	$\sqrt[4]{f 150}$

Table 5. Formula of each response variable in principal components.

Response	Formula in PCs
D10	$\log (D 10) = 4.76 + 0.0008 P C 2 + 0.0009 P C 4 + 0.016 P C 5 + 0.019 P C 6 + 0.069 P C 7 + 0.348 P C 9$
D50	$\log (D 50) = 5.58 + 0.0008 P C 2 + 0.0011 P C 4 + 0.017 P C 5 + 0.019 P C 6 + 0.06 P C 7 + 0.317 P C 9$
D90	$\frac{1}{\sqrt{D 90}} = 10^{- 4} (470 - 0.1 P C 2 - 0.2 P C 3 - 0.2 P C 4 - 2 P C 5 - 3.7 P C 6 - 10.2 P C 7 - 72.1 P C 9)$
f74	$\sqrt[3]{f 74} = 1.37 - 0.001 P C 2 - 0.002 P C 3 - 0.002 P C 4 - 0.022 P C 5 - 0.024 P C 6 - 0.16 P C 7 - 0.338 P C 9 + 0.298 P C 10$
f150	$\sqrt[4]{f 150} = 2.07 - 0.001 P C 2 - 0.001 P C 4 - 0.019 P C 5 - 0.018 P C 6 - 0.073 P C 7 - 0.348 P C 9$

Table 6. Formula of each response variable in original features.

Variable Name	$\log (D 10)$	$\log (D 50)$	$\frac{1}{\sqrt{D 90}}$ $(\times 10^{- 4}$ )	$\sqrt[3]{f 74}$	$\sqrt[4]{f 150}$	Unit	Scale
Intercept	4.76	5.58	470	1.37	2.07	1	NA
V1	−0.00219	−0.00202	0.498	−0.00652	0.00214	$r a d$	1
V2	−0.0707	−0.063	10.5	0.174	0.0749	$m m$	10
log(V3)	0.278	0.253	−57.8	−0.147	−0.278	${m m}^{2}$	1
V4	0.0379	0.0345	−7.8	−0.0146	−0.0379	$m m$	10
V5	0.0273	0.0249	−5.68	0.0375	−0.0274	${m m}^{2}$	1
V6	−0.0179	−0.0163	3.68	0.0402	0.0179	$m m$	10
V7	−0.00426	−0.00736	1.63	−0.0294	0.00324	$m m$	10
log(V8)	0.00856	0.008	−2.34	−0.0425	−0.00791	${m m}^{2}$	10
V9	−0.015	−0.0152	2.63	0.0234	0.0175	1	10
V10	−0.00179	−0.00112	0.337	0.00221	0.00129	1	100
V11	−0.00001	−0.00002	0.00606	0.000123	0.00003	1	1000
V12	−0.00052	−0.000417	−0.0547	−0.00191	0.000627	${m m}^{2}$	100
V13	0.000153	0.000167	0.0577	0.000597	−0.00009	${m m}^{2}$	100
log(V14)	0.0792	0.0718	−15.7	0.0233	−0.0799	1	10
log(V15)	0.187	0.17	−38.8	−0.413	−0.187	1	10

“1” in the table means that we do not have any units for the corresponding predictor, such as intercept and number of holes in the picture. “Scale” means the scale of that variable; if we multiply the variable range with the coefficient, the importance of that variable is approximated.

Table 7. Correlation of primary IA metrics to driftable fine fraction (D10).

Negative Slope	Positive Slope
Angle	Hole area
Hole distance	Ligament distance
Drop distance	Ligament area
Boundary distance	Boundary area
Hole number	(Drop distance)²
Ligament number	(Ligament number)²
Drop number	(Hole number)²
(Ligament distance)²

Table 8. Variance explained in terms of attributes for magnitude for each correlation in the parameter sensitivity analysis.

Variable	Variance Explained Percentage (%)
angle	26.43
log(hole_area)	8.38
drop_dis²	4.91
hole_dis	4.01
log(ligament number²)	3.73
liga_dis²	3.63
liga_n	3.16

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cryer, S.; Raymond, J. Interpreting Image Patterns for Agricultural Sprays Using Statistics and Machine Learning Techniques. Fluids 2024, 9, 40. https://doi.org/10.3390/fluids9020040

AMA Style

Cryer S, Raymond J. Interpreting Image Patterns for Agricultural Sprays Using Statistics and Machine Learning Techniques. Fluids. 2024; 9(2):40. https://doi.org/10.3390/fluids9020040

Chicago/Turabian Style

Cryer, Steven, and John Raymond. 2024. "Interpreting Image Patterns for Agricultural Sprays Using Statistics and Machine Learning Techniques" Fluids 9, no. 2: 40. https://doi.org/10.3390/fluids9020040

Article Menu

Interpreting Image Patterns for Agricultural Sprays Using Statistics and Machine Learning Techniques

Abstract

1. Introduction

2. Materials and Methods

2.1. Numerical Descriptors

2.2. Visual Navigable Landscape (Empirical)

2.3. Experimental

2.4. Theoretical

3. Results

3.1. Visual Navigable Landscape (Empirical)

3.2. Physical Image Analysis (IA) Metrics

3.3. Statistical Analysis

Exploratory Data Analysis: On Physical Metrics

3.4. Principal Component Analysis (PCA)

3.5. Individual Factor Analysis

4. Discussion

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI