Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives
Abstract
:1. Introduction
- RQ1: What are the most used ML algorithms in agriculture?
- RQ2: What are the impacts and outcomes of integrating ML in agriculture?
- RQ3: What are the challenges and future directions associated with integrating ML in agriculture and agricultural systems?
2. Principles and Methods
2.1. Identification Phase
2.2. Screening Phase
2.3. Eligibility Phase
2.4. Inclusion Phase
2.5. PRISMA Overview
3. Results and Discussion
3.1. Statistical Analysis
3.2. Application Domains in Agriculture
- Crop type:
- -
- “Plants” with 46.36% of the total count. In this group, it is possible to find the following: wheat (13.91%), maize (12.17%), rice (6.09%), vineyards (3.04%), grass (3.04%), rapeseed (2.61%), sugarcane (2.17%), tea (1.74%), cotton (0.87%), peach leaf (0.87%), alfalfa (0.87%), bok choy (0.43%), barley (0.43%), Arabidopsis (0.43%), jujube (0.43%), parsley (0.43%), green coffee plant (0.43%), mushrooms (0.43%), oil palm leaf (0.43%), almond orchard (0.43%), and banana leaf (0.43%);
- -
- “Not Specified” with 13.59% of the total crop analysis;
- -
- “Vegetables” with 12.22% of the total count: soybean (3.04%), potato (2.61%), vegetables—not specified (1.74%), lettuce (0.87%), carrot (0.87%), sugar beet (0.87%), asparagus (0.43%), leek (0.43%), onions (0.43%), and cabbage (0.43%);
- -
- “Fruits” with 11.96% of the total count: tomato (2.61%), citrus (1.74%), pineapple (1.31%), watermelon (0.87%), mango (0.87%), banana (0.43%), strawberry (0.43%), date (0.43%), avocado (0.43%), muskmelon (0.43%), kiwi (0.43%), apricot (0.43%), durian (0.43%), peach (0.43%), grape (0.43%), guava (0.43%), and cucumber (0.43%);
- -
- “Trees” with 9.84% of the total count: apple tree (1.72%), olive tree (0.87%), pine tree (0.87%), gum tree (0.43%), Oriental beech tree (0.43%), Cinnamon tree (0.43%), Caribbean tree (0.43%), and shrub (0.43%);
- -
- “Grain, seeds and nuts” with 6.04% of the total count: grains (3.01%), nuts—not specified (2.17%), pea seeds (0.43%), and radish seeds (0.43%).
- Animal type: cows and dairy cows (21.21%), chickens, broilers, and hens (18.18%), sheep (15.15%), fish (9.09%), pigs (9.09%), bees (6.06%), steers (3.03%), gorillas (3.03%), heifers (3.03%), horses (3.03%), lambs (3.03%), small ruminants (3.03%), and invasive insects—not specified (3.03%).
4. Machine Learning Trends
- Ensemble Learning: this category has the largest percentage in the SLR with 35.6% of the total distribution. Ensemble Learning [22,23,24] emerges as a key force for improving the performance and generalisation of ML models, making them more robust and reliable. This category includes RF (19.2%, frequency: 127), GBT (8.3%, frequency: 55), Extreme Gradient Boosting (XGBoost) (4.5%, frequency: 30), AdaBoost (0.9%, frequency: 6), Bagging (0.8%, frequency: 5), CatBoost (0.6%, frequency: 4), Stacking (0.3%, frequency: 2), and “not specified” ensemble methods (0.3%, frequency: 2). Of all the algorithms within this category, the RF presents the highest frequency. While the Decision Tree (DT) [25] offers a simple and interpretable model, RF leverages the power of multiple DT to provide robust predictions and classifications that are crucial for the optimisation of various agricultural processes. These processes can include crop yield estimation, disease detection, and land cover classification based on remote sensing data.
- Artificial Neural Networks: the second category within the scope of this SLR constitutes 24.9% of the overall content and encompasses a range of influential algorithms that fall under the domain of ANN [26,27]. The algorithms covered in this category include CNN (7.3%, frequency: 48), ANN-not specified (6.4%, frequency: 42), Long Short-Term Memory (LSTM) (3.0%, frequency: 20), Deep Neural Networks (DNN) (1.8%, frequency: 12), Multilayer Perceptron (MLP) (1.7%, frequency: 11), You Only Look Once (YOLO) (1.2%, frequency: 8), Extreme Learning Machines (ELM) (0.9%, frequency: 6), DL-not specified (0.6%, frequency: 4), Gated Recurrent Unit (GRU) (0.5%, frequency: 3), Recurrent Neural Network (RNN) (0.3%, frequency: 2), Generative Adversarial Networks (GAN) (0.2%, frequency: 1), and Encoder and Autoencoder (0.2%, frequency: 1). These algorithms are powerful tools capable of learning complex patterns, thereby facilitating accurate predictions and advancing the capabilities of numerous applications. Among these algorithms, the one that stands out the most is CNN, known to be specialised for image data analysis [28], making them valuable for tasks like crop disease identification, plant species recognition, and weed detection [1].
- Support Vector Machine: the third most prominent category, accounting for 15.9% (frequency: 105), is the SVM. This algorithm holds significant popularity and widespread application for tasks encompassing both classification and regression procedures [29]. SVM underscores its significance in guiding informed decisions for bolstering agricultural productivity in the era of Agriculture 4.0 [1]. Through its adeptness in crop mapping, yield estimation, and disease detection, SVM contributes to the ongoing transformation of agriculture into a more precise, efficient, and resilient practice, aligning seamlessly with the evolving demands of a dynamic global food landscape.
- Dimensionality Reduction: this category represents 6.2% of the total distribution and includes three different algorithms that can aid in dimensionality reduction [30] and feature engineering [31] from agricultural datasets, namely Partial Least Squares (PLS) algorithm (3.9%, frequency: 26), Principal Component Analysis (PCA) (1.2%, frequency: 8), and Linear Discriminant Analysis (LDA) algorithm (1.1%, frequency: 7).
- Generalised Linear Models: comprising 6.0% of the total distribution, this category underscores the significance of statistical models that transcend the constraints of simple linear regression. Of this category are Multiple Linear Regression (MLR) (2.3%, frequency: 15), Logistic Regression (1.4%, frequency: 9), Ridge Regression (1.2%, frequency: 8), Cubist Regression (0.8%, frequency: 5), and Multivariate Adaptive Regression Splines (MARS) (0.3%, frequency: 2).
- Nearest Neighbour: this category exclusively employs the k-Nearest Neighbors (KNN) algorithm (4.5%, frequency: 30). Among the various algorithms in the field of ML, the KNN algorithm stands out as one of the simplest yet extensively employed methods for classification purposes [32,33]. Its adaptive and comprehensible design contributes to its popularity in various classification tasks.
- Bayesian Models: this category focuses on Gaussian distributions and probabilistic models. These methods leverage the principles of Gaussian processes [34] (2.0%, frequency: 13) and Naïve Bayes (NB) [35] (2.3%, frequency: 15). These techniques offer solutions that adapt to the complexities of diverse datasets and applications.
- Decision Trees: constituting 4.1% (frequency: 27) of the overall distribution, these tree-like structure algorithms are versatile tools that facilitate data-driven decisions [25].
5. Machine Learning in Agriculture
5.1. Crop Management Domain
5.1.1. Crop Quality
5.1.2. Crop Mapping and Recognition
5.1.3. Crop Yield
5.1.4. Crop Disease
5.1.5. Pest and Weed Detection
5.2. Water Management Domain
5.3. Soil Management Domain
5.4. Animal Management Domain
5.5. Main Findings
5.5.1. Crop Management
5.5.2. Water Management
5.5.3. Soil Management
5.5.4. Animal Management
6. Challenges and Research Opportunities
7. Conclusions, Limitations, and Future Work
7.1. Conclusions
7.2. Limitations
7.3. Future Work
Author Contributions
Funding
Conflicts of Interest
Abbreviations
References
- Araújo, S.O.; Peres, R.S.; Barata, J.; Lidon, F.; Ramalho, J.C. Characterising the Agriculture 4.0 Landscape—Emerging Trends, Challenges and Opportunities. Agronomy 2021, 11, 667. [Google Scholar] [CrossRef]
- De Clercq, M.; Vats, A.; Biel, A. Agriculture 4.0: The future of farming technology. In Proceedings of the the World Government Summit, Dubai, United Arab Emirates, 11–13 February 2018; pp. 11–13. [Google Scholar]
- Zambon, I.; Cecchini, M.; Egidi, G.; Saporito, M.G.; Colantoni, A. Revolution 4.0: Industry vs. agriculture in a future development for SMEs. Processes 2019, 7, 36. [Google Scholar] [CrossRef]
- Liu, Y.; Ma, X.; Shu, L.; Hancke, G.P.; Abu-Mahfouz, A.M. From Industry 4.0 to Agriculture 4.0: Current Status, Enabling Technologies, and Research Challenges. IEEE Trans. Ind. Inform. 2020, 17, 4322–4334. [Google Scholar] [CrossRef]
- Zhai, Z.; Martínez, J.F.; Beltran, V.; Martínez, N.L. Decision support systems for Agriculture 4.0: Survey and challenges. Comput. Electron. Agric. 2020, 170, 105256. [Google Scholar] [CrossRef]
- Trendov, N.M.; Varas, S.; Zeng, M. Digital Technologies in Agriculture and Rural Areas; Briefing paper; FAO: Rome, Italy, 2019. [Google Scholar]
- Rose, D.C.; Chilvers, J. Agriculture 4.0: Broadening responsible innovation in an era of smart farming. Front. Sustain. Food Syst. 2018, 2, 87. [Google Scholar] [CrossRef]
- Ahmed, M.; Pathan, A.S.K. Data Analytics: Concepts, Techniques, and Applications; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar]
- Liakos, K.G.; Busato, P.; Moshou, D.; Pearson, S.; Bochtis, D. Machine learning in agriculture: A review. Sensors 2018, 18, 2674. [Google Scholar] [CrossRef]
- Mahesh, B. Machine learning algorithms-a review. Int. J. Sci. Res. (IJSR) 2020, 9, 381–386. [Google Scholar]
- Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G.; Group, P. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLoS Med. 2009, 6, e1000097. [Google Scholar] [CrossRef]
- PRISMA. Prisma Transparent Reporting of Systematic Reviews and Meta-Analyses. Available online: http://www.prisma-statement.org/ (accessed on 6 July 2023).
- Clarivate. Journal Citation Reports. Available online: http://jcr.clarivate.com (accessed on 6 July 2023).
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Liaw, A.; Wiener, M. Classification and regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
- Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Noble, W.S. What is a support vector machine? Nat. Biotechnol. 2006, 24, 1565–1567. [Google Scholar] [CrossRef]
- Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
- Li, Q.; Wen, Z.; He, B. Practical federated gradient boosting decision trees. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 4642–4649. [Google Scholar] [CrossRef]
- Albawi, S.; Mohammed, T.A.; Al-Zawi, S. Understanding of a convolutional neural network. In Proceedings of the 2017 International Conference on Engineering and Technology (ICET), Antalya, Turkey, 21–23 August 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–6. [Google Scholar] [CrossRef]
- Li, Z.; Liu, F.; Yang, W.; Peng, S.; Zhou, J. A survey of convolutional neural networks: Analysis, applications, and prospects. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 6999–7019. [Google Scholar] [CrossRef] [PubMed]
- Dietterich, T.G. Ensemble methods in machine learning. In Proceedings of the International Workshop on Multiple Classifier Systems, Cagliari, Italy, 21–23 June 2000; Springer: Berlin/Heidelberg, Germany, 2000; pp. 1–15. [Google Scholar]
- Sagi, O.; Rokach, L. Ensemble learning: A survey. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2018, 8, e1249. [Google Scholar] [CrossRef]
- Dong, X.; Yu, Z.; Cao, W.; Shi, Y.; Ma, Q. A survey on ensemble learning. Front. Comput. Sci. 2020, 14, 241–258. [Google Scholar] [CrossRef]
- De Ville, B. Decision trees. Wiley Interdiscip. Rev. Comput. Stat. 2013, 5, 448–455. [Google Scholar] [CrossRef]
- Krenker, A.; Bešter, J.; Kos, A. Introduction to the artificial neural networks. In Artificial Neural Networks: Methodological Advances and Biomedical Applications; InTech: London, UK, 2011; pp. 1–18. [Google Scholar]
- Walczak, S. Artificial neural networks. In Advanced Methodologies and Technologies in Artificial Intelligence, Computer Simulation, and Human-Computer Interaction; IGI Global: Hershey, PA, USA, 2019; pp. 40–53. [Google Scholar] [CrossRef]
- Liang, M.; Hu, X. Recurrent convolutional neural network for object recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 3367–3375. [Google Scholar] [CrossRef]
- Ray, S. A quick review of machine learning algorithms. In Proceedings of the 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), Faridabad, India, 14–16 February 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 35–39. [Google Scholar] [CrossRef]
- Reddy, G.T.; Reddy, M.P.K.; Lakshmanna, K.; Kaluri, R.; Rajput, D.S.; Srivastava, G.; Baker, T. Analysis of dimensionality reduction techniques on big data. IEEE Access 2020, 8, 54776–54788. [Google Scholar] [CrossRef]
- Zheng, A.; Casari, A. Feature Engineering for Machine Learning: Principles and Techniques for Data Scientists; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2018. [Google Scholar]
- Cunningham, P.; Delany, S.J. k-Nearest neighbour classifiers-A Tutorial. ACM Comput. Surv. (CSUR) 2021, 54, 1–25. [Google Scholar] [CrossRef]
- Uddin, S.; Haque, I.; Lu, H.; Moni, M.A.; Gide, E. Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction. Sci. Rep. 2022, 12, 6256. [Google Scholar] [CrossRef]
- Rasmussen, C.E. Gaussian processes in machine learning. In Summer School on Machine Learning; Springer: Berlin/Heidelberg, Germany, 2003; pp. 63–71. [Google Scholar]
- Berrar, D. Bayes’ theorem and naive Bayes classifier. In Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics; Elsevier: Amsterdam, The Netherlands, 2018; Volume 403, p. 412. [Google Scholar]
- Thung, K.H.; Wee, C.Y. A brief review on multi-task learning. Multimed. Tools Appl. 2018, 77, 29705–29725. [Google Scholar] [CrossRef]
- Zhang, Y.; Yang, Q. An overview of multi-task learning. Natl. Sci. Rev. 2018, 5, 30–43. [Google Scholar] [CrossRef]
- FAO. The Future of Food and Agriculture—Trends and Challenges; Food and Agriculture Organization of the United Nations: Rome, Italy, 2017. [Google Scholar]
- Bauer, A.; Bostrom, A.G.; Ball, J.; Applegate, C.; Cheng, T.; Laycock, S.; Rojas, S.M.; Kirwan, J.; Zhou, J. Combining computer vision and deep learning to enable ultra-scale aerial phenotyping and precision agriculture: A case study of lettuce production. Hortic. Res. 2019, 6, 70. [Google Scholar] [CrossRef] [PubMed]
- Manthou, E.; Lago, S.L.; Dagres, E.; Lianou, A.; Tsakanikas, P.; Panagou, E.Z.; Anastasiadi, M.; Mohareb, F.; Nychas, G.J.E. Application of spectroscopic and multispectral imaging technologies on the assessment of ready-to-eat pineapple quality: A performance evaluation study of machine learning models generated from two commercial data analytics tools. Comput. Electron. Agric. 2020, 175, 105529. [Google Scholar] [CrossRef]
- Chawgien, K.; Kiattisin, S. Machine learning techniques for classifying the sweetness of watermelon using acoustic signal and image processing. Comput. Electron. Agric. 2021, 181, 105938. [Google Scholar] [CrossRef]
- Zhu, Y.; Chen, M.; Gu, Q.; Zhao, Y.; Zhang, X.; Sun, Q.; Gu, X.; Zheng, K. Machine learning methods for efficient and automated in situ monitoring of peach flowering phenology. Comput. Electron. Agric. 2022, 202, 107370. [Google Scholar] [CrossRef]
- Lu, J.; Dai, E.; Miao, Y.; Kusnierek, K. Improving active canopy sensor-based in-season rice nitrogen status diagnosis and recommendation using multi-source data fusion with machine learning. J. Clean. Prod. 2022, 380, 134926. [Google Scholar] [CrossRef]
- Gomes, W.P.C.; Gonçalves, L.; da Silva, C.B.; Melchert, W.R. Application of multispectral imaging combined with machine learning models to discriminate special and traditional green coffee. Comput. Electron. Agric. 2022, 198, 107097. [Google Scholar] [CrossRef]
- Zhang, Y.; Wang, X.; Liu, T.; Wang, R.; Li, Y.; Xue, Q.; Yang, P. Sustainable fertilisation management via tensor multi-task learning using multi-dimensional agricultural data. J. Ind. Inf. Integr. 2023, 34, 100461. [Google Scholar] [CrossRef]
- Yang, X.; Zhang, R.; Zhai, Z.; Pang, Y.; Jin, Z. Machine learning for cultivar classification of apricots (Prunus armeniaca L.) based on shape features. Sci. Hortic. 2019, 256, 108524. [Google Scholar] [CrossRef]
- Fernandes, A.M.; Utkin, A.B.; Eiras-Dias, J.; Cunha, J.; Silvestre, J.; Melo-Pinto, P. Grapevine variety identification using “Big Data” collected with miniaturized spectrometer combined with support vector machines and convolutional neural networks. Comput. Electron. Agric. 2019, 163, 104855. [Google Scholar] [CrossRef]
- Del Valle, T.M.; Jiang, P. Comparison of common classification strategies for large-scale vegetation mapping over the Google Earth Engine platform. Int. J. Appl. Earth Obs. Geoinf. 2022, 115, 103092. [Google Scholar] [CrossRef]
- Li, L.; Qiao, J.; Yao, J.; Li, J.; Li, L. Automatic freezing-tolerant rapeseed material recognition using UAV images and deep learning. Plant Methods 2022, 18, 1–13. [Google Scholar] [CrossRef] [PubMed]
- Syazwani, R.W.N.; Asraf, H.M.; Amin, M.M.S.; Dalila, K.N. Automated image identification, detection and fruit counting of top-view pineapple crown using machine learning. Alex. Eng. J. 2022, 61, 1265–1276. [Google Scholar] [CrossRef]
- Yang, Z.; Diao, C.; Gao, F. Towards Scalable Within-Season Crop Mapping With Phenology Normalization and Deep Learning. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2023, 16, 1390–1402. [Google Scholar] [CrossRef]
- Ballesteros, R.; Intrigliolo, D.S.; Ortega, J.F.; Ramírez-Cuesta, J.M.; Buesa, I.; Moreno, M.A. Vineyard yield estimation by combining remote sensing, computer vision and artificial neural network techniques. Precis. Agric. 2020, 21, 1242–1262. [Google Scholar] [CrossRef]
- Chu, Z.; Yu, J. An end-to-end model for rice yield prediction using deep learning fusion. Comput. Electron. Agric. 2020, 174, 105471. [Google Scholar] [CrossRef]
- Zheng, C.; Abd-Elrahman, A.; Whitaker, V.; Dalid, C. Prediction of Strawberry Dry Biomass from UAV Multispectral Imagery Using Multiple Machine Learning Methods. Remote Sens. 2022, 14, 4511. [Google Scholar] [CrossRef]
- Chen, R.; Zhang, C.; Xu, B.; Zhu, Y.; Zhao, F.; Han, S.; Yang, G.; Yang, H. Predicting individual apple tree yield using UAV multi-source remote sensing data and ensemble learning. Comput. Electron. Agric. 2022, 201, 107275. [Google Scholar] [CrossRef]
- Segarra, J.; Araus, J.L.; Kefauver, S.C. Farming and Earth Observation: Sentinel-2 data to estimate within-field wheat grain yield. Int. J. Appl. Earth Obs. Geoinf. 2022, 107, 102697. [Google Scholar] [CrossRef]
- Wang, Y.; Shi, W.; Wen, T. Prediction of winter wheat yield and dry matter in North China Plain using machine learning algorithms for optimal water and nitrogen application. Agric. Water Manag. 2023, 277, 108140. [Google Scholar] [CrossRef]
- Selvaraj, M.G.; Vergara, A.; Montenegro, F.; Ruiz, H.A.; Safari, N.; Raymaekers, D.; Ocimati, W.; Ntamwira, J.; Tits, L.; Omondi, A.B.; et al. Detection of banana plants and their major diseases through aerial images and machine learning methods: A case study in DR Congo and Republic of Benin. ISPRS J. Photogramm. Remote Sens. 2020, 169, 110–124. [Google Scholar] [CrossRef]
- Wang, X.; Liu, J.; Zhu, X. Early real-time detection algorithm of tomato diseases and pests in the natural environment. Plant Methods 2021, 17, 1–17. [Google Scholar] [CrossRef] [PubMed]
- Nagasubramanian, G.; Sakthivel, R.K.; Patan, R.; Sankayya, M.; Daneshmand, M.; Gandomi, A.H. Ensemble classification and IoT-based pattern recognition for crop disease monitoring system. IEEE Internet Things J. 2021, 8, 12847–12854. [Google Scholar] [CrossRef]
- Amarasingam, N.; Gonzalez, F.; Salgadoe, A.S.A.; Sandino, J.; Powell, K. Detection of White Leaf Disease in Sugarcane Crops Using UAV-Derived RGB Imagery with Existing Deep Learning Models. Remote Sens. 2022, 14, 6137. [Google Scholar] [CrossRef]
- Abdulridha, J.; Ampatzidis, Y.; Qureshi, J.; Roberts, P. Identification and classification of downy mildew severity stages in watermelon utilizing aerial and ground remote sensing and machine learning. Front. Plant Sci. 2022, 13, 791018. [Google Scholar] [CrossRef] [PubMed]
- Sriwanna, K. Weather-based rice blast disease forecasting. Comput. Electron. Agric. 2022, 193, 106685. [Google Scholar] [CrossRef]
- Shin, M.Y.; Viejo, C.G.; Tongson, E.; Wiechel, T.; Taylor, P.W.; Fuentes, S. Early detection of Verticillium wilt of potatoes using near-infrared spectroscopy and machine learning modeling. Comput. Electron. Agric. 2023, 204, 107567. [Google Scholar] [CrossRef]
- Abbas, T.; Zahir, Z.A.; Naveed, M.; Kremer, R.J. Limitations of existing weed control practices necessitate development of alternative techniques based on biological approaches. Adv. Agron. 2018, 147, 239–280. [Google Scholar] [CrossRef]
- de Castro, A.I.; Peña, J.M.; Torres-Sánchez, J.; Jiménez-Brenes, F.M.; Valencia-Gredilla, F.; Recasens, J.; López-Granados, F. Mapping cynodon dactylon infesting cover crops with an automatic decision tree-OBIA procedure and UAV imagery for precision viticulture. Remote Sens. 2019, 12, 56. [Google Scholar] [CrossRef]
- Gée, C.; Denimal, E. RGB image-derived indicators for spatial assessment of the impact of broadleaf weeds on wheat biomass. Remote Sens. 2020, 12, 2982. [Google Scholar] [CrossRef]
- Sapkota, B.; Singh, V.; Neely, C.; Rajan, N.; Bagavathiannan, M. Detection of Italian ryegrass in wheat and prediction of competitive interactions using remote-sensing and machine-learning techniques. Remote Sens. 2020, 12, 2977. [Google Scholar] [CrossRef]
- El-Kenawy, E.S.M.; Khodadadi, N.; Mirjalili, S.; Makarovskikh, T.; Abotaleb, M.; Karim, F.K.; Alkahtani, H.K.; Abdelhamid, A.A.; Eid, M.M.; Horiuchi, T.; et al. Metaheuristic optimization for improving weed detection in wheat images captured by drones. Mathematics 2022, 10, 4421. [Google Scholar] [CrossRef]
- Zhang, C.; Hu, Z.; Xu, L.; Zhao, Y. A YOLOv7 incorporating the Adan optimizer based corn pests identification method. Front. Plant Sci. 2023, 14, 1174556. [Google Scholar] [CrossRef]
- Pereira, L.S.; Perrier, A.; Allen, R.G.; Alves, I. Evapotranspiration: Concepts and future trends. J. Irrig. Drain. Eng. 1999, 125, 45–51. [Google Scholar] [CrossRef]
- Filgueiras, R.; Almeida, T.S.; Mantovani, E.C.; Dias, S.H.B.; Fernandes-Filho, E.I.; da Cunha, F.F.; Venancio, L.P. Soil water content and actual evapotranspiration predictions using regression algorithms and remote sensing data. Agric. Water Manag. 2020, 241, 106346. [Google Scholar] [CrossRef]
- Brédy, J.; Gallichand, J.; Celicourt, P.; Gumiere, S.J. Water table depth forecasting in cranberry fields using two decision-tree-modeling approaches. Agric. Water Manag. 2020, 233, 106090. [Google Scholar] [CrossRef]
- Akhter, F.; Siddiquei, H.R.; Alahi, M.E.E.; Jayasundera, K.P.; Mukhopadhyay, S.C. An IoT-enabled portable water quality monitoring system with MWCNT/PDMS multifunctional sensor for agricultural applications. IEEE Internet Things J. 2021, 9, 14307–14316. [Google Scholar] [CrossRef]
- Zhao, L.; Zhao, X.; Zhou, H.; Wang, X.; Xing, X. Prediction model for daily reference crop evapotranspiration based on hybrid algorithm and principal components analysis in Southwest China. Comput. Electron. Agric. 2021, 190, 106424. [Google Scholar] [CrossRef]
- Ndlovu, H.S.; Odindi, J.; Sibanda, M.; Mutanga, O.; Clulow, A.; Chimonyo, V.G.; Mabhaudhi, T. A comparative estimation of maize leaf water content using machine learning techniques and unmanned aerial vehicle (UAV)-based proximal and remotely sensed data. Remote Sens. 2021, 13, 4091. [Google Scholar] [CrossRef]
- Vianny, D.M.M.; John, A.; Mohan, S.K.; Sarlan, A.; Ahmadian, A. Water optimization technique for precision irrigation system using IoT and machine learning. Sustain. Energy Technol. Assessments 2022, 52, 102307. [Google Scholar] [CrossRef]
- Yang, H.; Wang, P.; Chen, A.; Ye, Y.; Chen, Q.; Cui, R.; Zhang, D. Prediction of phosphorus concentrations in shallow groundwater in intensive agricultural regions based on machine learning. Chemosphere 2023, 313, 137623. [Google Scholar] [CrossRef] [PubMed]
- Na, A.; Isaac, W.; Varshney, S.; Khan, E. An IoT based system for remote monitoring of soil characteristics. In Proceedings of the 2016 International Conference on Information Technology (InCITe)—The Next Generation IT Summit on the Theme—Internet of Things: Connect your Worlds, Noida, India, 6–7 October 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 316–320. [Google Scholar] [CrossRef]
- Taneja, P.; Vasava, H.K.; Daggupati, P.; Biswas, A. Multi-algorithm comparison to predict soil organic matter and soil moisture content from cell phone images. Geoderma 2021, 385, 114863. [Google Scholar] [CrossRef]
- Yuan, J.; Wen, T.; Zhang, H.; Zhao, M.; Penton, C.R.; Thomashow, L.S.; Shen, Q. Predicting disease occurrence with high accuracy based on soil macroecological patterns of Fusarium wilt. ISME J. 2020, 14, 2936–2950. [Google Scholar] [CrossRef] [PubMed]
- Glenn, A.J.; Moulin, A.P.; Roy, A.K.; Wilson, H.F. Soil nitrous oxide emissions from no-till canola production under variable rate nitrogen fertilizer management. Geoderma 2021, 385, 114857. [Google Scholar] [CrossRef]
- Fournier, B.; Steiner, M.; Brochet, X.; Degrune, F.; Mammeri, J.; Carvalho, D.L.; Siliceo, S.L.; Bacher, S.; Peña-Reyes, C.A.; Heger, T.J. Toward the use of protists as bioindicators of multiple stresses in agricultural soils: A case study in vineyard ecosystems. Ecol. Indic. 2022, 139, 108955. [Google Scholar] [CrossRef]
- Li, P.; Hao, H.; Mao, X.; Xu, J.; Lv, Y.; Chen, W.; Ge, D.; Zhang, Z. Convolutional neural network-based applied research on the enrichment of heavy metals in the soil–rice system in China. Environ. Sci. Pollut. Res. 2022, 29, 53642–53655. [Google Scholar] [CrossRef]
- Zhao, J.; Zhang, C.; Min, L.; Guo, Z.; Li, N. Retrieval of farmland surface soil moisture based on feature optimization and machine learning. Remote Sens. 2022, 14, 5102. [Google Scholar] [CrossRef]
- Wan, H.; Qi, H.; Shang, S. Estimating soil water and salt contents from field measurements with time domain reflectometry using machine learning algorithms. Agric. Water Manag. 2023, 285, 108364. [Google Scholar] [CrossRef]
- Nasirahmadi, A.; Edwards, S.A.; Sturm, B. Implementation of machine vision for detecting behaviour of cattle and pigs. Livest. Sci. 2017, 202, 25–38. [Google Scholar] [CrossRef]
- Raju, K.R.S.R.; Varma, G.H.K. Knowledge based real time monitoring system for aquaculture using IoT. In Proceedings of the 2017 IEEE 7th International Advance Computing Conference (IACC), Hyderabad, India, 5–7 January 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 318–321. [Google Scholar] [CrossRef]
- Shi, X.; An, X.; Zhao, Q.; Liu, H.; Xia, L.; Sun, X.; Guo, Y. State-of-the-art internet of things in protected agriculture. Sensors 2019, 19, 1833. [Google Scholar] [CrossRef] [PubMed]
- Hu, S.; Ingham, A.; Schmoelzl, S.; McNally, J.; Little, B.; Smith, D.; Bishop-Hurley, G.; Wang, Y.G.; Li, Y. Inclusion of features derived from a mixture of time window sizes improved classification accuracy of machine learning algorithms for sheep grazing behaviours. Comput. Electron. Agric. 2020, 179, 105857. [Google Scholar] [CrossRef]
- Wagner, N.; Antoine, V.; Mialon, M.M.; Lardy, R.; Silberberg, M.; Koko, J.; Veissier, I. Machine learning to detect behavioural anomalies in dairy cows under subacute ruminal acidosis. Comput. Electron. Agric. 2020, 170, 105233. [Google Scholar] [CrossRef]
- Bovo, M.; Agrusti, M.; Benni, S.; Torreggiani, D.; Tassinari, P. Random forest modelling of milk yield of dairy cows under heat stress conditions. Animals 2021, 11, 1305. [Google Scholar] [CrossRef] [PubMed]
- Xu, J.; Zhou, S.; Xu, A.; Ye, J.; Zhao, A. Automatic scoring of postures in grouped pigs using depth image and CNN-SVM. Comput. Electron. Agric. 2022, 194, 106746. [Google Scholar] [CrossRef]
- Nasir, A.; Ullah, M.O.; Yousaf, M.H. Ai in apiculture: A novel framework for recognition of invasive insects under unconstrained flying conditions for smart beehives. Eng. Appl. Artif. Intell. 2023, 119, 105784. [Google Scholar] [CrossRef]
- Ranjan, R.; Tsukuda, S.; Good, C. Effects of image data quality on a convolutional neural network trained in-tank fish detection model for recirculating aquaculture systems. Comput. Electron. Agric. 2023, 205, 107644. [Google Scholar] [CrossRef]
- Mei, W.; Yang, X.; Zhao, Y.; Wang, X.; Dai, X.; Wang, K. Identification of aflatoxin-poisoned broilers based on accelerometer and machine learning. Biosyst. Eng. 2023, 227, 107–116. [Google Scholar] [CrossRef]
- SHAP. Welcome to the SHAP Documentation. Available online: https://shap.readthedocs.io/en/latest/ (accessed on 19 September 2023).
- Data Imaginist. LIME. Available online: https://lime.data-imaginist.com/ (accessed on 19 September 2023).
Group 1 | Group 2 | Group 3 |
---|---|---|
“agricultur*” | “machine learning” | “application” |
“farm*” | “implementation” | |
“case study” | ||
“experimental” | ||
“practical” |
ID | Criteria | Description |
---|---|---|
IC1 | Repositories | Web of Science and Scopus |
IC2 | Search period | From 2019 to 2023, both years included |
IC3 | Search within | Article title, abstract, and keywords |
IC4 | Document type | Articles |
IC5 | Language | English |
ID | Criteria | Description |
---|---|---|
SC1 | Duplicates | Duplicate records were removed |
SC2 | Records Screening | Must include the title, year, abstract, and DOI |
SC3 | Journal rank | Must include only Q1 journals (Clarivate [13]) |
SC4 | Article type | Must not include reviews/surveys without original research |
ID | Criteria | Description |
---|---|---|
EC1 | Content | Publications specifically for the agricultural sector |
EC2 | Content | Publications with ML-related content |
EC3 | Full-text | Must be available |
Ref. | Crop Type | Models Used | Summary |
---|---|---|---|
[39] | Lettuce | CNN, DNN | AirSurf platform developed for ultra-scale aerial phenotyping, crop counting, and crop quality assessment. AirSurf-Lettuce achieves high accuracy (>98%) in scoring and categorising iceberg lettuces and provides novel analysis functions for mapping lettuce size distribution to enhance precision agricultural practices. |
[40] | Kiwifruit | ANN, SVM, Gaussian Process, Ensemble learning | Non-destructive tactile sensing approach for estimating the stiffness of kiwifruits, achieving accurate ripeness estimation with regression-based ML, showcasing potential applications in real-time quality control and sorting of fruits throughout the supply chain. |
[41] | Watermelon | NB, Logistic regression, KNN, DT, RF, ANN, SVM, GBT | Proposes a fusion non-destructive method for classifying watermelon sweetness based on acoustic signals, image processing, and weight features. ML is used to develop sweetness classification models. GBT obtained the highest classification accuracy of 92%. |
[42] | Peach flower | RF, SVM, KNN, NB | Assesses different ML methods for estimating and monitoring peach flowering phenological stages using real-time flower images and meteorological data. RF has the highest F1 score of 98.82% on the testing set, demonstrating its potential for real-time monitoring and applications in peach breeding, heat stress management, and irrigation scheduling. |
[43] | Rice | Stepwise MLR, RF | Accurate and non-destructive in-season nitrogen (N) diagnosis and recommendation for rice crops. The study uses active canopy sensor data and combines it with environmental and agronomic variables to develop N status diagnosis and recommendation models. This approach can significantly enhance N management strategies in rice cultivation, contributing to sustainable development and food security. |
[44] | Green coffee | SVM, RF, XGBoost, CatBoost, PCA | Focuses on distinguishing between special and traditional green coffee beans using an advanced multispectral imaging technique based on reflectance and autofluorescence data and combined with ML techniques. SVM achieves the highest accuracy (0.96) for the test dataset. This approach showcases its potential as a non-destructive and real-time tool for classifying green coffee beans in the food industry. |
[45] | Wheat | Multi-task learning | Multi-task learning approach using a real-world agricultural dataset, showing superior accuracy and stability in fertilisation prediction, leading to the development of a precision fertilisation system for intelligent and personalised farm management. |
Ref. | Crop Type | Models Used | Summary |
---|---|---|---|
[46] | Apricot cultivars | DT, KNN, LDA, NB, SVM, BPNN | Demonstrates the feasibility of using ML to identify apricot cultivars based on their shape features, suggesting potential for non-destructive automatic identification systems. SVM achieved the best accuracy of 90.7% in the test set for classifying apricot cultivars. |
[47] | Grapevines | SVM, CNN | The study demonstrated the feasibility of using spectroscopy, Big Data, and ML to distinguish specific grapevine varieties (Touriga Nacional or Touriga Franca) from a larger group of other varieties. |
[48] | Various crops | RF | Compares various classification strategies for vegetation mapping over large-scale areas using Sentinel data within the Google Earth Engine platform and RF algorithms for classification. |
[49] | Rapeseed | DL | Low-cost approach using DL (AlexNet, VGGNet16, ResNet18, ResNet50, and GoogLeNet) and UAV images for recognising freezing-tolerant rapeseed materials. The method achieves high accuracy (over 92%), with ResNet50 providing the best performance (93.3%), outperforming traditional ML methods. |
[50] | Pineapple | ANN, SVM, RF, NB, DT, KNN | Method involving UAV-captured RGB images, image processing, and ML classifiers to identify pineapple crowns, classify them as fruit or non-fruit, and count them accurately. The process involves pre-processing and segmenting high spatial-resolution aerial images, extracting features based on shape, color, and texture, and optimising classifiers’ performance via feature fusion using one-way analysis of variance (ANOVA). |
[51] | Corn, soybean | DL, CNN | Innovative within-season emergence (WISE) phenology-normalised DL model for scalable within-season crop mapping using time-series remote sensing data. This approach accommodates spatiotemporal variations in crop phenological dynamics, yielding an over 90% overall accuracy for classifying corn and soybeans at the end of the season, as well as a satisfactory performance (85% overall accuracy) one to four weeks earlier than calendar-based approaches during the growing season. |
Ref. | Crop Type | Models Used | Summary |
---|---|---|---|
[52] | Vineyard | ANN | Combines remote sensing, computer vision, and ML for vineyard yield estimation. By using VIs and vegetated fraction cover obtained from UAV multispectral imagery, along with ANN techniques, the approach provides accurate yield predictions with higher accuracy than traditional methods, supporting decision making in viticulture practices and harvest planning. |
[53] | Rice | BPNN, RNN | Proposes an end-to-end model for rice yield prediction using DL fusion to learn deep spatial and temporal features from time-series meteorology and area data. The model achieves accurate predictions for both summer and winter rice yields. |
[54] | Strawberry | RF, MLR, MARS, XGBoost, SVM, ANN | The combination of canopy geometric parameters and VIs obtained from UAV imagery proved effective for estimating strawberry dry biomass using ML models. ANN showed the highest accuracy in cross-validation, and red-edge-related VIs were found to be the most influential variables. |
[55] | Apple tree | Ensemble learning, SVM, KNN | Develops an automatic processing channel to extract morphological and spectral features from UAV LiDAR and multispectral imagery data. The ensemble learning model outperforms other base learners (SVM and KNN) and provides accurate yield predictions for individual apple trees in the orchard. |
[56] | Wheat grain | RF, SVM, MLR, generalised boosting regression | The research explores various VI, Sentinel-2 bands, and the biophysical parameter LAI retrieved from radiative transfer models (RTM) as input data for the models. RFRandom forest regression stands out as the most effective model. |
[57] | Winter wheat | Linear regression, Ensemble learning, DT, SVM, Gaussian Process | The study employs ML and historical data to predict winter wheat yield and dry matter, with the Gaussian process model achieving the highest accuracy (R2 = 0.87 and R2 = 0.86, respectively). The results offer valuable insights into site-specific crop management and could aid in formulating water and nitrogen management strategies for global food security. |
Ref. | Crop Type | Models Used | Summary |
---|---|---|---|
[58] | Banana plants | RF, PCA | Detects banana plants and their major diseases using satellite and UAV images and ML for classification. The developed model effectively categorised both healthy and diseased plants. |
[59] | Tomato | YOLO (v3) | Employs a machine vision approach for early real-time detection of tomato diseases and pests in natural environments. The outcomes demonstrate an average recognition accuracy of 91.8%. The developed approach has been put into practice within real tomato cultivation settings, demonstrating its effectiveness in detecting small objects and leaves occlusion. |
[60] | Not specified | SVM, CNN, KNN, NB | IoT-based that uses sensors and cameras to collect data from plants, which are then analysed via ML models. The system proposes ensemble classification and pattern recognition for crop monitoring system to identify plant diseases at the early. |
[61] | Sugarcane | CNN, YOLO (v5) | Detects White Leaf Disease in sugarcane crops using UAV imagery and DL models. The proposed methodology provides technical guidelines for effective crop management and disease monitoring. |
[62] | Watermelon | MLP, DT | Uses remote sensing, VIs, and ML for identifying and classifying different severity stages of Downy Mildew disease in watermelon. The highest classification accuracy was achieved via the MLP method. |
[63] | Rice | MLP, SVM, NB, DT, KNN | Weather-based rice blast disease-forecasting system that uses an ensemble feature ranking approach to enhance predictive accuracy. By evaluating fifteen weather features, the proposed method identifies the most impactful ones. Among these features, average visibility, rainfall amount, sun exposure hours, maximum wind speed, and rainy days emerge as the most influential in rice blast prediction. |
[64] | Potatoes | ANN | Innovative approach to the early detection of Verticillium wilt in potatoes using near-infrared spectroscopy and ANN models. The models accurately predict physiological responses to infection and classify infected plants within just two days after inoculation, even before visible symptoms appear. |
Ref. | Crop Type | Models Used | Summary |
---|---|---|---|
[66] | Vineyard | DT with object-based image analysis | Innovative approach for mapping Cynodon dactylon (bermudagrass) infestations in vineyard cover crops using an automatic DT-OBIA algorithm combined with UAV imagery. This method is crucial due to the negative impacts of bermudagrass on vineyard productivity. |
[67] | Wheat | SVM with Radial Basis Function | Assesses weed impact on wheat biomass using RGB images and proximal sensing techniques. The SVM model discriminates between crop and weeds and generates indicators like weed pressure and local wheat biomass production. |
[68] | Wheat | DNN | Detection of Italian ryegrass in wheat fields using UAV imagery (RGB) and DNN, along with an extensive feature selection method to accurately detect ryegrass in wheat and estimate its canopy coverage. Predictive models were developed to relate early-season ryegrass canopy coverage with end-of-season ryegrass biomass and seed yield, as well as wheat biomass and grain yield reduction. |
[69] | Wheat | DL with SVM, KNN, NN | Novel approach for classifying weed and wheat in drone-captured images, integrating an optimised voting classifier with NN, SVM, and KNN to classify features extracted using AlexNet via transfer learning. |
[70] | Corn | YOLO (v7) | Identifies major pests (corn borer, armyworm, and bollworm) of corn using the YOLOv7 network combined with the Adam optimiser. The approach demonstrates the feasibility of using DL and advanced optimisation techniques for effective crop pests and disease identification, contributing to agricultural modernisation. |
Ref. | Crop Field | Models Used | Summary |
---|---|---|---|
[72] | Maize | Linear regression, RF, Cubist, PLS, PCA, GBT | Uses remote sensing data and regression algorithms for predicting ETa and soil water content to enable remote irrigation management. The study employs VIs for training and phenology observations. Cubist showed slightly better performance for predicting ETa and RF for soil water content. |
[73] | Cranberry | RF, XGBoost | Forecasts water table depth using DT-based modeling approaches for optimised irrigation management. XGBoost demonstrated superior predictive ability, accurately simulating water table depth fluctuations for longer periods than RF. Despite limitations with extrapolation and extreme events, the models hold potential with broader dataset ranges for practical applications. |
[74] | Not applicable | KNN | Portable smart sensing system based on IoT for detecting nitrate, phosphate, pH, and temperature in water. KNN algorithm is used to enhance the accuracy of the system’s analysis. The proposed system offers early hazard detection and promotes regular contaminant level evaluation. |
[75] | Not specified | PCA, SVM, GBT | Focuses on accurately predicting crop ETo for efficient water resource management and irrigation. The research employs PCA techniques to identify key factors influencing ETo that are then used as inputs for prediction models. PSO was used to optimise SVM and GBT models. The PSO-GBT model exhibits the highest accuracy. |
[76] | Maize | DT, RF, SVM, ANN, PLS | Uses UAV multispectral data and ML for estimating water content indicators, including equivalent water thickness, fuel moisture content, and specific leaf area of maize crops in smallholder farms. RF and SVM outperform others in predicting water content indicators. This approach offers accurate insights into drought-related water stress on smallholder farms. |
[77] | Banana plants | KNN, GBT, LSTM | Employs IoT components to gather data (soil moisture, temperature, and weather conditions) and ML to optimise irrigation requirements and reduce energy consumption. The hybrid model predicts real-time and time-series water needs based on various observations. The work is demonstrated using banana cultivation, achieving up to a 31.4% water optimisation for a single banana tree. |
[78] | Grains, vegetables, fruits, flowers | RF, NN, SVM | Predicts phosphorus concentrations in shallow groundwater in intensive agricultural regions. SVM achieved the highest accuracy (R2 = 0.60). These findings support groundwater phosphorus monitoring, early warning, and pollution management decision making in intensive agricultural regions. |
Ref. | Crop Field | Models Used | Summary |
---|---|---|---|
[81] | Various soil samples | RF, SVM, Logistic Regression | Predicts disease occurrence with high accuracy by analysing soil macroecological patterns of Fusarium wilt, a destructive soil-borne plant disease. The research employs a ML approach using bacterial and fungal data sets from diseased and healthy soils across various countries and plant varieties. The results reveal distinct differences in bacterial and fungal communities between healthy and diseased soils. |
[82] | Canola | RF | The research utilises a ML approach to determine key predictors of soil nitrous oxide (N2O) emissions, including soil temperature, moisture, and nitrate availability. The results highlight that N2O emissions were influenced by these factors, with emission factors being lower in high yield zones compared to low yield zones. |
[80] | Maize, soybean | DT, RF, Cubist, Gaussian Process, SVM, ANN | Estimates soil organic matter (SOM) and soil moisture content (SMC) based on 22 color and texture features extracted from cell phone images. The study demonstrates the potential of using computer vision and ML to create an efficient proximal soil sensor for quick and accurate predictions of soil properties. Gaussian Process and Cubist models performed the best for SMC prediction, while ANN and Cubist showed satisfactory accuracy for SOM prediction. |
[83] | Vineyard | NN regression, KNN, SVM with Linear Kernel, XGBoost, Cubist | Explores the potential of using soil protists as bioindicators to assess multiple stresses in agricultural soils. The findings indicate that changes in protist taxa occurrence and diversity metrics are effective predictors of key soil variables, with soil copper concentration, moisture, pH, and basal respiration being particularly well predicted. |
[84] | Rice | CNN | A CNN model is developed to predict heavy metal (Cadmium, Lead, Chromium, Arsenic, and Mercury) concentrations in soil–rice system using 17 environmental factors. The model exhibits strong predictive accuracy, especially for Cadmium and Mercury. The study emphasises the model’s stability and robustness, particularly for quick predictions during emergencies. |
[85] | Wheat, maize, peanut | RF, NN (regression, radial basis function), BPNN, ELM | Introduces a method for farmland surface soil moisture retrieval using feature (extracted from Sentinel-1/2 and Radarsat-2 remote sensing data) optimisation and ML. RF model exhibited the highest accuracy. The proposed method shows potential for accurate surface soil moisture retrieval and offers insights for future applications in other farmland surface types. |
[86] | Not specified | ANN, KNN, SVM, RF, GBT, XGBoost, MLR, Cubist | Estimates soil water, salt contents, and bulk density from time domain reflectometry measurements using various ML algorithms. The research demonstrates that soil particle-size fractions are crucial predictors for all the targeted soil properties. XGBoost is recommended for accurate soil gravimetric water content and bulk density estimation, while GBT is suggested for precise volumetric water content and soil salt content prediction. |
Ref. | Animal | Models Used | Summary |
---|---|---|---|
[90] | Sheep | RF, SVM, LDA | Uses inertial motion sensors on 17 Merino sheep to collect behaviour data. Three ML approaches were employed to classify sheep behaviours accurately. Incorporating features from a range of time window sizes, spanning 2 to 15 s, significantly improved behaviour classification accuracy compared to a single window size. Among the methods, RF yielded the best results. |
[91] | Dairy cows | DT, MLP, KNN, LSTM | Employs ML to detect abnormal behaviours in dairy cows with subacute ruminal acidosis (SARA), a condition known to induce behavioural changes. Monitoring 14 cows with SARA and 14 control cows involved tracking ruminal pH measurements and activity via stable-based positioning systems. KNN model exhibited the highest performance by identifying 83% of SARA cases. The study concludes that ML can successfully identify behaviour anomalies indicative of health issues. |
[92] | Dairy cows | RF | This research offers insights into enhancing dairy farm management by predicting milk production trends under heat stress conditions, thereby increasing both productivity and animal welfare. The results demonstrate that the RF model is effective in detecting the impact of extreme heat conditions on milk yield, with an average relative error of about 18% for single daily yield predictions and 2% for total milk production. |
[93] | Pigs | SVM, CNN | Automates the recognition and scoring of multiple postures of grouped pigs using depth images and a CNN-SVM model. The approach proves effective for detecting pig postures under commercial conditions, showing potential for improving pig welfare, health assessment, and behaviour analysis. |
[94] | Invasive insects | DT, SVM, CNN, KNN, Ensemble learning (Boosted and Bagged Trees) | The proposed framework uses multi-modal data, including 3-D trajectories and infrared imagery, along with a multi-evidence approach to detect invasive insects near beehives. The framework achieves a high classification accuracy of 97.1% for Vespa hornets and honeybees, showing the potential to ensure the safety and smart monitoring of beehives against invasive species. |
[95] | Fish | CNN, YOLO (v5) | The study uses a CNN for fish detection in recirculating aquaculture systems. The authors employ the one-stage YOLOv5 model and compare it with a two-stage Faster R-CNN model. The aim is to enhance fish production management via AI assistance. |
[96] | Broilers | KNN, SVM, DT, RF, GBT | Identification of aflatoxin-poisoned broilers via wearable accelerometers and ML. Poisoned broilers exhibit distinct behavioural changes, such as reduced time spent on feeding, drinking, walking, and standing, as well as increased sitting behaviour. The study successfully demonstrated that the used ML models can accurately identify poisoned broilers, particularly those with higher aflatoxin concentrations, with GBT showing the best performance. |
Challenge | Explanation | Proposed Solutions/Research Opportunities |
---|---|---|
Adaptability | Agricultural practices vary widely across regions, crops, and farming systems. Developing ML-based systems that are adaptable to diverse agricultural scenarios is a critical research frontier. | Developing adaptable models and algorithms that can be customised to suit diverse agricultural environments. Explore Transfer Learning techniques that allow models to leverage knowledge from one domain to another, making them more versatile and adaptable. |
Data accessibility | Encompass the efficient management of data, ensuring it is readily available to be used. For example, a delay in accessing data due to storage issues could hinder the real-time capabilities of ML applications. | Optimising data management systems and storage solutions, ensuring both efficiency and security. |
Data accuracy | Accurate data are critical for training ML models. Inaccurate data can lead to incorrect predictions or recommendations. | Ensuring that data are accurate, credible, and trustworthy by exploring methods for data validation and quality assurance. |
Data completeness | Incomplete data may result in biased or incomplete ML models. For example, missing data points in a crop monitoring dataset may hinder the model’s ability to accurately predict crop yield. | Exploring techniques for data imputation/extrapolation to address missing data in agricultural datasets. Investigating methods for optimising models’ performance in the presence of incomplete information (e.g., Feature Engineering). |
Data consistency | Consistent data ensures that ML models are reliable and reproducible. For example, inconsistent labeling of images in a crop disease detection dataset could lead to incorrect classification. | Exploring data validation and cleaning techniques to ensure consistency in agricultural datasets. Developing techniques that can identify and rectify inconsistencies. |
Data context | ML models need to be trained on data that are relevant to the specific agricultural task at hand. For example, using weather data from a different region may not provide accurate predictions for local farming conditions. | Investigating techniques for adapting ML models based on the specific agricultural context. A possible approach could be the use of Transfer Learning as it involves leveraging pre-trained models on similar tasks or domains and fine-tuning them using local data. |
Data security and privacy | Agricultural data are often sensitive information that requires compliance with data protection regulations. | Exploring mechanisms that encompass data anonymisation, access control, and compliance with evolving data protection regulations will be crucial in building a foundation of trust for ML-driven agricultural solutions. |
Data timeliness | Delayed/outdated data can lead to non-optimal results, impeding the potential benefits derived from ML-driven insights. However, it should be noted that there are scenarios in which historical data can be of significant use as it can offer invaluable insight into long-term trends, cyclical patterns, and the cumulative effects of farming practices. | Exploring methods for real-time data acquisition and processing that can adapt and make decisions based on the most up-to-date data, ensuring timely responses in ML applications. However, depending on the case at hand, a hybrid approach can be used, striking a balance between integrating real-time and historical data. This involves using real-time data for immediate decision making and integrating historical data for long-term strategic planning. |
Human–machine collaboration | ML-based systems should enhance, rather than replace, human expertise in agriculture. Designing systems that facilitate seamless collaboration between stakeholders is an emerging area of research. | Designing collaborative decision making frameworks that seamlessly integrate ML insights with human expertise. Developing interfaces that empower users to interact with and guide ML models in agricultural tasks. |
Interpretability and explainability | ML-based systems pose a significant challenge in gaining the trust and acceptance of farmers, stakeholders, and the agricultural industry. It is important to understand how models achieve their outputs. | Ensuring that ML models are transparent and that their inner workings are accessible. This means providing information on the features, variables, and algorithms that contribute to a model’s results. Techniques such as SHAP values [97] or LIME [98] can be useful to identify which features are most influential in a model’s predictions. |
Limited literacy | Generally, aged workers may have limited literacy on digital technologies that could cause resistance or difficulties in adopting and effectively utilising technologies from the Agriculture 4.0. | Investing on training methods (e.g., workshops, courses), knowledge transfer, and skill-building in the context of ML-based technologies. Designing user-friendly interfaces tailored to older workers. |
Resource constraints | ML-based systems often necessitate real-time processing and decision making. Remote regions or resource-constrained enterprises may lack the computational resources required for data processing. | Developing lightweight and efficient models that can operate effectively in low-resource scenarios. Investigating techniques for distributed and edge computing. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Araújo, S.O.; Peres, R.S.; Ramalho, J.C.; Lidon, F.; Barata, J. Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives. Agronomy 2023, 13, 2976. https://doi.org/10.3390/agronomy13122976
Araújo SO, Peres RS, Ramalho JC, Lidon F, Barata J. Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives. Agronomy. 2023; 13(12):2976. https://doi.org/10.3390/agronomy13122976
Chicago/Turabian StyleAraújo, Sara Oleiro, Ricardo Silva Peres, José Cochicho Ramalho, Fernando Lidon, and José Barata. 2023. "Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives" Agronomy 13, no. 12: 2976. https://doi.org/10.3390/agronomy13122976
APA StyleAraújo, S. O., Peres, R. S., Ramalho, J. C., Lidon, F., & Barata, J. (2023). Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives. Agronomy, 13(12), 2976. https://doi.org/10.3390/agronomy13122976