Use and Adaptations of Machine Learning in Big Data—Applications in Real Cases in Agriculture
Abstract
:1. Introduction
2. Background
2.1. Machine Learning
2.2. Big Data
3. Related Work
3.1. Machine Learning in Agriculture
3.2. Big Data in Agriculture
3.3. Challenges of Machine Learning in Big Data
3.4. ML Adaptations in Big Data
4. Research Methodology
4.1. Research Objetives
4.2. Research Questions
4.3. Search String
4.4. Screening of Relevant Papers
- Articles that do not used Big Data and ML system.
- Application in real cases in the field of agriculture.
- Papers published other than conferences, journals and technical reports.
- Articles without defining data sources.
- Articles not published in the English language.
- Papers published before 2015.
- Papers that are not relevant to the search string.
4.5. Keywording Using Abstract
4.6. Quality Assessment
4.7. Study Selection Process
4.8. Data Extraction Method
5. Analysis
5.1. Selection of Results
5.1.1. Answer for RQ1: What Are the Major Targeted Primary Publication Channels for Big Data and Machine Learning Research in Agriculture?
5.1.2. Answer for RQ2: How Has the Frequency of Architectures Been Changed of Big Data in Agriculture over Time?
5.1.3. Answer for RQ3: What Type of Problems Are Solved Using Big Data and Machine Learning in the Field of Agriculture?
5.1.4. Answer for RQ4: What Is the Area of Agriculture in Which Big Data Machine Learning Is Used?
5.1.5. Answer for RQ5: What Big Data and Machine Learning Architectures Are Used to Solve Problems in Agriculture?
5.1.6. Answer for RQ6: What Approaches Were Used to Solve the Problems?
5.1.7. Answer for RQ7: What Are the Main Application Domains of Big Data and Machine Learning in Agriculture?
5.1.8. Answer for RQ8: What Machine Learning Techniques Have Been Used in the Big Data Architectures Found?
5.1.9. Answer for RQ9: What Are the Adaptations That ML Uses in Big Data in the Field of Agriculture?
6. Discussion
6.1. Big Data and ML in Agriculture
6.2. Technologies Used in Big Data and ML Architectures for Agriculture
6.3. A Framework of Technological Challenges for the Use of ML and Big Data in Agriculture
7. Threats to Validity
7.1. Construct Validity
7.2. Internal Validity
7.3. External Validity
7.4. Conclusion Validity
8. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Slavin, P. Climate and famines: A historical reassessment. Wiley Interdiscip. Rev. Clim. Chang. 2016, 7, 433–447. [Google Scholar] [CrossRef]
- El Bilali, H.; Bassole, I.H.N.; Dambo, L.; Berjan, S. Climate change and food security. Agric. For. 2020, 66, 197–210. [Google Scholar]
- Pozza, L.E.; Field, D.J. The science of Soil Security and Food Security. Soil Secur. 2020, 1, 100002. [Google Scholar] [CrossRef]
- Gebbers, R.; Adamchuk, V.I. Precision Agriculture and Food Security. Science 2010, 327, 828–831. [Google Scholar] [CrossRef] [PubMed]
- WFS. Declaration on World Food Security and World Food Summit Plan of Action; WFS: Rome, Italy, 1996. [Google Scholar]
- Kamilaris, A.; Kartakoullis, A.; Prenafeta-Boldú, F.X. A review on the practice of big data analysis in agriculture. Comput. Electron. Agric. 2017, 143, 23–37. [Google Scholar] [CrossRef]
- Liakos, K.G.; Busato, P.; Moshou, D.; Pearson, S.; Bochtis, D. Machine learning in agriculture: A review. Sensors 2018, 18, 2674. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Sundmaeker, H.; Verdouw, C.; Wolfert, S.; Pérez Freire, L. Internet of Food and Farm. In Digitising the Industry—Internet of Things Connecting the Physical, Digital and Virtual Worlds; Vermesan, O., Friess, P., Eds.; River Publishers: Gistrup/Delft, Denmark, 2017. [Google Scholar]
- Wolfert, S.; Ge, L.; Verdouw, C.; Bogaardt, M.-J. Big data in smart farming—A review. Agric. Syst. 2017, 153, 69–80. [Google Scholar] [CrossRef]
- Nandyala, C.S.; Kim, H.-K. Big and Meta Data Management for U-Agriculture Mobile Services. Int. J. Softw. Eng. Appl. 2016, 10, 257–270. [Google Scholar] [CrossRef] [Green Version]
- Sun, A.Y.; Scanlon, B.R. How can Big Data and machine learning benefit environment and water management: A survey of methods, applications, and future directions. Environ. Res. Lett. 2019, 14, 073001. [Google Scholar] [CrossRef]
- Prudius, A.A.; Karpunin, A.A.; Vlasov, A.I. Analysis of machine learning methods to improve efficiency of big data processing in Industry 4.0. J. Phys. Conf. Ser. 2019, 1333, 032065. [Google Scholar] [CrossRef]
- Ryan, I.; Li-Minn, A.; Phooi, S.K.; Broster, J.C.; Pratley, J.E. Big data and machine learning for crop protection. Comput. Electron. Agric. 2018, 151, 376–383. [Google Scholar]
- Wu, H.; Meng, F.J. Review on Evaluation Criteria of Machine Learning Based on Big Data. J. Phys. Conf. Ser. 2020, 1486, 052026. [Google Scholar] [CrossRef] [Green Version]
- Sassi, I.; Ouaftouh, S.; Anter, S. Adaptation of Classical Machine Learning Algorithms to Big Data Context: Problems and Challenges: Case Study: Hidden Markov Models Under Spark. In Proceedings of the 2019 1st International Conference on Smart Systems and Data Science (ICSSD), Rabat, Morocco, 3–4 October 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–7. [Google Scholar]
- Al-Jarrah, O.Y.; Yoo, P.D.; Muhaidat, S.; Karagiannidis, G.K.; Taha, K. Efficient Machine Learning for Big Data: A Review. Big Data Res. 2015, 2, 87–93. [Google Scholar] [CrossRef] [Green Version]
- Che, D.; Safran, M.; Peng, Z. From Big Data to Big Data Mining: Challenges, Issues, and Opportunities. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention—MICCAI 2010, Beijing, China, 20–24 September 2010; Springer: Berlin, Germany, 2013; pp. 1–15. [Google Scholar]
- Chen, X.-W.; Lin, X. Big Data Deep Learning: Challenges and Perspectives. IEEE Access 2014, 2, 514–525. [Google Scholar] [CrossRef]
- Slavakis, K.; Giannakis, G.B.; Mateos, G. Modeling and Optimization for Big Data Analytics: (Statistical) learning tools for our era of data deluge. IEEE Signal Process. Mag. 2014, 31, 18–31. [Google Scholar] [CrossRef]
- Qiu, J.; Wu, Q.; Ding, G.; Xu, Y.; Feng, S. A survey of machine learning for big data processing. EURASIP J. Adv. Signal Process. 2016, 2016. [Google Scholar] [CrossRef] [Green Version]
- Bhatnagar, R. Machine Learning and Big Data Processing: A Technological Perspective and Review. Adv. Intell. Syst. Comput. 2018, 468–478. [Google Scholar] [CrossRef]
- James, M.; Michael, C.; Brad, B.; Jacques, B. Big Data: The Next Frontier for Innovation, Competition and Productivity; McKinsey Global Institute: New York, NY, USA, 2011. [Google Scholar]
- Neethirajan, S. The role of sensors, big data and machine learning in modern animal farming. Sens. Bio Sens. Res. 2020, 29, 100367. [Google Scholar] [CrossRef]
- L’Heureux, A.; Grolinger, K.; El Yamany, H.F.; Capretz, M.A.M. Machine Learning with Big Data: Challenges and Approaches. IEEE Access 2017, 5, 7776–7797. [Google Scholar] [CrossRef]
- Keele, S. Guidelines for Performing Systematic Literature Reviews in Software Engineering; Technical Report, Ver. 2.3; EBSE: Durham, UK, 2007. [Google Scholar]
- Cherkassky, V.; Mulier, F. Learning from Data: Concepts, Theory and Methods; Wiley: Hoboken, NJ, USA, 2007. [Google Scholar]
- Rudin, C.; Wagstaff, K.L. Machine learning for science and society. Mach. Learn. 2013, 95, 1–9. [Google Scholar] [CrossRef] [Green Version]
- Elshawi, R.; Sakr, S.; Talia, D.; Trunfio, P. Big Data Systems Meet Machine Learning Challenges: Towards Big Data Science as a Service. Big Data Res. 2018, 14, 1–11. [Google Scholar] [CrossRef] [Green Version]
- Haig, B.D. Big data science: A philosophy of science perspective. Big Data Psychol. Res. 2020, 15–33. [Google Scholar] [CrossRef]
- De Mauro, A.; Greco, M.; Grimaldi, M. A formal definition of Big Data based on its essential features. Libr. Rev. 2016, 65, 122–135. [Google Scholar] [CrossRef]
- Demchenko, Y.; De Laat, C.; Membrey, P. Defining architecture components of the Big Data Ecosystem. In Proceedings of the 2014 International Conference on Collaboration Technologies and Systems (CTS), Minneapolis, MN, USA, 19–23 May 2014; pp. 104–112. [Google Scholar] [CrossRef] [Green Version]
- Santos, M.Y.; Sá, J.O.E.; Costa, C.; Galvão, J.; Andrade, C.; Martinho, B.; Lima, F.V.; Costa, E. A Big Data Analytics Architecture for Industry 4.0. Adv. Intell. Syst. Comput. 2017, 175–184. [Google Scholar] [CrossRef]
- Sowmya, R.; Suneetha, K. Data mining with big data. In Proceedings of the 2017 11th International Conference on Intelligent Systems and Control. (ISCO), Coimbatore, India, 5–6 January 2017; IEEE: Piscataway, NJ, USA; pp. 246–250. [Google Scholar]
- Ordonez, C.; Garcia-Alvarado, C.; Song, I.-Y. Special issue on DOLAP 2015: Evolving data warehousing and OLAP cubes to big data analytics. Inf. Syst. 2017, 68, 1–2. [Google Scholar] [CrossRef]
- Song, I.-Y.; Zhu, Y. Big data and data science: What should we teach? Expert Syst. 2015, 33, 364–373. [Google Scholar] [CrossRef]
- Sarker, M.N.I.; Islam, M.S.; Ali, M.A.; Islam, M.S.; Salam, M.A.; Mahmud, S.H. Promoting digital agriculture through big data for sustainable farm management. Int. J. Innov. Appl. Stud. 2019, 25, 1235–1240. [Google Scholar]
- Zhou, L.; Pan, S.; Wang, J.; Vasilakos, A.V. Machine learning on big data: Opportunities and challenges. Neurocomputing 2017, 237, 350–361. [Google Scholar] [CrossRef] [Green Version]
- Chan, K.Y.; Kwong, C.; Wongthongtham, P.; Jiang, H.; Fung, C.K.; Abu-Salih, B.; Liu, Z.; Wong, T.; Jain, P. Affective design using machine learning: A survey and its prospect of conjoining big data. Int. J. Comput. Integr. Manuf. 2018, 33, 645–669. [Google Scholar] [CrossRef] [Green Version]
- Isabella, S.J.; Srinivasan, S. An understanding of machine learning techniques in big data analytics: A survey. Int. J. Eng. Technol. 2018, 7, 666–672. [Google Scholar] [CrossRef]
- Mahajan, D.; Park, J.; Amaro, E.; Sharma, H.; Yazdanbakhsh, A.; Kim, J.K.; Esmaeilzadeh, H. TABLA: A unified template-based framework for accelerating statistical machine learning. In Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture (HPCA), Barcelona, Spain, 12–16 March 2016. [Google Scholar] [CrossRef]
- Rathor, A.; Gyanchandani, M. A review at Machine Learning algorithms targeting big data challenges. In Proceedings of the 2017 International Conference on Electrical, Electronics, Communication, Computer, and Optimization Techniques (ICEECCOT), Mysuru, India, 15–16 December 2017; pp. 1–7. [Google Scholar] [CrossRef]
- Divya, K.S.; Bhargavi, P.; Jyothi, S. Machine Learning Algorithms in Big data Analytics. Int. J. Comput. Sci. Eng. 2018, 6, 63–70. [Google Scholar] [CrossRef]
- Swathi, R.; Seshadri, R. Systematic survey on evolution of machine learning for big data. In Proceedings of the 2017 International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, 15–16 June 2017; pp. 204–209. [Google Scholar] [CrossRef]
- Dybå, T.; Dingsøyr, T. Empirical studies of agile software development: A systematic review. Inf. Softw. Technol. 2008, 50, 833–859. [Google Scholar] [CrossRef]
- Petersen, K.; Feldt, R.; Mujtaba, S.; Mattsson, M. Systematic mapping studies in software engineering. In Proceedings of the 12th International Conference on Evaluation and Assessment in Software Engineering (EASE), Bari, Italy, 26–27 June 2008; pp. 1–10. [Google Scholar]
- Farooq, M.S.; Riaz, S.; Abid, A.; Umer, T.; Bin Zikria, Y. Role of IoT Technology in Agriculture: A Systematic Literature Review. Electronics 2020, 9, 319. [Google Scholar] [CrossRef] [Green Version]
- Landis, J.R.; Koch, G.G. The Measurement of Observer Agreement for Categorical Data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Dutta, R.; Li, C.; Smith, D.; Das, A.; Aryal, J. Big Data Architecture for Environmental Analytics. New Trends Nonlinear Control Theory 2015, 578–588. [Google Scholar] [CrossRef] [Green Version]
- Balducci, F.; Impedovo, D.; Pirlo, G. Machine Learning Applications on Agricultural Datasets for Smart Farm Enhancement. Machines 2018, 6, 38. [Google Scholar] [CrossRef] [Green Version]
- Gómez, D.; Salvador, P.; Sanz, J.; Casanova, J.L. Potato Yield Prediction Using Machine Learning Techniques and Sentinel 2 Data. Remote Sens. 2019, 11, 1745. [Google Scholar] [CrossRef] [Green Version]
- Taghizadeh-Mehrjardi, R.; Nabiollahi, K.; Rasoli, L.; Kerry, R.; Scholten, T. Land Suitability Assessment and Agricultural Production Sustainability Using Machine Learning Models. Agronomy 2020, 10, 573. [Google Scholar] [CrossRef]
- Wei, M.C.F.; Maldaner, L.F.; Ottoni, P.M.N.; Molin, J.P. Carrot Yield Mapping: A Precision Agriculture Approach Based on Machine Learning. AI 2020, 1, 229–241. [Google Scholar] [CrossRef]
- Mosavi, A.; Sajedi-Hosseini, F.; Choubin, B.; Taromideh, F.; Rahi, G.; Dineva, A.A. Susceptibility Mapping of Soil Water Erosion Using Machine Learning Models. Water 2020, 12, 1995. [Google Scholar] [CrossRef]
- Abbas, F.; Afzaal, H.; Farooque, A.A.; Tang, S. Crop Yield Prediction through Proximal Sensing and Machine Learning Algorithms. Agronomy 2020, 10, 1046. [Google Scholar] [CrossRef]
- Tombe, R. Computer Vision for Smart Farming and Sustainable Agriculture. In Proceedings of the 2020 IST-Africa Conference (IST-Africa), Kampala, Uganda, 18–22 May 2020; IEEE: Piscataway, NJ, USA, 2020. [Google Scholar]
- Diaz, C.A.M.; Castaneda, E.E.M.; Vassallo, C.A.M. Deep Learning for Plant Classification in Precision Agriculture. In Proceedings of the 2019 International Conference on Computer, Control, Informatics and its Applications (IC3INA), Tangerang, Indonesia, 23–24 October 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 9–13. [Google Scholar]
- Priya, R.; Ramesh, D.; Khosla, E. Crop Prediction on the Region Belts of India: A Naïve Bayes MapReduce Precision Ag-ricultural Model. In Proceedings of the 2018 Int. Conf. Adv. Comput. Commun. informatics (ICACCI), Bangalore, India, 19–22 September 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 99–104. [Google Scholar]
- Doshi, Z.; Nadkarni, S.; Agrawal, R.; Shah, N. AgroConsultant: Intelligent Crop Recommendation System Using Machine Learning Algorithms. In Proceedings of the 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA), Pune, India, 16–18 August 2018; pp. 1–6. [Google Scholar] [CrossRef]
- Shelestov, A.; Lavreniuk, M.; Vasiliev, V.; Shumilo, L.; Kolotii, A.; Yailymov, B.; Kussul, N.; Yailymova, H. Cloud Approach to Automated Crop Classification Using Sentinel-1 Imagery. IEEE Trans. Big Data 2020, 6, 572–582. [Google Scholar] [CrossRef]
- Nobrega, L.; Tavares, A.; Cardoso, A.; Goncalves, P. Animal monitoring based on IoT technologies. In Proceedings of the 2018 IoT Vertical and Topical Summit on Agriculture—Tuscany (IOT Tuscany), Monteriggioni, Italy,, 8–9 May 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–5. [Google Scholar]
- Lee, W.; Ham, Y.; Ban, T.-W.; Jo, O. Analysis of Growth Performance in Swine Based on Machine Learning. IEEE Access 2019, 7, 161716–161724. [Google Scholar] [CrossRef]
- Garcia, M.B.; Ambat, S.; Adao, R.T. Tomayto, Tomahto: A Machine Learning Approach for Tomato Ripening Stage Identification Using Pixel-Based Color Image Classification. In Proceedings of the 2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM), Laoag, Philippines, 29 November–1 December 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–6. [Google Scholar]
- Alsahaf, A.; Azzopardi, G.; Ducro, B.; Hanenberg, E.; Veerkamp, R.F.; Petkov, N. Estimation of Muscle Scores of Live Pigs Using a Kinect Camera. IEEE Access 2019, 7, 52238–52245. [Google Scholar] [CrossRef]
- Kumar, C.S.; Sharma, V.K.; Yadav, A.K.; Singh, A. Perception of Plant Diseases in Color Images Through Adaboost. In Advances in Intelligent Systems and Computing; Springer: Berlin, Germany, 2020; pp. 506–511. [Google Scholar]
- Amani, M.; Kakooei, M.; Moghimi, A.; Ghorbanian, A.; Ranjgar, B.; Mahdavi, S.; Davidson, A.; Fisette, T.; Rollin, P.; Brisco, B.; et al. Application of Google Earth Engine Cloud Computing Platform, Sentinel Imagery, and Neural Networks for Crop Mapping in Canada. Remote Sens. 2020, 12, 3561. [Google Scholar] [CrossRef]
- Rehman, A.; Liu, J.; Keqiu, L.; Mateen, A.; Yasin, M.Q. Machine learning prediction analysis using IoT for smart farming. Int. J. Emerg. Trends Eng. Res. 2020, 8, 6482–6487. [Google Scholar]
- Gumma, M.K.; Thenkabail, P.S.; Teluguntla, P.G.; Oliphant, A.; Xiong, J.; Giri, C.; Pyla, V.; Dixit, S.; Whitbread, A.M. Agricultural cropland extent and areas of South Asia derived using Landsat satellite 30-m time-series big-data using random forest machine learning algorithms on the Google Earth Engine cloud. GIScience Remote Sens. 2020, 57, 302–322. [Google Scholar] [CrossRef] [Green Version]
- Gnanasankaran, N.; Ramaraj, E. The effective yield of paddy crop in Sivaganga district—An initiative for smart farming. Int. J. Sci. Technol. Res. 2020, 9, 6452–6455. [Google Scholar]
- Tarik, H.; Mohammed, O.J. Big Data Analytics and Artificial Intelligence Serving Agriculture. In Advances in Intelligent Systems and Computing; Springer: Berlin, Germany, 2020; pp. 57–65. [Google Scholar]
- Swain, M.; Singh, R.; Thakur, A.K.; Gehlot, A. A machine learning approach of data mining in agriculture 4.0. Int. J. Emerg. Technol. 2020, 11, 257–262. [Google Scholar]
- Wang, X.; Yang, K.; Liu, T. The Implementation of a Practical Agricultural Big Data System. In Proceedings of the 2019 IEEE 5th International Conference on Computer and Communications (ICCC), Chengdu, China, 6–9 December 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1955–1959. [Google Scholar]
- Fenu, G.; Malloci, F.M. An Application of Machine Learning Technique in Forecasting Crop Disease. In Proceedings of the 2019 3rd International Conference on Big Data Research, Paris, France, 20–22 November 2019. [Google Scholar] [CrossRef]
- Moon, T.; Hong, S.; Choi, H.Y.; Jung, D.H.; Chang, S.H.; Son, J.E. Interpolation of greenhouse environment data using multilayer perceptron. Comput. Electron. Agric. 2019, 166, 105023. [Google Scholar] [CrossRef]
- Aiken, V.C.F.; Dórea, J.R.R.; Acedo, J.S.; De Sousa, F.G.; Dias, F.G.; Rosa, G.J.D.M. Record linkage for farm-level data analytics: Comparison of deterministic, stochastic and machine learning methods. Comput. Electron. Agric. 2019, 163, 104857. [Google Scholar] [CrossRef]
- Ochoa, K.S.; Guo, Z. A framework for the management of agricultural resources with automated aerial imagery detec-tion. Comput. Electron. Agric. 2019, 162, 53–69. [Google Scholar] [CrossRef]
- Sathiaraj, D.; Huang, X.; Chen, J. Predicting climate types for the Continental United States using unsupervised clustering techniques. Environmetrics 2019, 30, e2524. [Google Scholar] [CrossRef]
- Vasumathi, M.T.; Kamarasan, M. Fruit disease prediction using machine learning over big data. Int. J. Recent Technol. Eng. 2019, 7, 556–559. [Google Scholar]
- Saggi, M.K.; Jain, S. Reference evapotranspiration estimation and modeling of the Punjab Northern India using deep learning. Comput. Electron. Agric. 2019, 156, 387–398. [Google Scholar] [CrossRef]
- Yang, J.; Liu, M.; Lu, J.; Miao, Y.; Hossain, M.A.; Alhamid, M.F. Botanical Internet of Things: Toward Smart Indoor Farming by Connecting People, Plant, Data and Clouds. Mob. Netw. Appl. 2017, 23, 188–202. [Google Scholar] [CrossRef]
- Yahata, S.; Onishi, T.; Yamaguchi, K.; Ozawa, S.; Kitazono, J.; Ohkawa, T.; Yoshida, T.; Murakami, N.; Tsuji, H. A hybrid machine learning approach to automatic plant phenotyping for smart agriculture. In Proceedings of the 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, USA, 14–19 May 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1787–1793. [Google Scholar]
- Khine, P.P.; Wang, Z.S. Data lake: A new ideology in big data era. ITM Web Conf. 2018, 17, 03025. [Google Scholar] [CrossRef]
- Hirve, S.; Reddy, C.H.P. A Survey on Visualization Techniques Used for Big Data Analytics. Adv. Intell. Syst. Comput. 2019, 447–459. [Google Scholar] [CrossRef]
- Jun, S. Business Intelligence Visualization Technology and Its Application in Enterprise Management. In Proceedings of the 2020 2nd International Conference on Big Data Engineering and Technology; Association for Computing Machinery (ACM), Singapore, 3–5 January 2020; pp. 45–48. [Google Scholar]
- Chen, Q.; Zobel, J.; Verspoor, K. Evaluation of a machine learning duplicate detection method for bioinformatics data-bases. In Proceedings of the ACM Ninth International Workshop on Data and Text. Mining in Biomedical Informatics, Melbourne, Australia, 23 October 2015; pp. 4–12. [Google Scholar]
- Barga, R.; Fontama, V.; Tok, W.H. Cortana Analytics. Predict. Anal. Microsoft Azure Mach. Learn. 2015, 279–283. [Google Scholar] [CrossRef]
- Google. Google Cloud Machine Learning. 2016. Available online: https://cloud.google.com/products/machine-learning/ (accessed on 15 November 2016).
- A.W.S. Amazon. Machine Learning. 2016. Available online: https://aws.amazon.com/machine-learning/ (accessed on 7 June 2016).
- IBM. IBM Watson Ecosystem Program. 2014. Available online: http://www-03.ibm.com/innovation/us/watson/ (accessed on 8 January 2014).
- Padhi, B.K.; Nayak, S.; Biswal, B. Machine Learning for Big Data Processing: A Literature Review. Int. J. Innov. Res. Technol. 2018, 5, 359–368. [Google Scholar]
- Al-Fuqaha, A.; Guizani, M.; Mohammadi, M.; Aledhari, M.; Ayyash, M. Internet of Things: A survey on enabling technologies, protocols and applications. IEEE Commun. Surv. Tutor. 2015, 17, 2347–2376. [Google Scholar] [CrossRef]
- Grolinger, K.; Hayes, M.; Higashino, W.A.; L’Heureux, A.; Allison, D.S.; Capretz, M.A. Challenges for MapReduce in Big Data. In Proceedings of the 2014 IEEE World Congress on Services, Anchorage, AK, USA, 27 June–2 July 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 182–189. [Google Scholar]
Classification Type | Supervised Learning | Unsupervised Learning | Reinforcement Learning |
---|---|---|---|
Data Processing Tasks | Estimation Classification Regression | Clustering Prediction | Decision-making |
Learning Algorithms | Support vector machine Bayesian networks Neural networks Naive Bayes Hidden Markov model | Dirichlet process mixture model X-means K-means Gaussian mixture model | TD-learning Sarsa learning Q-learning R-learning |
Typology | Features |
---|---|
Agricultural Big Data | A lot of data collected from various sectors and stages of agriculture. Stored and processed in the computer for use and reuse for decision making. |
Precision agriculture | Sensor enabled hardware and software tools to manage agriculture in all aspects using modern technology. |
Prescription agriculture | Computer algorithm enabled prescription for agronomic practices for mixing yield. |
Enterprise agriculture | Computer enabled agribusiness platform considering field agriculture to human resources management, inventory, logistics, machinery, buying and selling the system and profit. |
Automated agriculture | Automation in agriculture through robotic technology and intelligent program using farm data and environmental data. |
Stages of the Sata Chain | Features | Key Issues |
---|---|---|
Data capture | Sensors, Open Data, data capture by UAVs, Biometric sensing, Genotype information | Availability, quality, formats |
Data Storage | Cloud-based platform, Hadoop, Distributed File System (HDFS), hybrid storage system, cloud-based data warehouse | Quick and safe Access to data, cost |
Data Transfer | Wireless, cloud-based platform, Linken Open Data | Safety, agreements on responsibilities and liabilities |
Data Transformation | ML algorithms, normalize, visualize, anonymize. | Heterogeneity of data source, automation of data cleansing and preparation |
Data Analytics | Yield models, Planting instructions, Benchmarking, Decision ontologies, Cognitive computing | Semantic heterogeneity, real-time analytics, scalability |
Data Marketing | Data visualization | Ownership, privacy, new business model |
Approaches * | Challenges | ||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Volume | Variety | Velocity | Veracity | ||||||||||||||||
A | B | C | D | E | F | G | H | I | J | K | L | M | N | L | O | P | Q | ||
ML | Deep Learning | - | - | - | - | √ | √ | - | - | - | √ | * | - | - | - | - | - | * | * |
Online Learning | √ | √ | * | - | - | - | - | - | √ | - | * | √ | √ | * | √ | - | - | * | |
Local Learning | √ | √ | √ | - | - | - | - | √ | √ | - | - | - | - | - | - | - | - | - | |
Transfer Learning | - | - | √ | - | - | - | - | - | - | √ | * | - | - | - | - | - | * | * | |
Lifelong Learning | √ | - | √ | - | - | - | - | - | - | √ | * | √ | √ | * | - | - | * | * | |
Ensemble Learning | √ | √ | - | - | - | - | - | - | - | - | - | - | - | √ | - | - | - | - |
Nº | Research Question | Main Motivation | Research Objectives |
---|---|---|---|
RQ1 | What are the major targeted primary publication channels for Big Data and Machine Learning research in agriculture? | Identify where Big Data and Machine Learning research for agriculture can be found, as well as good publication sources for future studies. | O1 |
RQ2 | How has the frequency of architectures been changed of Big Data in agriculture over time? | Identify the publication with the time related to Big Data in agriculture. | O1 |
RQ3 | What type of problems are solved using Big Data Machine Learning in the field of agriculture? | Identify the type of problem facing the agricultural company. | O1, O6 |
RQ4 | What is the heading of agriculture in which Big Data Machine Learning is used? | Identify the agricultural sector in which the technology is used. | O1 |
RQ5 | What Big Data Machine Learning architectures are used to solve problems in agriculture? | Identify the characteristics of the architectures used, their data sources, pre-processing, data analysis algorithms, results evaluation and visualization. | O1, O2, O6 |
RQ6 | What approaches were used to solve the problems? | Know the approaches used in the development of Big Data Machine Learning architectures in agriculture. | O1, O3 |
RQ7 | What are the main application domains of Big Data Machine Learning in agriculture? | Identify the main areas of agriculture in which Big Data Machine Learning is used for monitoring, control, simulation and prediction purposes. | O1, O3 |
RQ8 | What Machine Learning techniques have been used in the Big Data architectures found? | Identify the Machine Learning techniques and algorithms used. | O1, O2, O5 |
RQ9 | What are the adaptations of using ML in Big Data in the field of agriculture? | Identify adaptations made in ML when used in a Big Data context for agriculture. | O1, O4 |
Sources | Search String | Context |
---|---|---|
IEEE Xplore, ACM, Science Direct, Springer Link, MDPI and Scopus | “Big Data” AND “Machine Learning” AND (“Farm” OR “Agriculture”) | Agriculture |
Sources | Ranking | Score |
---|---|---|
Journal | Q1 | 2 |
Q2 | 1.5 | |
Q3 OR Q4 | 1 | |
If paper is not in a JCR ranking | 0 | |
Conference | CORE A | 1.5 |
CORE B | 1 | |
CORE C | 0.5 | |
If paper is not in a CORE ranking | 0 |
Phase | Process | Selection Criteria | IEEE Xplore | Springer | Science Direct | MDPI | Scopus | Total |
---|---|---|---|---|---|---|---|---|
1 | Search | Keywords | 538 | 486 | 106 | 270 | 580 | 1980 |
2 | Screening | Title | 413 | 278 | 87 | 78 | 312 | 1168 |
3 | Screening | Duplication Removal | 413 | 35 | 87 | 78 | 260 | 873 |
4 | Screening | Abstract | 48 | 17 | 37 | 25 | 97 | 224 |
5 | Inspection | Full Article | 9 | 1 | 0 | 6 | 18 | 34 |
Nº | Research Questions | Key Words |
---|---|---|
RQ1 | What are the major targeted primary publication channels for Big Data and Machine Learning research in agriculture? | The answer to this question is given by identifying the publication channels and the sources of all the articles. |
RQ2 | How has the frequency of architectures been changed of Big Data in agriculture over time? | Identify the frequency of approaches to each article, which has been classified according to the year of publication. |
RQ3 | What type of problems are solved using Big Data Machine Learning in the field of agriculture? | It is possible to find problems in the field of quality, production, disease control, climate prediction, among others. |
RQ4 | What is the heading of agriculture in which Big Data Machine Learning is used? | The heading can be diverse, such as Fruit, Livestock, Climate, among others. |
RQ5 | What Big Data Machine Learning architectures are used to solve problems in agriculture? | The components of the architectures will be detailed according to the data sources, data processing, analysis and visualization. |
RQ6 | What approaches were used to solve the problems? | The research approaches have been classified according to the development techniques in the selected studies, as if it is a proposal, method, model, application, survey, platform, ecosystem and framework. |
RQ7 | What are the main application domains of Big Data Machine Learning in agriculture? | The application domains can be control (management), growth tracking, control and prediction. |
RQ8 | What Machine Learning techniques have been used in the Big Data architectures found? | It is possible to find the techniques used and the design of the learning stages. |
RQ9 | What are the adaptations of using ML in Big Data in the field of agriculture? | ML adaptations have been made for use in Big Data. They have classified them according to the characteristics of Big Data, volume, variety, veracity and value. |
Classification | Quality Assessment | |||||||
---|---|---|---|---|---|---|---|---|
References | P. Channel | Heading | Domain | a | b | c | d | Scores |
[48] | Springer | Farmer’s decision making | Prediction | 1 | 1 | 0 | 0 | 2 |
[49] | MDPI | Crops | Prediction | 1 | 0.5 | 1 | 1.5 | 4 |
[50] | MDPI | Crops | Prediction | 1 | 0.5 | 1 | 2 | 4.5 |
[51] | MDPI | Land | Prediction | 1 | 0.5 | 0 | 2 | 3.5 |
[52] | MDPI | Crops | Prediction | 1 | 0.5 | 0 | 0 | 1.5 |
[53] | MDPI | Soil | Control | 1 | 0.5 | 0 | 1.5 | 3 |
[54] | MDPI | Crops | Prediction | 1 | 0.5 | 0 | 2 | 3.5 |
[55] | IEEE | Crops | Tracing | 1 | 1 | −1 | 0 | 1 |
[56] | IEEE | Biodiversity | Prediction | 1 | 0.5 | −1 | 0 | 0.5 |
[57] | IEEE | Crops | Prediction | 1 | 1 | 1 | 0 | 3 |
[58] | IEEE | Farmer’s decision making | Prediction | 1 | 1 | 0 | 0 | 2 |
[59] | IEEE | Crops | Control | 1 | 1 | 0 | 0 | 2 |
[60] | IEEE | Animal’s Research | Control | 1 | 1 | 1 | 0 | 3 |
[61] | IEEE | Animal’s Research | Prediction | 1 | 0.5 | −1 | 2 | 2.5 |
[62] | IEEE | Crops | Prediction | 1 | 0.5 | 0 | 0 | 1.5 |
[63] | IEEE | Animal’s Research | Prediction | 1 | 0.5 | 0 | 2 | 3.5 |
[64] | Scopus | Crops | Prediction | 1 | 0.5 | −1 | 1 | 1.5 |
[65] | Scopus | Land | Prediction | 1 | 0.5 | −1 | 2 | 2.5 |
[66] | Scopus | Farmers’ decision making/Weather and climate change | Prediction | 1 | 0.5 | −1 | 0 | 0.5 |
[67] | Scopus | Land | Tracing | 1 | 1 | 1 | 2 | 5 |
[68] | Scopus | Crops | Tracing | 1 | 0.5 | −1 | 1 | 1.5 |
[69] | Scopus | Farmers’ decision making/Crops | Prediction | 1 | 0.5 | −1 | 1 | 1.5 |
[70] | Scopus | Weather and climate change | Prediction | 1 | 0.5 | −1 | 1 | 1.5 |
[71] | Scopus | Crops | Tracing | 1 | 0.5 | −1 | 0 | 0.5 |
[72] | Scopus | Crops | Prediction | 1 | 1 | 0 | 0 | 2 |
[73] | Scopus | Soil | Prediction | 1 | 0.5 | 1 | 2 | 4.5 |
[74] | Scopus | Food availability and security | Tracing | 1 | 0.5 | 1 | 2 | 4.5 |
[75] | Scopus | Crops | Tracing | 1 | 0.5 | 0 | 2 | 3.5 |
[76] | Scopus | Weather and climate change | Prediction | 1 | 0.5 | 0 | 1.5 | 3 |
[77] | Scopus | Crops | Prediction | 1 | 0.5 | −1 | 0 | 0.5 |
[78] | Scopus | Soil | Prediction | 1 | 0.5 | 1 | 2 | 4.5 |
[13] | Scopus | Weeds | Control | 1 | 0.5 | 1 | 2 | 4.5 |
[79] | Scopus | Crops | Tracing | 1 | 0.5 | 1 | 2 | 4 |
[80] | Scopus | Biodiversity | Prediction | 1 | 0.5 | 1 | 0 | 2.5 |
Publication Sources | References | Chanel | Nº | % |
---|---|---|---|---|
Environmental Software Systems. Infrastructures, Services and Applications | [48] | Journal | 1 | 2.9 |
Machines | [49] | Journal | 1 | 2.9 |
Remote Sensing | [50,65] | Journal | 2 | 5.9 |
Agronomy | [51,54] | Journal | 2 | 5.9 |
AI (Multidisciplinary Digital Publishing Institute) | [52] | Journal | 1 | 2.9 |
Water | [53] | Journal | 1 | 2.9 |
IST-Africa Conference (IST-Africa) | [55] | Conference | 1 | 2.9 |
International Conference on Computer, Control, Informatics and its Applications (IC3INA) | [56] | Conference | 1 | 2.9 |
International Conference on Advances in Computing, Communications and Informatics (ICACCI) | [57] | Conference | 1 | 2.9 |
Fourth International Conference on Computing Communication Control and Automation (ICCUBEA) | [58] | Conference | 1 | 2.9 |
IEEE Transactions on Big Data | [59] | Journal | 1 | 2.9 |
IoT Vertical and Topical Summit on Agriculture—Tuscany (IOT Tuscany) | [60] | Conference | 1 | 2.9 |
IEEE Access | [61,63] | Journal | 2 | |
IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM) | [62] | Conference | 1 | 2.9 |
Advances in Intelligent Systems and Computing | [64,69] | Journal | 2 | 5.9 |
Remote Sensing | [65] | Journal | 1 | 2.9 |
International Journal of Emerging Trends in Engineering Research | [66] | Journal | 1 | 2.9 |
GIScience and Remote Sensing | [67] | Journal | 1 | 2.9 |
International Journal of Scientific and Technology Research | [68] | Journal | 1 | 2.9 |
International Journal on Emerging Technologies | [70] | Journal | 1 | 2.9 |
IEEE 5th International Conference on Computer and Communications, ICCC 2019 | [71] | Conference | 1 | 2.9 |
ACM International Conference Proceeding Series | [72] | Conference | 1 | 2.9 |
Computers and Electronics in Agriculture | [13,73,74,75,78] | Journal | 5 | 14.7 |
Environmetrics | [76] | Journal | 1 | 2.9 |
International Journal of Recent Technology and Engineering | [77] | Journal | 1 | 2.9 |
Mobile Networks and Applications | [79] | Journal | 1 | 2.9 |
Proceedings of the International Joint Conference on Neural Networks | [80] | Conference | 1 | 2.9 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cravero, A.; Sepúlveda, S. Use and Adaptations of Machine Learning in Big Data—Applications in Real Cases in Agriculture. Electronics 2021, 10, 552. https://doi.org/10.3390/electronics10050552
Cravero A, Sepúlveda S. Use and Adaptations of Machine Learning in Big Data—Applications in Real Cases in Agriculture. Electronics. 2021; 10(5):552. https://doi.org/10.3390/electronics10050552
Chicago/Turabian StyleCravero, Ania, and Samuel Sepúlveda. 2021. "Use and Adaptations of Machine Learning in Big Data—Applications in Real Cases in Agriculture" Electronics 10, no. 5: 552. https://doi.org/10.3390/electronics10050552
APA StyleCravero, A., & Sepúlveda, S. (2021). Use and Adaptations of Machine Learning in Big Data—Applications in Real Cases in Agriculture. Electronics, 10(5), 552. https://doi.org/10.3390/electronics10050552