A Multi-Considered Seed Coat Pattern Classification of Allium L. Using Unsupervised Machine Learning
Abstract
:1. Introduction
- We proposed to classify seed coat patterns using unsupervised machine learning.
- We then compared them to previous human-based classifications.
- Following that, we suggested our proposed classification based on unsupervised machine learning and possible combinations such as SI, SS, SM, NS, PS, and PD.
2. Results
2.1. Comparison between Results of Human-Based and Machine-Based Methods
2.2. Proposed Clustering
3. Discussion
4. Materials and Methods
4.1. The Proposed Module Overviews
4.2. General Module
4.2.1. Datasets
4.2.2. Preprocessing Module
4.2.3. Augmentation Module
4.2.4. Shuffling Raw Data
4.2.5. Calculating the Number of Clusters
4.2.6. Clustering Module
4.3. Performance Analysis
Visualization Module
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Friesen, N.; Fritch, R.M.; Blattner, F.R. Phylogeny and new intrageneric classification of Allium (Alliaceae) based on nuclear ribosomal DNA ITS sequences. Aliso 2006, 22, 372–395. [Google Scholar] [CrossRef]
- Choi, H.J.; Oh, B.U. A partial revision of Allium (Amaryllidaceae) in Korea and northeastern China. Bot. J. Linn. Soc. 2011, 167, 153–211. [Google Scholar] [CrossRef] [Green Version]
- Xie, D.F.; Tan, J.B.; Yu, Y.; Gui, L.J.; Su, D.M.; Zhou, S.D.; He, X.J. Insights into phylogeny, age and evolution of Allium (Amaryllidaceae) based on the whole plastome sequences. Ann. Bot. 2020, 125, 1039–1055. [Google Scholar] [CrossRef]
- POWO 2022. Plants of the World Online. Facilitated by the Royal Botanic Gardens, Kew. Available online: http://www.plantsoftheworldonline.org/ (accessed on 10 October 2022).
- Baasanmunkh, S.; Lee, J.K.; Jang, J.E.; Park, M.S.; Friesen, N.; Chung, S.; Choi, H.J. Seed morphology of Allium L. (Amaryllidaceae) from Central Asian countries and its taxonomic implications. Plants 2020, 9, 1239. [Google Scholar] [CrossRef]
- Yusupov, Z.; Ergashov, I.; Volis, S.; Makhmudjanov, D.; Dekhkonov, D.; Khassanov, F.; Tojibaev, K.; Deng, T.; Sun, H. Seed macro- and micromorphology in Allium (Amaryllidaceae) and its phylogenetic significance. Ann. Bot. 2022, 129, 869–911. [Google Scholar] [CrossRef]
- Baasanmunkh, S.; Choi, H.J.; Oyuntsetseg, B.; Friesen, N. Seed testa sculpture of species of Allium L. (Amaryllidaceae) and its taxonomic implications. Turczaninowia 2021, 24, 154–161. [Google Scholar] [CrossRef]
- Choi, H.J.; Giussani, L.M.; Jang, C.G.; Oh, B.U.; Cota-Santez, J.H. Systematics of disjunct northeastern Asian and northern North American Allium (Amaryllidaceae). Botany 2012, 90, 491–508. [Google Scholar] [CrossRef]
- Celep, F.; Koyuncu, M.; Fritsch, R.M.; Kahraman, A.; Dogan, M. Taxonomic importance of seed morphology in Allium (Amaryllidaceae). Syst. Bot. 2012, 37, 893–912. [Google Scholar] [CrossRef]
- Lin, C.Y.; Tan, D.Y. Seed testa micromorphology of thirty-eight species of Allium (Amaryllidaceae) from central Asia, and its taxonomic implications. Nord. J. Bot. 2017, 35, 189–200. [Google Scholar] [CrossRef]
- Barthlott, W. Epidermal and seed surface characters of plants: Systematic applicability and some evolutionary aspects. Nord. J. Bot. 1981, 1, 345–355. [Google Scholar] [CrossRef]
- Barthlott, W. Scanning electron microscopy of the epidermal surface in plants. In Scanning Electron Microscopy in Taxonomy and Function Morphology; Claugher, D., Ed.; Clarendon Press: Oxford, UK, 1990; Volume 41, pp. 69–94. [Google Scholar]
- Kruse, J. Rasterelektronenmikroskopische Untersuchungen an Samen der Gattung Allium L. Die Kult. 1984, 32, 89–101. [Google Scholar] [CrossRef]
- Kruse, J. Rasterelektronenmikroskopische Untersuchungen an Samen der Gattung Allium L. II. Die Kult. 1986, 34, 207–228. [Google Scholar] [CrossRef]
- Kruse, J. Rasterelektronenmikroskopische Untersuchungen an Samen der Gattung Allium L. III. Die Kult. 1988, 36, 355–368. [Google Scholar] [CrossRef]
- Kruse, J. Rasterelektronenmikroskopische Untersuchungen an Samen der Gattung Allium L. IV. Feddes Repert. 1994, 105, 457–471. [Google Scholar] [CrossRef]
- Geetharamani, G.; Pandian, A. Identification of plant leaf diseases using a nine-layer deep convolutional neural network. Comput. Elect. Eng. 2019, 76, 323–338. [Google Scholar] [CrossRef]
- Lee, S.H.; Chan, C.S.; Mayo, S.J.; Remagnino, P. How deep learning extracts and learns leaf features for plant classification. Pattern Recognit. 2017, 71, 1–13. [Google Scholar] [CrossRef] [Green Version]
- Reyes, A.K.; Caicedo, J.C.; Camargo, J.E. Fine-tuning Deep Convolutional Networks for Plant Recognition. CLEF 2015, 1391, 467–475. [Google Scholar]
- Saleem, M.H.; Potgieter, J.; Arif, K.M. Plant disease classification: A comparative evaluation of convolutional neural networks and deep learning optimizers. Plants 2020, 9, 1319. [Google Scholar] [CrossRef]
- Wang, X.; Liu, J.; Zhu, X. Early real-time detection algorithm of tomato diseases and pests in the natural environment. Plant Methods 2021, 17, 1–17. [Google Scholar] [CrossRef]
- Meunkaewjinda, A.; Kumsawat, P.; Attakitmongcol, K.; Srikaew, A. Grape leaf disease detection from color imagery using hybrid intelligent system. In Proceedings of the 2008 5th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, Krabi, Thailand, 14–17 May 2008; Volume 1, pp. 513–516. [Google Scholar] [CrossRef]
- Padol, P.B.; Yadav, A.A. SVM classifier based grape leaf disease detection. In Proceedings of the 2016 Conference on Advances in Signal Processing (CASP), Pune, India, 9–11 June 2016; pp. 175–179. [Google Scholar] [CrossRef]
- Sannakki, S.S.; Rajpurohit, V.S.; Nargund, V.B.; Kulkarni, P. Diagnosis and classification of grape leaf diseases using neural networks. In Proceedings of the 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT), Tiruchengode, India, 4–6 July 2013; pp. 1–5. [Google Scholar] [CrossRef]
- Mahlein, A.K. Plant disease detection by imaging sensors–parallels and specific demands for precision agriculture and plant phenotyping. Plant Dis. 2016, 100, 241–251. [Google Scholar] [CrossRef] [Green Version]
- Atila, Ü.; Uçar, M.; Akyol, K.; Uçar, E. Plant leaf disease classification using EfficientNet deep learning model. Ecol. Inform. 2021, 61, 101182. [Google Scholar] [CrossRef]
- Brahimi, M.; Boukhalfa, K.; Moussaoui, A. Deep learning for tomato diseases: Classification and symptoms visualization. Appl. Artif. Intell. 2017, 31, 299–315. [Google Scholar] [CrossRef]
- Amara, J.; Bouaziz, B.; Algergawy, A. A deep learning-based approach for banana leaf diseases classification. BTW 2017, 266, 79–88. [Google Scholar]
- Piazza, G.; Valsecchi, C.; Sottocornola, G. Deep Learning Applied to SEM Images for Supporting Marine Coralline Algae Classification. Diversity 2021, 13, 640. [Google Scholar] [CrossRef]
- Veiskarami, G.; Khodayari, H.; Heubl, G.; Zarre, S. Seed surface ultrastructure as an efficient tool for species delimitation in the Allium ampeloprasum L. alliance (Amaryllidaceae, Allioideae). Microsc. Res. Technol. 2018, 81, 1275–1285. [Google Scholar] [CrossRef]
- Koch, K.; Bhushan, B.; Barthlott, W. Multifunctional surface structures of plants: An inspiration for biomimetics. Prog. Mater. Sci. 2009, 54, 137–178. [Google Scholar] [CrossRef]
- Saravanan, C. Color image to grayscale image conversion. In Proceedings of the Second International Conference on Computer Engineering and Applications (ICCEA), Bali, Indonesia, 19–21 March 2010; Volume 2, pp. 196–199. [Google Scholar] [CrossRef]
- Kanan, C.; Cottrell, G.W. Color-to-grayscale: Does the method matter in image recognition? PLoS ONE 2012, 7, e29740. [Google Scholar] [CrossRef] [Green Version]
- Singh, T.R.; Roy, S.; Singh, O.I.; Sinam, T.; Singh, K. A new local adaptive thresholding technique in binarization. Int. J. Comput. Sci. 2012, 8, 271–277. [Google Scholar] [CrossRef]
- Al-amri, S.S.; Kalyankar, N.V.; Khamitkar, S.D. Image Segmentation by Using Threshold Techniques. arXiv 2010, arXiv:1005.4020. [Google Scholar]
- He, J.; Do, Q.D.M.; Downton, A.C.; Kim, J. A comparison of binarization methods for historical archive documents. Proc. Int. Conf. Doc. Anal. Recognit. 2005, 1, 538–542. [Google Scholar] [CrossRef]
- Di Stefano, L.; Bulgarelli, A. A simple and efficient connected components labeling algorithm. In Proceedings of the International Conference on Image Analysis and Processing, Venice, Italy, 27–29 September 1999; pp. 322–327. [Google Scholar] [CrossRef]
- He, L.; Chao, Y.; Suzuki, K.; Wu, K. Fast connected-component labeling. Pattern Recognit. 2009, 42, 1977–1987. [Google Scholar] [CrossRef]
- Shorten, C.; Khoshgoftaar, T.M. A survey on Image Data Augmentation for Deep Learning. J. Big. Data 2019, 6, 1–48. [Google Scholar] [CrossRef]
- Zhong, Z.; Zheng, L.; Kang, G.; Li, S.; Yang, Y. Random erasing data augmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 13001–13008. [Google Scholar] [CrossRef]
- Elmahdy, A.; Mohajer, S. On the fundamental limits of coded data shuffling for distributed machine learning. IEEE Trans. Inf. Theory 2020, 66, 3098–3131. [Google Scholar] [CrossRef]
- Masud, M.A.; Huang, J.Z.; Wei, C.; Wang, J.; Khan, I.; Zhong, M. I-nice: A new approach for identifying the number of clusters and initial cluster centres. Inf. Sci. 2018, 466, 129–151. [Google Scholar] [CrossRef]
- Kim, S.W.; Gil, J.M. Research paper classification systems based on TF-IDF and LDA schemes. Hum. Cent. Comput. Inf. Sci. 2019, 9, 1–21. [Google Scholar] [CrossRef]
- Bholowalia, P.; Kumar, A. EBK-means: A clustering technique based on elbow method and k-means in WSN. Int. J. Comput. Appl. 2014, 105, 17–24. [Google Scholar] [CrossRef]
- Syakur, M.A.; Khotimah, B.K.; Rochman, E.M.S.; Satoto, B.D. Integration k-means clustering method and elbow method for identification of the best customer profile cluster. Mater. Sci. Eng. 2018, 336, 012017. [Google Scholar] [CrossRef]
- García, S.; Parejo, A.; Personal, E.; Guerrero, J.I.; Biscarri, F.; León, C. A retrospective analysis of the impact of the COVID-19 restrictions on energy consumption at a disaggregated level. Appl. Energy 2021, 287, 116547. [Google Scholar] [CrossRef]
- Rousseeuw, P.J. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar] [CrossRef] [Green Version]
- Sinaga, K.P.; Yang, M.S. Unsupervised K-means clustering algorithm. IEEE Access 2020, 8, 80716–80727. [Google Scholar] [CrossRef]
- Bahmani, B.; Moseley, B.; Vattani, A.; Kumar, R.; Vassilvitskii, S. Scalable K-Means++. Proc. VLDB Endow 2012, 5, 622–633. [Google Scholar] [CrossRef]
- Sculley, D. Web-scale k-means clustering. In Proceedings of the 19th International Conference on World Wide Web, Raleigh, NC, USA, 26–30 April 2010; pp. 1177–1178. [Google Scholar] [CrossRef]
- Aggarwal, C.; Subbian, K. Evolutionary network analysis: A survey. ACM Comput. Surv. 2014, 47, 1–36. [Google Scholar] [CrossRef]
- Zhang, T.; Ramakrishnan, R.; Livny, M. BIRCH: An efficient data clustering method for very large databases. SIGMOD Rec. 1996, 25, 103–114. [Google Scholar] [CrossRef]
Our Observations | Abbreviation | Definition |
---|---|---|
Strongly identical | SI | It refers to an image that is strongly similar. |
Strongly similar | SS | It is referring term refers to an image that is exactly similar. |
Similar | SM | It refers to one or more images that are similar to one another. |
Nearly similar | NS | It indicates that two or more images are similar to one another. |
Possibly similar | PS | It indicates that there is a chance that certain images are identical. |
Possibly different | PD | It is indicating that there is a chance that some images may be different. |
SI | SS | SM | NS | PS | PD |
---|---|---|---|---|---|
1385 | 1127 | 1127 | 845 | 824 | 869 |
Techniques (1) | Abbreviation (1) | Clusters (2) | Abbreviation (2) | Proposed Cluster (3) | Abbreviation (3) |
---|---|---|---|---|---|
K-Means | M1 | Cluster 1 | C1 | Elbow based four-grayscale | EG |
K-Means++ | M2 | Cluster 2 | C2 | Silhouette based six-grayscale | SG |
Minibatch K-means | M3 | Cluster 3 | C3 | Elbow based four-threshold | ET |
Spectral | M4 | Cluster 4 | C4 | Silhouette based four- threshold | ST |
Birch | M5 | Cluster 5 | C5 | Elbow based five-colored | EC |
Silhouette based eight- colored | SC |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ariunzaya, G.; Baasanmunkh, S.; Choi, H.J.; Kavalan, J.C.L.; Chung, S. A Multi-Considered Seed Coat Pattern Classification of Allium L. Using Unsupervised Machine Learning. Plants 2022, 11, 3097. https://doi.org/10.3390/plants11223097
Ariunzaya G, Baasanmunkh S, Choi HJ, Kavalan JCL, Chung S. A Multi-Considered Seed Coat Pattern Classification of Allium L. Using Unsupervised Machine Learning. Plants. 2022; 11(22):3097. https://doi.org/10.3390/plants11223097
Chicago/Turabian StyleAriunzaya, Gantulga, Shukherdorj Baasanmunkh, Hyeok Jae Choi, Jonathan C. L. Kavalan, and Sungwook Chung. 2022. "A Multi-Considered Seed Coat Pattern Classification of Allium L. Using Unsupervised Machine Learning" Plants 11, no. 22: 3097. https://doi.org/10.3390/plants11223097