Multitarget Intelligent Recognition of Petrographic Thin Section Images Based on Faster RCNN
Abstract
:1. Introduction
2. Materials and Methods
2.1. Faster RCNN
- Backbone network: The backbone network comprises a series of convolution, batch normalization, activation function and pooling operations. It is used to extract image features and generate feature maps.
- Region proposal networks (RPN): A large number of anchor boxes are indirectly generated on an image, and the IOU ratio between each anchor box and the ground truth box is calculated. The anchor boxes are labeled as positive samples and negative samples according to the IOU threshold, and the positive and negative samples are classified and trained by regression. The final output is 300 relatively accurate region proposals (ROI).
- ROI pooling: The region proposal output in the previous step is projected onto the feature map. The feature maps of these region proposals are different sizes, so it is necessary to standardize them and output feature maps with the same size of 7 × 7, which is convenient to connect the subsequent fully connected layer.
- Classification and regression: More accurate classification and regression are performed on the feature map output in the previous step.
2.1.1. VGG16
2.1.2. ResNet50
2.1.3. Transfer Learning
2.1.4. Average Precision
2.2. Experiments
3. Results
4. Discussion
- The complexity of image components and the lack of distinction between target texture and background features can lead to misclassification in detection. This is exemplified by the petrographic thin section image classification of Bai et al., (2019) [40], where the similarity between oolitic limestone and dolomite thin sections was so high that misclassification occurred. To address this issue, incorporating such samples into the training process may enable the model to learn the differences between them and thus reduce misjudgment.
- The uneven distribution of training image samples can have a significant impact on detection. This effect was demonstrated by the training set analysis, which revealed that the training set similar to Figure 9, Figure 10 and Figure 11 accounted for 54%, the training set similar to Figure 12 and Figure 13 accounted for 32%, and the training set similar to Figure 14 accounted for 14%. This skewed distribution of training set samples indicates that the model may not have sufficient learning experience, leading to better performance on simple images than on complex images.
- The limited size of the training image data sets can lead to insufficient generalization of the model. Data augmentation has been employed to address the issue of small data sets, thus improving the model′s generalization capacity [38,39]. After data augmentation, the overall result was promising, which shows the potential of this method. With the increase in data sets, the generalization ability of the model can be further enhanced, thus improving the detection performance.
5. Conclusions
- The AP value of the ooids test set using ResNet50 as the backbone was 92.25%, indicating good overall detection performance. This object detection model was found to be robust and generalizable, as it was able to identify both complete ooids in the middle of the image and partial ooids at the edge.
- The uneven distribution of samples in the training set and the complex composition of microscopic images affected the detection, with the former having a greater effect. Deep learning was used to learn features from the training set and make predictions on the test set, but the uneven distribution of samples caused the distribution of the learned features to deviate, resulting in missed detection in the prediction process. The complexity of the microscopic image composition, with a small difference between the target and the background, also contributed to misclassification and thus affected the detection performance, although to a lesser extent than the uneven distribution.
- This study sought to transition from the classification of petrographic thin section images to multitarget detection, incorporating richer content such as spatial, quantitative and categorical target information, as well as more complex tasks. The research scale was further refined, transitioning from rocks to the textures and structures within them, providing a reference for multitarget intelligent recognition on petrographic thin section images.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Zhou, Y.Z.; Zuo, R.G.; Liu, G.; Yuan, F.; Mao, X.C.; Guo, Y.J.; Xiao, F.; Liao, J.; Liu, Y.P. The great-leap-forward development of mathematical geoscience during 2010–2019: Big data and artificial intelligence algorithm are changing mathematical geoscience. Bull. Mineral. Petrol. Geochem. 2021, 40, 556–573. [Google Scholar] [CrossRef]
- Zhou, Y.Z.; Chen, S.; Zhang, Q.; Xiao, F.; Wang, S.G.; Liu, Y.P.; Jiao, S.T. Advances and prospects of big data and mathematical geoscience. Acta Petrol. Sin. 2018, 34, 255–263. [Google Scholar]
- Liu, C.; Chen, J.P.; Li, S.; Qin, T. Construction of Conceptual Prospecting Model Based on Geological Big Data: A Case Study in Songtao-Huayuan Area, Hunan Province. Minerals 2022, 12, 669. [Google Scholar] [CrossRef]
- Mao, X.C.; Liu, P.; Deng, H.; Liu, Z.K.; Li, L.J.; Wang, Y.S.; Ai, Q.X.; Liu, J.X. A Novel Approach to Three-Dimensional Inference and Modeling of Magma Conduits with Exploration Data: A Case Study from the Jinchuan Ni–Cu Sulfide Deposit, NW China. Nat. Resour. Res. 2023, 32, 901–928. [Google Scholar] [CrossRef]
- Deng, H.; Zhou, S.F.; He, Y.; Lan, Z.D.; Zou, Y.H.; Mao, X.C. Efficient Calibration of Groundwater Contaminant Transport Models Using Bayesian Optimization. Toxics 2023, 11, 438. [Google Scholar] [CrossRef]
- Zeng, L.; Li, T.B.; Huang, H.T.; Zeng, P.; He, Y.X.; Jing, L.H.; Yang, Y.; Jiao, S.T. Identifying Emeishan basalt by supervised learning with Landsat-5 and ASTER data. Front. Earth Sci. 2023, 10, 1097778. [Google Scholar] [CrossRef]
- Wang, J.; Zhou, Y.Z.; Xiao, F. Identification of multi-element geochemical anomalies using unsupervised machine learning algorithms: A case study from Ag-Pb-Zn deposits in north-western Zhejiang, China. Appl. Geochem. 2020, 120, 104679. [Google Scholar] [CrossRef]
- Zuo, R.G. Machine Learning of Mineralization-Related Geochemical Anomalies: A Review of Potential Methods. Nat. Resour. Res. 2017, 26, 457–464. [Google Scholar] [CrossRef]
- Wu, G.P.; Chen, G.X.; Cheng, Q.M.; Zhang, Z.J.; Yang, J. Unsupervised Machine Learning for Lithological Mapping Using Geochemical Data in Covered Areas of Jining, China. Nat. Resour. Res. 2021, 30, 1053–1068. [Google Scholar] [CrossRef]
- Yu, X.T.; Xiao, F.; Zhou, Y.Z.; Wang, Y.; Wang, K.Q. Application of hierarchical clustering, singularity mapping, and Kohonen neural network to identify Ag-Au-Pb-Zn polymetallic mineralization associated geochemical anomaly in Pangxidong district. J. Geochem. Explor. 2019, 203, 87–95. [Google Scholar] [CrossRef]
- Wu, B.C.; Li, X.H.; Yuan, F.; Li, H.; Zhang, M.M. Transfer learning and siamese neural network based identification of geochemical anomalies for mineral exploration: A case study from the Cu-Au deposit in the NW Junggar area of northern Xinjiang Province, China. J. Geochem. Explor. 2022, 232, 106904. [Google Scholar] [CrossRef]
- Li, H.; Li, X.H.; Yuan, F.; Jowitt, S.M.; Zhang, M.M.; Zhou, J.; Zhou, T.F.; Li, X.L.; Ge, C.; Wu, B.C. Convolutional neural network and transfer learning based mineral prospectivity modeling for geochemical exploration of Au mineralization within the Guandian-Zhangbaling area, Anhui Province, China. Appl. Geochem. 2020, 122, 104747. [Google Scholar] [CrossRef]
- Qin, Y.Z.; Liu, L.M. Quantitative 3D Association of Geological Factors and Geophysical Fields with Mineralization and Its Significance for Ore Prediction: An Example from Anqing Orefield, China. Minerals 2018, 8, 300. [Google Scholar] [CrossRef] [Green Version]
- Zuo, R.G.; Kreuzer, O.P.; Wang, J.; Xiong, Y.H.; Zhang, Z.J.; Wang, Z.Y. Uncertainties in GIS-Based Mineral Prospectivity Mapping: Key Types, Potential Impacts and Possible Solutions. Nat. Resour. Res. 2021, 30, 3059–3079. [Google Scholar] [CrossRef]
- Liu, L.M.; Cao, W.; Liu, H.S.; Ord, A.; Qin, Y.Z.; Zhou, F.H.; Bi, C.X. Applying benefits and avoiding pitfalls of 3D computational modeling-based machine learning prediction for exploration targeting: Lessons from two mines in the Tongling-Anqing district, eastern China. Ore Geol. Rev. 2022, 142, 104712. [Google Scholar] [CrossRef]
- Wang, Z.Y.; Yin, Z.; Caers, J.; Zuo, R.G. A Monte Carlo-based framework for risk-return analysis in mineral prospectivity mapping. Geosci. Front. 2020, 11, 2297–2308. [Google Scholar] [CrossRef]
- Lu, Y.; Liu, L.M.; Xu, G.J. Constraints of deep crustal structures on large deposits in the Cloncurry district, Australia: Evidence from spatial analysis. Ore Geol. Rev. 2016, 79, 316–331. [Google Scholar] [CrossRef]
- Jia, L.Q.; Yang, M.; Meng, F.; He, M.Y.; Liu, H.M. Mineral Photos Recognition Based on Feature Fusion and Online Hard Sample Mining. Minerals 2021, 11, 1354. [Google Scholar] [CrossRef]
- Wu, B.K.; Ji, X.H.; He, M.Y.; Yang, M.; Zhang, Z.C.; Chen, Y.; Wang, Y.Z.; Zheng, X.Q. Mineral Identification Based on Multi-Label Image Classification. Minerals 2022, 12, 1338. [Google Scholar] [CrossRef]
- Su, C.; Xu, S.J.; Zhu, K.Y.; Zhang, X.C. Rock classification in petrographic thin section images based on concatenated convolutional neural networks. Earth Sci. Inform. 2020, 13, 1477–1484. [Google Scholar] [CrossRef]
- Ma, H.; Han, G.Q.; Peng, L.; Zhu, L.Y.; Shu, J. Rock thin sections identification based on improved squeeze-and-Excitation Networks model. Comput. Geosci. 2021, 152, 104780. [Google Scholar] [CrossRef]
- Singh, N.; Singh, T.N.; Tiwary, A.; Sarkar, K.M. Textural identification of basaltic rock mass using image processing and neural network. Comput. Geosci. 2010, 14, 301–310. [Google Scholar] [CrossRef]
- Flügel, E.; Munnecke, A. Microfacies of Carbonate Rocks: Analysis, Interpretation and Application; Springer: Berlin, Germany, 2010. [Google Scholar]
- Mlynarczuk, M.; Gorszczyk, A.; Slipek, B. The application of pattern recognition in the automatic classification of microscopic rock images. Comput. Geosci. 2013, 60, 126–133. [Google Scholar] [CrossRef]
- Shu, L.; McIsaac, K.; Osinski, G.R.; Francis, R. Unsupervised feature learning for autonomous rock image classification. Comput. Geosci. 2017, 106, 10–17. [Google Scholar] [CrossRef]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
- Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015: 18th International Conference, Part III 18, Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 26 June–1 July 2016; pp. 770–778. [Google Scholar] [CrossRef] [Green Version]
- Huang, G.; Liu, Z.; Van Der Maaten, L.; Weinberger, K.Q. Densely connected convolutional networks. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 4700–4708. [Google Scholar]
- Szegedy, C.; Vanhoucke, V.; Ioffe, S.; Shlens, J.; Wojna, Z. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 26 June–1 July 2016; pp. 2818–2826. [Google Scholar]
- Ren, S.Q.; He, K.M.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef] [Green Version]
- Liu, X.B.; Wang, H.Y.; Jing, H.D.; Shao, A.L.; Wang, L.C. Research on Intelligent Identification of Rock Types Based on Faster R-CNN Method. IEEE Access 2020, 8, 21804–21812. [Google Scholar] [CrossRef]
- Zhang, Y.; Li, M.C.; Han, S. Automatic identification and classification in lithology based on deep learning in rock images. Acta Petrol. Sin. 2018, 34, 333–342. [Google Scholar]
- Liu, C.Z.; Li, M.C.; Zhang, Y.; Han, S.; Zhu, Y.Q. An Enhanced Rock Mineral Recognition Method Integrating a Deep Learning Model and Clustering Algorithm. Minerals 2019, 9, 516. [Google Scholar] [CrossRef] [Green Version]
- Sun, Y.Q.; Chen, J.P.; Yan, P.B.; Zhong, J.; Sun, Y.X.; Jin, X.Y. Lithology Identification of Uranium-Bearing Sand Bodies Using Logging Data Based on a BP Neural Network. Minerals 2022, 12, 546. [Google Scholar] [CrossRef]
- Cheng, G.J.; Guo, W.H. Rock images classification by using deep convolution neural network. J. Phys. Conf. Ser. 2017, 887, 012089. [Google Scholar] [CrossRef] [Green Version]
- Polat, O.; Polat, A.; Ekici, T. Automatic classification of volcanic rocks from thin section images using transfer learning networks. Neural Comput. Appl. 2021, 33, 11531–11540. [Google Scholar] [CrossRef]
- Ran, X.J.; Xue, L.F.; Zhang, Y.Y.; Liu, Z.Y.; Sang, X.J.; He, J.X. Rock Classification from Field Image Patches Analyzed Using a Deep Convolutional Neural Network. Mathematics 2019, 7, 755. [Google Scholar] [CrossRef] [Green Version]
- Xu, S.T.; Zhou, Y.Z. Artificial intelligence identification of ore minerals under microscope based on deep learning algorithm. Acta Petrol. Sin. 2018, 34, 3244–3252. [Google Scholar]
- Bai, L.; Wei, X.; Liu, Y.; WU, C.; CHEN, L. Rock thin section image recognition and classification based on VGG model. Geol. Bull. China 2019, 38, 2053–2058. [Google Scholar] [CrossRef]
- Tan, X.C.; Zhao, L.Z.; Luo, B.; Jiang, X.F.; Cao, J.; Liu, H.; Li, L.; Wu, X.B.; Nie, Y. Comparison of basic features and origins of oolitic shoal reservoirs between carbonate platform interior and platform margin locations in the Lower Triassic Feixianguan Formation of the Sichuan Basin, southwest China. Petrol. Sci. 2012, 9, 417–428. [Google Scholar] [CrossRef] [Green Version]
- Hollis, C.; Lawrence, D.A.; de Periere, M.D.; Al Darmaki, F. Controls on porosity preservation within a Jurassic oolitic reservoir complex, UAE. Mar. Petrol. Geol. 2017, 88, 888–906. [Google Scholar] [CrossRef] [Green Version]
- Zhu, S.S.; Yang, W.Y.; Lu, B.B.; Huang, G.Y.; Hou, G.S.; Wei, S.P.; Zhang, Y.L. Micro image data set of some rock forming minerals, typical metamorphic minerals and oolitic thin sections. Sci. Data Bank 2020. [Google Scholar] [CrossRef]
- Jiao, L.; Zhang, F.; Liu, F.; Yang, S.; Li, L.; Feng, Z.; Qu, R. A survey of deep learning-based object detection. IEEE Access 2019, 7, 128837–128868. [Google Scholar] [CrossRef]
- Oksuz, K.; Cam, B.C.; Kalkan, S.; Akbas, E. Imbalance problems in object detection: A review. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 43, 3388–3415. [Google Scholar] [CrossRef] [Green Version]
- Wu, X.W.; Sahoo, D.; Hoi, S.C.H. Recent advances in deep learning for object detection. Neurocomputing 2020, 396, 39–64. [Google Scholar] [CrossRef] [Green Version]
- Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.H.; Karpathy, A.; Khosla, A.; Bernstein, M.; et al. ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vis. 2015, 115, 211–252. [Google Scholar] [CrossRef] [Green Version]
- Shorten, C.; Khoshgoftaar, T.M. A survey on Image Data Augmentation for Deep Learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef] [Green Version]
- Kukačka, J.; Golkov, V.; Cremers, D. Regularization for deep learning: A taxonomy. arXiv 2017, arXiv:1710.10686. [Google Scholar]
- Davis, J.; Goadrich, M. The relationship between Precision-Recall and ROC curves. In Proceedings of the 23rd International Conference on Machine Learning, Pittsburgh, PA, USA, 25–29 June 2006; pp. 233–240. [Google Scholar]
- Tzutalin. Labellmg. Git Code. 2015. Available online: https://github.com/heartexlabs/labelImg (accessed on 10 March 2022).
- Bubbliiiing. Faster-RCNN-Pytorch. Git Code. 2022. Available online: https://github.com/bubbliiiing/faster-rcnn-pytorch (accessed on 12 March 2022).
- Everingham, M.; Van Gool, L.; Williams, C.K.I.; Winn, J.; Zisserman, A. The Pascal Visual Object Classes (VOC) Challenge. Int. J. Comput. Vis. 2010, 88, 303–338. [Google Scholar] [CrossRef] [Green Version]
- Zhu, X.X.; Vondrick, C.; Fowlkes, C.C.; Ramanan, D. Do We Need More Training Data? Int. J. Comput. Vis. 2016, 119, 76–92. [Google Scholar] [CrossRef] [Green Version]
- Warden, P. How Many Images Do You Need to Train a Neural Network? Pete Warden’s Blog. 2017. Available online: https://petewarden.com/2017/12/14/how-many-images-do-you-need-to-train-a-neural-network (accessed on 10 June 2023).
Actual Value: True | Actual Value: False | |
---|---|---|
Predicted values: Positive | TP | FP |
Predicted values: Negative | FN | TN |
Hardware/Software | Series/Version |
---|---|
CPU | [email protected] GHz |
GPU | 2080ti |
DRAM | 32 G |
SSD | 1.5 T |
OS | Windows10 Professional |
Python | 3.7.1 |
Torch | 1.8.1 |
Torchvision | 0.9.1 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, H.; Cao, W.; Zhou, Y.; Yu, P.; Yang, W. Multitarget Intelligent Recognition of Petrographic Thin Section Images Based on Faster RCNN. Minerals 2023, 13, 872. https://doi.org/10.3390/min13070872
Wang H, Cao W, Zhou Y, Yu P, Yang W. Multitarget Intelligent Recognition of Petrographic Thin Section Images Based on Faster RCNN. Minerals. 2023; 13(7):872. https://doi.org/10.3390/min13070872
Chicago/Turabian StyleWang, Hanyu, Wei Cao, Yongzhang Zhou, Pengpeng Yu, and Wei Yang. 2023. "Multitarget Intelligent Recognition of Petrographic Thin Section Images Based on Faster RCNN" Minerals 13, no. 7: 872. https://doi.org/10.3390/min13070872