Detection of Surface Defects of Barrel Media Based on PaE-VGG Model
Abstract
:1. Introduction
- (i)
- CNNs are favored for their focus on local relationships, better hardware support, and ease of training. Therefore, the backbone network for identification of surface defects of barrel media is proposed as a VGG-based model.
- (ii)
- To overcome the limitation that traditional convolutional neural networks have in terms of limited receptive fields, we propose the PaCC technique, which enhances the model’s ability to comprehend defect positions. This is achieved by leveraging circular convolution to capture features that are dependent on the position. Here, base instance kernels and position-embedding strategies are employed to manage variations in input size and to incorporate location information into the output feature maps, respectively.
- (iii)
- To ascertain the significance of each feature map channel and enhance those channels that are beneficial to the current task, we have employed an Efficient Channel Attention (ECA) [17]. This mechanism evaluates the contribution of each channel and optimizes the feature representation by adjusting their weights accordingly, thereby enhancing the performance of the PaE-VGG network.
2. Related Work
3. Proposed PaE-VGG Model for Surface Defect Recognition
3.1. VGG-Based Model
3.2. ParC Residual Block
- (1)
- Horizontal PaCC (PaCC-H): PaCC-H focuses on the horizontal rows of the image, that is, the pixels in the same row.
- (2)
- Vertical PaCC (PaCC-V): This type of convolution processes the vertical columns of the image; that is, it focuses on the pixels in the same column.
- (3)
- When the input feature map information is processed through the PaCC-H and PaCC-V directions, respectively, we obtain and . Subsequently, by concatenating and along the channel direction, the dimensionality of the output image is expanded.
3.3. Efficient Channel Attention
4. Dataset and Model Parameters for PaE-VGG Model
4.1. Dataset and Data Augmentation
4.2. Matrices for Model Analyses
- Accuracy is a commonly utilized measure for evaluating the efficacy of models, measuring the proportion of correct detections out of the total detections made [25].
- Recall is a measurement used to evaluate the effectiveness of detection when faced with class imbalance. Mathematically, it is calculated as the number of correctly identified positive cases divided by the sum of true positives (TP) and false negatives (FN) [26]:
- F1-Score is a harmonic mean of precision and recall scores [27]:
4.3. Hyperparameters for Considered PaE-VGG Models
5. Results and Discussion for PaE-VGG Model
5.1. Ablation Experiment
5.2. Comparison with Other Models
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
PaCC | Position-Aware Cyclic Convolution |
ECA | Efficient Channel Attention |
CNN | Convolutional Neural Networks |
ViT | Vision Transformer |
PRB | PaCC Residual Block |
PaCC-H | Horizontal PaCC |
PaCC-V | Vertical PaCC |
PE | Instance Position Encoding |
GAP | Global Average Pooling |
References
- Yang, S.; Li, W.; Chen, H. Surface Finishing Theory and New Technology; Springer: Berlin/Heidelberg, Germany, 2018; pp. 65–224. [Google Scholar]
- Tang, B.; Kong, J.; Wu, S. Review of surface defect detection based on machine vision. J. Image Graph. 2017, 22, 1640–1663. [Google Scholar]
- Konovalenko, I.; Maruschak, P.; Brezinová, J. Steel Surface Defect Classification Using Deep Residual Neural Network. Metals 2020, 10, 846. [Google Scholar] [CrossRef]
- Jia, P.; Tian, J.; Yang, Y. Defect Detection Method of Abrasive Block based on Machine Vision. Diam. Abrasives Eng. 2021, 41, 76–82. [Google Scholar]
- Park, J.; Kwon, B.; Park, J. Machine learning-based imaging system for surface defect inspection. Manufact 2016, 3, 303–310. [Google Scholar] [CrossRef]
- Zhang, C.; Nateghinia, E.; Miranda-Moreno, L.; Sun, L. Pavement distress detection using convolutional neural network (CNN): A case study in Montreal, Canada. Int. J. Transp. Sci. Technol. 2022, 11, 298–309. [Google Scholar] [CrossRef]
- Ehtisham, R.; Qayyum, W.; Camp, C.; Plevris, V.; Mir, J.; Khan, Q.; Ahmad, A. Computing the characteristics of defects in wooden structures using image processing and CNN. Autom. Constr. 2024, 158, 105211. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Touvron, H.; Cord, M.; Douze, M.; Massa, F.; Sablayrolles, A.; e J’egou, H. Training data-efficient image transformers and distillation through at tention. In Proceedings of the International Conference on Machine Learning, Virtual, 18–24 July 2021; pp. 10347–10357. [Google Scholar]
- Zhou, H.; Yang, R.; Hu, R.; Shu, C.; Tang, X.; Li, X. ETDNet: Efficient Transformer-Based Detection Network for Surface Defect Detection. IEEE Trans. Instrum. Meas. 2023, 72, 2525014. [Google Scholar] [CrossRef]
- Singh, S.; Kumar, A.; Desai, K. Comparative assessment of common pre-trained CNNs for vision-based surface defect detection of machined components. Expert Syst. Appl. 2023, 218, 119623. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.; Kaiser, L.; Polosukhin, I. An Algorithm for Detecting Surface Defects in Industrial Strip Steel based on Receptive Field and Feature Information Supplementation. In Proceedings of the 2024 27th International Conference on Computer Supported Cooperative Work in Design, Tianjin, China, 8–10 May 2024; pp. 3078–3085. [Google Scholar]
- Tammina, S. Transfer learning using VGG-16 with Deep Convolutional Neural Network for Classifying Images. Int. J. Sci. Res. Publ. 2019, 9, 143–150. [Google Scholar] [CrossRef]
- Yang, H.; Ni, J.; Gao, J.; Han, Z.; Luan, T. A novel method for peanut variety identification and classification by Improved VGG16. Sci. Rep. 2021, 11, 15756. [Google Scholar] [CrossRef]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
- Zhang, H.; Hu, W.; Wang, X. ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer. In Proceedings of the ECCV, Tel Aviv, Israel, 23–27 October 2022. [Google Scholar]
- Wang, Q.; Wu, B.; Zhu, P.; Li, P.; Zuo, W.; Hu, Q. ECA-Net: Efficient channel attention for deep convolutional neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 11534–11542. [Google Scholar]
- Şeker, A. Evaluation of fabric defect detection based on transfer learning with pre-trained AlexNet. In Proceedings of the 2018 International Conference on Artificial Intelligence and Data Processing (IDAP), Malatya, Turkey, 28–30 September 2018; pp. 1–4. [Google Scholar]
- Xue, Y.; Wang, L.; Zhang, Y.; Shen, Q. Defect detection method of apples based on GoogLeNet deep transfer learning. Nongye Jixie Xuebao/Trans. Chin. Soc. Agric. Mach. 2020, 51. [Google Scholar]
- Sabeenian, R.; Paul, E.; Prakash, C. Fabric defect detection and classification using modified VGG network. J. Text. Inst. 2023, 114, 1032–1040. [Google Scholar] [CrossRef]
- Gao, M.; Song, P.; Wang, F.; Liu, J.; Mandelis, A.; Qi, D. A novel deep convolutional neural network based on ResNet-18 and transfer learning for detection of wood knot defects. J. Sens. 2021, 1–16. [Google Scholar]
- Tridgell, S.; Kumm, M.; Hardieck, M.; Boland, D.; Moss, D.; Zipf, P.; Leong, P. Unrolling ternary neural networks. ACM Trans. Reconfigurable Technol. Syst. (TRETS) 2019, 12, 1–23. [Google Scholar]
- Sengupta, A.; Ye, Y.; Wang, R.; Liu, C.; Roy, K. Going deeper in spiking neural networks: VGG and residual architectures. Front. Neurosci. 2019, 13, 95. [Google Scholar] [CrossRef]
- Wang, S.; Fernandes, S.; Zhu, Z.; Zhang, Y. AVNC: Attention-based VGG-style network for COVID-19 diagnosis by CBAM. IEEE Sens. J. 2021, 22, 17431–17438. [Google Scholar]
- Targ, S.; Almeida, D.; Lyman, K. Resnet in resnet: Generalizing residual architectures. arXiv 2016, arXiv:1603.08029. [Google Scholar]
- Chen, C.; Azman, A. Improved Deep Learning Model for Workpieces of Rectangular Pipeline Surface Defect Detection. Computers 2024, 13, 30. [Google Scholar] [CrossRef]
- Dwivedi, D.; Babu, K.; Yemula, P.; Chakraborty, P.; Pal, M. Identification of surface defects on solar pv panels and wind turbine blades using attention based deep learning model. Eng. Appl. Artif. Intell. 2024, 131, 107836. [Google Scholar]
- Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
- Dučinskas, K.; Karaliutė, M.; Šaltytė Vaisiauskė, L. Spatially Weighted Bayesian Classification of Spatio-Temporal Areal Data Based on Gaussian-Hidden Markov Models. Mathematics 2023, 11, 347. [Google Scholar] [CrossRef]
Type of Image | Number of Images |
---|---|
Normal | 2037 |
Concave | 417 |
Dark spots | 1473 |
Sintering marks | 93 |
Damage | 1638 |
Abnormal shape | 183 |
Adhesion | 774 |
Total | 6615 |
Hyperparameters | PaE-VGG Model |
---|---|
Batch Size | 32 |
Number of Epochs | 70 |
Optimizer | Adam |
Initial Learning Rate | 0.001 |
Loss Function | Cross Entropy |
Dropout | 0.25 |
Configuration/Variant | ECA | ParC | Accuracy (%) |
---|---|---|---|
VGG16_BN | No | No | 89.61 |
ECA-VGG | Yes | No | 91.88 |
ParC-VGG | No | Yes | 93.41 |
PaE-VGG | Yes | Yes | 94.37 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Peng, H.; Cheng, L.; Tian, J. Detection of Surface Defects of Barrel Media Based on PaE-VGG Model. Mathematics 2025, 13, 1104. https://doi.org/10.3390/math13071104
Peng H, Cheng L, Tian J. Detection of Surface Defects of Barrel Media Based on PaE-VGG Model. Mathematics. 2025; 13(7):1104. https://doi.org/10.3390/math13071104
Chicago/Turabian StylePeng, Hongli, Long Cheng, and Jianyan Tian. 2025. "Detection of Surface Defects of Barrel Media Based on PaE-VGG Model" Mathematics 13, no. 7: 1104. https://doi.org/10.3390/math13071104
APA StylePeng, H., Cheng, L., & Tian, J. (2025). Detection of Surface Defects of Barrel Media Based on PaE-VGG Model. Mathematics, 13(7), 1104. https://doi.org/10.3390/math13071104