Optimized YOLOv5 Architecture for Superior Kidney Stone Detection in CT Scans
Abstract
:1. Introduction
- By integrating the squeeze-and-excitation (SE) block within the C3 block of the YOLOv5 architecture, the proposed model significantly improves the recalibration of channel-wise dependencies, thereby enhancing the network’s ability to capture and differentiate intricate feature relationships. This leads to better detection accuracy and reliability in identifying kidney stones in CT images.
- The proposed YOLOv5m modification achieves a balanced performance in terms of model size, inference speed, and detection accuracy. With an inference speed of 8.2 ms per image and a model size of approximately 41 MB, it offers a viable solution for real-time medical applications requiring precise object detection without compromising speed.
- The use of a modified CSPDarknet53 as the backbone network enhanced the feature extraction efficiency. The incorporation of cross-stage partial (CSP) connections optimizes learning efficiency, reduces model size, and improves the overall detection capability across different scales.
- The integration of attention mechanisms into the YOLOv5m architecture enables the model to focus on the most pertinent parts of the input images. This selective attention enhances detection accuracy by allowing the model to better differentiate between significant and insignificant features within the images.
- The proposed model outperformed the standard YOLOv5 variants (nano-sized, small, and medium) in key performance metrics such as precision, recall, and mean average precision (mAP). This superior performance highlights its efficacy in detecting kidney stones, making it a suitable choice for medical imaging applications.
- The use of bilateral filtering for noise reduction ensures the preservation of critical features and sharpness in CT images, which are essential for accurate kidney stone detection. In addition, data augmentation techniques enhance the diversity and robustness of the training dataset, contributing to improved model performance.
- The proposed model employs a different color approach to improve the clarity of the kidney stone detection results. Using uniquely colored bounding boxes for closely located stones resolves potential overlap issues and facilitates a better analysis and understanding of detection performance.
2. Related Works
3. Proposed Methodology
3.1. YOLOv5m
3.2. C3 Block
3.3. Proposed Model
4. Experiments
4.1. Dataset
4.2. Evaluation Metrics
4.3. Experimental Setup
4.4. Experimental Result and Analysis
4.5. Different Coloring Approaches
4.6. Comparison
5. Discussion
6. Conclusions
7. Additional Information
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Akram, M.; Jahrreiss, V.; Skolarikos, A.; Geraghty, R.; Tzelves, L.; Emilliani, E.; Davis, N.F.; Somani, B.K. Urological guidelines for kidney stones: Overview and comprehensive update. J. Clin. Med. 2024, 13, 1114. [Google Scholar] [CrossRef] [PubMed]
- Jebir, R.M.; Mustafa, Y.F. Kidney stones: Natural remedies and lifestyle modifications to alleviate their burden. Int. Urol. Nephrol. 2024, 56, 1025–1033. [Google Scholar] [CrossRef] [PubMed]
- Cheraghian, B.; Meysam, A.; Hashemi, S.J.; Hosseini, S.A.; Malehi, A.S.; Khazaeli, D.; Rahimi, Z. Kidney stones and dietary intake in adults: A population-based study in southwest Iran. BMC Public Health 2024, 24, 955. [Google Scholar] [CrossRef]
- Ahmed, F.; Abbas, S.; Athar, A.; Shahzad, T.; Khan, W.A.; Alharbi, M.; Khan, M.A.; Ahmed, A. Identification of kidney stones in KUB X-ray images using VGG16 empowered with explainable artificial intelligence. Sci. Rep. 2024, 14, 6173. [Google Scholar]
- Liu, H.; Ghadimi, N. Hybrid convolutional neural network and Flexible Dwarf Mongoose Optimization Algorithm for strong kidney stone diagnosis. Biomed. Signal Process. Control. 2024, 91, 106024. [Google Scholar] [CrossRef]
- Muksimova, S.; Umirzakova, S.; Mardieva, S.; Cho, Y.I. Enhancing medical image denoising with innovative teacher–student model-based approaches for precision diagnostics. Sensors 2023, 23, 9502. [Google Scholar] [CrossRef]
- Muksimova, S.; Umirzakova, S.; Kang, S.; Im Cho, Y. CerviLearnNet: Advancing cervical cancer diagnosis with reinforcement learning-enhanced convolutional networks. Heliyon 2024, 10, e29913. [Google Scholar] [CrossRef]
- Mardieva, S.; Ahmad, S.; Umirzakova, S.; Rasool, M.A.; Whangbo, T.K. Lightweight image super-resolution for IoT devices using deep residual feature distillation network. Knowl.-Based Syst. 2024, 285, 111343. [Google Scholar] [CrossRef]
- Dangle, P.; Tasian, G.E.; Chu, D.I.; Shannon, R.; Spiardi, R.; Xiang, A.H.; Jadcherla, A.; Arenas, J.; Ellison, J.S. A systematic scoping review of comparative effectiveness studies in kidney stone disease. Urology 2024, 183, 3–10. [Google Scholar] [CrossRef]
- Umirzakova, S.; Ahmad, S.; Mardieva, S.; Muksimova, S.; Whangbo, T.K. Deep learning-driven diagnosis: A multi-task approach for segmenting stroke and Bell’s palsy. Pattern Recognit. 2023, 144, 109866. [Google Scholar] [CrossRef]
- Pan, W.; Yun, T.; Ouyang, X.; Ruan, Z.; Zhang, T.; An, Y.; Wang, R.; Zhu, P. A blood-based multi-omic landscape for the molecular characterization of kidney stone disease. Mol. Omics 2024, 20, 322–332. [Google Scholar] [CrossRef] [PubMed]
- Zhang, M.; Ye, Z.; Yuan, E.; Lv, X.; Zhang, Y.; Tan, Y.; Xia, C.; Tang, J.; Huang, J.; Li, Z. Imaging-based deep learning in kidney diseases: Recent progress and future prospects. Insights Into Imaging 2024, 15, 50. [Google Scholar] [CrossRef] [PubMed]
- Sudharson, S.; Kokil, P. Computer-aided diagnosis system for the classification of multi-class kidney abnormalities in the noisy ultrasound images. Comput. Methods Programs Biomed. 2021, 205, 106071. [Google Scholar] [CrossRef] [PubMed]
- Baygin, M.; Yaman, O.; Barua, P.D.; Dogan, S.; Tuncer, T.; Acharya, U.R. Exemplar Darknet19 feature generation technique for automated kidney stone detection with coronal CT images. Artif. Intell. Med. 2022, 127, 102274. [Google Scholar] [CrossRef]
- Chiou, T.; Meagher, M.F.; Berger, J.H.; Chen, T.T.; Sur, R.L.; Bechis, S.K. Software-estimated stone volume is better predictor of spontaneous passage for acute nephrolithiasis. J. Endourol. 2023, 37, 85–92. [Google Scholar] [CrossRef]
- Patro, K.K.; Allam, J.P.; Neelapu, B.C.; Tadeusiewicz, R.; Acharya, U.R.; Hammad, M.; Yildirim, O.; Pławiak, P. Application of Kronecker convolutions in deep learning technique for automated detection of kidney stones with coronal CT images. Inf. Sci. 2023, 640, 119005. [Google Scholar] [CrossRef]
- Xu, W.; Lai, C.; Mo, Z.; Liu, C.; Li, M.; Zhao, G.; Xu, K. Clinical-Inspired Framework for Automatic Kidney Stone Recognition and Analysis on Transverse CT Images. IEEE J. Biomed. Health Inform. 2024, 1–12. [Google Scholar] [CrossRef] [PubMed]
- Kilic, U.; Karabey Aksakalli, I.; Tumuklu Ozyer, G.; Aksakalli, T.; Ozyer, B.; Adanur, S. Exploring the Effect of Image Enhancement Techniques with Deep Neural Networks on Direct Urinary System (DUSX) Images for Automated Kidney Stone Detection. Int. J. Intell. Syst. 2023, 2023, 3801485. [Google Scholar] [CrossRef]
- Bayram, A.F.; Gurkan, C.; Budak, A.; Karataş, H. A detection and prediction model based on deep learning assisted by explainable artificial intelligence for kidney diseases. Avrupa Bilim Ve Teknol. Derg. 2022, 40, 67–74. [Google Scholar]
- Tahir, F.S.; Abdulrahman, A.A. Kidney stones detection based on deep learning and discrete wavelet transform. Indones. J. Electr. Eng. Compu. Sci. 2023, 31, 1829. [Google Scholar] [CrossRef]
- Chaki, J.; Ucar, A. An efficient and robust approach using inductive transfer-based ensemble deep neural networks for kidney stone detection. IEEE Access 2024, 12, 32894–32910. [Google Scholar] [CrossRef]
- Asif, S.; Zheng, X.; Zhu, Y. An optimized fusion of deep learning models for kidney stone detection from CT images. J. King Saud Univ.-Comput. Inf. Sci. 2024, 36, 102130. [Google Scholar] [CrossRef]
- Kumar, P.; Singh, D.; Samagh, J.S. A Hybrid Model for Kidney Stone Detection Using Deep Learning. IJSTM 2024, 13, 65–85. [Google Scholar]
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 7132–7141. [Google Scholar]
- TEZ. “TEZ_ROI_AUG Dataset”. Roboflow Universe, Roboflow, April 2023. Available online: https://universe.roboflow.com/tez-nwkf5/tez_roi_aug (accessed on 3 August 2024).
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed]
- Tan, M.; Pang, R.; Le, Q.V. Efficientdet: Scalable and efficient object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 14–19 June 2020; pp. 10781–10790. [Google Scholar]
- Ross, T.Y.; Dollár, G.K.H.P. Focal loss for dense object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 2980–2988. [Google Scholar]
- Duan, K.; Bai, S.; Xie, L.; Qi, H.; Huang, Q.; Tian, Q. Centernet: Keypoint triplets for object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 6569–6578. [Google Scholar]
YOLOv5n | YOLOv5s | YOLOv5m | YOLOv5l | YOLOv5X |
---|---|---|---|---|
4 MB | 14 MB | 41 MB | 89 MB | 166 MB |
6.3 ms | 6.4 ms | 8.2 ms | 10.1 ms | 12.1 ms |
28.4 mAP | 37.2 mAP | 45.2 mAP | 48.8 mAP | 50.7 mAP |
Models | Train Box Loss | Train Object Loss | Train Class Loss | Val Box Loss | Val Object Loss | Val Class Loss |
---|---|---|---|---|---|---|
YOLOv5n (nano-sized) | 0.0723 | 0.0093 | 1.3021 | 0.0821 | 0.0085 | 1.1245 |
YOLOv5s (small) | 0.0671 | 0.0096 | 1.0122 | 0.0799 | 0.0088 | 0.9653 |
YOLOv5m (medium) | 0.0624 | 0.0084 | 0.9863 | 0.0785 | 0.0084 | 0.9403 |
Ours | 0.0607 | 0.0076 | 0.9746 | 0.0767 | 0.0079 | 0.9298 |
Models | Precision | Recall | [email protected] | Params | Flops(G) | Epochs |
---|---|---|---|---|---|---|
YOLOv5n (nano-sized) | 0.719 | 0.578 | 0.567 | 17 | 4.1 | 50 |
YOLOv5s (small) | 0.772 | 0.604 | 0.617 | 19 | 28.9 | 50 |
YOLOv5m (medium) | 0.808 | 0.628 | 0.655 | 21 | 47.9 | 50 |
Ours | 0.816 | 0.637 | 0.664 | 20.3 | 48.1 | 50 |
Model | Precision | Recall | [email protected] | Inference Time (ms) | Model Size (MB) |
---|---|---|---|---|---|
Faster R-CNN | 0.785 | 0.612 | 0.628 | 15.4 | 148 |
EfficientDet (D2) | 0.799 | 0.619 | 0.640 | 13.0 | 52 |
RetinaNet | 0.782 | 0.605 | 0.635 | 14.1 | 80 |
CenterNet | 0.804 | 0.625 | 0.649 | 12.7 | 70 |
YOLOv5n (Nano-Sized) | 0.719 | 0.578 | 0.567 | 6.3 | 17 |
YOLOv5s (Small) | 0.772 | 0.604 | 0.617 | 6.4 | 19 |
YOLOv5m (Medium) | 0.808 | 0.628 | 0.655 | 8.2 | 41 |
Proposed Model (Ours) | 0.816 | 0.637 | 0.664 | 8.2 | 41 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Abdimurotovich, K.A.; Cho, Y.-I. Optimized YOLOv5 Architecture for Superior Kidney Stone Detection in CT Scans. Electronics 2024, 13, 4418. https://doi.org/10.3390/electronics13224418
Abdimurotovich KA, Cho Y-I. Optimized YOLOv5 Architecture for Superior Kidney Stone Detection in CT Scans. Electronics. 2024; 13(22):4418. https://doi.org/10.3390/electronics13224418
Chicago/Turabian StyleAbdimurotovich, Khasanov Asliddin, and Young-Im Cho. 2024. "Optimized YOLOv5 Architecture for Superior Kidney Stone Detection in CT Scans" Electronics 13, no. 22: 4418. https://doi.org/10.3390/electronics13224418