Cloud-Based License Plate Recognition: A Comparative Approach Using You Only Look Once Versions 5, 7, 8, and 9 Object Detection
Abstract
:1. Introduction
2. Related Work
3. Methodology
3.1. Dataset
3.2. Data Preprocessing
3.3. Evaluation Metrics Used
4. Training and Validation
4.1. YOLOv5
4.2. Results: YOLOv5 Training and Validation Experiment
4.3. YOLOv7
4.4. Results: YOLOv7 Training and Validation Experiment
4.5. YOLOv8
4.6. Results: YOLOv8 Training and Validation Experiment
4.7. YOLOv9
4.8. Results: YOLOv9 Training and Validation
5. Cloud Deployment
- Vehicle Video Input: In order to record cars as they enter a specific location, such as a parking lot or a gated neighborhood, the system first receives a video input of vehicles. The cloud deployment testing experiment uses a video dataset of moving cars. This dataset can be found in [35]. The data are live-streamed, and the live stream is evaluated using its reported accuracy.
- Kinesis Video Stream on Amazon: Next, the video feed is sent to Amazon Kinesis Video Stream, a service that enables the ingestion of video data in real time. In order to provide smooth streaming to downstream components, this component is in charge of recording, processing, and storing the video feed.
- The YOLO model (versions 5, 7, 8, and 9): The trained YOLO models are used to process frames from the video stream and identify license plates in real time. The different versions of YOLO (v5, v7, v8, v9) are tested to evaluate performance, accuracy, and detection speed. The YOLO model runs on an EC2 instance to detect and extract license plates from each frame.
- Amazon EC2 (Elastic Compute Cloud): Amazon EC2 provides the computational resources necessary to run the YOLO model. It processes each frame for number plate detection and passes relevant information downstream. Detected frames with license plate details are sent to other AWS services for further processing, storage, and retrieval.
- Amazon S3 (Simple Storage Service): Processed images, such as frames containing detected license plates, are stored in Amazon S3 for durability and easy access. This storage serves as a repository for extracted images or frames and allows for further processing, such as Optical Character Recognition (OCR).
- Amazon Textract: Amazon Textract is used to perform OCR (Optical Character Recognition) on the extracted license plates stored in Amazon S3. This service identifies and extracts textual information from the images, enabling the system to read the license plate numbers accurately.
- AWS Lambda Function: AWS Lambda functions as a serverless computer service that orchestrates tasks and automates the workflow. It can be triggered by events, such as new images uploaded to S3, to initiate the Textract OCR process. Lambda can also handle data processing and forward the extracted license plate numbers to other AWS services, such as Amazon SQS and DynamoDB, for further handling.
- Amazon SQS (Simple Queue Service): Amazon SQS acts as a message queue, where processed data (like license plate information) are temporarily stored. This service ensures reliable data delivery and enables the decoupling of system components, handling messages between Lambda functions and databases (DynamoDB).
- Amazon DynamoDB: Amazon DynamoDB is an NoSQL database service that stores processed license plate information. It provides fast access to data for querying and can be used to log license plate data or store metadata about each detected vehicle, such as timestamps or vehicle details.
- Visualization (Local Computer): Finally, the extracted and processed information can be visualized on a local computer or dashboard, allowing users to see real-time data on detected license plates. This visualization can show live updates, reports, or alerts based on the system’s output.
6. Results Analysis
7. Comparative Analysis and Discussion
8. Limitations
9. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Malik, M.I.; Wani, S.H.; Rashid, A. Cloud Computing-Technologies. Int. J. Adv. Res. Comput. Sci. 2018, 9. [Google Scholar] [CrossRef]
- Kaur, P.; Kumar, Y.; Gupta, S. Artificial intelligence techniques for the recognition of multi-plate multi-vehicle tracking systems: A systematic review. Arch. Comput. Methods Eng. 2022, 29, 4897–4914. [Google Scholar] [CrossRef]
- Gupta, S. On Cloud Technologies. 2021. Available online: https://api.semanticscholar.org/CorpusID:235483763 (accessed on 2 September 2024).
- Prasad, N.N. Architecture of Cloud Computing. 2011. Available online: https://api.semanticscholar.org/CorpusID:61215687 (accessed on 2 September 2024).
- Mahesh, S.; Walsh, K.R. Cloud computing as a model. In Encyclopedia of Information Science and Technology, 3rd ed.; IGI Global: Hershey, PA, USA; New York, NY, USA, 2015; pp. 1039–1047. [Google Scholar]
- Tan, W. Development of a Cloud-Based Traffic Diagnosis and Management Laboratory Based on High-Coverage ALPR. Ph.D. Thesis, The University of Wisconsin-Milwaukee, Milwaukee, WI, USA, 2021. [Google Scholar]
- Zhou, L.; Zhang, H.; Zhang, K.; Wang, B.; Shen, D.; Wang, Y. Advances in applying cloud computing techniques for air traffic systems. In Proceedings of the 2020 IEEE 2nd International Conference on Civil Aviation Safety and Information Technology (ICCASIT), Weihai, China, 14–16 October 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 134–139. [Google Scholar]
- Yan, D.; Hongqing, M.; Jilin, L.; Langang, L. A high performance license plate recognition system based on the web technique. In Proceedings of the ITSC 2001. 2001 IEEE Intelligent Transportation Systems. Proceedings (Cat. No. 01TH8585), Oakland, CA, USA, 25–29 August 2001; IEEE: Piscataway, NJ, USA, 2001; pp. 325–329. [Google Scholar]
- Lynch, M. Automated License Plate Recognition (ALPR) System. 2012. Available online: https://www.eff.org/sites/default/files/filenode/20120905_alpr_lasd_system_information.pdf (accessed on 4 September 2024).
- Redmon, J.; Divvala, S.K.; Girshick, R.B.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; pp. 779–788. [Google Scholar]
- Redmon, J.; Farhadi, A. YOLOv3: An Incremental Improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed]
- Vishwakarma, N. Real-Time Object Detection with SSDs: Single Shot MultiBox Detectors. 2023. Available online: https://www.analyticsvidhya.com/blog/2023/11/real-time-object-detection-with-ssds-single-shot-multibox-detectors/ (accessed on 4 September 2024).
- Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017. [Google Scholar]
- Bochkovskiy, A. Yolov4: Optimal speed and accuracy of object detection. arXiv 2020, arXiv:2004.10934. [Google Scholar]
- Chan, W.J. Artificial Intelligence for Cloud-Assisted Object Detection. Ph.D. Thesis, UTAR, Kampar, Malaysia, 2023. [Google Scholar]
- Abdellatif, M.M.; Elshabasy, N.H.; Elashmawy, A.E.; AbdelRaheem, M. A low cost IoT-based Arabic license plate recognition model for smart parking systems. Ain Shams Eng. J. 2023, 14, 102178. [Google Scholar] [CrossRef]
- Panganiban, C.F.G.; Sandoval, C.F.L.; Festin, C.A.M.; Tan, W.M. Enhancing real-time license plate recognition through edge-cloud computing. In Proceedings of the TENCON 2022—2022 IEEE Region 10 Conference (TENCON), Hong Kong China, 1–4 November 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1–6. [Google Scholar]
- Laoula, E.M.B.; Elfahim, O.; El Midaoui, M.; Youssfi, M.; Bouattane, O. Multi-agent cloud based license plate recognition system. Int. J. Electr. Comput. Eng. 2024, 14, 4590. [Google Scholar]
- Chen, Y.L.; Chen, T.S.; Huang, T.W.; Yin, L.C.; Wang, S.Y.; Chiueh, T.c. Intelligent urban video surveillance system for automatic vehicle detection and tracking in clouds. In Proceedings of the 2013 IEEE 27th International Conference on Advanced Information Networking and Applications (AINA), Barcelona, Spain, 25–28 March 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 814–821. [Google Scholar]
- Kamarozaman, M.H.B.; Syafeeza, A.; Wong, Y.; Hamid, N.A.; Saad, W.H.M.; Samad, A.S.A. Enhancing Campus Security And Vehicle Management with Real-Time Mobile License Plate Reader App Utilizing A Lightweight Integration Model. J. Eng. Sci. Technol. 2024, 19, 1672–1692. [Google Scholar]
- Car License Plate Detection. 2020. Available online: https://www.kaggle.com/datasets/andrewmvd/car-plate-detection (accessed on 15 August 2024).
- Pavithra, M.; Karthikesh, P.S.; Jahnavi, B.; Navyalokesh, M.; Krishna, K.L. Implementation of Enhanced Security System using Roboflow. In Proceedings of the 2024 11th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India, 14–15 March 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–5. [Google Scholar]
- Qing, Y.; Liu, W.; Feng, L.; Gao, W. Improved Yolo network for free-angle remote sensing target detection. Remote Sens. 2021, 13, 2171. [Google Scholar] [CrossRef]
- Torgo, L.; Ribeiro, R. Precision and recall for regression. In Proceedings of the Discovery Science: 12th International Conference, DS 2009, Porto, Portugal, 3–5 October 2009; Springer: Berlin/Heidelberg, Germany, 2009; pp. 332–346. [Google Scholar]
- Chum, O.; Philbin, J.; Sivic, J.; Isard, M.; Zisserman, A. Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval. In Proceedings of the 2007 IEEE 11th International Conference on Computer Vision, Rio De Janeiro, Brazil, 14–21 October 2007; pp. 1–8. [Google Scholar]
- Flach, P.; Kull, M. Precision-recall-gain curves: PR analysis done right. Adva. Neural Inf. Process. Syst. 2015, 28. Available online: https://papers.nips.cc/paper/5867-precision-recall-gain-curves-pr-analysis-done-right (accessed on 15 August 2024).
- Wu, S.; McClean, S.I. Information Retrieval Evaluation with Partial Relevance Judgment. In Proceedings of the British National Conference on Databases, Belfast, Northern Ireland, UK, 18–20 July 2006. [Google Scholar]
- Rezatofighi, S.H.; Tsoi, N.; Gwak, J.; Sadeghian, A.; Reid, I.D.; Savarese, S. Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 658–666. [Google Scholar]
- Ding, X.; Zhang, X.; Ma, N.; Han, J.; Ding, G.; Sun, J. Repvgg: Making vgg-style convnets great again. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 21–25 June 2021; pp. 13733–13742. [Google Scholar]
- Huang, L.; Huang, W. RD-YOLO: An effective and efficient object detector for roadside perception system. Sensors 2022, 22, 8097. [Google Scholar] [CrossRef] [PubMed]
- Liu, Z.; Lv, H. YOLO_Bolt: A lightweight network model for bolt detection. Sci. Rep. 2024, 14, 656. [Google Scholar] [CrossRef] [PubMed]
- Chien, C.T.; Ju, R.Y.; Chou, K.Y.; Chiang, J.S. YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images. arXiv 2024, arXiv:2403.11249. [Google Scholar] [CrossRef]
- Qin, H.; Gong, R.; Liu, X.; Shen, M.; Wei, Z.; Yu, F.; Song, J. Forward and Backward Information Retention for Accurate Binary Neural Networks. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 2247–2256. [Google Scholar]
- Pexels. Traffic Flow in the Highway. 2024. Available online: https://www.pexels.com/video/traffic-flow-in-the-highway-2103099/ (accessed on 26 September 2024).
Model | Architecture | Speed | Accuracy | Training Time |
---|---|---|---|---|
YOLO [11] | One-stage detector | Fast | High | Faster |
Faster R-CNN [12] | Two-stage detector | Slower | High | Slower |
SSD [13] | One-stage detector | Moderate | Moderately | Data |
RetinaNet [14] | One-stage detector | Slow | High | Moderate |
Epoch | Loss | Precision | Recall | [email protected] | [email protected]:0.95 |
---|---|---|---|---|---|
0 | 0.0130 | 0.031317 | 0.40476 | 0.084282 | 0.021526 |
10 | 0.0060 | 0.8254 | 0.80952 | 0.81151 | 0.34709 |
20 | 0.0060 | 0.77676 | 0.72619 | 0.77403 | 0.37161 |
30 | 0.0058 | 0.78992 | 0.80578 | 0.82009 | 0.41581 |
40 | 0.0060 | 0.82605 | 0.71429 | 0.82677 | 0.43019 |
50 | 0.0061 | 0.83887 | 0.77381 | 0.83093 | 0.4513 |
60 | 0.0060 | 0.7892 | 0.82143 | 0.81796 | 0.41443 |
70 | 0.0058 | 0.8272 | 0.7381 | 0.82164 | 0.42238 |
80 | 0.0061 | 0.87129 | 0.67857 | 0.78743 | 0.43548 |
90 | 0.0060 | 0.80601 | 0.7619 | 0.82293 | 0.4379 |
99 | 0.0063 | 0.82638 | 0.75 | 0.81813 | 0.4402 |
Average | 0.7462 | 0.7258 | |||
F-1 Score | 0.7359 |
Epoch | Loss | Precision | Recall | [email protected] | [email protected]:0.95 |
---|---|---|---|---|---|
0 | 0.0119 | 0.02136 | 0.02381 | 0.0007604 | 0.0001638 |
10 | 0.0080 | 0.5871 | 0.5586 | 0.5173 | 0.2139 |
20 | 0.0051 | 0.7219 | 0.649 | 0.6419 | 0.2812 |
30 | 0.0064 | 0.725 | 0.5357 | 0.5362 | 0.2491 |
40 | 0.0054 | 0.794 | 0.6429 | 0.7117 | 0.3371 |
50 | 0.0051 | 0.8049 | 0.7857 | 0.7428 | 0.3793 |
60 | 0.0061 | 0.7749 | 0.7381 | 0.7598 | 0.3899 |
70 | 0.0061 | 0.7527 | 0.7976 | 0.7838 | 0.4023 |
80 | 0.0063 | 0.8332 | 0.7738 | 0.8017 | 0.4002 |
90 | 0.0061 | 0.7896 | 0.8095 | 0.7848 | 0.3959 |
99 | 0.0061 | 0.8441 | 0.7738 | 0.804 | 0.4117 |
Average | 0.6953 | 0.6444 | |||
F-1 score | 0.6689 |
Epoch | Loss | Recall | Precision | [email protected] | [email protected]:0.95 |
---|---|---|---|---|---|
0 | 0.1366 | 0.40476 | 0.15566 | 0.06434 | 2.045 |
10 | 0.0062 | 0.66667 | 0.66313 | 0.28776 | 2.0036 |
20 | 0.0056 | 0.63095 | 0.65044 | 0.31496 | 1.9185 |
30 | 0.0058 | 0.6667 | 0.73086 | 0.34433 | 1.8627 |
40 | 0.0060 | 0.78571 | 0.77806 | 0.40325 | 1.8364 |
50 | 0.0061 | 0.7421 | 0.78221 | 0.37707 | 1.9089 |
60 | 0.0060 | 0.71824 | 0.80047 | 0.38016 | 1.7991 |
70 | 0.0061 | 0.7619 | 0.80203 | 0.38201 | 1.9788 |
80 | 0.0.0060 | 0.75389 | 0.81453 | 0.40935 | 1.8239 |
90 | 0.0060 | 0.7381 | 0.78346 | 0.38291 | 1.9522 |
100 | 0.0063 | 0.77381 | 0.83843 | 0.40394 | 1.8727 |
Average | 0.6948 | 0.7090 | |||
F-1 score | 0.7018 |
Epoch | Loss | Recall | Precision | [email protected] | [email protected]:0.95 |
---|---|---|---|---|---|
10 | 0.0062 | 0.369 | 0.0294 | 0.00570 | 0 |
20 | 0.0059 | 0.285 | 0.2608 | 0.0005 | 0 |
30 | 0.0058 | 0.4167 | 0.373 | 0.137 | 0 |
40 | 0.0060 | 0.5357 | 0.549 | 0.211 | 0 |
50 | 0.0057 | 0.5966 | 0.6077 | 0.266 | 0 |
60 | 0.0059 | 0.559 | 0.630 | 0.2812 | 0 |
70 | 0.0061 | 0.6071 | 0.684 | 0.2920 | 0 |
80 | 0.0060 | 0.640 | 0.663 | 0.2975 | 0 |
90 | 0.0061 | 0.6941 | 0.7260 | 0.3487 | 0 |
99 | 0.0063 | 0.6215 | 0.7260 | 0.3465 | 0 |
Average | 0.5325 | 0.5287 | |||
F-1 score | 0.5287 |
YOLO Version | Accuracy: Validation (%) | Accuracy: Testing on Cloud (%) |
---|---|---|
YOLOv5 | 83 | 71 |
YOLOv7 | 84 | 52 |
YOLOv8 | 83 | 78 |
YOLOv9 | 73 | 70 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Asaju, C.B.; Owolawi, P.A.; Tu, C.; Wyk, E.V. Cloud-Based License Plate Recognition: A Comparative Approach Using You Only Look Once Versions 5, 7, 8, and 9 Object Detection. Information 2025, 16, 57. https://doi.org/10.3390/info16010057
Asaju CB, Owolawi PA, Tu C, Wyk EV. Cloud-Based License Plate Recognition: A Comparative Approach Using You Only Look Once Versions 5, 7, 8, and 9 Object Detection. Information. 2025; 16(1):57. https://doi.org/10.3390/info16010057
Chicago/Turabian StyleAsaju, Christine Bukola, Pius Adewale Owolawi, Chuling Tu, and Etienne Van Wyk. 2025. "Cloud-Based License Plate Recognition: A Comparative Approach Using You Only Look Once Versions 5, 7, 8, and 9 Object Detection" Information 16, no. 1: 57. https://doi.org/10.3390/info16010057
APA StyleAsaju, C. B., Owolawi, P. A., Tu, C., & Wyk, E. V. (2025). Cloud-Based License Plate Recognition: A Comparative Approach Using You Only Look Once Versions 5, 7, 8, and 9 Object Detection. Information, 16(1), 57. https://doi.org/10.3390/info16010057