MDPI - Publisher of Open Access Journals

7 pages, 17245 KB

Open AccessProceeding Paper

Image Classification of Asiatic Parakeets Using YOLOv5 and Residual Network 50

by Terenz Ace C. Flores, Francis Danielle G. Luna and Jocelyn F. Villaverde

Eng. Proc. 2026, 134(1), 70; https://doi.org/10.3390/engproc2026134070 - 22 Apr 2026

Viewed by 366

Parakeets exhibit many similar traits across species, with only subtle differences in features and coloration used for classification, which complicates detection and identification for birdwatchers, breeders, and researchers. Traditional classification methods rely on observation, while more expensive options involve DNA sampling. We developed [...] Read more.

Parakeets exhibit many similar traits across species, with only subtle differences in features and coloration used for classification, which complicates detection and identification for birdwatchers, breeders, and researchers. Traditional classification methods rely on observation, while more expensive options involve DNA sampling. We developed a bird classification system that identifies Asiatic parakeets by combining You Only Look Once Version 5 (YOLOv5) for detection with ResNet-50 for the classification of four specific species: Alexandrine, Moustache, Plum-headed, and Indian Ringneck parakeets. Using a Raspberry Pi 4B and a Raspberry Pi Camera housed in a customized enclosure to capture images of the birds, the evaluation indicated an overall accuracy of 95.05% through a multi-class confusion matrix, demonstrating the effectiveness of the system as a reliable tool for avian identification and research. Full article

(This article belongs to the Proceedings of The 7th Eurasia Conference on IoT, Communication and Engineering 2025 (ECICE 2025))

► Show Figures

Figure 1

20 pages, 39376 KB

Open AccessProceeding Paper

AI-Powered Real-Time Image Recognition System with a Laser-Based Deterrent for Primate Pest Control in Orchards

by Sung-Wen Wang, Shih-Ming Cho, Min-Chie Chiu and Shao-Chun Chen

Eng. Proc. 2026, 134(1), 65; https://doi.org/10.3390/engproc2026134065 - 21 Apr 2026

Viewed by 676

Abstract

We developed an automated system to address orchard crop damage caused by Formosan macaques, a problem where traditional deterrent methods have proven to be ineffective. The system integrates an Internet Protocol camera with a You Only Look Once version 5 (YOLOv5) object detection [...] Read more.

We developed an automated system to address orchard crop damage caused by Formosan macaques, a problem where traditional deterrent methods have proven to be ineffective. The system integrates an Internet Protocol camera with a You Only Look Once version 5 (YOLOv5) object detection model, which was trained on an augmented 6000-image dataset featuring a simulated monkey puppet in an indoor setting to validate its real-time identification capability through simulation. Upon target detection, a high-power laser, controlled via the Message Queuing Telemetry Transport protocol, is actuated to perform dynamic and non-invasive repelling. A web-based Human–Machine Interface (HMI) is provided, allowing users to remotely monitor and adjust strategies. This system offers a low-cost, highly efficient, and scalable solution for smart agriculture, with potential for expansion to other scenarios requiring a high degree of security and defense, such as warehouses and construction sites. Full article

(This article belongs to the Proceedings of The 7th Eurasia Conference on IoT, Communication and Engineering 2025 (ECICE 2025))

► Show Figures

Figure 1

6 pages, 1809 KB

Open AccessProceeding Paper

Real-Time Classification of Guinea Pig Using You Only Look Once Version 9-Small and Raspberry Pi 5

by Jethro Ray P. Antiojo, John Patrick B. Bonilla and John Paul T. Cruz

Eng. Proc. 2026, 134(1), 59; https://doi.org/10.3390/engproc2026134059 - 17 Apr 2026

Viewed by 348

Abstract

We developed a real-time guinea pig breed classification system using You Only Look Once Version 9 (YOLOv9)-small, deployed on a Raspberry Pi 5 with Camera Module 3 and Hailo-8L acceleration module. The system targeted three breeds, Abyssinian, American, and Peruvian, using a dataset [...] Read more.

We developed a real-time guinea pig breed classification system using You Only Look Once Version 9 (YOLOv9)-small, deployed on a Raspberry Pi 5 with Camera Module 3 and Hailo-8L acceleration module. The system targeted three breeds, Abyssinian, American, and Peruvian, using a dataset of 4500 images split into a 70:20:10 ratio for training, validation, and testing. After optimization for Hailo-8L, the model was tested on live samples, with hamsters included as an unknown class. A total of 600 frame blocks were extracted from the video input and analyzed using a multi-class confusion matrix. Results showed an 89% overall accuracy (94.67% for Abyssinian, 94.33% for American, 98.67% for Peruvian, and 90.33% for unknown classification accuracy). The results showed the feasibility of deploying YOLOv9-small on embedded devices for accurate and real-time animal classification. Full article

(This article belongs to the Proceedings of The 7th Eurasia Conference on IoT, Communication and Engineering 2025 (ECICE 2025))

► Show Figures

Figure 1

12 pages, 796 KB

Open AccessProceeding Paper

Design of a Lightweight Video-Based Ear Biometric System on Raspberry Pi 5 Using You Only Look Once Version 12 and EfficientNet-4

by Kristian Emmanuel Padilla, Michael Robin Saculsan and John Paul Cruz

Eng. Proc. 2026, 134(1), 50; https://doi.org/10.3390/engproc2026134050 - 14 Apr 2026

Viewed by 517

Abstract

Recent advances in ear biometrics have yielded increasingly accurate detection and recognition methods, driven by the ear’s uniqueness and permanence as a non-invasive biometric modality. Nonetheless, several limitations persist, including computationally demanding models, inconsistent evaluation metrics, and portable systems restricted by manual capture [...] Read more.

Recent advances in ear biometrics have yielded increasingly accurate detection and recognition methods, driven by the ear’s uniqueness and permanence as a non-invasive biometric modality. Nonetheless, several limitations persist, including computationally demanding models, inconsistent evaluation metrics, and portable systems restricted by manual capture and limited datasets. To address these challenges, we developed a lightweight, video-based ear biometric system implemented on the Raspberry Pi 5. The system integrates You Only Look Once Version 12 (YOLOv12) for ear detection, EfficientNet-4 for feature extraction, and k-Nearest Neighbors (k-NNs) for recognition. Its robust hardware platform combines Raspberry Pi 5 with the Raspberry Pi AI Camera and AI HAT+. To train, fine-tune, and optimize YOLOv12 and EfficientNet-4, we used the Visual Geometry Group (VGG)Face-Ear dataset for training and the Unconstrained Ear Recognition Challenge 2019 dataset for validation, with k-NN employed for classification. The system is evaluated for classification accuracy and system-level performance. 13 participants, comprising 10 enrolled and three unenrolled subjects, participated in testing the system. The enrolled participants registered in the system were correctly identified, whereas unenrolled participants were excluded and rejected. The system achieved 92.31% accuracy, 95.45% precision, 96.97% recall, and an F1-score of 0.95, confirming the feasibility of deploying advanced ear biometric methods on embedded, resource-constrained devices. Full article

(This article belongs to the Proceedings of The 7th Eurasia Conference on IoT, Communication and Engineering 2025 (ECICE 2025))

► Show Figures

Figure 1

27 pages, 7096 KB

Open AccessArticle

From Simulation to Reality: GAN-Based Transformation of Pavement Defect Images for YOLO Detection

by Jiangang Yang, Shukai Yu, Yuquan Yao, Shiji Cao and Xiaojuan Ai

Appl. Sci. 2026, 16(6), 2978; https://doi.org/10.3390/app16062978 - 19 Mar 2026

Viewed by 457

Abstract

The application of three-dimensional ground-penetrating radar (3D GPR) for intelligent pavement defect analysis is often constrained by the limited availability of labeled samples. To address this challenge, this study employed Ground Penetrating Radar Maxwell (GprMax) to simulate typical pavement defects, including cracks, loose [...] Read more.

The application of three-dimensional ground-penetrating radar (3D GPR) for intelligent pavement defect analysis is often constrained by the limited availability of labeled samples. To address this challenge, this study employed Ground Penetrating Radar Maxwell (GprMax) to simulate typical pavement defects, including cracks, loose materials, and interlayer debonding. A Cycle-Consistent Generative Adversarial Network (Cycle-GAN) was then introduced to perform style transfer on the simulated images, thereby reducing the domain gap between simulated and real radar images. Furthermore, four You Only Look Once (YOLO) models—YOLO version 5, YOLOX, YOLO version 7, and YOLO version 8—were systematically compared using real datasets to identify the best-performing model, which was subsequently used to evaluate the effect of different proportions of synthetic data on detection performance. The results demonstrated that the moderate inclusion of synthetic data improved the recognition accuracy of loose defects (from 76.7% to 78.9%), whereas its impact on crack and debonding detection was negative. Moreover, excessive reliance on synthetic data led to overfitting, thereby reducing the model’s generalization capability. Among the four models, YOLOv7 achieved the best overall performance, with a mean Average Precision (mAP) of 83.4% and a crack detection rate of 88.2%. This study thus provides a feasible technical pathway and model selection reference for automated GPR-based pavement defect identification, offering practical value for efficient and accurate road maintenance inspections. Full article

(This article belongs to the Topic Service Safety and Green Maintenance Technology for Road Infrastructure in Complex Environments)

► Show Figures

Figure 1

19 pages, 5757 KB

Open AccessArticle

A Progressive Hybrid Automatic Switching Visual Servoing Method for Apple-Picking Robots

by Jiangming Kan, Yue Wu, Ruifang Dong, Shun Yao, Xixuan Zhao, Tianji Zou, Boqi Kang and Junjie Li

Agriculture 2026, 16(5), 620; https://doi.org/10.3390/agriculture16050620 - 8 Mar 2026

Viewed by 859

Abstract

Position-Based Visual Servoing (PBVS) and Image-Based Visual Servoing (IBVS) struggle to balance end effector pose accuracy and robustness in apple picking. They are also prone to target loss and control singularities. A progressive Hybrid Automatic Switching Visual Servoing (HAVS) method is proposed and [...] Read more.

Position-Based Visual Servoing (PBVS) and Image-Based Visual Servoing (IBVS) struggle to balance end effector pose accuracy and robustness in apple picking. They are also prone to target loss and control singularities. A progressive Hybrid Automatic Switching Visual Servoing (HAVS) method is proposed and applied to an apple-picking robotic system. HAVS integrates PBVS and IBVS to coordinate control of the manipulator end effector pose. A depth-based switching function is designed. When target depth is below an optimal threshold, the controller switches to PBVS for precise final positioning. This reduces target loss and control singularities. An adaptive proportional-derivative (PD) controller with fuzzy gain scheduling updates the control gains online to enhance responsiveness and stability. The hardware consists of a six-axis manipulator, a depth camera, and a mobile base. You Only Look Once version 5 (YOLOv5) performs apple detection and generates control commands. Indoors, success rate was 96%, which was 4 and 10 percentage points higher than PBVS only and IBVS only. Average picking time was 12.5 s, 0.3 s, and 1.1 s shorter. Outdoors, success rate was 87.5%, average time was 13.2 s, and damage rate was 4.2%. This method provides a reference implementation for visual servo control in agricultural picking robots. Full article

(This article belongs to the Special Issue Perception, Decision-Making, and Control of Agricultural Robots)

► Show Figures

Figure 1

17 pages, 5135 KB

Open AccessArticle

UAV-Based Computer Vision Approach for Melon Fruit Detection and Yield Estimation

by Hassan Aldakn, Giovanna Dragonetti, Roula Khadra, Ahmed Ali Ayoub Abdelmoneim and Bilal Derardja

AgriEngineering 2026, 8(2), 53; https://doi.org/10.3390/agriengineering8020053 - 3 Feb 2026

Cited by 1 | Viewed by 1056

Abstract

Accurate and timely crop yield estimation remains a major challenge in agriculture due to the limitations of traditional field-based methods, which are labor-intensive, destructive, and unsuitable for large-scale applications. While recent advances in Unmanned Aerial Vehicles (UAVs) and Deep Learning (DL) have enabled [...] Read more.

Accurate and timely crop yield estimation remains a major challenge in agriculture due to the limitations of traditional field-based methods, which are labor-intensive, destructive, and unsuitable for large-scale applications. While recent advances in Unmanned Aerial Vehicles (UAVs) and Deep Learning (DL) have enabled non-destructive and scalable alternatives, melons (Cucumis melo L.) remain relatively understudied, and datasets for yield estimation are scarce. This study presents a computer vision pipeline for UAV-based fruit detection and yield estimation in melon crops. High-resolution UAV RGB imagery was processed using YOLOv12 (You Only Look Once, version 12) for fruit detection, followed by a volume-based regression model for weight estimation. The experiment was conducted during the May–August 2025 growing season in Apulia, southern Italy. The detection model achieved high accuracy, with strong agreement between estimated and actual fruit counts (R² = 0.99, MAPE = 5%). The regression model achieved an R² of 0.79 for individual weight estimation and a total yield error of 2.9%. By addressing the scarcity of melon-specific data, this work demonstrates that integrating UAV imagery with deep learning provides an effective and scalable approach for accurate yield estimation in melons. Full article

► Show Figures

Figure 1

20 pages, 4309 KB

Open AccessArticle

Targetless Radar–Camera Calibration via Trajectory Alignment

by Ozan Durmaz and Hakan Cevikalp

Sensors 2025, 25(24), 7574; https://doi.org/10.3390/s25247574 - 13 Dec 2025

Cited by 1 | Viewed by 1673

Abstract

Accurate extrinsic calibration between radar and camera sensors is essential for reliable multi-modal perception in robotics and autonomous navigation. Traditional calibration methods often rely on artificial targets such as checkerboards or corner reflectors, which can be impractical in dynamic or large-scale environments. This [...] Read more.

Accurate extrinsic calibration between radar and camera sensors is essential for reliable multi-modal perception in robotics and autonomous navigation. Traditional calibration methods often rely on artificial targets such as checkerboards or corner reflectors, which can be impractical in dynamic or large-scale environments. This study presents a fully targetless calibration framework that estimates the rigid spatial transformation between radar and camera coordinate frames by aligning their observed trajectories of a moving object. The proposed method integrates You Only Look Once version 5 (YOLOv5)-based 3D object localization for the camera stream with Density-Based Spatial Clustering of Applications with Noise (DBSCAN) and Random Sample Consensus (RANSAC) filtering for sparse and noisy radar measurements. A passive temporal synchronization technique, based on Root Mean Square Error (RMSE) minimization, corrects timestamp offsets without requiring hardware triggers. Rigid transformation parameters are computed using Kabsch and Umeyama algorithms, ensuring robust alignment even under millimeter-wave (mmWave) radar sparsity and measurement bias. The framework is experimentally validated in an indoor OptiTrack-equipped laboratory using a Skydio 2 drone as the dynamic target. Results demonstrate sub-degree rotational accuracy and decimeter-level translational error (approximately 0.12–0.27 m depending on the metric), with successful generalization to unseen motion trajectories. The findings highlight the method’s applicability for real-world autonomous systems requiring practical, markerless multi-sensor calibration. Full article

(This article belongs to the Section Radar Sensors)

► Show Figures

Figure 1

30 pages, 7833 KB

Open AccessArticle

Design of Fruit Harvesting Robot System Based on a Reachability and Inverse Reachability Map

by Jae-Woong Han, Jae-Hoon Cho and Yong-Tae Kim

AgriEngineering 2025, 7(12), 417; https://doi.org/10.3390/agriengineering7120417 - 4 Dec 2025

Viewed by 2545

Abstract

This paper proposes a fruit-harvesting robot system that improves harvesting efficiency by utilizing a Reachability Map (RM) and an Inverse Reachability Map (IRM). The proposed system accurately detects fruit locations using You Only Look Once version 5 (YOLOv5)–based object detection and camera calibration. [...] Read more.

This paper proposes a fruit-harvesting robot system that improves harvesting efficiency by utilizing a Reachability Map (RM) and an Inverse Reachability Map (IRM). The proposed system accurately detects fruit locations using You Only Look Once version 5 (YOLOv5)–based object detection and camera calibration. Through coordinate transformation and hand–eye calibration, the manipulator is precisely guided to the fruit’s 3D position. During the construction of the reachability map, the reachability index, manipulability isotropy, and harvesting index are jointly considered to quantitatively evaluate manipulator performance. Fruits accessible by the manipulator are prioritized for harvesting. For fruits that cannot be directly reached, the system computes the optimal base pose using the inverse reachability map, enabling the mobile manipulator to reposition itself for harvesting. To further enhance efficiency, multiple fruits are grouped to minimize unnecessary movements. The integrated system is implemented on the Robot Operating System 2 (ROS 2), where fruit detection, autonomous navigation, and harvesting are executed as independent nodes to support scalable and modular operation. Finally, the proposed system is validated in a simulated orchard environment, confirming its effectiveness in improving autonomous fruit-harvesting performance. Full article

(This article belongs to the Section Agricultural Mechanization and Machinery)

► Show Figures

Figure 1

24 pages, 2375 KB

Open AccessArticle

Label-Efficient PCB Defect Detection with an ECA–DCN-Lite–BiFPN–CARAFE-Enhanced YOLOv5 and Single-Stage Semi-Supervision

by Zhenxia Wang, Nurulazlina Ramli and Tzer Hwai Gilbert Thio

Sensors 2025, 25(23), 7283; https://doi.org/10.3390/s25237283 - 29 Nov 2025

Cited by 1 | Viewed by 1114

Abstract

Printed circuit board (PCB) defect detection is critical to manufacturing quality, yet tiny, low-contrast defects and limited annotations challenge conventional systems. This study develops an ECA–DCN-lite–BiFPN–CARAFE-enhanced YOLOv5 detector by modifying You Only Look Once (YOLO) version 5 (YOLOv5) with Efficient Channel Attention (ECA) [...] Read more.

Printed circuit board (PCB) defect detection is critical to manufacturing quality, yet tiny, low-contrast defects and limited annotations challenge conventional systems. This study develops an ECA–DCN-lite–BiFPN–CARAFE-enhanced YOLOv5 detector by modifying You Only Look Once (YOLO) version 5 (YOLOv5) with Efficient Channel Attention (ECA) for channel re-weighting, a lightweight Deformable Convolution (DCN-lite) for geometric adaptability, a Bi-Directional Feature Pyramid Network (BiFPN) for multi-scale fusion, and Content-Aware ReAssembly of FEatures (CARAFE) for content-aware upsampling. A single-cycle semi-supervised training pipeline is further introduced: a detector trained on labeled images generates high-confidence pseudo-labels for unlabeled data, and the combined set is used for retraining without ratio heuristics. Evaluated on PKU-PCB under label-scarce regimes, the full model improves supervised mean Average Precision at an Intersection-over-Union threshold of 0.5 (mAP@0.5) from 0.870 (baseline) to 0.910, and reaches 0.943 mAP@0.5 with semi-supervision, with consistent class-wise gains and faster convergence. Ablation experiments validate the contribution of each module and identify robust pseudo-label thresholds, while comparisons with recent YOLO variants show favorable accuracy–efficiency trade-offs. These findings indicate that the proposed design delivers accurate, label-efficient PCB inspection suitable for Automated Optical Inspection (AOI) in production environments. This work supports SDG 9 by enhancing intelligent manufacturing systems through reliable, high-precision AI-driven PCB inspection. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

24 pages, 9586 KB

Open AccessArticle

Optimized Recognition Algorithm for Remotely Sensed Sea Ice in Polar Ship Path Planning

by Li Zhou, Runxin Xu, Jiayi Bian, Shifeng Ding, Sen Han and Roger Skjetne

Remote Sens. 2025, 17(19), 3359; https://doi.org/10.3390/rs17193359 - 4 Oct 2025

Cited by 1 | Viewed by 1245

Abstract

Collisions between ships and sea ice pose a significant threat to maritime safety, making it essential to detect sea ice and perform safety-oriented path planning for polar navigation. This paper utilizes an optimized You Only Look Once version 5 (YOLOv5) model, designated as [...] Read more.

Collisions between ships and sea ice pose a significant threat to maritime safety, making it essential to detect sea ice and perform safety-oriented path planning for polar navigation. This paper utilizes an optimized You Only Look Once version 5 (YOLOv5) model, designated as YOLOv5-ICE, for the detection of sea ice in satellite imagery, with the resultant detection data being employed to input obstacle coordinates into a ship path planning system. The enhancements include the Squeeze-and-Excitation (SE) attention mechanism, improved spatial pyramid pooling, and the Flexible ReLU (FReLU) activation function. The improved YOLOv5-ICE shows enhanced performance, with its mAP increasing by 3.5% compared to the baseline YOLOv5 and also by 1.3% compared to YOLOv8. YOLOv5-ICE demonstrates robust performance in detecting small sea ice targets within large-scale satellite images and excels in high ice concentration regions. For path planning, the Any-Angle Path Planning on Grids algorithm is applied to simulate routes based on detected sea ice floes. The objective function incorporates the path length, number of ship turns, and sea ice risk value, enabling path planning under varying ice concentrations. By integrating detection and path planning, this work proposes a novel method to enhance navigational safety in polar regions. Full article

(This article belongs to the Special Issue Remote Sensing of River and Lake Ice/Water Using Spaceborne, Airborne, and Ground Platforms)

► Show Figures

Figure 1

26 pages, 6234 KB

Open AccessArticle

Automated Identification and Spatial Pattern Analysis of Urban Slow-Moving Traffic Bottlenecks Using Street View Imagery and Deep Learning

by Zixuan Guo, Hong Xu and Qiushuang Lin

ISPRS Int. J. Geo-Inf. 2025, 14(9), 351; https://doi.org/10.3390/ijgi14090351 - 15 Sep 2025

Cited by 1 | Viewed by 2080

Abstract

With rapid urbanization and increasing emphasis on sustainable mobility, slow-moving traffic systems, including pedestrian and cycling infrastructure, have become critical to urban transportation and quality of life. Conventional assessment methods are labor-intensive, time-consuming, and limited in coverage. Leveraging advances in deep learning and [...] Read more.

With rapid urbanization and increasing emphasis on sustainable mobility, slow-moving traffic systems, including pedestrian and cycling infrastructure, have become critical to urban transportation and quality of life. Conventional assessment methods are labor-intensive, time-consuming, and limited in coverage. Leveraging advances in deep learning and computer vision, this study develops a framework for bottleneck detection using street-level imagery and the You Only Look Once version 5 (YOLOv5) model. An evaluation system comprising 15 indicators across continuity, safety, and comfort is established. In a case study of Wuhan’s Third Ring Road, the YOLOv5 model achieved 98.9% mean Average Precision (mAP)@0.5, while spatial hotspot analysis (p < 0.05) identified severe demand–infrastructure mismatches in southeastern Wuhan, contrasted with fewer problems in the northern region due to stronger management. To ensure adaptability, a dynamic optimization mechanism integrating temporal imagery updates, transfer learning, and collaborative training is proposed. The findings demonstrate the effectiveness of street-level remote sensing for large-scale urban diagnostics, extend the application of deep learning in mobility research, and provide practical insights for data-driven planning and governance of slow-moving traffic systems in high-density cities. Full article

► Show Figures

Figure 1

24 pages, 26968 KB

Open AccessArticle

Using a High-Precision YOLO Surveillance System for Gun Detection to Prevent Mass Shootings

by Jonathan Hsueh and Chao-Tung Yang

AI 2025, 6(9), 198; https://doi.org/10.3390/ai6090198 - 22 Aug 2025

Cited by 1 | Viewed by 5944

Abstract

Mass shootings are forms of loosely defined violent crimes typically involving four or more casualties by firearm and have become increasingly more frequent, and organized and speedy responses from police are necessary to mitigate harm and neutralize the perpetrator. Recent, widely publicized police [...] Read more.

Mass shootings are forms of loosely defined violent crimes typically involving four or more casualties by firearm and have become increasingly more frequent, and organized and speedy responses from police are necessary to mitigate harm and neutralize the perpetrator. Recent, widely publicized police responses to mass shooting events have been criticized by the media, government, and public. With the advancements in artificial intelligence, specifically single-shot detection (SSD) models, computer programs can detect harmful weapons within efficient time frames. We utilized YOLO (You Only Look Once), an SSD with a Convolutional Neural Network, and used versions 5, 7, 8, 9, 10, and 11 to develop our detection system. For our data, we used a Roboflow dataset that contained almost 17,000 images of real-life handgun scenarios, designed to skew towards positive instances. We trained each model on our dataset and exchanged different hyperparameters, conducting a randomized trial. Finally, we evaluated the performance based on precision metrics. Using a Python-based design, we tested our model’s capabilities for surveillance functions. Our experimental results showed that our best-performing model was YOLOv10s, with an mAP-50 (mean average precision 50) of 98.2% on our dataset. Our model showed potential in edge computing settings. Full article

► Show Figures

Figure 1

19 pages, 4394 KB

Open AccessArticle

Research on Optimized YOLOv5s Algorithm for Detecting Aircraft Landing Runway Markings

by Wei Huang, Hongrui Guo, Xiangquan Li, Xi Tan and Bo Liu

Processes 2025, 13(8), 2572; https://doi.org/10.3390/pr13082572 - 14 Aug 2025

Viewed by 957

Abstract

During traditional aircraft landings, pilots face significant challenges in identifying runway numbers with the naked eye, particularly at decision height under adverse weather conditions. To address this issue, this study proposes a novel detection algorithm based on an optimized version of the YOLOv5s [...] Read more.

During traditional aircraft landings, pilots face significant challenges in identifying runway numbers with the naked eye, particularly at decision height under adverse weather conditions. To address this issue, this study proposes a novel detection algorithm based on an optimized version of the YOLOv5s model (You Only Look Once, version 5) for recognizing runway markings during civil aircraft landings. By integrating a data augmentation strategy with external datasets, the method effectively reduces both false detections and missed targets through expanded feature representation. An Alpha Complete Intersection over Union (CIOU) Loss function is introduced in place of the original CIOU Loss function, offering improved gradient optimization. Additionally, the model incorporates several advanced modules and techniques, including a Convolutional Block Attention Module (CBAM), Soft Non-Maximum Suppression (Soft-NMS), cosine annealing learning rate scheduling, the FReLU activation function, and deformable convolutions into the backbone and neck of the YOLOv5 architecture. To further enhance detection, a specialized small-target detection layer is added to the head of the network and the resolution of feature maps is improved. These enhancements enable better feature extraction and more accurate identification of smaller targets. As a result, the optimized model shows significantly improved recall (R) and precision (P). Experimental results, visualized using custom-developed software, demonstrate that the proposed optimized YOLOv5s model achieved increases of 5.66% in P, 2.99% in R, and 2.74% in mean average precision (mAP) compared to the baseline model. This study provides valuable data and a theoretical foundation to support the accurate visual identification of runway numbers and other reference markings during aircraft landings. Full article

(This article belongs to the Special Issue Modelling and Optimizing Process in Industry 4.0)

► Show Figures

Figure 1

17 pages, 2115 KB

Open AccessArticle

Surface Defect Detection of Magnetic Tiles Based on YOLOv8-AHF

by Cheng Ma, Yurong Pan and Junfu Chen

Electronics 2025, 14(14), 2857; https://doi.org/10.3390/electronics14142857 - 17 Jul 2025

Cited by 3 | Viewed by 1952

Abstract

Magnetic tiles are an important component of permanent magnet motors, and the quality of magnetic tiles directly affects the performance and service life of a motor. It is necessary to perform defect detection on magnetic tiles in industrial production and remove those with [...] Read more.

Magnetic tiles are an important component of permanent magnet motors, and the quality of magnetic tiles directly affects the performance and service life of a motor. It is necessary to perform defect detection on magnetic tiles in industrial production and remove those with defects. The YOLOv8-AHF algorithm is proposed to improve the ability of network feature information extraction and solve the problem of missed detection or poor detection results in surface defect detection due to the small volume of permanent magnet motor tiles, which reduces the deviation between the predicted box and the true box simultaneously. Firstly, a hybrid module of a combination of atrous convolution and depthwise separable convolution (ADConv) is introduced in the backbone of the model to capture global and local features in magnet tile detection images. In the neck section, a hybrid attention module (HAM) is introduced to focus on the regions of interest in the magnetic tile surface defect images, which improves the ability of information transmission and fusion. The Focal-Enhanced Intersection over Union loss function (Focal-EIoU) is optimized to effectively achieve localization. We conducted comparative experiments, ablation experiments, and corresponding generalization experiments on the magnetic tile surface defect dataset. The experimental results show that the evaluation metrics of YOLOv8-AHF surpass mainstream single-stage object detection algorithms. Compared to the You Only Look Once version 8 (YOLOv8) algorithm, the performance of the YOLOv8-AHF algorithm was improved by 5.9%, 4.1%, 5%, 5%, and 5.8% in terms of mAP@0.5, mAP@0.5:0.95, F1-Score, precision, and recall, respectively. This algorithm achieved significant performance improvement in the task of detecting surface defects on magnetic tiles. Full article

► Show Figures

Figure 1

Search Results (91)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (91)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI