MDPI - Publisher of Open Access Journals

21 pages, 1192 KiB

Open AccessArticle

Video Stabilization Algorithm Based on View Boundary Synthesis

by Wenchao Shan, Hejing Zhao, Xin Li, Qian Huang, Chuanxu Jiang, Yiming Wang, Ziqi Chen and Yao Tong

Symmetry 2025, 17(8), 1351; https://doi.org/10.3390/sym17081351 - 19 Aug 2025

Abstract

Video stabilization is a critical technology for enhancing visual content quality in dynamic shooting scenarios, especially with the widespread adoption of mobile photography devices and Unmanned Aerial Vehicle (UAV) platforms. While traditional digital stabilization algorithms can improve frame stability by modeling global motion [...] Read more.

Video stabilization is a critical technology for enhancing visual content quality in dynamic shooting scenarios, especially with the widespread adoption of mobile photography devices and Unmanned Aerial Vehicle (UAV) platforms. While traditional digital stabilization algorithms can improve frame stability by modeling global motion trajectories, they often suffer from excessive cropping or boundary distortion, leading to a significant loss of valid image regions. To address this persistent challenge, we propose the View Out-boundary Synthesis Algorithm (VOSA), a symmetry-aware spatio-temporal consistency framework. By leveraging rotational and translational symmetry principles in motion dynamics, VOSA realizes optical flow field extrapolation through an encoder–decoder architecture and an iterative boundary extension strategy. Experimental results demonstrate that VOSA enhances conventional stabilization by increasing content retention by 6.3% while maintaining a 0.943 distortion score, outperforming mainstream methods in dynamic environments. The symmetry-informed design resolves stability–content conflicts and outperforms mainstream methods in dynamic environments, establishing a new paradigm for full-frame stabilization. Full article

(This article belongs to the Special Issue Symmetry/Asymmetry in Image Processing and Computer Vision)

► Show Figures

Figure 1

28 pages, 9030 KiB

Open AccessArticle

UAV Path Planning via Semantic Segmentation of 3D Reality Mesh Models

by Xiaoxinxi Zhang, Zheng Ji, Lingfeng Chen and Yang Lyu

Drones 2025, 9(8), 578; https://doi.org/10.3390/drones9080578 - 14 Aug 2025

Viewed by 340

Abstract

Traditional unmanned aerial vehicle (UAV) path planning methods for image-based 3D reconstruction often rely solely on geometric information from initial models, resulting in redundant data acquisition in non-architectural areas. This paper proposes a UAV path planning method via semantic segmentation of 3D reality [...] Read more.

Traditional unmanned aerial vehicle (UAV) path planning methods for image-based 3D reconstruction often rely solely on geometric information from initial models, resulting in redundant data acquisition in non-architectural areas. This paper proposes a UAV path planning method via semantic segmentation of 3D reality mesh models to enhance efficiency and accuracy in complex scenarios. The scene is segmented into buildings, vegetation, ground, and water bodies. Lightweight polygonal surfaces are extracted for buildings, while planar segments in non-building regions are fitted and projected into simplified polygonal patches. These photography targets are further decomposed into point, line, and surface primitives. A multi-resolution image acquisition strategy is adopted, featuring high-resolution coverage for buildings and rapid scanning for non-building areas. To ensure flight safety, a Digital Surface Model (DSM)-based shell model is utilized for obstacle avoidance, and sky-view-based Real-Time Kinematic (RTK) signal evaluation is applied to guide viewpoint optimization. Finally, a complete weighted graph is constructed, and ant colony optimization is employed to generate a low-energy-cost flight path. Experimental results demonstrate that, compared with traditional oblique photogrammetry, the proposed method achieves higher reconstruction quality. Compared with the commercial software Metashape, it reduces the number of images by 30.5% and energy consumption by 37.7%, while significantly improving reconstruction results in both architectural and non-architectural areas. Full article

► Show Figures

Figure 1

20 pages, 1971 KiB

Open AccessArticle

FFG-YOLO: Improved YOLOv8 for Target Detection of Lightweight Unmanned Aerial Vehicles

by Tongxu Wang, Sizhe Yang, Ming Wan and Yanqiu Liu

Appl. Syst. Innov. 2025, 8(4), 109; https://doi.org/10.3390/asi8040109 - 4 Aug 2025

Viewed by 681

Abstract

Target detection is essential in intelligent transportation and autonomous control of unmanned aerial vehicles (UAVs), with single-stage detection algorithms used widely due to their speed. However, these algorithms face limitations in detecting small targets, especially in aerial photography from unmanned aerial vehicles (UAVs), [...] Read more.

Target detection is essential in intelligent transportation and autonomous control of unmanned aerial vehicles (UAVs), with single-stage detection algorithms used widely due to their speed. However, these algorithms face limitations in detecting small targets, especially in aerial photography from unmanned aerial vehicles (UAVs), where small targets are often occluded, multi-scale semantic information is easily lost, and there is a trade-off between real-time processing and computational resources. Existing algorithms struggle to effectively extract multi-dimensional features and deep semantic information from images and to balance detection accuracy with model complexity. To address these limitations, we developed FFG-YOLO, a lightweight small-target detection method for UAVs based on YOLOv8. FFG-YOLO incorporates three modules: a feature enhancement block (FEB), a feature concat block (FCB), and a global context awareness block (GCAB). These modules strengthen feature extraction from small targets, resolve semantic bias in multi-scale feature fusion, and help differentiate small targets from complex backgrounds. We also improved the positioning accuracy of small targets using the Wasserstein distance loss function. Experiments showed that FFG-YOLO outperformed other algorithms, including YOLOv8n, in small-target detection due to its lightweight nature, meeting the stringent real-time performance and deployment requirements of UAVs. Full article

(This article belongs to the Topic Application of IoT on Manufacturing, Communication and Engineering)

► Show Figures

Figure 1

24 pages, 12286 KiB

Open AccessArticle

A UAV-Based Multi-Scenario RGB-Thermal Dataset and Fusion Model for Enhanced Forest Fire Detection

by Yalin Zhang, Xue Rui and Weiguo Song

Remote Sens. 2025, 17(15), 2593; https://doi.org/10.3390/rs17152593 - 25 Jul 2025

Viewed by 778

Abstract

UAVs are essential for forest fire detection due to vast forest areas and inaccessibility of high-risk zones, enabling rapid long-range inspection and detailed close-range surveillance. However, aerial photography faces challenges like multi-scale target recognition and complex scenario adaptation (e.g., deformation, occlusion, lighting variations). [...] Read more.

UAVs are essential for forest fire detection due to vast forest areas and inaccessibility of high-risk zones, enabling rapid long-range inspection and detailed close-range surveillance. However, aerial photography faces challenges like multi-scale target recognition and complex scenario adaptation (e.g., deformation, occlusion, lighting variations). RGB-Thermal fusion methods integrate visible-light texture and thermal infrared temperature features effectively, but current approaches are constrained by limited datasets and insufficient exploitation of cross-modal complementary information, ignoring cross-level feature interaction. A time-synchronized multi-scene, multi-angle aerial RGB-Thermal dataset (RGBT-3M) with “Smoke–Fire–Person” annotations and modal alignment via the M-RIFT method was constructed as a way to address the problem of data scarcity in wildfire scenarios. Finally, we propose a CP-YOLOv11-MF fusion detection model based on the advanced YOLOv11 framework, which can learn heterogeneous features complementary to each modality in a progressive manner. Experimental validation proves the superiority of our method, with a precision of 92.5%, a recall of 93.5%, a mAP50 of 96.3%, and a mAP50-95 of 62.9%. The model’s RGB-Thermal fusion capability enhances early fire detection, offering a benchmark dataset and methodological advancement for intelligent forest conservation, with implications for AI-driven ecological protection. Full article

(This article belongs to the Special Issue Advances in Spectral Imagery and Methods for Fire and Smoke Detection)

► Show Figures

Figure 1

23 pages, 13739 KiB

Open AccessArticle

Traffic Accident Rescue Action Recognition Method Based on Real-Time UAV Video

by Bo Yang, Jianan Lu, Tao Liu, Bixing Zhang, Chen Geng, Yan Tian and Siyu Zhang

Drones 2025, 9(8), 519; https://doi.org/10.3390/drones9080519 - 24 Jul 2025

Viewed by 523

Abstract

Low-altitude drones, which are unimpeded by traffic congestion or urban terrain, have become a critical asset in emergency rescue missions. To address the current lack of emergency rescue data, UAV aerial videos were collected to create an experimental dataset for action classification and [...] Read more.

Low-altitude drones, which are unimpeded by traffic congestion or urban terrain, have become a critical asset in emergency rescue missions. To address the current lack of emergency rescue data, UAV aerial videos were collected to create an experimental dataset for action classification and localization annotation. A total of 5082 keyframes were labeled with 1–5 targets each, and 14,412 instances of data were prepared (including flight altitude and camera angles) for action classification and position annotation. To mitigate the challenges posed by high-resolution drone footage with excessive redundant information, we propose the SlowFast-Traffic (SF-T) framework, a spatio-temporal sequence-based algorithm for recognizing traffic accident rescue actions. For more efficient extraction of target–background correlation features, we introduce the Actor-Centric Relation Network (ACRN) module, which employs temporal max pooling to enhance the time-dimensional features of static backgrounds, significantly reducing redundancy-induced interference. Additionally, smaller ROI feature map outputs are adopted to boost computational speed. To tackle class imbalance in incident samples, we integrate a Class-Balanced Focal Loss (CB-Focal Loss) function, effectively resolving rare-action recognition in specific rescue scenarios. We replace the original Faster R-CNN with YOLOX-s to improve the target detection rate. On our proposed dataset, the SF-T model achieves a mean average precision (mAP) of 83.9%, which is 8.5% higher than that of the standard SlowFast architecture while maintaining a processing speed of 34.9 tasks/s. Both accuracy-related metrics and computational efficiency are substantially improved. The proposed method demonstrates strong robustness and real-time analysis capabilities for modern traffic rescue action recognition. Full article

(This article belongs to the Special Issue Cooperative Perception for Modern Transportation)

► Show Figures

Figure 1

16 pages, 3620 KiB

Open AccessArticle

Wind Tunnel Experimental Study on Dynamic Coupling Characteristics of Flexible Refueling Hose–Drogue System

by Yinzhu Wang, Jiangtao Huang, Qisheng Chen, Enguang Shan and Yufeng Guo

Aerospace 2025, 12(7), 646; https://doi.org/10.3390/aerospace12070646 - 21 Jul 2025

Viewed by 215

Abstract

During the process of flexible aerial refueling, the flexible structure of the hose drogue assembly is affected by internal and external interference, such as docking maneuvering, deformation of the hose, attitude changes, and body vibrations, causing the hose to swing and the whipping [...] Read more.

During the process of flexible aerial refueling, the flexible structure of the hose drogue assembly is affected by internal and external interference, such as docking maneuvering, deformation of the hose, attitude changes, and body vibrations, causing the hose to swing and the whipping phenomenon, which greatly limits the success rate and safety of aerial refueling operations. Based on a 2.4 m transonic wind tunnel, high-speed wind tunnel test technology of a flexible aerial refueling hose–drogue system was established to carry out experimental research on the coupling characteristics of aerodynamics and multi-body dynamics. Based on the aid of Videogrammetry Model Deformation (VMD), high-speed photography, dynamic balance, and other wind tunnel test technologies, the dynamic characteristics of the hose–drogue system in a high-speed airflow and during the approach of the receiver are obtained. Adopting flexible multi-body dynamics, a dynamic system of the tanker, hose, drogue, and receiver is modeled. The cable/beam model is based on an arbitrary Lagrange–Euler method, and the absolute node coordinate method is used to describe the deformation, movement, and length variation in the hose during both winding and unwinding. The aerodynamic forces of the tanker, receiver, hose, and drogue are modeled, reflecting the coupling influence of movement of the tanker and receiver, the deformation of the hose and drogue, and the aerodynamic forces on each other. The tests show that during the approach of the receiver (distance from 1000 mm to 20 mm), the sinking amount of the drogue increases by 31 mm; due to the offset of the receiver probe, the drogue moves sideways from the symmetric plane of the receiver. Meanwhile, the oscillation magnitude of the drogue increases (from 33 to 48 and from 48 to 80 in spanwise and longitudinal directions, respectively). The simulation results show that the shear force induced by the oscillation of the hose and the propagation velocity of both the longitudinal and shear waves are affected by the hose stiffness and Mach number. The results presented in this work can be of great reference to further increase the safety of aerial refueling. Full article

(This article belongs to the Section Aeronautics)

► Show Figures

Figure 1

21 pages, 3834 KiB

Open AccessArticle

Rural Landscape Transformation and the Adaptive Reuse of Historical Agricultural Constructions in Bagheria (Sicily): A GIS-Based Approach to Territorial Planning and Representation

by Santo Orlando, Pietro Catania, Carlo Greco, Massimo Vincenzo Ferro, Mariangela Vallone and Giacomo Scarascia Mugnozza

Sustainability 2025, 17(14), 6291; https://doi.org/10.3390/su17146291 - 9 Jul 2025

Viewed by 493

Abstract

Bagheria, located on the northern coast of Sicily, is home to one of the Mediterranean’s most remarkable ensembles of Baroque villas, constructed between the 17th and 18th centuries by the aristocracy of Palermo. Originally situated within a highly structured rural landscape of citrus [...] Read more.

Bagheria, located on the northern coast of Sicily, is home to one of the Mediterranean’s most remarkable ensembles of Baroque villas, constructed between the 17th and 18th centuries by the aristocracy of Palermo. Originally situated within a highly structured rural landscape of citrus groves, gardens, and visual axes, these monumental residences have undergone substantial degradation due to uncontrolled urban expansion throughout the 20th century. This study presents a diachronic spatial analysis of Bagheria’s territorial transformation from 1850 to 2018, integrating historical cartography, aerial photography, satellite imagery, and Geographic Information System (GIS) tools. A total of 33 villas were identified, georeferenced, and assessed based on their spatial integrity, architectural condition, and relationship with the evolving urban fabric. The results reveal a progressive marginalization of the villa system, with many heritage assets now embedded within dense residential development, severed from their original landscape context and deprived of their formal gardens and visual prominence. Comparative insights drawn from analogous Mediterranean heritage landscapes, such as Ortigia (Siracusa), the Appian Way (Rome), and Athens, highlight the urgency of adopting integrated conservation frameworks that reconcile urban development with cultural and ecological continuity. As a strategic response, the study proposes the creation of a thematic cultural route, La città delle ville, to enhance the visibility, accessibility, and socio-economic relevance of Bagheria’s heritage system. This initiative, supported by adaptive reuse policies, smart heritage technologies, and participatory planning, offers a replicable model for sustainable territorial regeneration and heritage-led urban resilience. Full article

► Show Figures

Figure 1

26 pages, 23518 KiB

Open AccessArticle

Avalanche Hazard Dynamics and Causal Analysis Along China’s G219 Corridor: A Case Study of the Wenquan–Khorgas Section

by Xuekai Wang, Jie Liu, Qiang Guo, Bin Wang, Zhiwei Yang, Qiulian Cheng and Haiwei Xie

Atmosphere 2025, 16(7), 817; https://doi.org/10.3390/atmos16070817 - 4 Jul 2025

Viewed by 407

Abstract

Investigating avalanche hazards is a fundamental preliminary task in avalanche research. This work is critically important for establishing avalanche warning systems and designing mitigation measures. Primary research data originated from field investigations and UAV aerial surveys, with avalanche counts and timing identified through [...] Read more.

Investigating avalanche hazards is a fundamental preliminary task in avalanche research. This work is critically important for establishing avalanche warning systems and designing mitigation measures. Primary research data originated from field investigations and UAV aerial surveys, with avalanche counts and timing identified through image interpretation. Snowpack properties were primarily acquired via in situ field testing within the study area. Methodologically, statistical modeling and RAMMS::AVALANCHE simulations revealed spatiotemporal and dynamic characteristics of avalanches. Subsequent application of the Certainty Factor (CF) model and sensitivity analysis determined dominant controlling factors and quantified zonal influence intensity for each parameter. This study, utilizing field reconnaissance and drone aerial photography, identified 86 avalanche points in the study area. We used field tests and weather data to run the RAMMS::AVALANCHE model. Then, we categorized and summarized regional avalanche characteristics using both field surveys and simulation results. Furthermore, the Certainty Factor Model (CFM) and the parameter Sensitivity Index (Sa) were applied to assess the influence of elevation, slope gradient, aspect, and maximum snow depth on the severity of avalanche disasters. The results indicate the following: (1) Avalanches exhibit pronounced spatiotemporal concentration: temporally, they cluster between February and March and during 13:00–18:00 daily; spatially, they concentrate within the 2100–3000 m elevation zone. Chute-confined avalanches dominate the region, comprising 73.26% of total events; most chute-confined avalanches feature multiple release areas; therefore the number of release areas exceeds avalanche points; in terms of scale, medium-to-large-scale avalanches dominate, accounting for 86.5% of total avalanches. (2) RAMMS::AVALANCHE simulations yielded the following maximum values for the region: flow height = 15.43 m, flow velocity = 47.6 m/s, flow pressure = 679.79 kPa, and deposition height = 10.3 m. Compared to chute-confined avalanches, unconfined slope avalanches exhibit higher flow velocities and pressures, posing greater hazard potential. (3) The Certainty Factor Model and Sensitivity Index identify elevation, slope gradient, and maximum snow depth as the key drivers of avalanches in the study area. Their relative impact ranks as follows: maximum snow depth > elevation > slope gradient > aspect. The sensitivity index values were 1.536, 1.476, 1.362, and 0.996, respectively. The findings of this study provide a scientific basis for further research on avalanche hazards, the development of avalanche warning systems, and the design of avalanche mitigation projects in the study area. Full article

(This article belongs to the Special Issue Climate Change in the Cryosphere and Its Impacts)

► Show Figures

Figure 1

18 pages, 6847 KiB

Open AccessArticle

Numerical Simulation of Slope Excavation and Stability Under Earthquakes in Cataclastic Loose Rock Mass of Hydropower Station on Lancang River

by Wenjing Liu, Hui Deng and Shuo Tian

Appl. Sci. 2025, 15(13), 7480; https://doi.org/10.3390/app15137480 - 3 Jul 2025

Viewed by 469

Abstract

This study investigates the excavation of the cataclastic loose rock slope at the mixing plant on the right bank of the BDa Hydropower Station, which is situated in the upper reaches of Lancang River. The dominant structural plane of the cataclastic loose rock [...] Read more.

This study investigates the excavation of the cataclastic loose rock slope at the mixing plant on the right bank of the BDa Hydropower Station, which is situated in the upper reaches of Lancang River. The dominant structural plane of the cataclastic loose rock mass was obtained using unmanned aerial vehicle tilt photography and 3D point cloud technology. The actual 3D numerical model of the study area was developed using the 3DEC discrete element numerical simulation software. The excavation response characteristics and overall stability of the cataclastic loose rock slope were analyzed. The support effect was evaluated considering the preliminary shaft micropile and Macintosh reinforced mat as slope support measures, and the stability was assessed by applying seismic waves. The results showed the main deformation and failure area after slope cleaning excavation at the junction of the cataclastic loose rock mass and Q^edl deposits in the shallow surface of the excavation face. Moreover, the maximum total displacement could reach 18.3 cm. Subsequently, the overall displacement of the slope was significantly reduced, and the maximum total displacement decreased to 2.78 cm. The support effect was significant. Under an earthquake load, the slope with support exhibited considerable displacement in the shallow surface of the excavation slope, with collapse deformation primarily occurring through shear failure. Full article

(This article belongs to the Section Civil Engineering)

► Show Figures

Figure 1

29 pages, 18908 KiB

Open AccessArticle

Toward Efficient UAV-Based Small Object Detection: A Lightweight Network with Enhanced Feature Fusion

by Xingyu Di, Kangning Cui and Rui-Feng Wang

Remote Sens. 2025, 17(13), 2235; https://doi.org/10.3390/rs17132235 - 29 Jun 2025

Cited by 4 | Viewed by 787

Abstract

UAV-based small target detection is crucial in environmental monitoring, circuit detection, and related applications. However, UAV images often face challenges such as significant scale variation, dense small targets, high inter-class similarity, and intra-class diversity, which can lead to missed detections, thus reducing performance. [...] Read more.

UAV-based small target detection is crucial in environmental monitoring, circuit detection, and related applications. However, UAV images often face challenges such as significant scale variation, dense small targets, high inter-class similarity, and intra-class diversity, which can lead to missed detections, thus reducing performance. To solve these problems, this study proposes a lightweight and high-precision model UAV-YOLO based on YOLOv8s. Firstly, a double separation convolution (DSC) module is designed to replace the Bottleneck structure in the C2f module with deep separable convolution and point-by-point convolution fusion, which can reduce the model parameters and calculation complexity while enhancing feature expression. Secondly, a new SPPL module is proposed, which combines spatial pyramid pooling rapid fusion (SPPF) with long-distance dependency modeling (LSKA) to improve the robustness of the model to multi-scale targets through cross-level feature association. Then, DyHead is used to replace the original detector head, and the discrimination ability of small targets in complex background is enhanced by adaptive weight allocation and cross-scale feature optimization fusion. Finally, the WIPIoU loss function is proposed, which integrates the advantages of Wise-IoU, MPDIoU and Inner-IoU, and incorporates the geometric center of bounding box, aspect ratio and overlap degree into a unified measure to improve the localization accuracy of small targets and accelerate the convergence. The experimental results on the VisDrone2019 dataset showed that compared to YOLOv8s, UAV-YOLO achieved an 8.9% improvement in the recall of mAP@0.5 and 6.8%, while the parameters and calculations were reduced by 23.4% and 40.7%, respectively. Additional evaluations of the DIOR, RSOD, and NWPU VHR-10 datasets demonstrate the generalization capability of the model. Full article

(This article belongs to the Special Issue Geospatial Intelligence in Remote Sensing)

► Show Figures

Figure 1

29 pages, 7229 KiB

Open AccessArticle

The Non-Destructive Testing of Architectural Heritage Surfaces via Machine Learning: A Case Study of Flat Tiles in the Jiangnan Region

by Haina Song, Yile Chen and Liang Zheng

Coatings 2025, 15(7), 761; https://doi.org/10.3390/coatings15070761 - 27 Jun 2025

Viewed by 660

Abstract

This study focuses on the ancient buildings in Cicheng Old Town, a typical architectural heritage area in the Jiangnan region of China. These buildings are famous for their well-preserved Tang Dynasty urban layout and Ming and Qing Dynasty roof tiles. However, the natural [...] Read more.

This study focuses on the ancient buildings in Cicheng Old Town, a typical architectural heritage area in the Jiangnan region of China. These buildings are famous for their well-preserved Tang Dynasty urban layout and Ming and Qing Dynasty roof tiles. However, the natural aging, weathering, and biological erosion of the roof tiles seriously threaten the integrity of heritage protection. Given that current detection methods mostly depend on manual checks, which are slow and cover only a small area, this study suggests using deep learning technology for heritage protection and creating a smart model to identify damage in flat tiles using the YOLOv8 architecture. During this research, the team used drone aerial photography to collect images of typical building roofs in Cicheng Old Town. Through preprocessing, unified annotation, and system training, a damage dataset containing 351 high-quality images was established, covering five types of damage: breakage, cracks, the accumulation of fallen leaves, lichen growth, and vegetation growth. The results show that (1) the model has an overall mAP of 73.44%, an F1 value of 0.75 in the vegetation growth category, and a recall rate of 0.70, showing stable and balanced detection performance for various damage types; (2) the model performs well in comparisons using confusion matrices and multidimensional indicators (including precision, recall, and log-average miss rate) and can effectively reduce the false detection and missed detection rates; and (3) the research team applied the model to drone images of the roof of Fengyue Painted Terrace Gate in Cicheng Old Town, Jiangbei District, Ningbo City, Zhejiang Province, and automatically detected and located multiple tile damage areas. The prediction results are highly consistent with field observations, verifying the feasibility and application potential of the model in actual heritage protection scenarios. Full article

(This article belongs to the Special Issue Surface Engineering in the Diagnostics, Conservation and Restoration of Cultural Heritage)

► Show Figures

Figure 1

24 pages, 4066 KiB

Open AccessArticle

Analysing the Market Value of Land Accommodating Logistics Facilities in the City of Cape Town Municipality, South Africa

by Masilonyane Mokhele

Sustainability 2025, 17(13), 5776; https://doi.org/10.3390/su17135776 - 23 Jun 2025

Viewed by 489

Abstract

The world is characterised by the growing volumes and flow of goods, which, amid benefits to economic development, result in negative externalities affecting the sustainability of cities. Although numerous studies have analysed the locational patterns of logistics facilities in cities, further research is [...] Read more.

The world is characterised by the growing volumes and flow of goods, which, amid benefits to economic development, result in negative externalities affecting the sustainability of cities. Although numerous studies have analysed the locational patterns of logistics facilities in cities, further research is required to examine their real estate patterns and trends. The aim of the paper is, therefore, to analyse the value of land accommodating logistics facilities in the City of Cape Town municipality, South Africa. Given the lack of dedicated geo-spatial data, logistics firms were searched on Google Maps, utilising a combination of aerial photography and street view imagery. Three main attributes of land parcels hosting logistics facilities were thereafter captured from the municipal cadastral information: property extent, street address, and property number. The latter two were used to extract the 2018 and 2022 property market values from the valuation rolls on the municipal website, followed by statistical, spatial, and geographically weighted regression (GWR) analyses. Zones near the central business district and seaport, as well as areas with prime road-based accessibility, had high market values, while those near the railway stations did not stand out. However, GWR yielded weak relationships between market values and the locational variables analysed, arguably showing a disconnect between spatial planning and logistics planning. Towards augmenting sustainable logistics, it is recommended that relevant stakeholders strategically integrate logistics into spatial planning, and particularly revitalise freight rail to attract investment to logistics hubs with direct railway access. Full article

(This article belongs to the Special Issue Sustainable Transport and Land Use for a Sustainable Future)

► Show Figures

Figure 1

28 pages, 1707 KiB

Open AccessReview

Video Stabilization: A Comprehensive Survey from Classical Mechanics to Deep Learning Paradigms

by Qian Xu, Qian Huang, Chuanxu Jiang, Xin Li and Yiming Wang

Modelling 2025, 6(2), 49; https://doi.org/10.3390/modelling6020049 - 17 Jun 2025

Viewed by 1314

Abstract

Video stabilization is a critical technology for enhancing video quality by eliminating or reducing image instability caused by camera shake, thereby improving the visual viewing experience. It has deeply integrated into diverse applications—including handheld recording, UAV aerial photography, and vehicle-mounted surveillance. Propelled by [...] Read more.

Video stabilization is a critical technology for enhancing video quality by eliminating or reducing image instability caused by camera shake, thereby improving the visual viewing experience. It has deeply integrated into diverse applications—including handheld recording, UAV aerial photography, and vehicle-mounted surveillance. Propelled by advances in deep learning, data-driven stabilization methods have emerged as prominent solutions, demonstrating superior efficacy in handling jitter while achieving enhanced processing efficiency. This review systematically examines the field of video stabilization. First, this paper delineates the paradigm shift from classical to deep learning-based approaches. Subsequently, it elucidates conventional digital stabilization frameworks and their deep learning counterparts along with establishing standardized assessment metrics and benchmark datasets for comparative analysis. Finally, this review addresses critical challenges such as robustness limitations in complex motion scenarios and latency constraints in real-time processing. By integrating interdisciplinary perspectives, this work provides scholars with academically rigorous and practically relevant insights to advance video stabilization research. Full article

► Show Figures

Graphical abstract

18 pages, 4854 KiB

Open AccessArticle

Comparing UAV-Based Hyperspectral and Satellite-Based Multispectral Data for Soil Moisture Estimation Using Machine Learning

by Hadi Shokati, Mahmoud Mashal, Aliakbar Noroozi, Saham Mirzaei, Zahra Mohammadi-Doqozloo, Kamal Nabiollahi, Ruhollah Taghizadeh-Mehrjardi, Pegah Khosravani, Rabindra Adhikari, Ling Hu and Thomas Scholten

Water 2025, 17(11), 1715; https://doi.org/10.3390/w17111715 - 5 Jun 2025

Viewed by 986

Abstract

Accurate estimation of soil moisture content (SMC) is crucial for effective water management, enabling improved monitoring of water stress and a deeper understanding of hydrological processes. While satellite remote sensing provides broad coverage, its spatial resolution often limits its ability to capture small-scale [...] Read more.

Accurate estimation of soil moisture content (SMC) is crucial for effective water management, enabling improved monitoring of water stress and a deeper understanding of hydrological processes. While satellite remote sensing provides broad coverage, its spatial resolution often limits its ability to capture small-scale variations in SMC, especially in landscapes with diverse land-cover types. Unmanned aerial vehicles (UAVs) equipped with hyperspectral sensors offer a promising solution to overcome this limitation. This study compares the effectiveness of Sentinel-2, Landsat-8/9 multispectral data and UAV hyperspectral data (from 339.6 nm to 1028.8 nm with spectral bands) in estimating SMC in a research farm consisting of bare soil, cropland and grassland. A DJI Matrice 100 UAV equipped with a hyperspectral spectrometer collected data on 14 field campaigns, synchronized with satellite overflights. Five machine-learning algorithms including extreme learning machines (ELMs), Gaussian process regression (GPR), partial least squares regression (PLSR), support vector regression (SVR) and artificial neural network (ANN) were used to estimate SMC, focusing on the influence of land cover on the accuracy of SMC estimation. The findings indicated that GPR outperformed the other models when using Landsat-8/9 and hyperspectral photography data, demonstrating a tight correlation with the observed SMC (R² = 0.64 and 0.89, respectively). For Sentinel-2 data, ELM showed the highest correlation, with an R² value of 0.46. In addition, a comparative analysis showed that the UAV hyperspectral data outperformed both satellite sources due to better spatial and spectral resolution. In addition, the Landsat-8/9 data outperformed the Sentinel-2 data in terms of SMC estimation accuracy. For the different land-cover types, all types of remote-sensing data showed the highest accuracy for bare soil compared to cropland and grassland. This research highlights the potential of integrating UAV-based spectroscopy and machine-learning techniques as complementary tools to satellite platforms for precise SMC monitoring. The findings contribute to the further development of remote-sensing methods and improve the understanding of SMC dynamics in heterogeneous landscapes, with significant implications for precision agriculture. By enhancing the SMC estimation accuracy at high spatial resolution, this approach can optimize irrigation practices, improve cropping strategies and contribute to sustainable agricultural practices, ultimately enabling better decision-making for farmers and land managers. However, its broader applicability depends on factors such as scalability and performance under different conditions. Full article

(This article belongs to the Special Issue Applications of Multi-Source Remote Sensing Technologies in Soil Moisture Monitoring)

► Show Figures

Figure 1

29 pages, 6039 KiB

Open AccessArticle

Tree Species Detection and Enhancing Semantic Segmentation Using Machine Learning Models with Integrated Multispectral Channels from PlanetScope and Digital Aerial Photogrammetry in Young Boreal Forest

by Arun Gyawali, Mika Aalto and Tapio Ranta

Remote Sens. 2025, 17(11), 1811; https://doi.org/10.3390/rs17111811 - 22 May 2025

Viewed by 1059

Abstract

The precise identification and classification of tree species in young forests during their early development stages are vital for forest management and silvicultural efforts that support their growth and renewal. However, achieving accurate geolocation and species classification through field-based surveys is often a [...] Read more.

The precise identification and classification of tree species in young forests during their early development stages are vital for forest management and silvicultural efforts that support their growth and renewal. However, achieving accurate geolocation and species classification through field-based surveys is often a labor-intensive and complicated task. Remote sensing technologies combined with machine learning techniques present an encouraging solution, offering a more efficient alternative to conventional field-based methods. This study aimed to detect and classify young forest tree species using remote sensing imagery and machine learning techniques. The study mainly involved two different objectives: first, tree species detection using the latest version of You Only Look Once (YOLOv12), and second, semantic segmentation (classification) using random forest, Categorical Boosting (CatBoost), and a Convolutional Neural Network (CNN). To the best of our knowledge, this marks the first exploration utilizing YOLOv12 for tree species identification, along with the study that integrates digital aerial photogrammetry with Planet imagery to achieve semantic segmentation in young forests. The study used two remote sensing datasets: RGB imagery from unmanned aerial vehicle (UAV) ortho photography and RGB-NIR from PlanetScope. For YOLOv12-based tree species detection, only RGB from ortho photography was used, while semantic segmentation was performed with three sets of data: (1) Ortho RGB (3 bands), (2) Ortho RGB + canopy height model (CHM) + Planet RGB-NIR (8 bands), and (3) ortho RGB + CHM + Planet RGB-NIR + 12 vegetation indices (20 bands). With three models applied to these datasets, nine machine learning models were trained and tested using 57 images (1024 × 1024 pixels) and their corresponding mask tiles. The YOLOv12 model achieved 79% overall accuracy, with Scots pine performing best (precision: 97%, recall: 92%, mAP50: 97%, mAP75: 80%) and Norway spruce showing slightly lower accuracy (precision: 94%, recall: 82%, mAP50: 90%, mAP75: 71%). For semantic segmentation, the CatBoost model with 20 bands outperformed other models, achieving 85% accuracy, 80% Kappa, and 81% MCC, with CHM, EVI, NIRPlanet, GreenPlanet, NDGI, GNDVI, and NDVI being the most influential variables. These results indicate that a simple boosting model like CatBoost can outperform more complex CNNs for semantic segmentation in young forests. Full article

► Show Figures

Graphical abstract

Search Results (387)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (387)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI