Development of a High-Precision Lidar System and Improvement of Key Steps for Railway Obstacle Detection Algorithm

Nan, Zongliang; Zhu, Guoan; Zhang, Xu; Lin, Xuechun; Yang, Yingying

doi:10.3390/rs16101761

Open AccessArticle

Development of a High-Precision Lidar System and Improvement of Key Steps for Railway Obstacle Detection Algorithm

¹

Laboratory of All-Solid-State Light Sources, Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China

²

College of Materials Science and Opto-Electronic Technology, University of Chinese Academy of Sciences, Beijing 101407, China

³

Shenghong (Taizhou) Laser Technology Co. Ltd., Taizhou 318001, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2024, 16(10), 1761; https://doi.org/10.3390/rs16101761

Submission received: 27 March 2024 / Revised: 4 May 2024 / Accepted: 13 May 2024 / Published: 16 May 2024

(This article belongs to the Special Issue 3D Information Recovery and 2D Image Processing for Remotely Sensed Optical Images II)

Download

Browse Figures

Versions Notes

Abstract

:

In response to the growing demand for railway obstacle monitoring, lidar technology has emerged as an up-and-coming solution. In this study, we developed a mechanical 3D lidar system and meticulously calibrated the point cloud transformation to monitor specific areas precisely. Based on this foundation, we have devised a novel set of algorithms for obstacle detection within point clouds. These algorithms encompass three key steps: (a) the segmentation of ground point clouds and extraction of track point clouds using our RS-Lo-RANSAC (region select Lo-RANSAC) algorithm; (b) the registration of the BP (background point cloud) and FP (foreground point cloud) via an improved Robust ICP algorithm; and (c) obstacle recognition based on the VFOR (voxel-based feature obstacle recognition) algorithm from the fused point clouds. This set of algorithms has demonstrated robustness and operational efficiency in our experiments on a dataset obtained from an experimental field. Notably, it enables monitoring obstacles with dimensions of 15 cm × 15 cm × 15 cm. Overall, our study showcases the immense potential of lidar technology in railway obstacle monitoring, presenting a promising solution to enhance safety in this field.

Keywords:

lidar; coordinate calibration; RS-Lo-RANSAC; improved Robust ICP; VFOR; obstacle recognition

1. Introduction

Railway safety is a topic of great concern to people, and in recent years, train accidents caused by obstacles have occurred frequently [1]. Due to the complexity of railway terrain, the intrusion of blocks has features such as randomness, suddenness, and unpredictability [2]. Therefore, the real-time monitoring of obstacles has become an urgent demand [3].

Most research on railway obstacle intrusion focuses on visual detection [4]. However, current optical systems face several challenges, including susceptibility to lighting conditions; reduced obstacle recognition in harsh environments such as rain, snow, and fog; and difficulty in accurately identifying the distance information of obstacles [5]. Staniša et al. [6] proposed a machine learning-based strategy for obstacle detection in cases of pixel-level differences to overcome the low-pixel resolution of the camera system under adverse environmental conditions. Meanwhile, Anand et al. [7] introduced a visual enhancement system that combines infrared cameras and other devices to improve its detection rate in adverse weather conditions. Lidar is an active sensor. It emits laser pulses towards an object and calculates the distance by collecting and analyzing the echo signals reflected from the object. In recent years, lidar sensors have been widely used in obstacle detection because they are immune to environmental light, have a quick response time, and effectively detect distant obstacles [8,9]. The mainstream 3D lidar used in obstacle measurement includes MEMS (Micro-electro Mechanical System) lidar and traditional mechanical lidar. Due to the faster scanning speed, MEMS lidar is used more for dynamic obstacle detection. Zheng et al. [10] combined 2D MEMS lidar with drones to calibrate coordinates on sparse point cloud maps. They utilized a clustering algorithm to detect obstacles in motion. MEMS lidar uses voltage to drive a two-axis MEMS mirror to realize scanning in a three-dimensional direction [11]. When we use it to detect railway obstacles, the scanning method, as shown in Figure 1a, is usually adopted to realize remote detection. However, this method will bring the scanning of blind areas. Traditional mechanical lidar uses a motor to carry the single-line lidar to realize three-dimensional scanning. It can achieve a denser distribution of scan lines by slowing down the motor speed, so the imaging detail is outstanding (Figure 1b). Vitor et al. [12] performed the output of 3D point clouds by combining 2D lidar and inertial measurement units along the railway. They completed the detection of railway obstacles in the small crossing area.

Similarly, there has been extensive research on algorithms for point cloud obstacle detection [13,14]. Gao et al. [15] analyzed the geometric properties of obstacles based on non-uniform density point clouds and proposed an improved dynamic clustering algorithm that enhances the accuracy of traditional density-based clustering algorithms in autonomous driving. However, the algorithm’s stability decreases when dealing with parallel or distant obstacles. Before conducting segmented clustering to identify the barriers, Anand et al. [16] performed segmentation preprocessing on the ground point cloud. By dividing the ground point cloud into squares and conducting statistical analysis on the points within each square, they successfully achieved ground point cloud segmentation under different slopes. This algorithm demonstrates better environmental adaptability than traditional segmentation algorithms such as RANSAC and voxel-based algorithms for point cloud segmentation. Similarly, algorithms based on machine learning have been introduced into obstacle detection using lidar [17,18]. However, achieving a balance between timeliness and accuracy has been proven to be difficult for these algorithms. Currently, the application of lidar in obstacle detection is still focused on detecting large obstacles such as cars and pedestrians, and a small number of research studies have used it for detecting small obstacles [19]. Vitor et al. [12] used a 2D laser scanner to design an obstacle detection device for railway intersections, successfully completing the recognition of obstacles with a height of 30 cm. However, the operating distance of the device is limited to a range of 10 m. J. Li [20] proposed an obstacle detection algorithm to eliminate ground disturbances using a 3D lidar and a biaxial rotating platform, which enables the device to detect obstacles with a height of 10 cm. However, the operating range of the device is also greatly limited. Therefore, developing a lidar device and robust algorithm that can detect small obstacles within a certain range has become an urgent need.

Based on the above discussion, we have designed a regionally adjustable 3D lidar system that can be used for railway obstacle monitoring and developed a new obstacle-monitoring method based on the fusion of background and foreground point clouds. Our lidar system and obstacle recognition algorithm can avoid the repetitive labor of manual inspections and errors caused by subjective judgments. Meanwhile, the resolution of video surveillance in low-light conditions such as darkness and tunnels seriously affect the accuracy of obstacle detection. Our lidar system has the ability to solve this limitation.

We fully implemented everything from hardware selection to system implementation during the system design. We utilized a high-precision servo motor, a PLC (programmable logic controller), and a single-line lidar to create a 3D mechanical lidar with an adjustable scanning area (by adjusting the scanning speed and pitch angle range of the servo motor, we can obtain different scanning areas). Furthermore, we thoroughly discussed the calibration of coordinate errors for objects during the scanning process.

Then, we developed an obstacle detection algorithm suitable for large-scale ground point clouds and mainly discussed several key steps of the algorithm. We preprocessed the point cloud in the track area in the first stage using a statistical outlier filter. In the second stage, to improve the efficiency and robustness of ground point cloud segmentation, we proposed a region-select random sample consensus algorithm (RS-Lo-RANSAC) to segment the ground and extract the track model. In the following stage, we used an improved Robust ICP algorithm to perform registration on the background point cloud (fixed point cloud without obstacles in the track area) and foreground point cloud (real-time point clouds in the track area that may contain obstacles), achieving good consistency. Finally, we proposed an obstacle voxel-based feature obstacle recognition (VFOR) to achieve the stable detection of small obstacles in the railway region.

The contributions of this article are as follows:

(a): We have developed a region-adjustable 3D mechanical lidar suitable for railway obstacle detection, achieving the entire process from hardware system selection to software implementation. We also performed accurate corrections on lidar point clouds.
(b): We have developed a novel set of obstacle detection algorithms for lidar point clouds, which enables the high-precision monitoring of obstacles in the track area.
(c): We have improved and experimented on the Lo-RANSAC and Robust ICP algorithms in the process, which has enhanced their robustness and operational efficiency, making them applicable in most cases.

This paper is organized as follows. In Section 2, the components of the system are introduced, including the coordinate transformation of point cloud information and precise error correction. Section 3 discusses the algorithm’s processes and various steps, while Section 4 presents the experimental results. Finally, the conclusion is summarized.

2. Overall System Structure

As shown in Figure 2a, the hardware components of the entire system include a 2D scanning lidar rangefinder, a high-precision servo motor, and a PLC. The design and implementation of the 2D lidar rangefinder are not within the scope of this article’s discussion, and it achieves scanning along the lateral line using fixed angular resolution. The servo motor carries the 2D lidar to realize the acquisition of 3D information.

A 2D lidar emits pulses at a fixed frequency to obtain the detected object’s distance R and lateral angle resolution. The PLC receives real-time elevation angle measurements. The scanning principle is shown in Figure 3a. The 2D lidar is designed to perform horizontal scans at a fixed frequency, providing pulse distance R for a fixed lateral angle resolution δ. Simultaneously, a high-precision specialized motor drives the 2D lidar for pitch scanning while under load, achieving precise real-time acquisition of pitch angles during operation. With this information, the conversion relationship of the three-dimensional Cartesian coordinate values of the detection point can be obtained (Figure 3b).

x = R \times s i n ω \times c o s θ

(1)

y = R \times s i n θ

(2)

z = H - R \times c o s ω \times c o s θ

(3)

where

ω

represents the pulse’s pitch angle,

θ

represents the horizontal angle, R represents pulse distance, and H represents the height of the system.

Precise Coordinate Error Correction

In the above transformation relationship, due to the servo motors’ limited feedback precision and time delay, it is common to represent the pitch angle of all pulses by using the same value in each scanning line. However, this is different in practical scanning processes. As shown in Figure 4, although the frequency is high-speed during each 2D lidar scan cycle, not every pulse has the same pitch angle. Instead, there is a linear relationship based on the direction of pitch motion (assuming uniform scanning). To address this issue, we propose a precise coordinate error correction formula:

θ_{i j} = i \times δ

(4)

κ_{i j} = i \times (ω_{j + 1} - ω_{j}) / 2 n

(5)

ω_{i j} = ω_{i} + κ_{i j}

(6)

In this set of equations,

δ

represents the fixed angular resolution of the 2D lidar.

θ_{i j}

and

ω_{i j}

represent the horizontal scanning angle and actual pitch angle of the i-th scan pulse on the j-th scanning line from the beginning, respectively. Meanwhile,

ω_{j}

represents the starting pitch angle of the j-th scan line, whereas

κ_{i j}

refers to the angle compensation for the scan pulse at that point and n represents the total number of scan points for the j-th scan line. When the lidar moves upwards, it takes a positive value; when it moves downwards, it takes a negative value.

High-precision point cloud information is vital for our algorithm. In the next phase, we will use the accurate point cloud information obtained from the system to identify obstacles.

3. Point Cloud Obstacle Detection Algorithm

This section describes each stage of the proposed point cloud-based obstacle detection algorithm for the track area. As shown in Figure 5, we mainly focused on designing the three key steps in the three red boxes, which are as follows:

(a): Large-scale ground point cloud segmentation and railway model extraction using RS-Lo-RANSAC;
(b): Background point cloud (BP) and foreground point cloud (FP) (including obstacles) fine registration using improved Robust ICP;
(c): Obstacle recognition based on VFOR algorithm.

3.1. Large-Scale Ground Point Cloud Segmentation and Railway Track Extraction

After calibrating point cloud coordinates, we obtained a large-scale and detailed point cloud of the scanning area. When installed beside the railway tracks, we can obtain a point cloud with features such as the ground, railway tracks, obstacles, or trains. Our algorithm requires a frame of the obstacle-free point cloud as a background point cloud, while the foreground point cloud has obstacles. We first apply an outlier point filter to handle individual noise for the two input point clouds. Then, we perform plane segmentation on the long-range ground point cloud.

The classic RANSAC [21] algorithm fits the samples by randomly selecting a minimal subset. Points within the defined threshold range of error are considered inliers. The loop’s termination occurs when the probability of losing an internal element set of size m among k samples is lower than a predefined threshold. The selection of the sample size should be as follows:

η = {(1 - ρ_{m})}^{k}

(7)

where

ρ_{m}

represents the likelihood of randomly selecting a minimal sample of

m

points that are considered inliers,

I

represents the theoretical number of points for the inliers, and

N

represents the number of all points in the dataset.

ρ_{m} = {(\frac{I}{N})}^{m}

(8)

k = \frac{l o g (η)}{l o g (1 - ρ_{m})}

(9)

However, the assumption is not always valid because RANSAC cannot guarantee that the sampled points are not contaminated during the random selection process. Therefore, in situations with significant noise, the algorithm is more prone to being affected and typically requires a more substantial number of iterations than the theoretical value.

In the iteration process,

η

can represent a lack of confidence in the correct model estimation. At the same time,

ρ_{m}

can be interpreted as the probability of the model being correctly estimated.

x

represents the correct model parameters, and

\hat{x}

represents the estimated ones. The error is quantitatively estimated by function

f (x, \hat{x})

.

X_{[n]}

represents the dataset of selected n samples,

X_{[i n l i e r s]}

represents the dataset of samples belonging to inliers, and

X_{[o u t l i e r s]}

represents the dataset of samples belonging to outliers. According to [22], Equation (9) can be rewritten as follows:

η = {[P (f (x, \hat{x})) > ϵ]}^{k}, \forall x \subset X_{[i n l i e r s]}

(10)

k = ⌈\frac{l o g (η)}{1 - l o g (P (f (x, \hat{x}) < ϵ | X_{[n]}))}⌉

(11)

\begin{matrix} P (f (x, \hat{x}) < ϵ |X_{[n]})) = ρ_{n} \times P (f (x, \hat{x}) < ϵ |X_{[i n l i e r s]})) \\ + (1 - ρ_{n}) \times P (f (x, \hat{x}) < ϵ |X_{[i n l i e r s + o u t l i e r s]})) \end{matrix}

(12)

P (f (x, \hat{x}) | X)) \approx ρ_{n} \times P (f (x, \hat{x}) < ϵ |X_{[i n l i e r s]}))

(13)

For some,

ϵ

> 0.

When the probability distributions of internal and external points and noise overlap significantly, assuming that the internal noise follows a Gaussian distribution, the following formula can describe the relationship between the optimal selection of model points with the known signal-to-noise ratio, denoted by

n^{*}

.

n^{*} = ⌈m a x {(ρ}_{n} \times P (f (x, \hat{x}) < ϵ | X_{[i n l i e r s]}))⌉

(14)

Following the abovementioned assumptions, Equation (14) follows a quadratic distribution. Based on the experimental findings [20], the optimal sample size (denoted as n*) for reconstructing the model satisfies the following relationship:

n^{*} = \{\begin{matrix} m + 4, ρ_{n} \geq 0.9 \\ m + 3, {0.8 \leq ρ}_{n} \leq 0.9 \\ m + 2, {0.6 \leq ρ}_{n} \leq 0.8 \\ m + 1, {0.4 \leq ρ}_{n} \leq 0.6 \\ m, ρ_{n} \leq 0.4 \end{matrix}

(15)

To improve the stability of the model, Lo-RANSAC [21,22] introduces a local optimization step. After obtaining the best model so far, it samples and performs local optimization using the inlier points. This process generates a new model with a higher proportion of inliers for subsequent iterations. Experimental results have demonstrated that Lo-RANSAC, which incorporates sample-local optimization, exhibits superior robustness in plane segmentation compared to the classical RANSAC algorithm.

Inspired by the works of [22,23], we have proposed a strategy to enhance the stability and operational efficiency of Lo-RANSAC [24] in ground segmentation while further reducing the impact of noise on model fitting called RS-Lo-RANSAC (Region Select Lo-RANSAC). In the process of fitting the minimum sample model, we incorporated the optimal sample Formula (15) as a reference to guide the fitting process. In the Lo-RANSAC sample fitting process [25], the fitting of the current sample is achieved by randomly selecting a certain number of sample points between the minimum sample and the optimal sample. Therefore, we refer to this process as RS-Lo-RANSAC, indicating the selection of sample points from a specific region. The method consists of the following steps (the pseudocode of the algorithm is shown as Algorithm 1):

(a): Randomly select $m$ to $n$ sample points within an adaptively determined range capable of fitting the model and employ these points for least squares sample fitting.
(b): Iterate through a loop and perform local optimization when the best model so far is identified, utilizing the Total Least Square (TLS) method to optimize the inlier points of the model.
(c): Implement early termination of iterations based on probabilistic principles. By satisfying certain conditions derived from probability statistics, we can determine when the model has achieved a stable state, allowing for the termination of the iteration process.

Algorithm 1: RS-Lo-RANSAC algorithm use for ground segmentation and track extraction

Input:

point cloud set S

, error threshold ϵ

, failure probability η

, sample interval [m, n]

;

Output:

best model M^{*}

1 : while i t e r < f (η, m)

do

2 : S_{r a n d o m}

←

select [m, n]

sample points from S

randomly

3 : M \leftarrow S_{r a n d o m}

calculate the current model

4 : S_{i n l i e r}

\leftarrow

caculate the inlier points of the plane

5 : if | S_{i n l i e r}

| > | S_{i n l i e r}^{*} |

then

6 : M^{*} \leftarrow M

7: // run Local Optimization

8 : for {i n t e r s}_{l o} = 1; i n t e r s l o \leq m a x_i n t e r s l o

do

9 : S_{L o} \leftarrow

inliers points of M

10 :

M_{L o} \leftarrow S_{L o}

model estimated by TLS

11 :

if S_{L o} > S^{*}

inliers then

12 : S_{i n l i e r s}^{*} \leftarrow S_{L o}

13 : M^{*} \leftarrow M_{L o}

14: end if

15 : {i n t e r s}_{l o} + +

16: end for
17: end if

18 : return M^{*}

The termination condition can be defined as the following:

k_{m a x} = \frac{l o g (η)}{l o g (1 - {(\frac{N^{*}}{N})}^{m})}

(16)

where

N^{*}

represents the number of inlier points within the best model so far,

N

represents the total number of points in the point cloud, and m is the minimum sample fit threshold.

By implementing these steps, our goal is to significantly improve the stability and operational efficiency of Lo-RANSAC in ground segmentation, while minimizing the influence of noise on model fitting.

3.2. Fine Registration of BP and FP

After ground segmentation, we obtained point cloud maps of the respective track regions (BP and FP). We aim to achieve the fine registration of these point cloud maps at this stage.

The Iterative Closest Point (ICP) [26] algorithm is widely used for point cloud registration [27,28]. Its core principle is to iteratively refine a mapping transformation from a source point cloud to a target point cloud. The objective is to find the best alignment between source and target point clouds. The algorithm computes the optimal transformation by minimizing distance metrics such as Euclidean distance or point-to-plane distance. Through iterative updates, the ICP algorithm gradually converges to achieve an accurate registration result:

f (R, t) = \min_{R, t} \sum_{i = 1}^{n} {‖R p_{i} + t - q_{i}‖}^{2}

(17)

where

p_{i}

= {

p_{1}, p_{2}, \dots, p_{n}

}

\in

P is one of the points in the source point cloud, and

q_{i}

= {

q_{1}, q_{2}, \dots, q_{m}

}

\in

Q is one of the points in the target point cloud.

R \in R^{d \times d}

is the rotation matrix.

t \in R^{d}

is the translation vector. During the iteration process of ICP, several commonly used strategies for finding point-to-point correspondences in point clouds include point-to-point, point-to-line, and point-to-plane [29]. The steps of the traditional ICP algorithm are as follows [30]:

(a): Given an initial source point cloud set P and input set Q, find the subset X in set P that corresponds to a set Y in Q, which can be described as ${(Y}_{k}, Q) = F ({(X}_{k}, P)$ , in which k represents the number of iterations of the ICP and is initialized as zero.
(b): Calculate the mapping matrix based on the point set relation between $Y_{k}$ and $X_{k}$ , and the mapping matrix is represented as (18):

Τ = [\begin{matrix} R & t \\ 0 & 1 \end{matrix}] \in R^{(d + 1) \times (d + 1)}

(18)

(c): Apply the mapping matrix to the input source point cloud for pose transformation, $P^{’} = P * T$ , and use point cloud $P^{’}$ as the input source for the next round. And calculate the mean squared point matching error $d_{k} = D (P^{’}, Q)$ .
(d): If the value of $d_{k}$ is less than a certain threshold, exit the loop and consider the current mapping matrix T as the optimal solution. Otherwise, continue iterating until the maximum number of iterations k is reached.

In correspondence step, we usually use the l₂-norm to represent the registration error. However, the convergence of the Iterative Closest Point (ICP) algorithm may be hindered by outliers, partial overlap, occlusion, and other issues present in the source and target point clouds. The TrICP algorithm proposed by Chetveriko et al. [31] addresses this problem by introducing the concept of overlap during the iterative process to remove outlier points. Assuming Equation (19) is satisfied in the given scenario of source point set A and target point set B, the computation of overlap allows for filtering out points between the two sets that meet the defined overlap criteria during the iterative process. This helps to minimize the influence of misalignment caused by distant outliers. However, computing the optimal overlap can be computationally expensive.

\forall A_{i} \in \{A\}, \exists B_{i} ϵ \{B\}, i = \{1,2, 3, \dots n_{A}\}, ‖A_{i} - B_{i}‖ < ε, ζ = \frac{n_{A}}{N_{B}}

(19)

where

ζ

represents the overlap defined during the iterative process of TrICP.

The Sparse ICP [32] algorithm improves sparsity in the source point cloud set and target point cloud set by replacing the l₂-norm with an l_p-norm in the error metric function. The loss function is denoted as (20):

m i n \sum_{i = 1}^{n} {‖R x_{i} + t - y_{i}‖}_{2}^{p}, p \in {0,1}

(20)

While this does mitigate the penalty for point pairs with significant spatial separation to some extent, it is important to note that the inclusion of the alternating direction method of multipliers (ADMM) in the solution process significantly amplifies the computational complexity.

To overcome the limitations posed by the previously mentioned methods and effectively tackle the challenges arising from partial overlap and outlier points during the registration process, Zhang et al. [33] proposed a novel registration approach. This method incorporates Welsch’s function as a loss function, strategically formulating the correspondence step in the following manner:

Q = \min \sum_{i = 1}^{n} φ_{v} (\underset{p_{i} \in P, q_{i} \in Q}{m i n} ‖R p_{i} + t - q_{i}‖) + I_{S O (d)} (R)

(21)

where

I_{S O (d)} (R)

is the indicator function of the special orthogonal group, and

φ_{v}

is the Welsch’s function, which is denoted as (22):

φ_{v} (x) = 1 - e x p (- \frac{x^{2}}{2 v^{2}})

(22)

By introducing a quadratic surrogate function

χ_{v} (x| y) = φ_{v} (y) + \frac{x^{2} - y^{2}}{2 v^{2}} e x p (- \frac{y^{2}}{2 v^{2}})

, the problem during the alignment steps is formulated as a weighted least squares problem for the following solution:

(R^{(k + 1)}, t^{(k + 1)}) = \arg \min_{R, t} \sum_{i = 1}^{n} ω_{k} {‖R^{(k)} p_{i} + t^{(k)} - q_{i}‖}^{2} + I_{S O (d)} (R)

(23)

ω_{k} = e x p (\frac{{- ‖R^{(k)} p_{i} + t^{(k)} - q_{i}‖}^{2}}{{2 v}^{2}})

(24)

In Formula (22),

v

is an independent variable that can control the value of the Welsch’s function, and the specific influence curve can be referred to in Figure 6.

The weight coefficient is observable, which is associated with the distance between point pairs and follows a Gaussian distribution. Its range is constrained to (0, 1] (As shown in Figure 6). When the distance between the registration point pairs is sufficiently large, the weight diminishes, penalizing outliers and preventing erroneous registration. Conversely, when the weight is small, more point pairs can participate in the registration process, improving precision and robustness. This method can be regarded as a soft-thresholding approach that penalizes points with more significant errors during the iterative procedure, yielding more stable outcomes.

In the iterative registration process, the update strategy and range interval of the Gaussian weight parameter

υ

play a crucial role. A well-designed iteration strategy enables more point pairs to be involved in the registration, thereby preventing overfitting and improving the algorithm’s robustness and efficiency. Zhang et al. [33] proposed boundary conditions for

ω_{k}

based on the assumption that the sampled points in a point cloud have a uniform density. These boundary conditions are defined as 3

υ

(where

υ

represents the median of registration errors) according to the three-sigma rule. Additionally, another boundary condition is set as

\frac{E}{3 \sqrt{3}}

(where

E

represents the median of neighboring points) to enhance the tolerance of sampled points.

During the registration process, starting from one boundary condition, the intermediate transformation matrix’s change is monitored. If this change is below a threshold or the maximum iteration count is reached, the current iteration is terminated, and the weight is reduced by half until reaching the other boundary condition. This iterative approach ensures the fine-tuning of the registration process and achieves optimal results. However, in practical scenarios where the density of the same point cloud varies significantly, the above strategy may not perform well for certain samples and lead to overfitting. To address this issue, we propose a new improved point cloud registration strategy to enhance the usability of the Robust ICP algorithm. This new strategy demonstrates superior efficiency and robustness in test samples.

As shown in Figure 7, the density of point clouds within the point cloud map exhibits significant variation with distance. The original algorithm employed the median of neighbor points in the point cloud map to determine the upper and lower bounds of parameters. Considering this observation, we propose the following enhancements.

Regarding

ν_{m a x}

, we aim to achieve a comprehensive matching relationship while minimizing the number of iterative cycles. To accomplish this, we transform the target point cloud using the mapping matrix and then analyze the resulting relationships after removing outliers. Through this analysis, we determine the maximum difference

E_{m a x}

in the initial transformation of the point cloud, which serves as the definition for

ν_{m a x}

.

In the case of

ν_{m i n}

, we aim to improve registration point sparsity while adhering to specific constraints. To achieve this, we examine the point density within fixed-threshold neighborhoods in the point cloud map and eliminate any exceptional values. Instead of using a traditional

ν_{m i n}

, we adopt the maximum value among all points in the neighborhood. We argue that this strategy yields smoother Gaussian weights across spatial regions, mainly for non-uniform density point cloud maps. This approach increases tolerance towards matching errors while partially constraining the function to deviate from the l₀-norm. The intention is to strike a balance between accommodating significant error points and ensuring precise alignment in the results.

Lastly, throughout the iterative process, we have observed that in a specific cycle, when the value of v reaches a fixed point, the last few iterations cease to contribute significantly to achieving precise alignment of the samples. This diminished effect can be attributed to energy convergence. To tackle this issue, we have devised an adaptive iteration strategy based on the convergence energy. This adaptive strategy can be summarized as follows.

The determination to terminate the current iteration is made by continuously monitoring the changes in the convergence energy. By observing this energy metric, we can identify when further iterations will yield diminishing returns in terms of improving alignment accuracy. This approach ensures that the iterative process is dynamically adjusted based on the convergence behavior, thus optimizing the efficiency and effectiveness of the algorithm.

E_{s t o p} = α E_{i n i t i a l}

(25)

In this case,

E_{s t o p}

represents the final convergence energy,

E_{i n i t i a l}

stands for the convergence energy during the initial iteration process, and

α

represents our adaptive threshold.

The pseudocode for the key steps of the algorithm is as Algorithm 2:

Algorithm 2: Improved point-to-point Robust ICP using Welsch’s function

Input:

T_{i n i t i a l}

: initial transformation;

error threshold ϵ

for T

;

m a x_i t e r s

: maximum iteration count;

E_{c o n v}

: convergence ability of iterative functions;

E_{c o n v}^{’}

: convergence capability of the previous iteration;

k

: number of iterations;

α

: energy threshold ;
Initial

: E_{c o n v}

= + \infty; k = 0;

1 : while T r u e

do

2 : k = 0

;

3 : while k < m a x_i t e r s

do

4 : if ∥ ((E_{c o n v}^{’} - E_{c o n v}) > α E_{c o n v}^{’}) ∥

then

5 : (R^{(k)}, t^{(k)}) = (R, t); Q^{(k)} = m i n \sum_{i = 1}^{n} {∥ R^{(k)} p_{i} - t^{(k)} - q_{i} ∥}^{2}

6: else break;
7: end if

8 : E_{c o n v} = E_{v} ((R^{(k)}, t^{(k)}), Q^{(k)});

9 : (R ’, t ’) = \sum_{i = 1}^{n} ω_{k} {∥ R p_{i} - t - q_{i} ∥}^{2};

10 :

i f ∥ (R ’, t ’) - (R^{(k)}, t^{(k)}) ∥ < ϵ

then break;

11 :

end if

12 : E_{c o n v}^{’} = E_{c o n v};

13: //Using Anderson acceleration;

14 : (R^{(k + 1)}, t^{(k + 1)}) = A n d e r s o n ((R^{(k)}, t^{(k)}))

[33,34]

15 : Q^{(k + 1)} = {m i n \sum_{i = 1}^{n} ∥ R^{(k + 1)} p_{i} - t^{(k + 1)} - q_{i} ∥}^{2}

16:

k + +

;
17: end while

18 : if v = = v_{m i n}

then Return T^{*}

;
19: end if

20 : v = m a x (v / 2, v_{m i n});

21: end while

22 : return T^{*}

3.3. Extracting Voxel Features from Fused Point Clouds

Voxel grids are extensively utilized in the processing of large volumes of data within point clouds [35,36]. Grid-based voxelization, a common technique, is employed to downsample the point cloud data, thereby reducing density and introducing sparsity [37].

In our preceding steps, we employed the RS-Lo-RANSAC algorithm to extract the ground-removed point cloud in source and target point clouds. Subsequently, by employing an improved Robust ICP algorithm, we successfully fused the BG and FG point clouds, resulting in a coherent point cloud representation. Importantly, this fused point cloud incorporates feature labels (for obstacle detection requirements with a minimum size of Acm × Acm × Acm, we usually set the empirical parameter of Voxelsize to (A ± 5) cm) (as shown in Figure 8).

We have devised an innovative obstacle recognition algorithm to harness the unique features within the fused point cloud. Our approach incorporates voxelization as a crucial step, dividing the point cloud into fixed-size voxel grids. The size parameter of these voxel grids is directly related to the desired obstacle resolution, denoted by Voxelsize.

First, we voxelize the fused point cloud map

U

using a voxel grid with a side length of Voxelsize. The index established in the XYZ order is

(V_{i} . x, V_{i} . y, V_{i} . z)

. In the point retrieval process, we leverage Octree [38] to expedite this procedure.

U_{i} = \{\begin{matrix} V_{i} . x = ⌊\frac{U_{i} . x - x_{m i n}}{V o x e l s i z e}⌋ \\ V_{i} . y = ⌊\frac{U_{i} . y - y_{m i n}}{V o x e l s i z e}⌋ \\ V_{i} . z = ⌊\frac{U_{i} . z - z_{m i n}}{V o x e l s i z e}⌋ \end{matrix} U_{i} \in U

(26)

Subsequently, according to Formulas (27) and (28), we analyzed the point cloud within each voxel grid

U_{i}

. This analysis allows us to determine whether a particular voxel grid corresponds to an obstacle grid. Expressing this relationship, we can establish the following:

N_{F P} - N_{B P} > ε ρ

(27)

{m a x (Z}_{Q^{V o x e l}} - Z_{P^{V o x e l}}) > μ

(28)

Here,

N_{F P}, N_{B P}

represent the respective counts of points belonging to the BG and FG within

U_{i}

.

ε

is a threshold value that correlates with the point cloud density

ρ

. Meanwhile,

Z_{Q^{V o x e l}}, Z_{P^{V o x e l}}

denote the heights of points attributed to the BG and FG point clouds in

U_{i}

.

μ

is associated with the desired height threshold for obstacle detection determined by the experimenter (for obstacle detection requirements with a minimum size of Acm × Acm × Acm, we usually set the empirical parameter of Voxelsize to (A ± 5) cm).

4. Experiment

The configuration of the system with our algorithm is the following: AMD Ryzen 5 5600G with Radeon Graphics, 32 GB RAM and NVIDIA GeForce RTX 3060 Ti.

4.1. Point Cloud Coordinate Correction

In our experiment on precise error coordinate correction, we opted to employ the system operating at a frequency of 50 Hz and an angular resolution of 0.33° for data acquisition and processing. By carefully adjusting the vertical angular resolution, we examined the accuracy of coordinate correction across various angular resolutions and scanning distances.

To calculate the angle compensation for each pulse point, we utilized Equations (5) and (6). We also proposed a measurement of the error between the corrected and original coordinates using the Euclidean distance (

R_{E u c l i d e a n}

). This measurement allowed us to quantify the discrepancy between the two points.

R_{E u c l i d e a n} = \sqrt{{|\hat{x} - x|}^{2} + {|\hat{y} - y|}^{2} + {|\hat{z} - z|}^{2}}

(29)

In Figure 9a, we gathered distance measurements at various horizontal positions and elevation angles, maintaining a consistent resolution. The variable point order denotes the sequential sequence of pulses in the horizontal direction. The pulse distance represents the magnitude of the detection distance, and then we will be able to quantitatively observe the extent of error correction at this detection distance. The maximum distance we captured in this instance was 9.5 m. In Figure 9b, we utilized Formulas (4)–(6) to rectify errors in each pulse. Notably, the compensation’s effectiveness becomes more prominent as the elevation and lateral angles increase. The maximum effective compensation occurred at a lateral angle of 138° and an elevation angle of 58.26°, resulting in an approximate correction of 27 mm. In the provided data, even the smallest error was compensated for by 10 mm. Figure 9c exemplifies the linear correlation between different elevation angle resolutions and their associated compensatory effects while upholding a constant horizontal angle resolution. In Figure 9d, we utilized a control system with different elevation scan speeds to enable data acquisition at a fixed position (30°). Additionally, we applied coordinate correction to the collected data at various elevation angle resolutions. This implies that when we employ different scanning speeds, varying degrees of compensation will be near the same scan line position. Specifically, higher scanning speeds result in more significant compensation.

The result of the quantified correction effect demonstrates a noteworthy enhancement in the calibration of three-dimensional coordinates for the measured points, validating the effectiveness of our correction strategy.

4.2. RS-Lo-RANSAC

We collected five scenarios from our constructed experimental site to enable ground point cloud segmentation (Figure 10). During the evaluation of our proposed algorithm, we focused on three critical evaluation criteria:

(a): Level of visualization achieved in extracting regions along the track of the point cloud.
(b): The computational efficiency of the algorithm was measured by its execution time.
(c): The accuracy of ground segmentation was assessed by calculating the Root Mean Square Error (RMSE).

The RMSE for the algorithm is calculated as follows:

R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {d_{i}}^{2}}{n}}

(30)

Here,

d_{i}

represents the vertical distance from each point in the sample to the plane model, and

n

represents the number of all points in the sample. Lower RMSE values generally indicate better algorithm results.

We tested three different algorithms: the traditional RANSAC, Lo-RANSAC, and our proposed algorithm. When

ϵ

(Distance Threshold) was set at 80 and

k_{m a x}

(maximum number of iterations) was set to 1000, the traditional RANSAC algorithm exhibited faster execution speed than both Lo-RANSAC and our algorithm, as shown in the results depicted in Figure 11. However, our algorithm, an improvement based on the Lo-RANSAC algorithm, demonstrated some enhancements in terms of execution speed.

In order to further verify the stability of the algorithm for ground segmentation in practical complex situations, we introduced an “interference plane” by moving trains (Figure 12a,e,i) and set three different parameters (ϵ = 80, 85, and 90; k = 2000; η = 0.05) in Scene A, B, and C. The RMSE and Time Cost of the three algorithms are recorded in Table 1. It can be seen that in the case of low intervention in the “interference plane” (Scene A and B), the traditional RANSAC algorithm performs better than the other two algorithms in RMSE and Time, while our RS-Lo-RANSAC algorithm always achieves better results than the Lo-RANSAC algorithm. However, in the presence of severe interference planes (Figure 12i), RANSAC exhibits poor performance that cannot be segmented (Figure 12j), while the other two algorithms have completed the predetermined tasks. Overall, the RS-Lo-RANSAC algorithm performs more stably in various situations, indicating that our algorithm strategy is suitable for segmentation tasks in large-scale point clouds. The phenomenon of RANSAC algorithm segmentation failure may be due to the limitation of iteration times and the uncertainty of non-target-planar noise, which affects the convergence of the RANSAC algorithm. Meanwhile, our algorithm exhibited improved RMSE compared to the Lo-RANSAC algorithm in the majority of samples and showed a reduction in the algorithm’s runtime. By comparing the results in Table 1, our algorithm showcased significant improvements in robustness, particularly in specific scenarios, compared to the RANSAC and Lo-RANSAC algorithms. This can be attributed to our sample selection strategy, which enhanced the algorithm’s capability to handle non-planar noise and overall robustness.

By incorporating the proposed RS-Lo-RANSAC algorithm steps, we successfully improved upon the Lo-RANSAC algorithm while optimizing its execution speed. In our experiments, our algorithm consistently outperformed the traditional RANSAC and Lo-RANSAC algorithms regarding noise resistance in most scenarios. This enhanced robustness allowed us to obtain superior point cloud data in the track area of real-world scenes, significantly reducing the workload and complexity of subsequent algorithmic tasks (for more test data, refer to Table 2).

4.3. Improved Robust ICP

Regarding the improved Robust ICP algorithm proposed, a subset of test scenarios was randomly selected to evaluate the algorithm’s performance based on the following metrics (the visualization process is shown in Figure 13):

(a): Convergence ability was measured by the variation of the Gaussian weights $ω$ during the correspondence step. This variation can be expressed as follows:

E_{c o n v} = 1 - e x p (\frac{{‖q_{i} - \hat{q_{i}} (p_{i})‖}^{2}}{2 υ^{2}})

(31)

(b): Mean Square Error (MSE) at each iteration:

M S E = \frac{1}{n} \sum_{i = 1}^{n} {‖q_{i} - \hat{q_{i}} (p_{i})‖}^{2}

(32)

(c): The runtime t of the algorithm.

(d): The interval $d_{c o r r}$ between fixed marker points in the foreground and background point clouds indicates the level of registration, and the smaller $d_{c o r r}$ , the better the registration effect.

The results showed that our algorithm effectively controlled the number of iterations during the initial round of updating the Gaussian weights in the test scenarios, significantly reducing runtime. We observed that the strategy used by Zhang et al. [31] for point cloud scenarios with significant density variations, as shown in Figure 14, had slow convergence for the first iteration. Excessive iterations had a minimal impact on reducing the MSE and resulted in excessive registration, leading to decreased algorithm efficiency. However, as Figure 15a,b demonstrated, our proposed strategy effectively mitigated this phenomenon. Moreover, while ensuring that the MSE remained nearly the same or slightly improved, Figure 15c illustrates the improved runtime achieved by our algorithm, with a maximum improvement of 1.9 s in specific scenarios. Furthermore, although traditional ICP exhibited faster runtime efficiency in most samples, it showed instability in some instances. Figure 15d demonstrates the comparison between our algorithm and Robust ICP in these five scenarios, where the MSE was nearly equal. Notably, our algorithm further improved the registration time of the samples.

Furthermore, we collected a frame of background and foreground point clouds and performed ground segmentation through the aforementioned steps (Figure 16a). We marked a pair of labeled points in the foreground and background point clouds, and then used ICP, GICP (Generalized ICP) [39], NDT (Normal Distribution Transform) [40], RICP (Robust ICP) [33] strategies, and our registration algorithm to register and mark

d_{c o r r}

. Our strategy achieved the best value of 3.342 cm, indicating that our algorithm demonstrated better registration performance on the sample. At the same time, we calculated the registered MSE and Time Cost in Table 3. Our algorithm also achieved significant advantages in terms of runtime and MSE, especially in terms of runtime. Only 0.38 s were used, achieving the best performance among all tested algorithms.

4.4. VFOR

In order to further illustrate the quantitative impact of errors in the key links of our algorithm based on the experimental results, we used the RMSE of the RS-Lo-RANSAC algorithm

{R M S E}_{R S - L o - R A N S A C}

as the benchmark and calculated the error

{Δ R M S E}_{L o - R A N S A C} ({R M S E}_{L o - R A N S A C} - {R M S E}_{R S - L o - R A N S A C})

of the Lo-RANSAC algorithm and the error

{Δ R M S E}_{R A N S A C} ({R M S E}_{R A N S A C} - {R M S E}_{R S - L o - R A N S A C})

of the RANSAC algorithm. Next, we used the

d_{c o r r - o u r s}

of our Improved Robust ICP algorithm as the benchmark during the registration process and calculated the

{Δ d}_{c o r r - I C P} (d_{c o r r - I C P} {- d}_{c o r r - o u r s})

of the ICP algorithm, the

{Δ d}_{c o r r - G I C P} (d_{c o r r - G I C P} {- d}_{c o r r - o u r s})

of the GICP algorithm, the

{Δ d}_{c o r r - N D T} (d_{c o r r - N D T} {- d}_{c o r r - o u r s})

of the NDT algorithm, and the

{Δ d}_{c o r r - R I C P} (d_{c o r r - R I C P} {- d}_{c o r r - o u r s})

of the RICP algorithm. And the results of each combination algorithm were calibrated by combining them. In Figure 17, the red box represents the sample obstacle we placed for identification, while the yellow box represents the noise cluster that was incorrectly identified and the blue box represents the unrecognized obstacle.

In Figure 17, we annotated the 15 boxes that should theoretically be detected in the Original Scene. The running results of the algorithm combination are shown in Figure 17. It is worth noting that due to the influence of the ground segmentation algorithm on the registration results, our registration results

d_{c o r r - o u r s}

of RANSAC and Lo-RANSAC (33.47 mm and 33.50 mm) are slightly larger than the result of 33.42 mm of our RS-Lo-RANSAC algorithm. In RS-Lo-RANSAC, we prominently marked the track point cloud after ground segmentation using green point clouds. It is evident that we did not see any noise clusters (yellow boxes) after the RS-Lo-RANSAC algorithm segmented the ground, which can be understood as the impact of

Δ R M S E

error on the results (the correct ground segmentation strategy resulted in a smaller

Δ R M S E

and reduced the probability of noise clusters appearing). However, different registration algorithms have a greater impact on the omission of obstacle detection (blue box). When the RS-Lo-RANSAC algorithm is used to correctly segment the ground, as the registration error

Δ d_{c o r r}

decreases, it leads to a higher detection rate (with fewer blue boxes), which can be attributed to the quantitative description of registration errors on our results. Based on our current experience, in order to reduce the impact of errors on the results in each step, we need to control the range of error results as follows:

Δ R M S E

is ±3 mm, and the control for

Δ d_{c o r r}

should be ±0.05 mm (experienced parameter).

Subsequently, we conducted more extensive testing. We evaluated the algorithm’s ability to recognize obstacles such as people, trains, and cardboard boxes at random locations adjacent to and on the tracks. The algorithm workflow is outlined as follows.

We utilized Table 4, which contains various types of obstacles in our test dataset, to demonstrate the algorithm’s recognition performance for these obstacles. We conducted 100 tests on each obstacle at each position, and the percentage represents the number of times the obstacle was successfully detected. Suppose the intruding point cloud group is significantly large and impacts the RS-Lo-RANSAC and improved Robust ICP steps. We adopted the most effective track extraction model M* and mapping matrix T* from previous continuous runs. The experimental results suggest that our algorithm performs well in handling point cloud processing, even when dealing with more giant obstacles. Regarding recognizing comparatively more minor obstacles, it achieves a detection range of approximately 50 m for threshold obstacles measuring 15 cm × 15 cm ×15 cm. Figure 18 shows the final visualization result of our algorithm.

5. Conclusions

Ensuring the safety of railway tracks against obstacles is a paramount task. In this study, we have devised a specialized 3D mechanical lidar system explicitly designed for obstacle detection. To reconstruct objects’ 3D information precisely, we meticulously quantified and calibrated the point cloud coordinate transformation relationship through thorough analysis.

Once we obtained high-precision point cloud data of the scene, we developed a comprehensive set of obstacle detection algorithms encompassing the following key steps:

(a): Ground filtering and track extraction based on RS-Lo-RANSAC;
(b): Fusion of the BP and the FP using an improved Robust ICP;
(c): Obstacle recognition based on voxel features extracted from the fused point cloud map.

We have optimized each algorithm step for robustness and speed, resulting in stable testing in our experimental scenarios. To evaluate the effectiveness of our system and algorithm, we conducted tests on obstacle size and distance within our constructed test site. The results demonstrated the exceptional performance of our algorithm in detecting intruding obstacles such as trains, pedestrians, and cardboard boxes under various test scenarios.

According to the current algorithm operation results, under continuous rainfall weather conditions, the algorithm may be disturbed. In the future, we will strive to solve this limitation. As part of our future work, we will study the algorithm’s robustness in the face of perturbations and assess its effectiveness in adverse weather conditions such as rain, snow, and fog. Meanwhile, due to our algorithm being based on large-scale ground segmentation considerations, it is suitable for processing static point clouds containing large-scale ground. We will further expand the algorithm to more diverse scenarios, such as key highways, railway intersections, tunnels, and flat aircraft runway datasets.

Author Contributions

Conceptualization, Z.N. and G.Z.; methodology, Z.N.; writing—original draft preparation, X.Z.; writing—review and editing, X.Z. and X.L.; supervision, X.L.; funding acquisition, Y.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was financially supported by the National Natural Science Foundation of China (No.62275244). And the CAS Project for Young Scientists in Basic Research, Grant No. YSBR-065, National Natural Science Foundation of China (No. 62225507, No. U2033211, No. 62175230, and No. 62175232), Scientific Instrument Developing Project of the Chinese Academy of Sciences, Grant No. YJKYYQ20200001, National Key R&D Program of China (No. 2022YFB3607800, No. 2022YFB3605800, and No. 2022YFB4601501), and the Key Program of the Chinese Academy of Sciences (ZDBS-ZRKJZ-TLC018).

Data Availability Statement

The authors confirm that the data supporting the findings of this study are available within the article.

Conflicts of Interest

Author Xu Zhang was employed by the company Shenghong (Taizhou) Laser Technology Co. Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Ye, T.; Zhang, X.; Zhang, Y.; Liu, J. Railway Traffic Object Detection Using Differential Feature Fusion Convolution Neural Network. IEEE Trans. Intell. Transp. Syst. 2021, 22, 1375–1387. [Google Scholar] [CrossRef]
Pan, H.; Li, Y.; Wang, H.; Tian, X. Railway Obstacle Intrusion Detection Based on Convolution Neural Network Multitask Learning. Electronics 2022, 11, 2697. [Google Scholar] [CrossRef]
Kapoor, R.; Goel, R.; Sharma, A. An intelligent railway surveillance framework based on recognition of object and railway track using deep learning. Multimed. Tools Appl. 2022, 81, 21083–21109. [Google Scholar] [CrossRef] [PubMed]
Ristic-Durrant, D.; Franke, M.; Michels, K. A Review of Vision-Based On-Board Obstacle Detection and Distance Estimation in Railways. Sensors 2021, 21, 3452. [Google Scholar] [CrossRef] [PubMed]
Dodge, D.; Yilmaz, M. Convex Vision-Based Negative Obstacle Detection Framework for Autonomous Vehicles. IEEE Trans. Intell. Veh. 2023, 8, 778–789. [Google Scholar] [CrossRef]
Perić, S.; Milojković, M.; Stan, S.-D.; Banić, M.; Antić, D. Dealing with Low Quality Images in Railway Obstacle Detection System. Appl. Sci. 2022, 12, 3041. [Google Scholar] [CrossRef]
Kyatsandra, A.K.; Saket, R.K.; Kumar, S.; Sarita, K.; Vardhan, A.S.S.; Vardhan, A.S.S. Development of TRINETRA: A Sensor Based Vision Enhancement System for Obstacle Detection on Railway Tracks. IEEE Sens. J. 2022, 22, 3147–3156. [Google Scholar] [CrossRef]
Shao, W.; Zhang, H.; Wu, Y.; Sheng, N.; Zhou, S. Application of fusion 2D lidar and binocular vision in robot locating obstacles. J. Intell. Fuzzy Syst. 2021, 41, 4387–4394. [Google Scholar] [CrossRef]
Miao, Y.; Tang, Y.; Alzahrani, B.A.; Barnawi, A.; Alafif, T.; Hu, L. Airborne Lidar Assisted Obstacle Recognition and Intrusion Detection Towards Unmanned Aerial Vehicle: Architecture, Modeling and Evaluation. IEEE Trans. Intell. Transp. Syst. 2021, 22, 4531–4540. [Google Scholar] [CrossRef]
Wang, D.; Watkins, C.; Xie, H. MEMS Mirrors for Lidar: A review. Micromachines 2020, 11, 456. [Google Scholar] [CrossRef]
Zheng, L.; Zhang, P.; Tan, J.; Li, F. The Obstacle Detection Method of UAV Based on 2D Lidar. IEEE Access 2019, 7, 163437–163448. [Google Scholar] [CrossRef]
Amaral, V.; Marques, F.; Lourenço, A.; Barata, J.; Santana, P. Laser-Based Obstacle Detection at Railway Level Crossings. J. Sens. 2016, 2016, 1719230. [Google Scholar] [CrossRef]
Xie, D.; Xu, Y.; Wang, R. Obstacle detection and tracking method for autonomous vehicle based on three-dimensional LiDAR. Int. J. Adv. Robot. Syst. 2019, 16, 1729881419831587. [Google Scholar] [CrossRef]
Sun, Y.; Zuo, W.; Huang, H.; Cai, P.; Liu, M. PointMoSeg: Sparse Tensor-Based End-to-End Moving-Obstacle Segmentation in 3-D Lidar Point Clouds for Autonomous Driving. IEEE Robot. Autom. Lett. 2021, 6, 510–517. [Google Scholar] [CrossRef]
Gao, F.; Li, C.; Zhang, B. A Dynamic Clustering Algorithm for Lidar Obstacle Detection of Autonomous Driving System. IEEE Sens. J. 2021, 21, 25922–25930. [Google Scholar] [CrossRef]
Anand, B.; Senapati, M.; Barsaiyan, V.; Rajalakshmi, P. LiDAR-INS/GNSS-Based Real-Time Ground Removal, Segmentation, and Georeferencing Framework for Smart Transportation. IEEE Trans. Instrum. Meas. 2021, 70, 1–11. [Google Scholar] [CrossRef]
Fang, J.; Zhou, D.; Yan, F.; Zhao, T.; Zhang, F.; Ma, Y.; Wang, L.; Yang, R. Augmented Lidar Simulator for Autonomous Driving. IEEE Robot. Autom. Lett. 2020, 5, 1931–1938. [Google Scholar] [CrossRef]
Chen, J.; Kira, Z.; Cho, Y.K. LRGNet: Learnable Region Growing for Class-Agnostic Point Cloud Segmentation. IEEE Robot. Autom. Lett. 2021, 6, 2799–2806. [Google Scholar] [CrossRef]
Ci, W.Y.; Xu, T.; Lin, R.Z.; Lu, S.; Wu, X.L.; Xuan, J.Y. A Novel Method for Obstacle Detection in Front of Vehicles Based on the Local Spatial Features of Point Cloud. Remote Sens. 2023, 15, 1044. [Google Scholar] [CrossRef]
Li, J.; Li, R.; Wang, J.Z.; Yan, M. Obstacle information detection method based on multiframe three-dimensional lidar point cloud fusion. Opt. Eng. 2019, 58, 116102. [Google Scholar] [CrossRef]
Fischler, M.A.; Bolles, R.C. Random sample consensus. Commun. ACM 1981, 24, 381–395. [Google Scholar] [CrossRef]
Sangappa, H.K.; Ramakrishnan, K.R. A probabilistic analysis of a common RANSAC heuristic. Mach. Vis. Appl. 2018, 30, 71–89. [Google Scholar] [CrossRef]
Chum, O.; Matas, J.; Kittler, J. Locally optimized RANSAC. In Proceedings of the Pattern Recognition: 25th DAGM Symposium, Magdeburg, Germany, 10–12 September 2003; pp. 236–243. [Google Scholar]
Huang, X.; Walker, I.; Birchfield, S. Occlusion-aware multi-view reconstruction of articulated objects for manipulation. Robot. Auton. Syst. 2014, 62, 497–505. [Google Scholar] [CrossRef]
Jiang, S.; Jiang, W.; Li, L.; Wang, L.; Huang, W. Reliable and Efficient UAV Image Matching via Geometric Constraints Structured by Delaunay Triangulation. Remote Sens. 2020, 12, 3390. [Google Scholar] [CrossRef]
Estépar, R.; Brun, A.; Westin, C.-F. Robust Generalized Total Least Squares Iterative Closest Point Registration. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention–MICCAI 2004: 7th International Conference, Saint-Malo, France, 26–29 September 2004; pp. 234–241. [Google Scholar]
Zhong, S.; Guo, M.; Lv, R.; Chen, J.; Xie, Z.; Liu, Z. A Robust Rigid Registration Framework of 3D Indoor Scene Point Clouds Based on RGB-D Information. Remote Sens. 2021, 13, 4755. [Google Scholar] [CrossRef]
Senin, N.; Colosimo, B.M.; Pacella, M. Point set augmentation through fitting for enhanced ICP registration of point clouds in multisensor coordinate metrology. Robot. Comput.-Integr. Manuf. 2013, 29, 39–52. [Google Scholar] [CrossRef]
He, Y.; Yang, J.; Hou, X.; Pang, S.; Chen, J. ICP registration with DCA descriptor for 3D point clouds. Opt Express 2021, 29, 20423–20439. [Google Scholar] [CrossRef] [PubMed]
Li, W.; Song, P. A modified ICP algorithm based on dynamic adjustment factor for registration of point cloud and CAD model. Pattern Recognit. Lett. 2015, 65, 88–94. [Google Scholar] [CrossRef]
Chetverikov, D.; Stepanov, D.; Krsek, P. Robust Euclidean alignment of 3D point sets: The trimmed iterative closest point algorithm. Image Vis. Comput. 2005, 23, 299–309. [Google Scholar] [CrossRef]
Bouaziz, S.; Tagliasacchi, A.; Pauly, M. Sparse Iterative Closest Point. Elev. Eurographics/Acmsiggraph Symp. Geom. Process. 2013, 32, 113–123. [Google Scholar] [CrossRef]
Zhang, J.; Yao, Y.; Deng, B. Fast and Robust Iterative Closest Point. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 3450–3466. [Google Scholar] [CrossRef] [PubMed]
Walker, H.F.; Ni, P. Anderson Acceleration for Fixed-Point Iterations. SIAM J. Numer. Anal. 2011, 49, 1715–1735. [Google Scholar] [CrossRef]
Kong, S.; Shi, F.; Wang, C.; Xu, C. Point Cloud Generation from Multiple Angles of Voxel Grids. IEEE Access 2019, 7, 160436–160448. [Google Scholar] [CrossRef]
Shi, G.; Wang, K.; Li, R.; Ma, C. Real-Time Point Cloud Object Detection via Voxel-Point Geometry Abstraction. IEEE Trans. Intell. Transp. Syst. 2023, 24, 5971–5982. [Google Scholar] [CrossRef]
Tzamarias, D.E.O.; Chow, K.; Blanes, I.; Serra-Sagrista, J. Fast Run-Length Compression of Point Cloud Geometry. IEEE Trans. Image Process 2022, 31, 4490–4501. [Google Scholar] [CrossRef] [PubMed]
Koh, N.; Jayaraman, P.K.; Zheng, J. Truncated octree and its applications. Vis. Comput. 2021, 38, 1167–1179. [Google Scholar] [CrossRef]
Young, M.; Pretty, C.; McCulloch, J.; Green, R. Sparse point cloud registration and aggregation with mesh-based generalized iterative closest point. J. Field Robot. 2021, 38, 1078–1091. [Google Scholar] [CrossRef]
Das, A.; Waslander, S.L. Scan registration using segmented region growing NDT. Int. J. Robot. Res. 2014, 33, 1645–1663. [Google Scholar] [CrossRef]

Figure 1. (a) The scanning of the rail by MEMS lidar. (b) The scanning of the rail by 3D mechanical lidar.

Figure 2. (a) The main hardware components of our 3D lidar system. (b) The equipment is used in the experimental field and the site includes standard railway tracks.

Figure 3. (a) The scanning mechanism of 3D mechanical lidar. (b) Coordinate transformation relationship for point cloud.

Figure 4. Under constant-speed scanning, the distribution of pulses on the ground is not strictly horizontal.

Figure 5. The flowchart of the whole obstacle detection algorithm.

Figure 6. Welsch’s function.

Figure 7. In large-scale scene applications, the density variation of the point cloud is very significant.

Figure 8. We use Octree technology to voxelize the fused point cloud. Next, we cycle through each voxel grid

U_{i}

to determine whether the point cloud within that grid belongs to the category of obstacles.

Figure 8. We use Octree technology to voxelize the fused point cloud. Next, we cycle through each voxel grid

U_{i}

to determine whether the point cloud within that grid belongs to the category of obstacles.

Figure 9. (a) Pulse distance R in different scanning lines (

δ

= 0.33°,

Δ ω

= 0.1°); (b)

R_{E u c l i d e a n}

at four random pitch angles using Formula (6); (c) trend plot of

ω_{i j}

variations for different values of

Δ ω

(

δ

= 0.33°,

ω_{j}

= 30°); (d)

R_{E u c l i d e a n}

for different values of

Δ ω

(

δ

= 0.33°,

ω_{j}

= 30°).

Figure 9. (a) Pulse distance R in different scanning lines (

δ

= 0.33°,

Δ ω

= 0.1°); (b)

R_{E u c l i d e a n}

at four random pitch angles using Formula (6); (c) trend plot of

ω_{i j}

variations for different values of

Δ ω

(

δ

= 0.33°,

ω_{j}

= 30°); (d)

R_{E u c l i d e a n}

for different values of

Δ ω

(

δ

= 0.33°,

ω_{j}

= 30°).

Figure 10. Five random scenes for ground segmentation: (a) Person on the track (point cloud count n = 120 k); (b) train with significant planar interference (n = 120 k); (c) simple track background (n = 120 k); (d) small planar train at a distance (n = 120 k); (e) dense point cloud background (n = 180 k).

Figure 11. (a) Performance of three different algorithms in numerous scenes based on RMSE (1~5 in Figure 10); (b) runtime of algorithms in five scenes displayed in Figure 10 (ϵ = 85; η = 0.05; m = 3; n = 5; k_initial = 1000).

Figure 12. We introduced different interference planes to test the segmentation performance of algorithms and set their respective thresholds in different scenarios. Among them, (a–d) are the original point clouds in Scene A and the processing results of the three segmentation algorithms; (e–h) are the original point cloud and the processing results of three segmentation algorithms in Scene B. Similarly, (i–l) are the situations of Scene C.

Figure 13. (a) Background point cloud after ground segmentation with RS-Lo-RANSAC; (b) foreground point cloud after the same steps; (c) in the registration steps, we will use the improved Robust ICP algorithm to register and fuse the point clouds of (a,b).

Figure 14. (a) Convergence of energy during the correspondence step of Robust ICP; (b) trend of MSE caused by Robust ICP.

Figure 15. (a) Convergence trend of energy during initial iterations under our convergence strategy; (b) trend of MSE during the first iteration process under our convergence strategy; (c) comparison of runtime between our algorithm and Robust ICP in five random scenarios (different colors represent five different testing scenarios, while the width on the vertical axis represents the time difference between the two algorithms); (d) registration time comparison among traditional ICP algorithm, Robust ICP algorithm, and our algorithm in five random scenarios.

Figure 16. (a) We selected a foreground point cloud and a background point cloud for a scene and registered them using ICP, GICP, NDT, RICP algorithms (b–f), and our algorithm. Then we calibrated the

d_{c o r r}

of the corresponding point pairs.

Figure 16. (a) We selected a foreground point cloud and a background point cloud for a scene and registered them using ICP, GICP, NDT, RICP algorithms (b–f), and our algorithm. Then we calibrated the

d_{c o r r}

of the corresponding point pairs.

Figure 17. The impact of errors

Δ R M S E

and

Δ d_{c o r r}

on our results. The red boxes in the Original Scene represent the 15 boxes we placed (15 cm × 15 cm × 15 cm). Each column of RANSAC, Lo-RANSAC, and RS-Lo-RANSAC represents the results of different registration algorithms after the corresponding segmentation algorithms. The yellow box represents noise clusters, and the blue box represents undetected boxes. The point cloud of the track area after ground segmentation is marked with green point clouds in the RS-Lo-RANSAC column.

Figure 17. The impact of errors

Δ R M S E

and

Δ d_{c o r r}

on our results. The red boxes in the Original Scene represent the 15 boxes we placed (15 cm × 15 cm × 15 cm). Each column of RANSAC, Lo-RANSAC, and RS-Lo-RANSAC represents the results of different registration algorithms after the corresponding segmentation algorithms. The yellow box represents noise clusters, and the blue box represents undetected boxes. The point cloud of the track area after ground segmentation is marked with green point clouds in the RS-Lo-RANSAC column.

Figure 18. The final visualization result of the algorithm operation is displayed, where the photo is our actual scene photo, and the red point cloud in the transparent cube is the obstacle point cloud detected by our algorithm. The objects are manually labeled as the types of obstacles in red font: (a) stationary train (large obstacle); (b) human obstacle (1.7 m) located 15 m away from the lidar system; (c) three box obstacles (20 cm³, 25 cm³, and 30 cm³) located 10 m away from the system; (d) single box obstacle (20 cm³) located 5 m away from the system; (e) human obstacle (1.7 m) located alongside the track.

Table 1. We collected the RMSE and Time Cost performance of RANSAC, Lo-RANSAC, and RS-Lo-RANSAC algorithms under different threshold (ϵ = 80, 85, and 90) settings in Scene A, B, and C, to measure the effectiveness of the three algorithms for ground segmentation when interference planes appear.

		RANSAC		Lo-RANSAC		RS-Lo-RANSAC
		RMSE (mm)	Time (s)	RMSE (mm)	Time (s)	RMSE (mm)	Time (s)
Scene A	$ϵ = 80$	23.4768	0.0115	23.5502	0.02186	23.5212	0.0207
	$ϵ = 85$	24.2188	0.0112	29.5557	0.02675	26.8629	0.0237
	$ϵ = 90$	25.0101	0.0108	30.2094	0.02251	25.4805	0.0211
Scene B	$ϵ = 80$	22.8415	0.0112	36.2378	0.0237	28.5658	0.0212
	$ϵ = 85$	23.7015	0.0133	37.8211	0.0283	27.0003	0.0266
	$ϵ = 90$	24.6473	0.0110	32.8467	0.0255	29.3878	0.0231
Scene C	$ϵ = 80$	45.4117	0.0124	26.5277	0.0347	28.0463	0.0314
	$ϵ = 85$	48.2419	0.0131	37.8211	0.00301	37.9345	0.0219
	$ϵ = 90$	51.1454	0.0143	39.2738	0.0308	39.2418	0.0254

Table 2. The enhancement of RMSE in Lo-RANSA and RS-Lo-RANSAC compared to traditional RANSAC used for ground segmentation in various scenarios. The “RANSAC (mm)” line represents the specific value of RMSE calculated using the RANSAC algorithm, and the following two lines represent the percentage increase in RMSE obtained by the Lo-RANSAC and RS-Lo-RANSAC algorithms (the calculation formula is the following:

\frac{{R M S E}_{(R S -) L o - R A N S A C} - {R M S E}_{R A N S A C}}{{R M S E}_{R A N S A C}} * 100 %

).

Table 2. The enhancement of RMSE in Lo-RANSA and RS-Lo-RANSAC compared to traditional RANSAC used for ground segmentation in various scenarios. The “RANSAC (mm)” line represents the specific value of RMSE calculated using the RANSAC algorithm, and the following two lines represent the percentage increase in RMSE obtained by the Lo-RANSAC and RS-Lo-RANSAC algorithms (the calculation formula is the following:

\frac{{R M S E}_{(R S -) L o - R A N S A C} - {R M S E}_{R A N S A C}}{{R M S E}_{R A N S A C}} * 100 %

).

Method	1	2	3	4	5	6	7
RANSAC (mm)	26.22	36.47	29.35	46.25	26.09	26.44	28.57
Lo-RANSAC (%)	−7.28	−28.58	−6.71	3.89	−4.10	−6.77	−11.34
RS-Lo-RANSAC (%)	−14.26	−32.60	−8.72	−4.97	−5.67	−6.88	−10.12

Table 3. Statistical analysis of MSE, Time Cost, and

d_{c o r r}

performance of samples after different registration strategies with Figure 16a.

Table 3. Statistical analysis of MSE, Time Cost, and

d_{c o r r}

performance of samples after different registration strategies with Figure 16a.

Method	MSE (mm²)	Time Cost (s)	$d_{c o r r}$ (cm)
ICP	395.87	11.56	3.407
GICP	383.11	1.02	3.502
NDT	383.83	0.75	3.398
RICP	144.53	5.82	3.378
Ours	171.12	0.38	3.342

Table 4. Summary of our algorithm’s detection capabilities for different obstacles in the experimental field.

Obstacles	Size (cm)	Detection Distance (m)
Obstacles	Size (cm)	±5	±10	±15	±20	±25
bin	35 × 20 × 18	100%	100%	100%	100%	100%
person	50 ×50 × 170	100%	100%	100%	100%	100%
train	500 × 250 × 350	100%	100%	100%	100%	100%
box	30 × 30 ×30	100%	100%	100%	100%	100%
box	20 × 20 × 20	100%	100%	100%	100%	100%
box	15 × 15 ×15	100%	100%	100%	96%	91%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Nan, Z.; Zhu, G.; Zhang, X.; Lin, X.; Yang, Y. Development of a High-Precision Lidar System and Improvement of Key Steps for Railway Obstacle Detection Algorithm. Remote Sens. 2024, 16, 1761. https://doi.org/10.3390/rs16101761

AMA Style

Nan Z, Zhu G, Zhang X, Lin X, Yang Y. Development of a High-Precision Lidar System and Improvement of Key Steps for Railway Obstacle Detection Algorithm. Remote Sensing. 2024; 16(10):1761. https://doi.org/10.3390/rs16101761

Chicago/Turabian Style

Nan, Zongliang, Guoan Zhu, Xu Zhang, Xuechun Lin, and Yingying Yang. 2024. "Development of a High-Precision Lidar System and Improvement of Key Steps for Railway Obstacle Detection Algorithm" Remote Sensing 16, no. 10: 1761. https://doi.org/10.3390/rs16101761

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Development of a High-Precision Lidar System and Improvement of Key Steps for Railway Obstacle Detection Algorithm

Abstract

1. Introduction

2. Overall System Structure

Precise Coordinate Error Correction

3. Point Cloud Obstacle Detection Algorithm

3.1. Large-Scale Ground Point Cloud Segmentation and Railway Track Extraction

3.2. Fine Registration of BP and FP

3.3. Extracting Voxel Features from Fused Point Clouds

4. Experiment

4.1. Point Cloud Coordinate Correction

4.2. RS-Lo-RANSAC

4.3. Improved Robust ICP

4.4. VFOR

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI