Automatically Tracking Road Centerlines from Low-Frequency GPS Trajectory Data

Chen, Banqiao; Ding, Chibiao; Ren, Wenjuan; Xu, Guangluan

doi:10.3390/ijgi10030122

Open AccessArticle

Automatically Tracking Road Centerlines from Low-Frequency GPS Trajectory Data

¹

Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China

²

Key Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China

³

School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100190, China

⁴

National Key Laboratory of Science and Technology on Microwave Imaging, Beijing 100190, China

^*

Author to whom correspondence should be addressed.

ISPRS Int. J. Geo-Inf. 2021, 10(3), 122; https://doi.org/10.3390/ijgi10030122

Submission received: 16 December 2020 / Revised: 6 February 2021 / Accepted: 24 February 2021 / Published: 1 March 2021

Download

Browse Figures

Versions Notes

Abstract

:

High-quality digital road maps are essential prerequisites of location-based services and smart city applications. The massive and accessible GPS trajectory data generated by mobile GPS devices provide a new means through which to generate maps. However, due to the low sampling rate and multi-level disparity problems, automatically generating road maps is challenging and the generated maps cannot yet meet commercial requirements. In this paper, we present a GPS trajectory data-based road tracking algorithm, including an active contour-based road centerline refinement algorithm as the necessary post-processing. First, the low-frequency trajectory data were transferred into a density estimation map representing the roads through a kernel density estimator, for a seeding algorithm to automatically generate the initial points of the road-tracking algorithm. Then, we present a template-matching-based road-direction extraction algorithm for the road trackers to conduct simple correction, based on local density information. Last, we present an active contour-based road centerline refinement algorithm, considering both the geometric information of roads and density information. The generated road map was quantitatively evaluated using maps offered by the OpenStreetMap. Compared to other methods, our approach could produce a higher quality map with fewer zig-zag roads, and therefore more accurately represents reality.

Keywords:

road network extraction; map generation; GPS trajectory; low-frequency trajectory data

1. Introduction

With the development of location-based services, digital road maps are increasingly needed, making road network extraction (RNE) an active research topic. Especially in dense metropolitan cities, people frequently access navigation, route planning, and real-time traffic services, increasing the need for accurate and up-to-date digital road maps. Compared to the time-consuming and labor-intensive conventional map survey, various RNE methods were developed based on remote sensing (RS) images [1,2], GPS trajectory [3,4], and light detection and ranging (LiDAR) point-cloud data [5], since the 1970s. These semi-automatic and automatic algorithms not only lower the cost but also shorten the time of producing and updating road maps. However, due to the inherent drawbacks of the input data, automatically generating high-quality road maps remains challenging [1,3].

Among the existing methods, extracting road networks from RS images is rather mature, and various methods were developed, such as classification-based [6], mathematic morphology [7], road tracking [8,9,10], and active contour [11]. However, these methods are susceptible to the occlusion and shadow in RS images, therefore discontinuous road segments are easy to appear. At present, semi-automatic methods, which require human input, generally perform better than automatic methods.

Road extraction from GPS trajectory data started in 1990; since then, it underwent two stages—map augmentation and map generation. Map augmentation requires an existing map as prior information; conversely, map generation constructs a road map with a topological structure from an empty map. Based on the widely deployed GPS sensors, algorithms can use the collective mobility pattern behind trajectory data to extract semantic information, such as turning restrictions and obtaining GPS trajectory data [12], which is much cheaper compared to RS images or LiDAR data. Given the volume of GPS trajectory data, most of these methods are automatic.

Existing GPS trajectory-based methods can be roughly categorized as follows—clustering-based [13,14,15], kernel density estimation (KDE) [16,17,18], and incremental track insertion [19,20]. These methods face challenges caused by (1) high GPS noise, (2) low frequency, and (3) multi-level disparity problems, which are ubiquitous among easily accessed GPS trajectory data. However, high-quality trajectory data are hard to obtain and covers fewer roads.

To overcome these issues, some methods adopt an intersection-first approach in constructing road maps [14,18]. As intersections are defined as the convergence of three or more road branches, they possess many features that can be detected with a designed algorithm. Detected intersections can serve as valuable information for identifying the road segments connecting them. There has been a recent surge in road intersection detection methods [18,21], which provide a better foundation for this approach. However, few GPS trajectory data-based methods can fully use known intersections, whereas RS-images-based road tracking is mostly driven by points like intersection-centers falling within the region of the road surface.

Therefore, we propose a GPS trajectory data-based road-tracking algorithm, including an active contour-based road centerline refinement algorithm, to extract smoothed and interconnected road centerlines from low-frequency trajectories. By adopting an automatic seeding technique, this map generation method is automatic and thus capable of handling massive trajectory data. The major contributions of our method include.

(1) We enrich the intersection-first approach in extracting road network from GPS trajectory data, making road tracking an available tool to process low-frequency trajectories, which can make better use of known intersections.

(2) We propose a template-matching-based seeding algorithm to automatically provide starting points for the road-tracking algorithm, using the density information of GPS points. To overcome the disparity problem between the heat zone of intersection region and road branches, a circular sampling approach is introduced.

(3) We propose a template-matching-based road direction extraction algorithm to perform correction when tracking road centerlines, which adapts the road-tracking algorithm to GPS trajectory data. To avoid the high GPS noise problem, the real-time correction is simple but robust.

(4) We propose an active contour-based road centerline refinement algorithm to conduct further correction of extracted road centerlines. We include road orientation restrictions in the active contour model to make better use of intersections. The extended post-processing stage makes use of geometric properties of roads, providing a general solution to zig-zag roads.

Our paper is organized as follows. Section 2 reviews the related work on extracting road networks from GPS trajectory data and RS images. Section 3 describes the proposed method for tracking road centerlines. Section 4 presents a set of experimental results and analyses. Finally, Section 5 discusses the conclusions.

2. Related Work

Since the input data are heterogenous, road network extraction methods based on GPS trajectory data are very different from methods based on RS images. Typically, the output road map is represented by a simple graph

G (V, E)

, where

V

is a set of nodes representing the terminal points of roads, and

E

is a set of edges denoting roads.

First, we introduce the GPS trajectory data-based method. As its input, GPS trajectory data consist of positions of a group of mobile targets across a span of time, sometimes including speed and heading direction. Existing GPS trajectory-based methods can be roughly divided into three categories—(1) clustering-based, (2) KDE, and (3) incremental track insertion.

Clustering-based methods first extract the basic node or edge elements of a road map using a clustering algorithm such as K-means. Then, the road map is constructed by connecting these extracted elements. Edelkamp and Schrödl [13] first extracted a set of seed points spread along the road centerlines, by employing the K-means algorithm to cluster the input GPS points. Then, the adjacent extracted points are connected, based on the connectivity of the trajectories. Karagiorgou [14] first extracted road intersections by employing an agglomerate clustering algorithm to the turn-points, which were defined as where the heading direction changed dramatically. Chen [15] first extracted road segments by employing a graph-based clustering algorithm to a sampling set of input points; then, the intersections were detected by training a support vector machine (SVM) classifier based on their proposed traj-SIFT feature. In conclusion, clustering-based methods generally use the trajectory motion information and therefore suffer more loss from low frequencies.

KDE methods begin with transforming GPS points into a density map through a kernel density estimator, with Gaussian kernel. In most cases, this process occurs in conjunction with discretizing the plane space into the grid of pixels; therefore, image-processing algorithms can be applied to that density map regarding it as a gray-scale image. Biagioni and Eriksson [16] applied an image thinning algorithm on a binarized density map to produce a set of road skeletons using various binary thresholds. These skeletons are merged into a gray-scale skeleton, where an edge’s confidence of being an actual road is represented by its gray scale. Wang [17] proposed a Morse-theory-based road centerlines extraction method, which can capture both heavily- and sparsely-sampled roads with respect to local density. Both methods can produce road maps with different levels of coverage by tuning the parameters and adopting a post-processing approach to better remove the spurious roads. In conclusion, KDE methods are robust to low frequency, but tend to generate zig-zag roads and spurious roads, which requires an effective map refinement algorithm as post-processing.

Incremental track insertion methods construct road maps by incrementally inserting trajectories into an existing map with the help of a map-matching algorithm. Cao and Krumm [18] first grouped similar input trajectories by simulating the physical attraction between them, then incrementally inserted every trajectory by checking whether its node should merge with an existing graph node. Ahmed and Wenk [19] employed Fréchet distance, allowing a trajectory to be partially matched to the map. Different from the first two categories, incremental track insertion methods support online updating, but require high-quality input trajectory data.

Two of the main challenges in extracting road networks from GPS trajectory data are the low frequency and the multi-level disparity of trajectory data; both are ubiquitous among the feasible trajectory data. First, low sampling rate (average sampling interval over 20 s) greatly increases the uncertainty of trajectory data, posing a serious limitation to clustering-based and incremental track insertion methods. Second, the disparity impacts different levels of the extracted road network. At the road level, the staying points of trajectories lead to abnormal local heat zone and distorted road segments. At the network level, the sheer traffic volume difference between different roads makes algorithm tend to mistake roads being travelled by very few trajectories as spurious roads. Finding a global threshold to separate false clusters or roads from correct ones is infeasible.

To address these challenges, researchers are increasingly focusing on the intersection-first approach. Following this approach, road network extraction is divided into two steps—road intersection detection and intersection-based map construction. The difficulty in each step is reduced by adopting a divide-and-conquer strategy. Karagiorgou [14] first proposed a turn-points clustering-based road intersection detection method; Deng [21] introduced a local

G^{*}

statistic in clustering-based detection; Zhang [18] proposed a KDE-based detection method, considering the intersection’s heat zone feature. The goal of intersection detection is to achieve both good coverage and precision, under the low-frequency input data.

However, in map construction, most of the existing GPS trajectory-data-based methods only use the intersection’s position; information such as orientations of its connected roads are neglected. Therefore, we combined suitable RS images-based road network extraction methods with this intersection-first approach to more effectively use the known intersections. Here, we focused on briefly introducing the road tracking and active contour methods; both accept a set of seed points as the input, and present roads with a similar parameter model.

Road tracking methods trace the road centerlines by repeating two key steps—prediction and correction. Until a stopping criterion is satisfied, the road tracker traces a single road starting from an initial seed point, which is provided manually or by an automatic seeding algorithm. At the prediction step, an algorithm predicts the next point most likely on the road centerline given the current state of the road tracker. At the correction step, the prediction is corrected by considering its nearby image information, then the state of the road tracker is also updated. Vosselman [8] applied Kalman filter to model the tracking process as a linear dynamic system, and a least squares profile matching was chosen to provide measurement input to the Kalman filter at the correction step. Movaghati [10] combined the extended Kalman filter (EKF) with a particle filter (PF) to find potential new road branches after the road tracker reached its stopping criterion. Road tracking methods are generally sensitive to initial seed points, and face challenge in overcoming intersections, abrupt changes in road direction, and obstructions during the tracing process.

Last, we introduce the active contour method—it defines an energy function on a deformable line, which recursively fits a deformable line to a contour or centerline of a road in the direction of minimal energy, starting from an initialization position. This model was first proposed by Kass [22]; also known as snake, its energy function consists of both internal and image forces. Internal force (elasticity and stiffness) models the geometric features of a road, whereas image force (intensity gradient) represents certain features of interest in the image. Fua and Leclerc [23] proposed the ribbon snakes model to include a width parameter in the image force. By tuning the ratio between internal force and image force, active contour methods can avoid the overfitting problem when extracting roads from an image, which often yields zig-zag roads. As a result, it can also be used as a geometric refinement algorithm for road networks, given background image information.

In conclusion, Figure 1 outlines the rough categories of GPS trajectory-data-based and RS-images-based road network extraction methods. Road tracking methods can also benefit from adopting the intersection-first approach—if intersections and their connected road branches can be detected with high precision and high coverage, the difficulty of road tracking would be significantly reduced. However, due to GPS noise and multi-level disparity problems, the RS images-based methods cannot be directly applied to the density estimation map of GPS points.

Our method falls into the intersection-first approach category. Based on previous road intersection detection works, we present a combination of KDE and road tracking methods to overcome low-frequency and multi-level disparity of GPS trajectory data in road network extraction. First, we propose a density-based seeding algorithm and a road direction extraction algorithm, both based on template matching, to adapt road tracking to the density estimation map. We adopt a simple real-time correction in this tracking process to avoid tackling drastic GPS noise with limited local information. Then, we apply an active contour-based refinement algorithm to the extracted road network to recursively redo the correction, considering both the geometric information of an entire road and the density information. We emphasize that the impact of disparity is many-fold; by adopting a divide-and-conquer strategy, this problem can be easier solved at the local level.

3. Tracking Road Centerlines from Low-Frequency Trajectory Data

Our road network extraction method starts with a set of known intersection positions, which can be automatically provided by previous road intersection detection works [18,21] or can be input manually. The proposed method consists of three key steps for extracting road centerlines from low-frequency GPS trajectories—(1) generating seed points that exclusively define one end of a road; (2) tracking road centerlines with simple correction; and (3) refining road centerlines based on the active contour model. The workflow of our road centerline tracking method is shown in Figure 2.

3.1. Automatic Seeding Based on Kernel Density Estimation Map

Seed points are the prerequisite of road-tracking algorithms. In our method, a seed point is defined as

(x, y, θ)

, where

(x, y) \in R^{2}

is the start point of road and

θ \in [0, 2 π]

is the orientation of road at that point (anti-clockwise, equals 0 when heading toward Earth’s true east). Every seed point is derived from an existing intersection, and exclusively defines one end of a road.

Given a known intersection’s center positio

I_{i} (x_{i}, y_{i})

, the possibility of having a road branch with orientation

θ

is quantitively evaluated by matching the corresponding slice image with a set of ideal road template images. Least squares correlation matching is applied, along with a circular sampling approach, tackling the disparity problem of GPS trajectories. Based on the minimum-matching-distance criterion, multiple seed points

{(x_{i}, y_{i}, θ_{i, j})}

can be drawn, and then be examined by the seeding algorithm. Here we use a point-based intersection model, which can cover a majority of intersections with cross-shape, T-shape, and Y-shape.

3.1.1. Density Estimation

Based on the well-known KDE algorithm [24], the sparsely sampled GPS points can be transformed into a consecutive density function

ρ (x, y)

on a rectangular domain

R \subset R^{2}

, which is the area of interest for road network extraction.

Since using a consecutive density function is calculation-intensive, a common practice is discretizing

R

into a grid

G_{R}

with a cell-length

c

, and then only computing

ρ (x, y)

at the grid center points. Therefore, the density estimation result can be viewed as a gray-scale image

D (i, j)

, where

D (i, j)

stands for the density at grid

i, j

.

Let

P = {{\vec{p}}_{1}, \dots, {\vec{p}}_{n}}

be the collection of all GPS points. First, a 2D histogram

H (i, j)

is produced by counting the number of GPS points falling within each grid. Then, the density image

D (i, j)

is computed as:

D (i, j) ∶ = \frac{1}{n} H (i, j) \otimes K_{σ} (u, v), K_{σ} (u, v) = \frac{1}{\sqrt{2 π σ^{2}}} \exp {- \frac{(u^{2} + v^{2}) c^{2}}{2 σ^{2}}}

(1)

where

K_{σ} (u, v)

is the popular Gaussian kernel and has a bandwidth parameter

σ

. Following a previous work [16], we used

σ = 8.5 m

.

Notice that

K_{σ} (u, v) = K_{σ} (u, 0) \cdot K_{σ} (0, v)

. It is faster to compute Equation (1) in two rounds of 1D convolution. Last, we discard the marginal information of

K_{σ} (u, v)

, keeping

\sum_{| u |, | v | \leq M} K_{σ} (u, v) \geq 0.995

as the central information. These improvements support using a smaller grid cell-length

c

and conducting faster template-matching-based algorithms.

3.1.2. Template Matching

The previous section demonstrated a method to generate global density image from input GPS points. However, due to the GPS noise and the disparity problem [16], automatic seeding requires focusing on the local density image (slice image) to apply pre-processing, such as median filters and image binarization.

Given a position

(x, y)

, Figure 3 shows how a slice image with orientation

θ

is generated. Let

S (x, y, θ)

denote the slice image after rotation, for the convenience of matching with a set of fixed road template images. Its pixel value

s (i, j)

stands for the density at the recalculated grid center

{\vec{x}}_{i j}

, which is computed as:

{\vec{x}}_{i j} = (x + i c \cdot c o s θ + j c \cdot s i n θ, y + i c \cdot s i n θ - j c \cdot c o s θ), - \frac{N}{2} \leq i, j \leq \frac{N}{2}

(2)

Then, density value

s (i, j)

is re-estimated using Equation (1) and the nearby GPS points:

s (i, j) = \sum_{| u |, | v | \leq M} \frac{1}{n} H (i + u, j + v) \cdot K_{σ} (| | {\vec{x}}_{i j} - {\vec{g}}_{i + u, j + v} | |), - \frac{N}{2} \leq i, j \leq \frac{N}{2}

(3)

where

K_{σ} (d)

is the Gaussian kernel function.

N

is the image size satisfying

N \cdot c = 80

m, which means the slice image covers an area of 80

\times

80 m.

Next, we prepare a set of template images based on an ideal road model. A previous work [25] showed that a Gaussian mixture model (GMM) can model the spread of GPS points within a road, reflecting the number of lanes and their corresponding traffic volume in reality. Here, we use single Gaussian model as a simplification, as shown in Figure 4. Therefore, in a road template image, density of one pixel is solely determined by the distance between the pixel center and road centerline.

Let

T e m p (α, B)

denote a road template image with size

N \times N

pixels, its pixel value

t (i, j)

can be computed as:

t (i, j) = K_{τ} (- s i n α \cdot i + c o s α \cdot j - B), - \frac{N}{2} \leq i, j \leq \frac{N}{2},

(4)

where

α

is the orientation of road, and

B

is the bias parameter. The bandwidth

τ

was determined by

3 τ = N / 2

to make the template image cover the entire density descending course. By shifting

T e m p (α, 0) B

pixels’ width perpendicular to its road orientation, we obtain

T e m p (α, B)

. The sizes of the slice image and template image were set to the same value throughout our method.

If the slice image was close enough to a template image, one road was detected in that slice image. When the slice image deviated from the road centerline, the road could still be detected from a broader template set

{T e m p (α, B) | B = 0, \pm 1, \dots, \pm K}

. Since a traffic volume difference might exist between different roads, we could not directly compare pixel value

s (i, j)

with

t (i, j)

. Therefore, we applied median filter and binarization to both images as pre-processing, before the matching. Figure 5 shows that the median filter could effectively remove the GPS noise, then the traffic volume was normalized by binarization.

Last, the matching should be based on a distance measure between two images. We chose the squared distance for calculation convenience. Given an image

S

, let

\tilde{S}

denote the image after the above pre-processing. The matching distance between two images was defined as:

D i s (S, T) = \sum_{i, j} {(\tilde{S} (i, j) - \tilde{T} (i, j))}^{2}

(5)

The smaller the matching distance, the more similar were the two images.

3.1.3. Seeding Based on Template Matching and Circular Sampling

After these preparations, we can now quantitively evaluate the possibility of a road connected to an intersection from arbitrary directions. The search was performed by circular sampling to avoid different road branches, overlapping within the intersection region.

As shown in Figure 6, we categorized the slice image into three classes—(1) road segment, (2) near intersection, and (3) highly noisy area. Experiments showed that the template-matching algorithm worked for road segments and highly noisy areas, but suffered a significant drop in accuracy for the areas near intersections. Detailed experimental results are provided in Section 4.

Therefore, we proposed a circular sampling-based seeding algorithm, as shown in Figure 7, to avoid the drawback of the near-intersection areas. Given an intersection’s center position

(x, y)

, we used the slice image

S (x + r \cdot c o s θ, y + r \cdot s i n θ, θ)

to detect whether there exists a road branch with orientation

θ

. By matching

S

with a set of template images, a matching distance function

f_{x, y} (θ)

with the orientation variable was defined as:

f_{x, y} (θ) = \underset{| B | \leq μ N}{m i n} D i s (S (x + r \cdot c o s θ, y + r \cdot s i n θ, θ), T e m p (0, B)),

(6)

where

μ

represents how far a deviation we accept from the road centerline; here, we set

μ = 0.125

. Radius

r

was set to 60 m through a series of experiments. Intuitively, smaller

f_{x, y} (θ)

represents a higher chance of having a road branch with orientation

θ

.

Figure 8 shows that each road branch corresponds to a local minimum of

f_{x, y} (θ)

. As a result, by finding all local minimum points

{θ_{k}}

of

f_{x, y} (θ)

, the corresponding seed points

{(x, y, θ_{k})}

were generated. Since the number of intersections was limited, we adopted a simple enumerating solution in finding these local minimum points. We only computed

f_{x, y} (θ)

on a discretized set

{θ_{i} = 2 i π / n | i = 0, \dots, n},

an angle

θ_{i}

that satisfied

f_{x, y} (θ_{i}) = \min_{| θ_{j} - θ_{i} | \leq π / 12} f_{x, y} (θ_{j})

would be selected by our seeding algorithm.

However, as shown in Figure 9, the fluctuation of

f_{x, y} (θ)

caused by GPS noise led to many false seed points. By applying Gaussian smoothing on

f_{x, y} (θ)

, this problem could only be mitigated; therefore, we proposed density-based connectivity to verify the generated seed points. Given two planar points

\vec{P}, \vec{Q}

, let

c (\vec{P}, \vec{Q})

denote the connectivity between these points.

c (\vec{P}, \vec{Q}) = \int_{0}^{1} \frac{ρ (x_{t}, y_{t})}{\max_{0 \leq t \leq 1} ρ (x_{t}, y_{t})} d t, (x_{t}, y_{t}) = \vec{P} + t (\vec{Q} - \vec{P}) .

(7)

The proposed connectivity ranged from 0 to 1. Assuming

\vec{P}

is the intersection’s center position, when

\vec{P Q}

refers to a false road branch, the density value drops quickly along

\vec{P Q}

, which results in low connectivity. Figure 10 shows that true road branches tend to have larger integral area under curve

ρ (x_{t}, y_{t})

, therefore, a connectivity threshold can effectively separate the false road branches from the true ones.

Figure 10 also shows that, in most cases, the start point

\vec{P}

has a much higher density value for traffic from different road branches overlapping at the intersection, which results in a lower

c (\vec{P}, \vec{Q})

in general. Therefore, when verifying a seed point

(x, y, θ)

by computing density-based connectivity, the segment falling within the intersection region should be truncated, as follows:

c ({\vec{P}}_{l}, {\vec{P}}_{L}) > φ,

(8)

where

{\vec{P}}_{d} = (x, y) + d \cdot (c o s θ, s i n θ)

and

φ

is a predefined connectivity threshold. We set

φ = 0.1

through a series of experiments. The other two parameters

l = 30

m and

L = 60

m were set through visualization, as shown in Figure 11. Seed points that satisfied Equation (8) were considered true, and were fed to the road-tracking algorithm in the next step.

3.2. Tracking Road Centerlines Based on Kernel Density Estimation Map

After the seeding algorithm generated a set of seed points from the density map, the tracking algorithm set up multiple road trackers to incrementally generate estimation curves of road centerlines. Each tracker corresponded to a single seed point

(x, y, θ)

. The estimation curves were further refined in the next post-processing course.

Here, we used a simple graph

G (V, E)

to model the output road network, with known intersections as its initial vertex set

V

. For every road tracker, the prediction and correction two-step process was repeated—at the prediction step, the algorithm calculated the position of a new vertex based on the current state of the road tracker (using a predefined road model), and the successive positions formed the edges of the graph

G

. At the correction step, the algorithm measured the local density map around the newest position to refine the prediction and update the current state of the road tracker. As such, new edges were incrementally added to the shared graph

G

, until a stopping criterion was satisfied. Figure 12 shows the flow chart of our road-tracking algorithm.

3.2.1. Road Model

We first introduced a road model that was used by previous methods [10], which models one road as a series of states

{X_{0}, X_{1}, \dots, X_{n} | X_{k} = {[x_{k}, y_{k}, θ_{k}, {\dot{θ}}_{k}]}^{T}}

. Here, attribute

θ_{k}

represented the orientation of the road at position

(x_{k}, y_{k})

,

{\dot{θ}}_{k}

represented the curvature of road at

(x_{k}, y_{k})

, and

{(x_{k}, y_{k})}_{0 \leq k \leq n}

formed a polyline representing the road centerline. Figure 13 shows that this definition of road state contained the necessary information to predict the most likely next point on the road centerline, along with other crucial road attributes.

Previous methods also modeled the tracking process as a stochastic process [8,10]. As a result, the prediction step could be written in an equation form:

X_{k} = [\begin{matrix} x_{k} \\ y_{k} \\ θ_{k} \\ {\dot{θ}}_{k} \end{matrix}] = [\begin{matrix} x_{k - 1} + s \cdot c o s θ_{k - 1} \\ y_{k - 1} + s \cdot s i n θ_{k - 1} \\ θ_{k - 1} + s \cdot {\dot{θ}}_{k - 1} \\ {\dot{θ}}_{k - 1} \end{matrix}] + W_{k} = f (X_{k - 1}) + W_{k},

(9)

where

s

is the tracking step length parameter and

W_{k}

represents the random variation in the road state between successive positions. The next step was measuring the local density map to support the correction step, which is referred to as another equation called the measurement equation:

Z_{k} = h (X_{k - 1}) + V_{k},

(10)

where

V_{k}

represents the measurement noise. Assuming that both Equations (9) and (10) were linear and both

W_{k}

and

V_{k}

were Gaussian noise, the classical correction based on the Kalman filter could be derived [8].

However, since precisely modeling

W_{k}

and

V_{k}

was infeasible for GPS trajectory data under complex urban environments, we adopted a simple correction in the tracking process by replacing

W_{k}

and

V_{k}

with a zero matrix. The underlying assumption was that roads have a fixed curvature except for a few turn-points.

3.2.2. Road Orientation Extraction Based on Template Matching

To support the correction step, we measured two attributes

({\hat{θ}}_{k} and B_{k})

from the local density map of the predicted state

X_{k | k - 1} = f (X_{k - 1})

, based on the template matching previously introduced in Section 3.1.2. Attribute

{\hat{θ}}_{k}

represented the road orientation at the predicted point

(x_{k | k - 1}, y_{k | k - 1})

, and

B_{k}

represented the grid deviation from the road centerline at a predicted point.

Since the tracking process required excessive measurements, we introduced another matching mode requiring far less computation. Recall that in Section 3.1.3, generating a slice image

S (x + r \cdot c o s θ, y + r \cdot s i n θ, θ)

by redoing the density estimation placed a heavy computation burden on the seeding algorithm. Therefore, in this section, we only computed a slice image with fixed

θ = 0

, and let

T (Δ θ, μ)

denote the enlarged template image set, based on discretized orientations:

T (Δ θ, μ) ∶ = {T e m p (θ, B) | θ = k Δ θ, 0 \leq θ \leq 2 π, | B | \leq μ N} .

(11)

Using the same minimum-matching-distance criterion, the measurement result was obtained as:

\begin{array}{l} {\hat{θ}}_{k}, B_{k} & = \arg \min_{{\hat{θ}}_{k}, B_{k}} D i s (S (x_{k | k - 1}, y_{k | k - 1}, 0), T e m p ({\hat{θ}}_{k}, B_{k})) \\ = \arg \min_{{\hat{θ}}_{k}} \arg \min_{B_{k}} D i s (S (x_{k | k - 1}, y_{k | k - 1}, 0), T e m p ({\hat{θ}}_{k}, B_{k})) \end{array},

(12)

where

T e m p ({\hat{θ}}_{k}, B_{k})

belonged to the pre-generated set

T (Δ θ, μ)

. Here, we used

Δ θ = 0.25

° and

μ = 0.125

, which could satisfy the simple correction needed.

Figure 14 demonstrates the difference between the two matching modes. By allowing the slice image to have an arbitrary orientation, the algorithm could use more local density information, but the computation cost was not equivalent to the gain.

3.2.3. Road Tracking Algorithm

In this section, we combined the road model and the measuring algorithm to present our road-tracking algorithm. Starting with the density map

D (i, j)

and a set of seed points, this algorithm incrementally built a road network

G (V, E)

with a topological structure. The tracking algorithm consisted of the following five steps:

(1) Initialization: For every seed point

(x, y, θ)

, create a road tracker with an initial state

X_{0} : = {[x, y, θ, 0]}^{T}

, and let distinguished seed positions

{(x, y)}

be the initial vertex set of road network

G

.

(2) Prediction: For every road tracker, based on the current state

X_{k - 1}

and Equation (9), compute the predicted state

X_{k | k - 1} = f (X_{k - 1})

.

(3) Measurement: Based on the predicted state

X_{k | k - 1}

and Equation (12), obtain measurement result

{\hat{θ}}_{k}

and

B_{k}

from density map

D

.

(4) Correction: Based on the measurement result and the following equation, compute the corrected state

X_{k | k}

:

\begin{matrix} (x_{k | k}, y_{k | k}) = (x_{k | k - 1}, y_{k | k - 1}) - B_{k} c \cdot (s i n {\hat{θ}}_{k}, - c o s {\hat{θ}}_{k}) \\ θ_{k | k} = {\hat{θ}}_{k} \\ {\dot{θ}}_{k | k} = ({\hat{θ}}_{k} - θ_{k | k - 1}) / s \end{matrix},

(13)

where

c

is the grid cell length and

s

is the tracking step length.

(5) Stopping Criterion: Based on the current state

X_{k - 1}

and corrected state

X_{k | k},

examine two stopping criteria with their positions

{\vec{v}}_{k - 1}

and

{\vec{v}}_{k | k}

. First, examine the absorbing criterion using the extracted road network and the tracking step length:

\exists \vec{P Q} \in E, \underset{0 \leq t \leq 1}{m i n} | | {\vec{v}}_{k | k} - (\vec{P} + t (\vec{Q} - \vec{P})) | | < \frac{s}{2} .

(14)

If Equation (14) is satisfied, then

{\vec{v}}_{k | k}

is absorbed by the closer vertex

\vec{P}

or

\vec{Q}

of the road network, and a new edge from

{\vec{v}}_{k - 1}

to the absorb-vertex is added to the road network. Then, the corresponding road tracker stops. Otherwise, examine the density-connectivity criterion using Equation (7).

c ({\vec{v}}_{k - 1}, {\vec{v}}_{k | k}) \leq φ,

(15)

where

φ

is the same connectivity threshold in Equation (8). If Equation (15) is satisfied, it is likely that the newly extracted road segment

〈 {\vec{v}}_{k - 1}, {\vec{v}}_{k | k} 〉

crossed a road edge or ran into a dead end (zero density area). The corresponding road tracker stopped and the new edge was nullified in this situation. Otherwise, a new vertex and edge were added to the road network, and the road tracker continued tracking with the newest state

X_{k} = X_{k | k}

, by repeating steps (2) to (5).

The tracking step length played a key role in our road-tracking algorithm. Intuitively, when the step length was too large, the road tracker could not handle the dramatic change in road orientation well. Conversely, when the step length was too small, the road tracker would be trapped or distorted by the local heat zone of the density maps, such as the intersection regions. Here, we chose

s = 50

m through a series of experiments.

As shown in Figure 15, after the input of ground-truth intersections, our road-tracking algorithm could generate continuous road centerlines interconnected to form a network, and achieved good coverage of existing roads (travelled by at least three trajectories). This provided a solid foundation for post-processing, to further enhance the quality of the road network.

In conclusion, the proposed template-matching-based road-orientation-extraction algorithm adapted the road-tracking algorithm for the GPS trajectory data. Although the correction step was simple, it considered 80

\times

80 m area of local density information and was robust enough. A road-intersection-detection algorithm with high recall could help the road-tracking algorithm to avoid tackling the challenging intersection regions in advance.

3.3. Road Centerline Refinement Based on Active Contour

Due to the high GPS noise, most existing KDE-based road-network-extraction methods produced a number of zig-zag roads in the extraction result, as shown in Figure 16. The proposed road tracking algorithm faced the same problem—due to the discretized output of the measurement algorithm, the extracted road centerlines often have fluctuating orientations on consecutive points, resulting in zig-zag roads, especially when the tracking step length is small.

Developing a density-aware road-centerline-refinement algorithm provided a general solution to this problem. We chose an active contour model for it defined an energy function on a deformable line, including a pair of structural internal force and external image force; therefore, it could balance between generating straight roads and over-fitting zig-zag roads. The refinement algorithm took a road centerline from a road-tracking algorithm as its input; based on a predefined energy function, the geometric refinement of the road centerline was done by minimizing its energy, yielding a smoothed road.

3.3.1. Active Contour Model

In this section, we introduce the definition of the original active contour model by Kass [22], along with the minimization procedure, through variational calculus. Let

\vec{v} (s) = (x (s), y (s))

denote a deformable line in plane space, its energy function is defined as:

\begin{matrix} E_{s n a k e} (\vec{v}) = E_{i n t} (\vec{v}) + E_{i m g} (\vec{v}) \\ = \frac{1}{2} \int (α (s) {| {\vec{v}}_{s} (s) |}^{2} + β (s) {| {\vec{v}}_{s s} (s) |}^{2}) d s + \int P (\vec{v} (s)) d s \end{matrix},

(16)

where

E_{i n t}

represents the internal energy, which controls the shape of curve

\vec{v} (s)

with its first-order term and second-order term;

E_{i m g}

represents the image energy when

\vec{v} (s)

is placed on a background image. Parameter

α (s)

is also known as the elasticity parameter, where setting a larger

α (s)

results in an optimal curve closer to the shortest path; and parameter

β (s)

is also known as the stiffness parameter, where setting a smaller

β (s)

allows the optimal curve to have a more drastic change in direction.

Based on Equation (16), the optimal curve was defined and obtained by minimizing the total energy. Kass proposed a numerical method using the Euler–Lagrange differential equation. If curve

\vec{v} (s)

extremized the functional

E_{s n a k e} (\vec{v}) = \int L (s, \vec{v}, {\vec{v}}_{s}, {\vec{v}}_{s s}) d s

, then the following equation was satisfied:

\frac{\partial L}{\partial \vec{v}} - \frac{d}{d s} \frac{\partial L}{\partial {\vec{v}}_{s}} + \frac{d^{2}}{d s^{2}} \frac{\partial L}{\partial {\vec{v}}_{s s}} = 0 .

(17)

Turning Equation (16) into discrete form yields an easier solution method, especially when

α (s)

and

β (s)

were not constant. Following Kass’ approach, two independent Euler–Lagrange equations could be drawn from Equation (17), and could be written in matrix form as:

{\begin{matrix} A X + f_{x} (X, Y) = 0 \\ A Y + f_{y} (X, Y) = 0 \end{matrix},

(18)

where

X = {[x_{1}, x_{2}, \dots, x_{n}]}^{T}

and

Y = {[y_{1}, y_{2}, \dots, y_{n}]}^{T}

represent the coordinates of nodes in the curve.

A

is a sparse coefficient matrix composed of

α (s)

and

β (s)

at these nodes.

f_{x} (X, Y)

=

{[\frac{\partial P}{\partial x} (x_{1}, y_{1}), \frac{\partial P}{\partial x} (x_{2}, y_{2}), \dots, \frac{\partial P}{\partial x} (x_{n}, y_{n})]}^{T}

and

f_{y} (X, Y)

=

{[\frac{\partial P}{\partial y} (x_{1}, y_{1}), \frac{\partial P}{\partial y} (x_{2}, y_{2}), \dots, \frac{\partial P}{\partial y} (x_{n}, y_{n})]}^{T}

are the derivatives of the image force.

Solving

(X, Y)

from Equation (18) yields the optimal curve, which could be achieved by recursively computing the following equations, starting from the initial curve

(X_{0}, Y_{0})

:

{\begin{array}{l} X_{t} = {(A + γ I)}^{- 1} (γ X_{t - 1} - f_{x} (X_{t - 1}, Y_{t - 1})) \\ Y_{t} = {(A + γ I)}^{- 1} (γ Y_{t - 1} - f_{y} (X_{t - 1}, Y_{t - 1})) \end{array},

(19)

where

γ

is a step parameter and

I

is an

n \times n

identity matrix. The inverse matrix of

(A + γ I)

could be computed by

L U

decomposition. When the total energy change was less than a threshold, the curve

(X_{t}, Y_{t})

converged to a local minima.

3.3.2. Geometric Refinement of the Extracted Road Centerlines

Based on the active contour model, the geometric refinement of a road centerline could be defined as minimizing its energy, yielding a smoothed road. The refinement algorithm took its input

{{\vec{v}}_{0}, {\vec{v}}_{1}, \dots, {\vec{v}}_{n}}

from the road-tracking algorithm.

To solve the optimal road centerlines, first, we computed the coefficient matrix

A

. In our case, curve

\vec{v} (s)

was unclosed and had two fixed ends, resulting in the boundary conditions

\vec{v} (0) = {\vec{v}}_{0}

and

\vec{v} (n) = {\vec{v}}_{n}

. As

\vec{v} (0)

and

\vec{v} (n)

were fixed, we could only optimize the remaining points; therefore,

A

was an

(n - 1) \times (n - 1)

pentadiagonal banded matrix:

A = [\begin{matrix} a & b & c & 0 & \dots & 0 \\ b & a & b & c & ⋱ & 0 \\ c & b & a & b & ⋱ & 0 \\ 0 & c & b & a & ⋱ & c \\ ⋮ & ⋱ & ⋱ & ⋱ & ⋱ & b \\ 0 & 0 & 0 & c & b & a \end{matrix}],

(20)

where

a = 2 α + 6 β, b = - (α + 4 β), c = β

by Kass’ approach [22].

Following the common assumption made by road-tracking methods that road centerlines generally have low curvatures, we could assume that

α (s) = α

and

β (s) = β

were constant, except for a few turn-points. If

{\vec{v}}_{i}

is a turn-point, then

β_{i}

is set to 0, and the

i

th row of

A

equals

[\dots 0 - α 2 α - α 0 \dots]

. The adaptive selection of turn-points is discussed later.

Next, we introduced orientation restrictions

{\vec{v}}_{s} (0) = {\vec{h}}_{0}

and

{\vec{v}}_{s} (n) = {\vec{h}}_{n}

to consider road orientations at both ends, as shown in Figure 16. We included orientation restrictions in a matrix form

B = {[{\vec{b}}_{1} {\vec{b}}_{2} \dots {\vec{b}}_{n - 1}]}^{T}

into Equation (18). Here

{{\vec{b}}_{i} = \vec{0} | 3 \leq i \leq n - 3}

was satisfied, as we want orientation restrictions to only directly affect the first points of a road from two ends.

To determine

B

, since orientation restrictions were independent from image information (we could assume that the image force equaled to zero), the following equation held true for the optimization result:

A {[{\vec{v}}_{1} {\vec{v}}_{2} \dots {\vec{v}}_{n - 1}]}^{T} + {[{\vec{b}}_{1} {\vec{b}}_{2} \dots {\vec{b}}_{n - 1}]}^{T} = 0,

(21)

Focusing on one side of the road, one could make a reasonable assumption that

[c b a b c] {[{\vec{v}}_{- 1} {\vec{v}}_{0} {\vec{v}}_{1} {\vec{v}}_{2} {\vec{v}}_{3}]}^{T} = 0

also held true, where

{\vec{v}}_{- 1} = {\vec{v}}_{0} - {\vec{h}}_{0}

was defined by prolonging the road in the opposite orientation. Based on these assumptions,

{\vec{b}}_{1}

and

{\vec{b}}_{2}

could now be solved using the first two rows of coefficient matrix

A

:

{\begin{matrix} {\vec{b}}_{1} = (b + c) {\vec{v}}_{0} - c {\vec{h}}_{0} \\ {\vec{b}}_{2} = c {\vec{v}}_{0} \end{matrix},

(22)

where

{\vec{h}}_{0} = (s \cdot c o s θ_{0}, s \cdot s i n θ_{0})

was computed by using the road orientation and tracking step length. By similarly computing

{\vec{b}}_{n - 2}

and

{\vec{b}}_{n - 1}

, yielded the complete restriction matrix

B = [B_{x} B_{y}]

.

Previous works demonstrated that road centerlines could be viewed as mountain ridges of the density field [17]; therefore, we define the image force as a negative density value

P (\vec{v} (s)) = - ρ (\vec{v} (s))

. However, due to the disparity problem, the image force varied largely from curve to curve, making it infeasible to apply a set of global parameters. Therefore, the image force must be normalized at the local level. Combining the orientation restrictions, Equation (18) was rewritten as:

{\begin{matrix} A X + (B_{x} + λ \cdot f_{x} (X, Y)) = 0 \\ A Y + (B_{y} + λ \cdot f_{y} (X, Y)) = 0 \end{matrix},

(23)

where

λ

is the normalization factor, which was computed using the density value along the curve

λ = 1 / \sqrt{\sum ρ {({\vec{v}}_{i})}^{2}}

. The new equation could still be solved by Kass’ approach (see Equation (19)).

Last, we identified four tuning strategies through a series of experiments:

(1) Set a rather small

α

; therefore, the image force plays a priority role in refining road centerlines;

(2) Set a rather large

β / α

; here, we use

β / α = 10

;

(3) Set a rather large step parameter

γ / α

; here, we use

γ / α = 5

;

(4) Allow the curve to have multiple adaptively selected turn-points, where

β = 0

is satisfied.

We briefly introduced these strategies. Since the input curve from the road-tracking algorithm was close enough to the real road centerline, a large step parameter could produce a robust iteration course. The impact of disparity was multi-level; local heat zones existed in the road due to the stay-points in the trajectories. Figure 17 shows that these heat zones might distort an active contour, as the image force dragged its points into a density peak. Setting a larger

β

could mitigate this problem; however, it produced a new problem—if

α (s)

and

β (s)

were constant, the active contour could not accurately fit a road with drastic changes in orientation (Figure 18a).

A previous work [11] showed that introducing turn-points could solve this problem (Figure 18b). At a turn-point,

β_{i}

was set to 0, allowing the optimal curve to drastically change in orientation. Turn-points could be adaptively selected by checking whether the orientation change

| θ_{i} - θ_{i - 1} |

in the initial curve was larger than a given threshold (e.g.,

π / 6

). Experiments showed that the adaptive turn-points could considerably improve the geometric refinement of road centerlines, which significantly reduced the zig-zag roads in the extracted road network.

4. Experimental Results

4.1. Datasets and Evaluation of the Extracted Road Network

Real trajectory data of taxis in the Shenzhen city [12] were used to verify the effectiveness of our road network extraction method using low-frequency GPS trajectory data. The Shenzhen dataset contained over 75 million GPS points from 14,716 taxis, with an average sample rate of 26.1 s [26], collected in one day, which represented feasible but low-quality GPS trajectory data. In our experiments, only a subset covering an area of 2.0

\times

2.0 km was used for quantitative evaluation (Figure 19). The ground-truth map was provided by OpenStreetMap (OSM) [27], and we only kept roads that were travelled by at least three trajectories. The partial Shenzhen taxi trajectory data and manually processed OpenStreetMap can be found in supplementary material.

To compare the extracted road network with ground-truth data, we adopted a graph-sampling-based distance measure proposed by Biagioni and Erikson [3]. This measure considered both the geometric and topological similarity of maps. The sample points on the extracted road network were called marbles, and the sample points on the ground-truth road network were called holes. If a marble was close enough to a hole (judging from a predefined distance threshold), a matched marble and a matched hole were yielded. Based on this intuition, two metrics could be computed by counting the matched marbles and holes: (1)

s p u r i o u s = s p u r i o u s_m a r b l e s / (s p u r i o u s_m a r b l e s +

m a t c h e d_m a r b l e s)

and (2)

m i s s i n g = e m p t y_h o l e s / (e m p t y_h o l e s + m a t c h e d_h o l e s)

. Combing these two metrics, the F-score could be computed as:

F - s c o r e = 2 \cdot \frac{p r e c i s i o n \times r e c a l l}{p r e c i s i o n + r e c a l l},

(24)

where

p r e c i s i o n = 1 - s p u r i o u s

and

r e c a l l = 1 - m i s s i n g

.

The F-score represents the proximity of the extracted road network to the ground-truth map, under a specific distance threshold. Changing the distance threshold could reflect different standards for evaluating a generated road network, from satisfying commercial requirements to visualization needs. In our experiments, a set of distance thresholds

{5 m, 10 m, 20 m}

was used.

4.2. Evaluating Extracted Road Network

4.2.1. Parameter Tuning

Our method consisted of three steps—automatic seeding, road tracking, and road centerline refinement. The tracking algorithm was sensitive to the generated seed points, therefore automatic seeding is a key step, which involves two critical parameters—the circular sampling radius

r

and the density-connectivity threshold

φ

.

The parameter tuning was based on the ground-truth intersections offered by OpenStreetMap. The generated seed points were evaluated by computing the error of road orientation using known road branches of the ground-truth intersections. Using a fixed degree threshold

7.5

°, Table 1 shows the precision and recall of the generated seed points (before density connectivity check), using different circular sampling radius

r .

It could be seen that as sampling radius increased, the number of generated seed points increased, however, the recall first increased and then decreased. Both precision and recall reached its maximum value when

r

was 60 m. Based on this parameter setting and a loosened degree threshold of 15°, Table 2 shows the precision and recall after the density-connectivity check, using different threshold

φ .

The density-connectivity check tried to remove the false seed points without misclassifying the true ones. About 5.2% of the generated seed points could not match with any ground-truth seed points, even when the degree threshold was greatly loosened. In this case, a connectivity threshold of 0.1 was able to remove half of the false seed points, while preventing the recall from decreasing too much.

The proposed road-tracking algorithm involved one key parameter, the tracking step length

s

. Since the road centerlines would be further refined in post-processing, here, we focus on the recall and complexity of the road network generated by road tracking. We use ground-truth seed points as input to exclude road tracking’s sensitivity to seed points. The distance threshold in evaluation of the road network was set to

20 m

, causing every extracted road be counted in the recall.

As shown in Table 3, given the correct seed points, the proposed road-tracking algorithm showed the robustness for covering the existing roads. Either too small or too large

s

led to redundant vertexes and edges, resulting in more spurious roads. Lastly, Table 4 lists the parameter settings for full experiments.

4.2.2. Evaluation of the Extracted Road Network

Since previous intersection detection methods [18,21] lacked a positioning error evaluation of the detection result, the first part of our experiments was based on the ground-truth intersections. The simulation result using intersections with different coverages and noise levels is discussed in the next section.

Table 5 shows that our method achieved both relatively high precision and high recall when extracting road networks, if the input intersection was high quality. In the recent road intersection detection method proposed by Deng [21], a high precision (94.52%) was achieved using low-frequency trajectory data in Wuhan, but the recall (55.65%) was rather low. Intuitively, a recall-focus intersection detection method could better meet our method’s needs to cover more roads in the extraction result.

Table 5 also shows that the quality of automatically generated road network was still inadequate for commercial applications such as navigation and automated driving, a more realistic usage could be identifying changes and missing roads in the existing map, at a daily level (noticed that the Shenzhen dataset we used was collected on one day).

Figure 20a shows that our method could generate smooth road centerlines from the noisy and unevenly distributed density map. Figure 20b demonstrates the performance of geometric refinement, which shows that the refined roads achieved a balance between straight road and zig-zag roads, connecting the local peaks of the density field. However, the refined road was still susceptible to the heat zone of intersection region near its two ends.

In conclusion, quantitative evaluation and visual inspection demonstrated our method’s effectiveness as a semi-automatic method, the automatic one would continuously benefit from the development of the road-intersection-detection method. However, the position of intersections was not refined by our method. There were also a small proportion of spurious roads in the extracted road network, derived from the false seed points, which required a more in-depth research on an active contour-based road network refinement.

4.3. Analysis

4.3.1. Analysis of Template Matching

To verify the template-matching algorithm, we randomly sampled 176 slice images of a road with ground-truth orientation offered by OpenStreetMap, and then manually labeled them as road segments (total of 111 slices), near intersection (total of 40 slices), or highly noisy area (total of 25 slices). Figure 21 shows that based on the minimum-matching-distance criterion, our orientation extraction algorithm worked for the dominant road segment area; therefore, it could satisfy the need for simple correction of the road-tracking algorithm. Figure 21 also shows that adopting a circular sampling approach in the automatic seeding algorithm was crucial to preventing the areas near intersections from undermining the matching distance function.

4.3.2. Analysis of Automatic Seeding

First, we verified the effectiveness of density connectivity check using ground-truth intersections as input. Figure 22a shows the statistical result of matching the generated seed points with ground-truth road branches, from which we observed a proportion of obvious false seed points having an unacceptable orientation error (

>

60°). Figure 22b shows that after the density connectivity check, these obvious false seed points were removed, increasing the precision of the generated seed points at a small cost of recall.

Table 6 lists the detail precision, recall, and the F-score of the generated seed points (after density connectivity check), using different degree thresholds. As shown in Table 6, our seeding algorithm could balance precision and recall.

Next, we analyzed the sensitivity to input intersection positions under the fixed degree threshold of 7.5°. Since the previous road-intersection-detection methods lacked the fundamental positioning error model, we modeled the positioning error as 2D Gaussian distribution, and introduced circular error probable (CEP) to represent the noise level. When the CEP was

L

meters, 50% of the positioning error were within

L

meters. Figure 23 shows that the highly noisy intersection input eventually lowered the performance of the seeding algorithm, posing a moderate requirement on the precision of the road intersection detection method.

4.3.3. Input Sensitivity Analysis

Last, we tested the stability of our road network extraction method using simulation intersections with different coverages and noise levels (CEP). We modeled three types of input intersection—(1) ground-truth, (2) precision-focused, and (3) recall-focused. Simulation intersections were generated by randomly picking from ground-truth intersections and then adding 2D Gaussian noise to the center position; the detail CEP and recalls of different inputs are listed in Table 7.

The ground-truth input represents the manually selected input, provided by human operators or OpenStreetMap. The precision-focused input represents previous precision-focused road intersection detection methods; here, the recall was determined from the recent work by Deng [21]. The recall-focused input represents ideal methods that could better meet our method’s need.

As shown in Figure 24, using the precision-focused input, our method could more accurately generate parts of the road network. However, it would eventually miss about 20% of the existing roads. Using recall-focused input, this automatic approach could cover over 90% of the existing roads in the extraction result, regardless of the moderate intersection position error. Its ability to cover existing roads was very close to the semi-automatic approach. Finally, if the vertex of extracted road network could also be refined, which was what our method lacked, the map quality could be further improved.

5. Conclusions

In this paper, a low-frequency GPS trajectory data-based road network extraction method was proposed. The feasibility of GPS trajectory data provided a new means to automatically generating road maps. However, the prevalence of low sampling rate and multi-level disparity complicated the achievement of both high precision and high coverage in the generated map. Among various road-network-extraction methods, the intersection-first approach used the valuable information from road intersections in a divide-and-conquer manner, and it could be further benefited from the development of intersection-detection methods. Therefore, we followed this approach to propose a combined method that applied a road-tracking algorithm on the density map estimated by the KDE algorithm, where information of road intersections was represented as automatically generated seed points.

We found that the divide-and-conquer strategy was the key to tackling the multi-level disparity problem. By ruling out the near intersection region in advance, the template matching-based road-orientation-extraction algorithm performed better in general. The road-tracking algorithm could trace a road connecting two known intersections with more robustness. We pointed out that a recall-focus road-intersection-detection method could better suffice the need of road network extraction in this approach. By adopting an active contour-based road-centerline-refinement algorithm, our method yielded smoothed road centerlines than the traditional KDE-based methods. Compared to the previous GPS-trajectory-data-based road-network extraction methods, our method achieved a good balance between precision and recall using low-frequency trajectories. Moreover, the proposed road centerline refinement algorithm could serve as a general solution to zig-zag roads, considering both road structure and image information.

However, our method lacks the ability to refine the position of road intersections and to handle complicated intersection structures (e.g., highways and roundabouts). There also exists a small proportion of spurious roads due to false seed points, which requires an in-depth road-network-refinement method. Therefore, our future work will focus on applying the active contour for optimizing the position of a known intersection, and developing its ability to extract the precise boundary of city blocks from a density map, which is helpful in identifying the dead-end roads.

Supplementary Materials

The following are available online at https://www.mdpi.com/2220-9964/10/3/122/s1, the partial Shenzhen taxi trajectory data and OpenStreetMap we used in experiment is added in the supplementary file.

Author Contributions

Conceptualization, Banqiao Chen, Chibiao Ding, and Wenjuan Ren; Methodology, Banqiao Chen, and Chibiao Ding; Software, Banqiao Chen; Validation, Banqiao Chen; Formal Analysis, Banqiao Chen; Investigation, Banqiao Chen; Resources, Guangluan Xu; Data Curation, Banqiao Chen; Writing—Original Draft Preparation, Banqiao Chen; Writing—Review and Editing, Chibiao Ding and Wenjuan Ren; Visualization, Banqiao Chen; Supervision, Chibiao Ding; and Project Administration, Guangluan Xu; All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available at https://www.cs.rutgers.edu/~dz220/data.html (accessed on 22 October 2018).

Acknowledgments

We would like to thank Assistant Desheng Zhang at Rutgers University for releasing the Shenzhen trajectory dataset. We express our thanks to the editors and anonymous reviewers for their valuable comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lian, R.; Wang, W.; Mustafa, N.; Huang, L. Road Extraction Methods in High-Resolution Remote Sensing Images: A Comprehensive Review. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 5489–5507. [Google Scholar] [CrossRef]
Wang, W.; Yang, N.; Zhang, Y.; Wang, F.; Cao, T.; Eklund, P. A review of road extraction from remote sensing images. J. Traffic Transp. Eng. Engl. Ed. 2016, 3, 271–282. [Google Scholar] [CrossRef] [Green Version]
Biagioni, T.; Eriksson, J. Inferring road maps from global positioning system traces: Survey and comparative evaluation. Transp. Res. Rec. J. Transp. Res. Board 2012, 2291, 61–71. [Google Scholar] [CrossRef] [Green Version]
Ahmed, M.; Karagiorgou, S.; Pfoser, D.; Wenk, C. A comparison and evaluation of map construction algorithms using vehicle tracking data. GeoInformatica 2014, 19, 601–632. [Google Scholar] [CrossRef] [Green Version]
Guan, H.; Li, J.; Cao, S.; Yu, Y. Use of mobile LiDAR in road information inventory: A review. Int. J. Image Data Fusion 2016, 7, 219–242. [Google Scholar] [CrossRef]
Xu, Y.; Xie, Z.; Feng, Y.; Chen, Z. Road extraction from high-resolution remote sensing imagery using deep learning. Remote Sens. 2018, 10, 1461. [Google Scholar] [CrossRef] [Green Version]
Miao, Z.; Wang, B.; Shi, W.; Zhang, H. A semi-automatic method for road centerline extraction from VHR images. IEEE Geosci. Remote Sens. Lett. 2014, 11, 1856–1860. [Google Scholar] [CrossRef]
Vosselman, G.; Knecht, J. Road tracking by profile matching and Kalman filtering. In Automatic Extraction of Man-Made Objects from Aerial and Space Image; Gruen, A., Kuebler, O., Agouris, P., Eds.; Birkhäuser: Basel, Switzerland, 1995; pp. 265–274. [Google Scholar]
Kim, T.; Park, S.; Kim, M.; Jeong, S.; Kim, K. Tracking road centerlines from high resolution remote sensing images by least squares correlation matching. Photogramm. Eng. Remote Sens. 2004, 70, 1417–1422. [Google Scholar] [CrossRef]
Movaghati, S.; Moghaddamjoo, A.; Tavakoli, A. Road extraction from satellite images using particle filtering and extended kalman filtering. IEEE Trans. Geosci. Remote Sens. 2010, 48, 2807–2817. [Google Scholar] [CrossRef]
Youn, J.; Bethel, J.S. Adaptive snakes for urban road extraction. In Proceedings of the International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Istanbul, Turkey, 12–23 July 2004; Volume 35, pp. 465–470. [Google Scholar]
Urban Data Release V2. Available online: https://www.cs.rutgers.edu/~dz220/data.html (accessed on 22 October 2018).
Edelkamp, S.; Schrödl, S. Route Planning and Map Inference with Global Positioning Traces. Comput. Vis. 2003, 2598, 128–151. [Google Scholar]
Karagiorgou, S.; Pfoser, D. On vehicle tracking data-based road network generation. In Proceedings of the 20th International Conference on Intelligent User Interfaces Companion–IUI Companion’15, Lyon, France, 16–20 April 2012; p. 89. [Google Scholar]
Chen, C.; Lu, C.; Huang, Q.; Yang, Q.; Gunopulos, D.; Guibas, L. City-Scale Map Creation and Updating using GPS Collections. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining–KDD’16, San Francisco, CA, USA, 13–17 August 2016; pp. 1465–1474. [Google Scholar]
Biagioni, J.; Eriksson, J. Map inference in the face of noise and disparity. In Proceedings of the 20th International Conference on Intelligent User Interfaces Companion–IUI Companion’15, Lyon, France, 16–20 April 2012; pp. 79–88. [Google Scholar]
Wang, S.; Wang, Y.; Li, Y. Efficient map reconstruction and augmentation via topological methods. In Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, Washington, DC, USA, 3–6 November 2015; pp. 1–25. [Google Scholar]
Zhang, C.; Xiang, L.; Li, S.; Wang, D. An Intersection-First Approach for Road Network Generation from Crowd-Sourced Vehicle Trajectories. ISPRS Int. J. Geo-Inf. 2019, 8, 473. [Google Scholar] [CrossRef] [Green Version]
Cao, L.; Krumm, J. From GPS traces to a routable road map. In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA, 4–6 November 2009. [Google Scholar]
Ahmed, M.; Wenk, C. Constructing Street Networks from GPS Trajectories. In Algorithms–ESA2012; Springer: Berlin/Heidelberg, Germany, 2012; pp. 60–71. [Google Scholar]
Deng, M.; Huang, J.; Zhang, Y.; Liu, H.; Tang, L.; Tang, J.; Yang, X. Generating urban road intersection models from low-frequency GPS trajectory data. Int. J. Geogr. Inf. Sci. 2018, 32, 1–25. [Google Scholar] [CrossRef]
Kass, M.; Witkin, A.; Terzopoulos, D. Snakes: Active contour models. Int. J. Comput. Vis. 1988, 1, 321–331. [Google Scholar] [CrossRef]
Fua, P.; Leclerc, Y.G. Model driven edge detection. Mach. Vis. Appl. 1990, 3, 45–56. [Google Scholar] [CrossRef] [Green Version]
Scott, D.W. Kernel Density Estimators. In Multivariate Density Estimation: Theory, Practice, and Visualization, 2nd ed.; John Wiley & Sons: Hoboken, NJ, USA, 2015; pp. 137–216. ISBN 1118575482. [Google Scholar]
Chen, Y.H.; Krumm, J. Probabilistic modeling of traffic lanes from GPS traces. In Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, San Jose, CA, USA, 2–5 November 2010. [Google Scholar]
Zhang, D.; Zhao, J.; Zhang, F.; He, T. UrbanCPS: A cyber-physical system based on multi-source big infrastructure data for heterogeneous model integration. In Proceedings of the ACM/IEEE 6th International Conference on Cyber-Physical Systems (ICCPS’15), Seattle, WA, USA, 14–16 April 2015; pp. 238–247. [Google Scholar]
OpenStreetMap. Available online: http://www.openstreetmap.org/ (accessed on 22 October 2018).

Figure 1. Road network extraction methods based on two main types of data. RS—remote sensing; KDE—kernel density estimation.

Figure 2. Workflow for extracting road centerlines from GPS trajectories.

Figure 3. Generated slice image with arbitrary orientation—(a) the square local area; (b) re-estimating density value on every pixel using the nearby points; and (c) rotating the skewed slice image.

Figure 4. Model the spread of GPS points as single Gaussian distribution.

Figure 5. Pre-processing before matching—(a) the original slice image; (b) after applying the median filter; and (c) after binarization.

Figure 6. Three types of slice image—(a) road segment; (b) near intersection; and (c) highly noisy area.

Figure 7. Detecting road branches based on template matching and circular sampling.

Figure 8. Matching distance function reflects the possibility of having a road branch with orientation

θ

.

Figure 8. Matching distance function reflects the possibility of having a road branch with orientation

θ

.

Figure 9. Fluctuation in matching distance function results in false road branches.

Figure 10. Linearly sampled density value along a road branch’s orientation.

Figure 11. Seed points with drastic density change from inner circle to outer circle are considered false.

Figure 12. Flow chart of the road-tracking algorithm.

Figure 13. Road model.

Figure 14. Difference between two template-matching modes—(a) slice images with variable orientation allow for using more image information; and (b) fixed slice image is less computationally intensive.

Figure 15. The result of road tracking using the Shenzhen taxi trajectory data—(a) comparison with the complete map available online (extracted roads are marked with thick lines, thin lines stand for roads from OpenStreetMap, which are well overlapped with the color map offered by BaiduMap, and the Chinese labels stand for street name); and (b) coverage on roads travelled by at least three trajectories (green line means covered).

Figure 16. Orientation restrictions cause non-straight road to also be the optimization result (assuming that the image force equals to zero).

Figure 17. Geometric refinement result—(a) smooth in general; and (b) distorted by local heat zone.

Figure 18. Fitting a road with drastic changes in orientation—(a) considering no turn-points; and (b) allowing adaptive selection of turn-points.

Figure 19. Raw GPS trajectories collected by taxis in Shenzhen (partial).

Figure 20. Effectiveness of active contour-based road centerline refinement—(a) the smoothed road centerlines; and (b) comparison of tracked roads and refined roads.

Figure 21. Experimenting road orientation extraction algorithm for three types of slice images.

Figure 22. Error when determining the road orientation of seed points—(a) before and (b) after density connectivity check.

Figure 23. Evaluation of the generated seed points under different noise levels. CEP—circular error probable.

Figure 24. Comparison between three types of input intersections—(a) precision; and (b) recall.

Table 1. Comparison of generated seed points before connectivity check.

Circular Sampling Radius $r$	Number of Seed Points	Degree Threshold	Precision	Recall
50 m	206	7.5°	81.07%	81.07%
60 m	212		82.55%	84.95%
70 m	219		79.00%	83.98%
80 m	228		75.44%	83.50%
90 m	232		73.71%	83.10%

Table 2. Comparison of the generated seed points after connectivity check.

Connectivity Threshold $φ$	0.05	0.1	0.2	0.3
206 ground-truth seed points offered by OpenStreetMap
Number of Seed Points	203	198	180	169
Precision	96.06%	97.98%	98.33%	98.82%
Recall	96.12%	94.17%	85.92%	81.07%

Table 3. Comparison of the extracted road network

G (V, E)

before the road centerline refinement.

Table 3. Comparison of the extracted road network

G (V, E)

before the road centerline refinement.

Tracking Step Length $s$	10 m	20 m	50 m	75 m
Recall	92.33%	92.78%	93.38%	93.30%
$\| V \|$	118	98	90	103
$\| E \|$	168	134	121	143

Table 4. Parameter settings for experiments.

Grid Cell Length $c$	Pixels of Road Template $N$	Circular Sampling Radius $r$	Connectivity Threshold $φ$	Tracking Step Length $s$
2.5 m	32	60 m	0.1	50 m

Table 5. Comparison of the extracted road networks.

Distance Threshold	Methods	Precision	Recall	F-score
5 m	Proposed Biagioni [16]	66.11% 39.47%	69.26% 42.26%	0.67 0.41
10 m	Proposed Biagioni	86.78% 80.39%	89.97% 75.76%	0.88 0.78
20 m	Proposed Biagioni	96.62% 96.20%	94.28% 84.97%	0.95 0.90

Table 6. Evaluation of generated seed points.

Degree Threshold	Precision	Recall	F-Score
5°	72.73%	69.90%	0.71
7.5°	84.85%	81.55%	0.83
15°	97.98%	94.17%	0.96

Table 7. Three types of input intersections.

Type	CEP	Recall	Representation
Ground-truth	0 m	100%	OSM map
Precision-focused	10 m	55%	Deng’s method [21]
Recall-focused	25 m	75%	Ideal prerequisite

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, B.; Ding, C.; Ren, W.; Xu, G. Automatically Tracking Road Centerlines from Low-Frequency GPS Trajectory Data. ISPRS Int. J. Geo-Inf. 2021, 10, 122. https://doi.org/10.3390/ijgi10030122

AMA Style

Chen B, Ding C, Ren W, Xu G. Automatically Tracking Road Centerlines from Low-Frequency GPS Trajectory Data. ISPRS International Journal of Geo-Information. 2021; 10(3):122. https://doi.org/10.3390/ijgi10030122

Chicago/Turabian Style

Chen, Banqiao, Chibiao Ding, Wenjuan Ren, and Guangluan Xu. 2021. "Automatically Tracking Road Centerlines from Low-Frequency GPS Trajectory Data" ISPRS International Journal of Geo-Information 10, no. 3: 122. https://doi.org/10.3390/ijgi10030122

APA Style

Chen, B., Ding, C., Ren, W., & Xu, G. (2021). Automatically Tracking Road Centerlines from Low-Frequency GPS Trajectory Data. ISPRS International Journal of Geo-Information, 10(3), 122. https://doi.org/10.3390/ijgi10030122

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Automatically Tracking Road Centerlines from Low-Frequency GPS Trajectory Data

Abstract

1. Introduction

2. Related Work

3. Tracking Road Centerlines from Low-Frequency Trajectory Data

3.1. Automatic Seeding Based on Kernel Density Estimation Map

3.1.1. Density Estimation

3.1.2. Template Matching

3.1.3. Seeding Based on Template Matching and Circular Sampling

3.2. Tracking Road Centerlines Based on Kernel Density Estimation Map

3.2.1. Road Model

3.2.2. Road Orientation Extraction Based on Template Matching

3.2.3. Road Tracking Algorithm

3.3. Road Centerline Refinement Based on Active Contour

3.3.1. Active Contour Model

3.3.2. Geometric Refinement of the Extracted Road Centerlines

4. Experimental Results

4.1. Datasets and Evaluation of the Extracted Road Network

4.2. Evaluating Extracted Road Network

4.2.1. Parameter Tuning

4.2.2. Evaluation of the Extracted Road Network

4.3. Analysis

4.3.1. Analysis of Template Matching

4.3.2. Analysis of Automatic Seeding

4.3.3. Input Sensitivity Analysis

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI