1. Introduction
Road interchanges are critical components of road networks, using grade-separations to align intersections of multiple surface transport routes at different heights (grades), preventing disruptions to traffic flow on other routes. They play a crucial role in optimizing traffic and reducing congestion [
1,
2] in urban areas. Research shows these areas are prone to accidents, especially in merging section [
3]. Accurate road information (geometry and topology structures) is essential for reliable navigation services, enhancing safety and efficiency for drivers [
4,
5]. Traditional approaches to field mapping are costly due to labor-intensive processes and inefficient real-time updates [
6].
Automatic extraction of road networks using multi-source geospatial data, e.g., remote sensing images, or crowdsourced trajectories, has long been a focus. In particular, extracting the geometric and topology structures of road interchanges from remote sensing images has garner significant scholarly attention. Hu et al. [
7] proposed a two-step approach involving road footprint classification for incremental growth and pruning of road trees, achieving 81% extraction accuracy in highway interchange areas. Zhang et al. [
8] developed the NodeConnect model, which extracts road nodes and infers connectivity to represent road networks as undirected graphs. However, spatial occlusion between overpasses (upper-level straight lanes) and ramps (lower-level straight lanes) often causes fragmented lower-level road generation or pseudo-intersections at overlapping areas, leading to topological errors [
7,
8,
9]. Additionally, the geometric complexity of interchanges, coupled with geometric and textural noise in remote sensing imagery (e.g., interference from vegetation, buildings, and vehicles), poses challenges in reconstructing interchanges with both geometric fidelity and directed topological connectivity. However, crowdsourced trajectory data goes beyond capturing road shapes and locations; it also unveils intricate road connections and densely clustered pathways that aerial images might miss. Long trajectory continuity enables the identification of underlying topological connections in multi-level road networks, effectively distinguishing geometric and topological structures of overlapping upper- and lower-level roads. Furthermore, trajectory flow direction aids in discerning divergence and convergence properties at key interchange nodes, facilitating the generation of directed road structures—attributes inherently unattainable from remote sensing images alone.
The use of trajectory data to infer road maps has been a subject of extensive research for many years. Map inference [
10,
11,
12,
13] entails leveraging trajectory data to identify, reconstruct, and refine road networks over time. However, previous methods often produce maps with low accuracy, especially in areas with complex structures [
14,
15,
16] such as highway interchanges. In this section, we divide the current methods of map inference based on trajectory data into four categories: clustering-based methods, entry/exit points-based methods, trace-merging methods, and intersection-linking methods. The clustering method employs distance and azimuth information from trajectory data to delineate clusters, thereby facilitating the construction of the internal structure of highway interchanges. Stanojevic et al. [
17] grouped successive observations in each trajectory into clusters with each cluster center as a vertex and edges formed between clusters based on their sequence. However, this approach considers only pairs of consecutive observations, neglecting the broader connectivity of the trajectory. As a result, it may lead to erroneous connections, such as fragmented or discontinuous patterns, particularly in areas with dense overpasses and multi-level trajectories. The entry/exit points-based methods detect intersection boundaries, identify interface points between intersection boundaries and connecting roads, reconstruct geometries by analyzing movement patterns between identified entry/exit nodes. Deng et al. [
16] calculated similarity using the longest common subsequence to cluster the turning relationships within the interchange. However, the topology connection is not extracted and missed. The trace-merging method iteratively traverses all trajectories, merging trajectory points with near-parallel distributions based on positional and directional information. This process gradually combines nodes and edges to construct a road network graph. By modeling the attractiveness of trajectory points and grouping similar trajectories, Cao et al. [
18] constructed a road network through traversal of the trajectory collection. He et al. [
19] detected direction changes in the trajectory flow using local peaks, traced new road sections at bifurcations, and merged them based on the future alignment of the flow. Intersection-linking methods divide the interchange into several small intersections. By identifying large-angle turning points, this method segment long trajectories at different elevations. Wang et al. [
20] extracted turning rules within intersections and centerlines of external road segments, and forms connections through boundary access points to create a road network. The method hinders the effective use of trajectory continuity. As a result, it fails to distinguish between adjacent roads and generates false intersections at the interweaving sections of upper and lower-level roads, leading to topological errors.
Table 1 systematically compares four trajectory-based road network construction method in terms of their characteristics and performance. In recent years, some researchers have fused vehicle trajectories and remote sensing images to recognize intersections or extract roads. Sun et al. [
21] enhances road detection accuracy and generalizability to unmapped areas via specialized 1D filters and augmented training strategies. Qin et al. [
22] proposes an incremental road network update method that integrates multi-source geospatial data through Hidden Markov Model-based change detection and deep learning-driven road extraction.
Additionally, some researchers have explored methods for generating the three-dimensional (3D) structure of highway interchanges using LiDAR data or crowdsourced trajectories with elevation information. For instance, Cheng et al. [
23] introduced the concept of “structure unit” to decompose a complex multilayer interchange into simple units. Zhou et al. [
24] designed the multi-constraint energy model to enhance curve reconstruction performance in different highway scenes. LiDAR data captures the fine details of highway interchanges, which is crucial for accurate modeling and reconstruction. However, Airborne LiDAR systems [
25] can be costly to deploy and maintain, which may restrict their use in certain projects. The overall expenses, including the specialized equipment for data collection and the labor required for both acquisition and processing, make LiDAR a more expensive option compared to crowdsourced trajectory data. Consequently, there has been increasing interest in using crowdsourced trajectory data, which offers a more cost-effective alternative for large-scale analysis of highway interchanges and roads. Several studies have aimed to extract 3D information about road networks from crowdsourced trajectories that include elevation records. For example, Ren et al. [
26] utilized semantic segmentation of elevation data to identify slopes and level sections in roads and ramps. Yang et al. [
27] further improved the accuracy of segmenting ramps and main roads, as well as detecting the vertical order of road segments. A major limitation of this method is that most trajectory datasets are typically collected without reliable elevation data, as many GPS-based systems do not consistently include elevation information. Even when elevation data is available, its quality and reliability are often questioned [
28]. Inaccuracies in elevation measurements can result from errors in GPS signals, inconsistencies in the device’s altitude readings, or the limitations of consumer-grade sensors [
29,
30]. As a result, the extraction of precise 3D road geometry from such data can be challenging, and the accuracy of the derived models may be compromised.
Constructing a high-accuracy road graph of highway interchanges using crowdsourced trajectory data encounters two main challenges. Firstly, the intricate geometry and topology of interchanges, which include crossing stacked roads at different heights, dense ramps, and indistinguishable adjacent roads, present significant obstacles. Secondly, the reliability of crowdsourced trajectory data is compromised by biases, inaccuracies, and inconsistencies arising from GPS errors, diverse driver behaviors and varying traffic conditions [
29]. The presence of non-vehicle trajectories, such as pedestrian and cyclist paths, further complicates the extraction of interchange road structures. Additionally, the spatial distribution of trajectories exhibits heterogeneity [
31] influenced by factors such as road class and regional disparity.
Although considerable research have been dedicated to automating the generation and real-time updating of road networks, recent studies highlight that existing algorithms tend to work well for common road structures [
10,
11,
12,
13] but fail to generate high-accuracy maps in complex interchanges [
14,
15,
16]. Most of the existing map inference methods are devoted to generating road interchange map without fully utilizing trajectory continuity and flow information. They often make wrong connections between adjacent roads [
17,
19] and misidentify intersections where different road levels cross but do not meet at the same grade [
20]. This paper presents a novel approach for generating road interchange map based on crowdsourced trajectories. This method is distinct to other approaches for the following reasons: Our approach combines intersection linking and trace-merging. We have found that by using the continuity of trajectories to track the splitting and merging of flow paths, and by incorporating direction and distribution information of forward movement, we can effectively identify intersections within road interchanges. This approach allows us to accurately pinpoint the critical nodes of road interchanges. In summary, we make the following contributions:
We develop a novel road network generation method particularly for road interchanges. This method adopts a divide-and-conquer strategy by decomposing road interchange networks into subnetworks for various types of road interchanges and addresses the challenge posed by the multi-level complex structure and crowdsourced data quality issues.
A forward and reverse tracking mechanism that leverages long-term trajectory continuity is proposed to accurately identify transition nodes at road interchanges. This method avoids generating false intersections in areas where upper and lower-level roads interweave. Transition nodes are categorized into divergence nodes (via forward tracking) and convergence nodes (via reverse tracking).
Extensive experiments were conducted using real-world trajectory data. The proposed method achieves an average improvement of 13.3% in score and 12.1% in score, outperforming two baseline methods. The method demonstrates robust performance across various types of road interchanges, including cloverleaf, turbo, and trumpet interchanges.
The remainder of this paper is organized as follows.
Section 2 reviews the related work.
Section 3 presents the proposed method for generating road interchange map. The experimental analyses are detailed in
Section 4. Conclusions and future works are discussed at the end of the paper.
2. Methodology
Traces represent sequences of discrete points that depict a vehicle’s driving path. Typically, vehicles enter the highway via an on-ramp and exit via an off-ramp. As illustrated in
Figure 1a, trajectory flows entering the same entrance continuously diverge at the intersection and exit the road-interchange region through different exits. Conversely, trajectory flows from different entrances continuously merge at the intersection and exit the region via the same exit (
Figure 1b). Divergence nodes are defined where trace bundles following the road network fork at intersections (diverge or off-ramp) while the convergence nodes are defined where trace bundles match with each other (merge or on-ramp).
By leveraging the continuity of trajectories, we introduce a method that starts at the entrance/exit points to detect divergence nodes through forward tracking (i.e., following the trajectory sequence), thus creating a subgraph of the highway interchange. Conversely, convergence nodes are identified through reverse tracking (i.e., opposite to the trajectory sequence), and the road subgraphs from multiple directions are then integrated to construct the road interchange network.
As illustrated in
Figure 2, the geometric and topological structures of a highway interchange are represented in a road network model. This model is a directed graph composed of nodes and edges. The nodes represent divergence and convergence nodes, which are the intersections within the interchange, while the edges correspond to the centerlines of road segments, indicating the direction and topological connections of the roads. The geometric structure of the interchange is concerned with the shape and distribution of the roadways. It focuses on the centerline ensemble of each road segment, with the exit and entrance points serving as the endpoints. The topological structure, on the other hand, deals with the connectivity and direction of the network. This is represented by nodes and directed paths that form connections between them. The generation of the topological structure is based on the extraction of the geometric structure. Essentially, the geometric layout of the roads provides the foundation for determining how they are connected in terms of directionality and intersections, which ultimately defines the interchange’s overall topological framework.
In this paper, we propose a three-step method that converts sequences of tracked points into a routable road-level map for highway interchanges, as illustrated in
Figure 3.
First, the crowdsourced trajectory data undergoes preprocessing to filter out noisy or irrelevant trajectories. Constraint rules are applied to exclude circuitous, low-precision, non-vehicle, and low-density trajectories based on factors such as morphological features, heading angle, speed, and other relevant features.
Next, the road subgraphs corresponding to multiple entrance/exit points are extracted. A single entrance/exit is selected as a seed point, and potential transition nodes, where trajectory flows either diverge or converge, are identified after each update of the tracking window. These potential transition nodes are further verified by combining the direction and distribution information of forward movement. Trajectory bifurcation is performed at the transition nodes, and tracking continues until the end of the trajectory bundles. After redundant branches are removed, the road interchange subgraph is extracted.
Finally, a two-stage fusion of the road interchange subgraphs is performed. First, the subgraphs from forward and reverse tracking are fused separately, considering heading, trajectory bundle information, and spatial relationships. Then, the forward fused structure is combined with the convergence nodes from reverse tracking to extract a geometrically complete and topologically accurate road interchange network.
2.1. Trajectory Preprocessing
Crowdsourced trajectory data comes from various sources and typically includes trajectories from categories such as automobiles, bicycles, and pedestrians. Upon observation, there may be a cluster of buildings near the road interchange, with footpaths, urban greenways, and other features underneath. In addition, due to GPS errors, signal blocking by tall buildings, and uncertainties in the behavior of moving objects, the original data often has quality issues such as positioning drift, messy patterns, and redundancy. If not addressed, these problems can negatively impact the extraction of road interchange networks.
After analysis, two types of trajectories need to be filtered: (1) trajectories from non-road-interchange regions, which include circuitous trajectories, non-vehicle trajectories, and low-density trajectories; and (2) abnormally deviating trajectories, which refer to low-precision trajectories that deviate locally or integrally from the road surface. High-confidence trajectories distributed on road interchanges typically exhibit the following characteristics: (1) narrow geometric shape with a large spatial span; (2) smooth trajectory with fewer drift points; (3) high average and maximum velocity; (4) high density and homogeneity. Based on these features, this paper proposes a trajectory filtering strategy consists of the following steps:
Circuitous trajectory filtering: Circuitous trajectories, characterized by frequent directional changes within localized spatial ranges, form compact morphologies distinguishable via geometric metrics, as illustrated in
Figure 4a. The minimum enclosing rectangle (MER) of such trajectories provides two key indicators: the aspect ratio, which identifies narrow or clustered patterns (low values), and the area, reflecting their limited spatial extent. These traits correlate with a low morphology index (SI). Trajectories with SI below a predefined a threshold value are classified as circuitous trajectories and eliminated during preprocessing. The threshold value is defined as the mth-percentile of all trajectory morphology indices.
SI is calculated as described below:
where
and
represent the values normalized to the aspect ratio and area of the minimum enclosing rectangle for each trajectory, respectively, and
is the influence weight. Since the aspect ratio and area of the minimum enclosing rectangle are equally important in determining whether the trajectory is distributed on the overpass, we set the value of
to 0.5.
Low-precision trajectory filtering: Low-precision trajectories are defined by a high proportion of points deviating significantly from the road surface, as shown in
Figure 4b. To quantify this deviation, we compare two orientation metrics: (1) the original heading angle (
) of individual trajectory points, reflecting instantaneous vehicle direction, and (2) the direction angle (
) derived from adjacent points, representing the local road segment’s morphology. The angular distance
between
and
serves as a deviation indicator—values exceeding a threshold
suggest abnormal lateral drift. Since angular differences in raw angle space do not linearly correlate with geometric distance, we project angles onto a unit circle to compute
accurately, as illustrated in
Figure 5. The
is calculated as follows:
where
represents the original heading angle of trajectory points,
represents the direction angle calculated from adjacent trajectory points,
are the coordinates of
on the unit circle. Trajectories with a high proportion of trajectory points exhibiting low horizontal positioning accuracy, exceeding a specified ratio, are filtered out.
Non-vehicle trajectory filtering: Non-vehicle trajectories (e.g., walking, cycling) are spatially confined to non-motorized lanes and exhibit distinct speed profiles compared to vehicular movement, as illustrated in
Figure 4c. Multi-level interchanges prioritize vehicular flow through separated ramps and straight lanes, enabling higher sustained speeds. We leverage this design feature to differentiate vehicle trajectories from non-vehicle ones: trajectories with an overall average speed
and maximum speed
are classified as non-vehicle trajectories. These trajectories are filtered during preprocessing to retain only vehicular movement patterns.
Low-density trajectory filtering: Low-density trajectories are defined by a sparse distribution of points deviating from dense vehicular trace bundles, as illustrated in
Figure 4d. Road interchanges, designed for high traffic capacity, exhibit concentrated vehicle trajectory clusters due to sustained traffic flow. We quantify trajectory sparsity using neighboring flux
, which measures local trajectory density over extended segments. Trajectories with
persistently below an empirically defined threshold
are identified as low-density outliers and filtered out during preprocessing. This ensures retention of statistically representative vehicular movement patterns while removing anomalous deviations.
Auxiliary length features are computed for each trajectory point, specifically the cumulative length from each point to the endpoints of its corresponding trajectory. This feature indirectly characterizes the connectivity of the trajectories. The data preprocessing results in a high-confidence trajectory set , located within the main body of the interchange, which serves as input for road subnetwork extraction.
2.2. Road Subnetwork Extraction
The spatial distribution of vehicle trajectories is inherently constrained by the geometry of the road network. At road interchanges, the bifurcation and merging processes of the network correspond to the diverging and merging behaviors observed in trajectory data. In this paper, we introduce a novel method for extracting road subnetworks. This approach integrates local directional information with the distribution patterns of forward movement to dynamically detect road diversion events. The method enables the extraction of both the geometric and topological connectivity of road interchanges, forming a road subnetwork represented by nodes and directed edges.
The road subnetwork extraction begins by selecting an entrance/exit (orange/green dots in
Figure 6a) as the seed. From this seed point, trajectory bundles (blue lines) flowing into the road interchange are extracted from the high-confidence trajectory set
. A circular window (yellow dashed line) is then used to continuously slide along the trajectory, updating at each step, until the end of the trajectory is reached. This process models individual substructures as directed graphs. Each center of the sliding window corresponds to a node in the subgraph, which records geometric position information. The relationship between adjacent windows, established by the sliding order, defines the connections between nodes. These connections are represented as directed edges (yellow arrows) between nodes. Transition nodes (pink dots in
Figure 6b) associate road segments (blue lines) with intersections, forming a directed graph that topologically links substructures.
After each update of the sliding window, the trajectory points within the window are collected. The peak heading angle of these points is then statistically computed, and potential transition nodes are identified and verified. If the center of the current window is identified as a potential transition node and passes the validation, its location is considered a credible road bifurcation. At this point, trajectories passing through the neighborhood of the transition node are diverted into separate trajectory bundles, which are then tracked recursively. If the center of the current window is not identified as a transition node, the window is updated and the procedure continues. This process continues until the end of the tracking, at which point redundant branches are removed to form the final road subnetwork.
2.2.1. Transition Node Detection
Most existing methods assume that aligned trajectory data of a road are nearly parallel, characterized by both positional and directional proximity. These methods typically set fixed thresholds for distance and heading angle differences to cluster similar trajectory points or to incrementally construct directed graphs. The method presented in this paper integrates heading angle difference and trajectory continuity for improved tracking.
The tracking window with radius is updated by sliding it along the specified direction and step size , and connecting edges are generated between the centers of the updated windows. Given that arterials in road interchanges typically have higher road grades, more lanes, and wider roads than other internal arterials or turn ramps, the tracking window should reflect the real road conditions. To ensure that the local information carried by the track points in the window accurately represents the road, the window’s radius is primarily determined by the width of the high-grade arterials. However, a larger window may cause track points to be unevenly distributed, and the tracking direction’s deviation will gradually accumulate over time. This leads to the tracked line shifting away from the road’s center and may even cause transition nodes to be missed due to insufficient local information. Therefore, after each window update, the center of mass of the track points in the window is used to correct the window’s position, ensuring that the tracking path remains aligned with the road’s cross-section and stays centered on the road.
Initially, the method utilizes heading angle divergence to identify potential transition nodes. The divergence trend is then verified by pre-tracking the distance difference between two branches, helping to identify high-confidence transition nodes. Specifically, the heading angle space is discretized into 360 intervals. Using kernel density estimation (KDE; Gaussian kernel) [
32,
33], a normalized probability density curve of the heading angles within the current window is generated. This curve is analyzed to obtain the peak heading angle and the number of peaks within the window. As illustrated in
Figure 7a, the center of the next window is marked as a potential transition node if the number of peaks increases compared to the current window.
The authenticity of the potential transition node is verified by examining the divergence trend of the subsequent trajectory sequence through pre-tracing (
Figure 7a). Specifically, the tracking window is slid
times from the potential transition node along each heading angle peak. Since the center of the sliding window is updated to the center of mass of the trajectory points within the window, and the distribution of branching trajectory points may vary, the spacing between the pre-tracking end windows does not directly reflect the divergence trend of the trajectories. To address this, the side of the two branches closer to the potential transition node is selected, and equal-length points are intercepted at that same distance on the other branch. Next, the distance between the corresponding points of the two branches is calculated. If this distance exceeds a predetermined threshold, the point is considered a valid transition node (
Figure 7b). Otherwise, it is deemed a spurious transition node (
Figure 7b).
2.2.2. Trajectory Diversion
In road bifurcations, traffic density and speed can vary significantly across different directions. To ensure driving safety and reduce conflicts, buffer zones, such as transition sections, are often implemented. After a bifurcation, the roads may not immediately separate in space, causing trajectory points from different branches to be in close proximity. This overlap can lead to interference in the tracking process. To address this issue, we propose a trajectory diversion method based on K-means++. This method leverages the continuity of vehicle trajectories and the semantic information of local trajectory segments. By recognizing patterns in the directional flow of these segments, we transform the problem of trajectory diversion into the identification of clumping patterns for different road branches. This approach helps separate the trajectories of different branches, reducing interference and improving tracking accuracy.
Distinguishing between trajectory bundles from different branch roads is computationally complex when relying on the similarity of complete trajectories. Vehicle trajectories represent restricted geographic flows [
34,
35], with turning behaviors strictly constrained by the road network. After passing the transition node, trajectories heading toward the same branch road exhibit similar patterns within a certain distance, forming a characteristic clumping pattern. This method utilizes local semantic information by intercepting a fixed-length (100 m) segment of the trajectory along the tracking direction, with the transition node as the center of the starting neighborhood. We adopt the polar coordinate model of flow clustering to abstractly represent each trajectory segment as a ternary group, consisting of the starting point, ending point, and direction:
. Here,
represents the coordinates of the starting point,
represents the coordinates of the ending point and
is the direction of the directed line from the starting point to the ending point. The direction
is measured in degrees, with 0° indicating true north.
By reducing the dimensionality of the trajectory representation, we retain its original features while enhancing computational efficiency for subsequent flow pattern recognition. Given that the intercepted trajectory segments have nearly equal lengths and start points near the bifurcation, it can be inferred that the endpoints of segments with the same pattern are adjacent, while endpoints from different patterns are separated. Consequently, the ternary representation can be simplified to a single directional element, and trajectory flow clumps can be identified by recognizing the aggregation of direction angles in angle space, completing the trajectory diversion.
This paper adopts the K-means++ method [
36] to identify clustering patterns in the direction of trajectory segments in an unsupervised manner. This approach not only converges quickly but also effectively avoids diversion failure caused by the proximity of randomly initialized clustering centers. The procedure begins by using the peak number of heading angles obtained from transition node detection as the prior number of clusters. Initial clustering centers are then randomly selected from the sample space of trajectory segment direction angles, ensuring that they are as far apart as possible. Next, points in the sample space are assigned to clusters based on minimizing angular differences, and the cluster centers are recalculated. The iteration process continues until the changes in cluster centers are smaller than 1°, at which point the algorithm halts. The trajectories corresponding to the direction angles within each cluster form distinct trajectory bundles after diversion.
2.2.3. Redundant Branch Removal
The trajectory data involves significant uncertainty during the positioning process, resulting in errors in accuracy that prevent full precision. At the initial stage of a road bifurcation, the trajectories of different branch roads are often intertwined. The trajectory diversion is imprecise, and trajectories from different directions may not be fully separated. However, as the road extends, the trajectories begin to diverge more distinctly. Despite this, transition nodes may be repeatedly detected in the same intersection area, leading to redundant branches. To ensure the accuracy of road subnetwork merging, these redundant branches must be eliminated. The algorithm consists of the following steps:
Repeated bifurcation detection: At road interchanges, roads typically do not diverge frequently over short distances. Instead, redundant branches are often associated with repeated transition nodes, with short branches serving as indicators of repeated bifurcation signal, as illustrated in
Figure 8a. These short branches are used to detect potential repeated transition nodes.
Subsequent branch matching: The two transition nodes associated with a short branch are defined as a high-level transition node and a low-level transition node, based on their occurrence in the direction of branch extension. The matching of subsequent branches at these transition nodes is determined by the maximum overlap of the outer rectangle of the branch
, the difference in branch direction
, and the projection distance of the branch endpoint
. The branch matching process is illustrated in
Figure 8b, with the meaning of each index shown in
Figure 9. When the conditions in Formula (3) are met, the two branches are considered successfully matched. Since the matched branches represent the same road section structure, redundant branches are eliminated.
Redundant Branch Pruning: As shown in
Figure 8c, when a branch is successfully matched and the end connected to the low-level transition node does not continue to bifurcate, the branch is directly deleted. If bifurcation continues, the local structure at the branch end is first integrated, and then redundant branches are eliminated. The integration of the local structure also involves sub-structure fusion, which will be discussed in detail in
Section 2.3.
2.3. Road Subnetwork Merging
In this paper, we apply the concept of divide and conquer to simplify and decompose the extraction of a complete road network in a road interchange into multiple sub-networks. Each sub-network captures a portion of both the geometric structure and topological connections of the interchange. Specifically, the forward-tracking subnetwork contains road diversion information, while the reverse-tracking subnetwork holds road merging information. To reconstruct the full road interchange structure, we propose a two-stage subnetwork fusion method. This approach fuses subnetworks with the same tracking direction and integrates complementary tracking networks to establish the complete connectivity of the road interchange.
2.3.1. Unidirectional-Tracing Subnetwork Merging
The traffic flow in a road interchange exhibits the characteristics of multidirectional intersections, where traffic from different entrances shares common road sections. As a result, the sub-networks extracted through forward tracking often exhibit varying degrees of information overlap. The merging of unidirectional-tracing sub-networks begins by using the same road diversion location. It then identifies transition nodes within sub-networks that correspond to the same road diversion location, allowing for the discovery and fusion of structural fragments that are spatially and morphologically consistent. Ultimately, the geometric and structural information from sub-networks with the same tracking direction is integrated into a unified whole. The algorithm follows these steps:
Dynamically identify clusters of transition nodes within all sub-networks that are spatially close to each other using the DBSCAN method.
For each cluster, select two transition nodes one at a time, and determine whether they represent the same road bifurcation by checking if the direction of the road at both nodes differs by no more than 20°. Once identified, match the subsequent branches of these two points (following the method outlined in
Section 2.2.3). The center of these two nodes is then taken as the new transition node after merging. Matched branches are aligned, resampled, and fused. Connection relationships between the fused transition node and branches are updated to preserve topological consistency before and after fusion. This process is repeated until all transition nodes within the cluster that represent the same road bifurcation are merged.
Following these steps, transition nodes, branches, and their clusters are updated iteratively. Structural fragment fusion terminates once each cluster retains a single transition node or clusters correspond to distinct road bifurcations.
Due to the heterogeneity of trajectory data distribution, not all transition nodes can be fully extracted from each sub-network, resulting in unmerged structural fragments. To address this, a 5-m buffer is applied around the existing transition nodes. If a branch with a directional deviation ≤20° from the transition node’s road orientation enters this zone, a structural inconsistency is flagged. The integration matching table records the transition node ID, target branch, and its nearest folding point to the node. In the consistency refinement integration phase, branches to be fused are interrupted according to the integration matching table. Structural fragments are then merged using the same method as described in the previous section. This process results in a geometrically complete and topologically consistent road interchange structure, incorporating information from all subnetworks with the same tracking direction, referred to as the unidirectional-tracing subnetwork.
2.3.2. Bidirectional-Tracing Subnetwork Merging
The unidirectional-tracing subnetwork provides essentially complete geometric structural information but only includes topological details about diversions or mergers. The bidirectional-tracing subnetwork achieves complete structure extraction by complementarily merging the forward-tracing subnetwork and convergence nodes of the reverse-tracing subnetwork, as illustrated in
Figure 10. This process can also be described as structural consistency refinement and integration.
The complementary fusion process centers on the convergence node for buffer analysis. It identifies branches in the forward-tracking structure that pass through the convergence node’s vicinity. When detecting structural inconsistencies, both the directional difference between the road at the convergence node and the branch passing through the buffer zone, as well as the degree of trajectory matching, are considered to determine whether a record should be added to the matching table. Once the matching table is constructed, the branches are interrupted sequentially based on the records, and the branch segments on the same road are fused systematically to complete the complementary fusion. This process results in the extraction of a geometrically and topologically complete centerline-level road interchange structure.
4. Conclusions
In this paper, we propose a novel algorithm for generating highly accurate, routable road network graphs for highway interchanges. We classify the interference in the study area into four types: circuitous, non-vehicle, low-density, and low-precision trajectories. A high-confidence trajectory screening model is developed, integrating the trajectory morphology index, angular differences, motion features, and neighborhood density. This model forms the basis for the precise extraction of the road interchange network. Next, we introduce a subnetwork tracking algorithm that incorporates both trajectory continuity and local directional information. This method uses overall trajectory continuity to separate multi-level trajectories, enhances the accuracy of road bifurcation identification, and minimizes the risk of generating pseudo-topology at road intersections. Potential transition nodes are verified through pre-tracking to confirm their authenticity. To address network redundancy caused by multiple trajectory diversions at road bifurcations, redundant branches are removed during subnetwork tracking. Finally, a two-stage fusion strategy combines forward tracking with convergence nodes from reverse tracking to create a geometrically and topologically complete road interchange network.
Existing studies focus on road-level networks of typical road structures or 3D road information for highway interchanges but struggle to generate high-accuracy maps for complex interchanges due to underutilized trajectory continuity and flow information. To address this, we propose a divide-and-conquer approach that layers multi-level trajectories using shared entry/exit points and tracks bifurcations via long-trajectory continuity within subnetworks. Unlike conventional methods relying on local distance and heading angle metrics for segment merging, our approach leverages forward movement distribution and subnetwork fusion from multiple entry/exit points. This strategy simplifies the reconstruction of intricate geometries, eliminates pseudo-intersections by accurately identifying nodes, and distinguishes dense parallel roads. Experimental results demonstrate superior performance in geometric completeness and topological accuracy compared to existing methods. The results indicate the effectiveness of our proposed method in extracting road interchange networks under complex scenarios. Our method generates road networks with higher structural integrity and fewer fragments. The proposed method offers an innovative and practical solution for the automatic generation of road interchange networks, directly supporting efficient navigation systems.
Future work should address two key issues. First, the identification of entry/exit points for automatic road network construction could be improved by using road-level maps of non-interchange areas, combined with road geometry and direction derived from remote sensing images. Second, we will extend this algorithm to evaluate its robustness when applied to interchanges of varying complexity across diverse urban environments.