Next Article in Journal
About Calculation and Forecast of Temperature in the Layer Cell of Self-Heating of Raw Materials in a Silo
Previous Article in Journal
Optimized Alternating Current Electric Field and Light Irradiance for Caulerpa lentillifera Biomass Sustainability—An Innovative Approach for Potential Postharvest Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Research on Urban Road Network Extraction Based on Web Map API Hierarchical Rasterization and Improved Thinning Algorithm

1
School of Traffic and Transportation, Northeast Forestry University, Harbin 150040, China
2
School of Modern Service, Harbin Vocational and Technical College, Harbin 150081, China
*
Author to whom correspondence should be addressed.
Sustainability 2022, 14(21), 14363; https://doi.org/10.3390/su142114363
Submission received: 20 September 2022 / Revised: 30 October 2022 / Accepted: 1 November 2022 / Published: 2 November 2022
(This article belongs to the Section Sustainable Transportation)

Abstract

:
Most existing research on the vector road network is based on GPS trajectory travel information extraction, and urban GPS trajectory data are large and difficult to obtain. Based on this, this study proposes a road network extraction method based on network map API and designs a vector road network based on an improved image-processing algorithm using trajectory data. Firstly, a large number of trajectory data are processed by hierarchical rasterization. The trajectory points of the regional OD matrix are obtained by using the map API interface to generate the trajectory. Then, the image expansion processing is performed on the road network raster image to complete the information loss problem. The improved Zhang–Suen refinement algorithm is used to refine the idea to obtain the road center line, and the vector road network in the study area is obtained. Finally, taking the Harbin City of Heilongjiang Province as an example, compared with the road network of the network map, it has been demonstrated that using this technology may improve the traveler experience and the sustainability of urban traffic flow while reducing the number of manual procedures required, performing online incremental rapid change detection, and updating the present road network at a cheaper cost.

1. Introduction

With the rapid development of urban road construction, people’s demand for travel is increasing, but road network information is updated slowly. The vector road network has high application value in land space planning, intelligent transportation, urban construction, and disaster relief [1,2]. The traditional road network acquisition methods are manual surveying and mapping, geographic information collection, and satellite image processing. The update technology cycle is long, and the cost is high [3,4,5]. With the rapid development of information technology, the new road network generation algorithm has become a research topic for many scholars. The most common method is to generate a road network based on vehicle GPS trajectory data or walking GPS trajectory data [6,7,8]. Road network extraction based on GPS trajectory has the advantages of high precision, low acquisition cost, comprehensive data source, and strong real-time performance for urban roads when compared to the conventional methods of surveying, mapping, and image interpretation to obtain digital map road network data, but the road network data are often small-scale. However, the magnitude of GPS trajectory in an urban area is significant, and generating a road network using GPS trajectory is challenging. The Open Street Map (OSM) project is a free, open-source, editable map service for the online public.
Nevertheless, the OSM vector map obtained from the Internet has serious road missing problems in some underdeveloped areas [9,10,11,12,13,14]. In this study, considering the integrity of the urban road network, a method of trajectory generation and road network extraction based on map API is proposed, which can obtain the geometric information of the regional road network and make it possible to construct or update the road map in real time. In [15], for instance, relevant road information (such as geometric features, speed, latitude and longitude, direction angle, etc.) is extracted and updated from the trajectory data in the vicinity of the road segment; in [16], the K-means algorithm is used to cluster the trajectory points, and the spline curve is introduced to fit the road turning area; and in [17], the straight line and arc in the mathematical model are used to describe the GPS trajectory. According to the positioning point and the driving direction, a large number of GPS trajectory points are clustered and connected to form a trajectory chain, and the road center line is fitted by the nonlinear least squares method. In [18], the trajectory points are classified, the spatial and semantic relations are judged, and the trajectory points are merged into the candidate road network; in [19], a general map generation method is proposed based on trajectory merging, and the geometric features of the road are inferred by artificial intelligence algorithm. The clustering algorithm is used to incrementally generate the road network. The authors of [20] extracted the central axis of the road based on the constrained Delaunay triangulation to establish the street road network; Ref. [8] introduces the constrained Delaunay triangulation and its dual Voronoi diagram geometric model to extract the road boundary from the vehicle trajectory line. The authors of [21] rasterized the vehicle GPS trajectory data, merged the discrete data points based on mathematical morphology, and finally used the SPTA algorithm to refine the road network; Ref. [22] created a buffer for the vehicle trajectory data and then used the cross-number refinement algorithm to refine the taxi trajectory data to build the road network. In general, the grid refinement method converts a large number of vector trajectory data points into image operations, which improves computational efficiency.

2. Materials and Methods

Firstly, the research area is rasterized, and the path trajectory points are obtained through the map navigation service to generate the trajectory. The GPS trajectory data used in this study are from Didi Chuxing Technology Co., Ltd., Beijing, China, The Didi Chuxing GAIA Initiative. These GPS data are collected in the vehicle GPS equipment of the Didi Chuxing platform vehicle. As shown in Table 1, the data fields include the driver number (vehicle_id), order number (order_id), world time (utc), longitude (longi), and latitude (lati). Secondly, the accepted course is rasterized, and then the vector road network is constructed based on rasterization. The technical route of trajectory generation and road network extraction using the map API are shown in Figure 1.

2.1. Acquisition and Hierarchical Rasterization of Trajectory Data

This study proposes a rasterization method of a hierarchical index. In the face of massive trajectory data, to improve the efficiency of rasterization, the selected areas where the trajectory points fall into are graded, and the direction, speed, and time indexes are established to preserve and utilize the information in the grid, as shown in Figure 2.
According to the size of the selected area, the selected area is divided into three levels, corresponding to Div, Grid, and Cell in Figure 2. Div is the partitioning placement method used in the stacked stylesheet. This research refers to the study area; Grid describes an image that has been discretized on a two-dimensional plane and refers to brightness. The grid picture can be viewed as a matrix. Any element in the matrix can represent a point in the image, and the matching value can be used to represent the point’s grayscale. Pixels are the units that make up the digital matrix. The purpose of this classification is to categorize the research area’s level of gray; a Cell is the smallest record point unit, for the extraction of the road centerline, the size of the Cell is the same as that of the vehicle, about 5 m × 5 m, and then the direction index, time index, and speed index are established for Cell, as shown in Figure 2. Among them, the direction index divides the direction into eight different ranges, in which 0~7 represents different directions, the time index is in hours, 0~23 represents different periods, and the speed index divides the speed into (0~9) 10 different ranges according to the speed of the trajectory point.
The trajectory data is processed in days. For each trajectory point in the selected area Div, the Grid to which it belongs is first recorded. Then the Cell is found to which the corresponding Grid belongs and the trajectory points in the corresponding Cell are calculated. The trajectory points’ direction, time, and speed information are also counted within the respective ranges of the Cell direction, time, and speed indexes. This is used to record the number of trajectory points in the unit and count the number of trajectory points in different directions, time periods, and speed ranges in the unit.
The multi-day trajectory data can complement some of the missing road information, so the number of different trajectory points in the same Cell for multiple days and the statistics in different index ranges are added. Using the trajectory point speed information in the Cell speed index and the trajectory point azimuth information in the azimuth index, the trajectory points with high- and low-speed ranges or unreasonable directions in the Cell statistics are removed. If there are trajectory points in Cell, the pixel value is 0 if there are none, and 255 otherwise. This allows us to acquire the grid road network image.
Then, the research region is rasterized by fusing changes in the vehicle speed, urban latitude and longitude, and GPS data-collection period. To create grid center point C and represent the research region, a grid cell of 50 m × 50 m was built. Symbols can be simplified to:
A = { C o 1 , C o 2 , C o 3 , , C o n }
where A represents the study area, Co represents the center point of the grid cell, and n represents the total number of grid points in the study area. To obtain a more accurate road network, the resolution of the grid unit can be set to a smaller value, but it is not the smaller the better, as a 1 m × 1 m grid unit cannot distinguish whether the collected valid data or positioning drift error are caused by abnormal data points. The purpose of modeling is to obtain the study area’s starting- and endpoint matrix (OD matrix). Taking point Co as the starting point and its two adjacent units in the southeast and northwest as the endpoint, the OD matrix of point Co is constructed, which is expressed as: C o C d 1 C d 2 C d 3 C d 4 . Then, the OD matrix of the entire study area can be described as: C o 1 C o 2 C o 3 C o n C d 1 C d 1 C d 1 C d 1 C d 2 C d 2 C d 2 C d 2 C d 3 C d 3 C d 3 C d 3 C d 4 C d 4 C d 4 C d 4 . where Co is the center point of the grid unit, which is called the starting point, and n is the total number of grid points in the study area. Cd1, Cd2, Cd3, and Cd4 represent the grid center points of two adjacent units in the southeast–northwest direction corresponding to the starting point, which is called the endpoint here (see Figure 3).
In this study, the working image’s center point is determined using an estimating technique based on density weight. To achieve this, the grid density is utilized as the weight, and the weighted average algorithm is used to obtain the latitude and longitude of each point’s equivalent center. Symbols can be simplified to:
l o n = k = 1 n p k p s u m × l o n k
l a t = k = 1 n p k p s u m × l a t k
p s u m = i = 1 n p i
Here, n denotes the total number of grids in the research region, psum the overall density of the n grids, pk the density of the k th grid, and (lonk, latk) the latitude and longitude of the k th grid’s center of mass. The working area’s center may be found at the computed (lon, lat) coordinates.
The map navigation service obtains the path points based on the OD matrix. If the navigation trajectory between any two points is traversed, an n × n Cartesian product (n refers to the number of grid points in the study area) will be generated, which increases the difficulty of the operation, and a large number of repeated paths will occur. If the navigation path of each point and its four neighborhoods is traversed, the complexity of the operation is reduced from n × n to n × 4. Even while it makes calculations easier, some secondary roads or branches have fewer cars going through, which means the findings will result in a discontinuous trajectory, making it challenging to maintain the acquisition of the road network. According to the efficiency of the comprehensive operation and the rationality of the trajectory, the grid center point of the two adjacent units in the southeast and northwest corresponding to the starting point is selected as the endpoint. The trajectory point set of the entire region is obtained by traversing the entire region’s target point and its corresponding endpoint. The path trajectory points include the longitude, latitude, and road type (Table 2). Symbols can be simplified to:
p = ( x , y , i )
where p represents the trajectory point, x the longitude, y the latitude, and i the navigation command for the trajectory point.
Finally, the trajectory point generates a trajectory. The map navigation service returns a set of navigation trajectory points, which needs to be converted into a trajectory. According to the set of trajectory points belonging to the same command, a trajectory l is formed in chronological order, and all trajectory points are processed in the same way to obtain the network trajectory T of the whole study area.
The trajectories obtained by the map service will overlap substantially in different parts of the same road. This study draws on the grid clustering method to perform trajectory aggregation processing [23]. In practical applications, due to a large amount of GPS data processing, taking into account the real-time performance of the algorithm, this study does not use the original GPS trajectory data but is based on the grid unit equivalent centroid coordinates, using the least squares method to fit the road curve. That is, for a pile of discrete data points, find a curve y to minimize the sum of deviations from all discrete points. The polynomial curve fitting formula is:
y = a 0 + a 1 x + + a k x k
where ak is the coefficient of the kth factor, k is the polynomial degree, the fitting result is a straight line when k = 1, and the fitting result is a curve when k ≥ 2. The binarization method processes the trajectory data into a black-and-white binary image.
Hierarchical rasterization has two advantages: (1) The entire region is divided into different levels, and only the trajectory points fall into the previous level of Cell to perform the calculation; the entire selected area is not used to perform the calculation, so the speed compared to the traditional rasterization is improved. (2) Information such as the speed and direction of the track point is reserved in the grid point, and it can be used to judge whether the track point is reasonable or not, and the noise point can be removed to improve the accuracy of the grid road network image. However, there are two main points of hierarchical rasterization: (1) According to the selected area, the larger the area, the greater the amount of grading, and the minor area level must be Cell; the Grid between Cell and Div can be divided into multi-level according to the area. (2) The Cell size needs to be controlled; too much detailed information is easy to lose, and too small a neighborhood lane will increase the difficulty of centerline extraction.

2.2. Application of Fast Expansion and Improved Thinning Algorithm in Road Network Extraction

In the grid road network image obtained, there are cases wherein missing data lead to road disconnection and adjacent lanes. The separate road sections need to be connected, and the road centerline extraction needs to be merged with adjacent lanes. The expansion algorithm can expand the road section to the surrounding area, connect the road section, and merge the lanes. A simple and fast expansion algorithm is to scan the pixels in the image in turn, find the pixels with a gray value of 255, analyze the eight pixels around the pixel, and set the gray pixel value to 0 only if there are pixels with a gray value of 0 in the eight pixels.
After the expansion algorithm is processed, the road network image shows the characteristics of the road network. However, the road section is a multi-pixel width, and the extraction of the road section is more complex. The thinning algorithm can maintain the topological structure of the black part of the image. After processing, it shows a single pixel width. The skeleton line in the image is the center line of the road. This study uses an improved Zhang–Suen thinning algorithm. Binarizing the expanded image, for the point with road information (pixel value is 0), its pixel value is set to 1, called the target point, and for the non-road information point (pixel value is 255), its pixel value is set to 0, called the background point; a simple binary image can thus be obtained.
The process of improving the Zhang–Suen thinning algorithm is as follows:
(1)
Find eight neighborhoods centered on the target point, and the center point is P1. The eight points in the neighborhood are recorded as P2, P3,..., P9, and P2 on P1. Mark pixels that meet the following conditions:
2 ≤ N (P1) ≤ 6
S (P1) = 1 or B (P1) ∈{ 65, 5, 20, 80, 13, 22, 52, 133, 141, 54}
P2 × P4 × P8 = 0 or S (P2) ! = 1
P2 × P4 × P6 = 0 or S (P4) ! = 1
(2)
Remove all points that meet the mark, and assign the gray value of the target point to 0 and the gray value of the background point to 255.
Where N (P1) is the number of non-zero neighbors in the 8-neighborhood of P, and S (P1) is the number of changes from 0 to 1, starting from any point in the 8-neighborhood around P, in turn, along the counterclockwise direction. B (P1) is the binary code S of the P1 neighborhood point. The calculation formula is shown in the formula below, and the value of Pi is its pixel value.
S = Σ n = 2 9 ( P n × 2 n 2 )
The improved Zhang–Suen thinning algorithm uses a 5 × 5 neighborhood and changes the original Zhang–Suen algorithm thinning conditions based on increasing the neighborhood range. The effects of three thinning algorithms for an intersection are shown in Figure 4. The light color part is the effect of the intersection after expansion, and the dark part is the effect after thinning. Figure 4a shows the Zhang–Suen thinning algorithm, which is not a single pixel width after thinning, and the effect at the intersection of the road section is poor. Figure 4b is the improved thinning algorithm in [8], only changing Figure 4a to a single pixel width; Figure 4c is an improved Zhang–Suen refinement algorithm for this study. After refinement, it is a single pixel width, the intersection is more intuitive and realistic than the first two, and the refinement effect is better.
Figure 4a–c compared to the intersection topology changes, mainly because the refinement process not only depends on the previous iteration results but also depends on the iteration that has been processed pixel distribution. The number of iterations, neighborhood size, and refinement conditions significantly relate to one another. The essence of the three is to refine the multi-pixel width section into a central line. Although the local details, such as intersections, may be different, the overall direction of the section is consistent. There are still isolated and fewer pixels in the refined image, which is caused by noise such as drift or missing trajectory data. For extracting road segments, this part of the pixel is worthless and needs to be filtered.

2.3. Extracting Road Segments from Refined Road Network to Construct Vector Road Network

The ranks of pixels are their coordinates, and pixel coordinates represent the extracted road sections, road intersections, and endpoints. Before extracting the road section, it is necessary to obtain the road section’s end points and intersection points. The idea is to determine the number of other target pixels in the 3 × 3 neighborhood of a target pixel (pixel value is 0). If there are 2, the pixel is the endpoint, and more than 3 are the intersection points.
The road section extraction is divided into two categories. The first is the road section from the endpoint to the endpoint or the intersection. For example, in Figure 4, the number is one part of the road section, and the number is three parts of the intersection. As the refined image is a single pixel width, it can start from each endpoint, in turn, in the 3 × 3 neighborhood to find and store points with gray values, and then start with the stored points and repeat the search. When other intersections or endpoints stop, the road section can be extracted from the endpoint to the endpoint or intersection. The second is the road section between the intersection and intersection points. As shown in Figure 5, the number is two, starting from where each pixel value in the 3 × 3 neighborhood of a specific intersection is 0, similar to extracting the road section from the endpoint. However, when another intersection stops, all road sections between the intersection point and the intersection point can be extracted.
Based on the endpoints and intersections, all the road sections in the refined image are extracted, but the road sections composed of fewer pixels are not necessary. This type of road section is mainly composed of the road sections between the adjacent intersections in the image (which are the same road intersections in reality). These road sections need to be deleted, and then these intersections are merged into one intersection, and the dark part is the intersection, as shown in Figure 6b.
After the test, set to delete the lengths of less than or equal to 4 pixels of the road, most of which are closer to the intersection of the road section between the changes before and after the removal of some of the intersections; Figure 6a,b show two different intersections.
After deleting the short-circuit section, the point must be a natural intersection if an intersection is connected to more than three sections simultaneously. According to this idea, this kind of intersection is obtained, such as in Figure 6a. After this step, some fundamental intersection points are obtained; secondly, if there are other intersections around the intersection, as shown in Figure 6b, after deletion, it is only necessary to start from a particular intersection, search for other intersections within the neighborhood of the intersection, and use these intersections to interpolate a new intersection. The intersection in these road sections is replaced with a new intersection. One intersection can be connected to multiple road sections to obtain a natural intersection after merging, as shown in Figure 6b. The road section is better connected after merging the intersection, and a natural intersection is connected to at least three road sections.
Before constructing the vector road network, the DP (Douglas–Pucker) algorithm is used to compress the road segments. The DP algorithm makes the original curve closer to the natural road and maintains a smooth shape, but an appropriate threshold must be selected [24,25,26,27,28,29]. The threshold selected in this study is a 0.8-pixel width, which represents the accurate distance of 4 m. The DP algorithm compresses the extracted road section. Since the endpoint of the compressed road section must be the endpoint or the real intersection point, it is only necessary to generate vector road sections for each road section in turn. GPS trajectory data contain latitude and longitude coordinates, which are fitted after data processing and combined with the road network coordinates of the shp file exported by OSM, by calling the GDAL library, reflected in ArcGIS.

3. Results

The script for generating trajectory points is written in the Python language. The input parameters are the grid coordinate points of the research area, and the returned parameters are trajectory points p. The trajectory points are generated in ArcGIS software, and then the road network of the study area is obtained by rasterizing the trajectory and vectorizing the raster data. The study area is Harbin City in Northeast China, with a total area of 10,249 km2. The road network of the main urban area of Harbin was obtained, as shown in Figure 7.
To evaluate the effect of the extracted road network, this study first overlays the extracted road network with the OSM road. As shown in Figure 8, it can be seen that the OSM road network can reflect the road network structure of the main roads in the main urban area of Harbin, but there are severe deficiencies in the secondary roads and branches. The road network extracted in this paper and the existing roads of OSM also added a more detailed road network structure. To further evaluate the correspondence between the extracted road network and the existing road network, the extracted road network is compared with the map of the network (Figure 8). On the whole, the extracted road network can better reflect the topological structure of the urban road network, and the roads at all levels have a high degree of coincidence. The network map is used as a reference to manually vectorize a small part of the road network as the evaluation area, and the buffer method proposed in [30,31] is used to evaluate the road network accuracy of the evaluation area. A specific range of buffers is made for the extracted road network. The extraction accuracy is the ratio of the actual extracted road network length falling into the buffer zone to the total length of the existing road network. As the road network extracted intuitively has a high degree of agreement with the network, a smaller buffer with a radius of 5 m is set, and a road network extraction accuracy of 91.39% is obtained.

4. Discussion

In this study, 3,515,387 taxi trajectory data from 202,205,011 to 202,205,018 in a 5 km × 5 km area of Harbin were used for experiments. The minimum grid cell is 5 m × 5 m, the traditional grid method needs 22,749 ms, and the hierarchical rasterization only needs 21,317 ms. Compared with the two, the speed of hierarchical rasterization is improved. The road network of the selected area after hierarchical rasterization, expansion, refinement, and vectorization after denoising is shown in Figure 9.
Compared with Figure 9b, the road centerline in Figure 9a is obvious. The adjacent lanes are merged, and the road sections are disconnected, which is more conducive to extracting the road centerline, but the road centerline is a non-single pixel width. Figure 9c shows the region’s refined and denoised road network image. The road width is a single pixel width consistent with the road network topology information in Figure 9b. Figure 9d is the vector road network extracted based on Figure 6a. The road center lines in the two figures are the same, which shows that the generation process of the vector road network is correct. To illustrate the effectiveness of the generated road network, this study selects two areas in the vector road network to compare with the corresponding network map road network. As shown in Figure 10, the road sections in the vector road network are close to the road sections in the entire map road network. The topological structure characteristics of the road sections are the same. However, there are some missing road sections, mainly because the trajectory data do not cover these areas or the number of trajectory points in the area is small. Then, the latitude and longitude coordinates of the extracted real intersection point are compared with the actual road intersection point coordinates. Points A, B, and C of area 2 in Figure 10 are selected. The comparison results are shown in Table 3. By comparison, the coordinate difference is about 30 m, and this kind of error is mainly caused by turning the intersection into a point in extracting the intersection, which further proves that this study is feasible to extract the road network from the taxi trajectory data.

5. Conclusions

The vector road network data in the existing land spatial planning are challenging to obtain but are particularly needed in project planning, especially at the city level. The trajectory generation and road network extraction method based on map API proposed in this study has a good extraction effect, effectively solving the road network acquisition problem in spatial planning. Additionally, it provides a new idea for fine-scale road network extraction research. The method proposed in this study has the following advantages: (1) The trajectory generation method based on map API provides a new data source for the study of road network extraction; (2) This method is not limited to urban-level trajectory generation and road network extraction, but also applies to trajectory generation and road network extraction in smaller areas. For small areas, it is necessary to change the driving path service of the map API to the walking path service; (3) The existing Internet map API data are open-source sound, have low development difficulty, are convenient for non-computer professionals to learn quickly, and are conducive to the rapid development of the project and research. In addition, aiming at the problem of constructing a vector road network by using trajectory data, this paper proposes to use hierarchical rasterization and an improved Zhang–Suen refinement algorithm to realize fast extraction of the road network. The proposed road network generation algorithm only calculates the road’s center line and does not obtain detailed information such as the number of lanes. In future work, refined attributes such as the lane number and lane direction can be determined according to the distribution characteristics of GPS trajectory.

Author Contributions

Methodology, W.W. and W.Z.; formal analysis, W.W.; data curation, W.Z.; writing—original draft preparation, W.W.; writing—review and editing, W.Z.; visualization, W.W.; supervision, W.Z.; project administration, W.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the China Fundamental Research Funds for the Central Universities Category D Project Carbon Neutralization Project (No. 2572021DT09).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We thank Hongxing Deng for providing us with the network data sets, his insightful feedback also pushed us to sharpen our thinking and brought our work to a higher level.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Andrade, A.C.; Alixandrini, M.J., Jr.; Carvalho, F.P.S.; Fernandes, V.O. Performance of The Support Vector Machine and Artificial Neural Network Classifiers For Roads Identification. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci 2021, XLIII-B3-2021, 9–14. [Google Scholar] [CrossRef]
  2. Ziems, M.; Rottensteiner, F.; Heipke, C. Verification of road databases using multiple road models. ISPRS J. Photogramm. Remote Sens. 2017, 130, 44–62. [Google Scholar] [CrossRef]
  3. Yang, X.; Wang, F.; Yang, X.; Zhou, Q. Fractal dimension in concrete and implementation for meso-simulation. Constr. Build. Mater. 2017, 143, 464–472. [Google Scholar] [CrossRef]
  4. Alattar, M.A.; Cottrill, C.; Beecroft, M. Public participation geographic information system (PPGIS) as a method for active travel data acquisition. J. Transp. Geogr. 2021, 96, 103180. [Google Scholar] [CrossRef]
  5. Sanhory, A.; El-Tahan, M.; Moghazy, H.M.; Reda, W. Natural and manmade impact on Rosetta eastern shoreline using satellite Image processing technique. Alex. Eng. J. 2021, 61, 6247–6260. [Google Scholar] [CrossRef]
  6. Soni, S.; Kaur, S. To Propose an improvement in Zhang-Suen algorithm for image thinning in image processing. Int. J. Sci. Technol. Eng. 2016, 3, 7481. [Google Scholar]
  7. Cai, L.; Jiang, F.; Zhou, W.; Li, K. Design and Application of an Attractiveness Index for Urban Hotspots Based on GPS Trajectory Data. IEEE Access 2018, 6, 55976–55985. [Google Scholar] [CrossRef]
  8. Cantisani, G.; Del Serrone, G. Procedure for the identification of existing roads alignment from georeferenced points database. Infrastructures 2020, 6, 2. [Google Scholar] [CrossRef]
  9. Dai, H.; Zhang, R.; Beer, M. A new method for stochastic analysis of structures under limited observations. Mech. Syst. Signal Process. 2023, 185, 109730. [Google Scholar] [CrossRef]
  10. Dai, H.; Zhang, R.; Beer, M. A new perspective on the simulation of cross-correlated random fields. Struct. Saf. 2022, 96, 102201. [Google Scholar] [CrossRef]
  11. Anderson, J.; Soden, R.; Anderson, K.M.; Kogan, M.; Palen, L. EPIC-OSM: A Software Framework for OpenStreetMap Data Analytics. In Proceedings of the 2016 49th Hawaii International Conference on System Sciences, Koloa, HI, USA, 5–8 January 2016; pp. 5468–5477. [Google Scholar]
  12. Bridgelall, R. Using artificial intelligence to derive a public transit risk index. J. Public Transp. 2022, 24, 100009. [Google Scholar] [CrossRef]
  13. Justiniano, E.F.; Junior, E.R.d.S.; de Melo, B.M.; Siqueira, J.V.N.; Morato, R.G.; Fantin, M.; Pedrassoli, J.C.; Martines, M.R.; Kawakubo, F.S. Proposal for an index of roads and structures for the mapping of non-vegetated urban surfaces using OSM and Sentinel-2 data. Int. J. Appl. Earth Obs. Geoinf. 2022, 109, 102791. [Google Scholar]
  14. Nduji, N.N.; Madu, C.N.; Okafor, C.C. A Low-Cost Web Application System for Monitoring Geometrical Impacts of Surface Subsidence. Sustainability 2022, 14, 14240. [Google Scholar] [CrossRef]
  15. Zhang, B.; Zhuo, K.; Lan, Y.; Chen, D. Intelligent Movement Method for Trajectory Map Matching. In Proceedings of the 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking, Virtual, 17–19 December 2021; pp. 1705–1709. [Google Scholar]
  16. Schroedl, S.; Wagstaff, K.; Rogers, S.; Langley, P.; Wilson, C. Mining GPS traces for map refinement. Data Min. Knowl. Discov. 2004, 9, 59–87. [Google Scholar] [CrossRef]
  17. Worrall, S.; Nebot, E. Automated Process for generating Digitised Maps Through GPS Data Compression. In Proceedings of the Australasian Conference on Robotics and Automation, Brisbane, Australia, 10–12 December 2007. [Google Scholar]
  18. Li, J.; Qin, Q.; Xie, C.; Zhao, Y. ZIntegrated use of spatial and semantic relationships for extracting road networks from floating car data. Int. J. Appl. Earth Obs. Geoinf. 2012, 19, 238–247. [Google Scholar]
  19. Bruntrup, R.; Edelkamp, S.; Jabbar, S.; Scholz, B. Incremental Map Generation with GPS traces. In Proceedings of Intelligent Transportation Systems, Vienna, Austria, 16 September 2005; pp. 574–579. [Google Scholar]
  20. Chen, D.; Guibas, L.J.; Hershberger, J.; Sun, J. Road network reconstruction for organizing paths. In Proceedings of the Twenty-First Annual ACM-SIAM Symposium on Discrete Algorithms. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 17–19 January 2010; pp. 1309–1320. [Google Scholar]
  21. Yang, W.; AI, T.H. The extraction of road boundary from crowdsourcing trajectory using constrained Delaunay triangulation. Acta Geod. Cartogr. Sin. 2017, 46, 237–245. [Google Scholar]
  22. Chen, C.; Cheng, Y.H. Roads Digital Map Generation with Multi-track GPS Data. In Proceedings of the International Workshop on Education Technology and Training & 2008 International Workshop on Geoscience and Remote Sensing, Shanghai, China, 21–22 December 2008; pp. 508–511. [Google Scholar]
  23. Shen, L.; Lu, J.; Long, M.; Chen, T. Identification of Accident Blackspots on Rural Roads Using Grid Clustering and Principal Component Clustering. Math. Probl. Eng. 2019, 2019, 2151284. [Google Scholar] [CrossRef]
  24. Xie, J.; Wang, M.; Lu, X.; Liu, X.; Grant, P.W. DP-k-modes: A self-tuning k-modes clustering algorithm. Pattern Recognit. Lett. 2022, 158, 117–124. [Google Scholar] [CrossRef]
  25. Dey, T.K.; Wang, J.; Wang, Y. Road network reconstruction from satellite images with machine learning supported by topological methods. In Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Chicago, IL, USA, 5–8 November 2019; pp. 520–523. [Google Scholar]
  26. Shang, F.; Xu, T.; Liu, Y.; Liu, H.; Shen, L.; Gong, M. Differentially Private ADMM Algorithms for Machine Learning. IEEE Trans. Inf. Forensics Secur. 2021, 16, 4733–4745. [Google Scholar] [CrossRef]
  27. Reimel, K.J.; Kumjian, M. Evaluation of K-DP Estimation Algorithm Performance in Rain Using a Known-Truth Framework. J. Atmos. Ocean. Technol. 2021, 38, 587–605. [Google Scholar] [CrossRef]
  28. Wu, Y.; Zhao, Q.; Li, Y. Top-to-down segment process based urban road extraction from high-resolution remote sensing image. Egypt. J. Remote Sens. Space Sci. 2022, 25, 851–861. [Google Scholar] [CrossRef]
  29. Jayasinghe, A.; Sano, K.; Abenayake, C.C.; Mahanama, P. A novel approach to model traffic on road segments of large-scale urban road networks. MethodsX 2019, 6, 1147–1163. [Google Scholar] [CrossRef] [PubMed]
  30. Goodchild, M.; Hunter, G. A Simple Positional Accuracy  Measure for Linear Feature. Int. J. Geogr. Inf. Syst. 1997, 11, 299–306. [Google Scholar] [CrossRef]
  31. Wang, M.; Yang, X.; Wang, W. Establishing a 3D aggregates database from X-ray CT scans of bulk concrete. Constr. Build. Mater. 2022, 315, 125740. [Google Scholar] [CrossRef]
Figure 1. Road network extraction technology based on map API.
Figure 1. Road network extraction technology based on map API.
Sustainability 14 14363 g001
Figure 2. Number indexing scheme.
Figure 2. Number indexing scheme.
Sustainability 14 14363 g002
Figure 3. OD matrix diagram.
Figure 3. OD matrix diagram.
Sustainability 14 14363 g003
Figure 4. The effect comparison of improved Zhang–Suen thinning algorithm. (a) Zhang–Suen thinning algorithm. (b) Improved Zhang–Suen thinning algorithm. (c) This study improves Zhang–Suen thinning algorithm.
Figure 4. The effect comparison of improved Zhang–Suen thinning algorithm. (a) Zhang–Suen thinning algorithm. (b) Improved Zhang–Suen thinning algorithm. (c) This study improves Zhang–Suen thinning algorithm.
Sustainability 14 14363 g004
Figure 5. Road extraction diagram.
Figure 5. Road extraction diagram.
Sustainability 14 14363 g005
Figure 6. Road section changes at some intersections. (a) Intersection Situation 1. (b) Intersection Situation 2.
Figure 6. Road section changes at some intersections. (a) Intersection Situation 1. (b) Intersection Situation 2.
Sustainability 14 14363 g006
Figure 7. Road network map of Harbin.
Figure 7. Road network map of Harbin.
Sustainability 14 14363 g007
Figure 8. Comparison of the extracted road network and OSM road network.
Figure 8. Comparison of the extracted road network and OSM road network.
Sustainability 14 14363 g008
Figure 9. Different types of road networks. (a) Rasterization road network. (b) Expansion road network. (c) Denoising road network. (d) Vector road network.
Figure 9. Different types of road networks. (a) Rasterization road network. (b) Expansion road network. (c) Denoising road network. (d) Vector road network.
Sustainability 14 14363 g009
Figure 10. Comparison between real map and road network map. (a,b) Region 1. (c,d) Region 2.
Figure 10. Comparison between real map and road network map. (a,b) Region 1. (c,d) Region 2.
Sustainability 14 14363 g010
Table 1. Partial original GPS trajectory data.
Table 1. Partial original GPS trajectory data.
Vehicle_idOrder_idUtcLongiLati
7n6dhjnad_Et0hsuh
k9ee7fjf2DE-ovu
gjhdafj6h@tz2rAy
hnga7lkgh2ytaruA
20220512T152036+08126.64830445.753281
9mde7Jj9aJEv6gsrh fk5e7d982EwanBnidlh48kcie6Dticiyyg
17i7cgb69vubcpwt
20220513T081325+08126.63752745.756314
icg876n8h7vq8hxq
gbj776hcb9Dscmyo
8hk99lh739qc]lkpz
9n9e7jk6cOAqleDq
20220513T105703+08126.59042945.730337
9mde7f)9a_JEv6gsrh fk5e7d982EyvanBmdlh48kcie6Diiciyyg
17i7cgb69vubqywt
20220515T142516+08126.52785945.814551
6nehde97f5Br. oD
fbhcffef5btE.jAp
efa5c6bf93wB8gv
wihl3hehab-rr. hBv
20220516T174506+08126.67845745.794172
ailbh7ly93DF0iux
aohe4gkl42zxllxv
gbdf38nid3DiiclDtb
9ih7ari6h.AE9duz
20220517T201538+08126.69215745.772399
Table 2. Navigation map parameter diagram.
Table 2. Navigation map parameter diagram.
Start_LngStart_LatEnd_LngEnd_LatRoad_Types
126.64104645.723925126.63714145.725064main road
126.63714145.725064126.64598145.737825main road
126.64598145.737825126.63061845.746900secondary road
126.63061845.746900126.64302045.757201main road
126.64302045.757201126.63490945.761033main road
Table 3. Comparison of intersection coordinates in region 2.
Table 3. Comparison of intersection coordinates in region 2.
IDCoordinates in (a) (°)Coordinates in (b) (°)Gap/m
A126.57347345.747516126.57319545.74747731
B126.55871045.749762126.55857145.74989822
C126.58437445.743233126.58426245.74343526
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wen, W.; Zhang, W. Research on Urban Road Network Extraction Based on Web Map API Hierarchical Rasterization and Improved Thinning Algorithm. Sustainability 2022, 14, 14363. https://doi.org/10.3390/su142114363

AMA Style

Wen W, Zhang W. Research on Urban Road Network Extraction Based on Web Map API Hierarchical Rasterization and Improved Thinning Algorithm. Sustainability. 2022; 14(21):14363. https://doi.org/10.3390/su142114363

Chicago/Turabian Style

Wen, Wen, and Wenhui Zhang. 2022. "Research on Urban Road Network Extraction Based on Web Map API Hierarchical Rasterization and Improved Thinning Algorithm" Sustainability 14, no. 21: 14363. https://doi.org/10.3390/su142114363

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop