Next Article in Journal
Application of the Maturity Model in Industrial Corporations
Next Article in Special Issue
Analysis of the Efficiency of Transport Infrastructure Connectivity and Trade
Previous Article in Journal
Load-Bearing Performance and Safety Assessment of Grid Pile Foundation
Previous Article in Special Issue
Impact of Various Operating Conditions on Simulated Emissions-Based Stop Penalty at Signalized Intersections
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Practical and Sustainable Approach to Determining the Deployment Priorities of Automatic Vehicle Identification Sensors

1
Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing 211189, China
2
Jiangsu Province Collaborative Innovation Center of Modern Urban Traffic Technologies, Southeast University, Nanjing 211189, China
3
School of Transportation, Southeast University, Nanjing 211189, China
*
Authors to whom correspondence should be addressed.
Sustainability 2022, 14(15), 9474; https://doi.org/10.3390/su14159474
Submission received: 19 June 2022 / Revised: 23 July 2022 / Accepted: 30 July 2022 / Published: 2 August 2022
(This article belongs to the Special Issue Sustainable Transportation and Infrastructure Systems)

Abstract

:
Monitoring vehicles’ paths is important for the management and governance of smart sustainable cities, where traffic sensors play a significant role. As a typical sensor, an automatic vehicle identification (AVI) sensor can observe the whereabouts and movements of vehicles. In this article, we introduced an indicator called the deployment score to present the deployment priorities of AVIs for a better reconstruction of vehicles’ paths. The deployment score was obtained based on a programming method for maximizing the accuracy of a recurring vehicle’s path and minimizing the number of AVI sensors. The calculation process is data-driven, where a random-work method was developed to simulate massive path data (tracks of vehicles) according to travel characteristics extracted from finite GPS data. Then, for each simulated path, a path-level bi-level programming model (P-BPM) was constructed to find the optimal layout of the AVI sensors. The solutions of the P-BPM proved to be approximate Pareto optima from a data-driven perspective. Furthermore, the PageRank method was presented to integrate the solutions; thus, the deployment score was obtained. The proposed method was validated in Chengdu City, whose results demonstrated the remarkable value of our approach.

1. Introduction

1.1. Background

With the rapid development of intelligent transportation technology, there are increasingly diverse types of sensors and detectors in traffic operations, such as in intelligent transportation systems (ITSs), traffic management and control, and traffic safety analysis and prevention [1,2,3]. Two broad categories of sensors are generally considered: counting sensors and scanning sensors [4,5,6,7]. Here, we focus on a typical scanning sensor: the AVI sensor. AVI data show unique attributes [8]: (1) a large sample size—every passing vehicle’s information and attribute information will be captured, and an AVI system can provide 24/7 service without extra requirements for vehicles; (2) accurate observation—the accurate coordinates and the passing time of the tracked vehicle can be obtained, and the accurate travel time of the tracked vehicle can also be calculated based on the timestamps of observations from neighboring AVI sensors; (3) safety and stability—every detected vehicle has a virtual electronic license plate that is scanned. Specifically, AVI data can be adopted for origin–destination (OD) demand estimation, vehicle path reconstruction, link flow inference, and travel time estimation, though counting sensors cannot.
However, full coverage of AVI sensors over a whole network is more expensive than it is for loop detectors, and their deployment is also a long step-by-step process. In consequence, seeking the optimal locations and making a construction-priority recommendation for AVI sensors in a network are crucial for a powerful AVI system.
The AVI location problem (AVI-LP) has drawn sufficient attention in the areas of OD demand estimation, link flow inference, and travel time estimation, while few studies on the vehicle-path-reconstruction-based AVI-LP have been performed. Of particular concern is that, once each vehicle’s path is accurately and completely reconstructed, OD matrix estimation, link flow estimation, and travel time estimation will be naturally achieved. Questions have been raised about the enhancement of subjects of the path reconstruction model in the AVI-LP, whereas it is more difficult to realize an identified and applicable model compared to other methods. By using a data-driven method to simulate vehicles’ moving paths—in conjunction with the bi-level programming model of each path—this research provides a new indicator in order to recommend a path-reconstruction-oriented AVI sensor location plan.

1.2. Literature Review

The section essentially involves two viewpoints and perspectives, namely, the AVI-LP and its solution methods. Gentili [9] and Castillo [4] performed a state-of-the-art review of the network sensor location problem and outlined future challenges from different perspectives, including data, variables, constraints, objective functions, and examples. In what follows, we review previous studies related to our work.
To the best of our knowledge, limited attention has been paid to path reconstruction in sensor location problems compared to other objectives, such as OD estimation [10,11], link flow inference [12,13], and travel time estimation [14]. One primary problem is that path reconstruction is a complicated process, and the representation of the estimation of its error in an optimal location model is intractable, especially in the objective function. Gentili et al. [15] solved this problem by assuming that vehicles’ routes are accessible by AVI sensors, and a bi-level optimization model was presented. A more realistic scenario without the assumption was later adopted to infer the path [16,17]. It is worth noting that the estimation of OD demand and link flow was available in terms of the results of path reconstruction, which typically proved the powerful practicability of path reconstruction, despite the existing difficulty. The effects of sensors’ order on a path and path-differentiated congestion pricing were also considered [18,19]. Furthermore, both counting sensors and AVI sensors were located strategically for path reconstruction. The study showed that AVI sensors played a more critical role than counting sensors in the path reconstruction process [20]. Additional research considered the uncertainty of the link–path matrix for the path-reconstruction-oriented sensor location model [21]. Mostafa et al. recently solved the path-reconstruction-oriented sensor location problem in an environment of connected vehicles by using four different mixed-integer programming formulations for RSUs and AVIs [22]. In short, as far as we know, the research on the path-reconstruction-oriented sensor location problem is far from being desired for theoretical research and practical applications.
The traffic sensor location problem is a typical planning problem in realistic applications. In detail, modeling a traffic sensor location problem calls for multiple objectives, evaluation criteria, and complex subjections. Bianco et al. [23] proved that the sensor location problem is NP-complete on general directed and symmetric graphs and is polynomially solvable on directed and symmetric path; to the best of our knowledge, mathematical programming approaches and algebraic methods are the most commonly used solution methods. The stochastic programming model was demonstrated to be workable when solving stochastic vehicle routing problems [24], reliable facility location problems [25], and other transportation issues; hence, it has become the most commonly used model in the relevant areas.
The solution methods for stochastic programming models are divided into three groups: exact methods, approximate methods, and heuristic methods [26,27]. A heuristic method is a generally used algorithm that can find the optimal solutions to common problems, whereas it cannot guarantee global optimality. An approximate algorithm is merely suitable for a specific problem, resulting in infrequent application. An exact method, as a novel algorithm for solving location problems, is efficient in finding global optima, and can include the decomposition-based algorithms proposed in previous years [28,29]. Recently, Paul Rubin [30] was the first to address the exact algorithm for locating counting sensors in flow observability problems. Although the study overcame several limitations of the existing approaches and performed flexibly, it depended on many heuristic steps and some assumptions, and it showed limitations when locating AVI sensors. In addition, these studies were mainly concerned with the processes of building models and finding a robust methodology, while the practicability and generalizability in engineering practice were sometimes neglected.

1.3. Research Gaps and Contributions

To the best of our knowledge, existing methods show three main limitations:
(1) There is a limited number of studies formulizing a data-driven detection error when modeling the vehicle-path-reconstruction-based AVI location problem. On the one hand, the vast majority of AVI-LP models focus on traffic flow estimation (involving path flow, link flow, and OD flow) or travel time estimation, while the path reconstruction is seldom considered. On the other hand, the detection error of traffic sensors is merely used in evaluation, rather than in the model-building process.
(2) The existing implementation strategies for the AVI-LP consider mathematical exactness and accuracy rather than practical feasibility on the management side. Researchers prefer promoting the accuracy of the models. However, this gives rise to some problems, such as worse universalizability, greater complexity, and higher computational burden, which are undesired by administrators.
(3) The existing methods mostly contribute to a unique optimal solution for a specific scenario, but this solution is not practical in the real world. In academic research, authors devote efforts to finding the unique optimal solution mathematically with a fixed penetration of the layout, which makes little sense in practical applications. However, the deployment of traffic sensors is always city-wide and on a large scale, meaning that much greater costs and more time are needed. Thus, a unique optimal layout plan cannot meet the requirements for such a long-term engineering projects in transportation planning and construction.
To remedy these gaps, we proposed a novel approach. In particular, the contributions of this study are as follows:
(1) A data-driven approach was proposed to solve the vehicle-path-reconstruction-based AVI sensor location problem. In order to build a practical, universal, and implementable model, this study found the optimal layout of AVI sensors at the path level first, and then integrated the results of all paths to analyze the region-wide layout. In particular, we simulated massive path data (tracks of vehicles) according to travel characteristics extracted from finite GPS data by using a random walk method; then, for each path, a Path-level Bi-level Programming Model (P-BPM) was constructed to find the optimal layout of the AVI sensors; lastly, we integrated these layouts by using the PageRank algorithm.
(2) The detection error of the AVI sensors was calculated and applied as one of the optimization objectives when modeling for the AVI-LP. By deriving it from the tracks of vehicles, the traffic flow distribution of an intersection can be characterized. Then, the detection error can be calculated by finding the location of the AVI sensors in each path. The detection error was applied to an objective equation to ensure that the missing observations of AVI sensors were as few as possible.
(3) A novel indicator called the deployment score was introduced. The PageRank algorithm finally output the priority ranking set of the intersections to locate AVI sensors. As a consequence, the installation sequence of the AVI sensors was determined, which is more implementable and practical in decision making.
We introduced and formulated the traditional region-level BPM (R-BPM) and proved it to be non-deterministic polynomial hard (NP-hard) (Section 2). Our data-driven framework is described, and the random work method, the decomposed BPM, and a PageRank method for the computation of the deployment score are detailed (Section 3). Furthermore, experiments were performed and the computational results were summarized (Section 4). Lastly, Section 5 presents the conclusions and limitations.

2. Problem Description and Preliminaries

2.1. Problem Description and Assumptions

A road network can be defined as a directed graph G   = V ,   E , where V is the set of network nodes, V = v 1 , v 2 , , v n , representing the intersections, and E is the set of edges, E = e 1 , e 2 , , e m , representing road links.
In the AVI-LP, an AVI sensor is established at a candidate node v i , monitoring the next node that the vehicle will pass. For example, in the network shown in Figure 1, it is assumed that there are three paths and 100% traffic flow between the origin v o and destination v d , with Path-1 ( v 8 v 9 v 10 v 16 v 22 v 28 v 29 ) conveying 60% of the traffic flow, Path-2 ( v 8 v 9 v 15 v 21 v 22 v 28 v 29 ) conveying 35% of the traffic flow, and Path-3 ( v 8 v 9 v 11 v 17 v 23 v 29 ) conveying 5% of the traffic flow. Intuitively, there should be thirteen AVI sensors to identify all paths, but this is unnecessary. As a matter of fact, with only four AVI sensors established at v 8 , v 9 , v 10 , and v 29 , these paths can be inferred entirely. v 8 and v 29 indicate the origin and destination, respectively, while v 9 and v 10 distinguish the shunt flows. If there are three AVI sensors established at v 8 , v 9 , and v 29 , Path-2 with 35% of the traffic flow can be truly inferred, while the other 65% of the traffic flow is in dispute. With the AVI sensor at v 9 , the flow through v 9 can be identified, where 35% of the traffic flow will go to v 15 , while 65% of the traffic flow will go to v 10 , so Path-1 and Path-3 cannot be identified clearly. From a data-driven point of view, if a vehicle is observed to pass along the path v 8 v 9 v 10 v 29 , we can speculate that this vehicle may pass through Path-1 with a probability of 60 % / 60 % + 5 % or through Path-3 with a probability of 5 % / 60 % + 5 % . That is to say, if 5% error is reasonably acceptable, these three AVI sensors are enough. Hence, we obtain the optimal location/allocation of AVI sensors within the allowed error range (>5%). Furthermore, together with one more AVI sensor established at v 10 , these traffic flows can be generally obtained.
To formulate the problem, the following assumptions are made:
(a) The basic topology of the task network and the information of the location candidates are known beforehand;
(b) The detection error rate of the AVI sensors is free;
(c) Each AVI sensors is placed at a node, observing the next node that the vehicle will pass;
(d) In each vehicle’s path, the first node observed to be passed in the study region is regarded as the origin node, while the last one is the destination node.

2.2. Preliminary Representation and Traditional Region-Level BPM

Here, from the region-level point of view, a simple traditional BPM can be formulated as:
min f 1 = i C x i
min f 2 = r R 1 j , k N r 1 q j k r · l j k r q r · l r a j k r · 1 x j
x i ,   a j k r 0 , 1       i C , j N r , k N r
The objective function (1) is the upper level, which is devoted to determining the optimal locations of the AVI sensors by minimizing the count and, thus, minimizing the infrastructure investment. Objective function (2) is the lower level, which is devoted to maximizing the detection rate. Figure 2 introduces the calculation of the identified error in objective function (2).
It is clear that the BPM includes O(|N|2·|R|) binary variables, O(|N|·|R|) non-negative variables, and O(|N|2·|R|) constraints, in which |N| is the number of candidate nodes and |R| is the number of vehicle paths in the network. Hence, there is a great number of variables and constraints with greater values of |N| and |R|, making it hard for us to easily obtain optimal solutions. To develop the greatest potential of AVI sensors, huge-magnitude quantities of vehicle paths are necessary, i.e., |R| will be too enormous. Then, with an analysis of the computational complexity, the relationship between the AVI-LP and the mixed-integer linear programming problem (MILPP) is given in the following remark.
Remark 1.
The BPM is one of the variants of the MILPP and is NP-hard. To be specific, if the heterogeneous distribution of paths of vehicles becomes homogeneous, the BPM degenerates into the MILPP. Since it extends the classic MILPP, the BPM is one of the variants of the MILPP. Therefore, the BPM is also an NP-hard problem.
There are mainly two barriers to the BPM. On one hand, it is difficult to solve most of NP-hard problems to optimality, especially in the real world. On the other hand, classic location programming problems focus on seeking the only optimal solution, which is not practical for recommendations for the deployment of AVI sensors. In consequence, we try to seek out a novel framework for the AVI-LP to conquer the solution and meet the real demand. The proposed framework is detailed in the following sections.

3. Model Formulation and Methodology

Figure 3 shows an overview of our approach. Firstly, we preprocessed the data and the aforesaid R-BPM was given, and it was proven to be complicated and unpractical. Secondly, a random-walk method was proposed to simulate massive paths, and for these simulated paths, a path-level BPM (P-BPM) was proposed. Thirdly, each P-BPM could be solved with the Gurobi solver. Thus, each path had an optimal layout of AVI sensors. Finally, the results of the P-BPMs of all of the paths were integrated using the PageRank method, and then we obtained the optimal AVI sensor allocation for the whole network. The remainder of this section discusses each step.

3.1. Random-Walk Method

Random-walk-based sampling methods have been widely employed to characterize social networks. In consideration of the heterogeneous distribution of vehicle trajectories, we developed a data-driven constrained random-walk method (D-RWM) to generate a certain number of vehicle routings, avoiding influential bias compared with the practical status.
The data-driven initialization step aimed at obtaining the initial path set Φ = ϕ , V o , V d , ϕ i ϕ ,   v o i V o , v ˜ d i V d , where ϕ i is the i th path generated from the D-RWM, v o i is the origin node, and v ˜ d i is the virtual destination node of ϕ i , v o i , v ˜ d i C . The path of the ith sample ϕ i is a series of chronologically ordered nodes, v o i , , , v x i , , , v d i . v ˜ d i may not be the true destination node v d i of ϕ i , as the random-walk process was partially constrained when derived from v ˜ d i . The steps were as follows. Firstly, a data-driven initialization method was introduced to initialize v o i and v ˜ d i . Then, some characteristics of candidate nodes were extracted from traffic network structure data and massive trajectory data. Finally, we declared some constraints and regulations and expatiated the simulation process and results; thus, ϕ i was decided.

3.1.1. Transferring Probability Calculation

One of the key steps in random walking is to confirm the probability of moving to the next node while ‘walking’. We assumed that a walking agent starts a walk from one node to its neighbor based on a probability distribution function, which is called the transferring probability. The transferring probability was calculated using two indexes: betweenness centrality (denoted as BC, global and static) and transfer connectivity (denoted as TC, local and dynamic).
BC is a typical measure in complex network theory, and it considers both the travel path and the link weight. It represents the global importance of a node in connecting others through through-movement, which is also called global choice [31,32]. For node i ( v i ), BC is defined as the number of the shortest paths passing v i compared to the total number of the shortest paths between any two nodes in the network [33]. The definition is represented as:
BC i = i j k n p a t h j k i p a t h j k
where p a t h j k denotes the total number of the shortest paths from v j and v k , and p a t h j k i is the number of those paths that pass v i .
TC is derived from complex network theory [34], and it considers traffic characteristics. TC of v i is defined as the number of traces passing v i compared to the total number of traces between any two nodes around v i within a specific topological connection range; three phases of consecutive nodes of node i were used. The definition is represented as:
TC i = i j k j , k ϵ T v i , 𝓆 t r i p j k v i , 𝓆 i t r i p j k v i , 𝓆
where t r i p j k v i , 𝓆 is the total number of traces from v j and v k , and t r i p j k v i , 𝓆 i is the number of those traces that pass v i ; T v i , 𝓆 is a set of encoding nodes that link directly and indirectly to v i , and 𝓆 is the proximity of nodes to v i in T i , 𝓆 .
Then, based on a logistic function, BC and TC were considered to calculate the transferring probability as follows:
p v n o w , v n e x t i = exp BC ^ i + exp TC ^ i n e x t ϵ T ¯ v i , 1 exp BC ^ n e x t + exp TC ^ n e x t
BC ^ i = BC i n e x t ϵ T ¯ v i , 1 BC n e x t   ,   TC ^ i = TC i n e x t ϵ T ¯ v i , 1 TC n e x t  
where p v n o w , v n e x t i is the probability of transferring from v n o w to v n e x t , v n o w is the node at which the agent stays at the current time, v n e x t is the node adjacent to v n o w , i.e., the agent can walk from v n o w to v n e x t (but this does not mean that the agent can walk from v n e x t to v n o w ), BC ^ i and TC ^ i are the normalized BC i and TC i , respectively, and T ¯ v i , 1 is a set of encoding nodes, as well as the nodes adjacent to v i .
On the aspect of the influence of trip purpose on random walking, we utilized the deviation angle to modify the transferring probability as follows:
p ^ v n o w , v n e x t i = f Δ θ w a l k i n g i p v n o w , v n e x t i
f Δ θ w a l k i n g i = exp cos θ w a l k i n g i 1 , 0 < θ w a l k i n g i θ t h r e s h o l d 0 ,     θ w a l k i n g i > θ   t h r e s h o l d i = exp x v n e x t x v n o w x v ˜ d i x v n o w + y v n e x t y v n o w y v ˜ d i y v n o w l v n o w , v n e x t + l v n o w , v ˜ d i 1 , 0 < θ w a l k i n g i θ   t h r e s h o l d i 0 ,     θ w a l k i n g i > θ   t h r e s h o l d
where p ^ v n o w , v n e x t i is the modified transferring probability, θ w a l k i n g i is the angle between the direction of the destination and the current walking direction, i.e., the angle between the vector from v n o w to v n e x t and the vector from v n o w to v ˜ d i , θ t h r e s h o l d is an angle threshold to ensure that the agent walks toward the destination and avoids a circular path, x v n o w ,   y v n o w and x v n e x t ,   y v n e x t are the position coordinates of v n o w and v n e x t , respectively, and l v n o w , v n e x t and l v n o w , v ˜ d i are the straight-line distances of v n o w to v n e x t and v n o w to v ˜ d i , respectively. Thus, according to Equations (6)–(9), the modified transferring probability function could be rewritten as follows:
p ^ v n o w , v n e x t i = exp x v n e x t x v n o w x v ˜ d i x v n o w + y v n e x t y v n o w y v ˜ d i y v n o w l v n o w , v n e x t + l v n o w , v ˜ d i 1 exp BC ^ i + exp TC ^ i n e x t ϵ T ¯ v i , 1 exp BC ^ n e x t + exp TC ^ n e x t ,     0 < θ w a l k i n g i θ t h r e s h o l d 0 ,     θ w a l k i n g i > θ t h r e s h o l d

3.1.2. Simulation Process

We introduced the procedure of implementing the random-walk simulation proposed by Xuebin [35]. To begin with, an OD pair ( v o i , v ˜ d i ) in ϕ i will be selected in sequence, while v o i is the starting node of the walk and v ˜ d i is used to set the walking rules. Then, two stopping criteria for the agent were set: arriving at v ˜ d i or reaching a length threshold for the walking journey. The length threshold is calculated with the following formula:
l m a x = E l + S l 2 log 1 1 R 1 cos 2 π R 2
where E l is the expectation and S l is the standard deviation of the trip distance of the residents, and R 1 and R 2 are randomly generated numbers ranging from 0 to 1.
Next, the second point is selected according to the transferring probability and other rules. At the same time, the nodes contained in the walking path are checked for duplication to avoid a repeated selection of road segments. When a valid node—namely, not a repeated node or one in a circular path—is added to the walking path, the total length of the current walking track is calculated. On the condition that the total length exceeds the threshold length or the walking agent arrives at v ˜ d i , the random walking trip will stop. The nodes that the walking agent passes, one after the other, will be assigned to ϕ i . Algorithm 1 shows the random-walk process in detail.
Algorithm 1: The pseudo-code for the random walk
Input: The surveyed OD matrix, observed trajectory data of the vehicles
Output: The path set Φ = ϕ , V o , V d
1:Initialization of V o and V d in Φ , the length threshold l m a x
2:Calculate the transferring probability p v n o w , v n e x t i between each pair of nodes;
3:for i = 1   t o   n r do
4:    initialize ϕ i to null and add v o i into ϕ i ; walking agent begins at v o i ; p ^ v n o w , v n e x t i ;
5:    while the total length of ϕ i does not exceed l m a x and the walking agent is not at v d i  do
6:        select one of the agent’s adjacent nodes according to the p ^ v n o w , v n e x t i
7:        if ϕ i is not a repeated or circular path do
8:          add the selected adjacent node into ϕ i ;
9:        else do
10:          select another one of the agent’s adjacent nodes according to the p v n o w , v n e x t i , return to line 7;
11:        end if
12:        let walking agent move to the last added node;
13:return  ϕ i
14:end for

3.2. Formulating the P-BPM of Each Simulated Path

With the random-walk method, a mass of data-driven paths were produced. Then, a path-level BPM of these paths was built, instead of one at the region level. It is apparent that the P-BPM is simpler due to its lower dimensions, and it is more robust because the drawback of the heterogeneous distribution of paths no longer exists. The elements of the path were definite in the P-BPM, meaning that the node, link, and their topological relation were already known. A detailed description is provided in the subsequent section.

3.2.1. Problem Restatement

To better understand the P-BPM, we first introduce two data-driven parameters, the drop-fraction at the node (DN) and the drop-fraction at the link (DL), which are both calculated based on massive trajectory data to produce the ground-truth vehicle paths. The definitions are provided below, with Figure 4 as an illustration.
DN: The accumulated proportion of the traffic flow that does not pass through node v i in a path is denoted as φ i . In Figure 4,
φ 1 = 0 f l o w 0 = 0 ,   φ 2 = f l o w 0 f l o w 3 f l o w 0 = f l o w 1 f l o w 0 ,   φ d = f l o w 4 f l o w 0 = f l o w 1 + f l o w 2 f l o w 0 .
DL: The proportion of the traffic flow that does not pass through link l i j from node v i to node v j + 1 is denoted as ω i j . In Figure 4,
ω o 1 = 0 f l o w 0 = 0 , ω 1 2 = f l o w 1 f l o w 0 , ω 2 d = f l o w 2 f l o w 0
Visibly, φ i = ω i 1 i , since φ i is global and ω i 1 i is local. In particular, once an AVI sensor is established at v i , φ i will be reset to 0.
The developed P-BPM is formulated according to the example exhibited in Figure 5.
In this example, the selected path contains six nodes ( v 1 ,   v 2 ,   v 3 ,   v 4 ,   v 5 ,   and v 6 ) and five links ( l 12 , l 23 , l 34 , l 45 , and l 56 ), while the DL of each link is 0%, 10%, 8%, 9%, and 6%, and the distance of each link is 600, 800, 1500, 700, and 1000 m. Two allocation scenarios for AVI sensors are given as samples. In Scenario 1, an AVI sensor was established at   v 3 , since the distance of l 34 was at its maximum, while in Scenario 2, an AVI sensor was established at   v 4 , since φ 5 was at its maximum. It is noticeable that these two scenarios lead to different detection performance, while the cumulative RF s of each link are 100%, 90%, 100%, 91%, and 85.5% in Scenario 1 and 100%, 90%, 82.8%, 100%, and 94% in Scenario 2. As for the detection error, as shown in Figure 5, we get:
d e t e c t i o n   e r r o r   Scenario   1 = 4.36 % ,   d e t e c t i o n   e r r o r   Scenario   2 = 5.68 %
The allocation in Scenario 1 is superior to that in Scenario 2; thus, v 1 ,   v 3 , and v 6 are the suggested nodes for locating the AVI sensors in this example. Therefore, we developed a P-BPM with the aim of finding the optimal allocation scenario with a minimal d e t e c t i o n   e r r o r   , i.e., a maximal detection rate.

3.2.2. Formulating the P-BPM

In each definite artificial path obtained in Section 3.1, a P-BPM was reformulated as follows.
min f = 1 i N r 1 φ i · l i i + 1 r l a i i + 1 r · 1 x i + K · i C x i
This was subject to:
i N r , i 1 x i 2
φ i a i i + 1 r · 1 x i > ε q , i N r
1 l j k r l r a j k r · 1 x i > ε l , i , j N r
φ i 1 + ω i 1 i φ i · 1 x i = 0
φ i · x i = 0
x i ,   a j k r 0 , 1       i C , j N r , k N r
Let K be the penalty factor in the objective function (12), and it decides the priority between the maximum performance and minimum cost. The objective function (12) aims at minimizing the detection rate and the count of AVI sensors, while minimum detection rate refers to the maximum performance, and the minimum count of AVI sensors refers to the minimum cost. The default value of K is set to 1 2 n c , where n c is the number of candidate nodes.
Let ε q represent the maximum reliable DN, and let ε l represent the maximum of reliable links in proportion to the total links of path r. ε q and ε l are both constant and are the critical criteria for whether or not to place an AVI sensor. Constraints (13) ensure that there are at least two AVI sensors in each path to be used to detect the origin and destination, which is an essential requirement for observing where a vehicle enters and leaves the study area. Constraints (14) ensure that if, the drop-fraction at node i is greater than ε q , an AVI sensor is required, and constraints (15) ensure that, if the distance between node j and node k exceeds ε l (the distance of path r), an AVI sensor is required at node j. Constraints (16) indicate the correlations between the DN values of the consecutive points in order to determine if an AVI sensor is required. Generally speaking, φ i = j = max i | x i = 1 i ω j 1 j . Constraints (17) indicate that, once an AVI sensor is established at   v i , φ i will be reset to 0. Constraints (18) define the binary variables x i and a j , k r .
In particular, constraints (16) can be linearized using the big-M method [21].
φ i 1 + ω i 1 i φ i x i · M
where M is a series of constants that are large enough to create no limits on the original constraint and variable.
Finally, as a powerful and user-friendly commercial solver in mixed-integer linear programming, the Gurobi solver was used to solve the P-BPM, with Python 3.6 as a modeling language [36].

3.3. Calculating the Deployment Score

Once candidate node v i is allocated an AVI sensor, the score of v i increase by one; after all of the random-walk paths have obtained their optimal allocations, each candidate node receives its final score, and the cumulative score is the deployment score. The one with the highest deployment-score should be given a higher priority for allocation of an AVI sensor. Let the vector S = s v 1 , s v 2 , , s v n , where s v i represents the number of deployment scores of v i .
We assumed that the simulation paths based on random walks closely approximated the realistic paths. Then, the set of the candidate nodes’ deployment scores was the solution with approximate Pareto global optimality. Theorem 1 and Proof are provided in the following.
Theorem 1.
Among a certain number of single-path AVI-LPs, the candidate node  v i (or a set of nodes denoted as  V i   t o   j ) with the maximum time that is regarded as the optimal solution (i.e.,  v i  is one of the nodes in the optimal layout) is the solution of this location problem at the regional level with approximate Pareto global optimality.
Proof 1.
χ denotes the number of single-path AVI-LPs (denoted as single problems). Assume that each candidate node receives the same score after solving χ 1 single problems. In the χ th single problem, v i (or V i   to   j ) is the optimal solution at the path level. If v i (or V i   to   j ) is not chosen as the optimal solution at the regional level, the objective function’s value will be worse because the objective function’s value for the χ th single problem is worse, while the values of the objective function for the other χ 1 single problems remain the same. Hence, v i or V i   to   j ) with the maximum deployment-score is the Pareto optimal solution for these χ paths at the regional level. We call v i (or V i   to   j ) the approximate Pareto optimal solution due to the assumption that the simulation paths closely approximated realistic paths. □
To avoid ignoring the effects of the topological characteristics of the road network in the AVI-LP, the well-known PageRank algorithm was employed for further enhancement. Though it is widely known in search engine optimization, PageRank can also be applied to the measurement of a road network as follows:
γ j = 1 q n 1 n + q A γ j 1
where γ j = P r j v 1 , P r j v 2 , , P r j v n is the PageRank vector in the j th iteration, P r j v i is the deployment score of v i in the j th iteration, n is the total number of candidate nodes, q is the decay factor (this normally takes the value of 0.85), 1 n is an n-dimensional vector whose entries are 1 and A stands for the adjacency matrix indicating the adjacent relations between candidate nodes. Letting γ 0 = S , we can get the final γ , i.e., the modified deployment score of each candidate node. Thus, the priority of candidate nodes for which an AVI sensor is suggested can be obtained.

4. Experiments

4.1. Dataset

(1) Road Network: In our experiments, a part of the road network of Chengdu city in China, with a total length of 257 km, was used, as shown in Figure 6. This city-wide network graph covers a 3.5 × 5.2 km spatial range containing 587 nodes and 1397 road links.
(2) GPS Dataset: The GPS dataset was collected in Chengdu city from October 1st to October 31st, 2016, with support from Didi Chuxing. In the third quarter of 2016, Didi Intelligent Travel generated a total of 1 billion GPS trajectory records, which contained 35,450 cars and 202,505 orders in a region of 65 km2 in the northeast section of the Second Ring Area in Chengdu. Each point of the GPS trajectories of the trips was matched to a real road to ensure that the data could correspond to actual road information, as shown in Figure 7.
(3) AVI Dataset: Some virtual AVI sensors could be assumed to be located at the intersections of the road network by reference to the GPS trajectory data. If an intersection was assumed to be a location for an AVI sensor, the recorded GPS data within this intersection were valid as observed AVI data; otherwise, they were abandoned.

4.2. Evaluation Methodology

4.2.1. Implementation Procedure

A path-inferring algorithm used to infer the whole path is presented in Algorithm 2. Some path sets were defined as follows (as shown in Figure 8), where ψ = 1 , 2 , .. , n evaluation , n evaluation is the total number of evaluation samples, ϕ shortest ψ , ϕ popular ψ , and ϕ inferred   ψ denote the ψ th path, and v i ,   v j C .
The ground-truth path set: Φ t r u e   = ϕ t r u e   ψ contains the true paths of vehicles from the GPS data, and it is used as a reference for evaluation in the next section;
The shortest-path path set: Φ shortest = ϕ shortest v i v j contains the shortest paths between two arbitrary candidate nodes derived from the road map, and it serves as supplementary path data;
The most-popular path set: Φ popular = ϕ popular v i v j contains the most common paths between two arbitrary candidate nodes derived from ϕ t r u e   ψ , and it serves as supplementary path data;
The detected path set: Φ detected   = ϕ detected   ψ contains the paths detected by AVI sensors, and it is obtained from Φ t r u e   , which may be incomplete and may need to be complemented using Φ shortest or Φ popular ;
The inferred path set: Φ inferred   = ϕ inferred   ψ contains the paths inferred from Φ detected   , Φ shortest , and Φ popular , where the missing part of path ϕ detected   ψ is interpolated with the path in Φ shortest or Φ popular . The details are shown in Algorithm 2.
Algorithm 2: The pseudo-code for inferring the whole detected path
Input: The candidate graph G , Φ t r u e   , BC ^ i and TC ^ i of each candidate node
Output: Φ inferred  
1:Initialize Φ shortest and Φ popular from the road map and Φ t r u e , respectively
2:
3:for ψ = 1   to   n evaluation    do
4:     initialize ϕ detected   ψ from ϕ true   ψ , ϕ inferred   ψ = ϕ detected   ψ
5:     for each segment from v x 1   ψ to v x 2   ψ in ϕ detected   ψ do
6:         if  v x 1   ψ is not adjacent to v x 2   ψ do
7:          remove the path ϕ shortest v x 1   ψ v x 2   ψ   v x 1   ψ to v x 2   ψ from Φ shortest
8:          remove the path ϕ popular v x 1   ψ v x 2   ψ   v x 1   ψ to v x 2   ψ from Φ popular
9:          calculate the total value v a l u e shortest v x 1   ψ v x 2   ψ of the nodes in ϕ shortest v x 1   ψ v x 2   ψ except v x 1   ψ and v x 1   ψ
10:          calculate the total value v a l u e popular v x 1   ψ v x 2   ψ of the nodes in ϕ popular v x 1   ψ v x 2   ψ except v x 1   ψ and v x 1   ψ
11:          if  v a l u e shortest v x 1   ψ v x 2   ψ v a l u e popular v x 1   ψ v x 2   ψ  do
12:              insert ϕ shortest v x 1   ψ v x 2   ψ to ϕ detected   ψ between v x 1   ψ and v x 2   ψ as ϕ inferred   ψ
13:          else do
14:              insert ϕ popular v x 1   ψ v x 2   ψ to ϕ detected   ψ between v x 1   ψ and v x 2   ψ as ϕ inferred   ψ
15:          end if
16:         end if
17:     end for
18:return  ϕ inferred   ψ

4.2.2. Performance Metrics

Metric 1,F1 score of paths: We evaluated the results based on the F1 measure, which indicates both accuracy and recall. Accuracy is defined as the percentage of the length of the matched segments of the total paths S m a t c h e d to the length of the inferred segments of the total paths S inferred , where S m a t c h e d will be the total length of the common edges between the inferred paths and the ground-truth paths S truth . The recall is measured according to the ratio between the length of the matched segments and the length of the ground-truth paths. Hence, the F1 score of the paths is computed as follows:
S m a t c h e d = S inferred     S truth
a c c u r a c y = S m a t c h e d S inferred  
r e c a l l = S m a t c h e d S truth
F 1   s c o r e = 2 a c c u r a c y r e c a l l a c c u r a c y + r e c a l l
where S is the total length of edges, s S .
Metric 2,F1 score of nodes: Meanwhile, the F1 score of the nodes is also computed in the same was as Metric 1.
Metric 3,observability: Observability evaluates the scope of service of the allocation of AVI sensors, and it is computed as follows:
o b s e r v a b i l i t y = S c m a t c h e d S c t r u t h
where S c m a t c h e d represents the quantities of the total matched nodes of all detected paths, and S c t r u t h represents the quantities of the total nodes of all ground-truth paths.

4.3. Experimental Results

With Equations (4) and (5), the BC and TC of the candidate nodes were calculated, as shown in Figure 9, where the radius of the circle represents the value of the BC or TC.
The P-BPMs were solved by the commercial Gurobi solver. The time spent was about 20.34 s per 10,000 P-BPMs, running on a personal workstation with 3.50 GHz CPU and 16 GB of RAM. Then, the deployment scores of all candidates were globally optimized using PageRank. Finally, we estimated the performance when allocating AVI sensors with different coverage rates.
Figure 10d presents the distribution of the deployment scores of the candidates with a heat map. According to the deployment scores, we can make decisions on allocating diversified proportions of AVI sensors. Figure 10a–c provide examples of the results when deploying 10%, 30%, and 50% of the AVI sensors among the candidates, respectively. Some conclusions can be drawn.
The assessment metrics for the P-BPM are shown in Table 1, including the three types of metrics mentioned above, from 10% to 90% at 10% intervals. For example, 10% means that 10% of the nodes with the top 10% of deployment scores are selected for deployment of AVI sensors. The fine-grained results are visualized in Figure 11, where the difference between each metric and the proportion of AVI sensors (i.e., the red line where y = x) is represented by a histogram. The following conclusions can be drawn.
F 1   s c o r e s : As seen in Table 1, the F 1   s c o r e of the link and node are both high, even with low proportions of AVI sensors; in other words, the performance of the deployment-score is very good. For example, 10% of the AVI sensors lead to F 1   scores of 0.6335 and 0.5425 for the link and node, respectively, meaning that even when AVI sensors are placed at only 10% of the intersections, 63.35% of the total links and 54.25% of the total intersections of the passing vehicles’ paths can be inferred. Figure 11 indicates that the F 1   s c o r e of the link and node will exceed 0.8 if only approximately 25% or more of the AVI sensors are deployed. The proportion ranging from 10% to 45% of the AVI sensors shows significant superiority compared to the basic line at y = x, where the F 1   s c o r e is about 0.4 greater than the corresponding AVI sensor proportion, i.e., x % of the AVIs can monitor more than x + 40 % of the vehicles’ traces. This is somewhat counterintuitive, but is of great significance for the development of smart and sustainable cities.
Accuracy and recall : It is apparent from Table 1 and Figure 11b,c that the a c c u r a c y of link and node is extremely high—on average, more than 0.95—regardless of how many AVI sensors are used, while the r e c a l l is relatively low. Thus, the following conclusions can be drawn: (a) The suggested layouts hold a high identification accuracy regardless of how many AVI sensors there are (assuming that the proportion of AVI sensors is at least 10%); (b) some segments of the paths may be missed during identification if there are too few AVI sensors because fewer AVI sensors result in lower r e c a l l but higher a c c u r a c y ; (c) the marginal cost gets smaller and smaller as the number of AVI sensors increases because a larger AVI sensor proportion leads to a smaller value of r e c a l l from which the proportion of AVI sensors is subtracted, i.e., the growth rate of r e c a l l rapidly decays as the AVI sensor proportion increases.
Observability : On the one hand, similarly to the r e c a l l and F 1   s c o r e , fewer AVI sensors results in high o b s e r v a b i l i t y , e.g., 10% vs. 0.2638 and 20% vs. 0.4558. As a result, the suggested layout is positive to increase the service coverage of AVI sensors. On the other hand, the general difference between the o b s e r v a b i l i t y and the proportion of AVI sensors is inconspicuous, ranging from about 15% to 20% greater. That is to say, unlike other metrics, the growth of o b s e r v a b i l i t y is steadier, meaning that, to rapidly improve the service coverage of AVI sensors, the deployment score is helpful, but not enough.
More specifically, these results confirm that the deployment score is positive to reinforce the identification power of AVI sensors, especially when inferring vehicle paths. If the AVI sensors are placed in the suggested layout, more than 95% of the inferred paths are consistent with the true paths. Counterintuitively, only a minority of AVI sensors can contribute to the identification of a great number of the paths, which is of great significance in practice.

4.4. Comparative Experiments

We compared the path identification performance of the deployment score with other indicators in terms of various purposes. The following indicators were used.
Scenario 1: Betweenness-centrality-based indicator: This method focuses on the priority of the shortest paths, with reference to the complex network theory. The candidate nodes are scored according to their betweenness centrality, and then the layout is determined according to the scores.
Scenario 2: Transfer-connectivity-based indicator: This method is derived from complex network theory. It focuses on the priority of local truth paths and the road network. In the same as in Scenario 1, the candidate nodes are scored according to their transfer connectivity, which was defined in the preceding sections.
Scenario 3: Traffic-volume-based indicator: This method focuses on the priority of traffic volume, the most significant indicator in urban traffic. The candidate nodes are scored according to the traffic volume.
Figure 12 compares the performance of these indicators with the deployment score (computed with the P-BPM), and each item is explained in the following.
Figure 12a shows the F 1   s c o r e s of these indicators. Obviously, the deployment score outperforms the other indicators in terms of the F 1   s c o r e . It could achieve a high F 1   s c o r e with a low proportion of AVI sensors. What is striking in the figure is the dominance of the P-BPM at the beginning, indicating that the deployment score performs well in cases in which AVI sensors are sparse. We note that the estimated results can achieve F 1   s c o r e s for both the link and node of over 70% when the AVI sensor proportion is higher than 20%, while in other scenarios, an AVI sensor proportion of at least 30% is required.
Figure 12b describes the a c c u r a c y of these methods. The figure shows that the deployment score is steady and high, while others have sharp fluctuations, especially if the AVI sensor proportion is low. The a c c u r a c y of the deployment score maintains a high level, meaning that the layout suggested by the deployment score is robust and effective and can ensure the identification accuracy regardless of what the AVI sensor proportion is.
Figure 12c presents the r e c a l l of these methods. As revealed in figure, the deployment score’s recall for both the link and node stands out higher than the others. Thus, its path-reconstruction ability is more powerful in the same condition, and the reconstruction ratio of both the link and node can also be guaranteed. In contrast, the other r e c a l l scores are patchy with different proportions of AVI sensors, so the deployment score is more reliable.
Figure 12d shows the o b s e r v a b i l i t y . Generally speaking, compared with other metrics, the o b s e r v a b i l i t y of all of these methods is low, and the growth rate is also slow. Even in this case, the deployment score still performs better than the others. The o b s e r v a b i l i t y of the traffic-volume-based method is the worst. That is to say, the traffic-volume-based method just ensures the service coverage at the node level, rather than at the regional level, leading to failure when inferring paths. Thus, the deployment score could generate a more reasonable and homogeneous spatial distribution for AVI sensors.

5. Conclusions and Discussions

AVI sensors show unique advantages for vehicle detection in traffic management. This study developed a data-driven method for solving the AVI-LP for better path reconstruction, the final solution of which is a novel indicator called the deployment score, which reflects the priority of deploying AVI sensors. In consideration of practical and universal feasibility, this research constructed a path-level bi-level programming model (P-BPM), rather than a region-level model, which is complicated and may be trapped in an NP-hard problem when applied. A random-walk method was introduced to simulate the tracks of vehicles according to travel characteristics extracted from finite GPS data. Accordingly, the P-BPM for each path was modeled to find the optimal layout of AVI sensors. Each P-BPM was solved by the Gurobi solver. The total scores of the candidates were obtained (i.e., the deployment scores) once all P-BPMs were solved, and the accumulated scores proved to be the approximate Pareto optimal solutions at the regional level from a data-driven perspective. The PageRank method was also introduced to modify the deployment scores with respect to the topological structural characteristics of the road network. Our proposed deployment score was compared with three baseline indicators based on the shortest paths, the true paths, and the traffic volume, respectively. The experimental results show that the deployment score outperforms the other indicators in terms of performance metrics. To be specific, the layout of AVI sensors suggested by the deployment score has excellent and steady routing detection accuracy, great and reliable routing recall, and satisfactory service coverage.
Enabled by the development of ITS, big data are more and more popular and important in smart and sustainable cities due to their indispensable contributions to the management and operation of environmentally sustainable development. The reasonable and effective deployment of traffic sensors is capable of providing comprehensive traffic monitoring data and, simultaneously, allowing lower costs. As some of the most important traffic sensors, AVI sensors are expected to be placed in the road network, i.e., at segments or intersections, to capture the detailed sequential information of passing vehicles. This study discussed the case of AVI sensors placed at intersections to better infer the full traces of the vehicles. This study can support the decision makers in deploying AVI sensors more practicably and effectively. Sustainability is the trend of future transportation development. Deploying AVI sensors more reasonably makes traffic operation and management more efficient, and thus provides motivation for the development of more sustainable transportation.
This research is a preliminary step toward solving the AVI-LP. Random-walk can effectively reduce the complexity and eliminate the deficiency of input data, while some traffic preferences can also be taken into consideration in the random-walk procedure according to various decision objectives. A limitation of this research is that the candidate nodes were manually selected and some assumptions were made, the intention of which was to make it convenient to model and develop solution methods. As we know, in some cases, we cannot grasp the situations of all of intersections. Thus, decision makers prefer to input the restricted conditions of every intersection into the model to decide on the candidate nodes. Hence, as a further research direction, we will consider taking each node as a candidate node, and their restricted conditions will be used to model the P-BPM.

Author Contributions

Conceptualization, W.W.; Data curation, D.L.; Formal analysis, D.L.; Funding acquisition, W.W. and D.Z.; Investigation, D.Z.; Methodology, D.L.; Resources, W.W. and D.Z.; Validation, D.L.; Visualization, D.L.; Writing—original draft, D.L.; Writing—review & editing, D.L. and D.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, under Grant number (51878166) and the China Scholarship Council, under Grant number (202106090220).

Data Availability Statement

The author confirms that the data supporting the findings of this study can be found at https://outreach.didichuxing.com/research/opendata/ (accessed on 1 February 2019). The data were supplied by Didi Chuxing.

Conflicts of Interest

The authors declare no conflict of interest.

Disclosure Statement

No potential conflict of interest was reported by the authors.

Variables

Various notation descriptions are given below.
VariablesDetails
x i Binary, equal to 1 if there is an AVI sensor established at node i , and 0 otherwise
a i j r Binary, equal to 1 if the path from i to j within path r is misidentified, and 0 otherwise
q i j r Integer, the misidentified traffic flow from node i to node j of path r
q r Integer, the total true traffic flow of path r
l i j r Float, the misidentified length from node i to node j path r
l r Float, the total length ofpath r
CThe set of candidate nodes, C V
LThe set of links between the candidate nodes
RThe set of the total actual vehicle paths; each path is represented as a sequence of candidate nodes that the vehicle passes
N r The set of the nodes in path r, L r R

References

  1. Fu, C.; Sayed, T. Random-Parameter Bayesian Hierarchical Extreme Value Modeling Approach with Heterogeneity in Means and Variances for Traffic Conflict–Based Crash Estimation. J. Transp. Eng. Part A Syst. 2022, 148, 04022056. [Google Scholar] [CrossRef]
  2. Mahmud, S.; Day, C.M. Leveraging Data-Driven Traffic Management in Smart Cities: Datasets for Highway Traffic Monitoring. In The Rise of Smart Cities; Butterworth-Heinemann: Oxford, UK, 2022; pp. 583–607. [Google Scholar] [CrossRef]
  3. Fu, C.; Sayed, T. Bayesian dynamic extreme value modeling for conflict-based real-time safety analysis. Anal. Methods Accid. Res. 2022, 34, 100204. [Google Scholar] [CrossRef]
  4. Castillo, E.; Grande, Z.; Calviño, A.; Szeto, W.Y.; Lo, H.K. A state-of-the-art review of the sensor location, flow observability, estimation, and prediction problems in traffic networks. J. Sens. 2015, 2015, 903563. [Google Scholar] [CrossRef] [Green Version]
  5. Sharma, S.K.; Phan, H.; Lee, J. An Application Study on Road Surface Monitoring Using DTW Based Image Processing and Ultrasonic Sensors. Appl. Sci. 2020, 10, 4490. [Google Scholar] [CrossRef]
  6. Jo, Y.; Ryu, S. Pothole Detection System Using a Black-box Camera. Sensors 2015, 15, 29316–29331. [Google Scholar] [CrossRef] [PubMed]
  7. Asad, M.H.; Khaliq, S.; Yousaf, M.H.; Ullah, M.O.; Ahmad, A. Pothole Detection Using Deep Learning: A Real-Time and AI-on-the-Edge Perspective. Adv. Civ. Eng. 2022, 2022, 1–13. [Google Scholar] [CrossRef]
  8. Cao, Q.; Ren, G.; Li, D.; Ma, J.; Li, H. Semi-supervised route choice modeling with sparse Automatic vehicle identification data. Transp. Res. Part C Emerg. Technol. 2020, 121, 102857. [Google Scholar] [CrossRef]
  9. Gentili, M.; Mirchandani, P.B. Locating sensors on traffic networks: Models, challenges and research opportunities. Transp. Res. part C Emerg. Technol. 2012, 24, 227–255. [Google Scholar] [CrossRef]
  10. Chen, H.; Chu, Z.; Sun, C. Sensor Deployment Strategy and Traffic Demand Estimation with Multisource Data. Sustainability 2021, 13, 13057. [Google Scholar] [CrossRef]
  11. Fu, H.; Lam, W.H.; Shao, H.; Kattan, L.; Salari, M. Optimization of multi-type traffic sensor locations for estimation of multi-period origin-destination demands with covariance effects. Transp. Res. Part E Logist. Transp. Rev. 2021, 157, 102555. [Google Scholar] [CrossRef]
  12. Owais, M.; Moussa, G.S.; Hussain, K.F. Sensor location model for O/D estimation: Multi-criteria meta-heuristics approach. Oper. Res. Perspect. 2019, 6, 100100. [Google Scholar] [CrossRef]
  13. Zhu, N.; Fu, C.; Zhang, X.; Ma, S. A network sensor location problem for link flow observability and estimation. Eur. J. Oper. Res. 2022, 300, 428–448. [Google Scholar] [CrossRef]
  14. Gentili, M.; Mirchandani, P.B. Review of optimal sensor location models for travel time estimation. Transp. Res. Part C Emerg. Technol. 2018, 90, 74–96. [Google Scholar] [CrossRef]
  15. Gentili, M.; Mirchandani, P.B. Locating Active Sensors on Traffic Networks. Ann. Oper. Res. 2005, 136, 229–257. [Google Scholar] [CrossRef]
  16. Mínguez, R.; Sánchez-Cambronero, S.; Castillo, E.; Jiménez, P. Optimal traffic plate scanning location for OD trip matrix and route estimation in road networks. Transp. Res. Part B Methodol. 2010, 44, 282–298. [Google Scholar] [CrossRef]
  17. Castillo, E.; Menéndez, J.M.; Jiménez, P. Trip matrix and path flow reconstruction and estimation based on plate scanning and link observations. Transp. Res. Part B Methodol. 2008, 42, 455–481. [Google Scholar] [CrossRef]
  18. Zangui, M.; Yin, Y.; Lawphongpanich, S. Sensor location problems in path-differentiated congestion pricing. Transp. Res. Part C Emerg. Technol. 2015, 55, 217–230. [Google Scholar] [CrossRef]
  19. Cerrone, C.; Cerulli, R.; Gentili, M. Vehicle-ID sensor location for route flow recognition: Models and algorithms. Eur. J. Oper. Res. 2015, 247, 618–629. [Google Scholar] [CrossRef]
  20. Fu, C.; Zhu, N.; Ling, S.; Ma, S.; Huang, Y. Heterogeneous sensor location model for path reconstruction. Transp. Res. Part B Methodol. 2016, 91, 77–97. [Google Scholar] [CrossRef]
  21. Ruiz, C.; Conejo, A.J. Pool Strategy of a Producer With Endogenous Formation of Locational Marginal Prices. IEEE Trans. Power Syst. 2009, 24, 1855–1866. [Google Scholar] [CrossRef]
  22. Salari, M.; Kattan, L.; Gentili, M. Optimal roadside units location for path flow reconstruction in a connected vehicle environment. Transp. Res. Part C Emerg. Technol. 2022, 138, 103625. [Google Scholar] [CrossRef]
  23. Bianco, L.; Confessore, G.; Gentili, M. Combinatorial aspects of the sensor location problem. Ann. Oper. Res. 2006, 144, 201–234. [Google Scholar] [CrossRef]
  24. Gendreau, M.; Laporte, G.; Séguin, R. Stochastic vehicle routing. Eur. J. Oper. Res. 1996, 88, 3–12. [Google Scholar] [CrossRef]
  25. Owen, S.H.; Daskin, M.S. Strategic facility location: A review. Eur. J. Oper. Res. 1998, 111, 423–447. [Google Scholar] [CrossRef]
  26. Alves, M.J.; Antunes, C.H. A new exact method for linear bilevel problems with multiple objective functions at the lower level. Eur. J. Oper. Res. 2022, 303, 312–327. [Google Scholar] [CrossRef]
  27. Torres, J.J.; Li, C.; Apap, R.M.; Grossmann, I.E. A Review on the Performance of Linear and Mixed Integer Two-Stage Stochastic Programming Software. Algorithms 2022, 15, 103. [Google Scholar] [CrossRef]
  28. van Slyke, R.M.; Wets, R. L-shaped linear programs with applications to optimal control and stochastic programming. SIAM J. Appl. Math. 1969, 17, 638–663. [Google Scholar] [CrossRef]
  29. Benders, J.F. Partitioning procedures for solving mixed-variables programming problems. Comput. Manag. Sci. 2005, 2, 3–19. [Google Scholar] [CrossRef]
  30. Rubin, P.; Gentili, M. An exact method for locating counting sensors in flow observability problems. Transp. Res. Part C Emerg. Technol. 2021, 123, 102855. [Google Scholar] [CrossRef]
  31. Freeman, L.C. A Set of Measures of Centrality Based on Betweenness. Sociometry 1977, 40, 35–41. [Google Scholar] [CrossRef]
  32. Wu, X.; Cao, W.; Wang, J.; Zhang, Y.; Yang, W.; Liu, Y. A spatial interaction incorporated betweenness centrality measure. PLoS ONE 2022, 17, e0268203. [Google Scholar] [CrossRef] [PubMed]
  33. Wang, X.; Niu, S.; Gao, J.; Zhang, J. A Study on the Highway Network Key Segments Identifying Method Based on the Structural Characteristics. CICTP 2014, 2014, 1819–1829. [Google Scholar] [CrossRef]
  34. Trolliet, T.; Cohen, N.; Giroire, F.; Hogie, L.; Pérennes, S. Interest Clustering Coefficient: A New Metric for Directed Networks Like Twitter. J. Complex Netw. 2021, 10, 597–609. [Google Scholar] [CrossRef]
  35. Xuebin, W. Optimizing Bus Stop Locations in Wuhan, China; University of Twente Faculty of Geo-Information and Earth Observation (ITC): Enschede, The Netherlands, 2010. [Google Scholar]
  36. Gurobi Optimization LLC. Gurobi Optimizer Reference Manual. 2018. Available online: https://www.gurobi.com/documentation/9.5/refman/index.html (accessed on 2 July 2022).
Figure 1. An example of a network and data (color).
Figure 1. An example of a network and data (color).
Sustainability 14 09474 g001
Figure 2. Example of the calculation of the detection rate.
Figure 2. Example of the calculation of the detection rate.
Sustainability 14 09474 g002
Figure 3. Conceptual framework of our strategy.
Figure 3. Conceptual framework of our strategy.
Sustainability 14 09474 g003
Figure 4. Illustration of the drop-fraction and remaining fraction.
Figure 4. Illustration of the drop-fraction and remaining fraction.
Sustainability 14 09474 g004
Figure 5. Illustration of the P-BPM of a single path.
Figure 5. Illustration of the P-BPM of a single path.
Sustainability 14 09474 g005
Figure 6. The coverage of the study area and experimental data.
Figure 6. The coverage of the study area and experimental data.
Sustainability 14 09474 g006
Figure 7. An example of the distribution of GPS trajectories (color).
Figure 7. An example of the distribution of GPS trajectories (color).
Sustainability 14 09474 g007
Figure 8. Illustration of the different paths (color).
Figure 8. Illustration of the different paths (color).
Sustainability 14 09474 g008
Figure 9. Illustration of the BC and TC of the candidate nodes.
Figure 9. Illustration of the BC and TC of the candidate nodes.
Sustainability 14 09474 g009
Figure 10. Suggested layout and deployment scores of the candidate nodes.
Figure 10. Suggested layout and deployment scores of the candidate nodes.
Sustainability 14 09474 g010
Figure 11. Results of the P-BPM of fine-grained AVI sensor proportions. (a) F1 scores. (b) Accuracy. (c) Recall. (d) Observability.
Figure 11. Results of the P-BPM of fine-grained AVI sensor proportions. (a) F1 scores. (b) Accuracy. (c) Recall. (d) Observability.
Sustainability 14 09474 g011
Figure 12. Performance metrics in comparative experiments. (a) F1 score; (b) accuracy; (c) recall; (d) observability.
Figure 12. Performance metrics in comparative experiments. (a) F1 score; (b) accuracy; (c) recall; (d) observability.
Sustainability 14 09474 g012aSustainability 14 09474 g012b
Table 1. Assessment metrics of the P-BPM with specific proportions of AVI sensors.
Table 1. Assessment metrics of the P-BPM with specific proportions of AVI sensors.
Deployment Proportion of AVI SensorsLinkNodeObservability
A c c u r a c y R e c a l l F 1   s c o r e A c c u r a c y R e c a l l F 1   s c o r e
10%0.95210.47470.63350.97810.37530.54250.2638
20%0.94210.60920.73990.96970.58070.72640.4558
30%0.94850.6680.78390.97120.68360.80240.5644
40%0.95720.76520.85050.97710.79790.87850.6878
50%0.95430.81640.880.97630.86930.91970.7599
60%0.96140.84830.90130.97880.90860.94240.8358
70%0.96320.86470.91130.97910.93470.95640.8788
80%0.9670.92380.94490.98130.96950.97540.929
90%0.96790.95690.96240.98140.98930.98530.9685
Note: The cells in the grid have a red background if the value is more than 0.80.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, D.; Wang, W.; Zhao, D. A Practical and Sustainable Approach to Determining the Deployment Priorities of Automatic Vehicle Identification Sensors. Sustainability 2022, 14, 9474. https://doi.org/10.3390/su14159474

AMA Style

Li D, Wang W, Zhao D. A Practical and Sustainable Approach to Determining the Deployment Priorities of Automatic Vehicle Identification Sensors. Sustainability. 2022; 14(15):9474. https://doi.org/10.3390/su14159474

Chicago/Turabian Style

Li, Dongya, Wei Wang, and De Zhao. 2022. "A Practical and Sustainable Approach to Determining the Deployment Priorities of Automatic Vehicle Identification Sensors" Sustainability 14, no. 15: 9474. https://doi.org/10.3390/su14159474

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop