Cloud Model-Based Fuzzy Inference System for Short-Term Traffic Flow Prediction

Liu, He-Wei; Wang, Yi-Ting; Wang, Xiao-Kang; Liu, Ye; Liu, Yan; Zhang, Xue-Yang; Xiao, Fei

doi:10.3390/math11112509

Open AccessArticle

Cloud Model-Based Fuzzy Inference System for Short-Term Traffic Flow Prediction

by

He-Wei Liu

¹,

Yi-Ting Wang

²,

Xiao-Kang Wang

³,

Ye Liu

⁴,

Yan Liu

⁴,

Xue-Yang Zhang

^5,* and

Fei Xiao

^4,*

¹

School of Business, Guilin University of Technology, Guilin 541004, China

²

School of Finance, Hunan University of Finance and Economics, Changsha 410205, China

³

College of Management, Shenzhen University, Shenzhen 518060, China

⁴

School of Business, Central South University, Changsha 410083, China

⁵

Institute of Big Data Intelligent Management and Decision-Making, College of Management, Shenzhen University, Shenzhen 518060, China

^*

Authors to whom correspondence should be addressed.

Mathematics 2023, 11(11), 2509; https://doi.org/10.3390/math11112509

Submission received: 18 April 2023 / Revised: 23 May 2023 / Accepted: 26 May 2023 / Published: 30 May 2023

Download

Browse Figures

Versions Notes

Abstract

:

Since traffic congestion during peak hours has become the norm in daily life, research on short-term traffic flow forecasting has attracted widespread attention that can alleviate urban traffic congestion. However, the existing research ignores the uncertainty of short-term traffic flow forecasting, which will affect the accuracy and robustness of traffic flow forecasting models. Therefore, this paper proposes a short-term traffic flow forecasting algorithm combining the cloud model and the fuzzy inference system in an uncertain environment, which uses the idea of the cloud model to process the traffic flow data and describe its randomness and fuzziness at the same time. First, the fuzzy c-means algorithm is selected to carry out cluster analysis on the original traffic flow data, and the number and parameter values of the initial membership function of the system are obtained. Based on the cloud reasoning algorithm and the cloud rule generator, an improved fuzzy reasoning system is proposed for short-term traffic flow predictions. The reasoning system cannot only capture the uncertainty of traffic flow data, but it also can describe temporal dependencies well. Finally, experimental results indicate that the proposed model has a better prediction accuracy and better stability, which reduces 0.6106 in RMSE, reduces 0.281 in MAE, and reduces 0.0022 in MRE compared with the suboptimal comparative methods.

Keywords:

traffic flow prediction; cloud model; fuzzy inference system; fuzzy C-means

MSC:

60G25; 62A86

1. Introduction

In recent years, rapid economic development has brought about rapid population growth and the increase of vehicle occupancy per capita, which impose a heavy burden on transportation infrastructure, such as insufficient parking spaces. According to the 2020 National Economic and Social Development Statistical Bulletin (http://www.gov.cn/xinwen/2021-02/28/content_5589283.htm, accessed on 22 October 2022), the total number of civilian vehicles in the country has increased by 7.41% year-over-year, and the total number has exceeded 280 million. The continuous increase in the number of motor vehicles has brought many problems to society, such as traffic congestion, a waste of resources, economic losses, excessive commuting times, and frequent traffic accidents. In addition, the pollution caused by the large number of cars may threaten human health [1]. Since traffic flow can reflect the number of vehicles that pass a point in a certain period of time [2], accurate traffic flow forecasting is of great significance to management departments and individuals, which can optimize the design and operation of transportation systems to improve traffic efficiency and safety. Thus, traffic flow predictions over a short period of time, i.e., short-term traffic flow predictions, have attracted much attention of scholars due to the randomness and dynamics of traffic conditions [3]. Many machine-learning methods [4,5] have been applied to short-term traffic flow predictions, which can be divided into parametric models, non-parametric models, and hybrid models.

As for parametric models, they quantitatively describe the relationship between inputs and outputs through an explicit model and estimate the parameters in the model. Common parametric models have Historical Average (HA) models and Auto-Regressive Integrated Moving Average (ARIMA) models. HA relies on the cyclical nature of traffic flow and only uses the average value of past traffic volume to predict future traffic flow [6]. Therefore, HA is simple in calculation and easy to apply in real life. Stephanedes et al. [7] employed HA to forecast future traffic volume and applied the model to the urban traffic control systems. Kaysi et al. [8] used HA for the traveler information systems. However, HA is unable to respond to dynamic changes in traffic systems, especially traffic accidents. To overcome this shortcoming, later models introduce real-time data into the prediction process, such as time-series method ARIMA. ARIMA interprets the past behavior of time series through mathematical models and applies the model to predict future traffic flow [9,10]. Van Der Voort et al. [11] introduced a new method of short-term traffic forecasting KARIMA by combing Kohonen maps with ARIMA time series models for solving the problem that ARIMA cannot deal with nonlinear traffic data. Considering the balance between the increased complexity and the increased forecast accuracy, Williams et al. [12] raised an ARIMAX model through combining ARIMA with explanatory variables for improving forecasting performance. Considering a huge historical database of traffic flow, Kumar et al. [13] proposed a Seasonal ARIMA (SARIMA) model for short-term traffic flow predictions. The model only utilized the prior three days of flow observation to predict the next-day flow values. However, the explicit models presupposed by the above methods are difficult to fit the real traffic flow data. Furthermore, they cannot reflect the emergent traffic conditions well. Consequently, non-parametric models are widely concerned.

Non-parametric models are a class of data-driven methods, which explore implicit relationships between inputs and predictions through large amounts of data without providing explicit functions. Some common non-parametric models include K-Nearest Neighbor (KNN), Support Vector Regression (SVR), and Artificial Neural Networks (ANN). KNN does not need prior knowledge, and it performs better than linear-model algorithms in terms of predictive performance. Zhang et al. [14] established a short-term urban expressway flow prediction system based on KNN from the historical database, the search mechanism and algorithm parameters, and the predication plan. Hou et al. [15] used a two-tier K-nearest neighbor algorithm to forecast short-term traffic flow considering the problem of calculation speed and parameter flexibility. Cai et al. [16] presented a sample-rebalanced and outlier-rejected k-nearest neighbor regression model for short-term traffic forecasting in order to handle the problem of imbalance and noise. In addition, SVR is also widely used in the nonlinear regression and time series problems. To improve traffic prediction accuracy, Lin et al. [17] put forward a method for screening spatial time-delayed traffic series based on the maximal information coefficients, which adopted the combination of support vector regression method and the k-nearest neighbors method for traffic flow prediction. Hong et al. [18] put forward a SVR traffic flow forecasting model, which employs the hybrid genetic algorithm-simulated annealing algorithm to determine its suitable parameter combination. Hong [19] applied SVR to seasonal trend time series data and proposed a traffic flow forecasting model SSVRCIA that combines the seasonal support vector regression model with a chaotic immune algorithm. Furthermore, since ANN has strong self-learning and self-adaptation abilities, many scholars proposed short-term traffic flow prediction models based on ANN [20]. Tang et al. [21] raised Neighbor Subset Deep Neutral Network (NSDNN) to forecast spatio-temporal data, which can extract useful inputs from nearby roads by conjoining a deep neutral network and the subset selection method. Considering the spatial correlation of traffic flow, the paper [22] proposed a method to predict the spatio-temporal characteristics of short-term traffic flow by combing the k-nearest neighbor algorithm and bidirectional long–short-term memory network model. However, a single short-term traffic flow prediction model is difficult to meet various situations in real life. Therefore, to improve the prediction ability and prediction accuracy, hybrid prediction models have received extensive attention, which takes full advantages of different models.

Some hybrid methods are raised for forecasting short-term traffic flow by combining several techniques [23,24,25]. Considering the forecasting performance is seriously deteriorated by non-Gaussian noises inside the traffic flow sequence, Fang et al. [26] presented an error distribution free deep learning for short term traffic flow forecasting. Liu et al. [27] put forward a hybrid short-term traffic flow forecasting method combining the neural networks and KNN. In order to improve the forecasting accuracy of short-term traffic flow and provide precise and reliable traffic information for traffic management units and travelers, Liu et al. [28] raised a hybrid forecasting model based on KNN and SVR. Luo et al. [29] proposed a spatiotemporal traffic flow prediction method by combining KNN and long–short-term memory network (LSTM), called KNN–LSTM. However, the above-mentioned methods ignore the uncertainty in the traffic flow data, which affects the accuracy and robustness of the traffic flow prediction model. The uncertainty involves ambiguity and randomness, and they often appear at the same time [30]. It is worth noting that fuzzy systems can describe the ambiguity well. Therefore, researchers often combine fuzzy systems with ANNs, which are called fuzzy neural networks or neuro-fuzzy models. Zhou et al. [31] proposed a novel deep-learning model for short-term traffic flow prediction by considering the inherent features of traffic data. In addition, a novel approach of the estimation of uncertainty is proposed, which is based on the notion of Intuitionistic fuzzy set (an extension of the Fuzzy set of Lotfi Zadeh) and an intuitionistic fuzzy traffic characterization [32].

Considering the Fuzzy Inference System (FIS) has the ability to autonomously imitate the human brain for reasoning, the Adaptive Neuro-Fuzzy Inference System (ANFIS) was developed by Jang Roger [33]. The system combined the learning mechanism of neural networks and the reasoning ability of FIS. ANFIS can adaptively extract network inference rules from data samples with the help of the neural network’s autonomous learning advantages. It shows unique characteristics and has been successfully applied in many fields. Keskin et al. [34] used the synthetic sequence generated by the ARIMA model as the training set of ANFIS and developed a flow prediction method based on the combination of ANFIS and the stochastic hydrological model. Ahmadianfar et al. [35] adopted the integration of an adaptive hybrid of differential evolution and particle warm optimizations with an adaptive neuro fuzzy inference system model for EC prediction. Acakpovi et al. [36] used ANFIS to predict the reliability of power demand. Mohiyunddin et al. [37] introduced a novel ANIFS for data protection to improve and determine the degree of security. Chen et al. [38] proposed a short-term traffic flow prediction based on ANFIS. Ghenai et al. [39] developed a short-term and accurate energy consumption forecast for educational building. This aims to balance the supply from renewable power systems and the building electrical load demand. Although ANFIS can describe the ambiguity in the traffic flow data, it cannot reflect the randomness of the data. The cloud model proposed by Li et al. [40] can simultaneously capture multiple uncertainties, especially randomness. In order to describe the ambiguity and randomness of traffic flow data simultaneously and improve the prediction performance of the model, we combine cloud models and FIS to solve traffic flow forecasting problems in ANN. In summary, the main contributions of our work are listed below:

(1) The cloud model and fuzzy inference system are combined to describe the ambiguity and randomness in the traffic flow. Put the cloud model in the network of the fuzzy inference system for training instead of using the inference rules to perform simple mapping between two cloud models.

(2) By calculating the weight of the historical time series of traffic flow, a weighted multi-dimensional cloud model is generated.

(3) Based on the weighted multi-dimensional cloud model, the improved fuzzy prediction system is constructed for short-term flow predictions; the system can describe the randomness problems and ambiguity of the data at the same time. It overcomes the shortcomings of fuzzy inference systems, which cannot capture the timing characteristics of long sequence data well.

The following content includes four sections. Section 2 introduces the basic knowledge related to the paper. Section 3 describes the improved fuzzy inferenced systems and explains the input layer, the cloudification fuzzy layer, the cloudification rule layer, the standardization layer, the inverse cloudification layer, and the output layer in detail. Section 4 demonstrates experiments for verifying the effectiveness of the raised model. Section 5 summarizes the whole paper.

2. Preliminaries

2.1. Fuzzy Inference Systems

A Fuzzy Inference System (FIS) is a system with the ability to handle fuzzy data based on fuzzy set theory and fuzzy logic methods, which simulate the fuzzy reasoning process of human beings by applying fuzzy sets and fuzzy rules to input data to generate fuzzy output results. Next, we will give the definition of fuzzy rules.

Definition 1

[41]. Suppose the input–output data records of fuzzy rules are given,

(x^{p}; y^{p}), p = 1, 2, \dots, N,

where

x^{p}

(

x^{p} \in R^{m}

) is the input,

y^{p} (y^{p} \in R)

is the output, and

p

denotes the pth sample. Then, single fuzzy IF–THEN rule performs as follows: IF

x^{p}

is

A

, THEN

y^{p}

is

B

, where

A

and

B

are fuzzy sets defined in R. Fuzzy systems mainly consist of a fuzzy input layer, a fuzzy inference method, a fuzzy rule base, and a defuzzification layer [42].

The fuzzy layer is responsible for mapping the exact values entering the fuzzy system to a fuzzy set over a given theoretical domain. Fuzzification methods include the fuzzy single value method, the triangular membership function method, and the Gaussian membership function method. Since the Gaussian membership function has a good anti-interference ability and the fuzzification results are closer to human cognition, it is mostly used in research.

The fuzzy rule base, which is the core part of the fuzzy inference system, consists of all the fuzzy rules in the system. It has two forms, including one-dimensional fuzzy rules and multi-dimensional fuzzy rules. The fuzzy inference engine is mainly responsible for calculating the incentive intensity of the rules in the rule base.

The defuzzification layer is to determine the best accurate value that can represent the fuzzy set. The method of defuzzification is not unique, as it mainly includes the maximum membership method, the center of gravity method, and the center average method.

2.2. Cloud Model

Inspired by probabilistic mathematics and the fuzzy set theory, Li et al. [40] created the cloud model, which is a new method to recognize uncertainty and an important way to realize two-way cognitive conversions between qualitative semantics and quantitative values. The cloud model allows for a certain degree of deviation between random phenomena and normal distribution and measures the deviation between them. At the same time, cloud models can describe the inherent correlation between randomness and fuzziness in uncertainty. Next, the definition of the cloud model is given.

Definition 2

[42]. Assume a universe

X = {x_{i}}

, where

x_{i}

is an exact value and there exists a set of linguistic terms,

T

with

X

. If

x

is a random instance on T, and the degree of certainty

u (x)

of x for

T

is a random number with a stable tendency within the interval

[0, 1]

, then the distribution of

x

is called a cloud on

X

, and each random instance of

x

is called a cloud drop on domain

X

.

The cloud model is generally described using three characteristic values:

E x

,

E n

, and

H e

. Among them,

E x

is the expectation, representing the expectation of the sample with a membership degree of 1 in

T

and reflecting the center position of the sample;

E n

is entropy and

H e

is hyperentropy, both of which are determined by the correlation between randomness and fuzziness within

T

simultaneously. The entropy

E n

can be used to measure the degree of randomness in the sample, manifested as the width of the cloud model (that is the distribution range of cloud droplets on the horizontal axis within the universe). Hyperentropy

H e

can reflect the degree of dispersion of

T

, manifested as the thickness of the cloud model, i.e., the degree of condensation of cloud droplets within the universe. A cloud model can be labeled as

C = (E x, E n, H e)

. The Gaussian cloud model, based on the Gaussian distribution function and Gaussian membership function, is the most important cloud model, which is defined as follows.

Definition 3

[42]. Let U be the universe of discourse and T be a linguistic terms set in U. If

x \in U

is a random instantiation of concept T and satisfies

x \sim N (E x, E {n^{'}}^{2})

,

E n^{'} \sim N (E n, H e^{2})

, then the certainty degree of x belonging to T satisfies

y = e^{\frac{- {(x - E x)}^{2}}{2 {(E n^{'})}^{2}}}

where y belongs to [0, 1].

The distribution of X in the universe U is named a one-dimensional normal cloud, and the cloud drop can be written as (x, y). The cloud can effectively describe both fuzziness and randomness of a concept by three quantitative variables, i.e., expectation Ex, entropy En, and hyper entropy He.

The one-dimensional cloud was originally applied to solving the problem of decision-making evaluations. When the number of evaluation factors increases, the evaluation results deviate significantly from the actual situation. Therefore, a multi-dimensional cloud model is proposed to overcome the above-mentioned problems. The multi-dimensional cloud model is an extension of the one-dimensional cloud model, which adopts the one-dimensional cloud method for each attribute of the multi-dimensional cloud [43]. In the following, we give the definition of the multi-dimensional cloud model.

Definition 4

[43]. Let

U

be a set of samples where

\forall X \in U

,

X = (x_{1}, x_{2}, \dots, x_{m})

, and T be a qualitative concept on the domain

U

.

\forall X \in U

, there is a membership degree

μ \in [0, 1]

of

X

with respect to

T

. That is:

U \to [0, 1]

.

Definition 5.

Assuming that the dimensions in the universe of discourse are independent of each other, then the m-dimensional cloud has 3m numerical eigenvalues:

(E x_{1}, E n_{1}, H e_{1}, E x_{2}, E n_{2}, H e_{2}, \dots E x_{m}, E n_{m}, H e_{m})

. Where

E x_{1}, E x_{2}, \dots, E x_{m}

is the expectation,

E n_{1}, E n_{2}, \dots, E n_{m}

is the entropy of the multidimensional normal cloud, and

H e_{1}, H e_{2}, \dots, H e_{m}

is super-entropy. A multi-dimensional cloud model can be expressed by the following formula, which is called MEHS (Mathematical Expected Hyper Surface):

M E H S (x_{1}, x_{2}, \dots, x_{m}) = \exp [- \frac{1}{2} \sum_{i = 1}^{m} \frac{{(x_{i} - E x_{i})}^{2}}{E n_{i}^{2}}]

(1)

2.3. Cloud Inference Algorithm

2.3.1. Front Part Cloud and Back Part Cloud

The foundation of uncertainty reasoning is uncertainty knowledge, and the uncertainty information contained in uncertainty knowledge is often extracted using IF–THEN fuzzy rules. IF–THEN fuzzy rules include one-dimensional fuzzy rules and multidimensional fuzzy rules. Among them, the one-dimensional fuzzy rules are: If x is

\tilde{A}

, then y is

\tilde{B}

, which is called an uncertainty inference machine. The condition

\tilde{A}

corresponds to the linguistic terms set of universe

U_{1}

, which is called the one-dimensional front part; the conclusion

\tilde{B}

corresponds to the linguistic terms set of universe

U_{2}

, which is called the one-dimensional back part. In cloud-reasoning algorithms,

\tilde{A}

is called a one-dimensional front part cloud for determining the membership

u

degree of x the linguistic terms set of

U_{1}

, and it generally uses the X conditional cloud generator.

\tilde{B}

is called a one-dimensional Back Part Cloud, and a Y-condition cloud generator is used to determine the membership

u

degree of x belonging to the linguistic terms set of

U_{2}

.

The one-dimensional precursor cloud generator, shown in Figure 1, converts the input data into cloud droplets and obtains the distribution range and pattern of the data. The mapping relationship between the input data and the membership degree is established. In the process of generating the membership degree, normal random numbers based on expectation and variance are used, and it considers the fuzziness and randomness of data in the overall calculation process. The detailed algorithm is as follows:

Input: A cloud model

(E_{x}, E_{n}, H e)

and a quantitative x.

Output: The membership degree μ of the quantitative values of x.

(1): Produce a normal random entropy based on the entropy $E_{n}$ and the hyperentropy $H e$ .
(2): Calculate the cloud droplet at the specified value x, $d r o p = \exp [- {(\frac{x - E x}{E n^{'}})}^{2}]$ .

The accuracy of the one-dimensional back-part cloud generator depends on the amount of data in the model. When the number of cloud droplets is large enough, its three parameter values can be calculated according to the statistical characteristics. The greater the number of cloud droplets, the better the statistical effect.

2.3.2. Cloud Model Inference Rule Generator

By connecting an antecedent cloud generator to a consequent cloud generator, a single rule generator is constructed. The operating mechanism of a single rule generator is to connect the two in sequence, so that the combination of the two conditional cloud generators can realize the preservation and transmission of the uncertainty of the data and complete the uncertainty inference. The execution process of the algorithm is as follows:

Step 1: Generate a normal random number

E x_{A}^{'}

with

E n_{A}

as the expected value and

H e_{A}

as the mean squared deviation.

Step 2: Calculation of the membership degree:

μ = \exp [\frac{- {(x_{A} - E x_{A})}^{2}}{2 (E n_{A}^{'})}]

(2)

Step 3: Generate a normal random number

E n_{B}^{'}

with

E n_{B}

as the expected value and

H e_{B}

as the mean squared deviation.

Step 4: When the quantization value

x_{A} \leq E x_{A}

, the antecedent cloud activates and rises along, the latter also activates and rises along this direction

x_{B} = E x_{B} - E n_{B}^{'} \times \sqrt{2 \ln u}

(3)

Step 5: When the quantization value

x_{A} > E x_{A}

, the cloud of antecedents is activated and descends along, then the latter also activates and descends in this direction.

x_{B} = E x_{B} - E n_{B}^{'} \times \sqrt{- 2 \ln u} .

(4)

In practice, the multi-rule inference algorithm is generally used, as shown in Figure 2. Through a logical calculation, uncertainty reasoning for multi-rule reasoning can be achieved. In the actual operation process, the number of conditions and rules is determined based on the specific manifestations of different datasets. For different inference rules, logical computation operators are mainly divided into “soft AND” and “soft OR” operators. When the result of reasoning needs to meet the requirements of all conditional attributes, a logical “AND” operation is performed, which is called the “soft AND” algorithm. When the inference result satisfies one or more of the conditions, a logical “OR” operation is performed, which is called the “soft OR” algorithm. In order to simplify the calculation, it is necessary to minimize the possibility of multiple conditions and rules appearing during the inference process. As the number of multiple conditions and rules in the system increases, the number of rules will rapidly increase, and the computational difficulty will then significantly increase. Therefore, it is necessary to perform a certain degree of “dimensionality reduction” on inference rules, split complex rules that are difficult to calculate. Thus, they reduce the computational workload and complexity of the model. In researching the literature, the “max” function is generally used to take the maximum value, and the “prod” function is used to calculate the cumulative result for the “soft sum” calculation to obtain the comprehensive membership degree.

3. Improved Fuzzy Inference System

Fuzzy inference systems usually use the membership function in the fuzzification layer to project the exact value of the input values into the fuzzy set. Common fuzzy membership functions include the triangular membership function, trapezoidal membership function, generalized bell-shaped membership function, Gaussian membership function, joint-Gaussian membership function, etc., among which the Gaussian membership function is most widely used. However, due to the different driving habits of drivers, there is a certain degree of randomness in the traffic flow data, and the above-mentioned functions cannot describe the randomness of the traffic flow data well, so this paper introduces the cloud model as the membership function.

For ease of understanding, Figure 3 shows the improved fuzzy inference system. The fuzzy inference system consists of five network layers, namely the input layer, the cloudification fuzzy layer, the cloudification rule layer, the standardization layer, the inverse cloudification layer, and the output layer, where

{X_{t} | t = 1, 2, 3, \dots, n}

denotes a time sequence of the observed traffic flow data and the output result represents the predicted traffic flow at time t + 1.

The execution function of each layer of the improved fuzzy inference system is as follows (for the convenience of symbolic representation, let the input of neuron i in network layer k be denoted as

I_{i}^{k}

and the output as

O_{i}^{k}

):

(1) Input layer: Each node on the input layer is directly connected to the clouded fuzzy layer and is primarily used to receive traffic flow data with a time window.

In addition to the strong cyclical correlation that traffic flow demonstrates with the same day of each week, there is also a cyclical similarity in traffic flow on a daily basis. If only daily variation is considered, the overall trend of traffic flow over a 24-h period is reflected; if only the weekly cyclical variation is considered, the overall trend of traffic flow over a 24-h period on the same day of each week is reflected. If only one of the above is considered, it does not fully reflect the traffic flow pattern and needs to be considered in a comprehensive manner. In order to model the daily and weekly periodicity of traffic flow, the periodic input matrix for time t is given as follows:

X^{d} = [x_{1} (t) x_{2} (t) x_{3} (t) \dots x_{d} (t)]

X^{w} = [x_{1} (t) x_{2} (t) x_{3} (t) \dots x_{w} (t)]

(5)

where x represents the time series,

x_{d} (t)

and

x_{w} (t)

represent the traffic flow data for the previous d days and w weeks, respectively. Therefore, the input layer of the fuzzy system is responsible for passing each component of the traffic flow history data

{X (t) | X^{d} (t), X^{w} (t)}

to the clouded fuzzy layer.

Input:

I_{i}^{1} = x_{i} (t)

; Output:

O_{i}^{1} (i = 1, 2, \dots, n)

; n = d + w, indicating the total number of nodes in the first layer of the network.

(2) Cloudification fuzzy layer: It performs uncertainty processing on the data and maps the exact traffic flow values to the uncertainty space. Each node in the clouded fuzzy layer represents a sub-subordinate cloud model generated by the time value t and the X-conditional cloud generator, which calculates the degree of certainty of each input temporal component. In this layer of network, the input time series data is clustered first according to the chapter fuzzy clustering algorithm. The number of membership functions in the system is equal to the number of clusters, and the initial parameters of the membership functions are determined by the clustering results.

Input:

I_{i}^{2} = O_{i}^{1}

;

Output:

O_{i}^{2} = μ_{i}^{j} = \exp (- \frac{{(x_{i} (t) - E x_{i}^{j} (t))}^{2}}{2 {(E^{'} n_{i}^{j})}^{2}})

, where

i = 1, 2, \dots, n

;

j = 1, 2, \dots, m_{i}

,

μ_{i}^{j}

denotes the jth cloud model affiliation function corresponding to the time series of the input system; mi denotes the number of discrete sub-clouds into which

x_{i} (t)

is divided.

(3) Cloudification rule layer: The rule layer is mainly responsible for cloud rule matching, and each fuzzy rule has a corresponding node in this layer. t-mode “AND” and “OR” are the most commonly used operators for fuzzy set combination, and the soft “AND” operator is activated on the rule layer. The activation of each cloud rule

a_{k}

can be determined by the soft “AND” operator, where

k = 1, 2, \dots, m

. The soft “AND” calculation process refers to the multi-dimensional normal cloud generator introduced in Definition 3 to calculate the membership degree of the multi-dimensional normal cloud.

This paper argues that the closer the historical traffic flow series is to the prediction time point, the higher the similarity with the prediction time period. Thus, this paper gives higher weights to the time series with high impact in the historical traffic flow sequence for compensating the lack of learning ability of the fuzzy inference system. Assuming that the input sequence is

{X (t) | x_{1} (t), x_{2} (t), \dots, x_{n} (t)}

, we assume that the ith time series of

{X (t) | x_{1} (t), x_{2} (t), \dots, x_{n} (t)}

has a high impact on the prediction result [44]. Therefore, we calculate the corresponding weights to be assigned to each time series to improve the prediction accuracy. Then, we perform multiple linear regression using multiple time series data, calculated as follows:

y_{t} = \sum_{n = 1}^{N} w^{n} x^{n} + b

(6)

where

w^{n}

is the corresponding weight and b is the bias. The weight and bias parameters in the cloud rule can be obtained by minimizing the equation

L (h_{θ} (x), y_{t})

.

h_{θ} (x)

is the predicted value. Finally, the weights can be obtained as:

{\tilde{w}}^{n} = \frac{\exp (w^{n})}{\sum_{n = 1}^{N} \exp (w^{n})}

(7)

where

W^{n}

is the weight of the nth day before the prediction time point. In this paper, the Softmax classifier function is used to ensure that the sum of all weights is 1.

The multi-dimensional normal cloud, processed by the fuzzy-rule enhancement mechanism, is calculated as follows.

μ_{i} = \exp [- \frac{1}{2} \sum_{j = 1}^{m} {\tilde{w}}_{j} \frac{{(x_{j i} - E x_{j})}^{2}}{y_{j i}^{2}}], i = 1, 2, \dots, n

(8)

The fuzzy inference rule for this layer is: If the sub-subordinate cloud function has a membership degree of

μ_{k 1}, μ_{k 2}, \dots, μ_{k n}

, then the combined membership of the rule is

μ_{k}

.

Input:

I_{i}^{3} = O_{i}^{2}

; Output:

O_{i}^{3} = a_{k} = \exp [- \frac{1}{2} \sum_{j = 1}^{m} {\tilde{w}}_{j} \frac{{(x_{j i} - E x_{j})}^{2}}{y_{j i}^{2}}]

.

(4) Standardization layer: It is mainly responsible for the standardization operation of values. The following formula is used to calculate the normalized activation intensity

{\bar{a}}_{k}

corresponding to the activation degree

{\bar{a}}_{k}

passed into this layer.

{\bar{a}}_{k} = \frac{a_{k}}{\sum_{k = 1}^{m} a_{k}}

(9)

Input:

I_{i}^{4} = O_{i}^{3}

; Output:

O_{i}^{4} = {\bar{a}}_{k}

.

(5) Inverse cloudification layer: Quantitatively transform the fuzzy membership degree

{\bar{a}}_{k}

and generate a subsequent cloud by the Y conditional cloud generator. Then output the inference result and its corresponding traffic flow value q_k.

Input:

I_{i}^{5} = O_{i}^{4}

; Output:

O_{i}^{5} = d r o p ({\bar{a}}_{k}, q_{k})

.

(6) Output layer: The results of the inverse clouding layer are averaged and weighted. Then, output the final results.

Input:

I_{i}^{6} = O_{i}^{5}

; Output:

O^{6} = \frac{\sum_{k = 1}^{m} q_{k}}{m}

.

4. The Experiments

4.1. Data Description and Indexes

We employ the traffic flow data of the observation point on the SR1 highway of Monterey City from 1 June 2019 to 31 August 2019 with a time interval of 5 min. These data are derived from the PeMS platform (http://pems.dot.ca.gov, accessed on 22 October 2022). The data from 1 June 2019 to 20 August 2019 (81 days) are used to train the prediction model, and the data from 21 August 2019 to 31 August 2019 (10 days) is used to evaluate the model. For reducing the impact of the difference between two datasets on the accuracy of the model and reducing the computational complexity, all data are processed by the mean normalization method. The partial initial data are shown in Figure 4, where data quality represents the detected proportion, i.e., vehicles detected and recorded in the current time period/the total number of vehicles.

In addition, we adopt three indexes for reflecting the performance of the improved fuzzy prediction system, which are Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Mean Relative Error (MRE). The detailed calculation formulas of these three indexes are as follows:

RMSE = {[\frac{1}{N} \sum_{i = 1}^{N} {(y_{i}^{'} - y_{i})}^{2}]}^{1 / 2}

(10)

where

N

denotes the number of samples,

y_{i}^{'}

is the predicted value and

y_{i}

is the real value. RMSE is used to reflect the unbiasedness of sequence prediction. The smaller the value is, the smaller the dispersion degree of error distribution is and the better the prediction performance is.

MAE = \frac{1}{N} \sum_{i = 1}^{N} | y_{i}^{'} - y_{i} |

(11)

MAE is used to represent the average absolute deviation between the predicted value and the real value. The smaller MAE is, the smaller the error is, and the better the prediction effect is.

MRE = \frac{1}{N} \sum_{i = 1}^{N} \frac{| y_{i}^{'} - y_{i} |}{y_{i}}

(12)

MRE is the average value of relative error. The smaller MRE is, the closer the predicted value is to the actual value, and the better the prediction effect is.

4.2. Experiment Settings and Results

In this paper, the FCM (Fuzzy C-Means) method is used to generate fuzzy inference rules. The distribution matrix index is 2, the maximum number of iterations is 500, the model training method uses a BP neural network algorithm, and the time window of traffic flow history data is 5. All experiments in this chapter are performed on MATLAB R2017b, and the platform used is Windows 10 (CPU: i7-10875H).

The number of fuzzy clusters affects the number of membership functions and the number of rules in the fuzzy system, which affects the final model prediction effect in turn. This paper first increases the number of fuzzy clusters from 2 to 10 at an interval of 1, and then increases the number of fuzzy clusters from 10 to 100 at an interval of 10 before observing the change in the error value of the model output result. The prediction result and calculation time are shown in Table 1. In addition, we also present a visual Figure 5 of Table 1 for comparative analysis.

Among them, the rate of change represents the growth rate of the RMSE value (operation time) under each cluster number relative to the RMSE value (operation time) under the previous cluster number. It can be seen from Table 1 that as the number of membership functions increases, the RMSE value of the training set gradually decreases at the beginning. This paper deems that this is because the greater the number of fuzzy clusters, the more the parameters of each fuzzy membership function. It can reflect the characteristics of time series data in this category and finally obtain better training set RMSE results. When the number of clusters increases from 2 to 6, the RMSE of the test set also decreases. However, as the number of clusters continues to increase, the RMSE value of the prediction set gradually increases. This shows that with the increase of membership functions, the model can better fit time series data. However, when the number of membership functions exceeds a certain threshold, there is overfitting in the model, and the generalization ability of the model becomes worse. In addition, as the number of fuzzy clusters increases, the RMSE value of the training set decreases less, while the RMSE value of the prediction set increases more. Take the two sets of data with the number of clusters 20 and 30 as an example. When the number of clusters increases from 20 to 30, the RMSE value of the training set decreases by 0.888%, while the RMSE of the test set increases by 1.718%. The increase is the decrease of the training set. The increase is nearly twice the decrease of the training set. This means that for the current data set, a continuous increase in the number of membership functions does not effectively improve the performance of the model, but it actually leads to a very high error value in the prediction. In addition, when the number of clusters increases, the average running time of the model increases by 85.661%, indicating that the complexity of the model is greatly increased, the convergence speed is greatly slowed down, and the prediction effect is greatly reduced. In order to reduce the degree of model overfitting, this paper sets the number of fuzzy clusters to 6. At this time, the prediction effect of the model is relatively good, the model is relatively simple, and the algorithm converges faster.

4.3. Experimental Results

According to the experimental results in Section 4.2, the experiment in this section takes 6 as the number of fuzzy clusters, and the maximum number of iterations is 500. The traffic flow forecast results for one day on 30 August 2019 are shown in Figure 6. It can be seen that the peak flow of this monitoring point is about 300 vehicles, and it generally appears at about 15:00 pm to 16:00 pm. The general trend of traffic flow at the monitoring site is to decrease from about 0:00 in the morning and remain at about 20 for a duration of about 5 h. From 5:00, the traffic gradually increases until it reaches the maximum at 8:00 in the morning. The traffic flow reaches the peak of the day from 15:00, and then maintains a high flow value until around 20:00 and starts to slowly decrease. In addition, as shown in Figure 6, the traffic monitoring point was disturbed by a large amount of external noise during the time period from 0:00 to 5:00 on 30 August 2019, and a large number of data points with a traffic value of 0 appeared. According to the source data, the observation rate of the monitoring points in this time period is 0% (Observed = 0%), that is, the monitoring points are not working normally in this time period. On the day of 30 August 2019, there were a total of 288 observation points, of which 22 were missing points, accounting for 7.6% of the total observation data for that day. However, the improved fuzzy inference system proposed in this paper has not been affected by the missing data values. The model’s prediction effect on the day was good. The RMSE of the model was 19.32. After removing the missing points, the model’s predicted RMSE value on the day was reduced to 19.10, a decrease of only 1.2%. From this we know that the proposed model has good robustness and can effectively resist the external noise interference in the data.

4.4. Comparative Experiments

In order to verify the accuracy of the membership function of the cloud model adopted in this paper, the traditional Gaussian membership function, linear membership function, and triangular membership function are used in Comparative Experiment 1. In order to verify the rationality of the FCM algorithm for calculating the number of membership functions, Comparative Experiment 2 uses subtractive clustering and grid segmentation to cluster the data. According to the experimental results in Section 4.2, Comparative Experiments 1 and 2 both take 6 as the number of clusters, and the maximum number of iterations is 500. This section takes 30 August 2019 16:00–19:00 in the afternoon as an example.

The most common, simplest, and easy-to-implement fuzzification method is the linear function method. Because this method is simple enough, it can only process relatively accurate input data. Under this circumstance, the fuzzification performance of this method is good. When the interference noise contained in the input data gradually increases, it is necessary to change the fuzzification method to the triangular membership function method. Relatively speaking, the operation process of this function is relatively simple, and the robustness of the calculation result is improved to a certain extent. Although the Gaussian membership function is more complicated than the first two basic methods, its calculation result has good anti-interference ability, and the fuzzification result is closer to the characteristics of human cognition, so it is widely used in fuzzy reasoning systems. The prediction results of Experiment 1 are shown in Figure 7. The model evaluation matrix is shown in Table 2:

As can be seen from Figure 7, because the traffic flow data is interfered by a large amount of external noise, its data changes show a certain degree of twists and turns, and the range of changes is large. The prediction results of the triangular membership function, the Gaussian membership function, and the cloud model proposed in this paper constitute a consistent trend. The linear membership function has a relatively simple structure, and only the predicted trend is consistent with the actual value. However, there are many outliers in the prediction result, and the prediction effect is poor. This article considers that this is because the traffic flow data contains a lot of noise interference, and the anti-interference ability of the linear membership function is poor. From Table 2, it can be seen that three evaluation matrix results of the cloud model are better than the other three membership function models. Although the triangular membership function and Gaussian membership function models have certain robustness, the prediction effect is still inferior to the cloud model. This is because the cloud model not only considers the fuzziness of the data, but it also considers the randomness in the data so the improved fuzzy inference system proposed in this paper has certain advantages.

The grid segmentation and subtractive clustering algorithm are two commonly used clustering methods in fuzzy inference systems. In Comparative Experiment 2, the influence radius of subtractive clustering algorithm is set to 0.55. The experimental results are shown in Table 3:

It can be seen from the table that the model error results of the FCM algorithm are better than the other two clustering algorithms, which proves that the FCM algorithm used for generating the system membership function is reasonable. The grid-segmentation algorithm has the worst predictive effect. It may be because the grid-segmentation algorithm is a hard classification. All clusters are performed on the grid, and only regular clusters (such as cluster boundaries horizontal or vertical) can be explored, while the clusters with a strong correlation on the oblique boundary at the boundary point cannot be detected. In addition, traffic flow data belong to complex high-latitude data, and the grid cells divided in the grid-segmentation method have an exponential relationship with the data dimension. When the data dimension increases, the number of grid cells will explode, and the time complexity will also increase. Its running time is as long as 7681.32 s, which is much higher than the 104.1 s of the FCM algorithm and 104.977 s of the subtractive clustering algorithm. Although the subtractive clustering algorithm is an improved clustering algorithm proposed for large sample data, the efficiency of the algorithm has been improved. However, this paper thinks that the initial centers of subtractive clustering are all based on the points in the original data, which is not the true clustering center in theory. This kind of algorithm will gradually produce errors in the clustering process, and the errors will eventually lead to poor prediction results after multiple accumulations.

5. Conclusions

This paper proposes a cloud model-based fuzzy inference system for short-term traffic flow prediction. First, it briefly introduces the basic knowledge and algorithms of fuzzy mathematics. Then, it introduces a cloud model overview, cloud inference algorithm, and its rule generator, respectively. Finally, this paper uses the cloud model to fit the randomness and uncertainty of the traffic flow data and compares it with the typical road section, i.e., the fuzzy inference system based on the traditional Gaussian membership function, triangular membership function, and linear membership function. The experimental results show that the improved fuzzy prediction system has superiority and practicability under different conditions. The proposed model laid the theoretical foundation for the construction of the short-term traffic flow forecasting model based on the improved fuzzy theory.

There are still many research flaws in this paper. In future research, we will study the problem using the Intuitionistic fuzzy sets to model the uncertainty in the short-term traffic flow prediction.

Author Contributions

Data curation, X.-K.W., Y.L. (Ye Liu), Y.L. (Yan Liu) and X.-Y.Z.; Formal analysis, H.-W.L., Y.-T.W., Y.L. (Ye Liu), Y.L. (Yan Liu) and X.-Y.Z.; Funding acquisition, F.X.; Investigation, H.-W.L., Y.-T.W., Y.L. (Ye Liu), Y.L. (Yan Liu) and F.X.; Methodology, H.-W.L., Y.-T.W., X.-K.W., Y.L. (Yan Liu) and X.-Y.Z.; Project administration, F.X.; Resources, H.-W.L.; Validation, H.-W.L. and F.X.; Visualization, X.-K.W.; Writing—original draft, H.-W.L., Y.-T.W. and Y.L. (Yan Liu); Writing—review & editing, X.-K.W., Y.L. (Ye Liu), X.-Y.Z. and F.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are unavailable due to privacy.

Conflicts of Interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

References

Chen, Z.-Y.; Xiao, F.; Wang, X.-K.; Hou, W.-H.; Huang, R.-L.; Wang, J.-Q. An interpretable diagnostic approach for lung cancer: Combining maximal clique and improved BERT. Expert Syst. 2023, e13310. [Google Scholar] [CrossRef]
Smith, B.L.; Demetsky, M.J. Short-term traffic flow prediction models-a comparison of neural network and nonparametric regression approaches. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, San Antonio, TX, USA, 2–5 October 1994; pp. 1706–1709. [Google Scholar]
Tong, M.; Duan, H.; Luo, X. Research on short-term traffic flow prediction based on the tensor decomposition algorithm. J. Intell. Fuzzy Syst. 2021, 40, 5731–5741. [Google Scholar] [CrossRef]
Chen, Z.-Y.; Xiao, F.; Wang, X.-K.; Deng, M.-H.; Wang, J.-Q.; Li, J.-B. Stochastic configuration network based on improved whale optimization algorithm for nonstationary time series prediction. J. Forecast. 2022, 41, 1458–1482. [Google Scholar] [CrossRef]
Wang, X.-K.; Hou, W.-H.; Zhang, H.-Y.; Wang, J.-Q.; Goh, M.; Tian, Z.-P.; Shen, K.-W. KDE-OCSVM model using Kullback-Leibler divergence to detect anomalies in medical claims. Expert Syst. Appl. 2022, 200, 117056. [Google Scholar] [CrossRef]
Smith, B.L.; Demetsky, M.J. Traffic flow forecasting: Comparison of modeling approaches. J. Transp. Eng. 1997, 123, 261–266. [Google Scholar] [CrossRef]
Stephanedes, Y.J.; Michalopoulos, P.G.; Plum, R.A. Improved estimation of traffic flow for real time control. Transp. Res. Rec. 1980, 7, 28–39. [Google Scholar]
Kaysi, I.; Ben-Akiva, M.E.; Koutsopoulos, H. An Integrated Approach to Vehicle Routing and Congestion Prediction for Real-Time Driver Guidance; Transportation Research Board: Washinton, DC, USA, 1993; Volume 1408. [Google Scholar]
Levin, M.; Tsao, Y.D. On forecasting freeway occupancies and volumes. Transp. Res. Rec. J. Transp. Res. Board 1980, 773, 47–49. [Google Scholar]
Hamed, M.M.; Al-Masaeid, H.R.; Said, Z.M.B. Short-term prediction of traffic volume in urban arterials. J. Transp. Eng. 1995, 121, 249–254. [Google Scholar] [CrossRef]
Van Der Voort, M.; Dougherty, M.; Watson, S. Combining kohonen maps with arima time series models to forecast traffic flow. Transp. Res. Part C Emerg. Technol. 1996, 4, 307–318. [Google Scholar] [CrossRef]
Williams, B.M. Multivariate vehicular traffic flow prediction: Evaluation of ARIMAX modeling. Transp. Res. Rec. J. Transp. Res. Board 2001, 1776, 194–200. [Google Scholar] [CrossRef]
Kumar, S.V.; Vanajakshi, L. Short-term traffic flow prediction using seasonal ARIMA model with limited input data. Eur. Transp. Res. Rev. 2015, 7, 21. [Google Scholar] [CrossRef]
Zhang, L.; Liu, Q.; Yang, W.; Wei, N.; Dong, D. An improved K-nearest neighbor model for short-term traffic flow prediction. Procedia Soc. Behav. Sci. 2013, 96, 653–662. [Google Scholar] [CrossRef]
Xiaoyu, H.; Yisheng, W.; Siyu, H. Short-term traffic flow forecasting based on two-tier K-nearest neighbor algorithm. Procedia Soc. Behav. Sci. 2013, 96, 2529–2536. [Google Scholar] [CrossRef]
Cai, L.; Yu, Y.; Zhang, S.; Song, Y.; Xiong, Z.; Zhou, T. A sample-rebalanced outlier-rejected K-nearest neighbor regression model for short-term traffic flow forecasting. IEEE Access 2020, 8, 22686–22696. [Google Scholar] [CrossRef]
Lin, G.; Lin, A.; Gu, D. Using support vector regression and K-nearest neighbors for short-term traffic flow prediction based on maximal information coefficient. Inf. Sci. 2022, 608, 517–531. [Google Scholar] [CrossRef]
Hong, W.-C.; Dong, Y.; Zheng, F.; Wei, S.Y. Hybrid evolutionary algorithms in a SVR traffic flow forecasting model. Appl. Math. Comput. 2011, 217, 6733–6747. [Google Scholar] [CrossRef]
Hong, W.-C. Application of seasonal SVR with chaotic immune algorithm in traffic flow forecasting. Neural Comput. Appl. 2012, 21, 583–593. [Google Scholar] [CrossRef]
Ma, C.; Zhao, Y.; Dai, G.; Xu, X.; Wong, S.C. A Novel STFSA-CNN-GRU Hybrid Model for Short-Term Traffic Speed Prediction. IEEE Trans. Intell. Transp. Syst. 2023, 24, 3728–3737. [Google Scholar] [CrossRef]
Tang, W.M.; Yiu, K.F.C.; Chan, K.Y.; Zhang, K. Conjoining congestion speed-cycle patterns and deep learning neural network for short-term traffic speed forecasting. Appl. Soft Comput. 2023, 138, 110154. [Google Scholar] [CrossRef]
Zhuang, W.; Cao, Y. Short-Term Traffic Flow Prediction Based on a K-Nearest Neighbor and Bidirectional Long Short-Term Memory Model. Appl. Sci. 2023, 13, 2681. [Google Scholar] [CrossRef]
Tan, M.-C.; Wong, S.C.; Xu, J.-M.; Guan, Z.-R.; Zhang, P. An aggregation approach to short-term traffic flow prediction. IEEE Trans. Intell. Transp. Syst. 2009, 10, 60–69. [Google Scholar]
Cetin, M.; Comert, G. Short-term traffic flow prediction with regime switching models. Transp. Res. Rec. 2006, 1965, 23–31. [Google Scholar] [CrossRef]
Dimitriou, L.; Tsekeris, T.; Stathopoulos, A. Adaptive hybrid fuzzy rule-based system approach for modeling and predicting urban traffic flow. Transp. Res. Part C Emerg. Technol. 2008, 16, 554–573. [Google Scholar] [CrossRef]
Fang, W.; Zhuo, W.; Song, Y.; Yan, J.; Zhou, T.; Qin, J. Δfree-LSTM: An error distribution free deep learning for short-term traffic flow forecasting. Neurocomputing 2023, 526, 180–190. [Google Scholar] [CrossRef]
Liu, Z.; Guo, J.; Cao, J.; Wei, Y.; Huang, W. A hybrid short-term traffic flow forecasting method based on neural networks combined with K-nearest neighbor. Promet-Traffic Transport. 2018, 30, 445–456. [Google Scholar] [CrossRef]
Liu, Z.; Du, W.; Yan, D.-m.; Chai, G.; Guo, J.-h. Short-term traffic flow forecasting based on combination of k-nearest neighbor and support vector regression. J. Highw. Transp. Res. Dev. (Engl. Ed.) 2018, 12, 89–96. [Google Scholar] [CrossRef]
Luo, X.; Li, D.; Yang, Y.; Zhang, S. Spatiotemporal traffic flow prediction with KNN and LSTM. J. Adv. Transp. 2019, 2019, 4145353. [Google Scholar] [CrossRef]
Wang, X.; Wang, S.; Zhang, H.; Wang, J.; Li, L. The recommendation method for hotel selection under traveller preference characteristics: A cloud-based multi-criteria group decision support model. Group Decis. Negot. 2021, 30, 1433–1469. [Google Scholar] [CrossRef]
Zhou, S.; Wei, C.; Song, C.; Fu, Y.; Luo, R.; Chang, W.; Yang, L. A Hybrid Deep Learning Model for Short-Term Traffic Flow Pre-Diction Considering Spatiotemporal Features. Sustainability 2022, 14, 10039. [Google Scholar] [CrossRef]
Poryazov, S.; Andonov, V.; Saranova, E.; Atanassov, K. Two Approaches to the Traffic Quality Intuitionistic Fuzzy Estimation of Service Compositions. Mathematics 2022, 10, 4439. [Google Scholar] [CrossRef]
Jang, J.-S. ANFIS: Adaptive-network-based fuzzy inference system. IEEE Trans. Syst. Man Cybermetics 1993, 23, 665–685. [Google Scholar] [CrossRef]
Keskin, M.E.; Taylan, D.; Terzi, Ö. Adaptive neural-based fuzzy inference system (ANFIS) approach for modelling hydrological time series. Hydrol. Sci. J. 2006, 51, 588–598. [Google Scholar] [CrossRef]
Ahmadianfar, I.; Shirvani-Hosseini, S.; He, J.; Samadi-Koucheksaraee, A.; Yaseen, Z.M. An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction. Sci. Rep. 2022, 12, 4934. [Google Scholar] [CrossRef] [PubMed]
Acakpovi, A.; Ternor, A.T.; Asabere, N.Y.; Adjei, P.; Iddrisu, A.-S. Time Series Prediction of Electricity Demand Using Adaptive Neuro-Fuzzy Inference Systems. Math. Probl. Eng. 2020, 2, 4181045. [Google Scholar] [CrossRef]
Mohiyuddin, A.; Javed, A.R.; Chakraborty, C.; Rizwan, M.; Shabbir, M.; Nebhen, J. Secure Cloud Storage for Medical IoT Data using Adaptive Neuro-Fuzzy Inference System. Int. J. Fuzzy Syst. 2022, 24, 1203–1215. [Google Scholar] [CrossRef]
Chen, B.-P.; Ma, Z.-Q. Short-term traffic flow prediction based on ANFIS. In Proceedings of the 2009 International Conference on Communication Software and Networks, Chengdu, Sichuan, China, 27–28 February 2009; pp. 791–793. [Google Scholar]
Ghenai, C.; Al-Mufti, O.A.A.; Al-Isawi, O.A.M.; Amirah, L.H.L.; Merabet, A. Short-term building electrical load forecasting using adaptive neuro-fuzzy inference system (ANFIS). J. Build. Eng. 2022, 52, 104323. [Google Scholar] [CrossRef]
Li, D.; Meng, H.; Shi, X. Membership clouds and membership cloud generators. Comput. Res. Dev. 1995, 32, 15–20. [Google Scholar]
de Campos Souza, P.V. Fuzzy neural networks and neuro-fuzzy networks: A review the main techniques and applications used in the literature. Appl. Soft Comput. 2020, 92, 106275. [Google Scholar] [CrossRef]
Li, D.; Liu, C.; Gan, W. A new cognitive model: Cloud model. Int. J. Intell. Syst. 2009, 24, 357–375. [Google Scholar] [CrossRef]
Wang, D.; Zeng, D.; Singh, V.P.; Xu, P.; Liu, D.; Wang, Y.; Zeng, X.; Wu, J.; Wang, L. A multidimension cloud model-based approach for water quality assessment. Environ. Res. 2016, 149, 113–121. [Google Scholar] [CrossRef]
Yang, B.; Sun, S.; Li, J.; Lin, X.; Tian, Y. Traffic flow prediction using LSTM with feature enhancement. Neurocomputing 2019, 332, 320–327. [Google Scholar] [CrossRef]

Figure 1. X-cloud generator.

Figure 2. Multi-rule inference algorithm.

Figure 3. The improved fuzzy inference system.

Figure 4. Schematic diagram of PeMS platform data.

Figure 5. The intuitive change in different numbers of fuzzy clusters. (a) Changes of RMSE in different cluster numbers; (b) Rate of changes in different cluster numbers; (c) Running time of model in different cluster numbers.

Figure 6. IFS model traffic flow forecast results on 30 August 2019.

Figure 7. IFS model comparative experimental results.

Table 1. Changes in different numbers of fuzzy clusters.

Cluster Number	Train-RMSE	Rate of Change	Test RMSE	Rate of Change	Running Time	Rate of Change
2	19.7353		24.942		38.805
3	19.6758	−0.301%	24.9411	−0.004%	53.717	38.426%
4	19.638	−0.192%	24.901	−0.161%	86.691	61.386%
5	19.6192	−0.096%	24.8935	−0.030%	93.121	7.417%
6	19.6169	−0.012%	24.8747	−0.076%	104.101	11.791%
7	19.5946	−0.114%	24.964	0.359%	119.344	14.643%
8	19.584	−0.054%	25.027	0.252%	138.032	15.659%
9	19.5678	−0.083%	25.055	0.112%	165.541	19.929%
10	19.549	−0.099%	25.105	0.199%	217.927	31.646%
20	19.395	−0.783%	25.462	1.422%	421.537	93.430%
30	19.223	−0.888%	25.899	1.718%	1134.649	169.170%
40	19.188	−0.183%	26.050	0.582%	1833.513	61.593%
50	19.067	−0.631%	26.161	0.427%	2370.110	29.266%
60	19.016	−0.269%	26.266	0.402%	4070.100	71.726%
70	18.933	−0.436%	26.685	1.593%	4869.200	19.633%
80	18.852	−0.428%	27.232	2.049%	6630.072	36.163%
90	18.794	−0.306%	27.431	0.734%	7743.372	16.792%
100	18.718	−0.403%	28.234	2.925%	28,896.017	273.171%
Mean	19.598	−0.253%	25.785	0.736%	3276.992	57.167%

Table 2. Experiment 1 results.

	RMSE	MAE	MRE
Linear membership function	21.9185	18.2887	0.1149
Triangular membership function	20.9546	16.6365	0.1024
Gaussian membership function	20.7267	16.9891	0.1040
Cloud model	20.1170	16.3555	0.1002

Table 3. Experiment 2.

	RMSE	MAE	MRE
FCM	20.1170	16.3555	0.1002
Grid segmentation	20.7267	16.9891	0.1040
Subtractive clustering	20.5367	16.6745	0.1025

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, H.-W.; Wang, Y.-T.; Wang, X.-K.; Liu, Y.; Liu, Y.; Zhang, X.-Y.; Xiao, F. Cloud Model-Based Fuzzy Inference System for Short-Term Traffic Flow Prediction. Mathematics 2023, 11, 2509. https://doi.org/10.3390/math11112509

AMA Style

Liu H-W, Wang Y-T, Wang X-K, Liu Y, Liu Y, Zhang X-Y, Xiao F. Cloud Model-Based Fuzzy Inference System for Short-Term Traffic Flow Prediction. Mathematics. 2023; 11(11):2509. https://doi.org/10.3390/math11112509

Chicago/Turabian Style

Liu, He-Wei, Yi-Ting Wang, Xiao-Kang Wang, Ye Liu, Yan Liu, Xue-Yang Zhang, and Fei Xiao. 2023. "Cloud Model-Based Fuzzy Inference System for Short-Term Traffic Flow Prediction" Mathematics 11, no. 11: 2509. https://doi.org/10.3390/math11112509

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cloud Model-Based Fuzzy Inference System for Short-Term Traffic Flow Prediction

Abstract

1. Introduction

2. Preliminaries

2.1. Fuzzy Inference Systems

2.2. Cloud Model

2.3. Cloud Inference Algorithm

2.3.1. Front Part Cloud and Back Part Cloud

2.3.2. Cloud Model Inference Rule Generator

3. Improved Fuzzy Inference System

4. The Experiments

4.1. Data Description and Indexes

4.2. Experiment Settings and Results

4.3. Experimental Results

4.4. Comparative Experiments

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI