Next Article in Journal
A New Chaotic System with Stable Equilibrium: Entropy Analysis, Parameter Estimation, and Circuit Design
Previous Article in Journal
Transpiration and Viscous Dissipation Effects on Entropy Generation in Hybrid Nanofluid Flow over a Nonlinear Radially Stretching Disk
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Forecasting Model Based on High-Order Fluctuation Trends and Information Entropy

1
School of Management Science and Engineering, Shandong University of Finance and Economics, Jinan 250014, China
2
Rensselaer Polytechnic Institute, Troy, NY 12180, USA
3
School of Management, Jiangsu University, Zhenjiang 212013, China
*
Author to whom correspondence should be addressed.
Entropy 2018, 20(9), 669; https://doi.org/10.3390/e20090669
Submission received: 31 July 2018 / Revised: 22 August 2018 / Accepted: 3 September 2018 / Published: 4 September 2018
(This article belongs to the Section Information Theory, Probability and Statistics)

Abstract

:
Most existing high-order prediction models abstract logical rules that are based on historical discrete states without considering historical inconsistency and fluctuation trends. In fact, these two characteristics are important for describing historical fluctuations. This paper proposes a model based on logical rules abstracted from historical dynamic fluctuation trends and the corresponding inconsistencies. In the logical rule training stage, the dynamic trend states of up and down are mapped to the two dimensions of truth-membership and false-membership of neutrosophic sets, respectively. Meanwhile, information entropy is employed to quantify the inconsistency of a period of history, which is mapped to the indeterminercy-membership of the neutrosophic sets. In the forecasting stage, the similarities among the neutrosophic sets are employed to locate the most similar left side of the logical relationship. Therefore, the two characteristics of the fluctuation trends and inconsistency assist with the future forecasting. The proposed model extends existing high-order fuzzy logical relationships (FLRs) to neutrosophic logical relationships (NLRs). When compared with traditional discrete high-order FLRs, the proposed NLRs have higher generality and handle the problem caused by the lack of rules. The proposed method is then implemented to forecast Taiwan Stock Exchange Capitalization Weighted Stock Index and Heng Seng Index. The experimental conclusions indicate that the model has stable prediction ability for different data sets. Simultaneously, comparing the prediction error with other approaches also proves that the model has outstanding prediction accuracy and universality.

1. Introduction

For stock market forecasts, summarizing the rules that can be used for future predictions from historical data is crucial. At the same time, due to the noise that is contained in the actual data, Song and Chissom [1,2,3] proposed a fuzzy time series method for general rule extraction. On this basis, some scholars proposed first-order models [4,5,6], whereas others further studied high-order models [7,8]. These studies not only imply that high-order models can reflect historical trends, but also emphasize the importance of the relationship between historical trends and current state.
However, most of above models summarize the logical relationships between current and historical trends that are only based on discrete states. In fact, in addition to the simple state-to-state relationship, the relationships between the historical states and the current states are also related to other features. For example, Lee et al. [9] suggested that the fluctuation trends in related stock markets are linked with each other. From this point of view, Lee et al. proposed a prediction model that is based on two related stock markets that extended such models to two-factor high-order prediction models. In addition to the range of external influences, other internal features of a time series are ignored by traditional autoregressive models. For example, a typical phenomenon, called “callback”, occurs after a period of continuous rise, which means that the inconsistencies in historical fluctuation trends might have considerable impacts on future fluctuation trends. In this area, Zhang et al. [10] extracted the inconsistent features of different types of digital signals and selected the optimal feature subsets from 16 entropy features. Jiang et al. [11] proposed a feature fusion model that is based on information entropy (IE) and probabilistic neural network, which uses IE theory to extract the characteristic entropies in vibration signals. These studies inspired us to think that IE can be employed to describe the inconsistency in a period of fluctuation in a time series.
The concept of information entropy was first proposed by Shannon in 1948 [12]. IE is a tool for measuring the degree of ordering of systems. The lower the IE, the less chaos, and the higher the order. Conversely, the larger the IE, the lower the order. IE solves the problem of quantification and describing system complexity and inconsistency. For example, in order to indicate the inconsistency of a battery due to small deviations and uncertainties in its production process, Duan et al. [13] developed a comprehensive evaluation method for battery inconsistency based on IE. Xu et al. [14] proposed the information entropy risk measure, which applied IE to describe the uncertainty in decision risk and the inconsistency of decision information. Based on the uncertainty of rainfall distribution and the inconsistency of information, Wang et al. [15] employed IE to analyze the time-variation in similar-scale rainfall networks in cities. Information entropy has been widely used in many other situations, such as graphics detection [16], algorithm optimization [17], environmental evaluation [18], and big data analysis and mining [19].
In order to fully express the three states of up, down, and inconsistency of a period in history in a logical relationship and to conveniently compare them, the expression variable should have three dimensions. This is similar to the structure of neutrosophic sets (NSs) and the similarity comparisons of different NSs. The NS was introduced by Smarandache [20], which consists of true membership, indeterminacy membership, and false membership. From the perspective of information representation, scholars have proposed three specific concepts that are based on the neutrosophic set: single-valued NSs [21], interval-valued NSs [22], and simplified NSs [23], including single-valued and interval-valued NSs [24]. These concepts seek a more detailed representation of the information, enabling the NSs to express inconsistent information more accurately. At present, NSs have attracted the attention of scholars in various fields. For instance, Abdel-Basse et al. [25] discussed the role of NSs in urban construction and further analyzed its advantages in the construction of imperfect and incomplete information systems. Şahin et al. [26] discussed the application of single-valued NSs in decision making. Van et al. [27] applied NSs to green supplier evaluation and selection. In addition, since similarity is an important method for measuring NSs, similarity was also used in numerous studies [28,29,30].
As mentioned above, stock market fluctuation is a dynamic nonlinear system. A common question is how complex systems can be synchronized and which parameters have a direct impact on the system’s evolution. Researchers think that there are relationships between history, current, and future. It is more likely a small-world network [31]. Therefore, high-order fuzzy time series models have been developed to describe the fluctuations in a time period and generate forecasting rules that are based on discrete high-order states for future prediction. The most problem of these models is the lack of corresponding rules due to limited training samples. This would significantly reduce the accuracy and credibility of such models. The proposed model summarizes the discrete high-order fluctuation states to truth-membership of upper-trend, falsity-membership of upper-trend, and chaos of trends, and then maps them to the three dimensions of truth-membership, falsity-membership, and indeterminacy-membership of a neutrosophic set. It extends existing high-order fuzzy logical relationships to neutrosophic logical relationships. Therefore, the model can employ NS theory to deal with related issues, such as the comparison of high-order states during the location of rules. Compared with traditional discrete high-order fuzzy logical relationships (FLRs), the proposed to neutrosophic logical relationships (NLRs) have higher generality and can address the problem that is caused by the lack of rules in the forecasting stage.
Inspired by the above research, we propose a prediction model based on high-order fluctuation trends and information entropy. First, the original time series of the stock market is converted into a fluctuation time series, and then the fluctuation time series is blurred into a fuzzy fluctuation time series according to a predefined label. Second, the IE of its historical fluctuations is calculated based on the probability of different states of each current value. Third, the NS is used to represent the current state and to establish a neutrosophic logical relationships. Fourth, the Jaccard similarity measure is used to seek out similar appropriate logical relationship groups and calculate its expected NSs. Finally, we obtain the desired expected NSs, then the final predicted value is calculated through the process of deneutrosophication. For verification, the proposed model is implemented to forecast Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX) and Heng Seng Index (HIS). The experimental conclusions indicate that the model has stable prediction ability for different data sets. Simultaneously, comparing the prediction error with other approaches also proves that the model has outstanding prediction accuracy and universality.
The remainder of this article can be summarized as follows: Section 2 introduces some basic concepts of fuzzy fluctuation time series, IE, deneutrosophication of a neutrosophic set, and neutrosophic logical relationships. Section 3 presents a new prediction method that is based on high-order fluctuation trends and NSs. In Section 4, the TAIEX data set (1997–2005) and HIS (1998–2012) are used to predict future values. Section 5 presents the conclusions of this paper and the potential issues for future research.

2. Preliminaries

2.1. Fuzzy Set (FS)

Fuzzy set theory was proposed by Zadeh [32], and it has been widely applied in several fields. A brief introduction to the basic concepts of fuzzy set follows.
Definition 1.
Denote the universe of discourse as U = {u1, u1, …, un}. A fuzzy set L = {L1, L2, …, Lg} in U can be defined by its membership function:
Lj = fLj(u1)/u1 + fLj (u2)/u2 +…+ fLj (un)/un (j = 1, 2, …, g)
where fLj: U → [0, 1] is the membership function of the fuzzy set Lj, and fLj(ui) is the membership degree of ui belonging to Lj, where fLj(ui) ∈ [0, 1] (I = 1, 2, …, n, j = 1, 2, ..., g). The symbol + is not the conventional operation of addition but union.
Let fuzzy set L = {L1, L2, …, Lg} be a finite and fully ordered discrete term set, where g is an odd number in real situations with the first (g − 1)/2 elements describe the degree of a property and the last (g − 1)/2 elements from the opposite description. For example, when g = 7, it might represent a set of linguistic variants as: L = {L1, L2, L3, L4, L5, L6, L7} = {very bad, bad, below fair, fair, above fair, good, very good}, and so on. The relationship between the element Li(I = 1, 2, …, g) and its subscript i is strictly monotonically increasing [33], so the function can be defined, as follows: f: Li = f(i). Clearly, the function f(i) is a strictly monotonically increasing function about a subscript i.

2.2. Fuzzy Time Series (FTS)

Fuzzy time series was proposed by Song and Chissom in 1993 [2]. It has been successfully used to solve many practical problems. The brief introduction of basic concepts of fuzzy time series, as follows.
Definition 2.
Let the time series {Y(t)|t = 1, 2, …, T}, a subset of real number, denote the universe of discourse. According to Definition 1, each element in the time series can be fuzzified to a fuzzy set element F(t)(t = 1, 2, …, T), then {F(t)|t = 1, 2, …, T} is called a fuzzy time series defined on the time series {Y(t)|t = 1, 2, …, T}. If there exists a fuzzy relation R(t – p, t), such that
F(t) = F(t − p) ◦ R(t − p, t)
where ◦ is a max-min composition operator, F(t) is called derived from F(t − p), denoted by the fuzzy logical relationship (FLR) F(t − p) → F(t). F(t – p) and F(t) are called the left-hand side (LHS) and the right-hand side (RHS) of the FLR, respectively. FLRs with the same LHS can be categorized into an ordered fuzzy logical group (FLG).
If F(t) is caused by F(t − 1), F(t − 2), …, F(tp), then the p-order FLR can be represented by: F(tp), F(tp + 1), …, F(t – 1) → F(t)

2.3. Fuzzy-Fluctuation Time Series (FFTS)

Definition 3.
Let {X(t)|t = 1, 2, …, T} be a time series, where t is the number of the time series. Let {Y(t)|t = 2,3…T} be a fluctuation time series, where {Y(t) = X(t) − X(t − 1)|t = 2, 3, …, T}. Then the corresponding fuzzy time series {Q(t)|t = 2, 3, …, T} is called a fuzzy-fluctuation time series (FFTS).
Definition 4.
Let Q(t) (t = m + 1, m + 2, …, T, m ≥ 1) be a FFTS. If Q(t) is caused by Q(t − 1), Q(t − 2), …, Q(t − m), then the fuzzy-fluctuation logical relationship is represented by:
Q(t − 1), Q(t − 2), …, Q(tm) → Q(t)
and it is called the m-order fuzzy-fluctuation logical relationship (FFLR) of the fuzzy-fluctuation time series, where Q(t − 1), Q(t − 2), …, Q(t − m) is called the left-hand side (LHS) and Q(t) is called the right-hand side (RHS) of the FFLR, and Q(k)(k = t, t − 1,t − 2,…, t − m) ∈ L.

2.4. Information Entropy

Information entropy (IE) [12] was firstly defined by Shannon in 1948 as a measure of event uncertainty. Shannon suggested that the smaller the possibility of an incident, the greater the amount of information that it contains. Conversely, the greater the likelihood of an event, the smaller the amount of information. So, the amount of information can be expressed as a function of the probability of occurrence of an event.
Definition 5.
E = i = 1 N p ( x i ) l o g 2 ( p ( x i ) )
where p(xi) represents the probability of occurrence of the ith event. In addition, the information entropy must satisfy the following conditions: i = 1 N p ( x i ) = 1 ,   0 < p ( x i ) < 1 and non-negativity: E 0 .

2.5. Neutrosophic Set (NS)

Neutrosophic set (NS) was proposed by Smarandache [20]. It has been widely used to describe complex phenomena.
Definition 6.
Let X be a space of points (objects), with a generic element in X denoted by x. A neutrosophic set A in X is characterized by a truth-membership function TA(x), a indeterminacy-membership function IA(x), and a falsity-membership function FA(x). The functions TA(x), IA(x), and FA(x) are real standard or nonstandard subsets of ]0,1+[. There is no restriction on the sum of TA(x), IA(x), and FA(x).

2.6. Neutrosophic Logical Relationship (NLR)

Definition 7.
Let i be the subscript of a fuzzy set element Li(i = 1, 2, …, g), A(t) be the LHS of an m-order FFLR Q ( t 1 ) , Q ( t 2 ) , , Q ( t m ) Q ( t ) , PiA(t) be the probabilities of corresponding Li(i = 1, 2, …, g) in A(t). PiA(t) can be generated by:
P A ( t ) i = j = 1 m w i , j m           i = 1 , 2 , , g
where w i , j = 1 if Q(t − j) = Li and 0 otherwise.
Let A(t) be the LHS of a m-order FFLR Q ( t 1 ) , Q ( t 2 ) , , Q ( t m ) Q ( t ) , EA(t) be the entropy of A(t), PiA(t) be the probabilities of corresponding Li(i = 1, 2, …, g) in A(t). A(t) can be represented by a neutrosphic set NA(t), with the truth-membership function T N A ( t ) , indeterminacy-membership I N A ( t ) , and falsity-membership function F N A ( t ) defined by:
{ T N A ( t ) = i = 1 g 1 2 α i P A ( t ) i I N A ( t ) = E A ( t ) F N A ( t ) = i = g 1 2 + 1 g β i P A ( t ) i
where α i , β i are the weights of Li(i = 1, 2, …, g) in terms of their contribution to represent the corresponding characteristics, i = 1 ( g 1 ) / 2 α i = 1, i = g 1 2 + 1 g β i = 1 .
The same as the definition of FLG, NA(t) is called the left-hand side (LHS) of a neutrosophic logical relationship (NLR). The similar LHSs of NLRs can be categorized and group their RHSs of corresponding FFLRs into an ordered fuzzy logical group (FLG) BA(t), which can also be represented by a neutrosophic set NA(t), referring to the above method.
{ T N A ( t ) = i = 1 ( g 1 ) / 2 α i P B A ( t ) i I N A ( t ) = E B A ( t ) F N A ( t ) = i = g 1 2 + 1 g β i P B A ( t ) i
where P B A ( t ) i represents the probability of corresponding Li(i = 1, 2, …, g) in BA(t).
Thus, a neutrosophic logical relationship (NLR) can be defined as: N A ( t ) N A ( t ) .
Definition 8.
Let NA(t1) and NA(t2) be two NSs. The Jaccard similarity [34] between NA(t1) and NA(t2) in vector space can be expressed as follows:
J ( N A ( t 1 ) , N A ( t 1 ) ) = T N A ( t 1 ) T N A ( t 2 ) + I N A ( t 1 ) I N A ( t 2 ) + F N A ( t 1 ) F N A ( t 2 ) ( T N A ( t 1 ) ) 2 + ( I N A ( t 1 ) ) 2 + ( F N A ( t 1 ) ) 2 + ( T N A ( t 2 ) ) 2 + ( I N A ( t 2 ) ) 2 + ( F N A ( t 2 ) ) 2 ( T N A ( t 1 ) T N A ( t 2 ) + I N A ( t 1 ) I N A ( t 2 ) + F N A ( t 1 ) F N A ( t 2 ) )

2.7. Deneutrosophication of a Neutrosphic Set

Definition 9.
Deneutrosophication of a neutrosophic fluctuation set refers to converting a neutrosophic set x into a fuzzy set y by the following function according to Ali et al. [35]:
f ( T ( x ) , I ( x ) , F ( x ) ) : ( [ 0 , 1 ] , [ 0 , 1 ] , [ 0 , 1 ] ) [ 0 , 1 ]
Specifically, in this paper, our equation for deneutrosophication is as follows:
y = γ T ( x ) + θ I ( x ) + δ F ( x )

3. Proposed Model Based on High-Order Fluctuation Trends and Information Entropy

This paper presents a prediction model that is based on high-order fluctuation trends and information entropy. Different from existing high-order fuzzy time series forecasting models, the proposed model summarizes the discrete high-order fluctuation states into truth-membership of upper-trend, falsity-membership of upper-trend, and chaos of trends. It coincides with the definition of neutrosophic set, which employs the three dimensions of truth-membership, falsity-membership, and indeterminacy-membership to describe a characteristic. Based on the NS expression of high-order fluctuation states, the NS theory can be used to deal with related issues, such as the comparison of high-order states during the location of rules. The most significant contribution of this model is that it extends existing high-order fuzzy logical relationships to neutrosophic logical relationships and it employs information entropy to represent the indeterminacy of fluctuation trends. When compared with traditional discrete high-order FLRs, the proposed NLRs have higher generality and can handle the problem caused by the lack of rules in the forecasting stage. The detailed steps are shown as follow steps and in Figure 1.
Step 1 Construct FFTS for the historical training data.
For each element X(t)(t = 1, 2, …, T) in the historical training time series, its fluctuation trend is determined by Y(t) = X(t) − X(t − 1) (t = 2, 3, …, T). Y(t) can be fuzzified into a linguistic set {L1, L2, L3, L4, L5} = {down, slightly down, equal, slightly up, up}. Let len be the whole mean of all the elements in the fluctuation time series Y(t)(t = 2, 3, …, T), and we define intervals u 1 = [ , 1.5 l e n ] , u 2 = [ 1.5 l e n , 0.5 l e n ] , u 3 = [ 0.5 l e n , 0.5 l e n ] , u 4 = [ 0.5 l e n , 1.5 l e n ] , and u 5 = [ 1.5 l e n , + ] . Then, Y(t)(t = 2, 3, …, T) can be fuzzified into a fuzzy fluctuation time series Q(t)(t = 2, 3, …, T).
Step 2 Establish m-order FFLRs for the training data set
According to Definition 4, each Q(t) (t > m) in the training set can be represented by its previous m days’ fuzzy-fluctuation numbers. Then, the m-order FFLRs of the prediction model can be established.
Step 3 Calculate fluctuation information entropy
According to the Definition 5, the information entropy of the m-order fluctuation can be separately calculated. Among them, p(x1), p(x2), p(x3), p(x4), and p(x5), respectively, indicate the probability of occurrence of the linguistic variants L1, L2, L3, L4, and L5 in the LHS. Then, the information entropy of the m-order FFLRs can be obtained according to Equation (4), which will be the element of the indeterminacy membership of the NSs.
Step 4 Convert the LHSs of FFLRs to NSs
Calculate the probability of corresponding Li(i = 1, 2, …, 5) in the LHSs of training data set, according to Equation (5). Combined with the information entropy of the m-order FFLRs obtained in the previous step, convert the left-hands of FFLRs into neutrosophic sets according to Equation (6).
Step 5 Grouping and optimization
Applying the Jaccard similarity measure method, categorize and group the converted LHSs of FFLRs according their similarities. Convert the RHSs of corresponding FFLRs into neutrosophic sets according to Equation (7). NLRs are obtained from the training data sets for future forecasting.
Step 6 Forecast test time series
The Jaccard similarity measurement method can be used to find the most appropriate NLR for each test data set. According to Definition 9, the RHS of the NLR can be converted to an expectation fuzzy value Y(i + 1). Then, calculate the real number of the fluctuation by: F ( i + 1 ) = Y ( i + 1 ) × l e n . Finally, the predicted value can be obtained from the actual value of the previous day X(i) and the predicted fluctuation value F(i + 1): X ( i + 1 ) = X ( i ) + F ( i + 1 ) .

4. Empirical Analysis

4.1. Forecasting Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX)

The Taiwan stock market has always been the focus of research in the field of stock market forecasting. The TAIEX is an indicator of the change in the value of Taiwan’s overall market stocks, which is seen as a window to the Taiwanese economy. In addition, a large number of studies have used TAIEX data as an example to illustrate their proposed prediction methods [36,37,38,39,40,41]. To facilitate a comparison with the accuracy of these models, we also used the TAIEX data set to illustrate our proposed method. Specifically, while referring to the general practice of the above studies, the forecasting process in this section is based on TAIEX’s 1999 data. Similarly, we also selected data from January 1999 to October 1999 as a training set, and selected from November to December 1999 as a test data set.
Step 1 Construct FFTS for the historical training data
The fluctuation trend is constructed based on the elements of the historical training data set. Then, while using the overall mean of the number of fluctuations in the training data set, the fluctuation trend is blurred to FFTS, e.g., the average value of the training data set for TAIEX. Therefore, len is 85. Then, X(1) = 6152.43, X(2) = 6199.91, and Y(2) = X(2) − X(1) = 47.48. Further, F(2) = L4, means slightly up. Thus, the training set can be converted to fuzzy fluctuation set.
Step 2 Establish m-order FFLRs for the training data set
Different orders directly affect the effect of the prediction. Here, we only need to explain the process, so we take the 9th-order as an example. Then, according to Definition 4, the 9th-order FFLRs of the prediction model were established.
Step 3 Calculating fluctuation information entropy
According to Equation (4), the information entropy of the 9th-order fluctuation time series can be calculated separately. For example, the 9th-order FFLRs on November 1, 1999 can be represented by (L3, L3, L2, L4, L3, L3, L3, L3, L5), which can be calculated as p(x1) = 0, p(x2) = 1/9, p(x3) = 6/9, p(x4) = 1/9, and p(x5) = 1/9. Therefore, its information entropy is 1.4466, which can be normalized to 0.5306.
Step 4 Convert the LHSs of FFLRs to NSs
Calculate the probability of corresponding Li(i = 1, 2, …, 5) in the LHSs of the training data set and then convert the left-hands of FFLRs into neutrosophic sets according to Equation (6).
Step 5 Grouping and optimization
Firstly, apply the Jaccard similarity measure method to categorize and group the converted LHSs of FFLRs. In this example, the threshold similarity value is set to 0.94. Then, convert the RHSs of the corresponding FFLRs into neutrosophic sets according to Equation (7). The conversion and grouping process is shown in Figure 2.
Step 6 Forecast test time series
For each test data point, calculate its 9th-order historical fluctuation states and convert them to a neutrosophic set. Locate the NLR with the highest similarity. Then, the corresponding RHS of the NLR is the forecasting neutrosophic value. For example, we chose 1 November 1999 as an example of the test data. The LHS of its fluctuation states is (L3, L3, L4, L3, L2, L3, L5, L4, L4, L3, L4, L4, L3, L4, L3, L2, L2, L3, L4, L5, L4, L4, L3, L3, L3), which can be represented by a neutrosophic set (0.0480, 0.6085, 0.1920). According to the similarity comparison, the most appropriate NLR for forecasting is (0.0444, 0.5306, 0.1111) → (0.0480, 0.6085, 0.1920).
Then, calculate the predicted fuzzy fluctuation:
Y ( i + 1 ) = ( 0.0480 ) + 0.1920 = 0.144
Calculate the real number of the fluctuation:
F ( i + 1 ) = Y ( i + 1 ) × l e n = 0.144 × 85 = 12.24
Finally, The predicted value can be obtained from the actual value of the previous day and the predicted fluctuation value:
X ( i + 1 ) = X ( i ) + F ( i + 1 ) = 7854.85 + 12.24 = 7867.09
According to the above steps, we can obtain other prediction results that are shown in Table 1 and Figure 3.
In experimental analysis, some of the methods are used to measure the accuracy of prediction in order to quantify the effect of model prediction. These methods are mainly used in the prediction field, including the mean percentage error (MPE), the mean square error (MSE), and the root mean squared error (RMSE). Since there is no significant difference in the effect of different error calculation methods, RMSE was chosen as the main formula for error calculation.
R M S E = t = 1 n ( f o r e c a s t ( t ) a c t u a l ( t ) ) 2 n
Since different orders affect the prediction effect, in order to reduce the experimental error and improve the accuracy of the prediction, it was necessary to select the optimal order. Experimental analysis showed that when the order was nine, the predictability of the model is better. Table 2 shows the experimental errors for different years under different orders.
It is important and necessary to accurately predict the trend of fluctuations. Some of the most recent articles have proposed many excellent prediction methods. Therefore, by comparing our method with the previous methods, the advantages and disadvantages of our model can be verified.
Table 3 shows the prediction errors for different methods from 1997 to 2005. By comparing with the latest research, we found that the overall prediction effect of this method was excellent. For example, the average error of Guan et al.’s model [40] is 94.5, the average error of Cheng et al.’s model [41] is 102.4, and the error of the proposed model is 92.16. Upon further analysis, from the annual prediction error results, we can see that this method can effectively predict the trend from 1997 to 2005, which is more universal.

4.2. Forecasting Hong Kong Heng Seng Index (HIS)

HIS is one of the representative indices in Asia. It is not only an important indicator of the Hong Kong stock market price, but is also a stock price index that reflects the most influential trends in the Hong Kong stock market prices. By comparing several authoritative prediction methods, we attempted to verify the universality of the model in other stock markets. Table 4 and Figure 4 compare the different prediction methods from 1998 to 2012.
As shown in Table 4, the average prediction error of the Yu [42] method is 359.66, the Wan [43] method is 395.26, and the Ren [44] method is 526.46. The average prediction error of proposed method is 248.23, which is the smallest error. In addition, it can be seen from Figure 4 that the prediction result of the method is more stable and the prediction effect is more prominent.

5. Conclusions

This paper proposed a prediction model that is based on logical rules abstracted from historical dynamic fluctuation trends and the corresponding inconsistencies. During the logical rule training stage, the two dimensions of truth-membership and false-membership of neutrosophic sets were mapped to the dynamic trend states of up and down, respectively. Information entropy was employed to quantify the inconsistency of a period of history and was mapped to the indeterminacy membership of the neutrosophic sets. In the forecasting stage, the similarities among neutrosophic sets were employed to locate the most similar left side of a logical relationship. Therefore, the two characteristics of fluctuation trends and inconsistency assisted with future forecasting. The proposed model was implemented to forecast TAIEX and HIS. The experimental results showed that the model has stable prediction ability for different data sets. Simultaneously, comparing the prediction error with other approaches also proved that our model has outstanding prediction accuracy and universality. This study used two datasets, TAIEX and HIS. Hence, the next study requires data from other stock markets to further validate the model. This study discussed high-order FFTS to characterize the historical fluctuation trends and high-order information fluctuation entropy to measure the inconsistency of historical fluctuations. While considering the relationships among high-order history, current and future, it is likely to use network theory to handle such forecasting problems. In future research, flexible network models should be constructed and more network approaches should be introduced to this area.

Author Contributions

Data curation, Z.D.; Supervision, H.G.; Validation, H.G., S.G. and A.Z.; Writing—original draft, Z.D.; Writing—review & editing, A.Z.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Song, Q.; Chissom, B.S. Forecasting enrollments with fuzzy time series—Part I. Fuzzy Sets Syst. 1993, 54, 1–9. [Google Scholar] [CrossRef]
  2. Song, Q.; Chissom, B.S. Fuzzy time series and its models. Fuzzy Sets Syst. 1993, 54, 269–277. [Google Scholar] [CrossRef]
  3. Song, Q.; Chissom, B.S. Forecasting enrollments with fuzzy time series—Part II. Fuzzy Sets Syst. 1994, 62, 1–8. [Google Scholar] [CrossRef]
  4. Pritpal, S.; Gaurav, D. A hybrid fuzzy time series forecasting model based on granular computing and bio-inspired optimization approaches. J. Comput. Sci. 2018, 1877–7503. [Google Scholar]
  5. Che-Ngoc, H.; Tai, V.V.; Huynh-Le, Q.C.; Ho, V.; Nguyen-Trang, T.; Chu-Thi, M. An Improved Fuzzy Time Series Forecasting Model. Available online: https://www.researchgate.net/publication/321947685/download (accessed on 3 September 2018).
  6. Guo, H.; Pedrycz, W.; Liu, X. Fuzzy time series forecasting based on axiomatic fuzzy set theory. Neural Comput. Appl. 2018, 3, 1–12. [Google Scholar] [CrossRef]
  7. Gautam, S.S.; Singh, S.R. A refined method of forecasting based on high-order intuitionistic fuzzy time series data. In Progress in Artificial Intelligence; Springer: Berlin, Germany, 2018; pp. 1–12. [Google Scholar]
  8. Pannu, H.S.; Tripathi, A. High-order fuzzy time series forecasting model for advance prediction of temperature. In Proceedings of the International Conference on Inventive Computation Technologies, Coimbatore, India, 26–27 August 2016; pp. 1–6. [Google Scholar]
  9. Lee, L.W.; Wang, L.H.; Chen, S.M.; Leu, Y.-H. Handling forecasting problems based on two-factors high-order fuzzy time series. IEEE Trans. Fuzzy Syst. 2006, 14, 468–477. [Google Scholar] [CrossRef]
  10. Zhang, Z.; Li, Y.; Jin, S.; Zhang, Z.; Wang, H.; Qi, L.; Zhou, R. Modulation Signal Recognition Based on Information Entropy and Ensemble Learning. Entropy 2018, 20, 198. [Google Scholar] [CrossRef]
  11. Jiang, Q.; Shen, Y.; Li, H.; Xu, F. New Fault Recognition Method for Rotary Machinery Based on Information Entropy and a Probabilistic Neural Network. Sensors 2018, 18, 337. [Google Scholar] [CrossRef] [PubMed]
  12. Núñez, J.A.; Cincotta, P.M.; Wachlin, F.C. Information entropy. Celest. Mech. Dyn. Astron. 1996, 64, 43–53. [Google Scholar] [CrossRef]
  13. Duan, B.; Li, Z.; Gu, P.; Wang, Y.; Wu, J.; Wang, L.; Zou, X.; Liu, J.; Zou, Y.; He, R. Evaluation of battery inconsistency based on information entropy. J. Energy Storage 2018, 16, 160–166. [Google Scholar] [CrossRef]
  14. Xu, X.H.; Luo, X. Information entropy risk measure applied to large group decision-making method. Soft Comput. 2018, 1–11. [Google Scholar] [CrossRef] [Green Version]
  15. Wang, W.; Wang, D.; Singh, V.P.; Wang, Y.; Wu, J.; Wang, L.; Zou, X.; Liu, J.; Zou, Y.; He, R. Optimization of rainfall networks using information entropy and temporal variability analysis. J. Hydrol. 2018, 559, 136–155. [Google Scholar] [CrossRef]
  16. Huo, W.; Huang, Y.; Pei, J.; Zhang, Q.; Gu, Q.; Yang, J. Ship Detection from Ocean SAR Image Based on Local Contrast Variance Weighted Information Entropy. Sensors 2018, 18, 1196. [Google Scholar] [CrossRef] [PubMed]
  17. Liu, Z.; Shang, P. Generalized information entropy analysis of financial time series. Phys. A Stat. Mech. Appl. 2018, 505, 1170–1185. [Google Scholar] [CrossRef]
  18. Liang, X.; Si, D.; Xu, J. Quantitative Evaluation of the Sustainable Development Capacity of Hydropower in China Based on Information Entropy. Sustainability 2018, 10, 529. [Google Scholar] [CrossRef]
  19. Mahdi, O.A.; Pardede, E.; Cao, J. Combination of information entropy and ensemble classification for detecting concept drift in data stream. In Proceedings of the Australasian Computer Science Week Multiconference, Brisband, Queensland, Australia, 29 January–2 February 2018. [Google Scholar] [CrossRef]
  20. Smarandache, F. A unifying field in logic. In Multiple-Valued Logic; IEEE: Piscataway, NJ, USA, 1999; Volume 8, pp. 489–503. [Google Scholar]
  21. Wang, H.; Smarandache, F.; Zhang, Y.Q.; Sunderraman, R. Single valued neutrosophic sets. Rev. Air Force Acad. 2010, 4, 410–413. [Google Scholar]
  22. Wang, H.; Smarandache, F.; Zhang, Y.Q.; Sunderraman, R. Interval Neutrosophic Sets and Logic: Theory and Applications in Computing; Hexis: Phoenix, AZ, USA, 2005. [Google Scholar]
  23. Ye, J. A multicriteria decision-making method using aggregation operators for simplified neutrosophic sets. Intell. Fuzzy Syst. 2014, 26, 2459–2466. [Google Scholar]
  24. Ye, J.; Cui, W. Exponential Entropy for Simplified Neutrosophic Sets and Its Application in Decision Making. Entropy 2018, 20, 357. [Google Scholar] [CrossRef]
  25. Abdel-Basset, M.; Mai, M. The Role of Single Valued Neutrosophic Sets and Rough Sets in Smart City: Imperfect and Incomplete Information Systems. Measurement 2018, 124, 47–55. [Google Scholar] [CrossRef]
  26. Şahin, R.; Liu, P. Correlation coefficient of single-valued neutrosophic hesitant fuzzy sets and its applications in decision making. Neural Comput. Appl. 2017, 28, 1387–1395. [Google Scholar] [CrossRef]
  27. Van, L.H.; Yu, V.F.; Dat, L.Q.; Dung, C.C.; Chou, S.; Loc, N.V. New Integrated Quality Function Deployment Approach Based on Interval Neutrosophic Set for Green Supplier Evaluation and Selection. Sustainability 2018, 10, 838. [Google Scholar] [CrossRef]
  28. Wu, H.; Yuan, Y.; Wei, L.; Pei, L. On entropy, similarity measure and cross-entropy of single-valued neutrosophic sets and their application in multi-attribute decision making. In Soft Computing; Springer: Berlin, Germany, 2018. [Google Scholar]
  29. Uluçay, V.; Deli, I.; Şahin, M. Similarity measures of bipolar neutrosophic sets and their application to multiple criteria decision making. Neural Comput. Appl. 2018, 29, 739–748. [Google Scholar] [CrossRef]
  30. Ye, J.; Du, S. Some distances, similarity and entropy measures for interval-valued neutrosophic sets and their relationship. Int. J. Mach. Learn. Cybern. 2017. [Google Scholar] [CrossRef]
  31. Bucolo, M.; Fazzino, S.; Rosa, M.L.; Fortuna, L. Small-world networks of fuzzy chaotic oscillators. Chaos Solitons Fractals 2003, 17, 557–565. [Google Scholar] [CrossRef]
  32. Zadeh, L.A. The Concept of a Linguistic Variable and its Application to Approximate Reasoning. Inf. Sci. 1974, 8, 199–249. [Google Scholar] [CrossRef]
  33. Herrera, F.; Herrera-Viedma, E.; Verdegay, J.L. A model of consensus in group decision making under linguistic assessments. Fuzzy Sets Syst. 1996, 79, 73–87. [Google Scholar] [CrossRef]
  34. Fu, J.; Ye, J. Simplified neutrosophic exponential similarity measures for the initial evaluation/diagnosis of benign prostatic hyperplasia symptoms. Symmetry 2017, 9, 154. [Google Scholar] [CrossRef]
  35. Ali, M.; Minh, N.V.; Le, H.S. A Neutrosophic Recommender System for Medical Diagnosis Based on Algebraic Neutrosophic Measures. Appl. Soft Comput. 2017. [Google Scholar] [CrossRef]
  36. Chen, S.M.; Chang, Y.C. Multi-variable fuzzy forecasting based on fuzzy clustering and fuzzy rule interpolation techniques. Inf. Sci. 2010, 180, 4772–4783. [Google Scholar] [CrossRef]
  37. Chen, S.M.; Chen, C.D. TAIEX Forecasting Based on Fuzzy Time Series and Fuzzy Variation Groups. IEEE Trans. Fuzzy Syst. 2011, 19, 1–12. [Google Scholar] [CrossRef]
  38. Chen, S.M.; Manalu, G.M.; Pan, J.S.; Liu, H.C. Fuzzy Forecasting Based on Two-Factors Second-Order Fuzzy-Trend Logical Relationship Groups and Particle Swarm Optimization Techniques. IEEE Trans. Cybern. 2013, 43, 1102–1117. [Google Scholar] [CrossRef] [PubMed]
  39. Cheng, S.H.; Chen, S.M.; Jian, W.S. Fuzzy time series forecasting based on fuzzy logical relationships and similarity measures. Inf. Sci. 2016, 327, 272–287. [Google Scholar] [CrossRef]
  40. Guan, S.; Zhao, A. A Two-Factor Autoregressive Moving Average Model Based on Fuzzy Fluctuation Logical Relationships. Symmetry 2017, 9, 207. [Google Scholar] [CrossRef]
  41. Cheng, C.; Yang, J. Fuzzy time-series model based on rough set rule induction for forecasting stock price. Neurocomputing 2018, 302, 33–45. [Google Scholar] [CrossRef]
  42. Yu, H.K. A refined fuzzy time-series model for forecasting. Phys. A Stat. Mech. Appl. 2005, 346, 657–681. [Google Scholar] [CrossRef]
  43. Wan, Y.; Si, Y.W. Adaptive neuro fuzzy inference system for chart pattern matching in financial time series. Appl. Soft Comput. 2017, 57, 1–18. [Google Scholar] [CrossRef]
  44. Ren, Y.; Suganthan, P.N.; Srikanth, N. A Novel Empirical Mode Decomposition with Support Vector Regression for Wind Speed Forecasting. IEEE Trans. Neural Netw. Learn. Syst. 2016, 27, 1793–1798. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flow chart of prediction model based on high-order Fuzzy-Fluctuation time series (FFTS) and information entropy (IE).
Figure 1. Flow chart of prediction model based on high-order Fuzzy-Fluctuation time series (FFTS) and information entropy (IE).
Entropy 20 00669 g001
Figure 2. Conversion and group process of fuzzy-fluctuation logical relationship (FFLRs).
Figure 2. Conversion and group process of fuzzy-fluctuation logical relationship (FFLRs).
Entropy 20 00669 g002
Figure 3. The stock market fluctuation of predicted and actual values for November to December 1999.
Figure 3. The stock market fluctuation of predicted and actual values for November to December 1999.
Entropy 20 00669 g003
Figure 4. Root mean squared errors (RMSEs) of forecast errors for the Heng Seng Index (HIS) from 1998 to 2012.
Figure 4. Root mean squared errors (RMSEs) of forecast errors for the Heng Seng Index (HIS) from 1998 to 2012.
Entropy 20 00669 g004
Table 1. Predicted and actual Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX) values from 1 November 1999 to 30 December 1999.
Table 1. Predicted and actual Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX) values from 1 November 1999 to 30 December 1999.
Date (DD/MM/YYYY)ActualForecast(Forecast − Actual)2Date (DD/MM/YYYY)ActualForecast(Forecast − Actual)2
1/11/19997814.897867.092724.841/12/19997766.207723.081859.33
2/11/19997721.597826.6611,039.542/12/19997806.267768.721408.89
3/11/19997580.097723.1420,462.003/12/19997933.177807.1115,891.12
4/11/19997469.237577.8511,798.994/12/19997964.497932.851001.33
5/11/19997488.267466.21486.036/12/19997894.467964.174859.38
6/11/19997376.567485.2911,822.597/12/19997827.057893.824458.83
8/11/19997401.497370.82940.508/12/19997811.027825.78217.71
9/11/19997362.697395.821097.829/12/19997738.847810.165086.43
10/11/19997401.817357.091999.6610/12/19997733.777736.728.67
11/11/19997532.227404.9016,210.1513/12/19997883.617734.8322,134.74
15/11/19997545.037526.69336.3614/12/19997850.147882.731062.35
16/11/19997606.207541.784149.9415/12/19997859.897849.27112.73
17/11/19997645.787606.201566.5816/12/19997739.767868.7316,633.26
18/11/19997718.067653.404180.8317/12/19997723.227747.76602.21
19/11/19997770.817719.842598.3418/12/19997797.877730.154586.55
20/11/19997900.347784.5413,409.4620/12/19997782.947799.05259.55
22/11/19998052.317919.6817,589.4421/12/19997934.267784.1022,546.71
23/11/19998046.198058.99163.8022/12/19998002.767938.254161.84
24/11/19997921.858052.6417,105.5723/12/19998083.498002.586546.57
25/11/19997904.537925.99460.5824/12/19998219.458084.7318,148.43
26/11/19997595.447908.6298,080.8427/12/19998415.078224.4236,348.84
29/11/19997823.907597.7451,147.4028/12/19998448.848418.62913.11
30/11/19997720.877823.1310,456.55Root Mean Square Error(RMSE)102.05
Table 2. Comparison of forecasting errors for different m-orders.
Table 2. Comparison of forecasting errors for different m-orders.
m
67891011121314
RMSE102.76102.52102.2102.05102.61102.89102.9102.88103.4
Table 3. Performance comparison of prediction root mean squared error (RMSE) with other models for TAIEX from 1997 to 2005.
Table 3. Performance comparison of prediction root mean squared error (RMSE) with other models for TAIEX from 1997 to 2005.
MethodRMSE
199719981999200020012002200320042005Average
Chen and Chang [36]NN123.64131.1115.0873.0666.3660.48N94.95
Chen and Chen [37]140.86144.13119.32129.87123.1271.0165.1461.94N106.92
Chen et al. [38]138.41113.88102.34131.25113.6265.7752.2356.16N96.71
Cheng et al. [39]NN100.74125.62113.0462.9451.4654.24N84.67
Guan et al. [40]141.89119.8599.03128.62125.6466.2953.256.1155.8394.05
Cheng et al. [41]N120.8110.7150.6113.266.053.158.653.5102.4
Our model140.33114.35102.05129.97113.3266.2654.6655.1953.3392.16
Table 4. Performance comparison of prediction RMSE with other models for the Hong Kong Heng Seng Index (HIS) from 1998 to 2012.
Table 4. Performance comparison of prediction RMSE with other models for the Hong Kong Heng Seng Index (HIS) from 1998 to 2012.
Method199819992000200120022003200420052006200720082009201020112012Average
Yu [42]291.4469.6297.05316.85123.7186.16264.34112.4252.44912.67684.9442.64382.06419.67239.11359.66
Wan et al. [43]326.62637.1356.7299.43155.09226.38239.63147.2466.241847.82179437.24445.41688.04477.34595.26
Ren et al. [44]296.67761.9356.81254.07155.4199.58540.191127407.891028.7593.8435.18718.33578.7442.44526.46
Our model200.72224.81254.56158.88105.53122.99104.51103.66177.49686.79466.81311.76273.49348.57182.85248.23

Share and Cite

MDPI and ACS Style

Guan, H.; Dai, Z.; Guan, S.; Zhao, A. A Forecasting Model Based on High-Order Fluctuation Trends and Information Entropy. Entropy 2018, 20, 669. https://doi.org/10.3390/e20090669

AMA Style

Guan H, Dai Z, Guan S, Zhao A. A Forecasting Model Based on High-Order Fluctuation Trends and Information Entropy. Entropy. 2018; 20(9):669. https://doi.org/10.3390/e20090669

Chicago/Turabian Style

Guan, Hongjun, Zongli Dai, Shuang Guan, and Aiwu Zhao. 2018. "A Forecasting Model Based on High-Order Fluctuation Trends and Information Entropy" Entropy 20, no. 9: 669. https://doi.org/10.3390/e20090669

APA Style

Guan, H., Dai, Z., Guan, S., & Zhao, A. (2018). A Forecasting Model Based on High-Order Fluctuation Trends and Information Entropy. Entropy, 20(9), 669. https://doi.org/10.3390/e20090669

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop