Next Article in Journal
Unambiguous Models and Machine Learning Strategies for Anomalous Extreme Events in Turbulent Dynamical System
Next Article in Special Issue
Legendre Polynomial Fitting-Based Permutation Entropy Offers New Insights into the Influence of Fatigue on Surface Electromyography (sEMG) Signal Complexity
Previous Article in Journal
A Possibilistic Formulation of Autonomous Search for Targets
Previous Article in Special Issue
Benefits of Zero-Phase or Linear Phase Filters to Design Multiscale Entropy: Theory and Application
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Tsallis Entropy-Based Complexity-IPE Casualty Plane: A Novel Method for Complex Time Series Analysis

1
School of Information and Communication, Guilin University of Electronic Technology, Guilin 541004, China
2
Cognitive Radio and Information Processing Key Laboratory Authorized by China’s Ministry of Education Foundation, Guilin University of Electronic Technology, Guilin 541004, China
*
Author to whom correspondence should be addressed.
Entropy 2024, 26(6), 521; https://doi.org/10.3390/e26060521
Submission received: 15 May 2024 / Revised: 11 June 2024 / Accepted: 14 June 2024 / Published: 17 June 2024
(This article belongs to the Special Issue Ordinal Pattern-Based Entropies: New Ideas and Challenges)

Abstract

:
Due to its capacity to unveil the dynamic characteristics of time series data, entropy has attracted growing interest. However, traditional entropy feature extraction methods, such as permutation entropy, fall short in concurrently considering both the absolute amplitude information of signals and the temporal correlation between sample points. Consequently, this limitation leads to inadequate differentiation among different time series and susceptibility to noise interference. In order to augment the discriminative power and noise robustness of entropy features in time series analysis, this paper introduces a novel method called Tsallis entropy-based complexity-improved permutation entropy casualty plane (TC-IPE-CP). TC-IPE-CP adopts a novel symbolization approach that preserves both absolute amplitude information and inter-point correlations within sequences, thereby enhancing feature separability and noise resilience. Additionally, by incorporating Tsallis entropy and weighting the probability distribution with parameter q, it integrates with statistical complexity to establish a feature plane of complexity and entropy, further enriching signal features. Through the integration of multiscale algorithms, a multiscale Tsallis-improved permutation entropy algorithm is also developed. The simulation results indicate that TC-IPE-CP requires a small amount of data, exhibits strong noise resistance, and possesses high separability for signals. When applied to the analysis of heart rate signals, fault diagnosis, and underwater acoustic signal recognition, experimental findings demonstrate that TC-IPE-CP can accurately differentiate between electrocardiographic signals of elderly and young subjects, achieve precise bearing fault diagnosis, and identify four types of underwater targets. Particularly in underwater acoustic signal recognition experiments, TC-IPE-CP achieves a recognition rate of 96.67%, surpassing the well-known multi-scale dispersion entropy and multi-scale permutation entropy by 7.34% and 19.17%, respectively. This suggests that TC-IPE-CP is highly suitable for the analysis of complex time series.

1. Introduction

The concept of entropy was first introduced by Clausius in 1854 as a measure of disorder in thermodynamic systems. In 1948, Shannon extended this concept to the field of communications by proposing information entropy, thereby enabling the quantification of information [1]. In recent years, entropy has garnered increasing attention and has been widely applied in various fields [2,3,4], including system identification, time series analysis, and signal recognition.
This paper focuses on the application of entropy in time series analysis. Currently, entropy algorithms used to measure the randomness of time series are primarily based on the definitions of conditional entropy and Shannon entropy. A notable example of the former is approximate entropy (ApEn), proposed by Pincus in 1991 [5]. ApEn evaluates the similarity between sequences in phase space, effectively measuring the complexity of time series. However, ApEn has significant drawbacks, including a strong dependence on the length of the time series and relatively high error rates. In 2002, Richman and colleagues improved upon ApEn and proposed sample entropy (SampEn), which addresses the self-matching issue in ApEn and enhances algorithm stability [6]. Chen and co-researchers argued that SampEn’s use of the Heaviside function, a hard threshold method for measuring sequence similarity, leads to inaccurate entropy estimation. To mitigate this, they replaced the Heaviside function with a fuzzy membership function, improving the algorithm’s performance. Despite these advancements, all three types of algorithms are affected by the tolerance factor r and have high computational complexity, making them unsuitable for real-time processing.
A notable representative of entropy algorithms based on Shannon entropy is permutation entropy (PE) [7], proposed by Bandt in 2002. PE utilizes “symbol patterns” to characterize time series, thereby reducing the computational complexity of the algorithm. This approach has seen widely applied across various fields. However, PE neglects amplitude information, leading to a diminished ability to distinguish between different time series. Although subsequent improvements such as weighted permutation entropy [8] and modified permutation entropy [9] have been proposed, these methods fail to simultaneously consider both the absolute amplitude information of sample points and the temporal correlation between them. In 2016, Mostafa and colleagues introduced the dispersion entropy (DispEn) algorithm [10], which not only offers fast computation but is also less affected by abrupt signal changes. In 2018, Azami and colleagues further refined DispEn by incorporating signal fluctuations, resulting in fluctuation dispersion entropy (FDispEn) [11], which exhibits enhanced noise resistance compared to DispEn. In addition, methods such as bubble entropy, distribution entropy, slope entropy, and attention entropy can measure the randomness of time series from different perspectives; however, none of these methods have been as widely applied in practice as PE and DispEn.
All above mentioned entropy algorithms operate at a single scale, providing a solitary entropy estimate for a given time series. However, in practical engineering scenarios, time series generated by complex systems exhibit inherent complexity that cannot be adequately captured by single-scale entropy features. Recognizing this challenge, Costa proposed a coarse-graining technique to decompose the original series into sequences at various scales for multi-scale entropy estimation [12]. This approach, known as multi-scale entropy estimation, enables a more comprehensive understanding of the system’s complexity. The entropy values obtained at different scales are then integrated into a feature vector. Through the integration of coarse-graining techniques with entropy algorithms, scholars have introduced a range of multi-scale entropy algorithms, such as multi-scale permutation entropy (MPE) [13], multi-scale sample entropy [12], and multi-scale dispersion entropy (MDispEn) [14]. The adoption of this coarse-graining strategy significantly enhances the discriminative power of entropy features across diverse signal types.
Entropy measures serve as effective tools for quantifying the randomness and disorder present in time series data. However, they often fall short in capturing the underlying structural correlations that are intricately woven within these sequences, thus failing to provide a comprehensive characterization of time series complexity. In 1995, Lopez, Mancini, and Calbert proposed a novel measure of complexity known as statistical complexity (SCM) [15]. SCM is derived by multiplying the imbalance of probability density distribution by the information entropy. Initially, SCM utilized histogram and bisection methodologies to depict the probability density distribution of time series. Subsequently, researchers amalgamated algorithms such as Shannon entropy and permutation entropy to devise the complexity-entropy causality space (CECS). Zhang et al. conducted financial data analysis utilizing complexity-entropy methodologies [16], while Li et al. employed similar techniques to investigate the dynamic characteristics of gas–liquid two-phase flow [17]. Silva et al. applied CECS to examine monthly rainfall time series. These studies collectively underscore the efficacy of CECS. However, it is noteworthy that the majority of these entropy computations rely on Shannon entropy or permutation entropy, thereby inheriting the limitations associated with amplitude information loss and inadequate handling of equal elements within embedding vectors.
This study introduces a Tsallis entropy-based complexity-improved permutation entropy casualty plane (TC-IPE-CP) to address challenges encountered in traditional entropy algorithms during the extraction of features from complex time series, including substantial data point requirements, limited separability, and weak noise resilience. Departing from conventional complexity-permutation entropy planes, this method adopts a novel symbolization approach that preserves both absolute amplitude information and inter-point correlations within sequences, thereby enhancing feature separability and noise resilience. Additionally, by incorporating Tsallis entropy and weighting the probability distribution P with parameter q [18,19], it integrates with statistical complexity to establish a feature plane of complexity and entropy, further enriching signal features. Through the integration of multiscale algorithms, a multiscale Tsallis-improved permutation entropy algorithm is also developed.
The paper’s structure is outlined as follows: Section 2 delineates the fundamental principles of TC-IPE-CP, Section 3 and Section 4 offer analyses of simulated signals and complex time series utilizing various multiscale entropy algorithms, and Section 5 furnishes a comprehensive summary of the paper’s entirety.

2. Methods

2.1. Tsallis Improved Permutation Entropy (TIPE)

The procedural delineation of the TIPE algorithm unfolds as follows:
  • Step 1. For a time series X = { x 1 , x 2 , , x N }, normalization is performed by employing Equation (1) for the cumulative distribution function, resulting in the normalized sequence, denoted as y i . In the equation, μ and δ 2 , respectively, represent the mean and variance.
    y i = 1 δ 2 π x i e t μ 2 2 δ 2 d t
  • Step 2. By setting the embedding dimension m and time delay τ , the time series is reconstructed as follows:
    Y j = y j , y j + τ , , y j + m 1 τ ,
    where Y j represents the j-th row of Y , and 1 j N m 1 τ .
  • Step 3. By applying the uniform quantization operator (UQO) as defined in Equation (3), the first column Y(:,1) is transformed into S(:,1), representing the first column of the symbolized phase space.
    U Q O u = 0 1 L 1 y m i n μ < Δ + y m i n y m i n + Δ μ < 2 Δ + y m i n y m a x Δ < μ y m a x
    In Equation (3), y m a x and y m i n are, respectively, the maximum and minimum values of y , L is the predetermined discretization parameter, and Δ represents the discrete interval, which satisfies Δ = ( y m a x y m i n )/L.
  • Step 4. Define Y(:,k) as the k-th column of Y, where 2 k m . Symbolize Y(:,k) using Equation (4) to obtain S(:,k).
    S j , k = S j , 1 + Y j , k Y j , 1 / Δ ,
    where 1 j N m 1 τ , and ⌊⌋ represents the floor function, which means rounding down to the nearest integer. After symbolizing all components of Y, the resulting entity is represented as the symbolic phase space S. Furthermore, each row of S is referred to as a symbolic pattern (SP), which incorporates both permutation relations and amplitude information. According to Equation (4), after symbolization, each pattern contains m elements, with each element having L possible states. Hence, the total number of symbolic patterns is L m .
  • Step 5. In contrast to other approaches, here we employ Tsallis entropy to compute entropy values, which is a generalized form of Shannon entropy.
    S q P = j = 1 L m P j ln q 1 P j ,
    where ln q x = x 1 q 1 1 q and q 1 , ln 1 x = ln x   for   any   x > 0 , q represents the weighting coefficient. When the probability distribution follows a uniform distribution, the maximum value of Tsallis q entropy is S q U = l n q L m ( U = { 1 / L m } j = 1 , , L m ), and L represents a pre-defined discretization factor. The normalized TIPE can be obtained as follows:
    H q [ P ] = S q P S q U = j = 1 L m P j ln q 1 P j l n q L m

2.2. Multiscale Tsallis-Improved Permutation Entropy

In the analysis of intricate time series, it is customary to employ multiple temporal scales to assess the sequence’s complexity, a feature that surpasses the explanatory capacity of traditional entropy measures reliant solely on single-scale analysis. Therefore, we propose the concept of multiscale Tsallis improved permutation entropy (MTIPE). The time series X = { x 1 , x 2 , , X n } undergoes a coarse-graining process [12] according to Equation (7) resulting in the output sequence y j S at scale s. The sequence y j S is input into the IPE algorithm to compute entropy values at scale s.
y j S = 1 s i = j 1 s + 1 j s x i , 1 j N / s

2.3. TC-IPE-CP

Complexity entropy casualty plane (CECP) has been demonstrated to have certain advantages in revealing the causal relationships between internal structure and behavior within systems [20,21,22,23]. But the CECP based on the permutation entropy algorithm inherits its limitations. We propose a complexity-IPE causality plane based on the Tsallis entropy.
The complexity measure adopts Jensen–Shannon divergence to assess the imbalance between probability distributions and uniform distributions, which is the same as CECP. The calculation steps of TC-IPE-CP are as follows:
  • Step 1. Define the imbalance Δ q [ P ] of the probability density distribution P according to Equation (8).
    Δ q P = D q P , U D q P , U
    where U = { 1 / L m } j = 1 , . . , L m represents the uniform distribution, and D q ( P , U ) represents the distance between the probability density distribution P and the uniform distribution U. D q P , U = max D q P , U is the maximum Jensen–Shannon divergence (JSD) distance. The calculation of D q ( P , U ) is shown in Equation (9).
    D q P , U = 1 2 Κ q P P + U 2 + 1 2 Κ q U P + U 2     = 1 2 j = 1 P j 1 L m P j ln q P j + 1 L m 2 P j 1 2 j = 1 L m 1 L m ln q ( P j + 1 L m ) L m 2  
  • Step 2. Multiplying the normalized Tsallis entropy H q [ P ] by the imbalance Δ q [ P ] yields the following complexity:
    C q [ P ] = Δ q P · H q P = D q P , U · H q P D q P , U
Plotting H q P as the horizontal axis and C q P as the vertical axis, we can obtain the TC-IPE-CP feature. Combining the coarse-graining process defined by Equation (7), different scales of TC-IPE-CP can be obtained. The multiscale version of TC-IPE-CP is expressed as MTC-IPE-CP in the following.

3. Synthetic Data Analysis

In this section, TC-IPE-CP is employed to analyze several different simulated signals. As indicated by the analysis in Section 2, this algorithm requires appropriate parameter settings, including embedding dimension m, discretization parameter L, parameter factor q, and time delay τ . According to the conclusions of the paper [24], the range of m from 3 to 7 and a time delay τ = 1 are considered to yield better results. Unless explicitly stated otherwise, we set the discretization parameter L in the range of 2 to 8, m = 4, τ = 1, and q ranges from 0 to 100 with a step size of 0.001.

3.1. Noise Signals

Noise in time series analysis can significantly impact the predictability of data and the accuracy of models. White noise and pink noise are two common types of noise in time series analysis, characterized by their power spectral density distributions represented as S W f = C W and S P f = C P / f , where C W and C P are constants.
In order to validate the reparability and reliability of the TC-IPE-CP under short time series conditions, we vary the data length L from 10 to 2010 with a step size of 50 sample points. For each data length, 20 sets of independent noise are generated. The results are depicted in the Figure 1 below. Figure 1a,b depicts the TC-IPE-CP characteristics of white noise and pink noise for data lengths of 210 and 510, respectively. The standard deviation of the results from multiple experiments is represented by shaded regions. In the Figure 1a, the blue curve represents the characteristic of white noise, where entropy H q decreases initially to 0.46 with increasing q, and then increases to 1; and complexity C q decreases initially to 0.2 with increasing q, and then increases to 0.43. The trend of H q   C q varying with the parameter q forms a curve on the plane. The red curve corresponds to the characteristic of pink noise, similarly forming a curve on the plane. From the distribution of the characteristic curves on the plane, it is easy to distinguish between these two types of noise. Furthermore, the observed consistency across multiple experiments, as indicated by the small magnitude of the shaded areas, underscores the algorithm’s robust stability. For a more comprehensive display of the influence of data length on entropy H q and complexity C q , we present curves depicting the variations of H q and C q with increasing length L, where q = 0.1, 1.01, and 2.01. The figures clearly demonstrate that classification of the two types of noise signals is feasible even when the length L is less than 100.

3.2. Autoregressive (AR) Time Series

AR model is a process in which regression variables are derived from the variable itself. It describes a linear regression model of a random variable at a future time by incorporating a linear combination of random variables from previous time steps. AR processes can be generated by Equation (11).
A R p t = i = 1 p α i A R t i + n ( t )
where n ( t ) is white Gaussian noise (WGN) with zero mean and unit variance, p denotes the order of the AR sequence, when p = 0 implies that A R 0 is equivalent to WGN. The α i represents autocorrelation coefficients. The parameter configurations for AR time series of varying orders are outlined in Table 1. Typically, a higher order of AR corresponds to increased sample correlation, enhanced predictability, and diminished randomness within the sequence.
Following the reference [25], we conducted feature analysis on AR processes of different orders using the TC-IPE-CP. We conducted independent analysis experiments on 20 sets of AR sequences ranging from order one to eight and WGN, with the length of the AR set to N = 10,000. The results are illustrated in Figure 2. Figure 2a depicts the results of 20 independent experiments conducted using TC-IPE-CP on AR sequences of eight different orders and WGN, and the shaded areas represent the standard deviation. In Figure 2a, the H q - C q characteristic curves for AR sequences, ranging from AR one to eight, systematically expand outward. Additionally, the clear differentiation between WGN and the eighth AR signals underscores the algorithm’s ability to effectively discriminate signals with diverse dynamical characteristics. Figure 2b presents the results of experiments on 20 sets of AR sequences using Renyi complexity permutation entropy casualty plane (RC-PE-CP) (with alpha values ranging from 0.001 to 4.01 with a step size of 0.001), keeping other parameters consistent with TIPE. By comparing Figure 2a,b, which represent two types of complexity-entropy plane methods, it can be concluded that the TC-IPE-CP exhibits superiority in discriminating AR signals.
Following the analysis method outlined in reference [19], we identify the characteristic point q H corresponding to the minimum value of H q for each signal type, which is annotated as a black in the figure. Similarly, the characteristic point q C , representing the maximum value of C q for each signal type, is identified and annotated as a black × in the figure. It is evident that there are significant differences in q H and q C across each signal type, suggesting this as a distinguishing feature. To validate this point, we plot the q- H q curve in Figure 2c and the q- C q curve in Figure 2d, while simultaneously annotating the positions of q H and q C . It is evident that utilizing q H and q C provides effective discrimination for each type of AR signal and WGN.

3.3. Noisy Lorenz Signal

In this section, we assess the algorithm’s performance under noisy conditions by generating signals at different SNR through the introduction of Gaussian white noise into the Lorenz time series. We solve the Lorenz system represented by Equation (12) using the fourth-order Runge–Kutta scheme with a time step of Δt = 0.001, recording 50,000 data points. For each signal-to-noise ratio condition, we conducted 20 independent experiments using both TC-IPE-CP and RC-PE casualty plane, and calculated the entropy and complexity. From the results in Figure 3a, it can be observed that the curves of H q and C q under different (SNR) conditions for Lorenz signals in the TC-IPE-CP exhibit a high degree of similarity. Moreover, the curve for −10 dB closely resembles the curve for the normal signal, indicating that noise has the least impact on the performance of the TC-IPE-CP. Furthermore, the trends of the q- H q and q- C q curves for Lorenz chaotic signals under different SNR in Figure 3c,d are largely consistent, providing additional evidence of the superior noise resistance performance of the TC-IPE-CP. In comparison, the curves for RC-PE-CP of Lorenz signals under different SNR in Figure 3b exhibit relatively scattered patterns.
x ˙ = 10 ( y x ) y ˙ = x 28 z y z ˙ = x y 8 3 x

4. Experimental Data Analysis

In this section, we validate the algorithm’s performance in handling real complex time series by conducting data analysis on experimental data of RR intervals, bearing fault signals, and acoustic signals using TC-IPE-CP.

4.1. RR Intervals

This section analyzes the RR intervals data from the Fantasia dataset [26], which includes data from young and elderly healthy participants. The age range of the young participants is 21 to 34 years, while the age range of the elderly participants is 68 to 85 years.
To quantitatively assess the differences in entropy values between young and elderly individuals, we used the non-parametric Mann–Whitney U test. This method determines the significance of group differences through the p-value, with a lower p-value indicating more significant differences. The analysis results in Figure 4a,b are derived from the author’s article [27], where the p-value for PE is 0.2792 and the p-value for DispEn is 0.0038. In Figure 4c,d, the same dataset is processed using TC-IPE-CP. The analysis, using the q H and q C proposed in this paper, yields p-values of 0.0173 and 0.0022 for q H and q C , respectively. In Figure 4, p-values smaller than 0.01 and 0.001247 are represented by ** and ***, respectively. The p-value for PE is 0.2792, which is greater than 0.01. The p-values for DispEn, q H , and q C are 0.0038, 0.0173, and 0.0022, respectively, all falling between 0.01 and 0.001. They can all be represented by **, providing strong evidence for their ability to distinguish between two signal types.

4.2. Bearing Fault Signals

In this section, we selected four types of signals for analysis: normal signals with a fault diameter of 0.5334 mm and a motor speed of 1750, as well as signals representing rolling element faults, inner race faults, and outer race faults. Each dataset consisted of 120,000 data points. The dataset consists of four types of signals: normal signals, representing ball faults (BFs), inner race faults (IRFs), and outer race faults (ORFs) [28]. For ease of analysis, each dataset was divided into 10 segments, each containing 12,000 sample points. The bearing fault data analyzed was sourced from the Bearing Data Center at Case Western Reserve University.
The results of multiscale entropy analysis for the four types of bearing fault signals are depicted in Figure 5. In Figure 5a, in the multiscale PE results, the entropy curves appear to be quite close to each other. In Figure 5b, the multiscale DispEn results show that the entropy information of the four types of faults is well separated from each other. In Figure 5c,d, it can be observed that the feature curves of q H and q C in the MTC-IPE-CP algorithm are well separated from each other. To demonstrate this point, Figure 5e and Figure 5f, respectively, depict the characteristic curve plots of H q - C q for different fault information at scales one and five. From the distribution of curves on the plane, it is easy to distinguish between different types of faults.

4.3. Underwater Acoustic Signals

In this section, the MTC-IPE-CP is utilized to analyze four types of underwater acoustic signals, i.e., ambient noise, passenger boats, ocean liners, and motorboats [29]. The traditional multiscale PE and multiscale DispEn are also used for comparison purpose. Due to the large volume of data, each type of signal is segmented into multiple samples with a length of 3 s. The sampling frequency is set to 52,734 Hz, with each sample consisting of 158,202 data points. Table 2 provides detailed information about the datasets used in this experiment, which include four types of signals. Each class of data involves signals collected from different vessels.
The analysis results of multiscale PE, multiscale DispEn, and MTC-IPE-CP with a scale factor of 20 are depicted in Figure 6. As shown in Figure 6a, the three types of underwater acoustic signals, ocean liners, ocean noise, and motorboats, can be easily distinguished based on the magnitude of q H values at different scales. We also noticed with surprise that the trend of passenger q H decreases and then increases with the increase in scale, which is distinctly different from the other three types of signals. In Figure 6b, the q C values of ocean liners, passenger boats, and motorboats exhibit a decreasing trend followed by stabilization with the increase in scale. Additionally, there is a significant difference in the magnitude of q C values between scale one and five for these three types of ships. Unlike the others, ocean noise exhibits relatively minor changes in q C values as the scale increases. The results of multiscale PE are shown in Figure 6c, where it can be observed that as the scale increases, the curves of ocean liners, ocean noise, and motorboat PE exhibit significant overlap. In Figure 6d, which depicts the results of multiscale DispEn, there are distinct overlapping regions in entropy values between ocean noise and motorboats at different scales. In addition, Figure 6e,f presents the H q - C q feature curves at scales one and five, respectively. From the distribution of curves on the plane, it is straightforward to distinguish between the four types of underwater acoustic signals. The above results demonstrate that compared to multiscale PE and multiscale DispEn, MTC-IPE-CP shows greater potential for classifying the four types of underwater acoustic signals.
To further substantiate this, we trained and recognized the features extracted by TC-IPE-CP using a probabilistic neural network (PNN), quantifying the algorithm’s discriminative efficacy. It is worth noting that we combined the two features of MTC-IPE-CP, q H and q C , before classifying using PNN. During testing, we divided the training and testing sets for PNN. We selected 150 segments for each type of underwater acoustic signal as the testing set, and the remaining segments were used for training. The classification results of TC-IPE-CP for the four types of underwater acoustic signals are presented in Table 3. Additionally, we provide the classification results of MPE and multiscale DispEn algorithms in Table 4 and Table 5, respectively.
The results from the three tables indicate that MTC-IPE-CP, multiscale DispEn algorithm, and multiscale PE all achieved impressive recognition rates of 100% for the ocean liner category. However, in the passenger category, MTC-IPE-CP achieved a recognition rate of 96.67%, which is higher than the other two methods. Moreover, the algorithm’s classification performance for the four types of vessels exceeded 88.67%. It is noteworthy that multiscale PE achieved a recognition rate of only 12.67% for motorboats, and multiscale DispEn achieved a recognition rate of only 58.00% for ocean noise. Overall, MTC-IPE-CP achieved an accuracy of 94.17%, which is 7.34% higher than multiscale DispEn and 19.17% higher than multiscale PE. These analyses results demonstrate the superior performance of the MTC-IPE-CP in accurately identifying and distinguishing between various types of ships.

5. Conclusions

In order to augment the discriminative power and noise robustness of entropy features in time series analysis, this paper introduces two novel time series analysis algorithms: TC-IPE-CP and MTC-IPE-CP. To validate the effectiveness of these methods, a comprehensive evaluation was conducted using both simulated signals and experimental signals. The simulation results indicate that TC-IPE-CP requires a small amount of data, exhibits strong noise resistance, and possesses high separability for signals. When applied to the analysis of heart rate signals, fault diagnosis, and underwater acoustic signal recognition, our experimental findings demonstrate that TC-IPE-CP can accurately differentiate between electrocardiographic signals of elderly and young subjects, achieve precise bearing fault diagnosis, and identify four types of underwater targets. Particularly in underwater acoustic signal recognition experiments, MTC-IPE-CP achieves a recognition rate of 96.67%, surpassing the well-known multi-scale dispersion entropy and multi-scale permutation entropy by 7.34% and 19.17%, respectively. This suggests that our method is highly suitable for the analysis of complex time series.

Author Contributions

Conceptualization, Z.C. and C.W.; methodology, Z.C.; software, C.W.; writing—original draft preparation, Z.C.; writing—review and editing, H.Q.; funding acquisition, J.W. and H.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Special Program of Guangxi Science and Technology Base and Talent under Grant No. GuiKeAD21220098.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
  2. Xu, K.X.; Wang, J. Weighted fractional permutation entropy and fractional sample entropy for nonlinear Potts financial dynamics. Phys. Lett. A 2017, 381, 767–779. [Google Scholar] [CrossRef]
  3. Li, Y.; Li, G.; Yang, Y.; Liang, X.; Xu, M. A fault diagnosis scheme for planetary gearboxes using adaptive multi-scale morphology filter and modified hierarchical permutation entropy. Mech. Syst. Signal Process. 2018, 105, 319–337. [Google Scholar] [CrossRef]
  4. Zhou, S.; Qian, S.; Chang, W. A novel bearing multi-fault diagnosis approach based on weighted permutation entropy and an improved SVM ensemble classifier. Sensors 2018, 18, 1934. [Google Scholar] [CrossRef] [PubMed]
  5. Pincus, S. Approximate entropy (ApEn) as a complexity measure. Chaos 1995, 5, 110–117. [Google Scholar] [CrossRef] [PubMed]
  6. Richman, J.S.; Moorman, J.R. Physiological time-series analysis using approximate entropy and sample entropy. Am. J. Physiol. 2000, 278, H2039–H2049. [Google Scholar] [CrossRef] [PubMed]
  7. Bandt, C.; Pompe, B. Permutation entropy: A natural complexity measure for time series. Phys. Rev. Lett. 2002, 88, 174102. [Google Scholar] [CrossRef] [PubMed]
  8. Fadlallah, B.; Chen, B.D.; Keil, A.; Principe, J. Weighted-permutation entropy: A complexity measure for time series incorporating amplitude information. Phys. Rev. E 2013, 88, 022911. [Google Scholar] [CrossRef] [PubMed]
  9. Bian, C.; Qin, C.; Ma, Q.D.; Shen, Q. Modified permutation-entropy analysis of heartbeat dynamics. Phys. Rev. E 2012, 85, 021906. [Google Scholar] [CrossRef]
  10. Rostaghi, M.; Azami, H. Dispersion entropy: A measure for time-series analysis. IEEE Signal Process. Lett. 2016, 23, 610–614. [Google Scholar] [CrossRef]
  11. Azami, H.; Escudero, J. Amplitude- and Fluctuation-Based Dispersion Entropy. Entropy 2018, 20, 210. [Google Scholar] [CrossRef] [PubMed]
  12. Costa, M.; Goldberger, A.L.; Peng, C.K. Multiscale entropy to distinguish physiologic and synthetic RR time series. In Computers in Cardiology; IEEE: New York, NY, USA, 2002; pp. 137–140. [Google Scholar]
  13. Aziz, W.; Arif, M. Multiscale Permutation Entropy of Physiological Time Series. In Proceedings of the 2005 Pakistan Section Multitopic Conference, Karachi, Pakistan, 23–24 December 2005; p. 334494. [Google Scholar]
  14. Azami, H.; Rostaghi, M.; Abásolo, D.; Escudero, J. Refined Composite Multiscale Dispersion Entropy and its Application to Biomedical Signals. IEEE Trans. Biomed. Eng. 2017, 2017, 2679136. [Google Scholar]
  15. López-Ruiz, R.; Mancini, H.L.; Calbet, X. A statistical measure of complexity. Phys. Lett. A 1995, 209, 321–326. [Google Scholar] [CrossRef]
  16. Zhang, B.; Shang, P.; Liu, J. Transition-based complexity-entropy causality diagram: A novel method to characterize complex systems. Commun. Nonlinear Sci. Numer. Simul. 2021, 95, 105660. [Google Scholar] [CrossRef]
  17. Li, X.; Fan, C.; Qin, J.; Yang, R. Refined composite multivariate multiscale complexity-entropy causality plane analysis for gas-liquid two-phase flow. Z. Naturforschung A 2023, 78, 907–920. [Google Scholar] [CrossRef]
  18. Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [Google Scholar] [CrossRef]
  19. Ribeiro, H.V.; Jauregui, M.; Zunino, L.; Lenzi, E.K. Characterizing time series via complexity-entropy curves. Phys. Rev. E 2017, 95, 062106. [Google Scholar] [CrossRef] [PubMed]
  20. Jauregui, M.; Zunino, L.; Lenzi, E.K.; Mendes, R.; Ribeiro, H. Characterization of time series via Rényi complexity–entropy curves. Phys. A Stat. Mech. Its Appl. 2018, 498, 74–85. [Google Scholar] [CrossRef]
  21. Freitas, C.G.; Rosso, O.A.; Aquino, A.L. Mapping Network Traffic Dynamics in the Complexity-Entropy Plane. In Proceedings of the 2020 IEEE Symposium on Computers and Communications, Rennes, France, 7–10 July 2020; pp. 1–6. [Google Scholar]
  22. Peng, K.; Shang, P. Characterizing ordinal network of time series based on complexity-entropy curve. Pattern Recognit. 2022, 124, 108464. [Google Scholar] [CrossRef]
  23. Lee, M. Early warning detection of thermoacoustic instability using three-dimensional complexity-entropy causality space. Exp. Therm. Fluid Sci. 2022, 130, 110517. [Google Scholar] [CrossRef]
  24. Chen, Z.; Li, Y.A.; Liang, H.T.; Yu, J. Improved permutation entropy for measuring complexity of time series under noisy condition. Complexity 2019, 2019, 1403829. [Google Scholar] [CrossRef]
  25. Zhang, Y.; Shang, P.; He, J.; Xiong, H. Cumulative Tsallis entropy based on power spectrum of financial time series. Chaos Interdiscip. J. Nonlinear Sci. 2019, 29, 103118. [Google Scholar] [CrossRef]
  26. Iyengar, N.; Peng, C.K.; Morin, R.; Goldberger, A.L.; Lipsitz, L.A. Age-related alterations in the fractal scaling of cardiac interbeat interval dynamics. Am. J. Physiol. Regul. Integr. Comp. Physiol. 1996, 271, 1078–1084. [Google Scholar] [CrossRef] [PubMed]
  27. Chen, Z.; Ma, X.; Fu, J.; Li, Y. Ensemble Improved Permutation Entropy: A New Approach for Time Series Analysis. Entropy 2023, 25, 1175. [Google Scholar] [CrossRef] [PubMed]
  28. Ragavesh, D.; Scott, M.; Gordon, M. A Novel Bearing Faults Detection Method Using Generalized Gaussian Distribution Refined Composite Multiscale Dispersion Entropy. IEEE Trans. Instrum. Meas. 2022, 71, 3517112. [Google Scholar]
  29. Santos-Domínguez, D.; Torres-Guijarro, S.; Cardenal-López, A. ShipsEar: An underwater vessel noise database. Appl. Acoust. 2016, 113, 64–69. [Google Scholar] [CrossRef]
Figure 1. The TC-IPE-CP for different lengths of white noise and pink noise judgment results. (a) H q - C q curves with L = 210; (b) H q - C q curves with L = 510; (c) error bar plot of H q with q = 0.1; (d) error bar plot of C q with q = 1.1; (e) error bar plot of H q with q = 1.01; (f) error bar plot of C q with q = 1.01; (g) error bar plot of H q with q = 2.01; and (h) error bar plot of C q with q = 2.01.
Figure 1. The TC-IPE-CP for different lengths of white noise and pink noise judgment results. (a) H q - C q curves with L = 210; (b) H q - C q curves with L = 510; (c) error bar plot of H q with q = 0.1; (d) error bar plot of C q with q = 1.1; (e) error bar plot of H q with q = 1.01; (f) error bar plot of C q with q = 1.01; (g) error bar plot of H q with q = 2.01; and (h) error bar plot of C q with q = 2.01.
Entropy 26 00521 g001
Figure 2. The analysis results of 20 sets of autoregressive time series and white noise on TC-IPE-CP and RC-PE casualty plane. (a) H q - C q curves of TC-IPE-CP; (b) H - C curves of RC-PE-CP; (c) q- H q curves of TC-IPE-CP; and (d) q- C q curves of TC-IPE-CP.
Figure 2. The analysis results of 20 sets of autoregressive time series and white noise on TC-IPE-CP and RC-PE casualty plane. (a) H q - C q curves of TC-IPE-CP; (b) H - C curves of RC-PE-CP; (c) q- H q curves of TC-IPE-CP; and (d) q- C q curves of TC-IPE-CP.
Entropy 26 00521 g002
Figure 3. The analysis results of TC-IPE-CP and RC-PE-CP under different signal-to-noise ratio conditions for the Lorenz time series: (a) H q - C q curves of TC-IPE-CP; (b) H - C curves of RC-PE-CP; (c) q- H q curves of TC-IPE-CP; and (d) q- C q curves of TC-IPE-CP.
Figure 3. The analysis results of TC-IPE-CP and RC-PE-CP under different signal-to-noise ratio conditions for the Lorenz time series: (a) H q - C q curves of TC-IPE-CP; (b) H - C curves of RC-PE-CP; (c) q- H q curves of TC-IPE-CP; and (d) q- C q curves of TC-IPE-CP.
Entropy 26 00521 g003
Figure 4. Boxplots of distinct entropy approaches computed from the RR intervals of healthy young and healthy elderly participants. (a) PE analysis result; (b) DE analysis result; (c) q H analysis result; and (d) q C analysis result. The symbol + in this figure represents outlier value.
Figure 4. Boxplots of distinct entropy approaches computed from the RR intervals of healthy young and healthy elderly participants. (a) PE analysis result; (b) DE analysis result; (c) q H analysis result; and (d) q C analysis result. The symbol + in this figure represents outlier value.
Entropy 26 00521 g004
Figure 5. Multiscale entropy analysis results of four types of bearing fault signals. (a) PE analysis result; (b) DE analysis result; (c) q H analysis result; (d) q C analysis result; (e) H q - C q curves of TC-IPE-CP with scale = 1; and (f) H q - C q curves of TC-IPE-CP with scale = 5. (Note: In this paper, q H refers to the value of q when the entropy is at its maximum, and q C refers to the value of q when the complexity is at its maximum. These will not be annotated further in subsequent sections).
Figure 5. Multiscale entropy analysis results of four types of bearing fault signals. (a) PE analysis result; (b) DE analysis result; (c) q H analysis result; (d) q C analysis result; (e) H q - C q curves of TC-IPE-CP with scale = 1; and (f) H q - C q curves of TC-IPE-CP with scale = 5. (Note: In this paper, q H refers to the value of q when the entropy is at its maximum, and q C refers to the value of q when the complexity is at its maximum. These will not be annotated further in subsequent sections).
Entropy 26 00521 g005
Figure 6. Multiscale entropy analysis results of four types of ship-radiated noise. (a) q H analysis result; (b) q C analysis result; (c) PE analysis result; (d) DE analysis result; (e) H q - C q curves of TC-IPE-CP with scale = 1; and (f) H q - C q curves of TC-IPE-CP with scale = 5.
Figure 6. Multiscale entropy analysis results of four types of ship-radiated noise. (a) q H analysis result; (b) q C analysis result; (c) PE analysis result; (d) DE analysis result; (e) H q - C q curves of TC-IPE-CP with scale = 1; and (f) H q - C q curves of TC-IPE-CP with scale = 5.
Entropy 26 00521 g006
Table 1. The correlation coefficients for generating AR processes.
Table 1. The correlation coefficients for generating AR processes.
α1α2α3α4α5α6α7α8
AR11/2-------
AR21/21/4------
AR31/21/41/8-----
AR41/21/41/81/16----
AR51/21/41/81/161/32---
AR61/21/41/81/161/321/64--
AR71/21/41/81/161/321/641/128-
AR81/21/41/81/161/321/641/1281/256
Table 2. Description of four types of ship-radiated noise.
Table 2. Description of four types of ship-radiated noise.
CategoriesShip NameNumber of Segments
PassengerMar de Cangas240
Mar de Onza130
Pirata de Salvora68
Ocean linerMSC Opera163
Adventure of the Seas95
Costa Voyager68
MotorboatSmall yacht114
Motorboat 2123
High speed motorboat92
Zodiac99
Ocean noiseNatural ambient noise sample 185
Natural ambient noise sample 299
Natural ambient noise sample 398
Natural ambient noise sample 493
Table 3. PNN classification results for four types of ships using MTC-IPE-CP features.
Table 3. PNN classification results for four types of ships using MTC-IPE-CP features.
CategoriesShip NameClassification Accuracy
PassengerOcean LinerMotorboatOcean Noise
Passenger14550096.67%
Ocean liner015000100%
Motorboat312133288.67%
Ocean noise101213791.33%
In total----94.17%
Table 4. PNN classification results for four types of ships using multiscale PE features.
Table 4. PNN classification results for four types of ships using multiscale PE features.
CategoriesShip NameClassification Accuracy
PassengerOcean LinerMotorboatOcean Noise
Passenger131001987.33%
Ocean liner015000100%
Motorboat101913012.67%
Ocean noise000150100%
In total----75.00%
Table 5. PNN classification results for four types of ships using multiscale DispEn features.
Table 5. PNN classification results for four types of ships using multiscale DispEn features.
CategoriesShip NameClassification Accuracy
PassengerOcean LinerMotorboatOcean Noise
Passenger14190094.00%
Ocean liner015000100%
Motorboat34143095.33%
Ocean noise00638758.00%
In total----86.83%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, Z.; Wu, C.; Wang, J.; Qiu, H. Tsallis Entropy-Based Complexity-IPE Casualty Plane: A Novel Method for Complex Time Series Analysis. Entropy 2024, 26, 521. https://doi.org/10.3390/e26060521

AMA Style

Chen Z, Wu C, Wang J, Qiu H. Tsallis Entropy-Based Complexity-IPE Casualty Plane: A Novel Method for Complex Time Series Analysis. Entropy. 2024; 26(6):521. https://doi.org/10.3390/e26060521

Chicago/Turabian Style

Chen, Zhe, Changling Wu, Junyi Wang, and Hongbing Qiu. 2024. "Tsallis Entropy-Based Complexity-IPE Casualty Plane: A Novel Method for Complex Time Series Analysis" Entropy 26, no. 6: 521. https://doi.org/10.3390/e26060521

APA Style

Chen, Z., Wu, C., Wang, J., & Qiu, H. (2024). Tsallis Entropy-Based Complexity-IPE Casualty Plane: A Novel Method for Complex Time Series Analysis. Entropy, 26(6), 521. https://doi.org/10.3390/e26060521

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop