Next Article in Journal
A Novel Image Encryption Algorithm Based on Improved Arnold Transform and Chaotic Pulse-Coupled Neural Network
Previous Article in Journal
Temperature Dependence of Structural Relaxation in Glass-Forming Liquids and Polymers
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dynamics of Information Flow between the Chinese A-Share Market and the U.S. Stock Market: From the 2008 Crisis to the COVID-19 Pandemic Period

1
School of Statistics and Mathematics, Zhejiang Gongshang University, Hangzhou 310018, China
2
Collaborative Innovation Center of Statistical Data Engineering, Technology & Application, Zhejiang Gongshang University, Hangzhou 310018, China
*
Author to whom correspondence should be addressed.
Entropy 2022, 24(8), 1102; https://doi.org/10.3390/e24081102
Submission received: 13 July 2022 / Revised: 8 August 2022 / Accepted: 9 August 2022 / Published: 10 August 2022

Abstract

:
The relationship between the Chinese market and the US market is widely concerned by researchers and investors. This paper uses transfer entropy and local random permutation ( L R P ) surrogates to detect the information flow dynamics between two markets. We provide a detailed analysis of the relationship between the two markets using long-term daily and weekly data. Calculations show that there is an asymmetric information flow between the two markets, in which the US market significantly affects the Chinese market. Dynamic analysis based on weekly data shows that the information flow evolves, and includes three significant periods between 2004 and 2021. We also used daily data to analyze the dynamics of information flow in detail over the three periods and found that changes in the intensity of information flow were accompanied by major events affecting the market, such as the 2008 financial crisis and the COVID-19 pandemic period. In particular, we analyzed the impact of the S&P500 index on different industry indices in the Chinese market and found that the dynamics of information flow exhibit multiple patterns. This study reveals the complex information flow between two markets from the perspective of nonlinear dynamics, thereby helping to analyze the impact of major events and providing quantitative analysis tools for investment practice.

1. Introduction

Both China and the United States are large economies with high influence, so analyzing the relationship between the two stock markets is of great significance for investment practice and risk management. Currently, there are many studies analyzing the relationship between the two markets based on different datasets, in which multiple empirical tools are used. For example, Goh et al. found that some economic variables in the United States can improve the forecast performance of the Chinese stock market [1]. Jian Chen et al. found that US economic variables can help predict the monthly volatility of the Chinese market [2]. In addition, previous studies have also shown that US economic policy uncertainty can significantly explain the returns of China’s A-share market [3]. Researchers not only pay attention to how economic variables and policies in the US market affect the Chinese market but also directly discuss the relationship between the returns of the two markets. These studies focus on the impact of special events on the relationship between markets, such as the 2007–2009 crisis [4,5,6,7,8,9,10], the Eurozone crisis [11], and the COVID-19 pandemic [12,13,14]. Empirical analysis shows that the financial crisis has had a significant impact on the BRICS and emerging markets, such as leading to significant risk spillover effects and risk contagion [5,6,7]. In particular, the Chinese market was significantly affected by the US market during the crisis [4,5,6,8,10,11]. These studies used many types of quantitative tools, such as multivariate GARCH models [4,6,10], Granger causality test [5], latent factor models [9], correlation contagion tests [8], etc. In particular, recently, the COVID-19 pandemic has significantly impacted the Chinese market and the US market [12]. For example, research reveals that China’s stock market and the US stock market have significant spillover effects during the pandemic [13]. In addition, the pandemic has also impacted the correlation structure between market indexes [14].
Existing studies have revealed that there are extensive links between markets, and the Chinese market is significantly affected by the US market, especially under the impact of major events. However, previous studies have not fully analyzed the long-term relationship between the two markets. For example, most studies only analyze the static relationship in a specified period but do not focus on dynamics. This study analyzes the long-term relationship between the two markets from the perspective of nonlinear information flow dynamics. We use transfer entropy ( T E ) to quantitatively characterize nonlinear relationships [15], and analyze the dynamics in detail by surrogate time series [16]. Surrogates is an effective technique for analyzing nonlinear time series, which generates simulated time series by keeping some characteristics of the original time series [17,18,19].
At present, transfer entropy has expanded into a “toolbox” that includes multiple tools. For example, Marschinski et al. introduced effective transfer entropy by shuffling the source time series, where the shuffling process is used to destroy the correlation [20]. Further, the group transfer entropy and the effective group transfer entropy can be constructed by considering multiple time series [21]. Jizba et al. constructed the Rényian transfer entropies by extending Shannon entropy to Rényi entropy [22]. Papana et al. extended the transfer entropy method to analyze non-stationary time series [23]. Recently, Nie proposed a method to detect local information flow [16]. This method obtains the baseline T E value from local surrogates so that it is possible to observe the contribution of observations within a small computational window to the original T E value. An advantage of this method is that changes in the intensity of the information flow can be clearly observed, so that we can identify the impact of events on the information flow [16].
Since transfer entropy is a non-parametric method that effectively extracts the causal relationship between variables, it is suitable for analyzing high-complexity financial time series. For example, methods based on transfer entropy have been widely used to analyze financial time series, such as the relationship between stock markets [24,25,26,27,28], the price—volume relationship [16,29,30,31], and the foreign exchange market [32]. In particular, early empirical research shows that the US market is a core source of information for global financial markets [28]. Kim et al. considered 10 important indexes and found that the structure of the information transfer network was affected by the crisis [33].
This study focuses on analyzing the evolutionary characteristics of the information flow between China’s A-share market and the US market. We use nonlinear analysis methods to characterize the complex information flow dynamics between the two markets and, in particular, identify periods of high information flow intensity by T E and L R P . This study reveals the details of information transfer between markets based on long-term data and from multiple time scales. In the following, we first describe the data set and analyze basic descriptive statistics. Secondly, we review the concept and calculation method of transfer entropy, and the steps of L R P -based analysis. Third, we use weekly data and daily data to analyze the information flow between the two markets. Finally, we analyze the information flow between the S&P500 index and the industry indices of the Chinese A-share market.

2. Data

This paper uses three datasets, which include closing indices for two market composite indices and some industry indices. The composite index data includes daily and weekly closing data of S&P500 and CSI300 indices, as well as weekly closing data of some industry indices in the Chinese A-share market. All data are extracted from the iFinD database of Tonghuashun company. For the original closing index time series P = { P ( t ) } , we calculate the series R = { R ( t ) } , where R ( t ) = l o g ( P ( t + 1 ) ) l o g ( P ( t ) ) .
For weekly data, we use R u s and R c n to represent the return series of the S&P500 index and the CSI300 Index, respectively. We considered the data from 31 December 2004 to 17 June 2022, a total of 890 observations, so that each preprocessed series includes 889 observations. Table 1 lists some descriptive statistics of the two series. In addition, the Pearson correlation coefficient of the two-time series is 0.2118, which implies a weak correlation.
In addition to the weekly data, we also analyzed the daily data of the two indexes. To clearly analyze the impact of major events, we divide the daily data into three periods. For the preprocessed index data, we use R c n 1 and R u s 1 to represent the data of the first period and use similar symbols for other periods. The time intervals corresponding to the three data sets are 4 January 2007–31 December 2010, 2 January 2014–29 December 2017, and 2 January 2018–31 December 2021 respectively. Since the trading days of the two markets are not synchronized, we remove the missing values in the two series, leaving observations with the same time stamp. The time series of the three periods without missing values include 944, 948, and 942 observations, respectively. Table 1 lists the basic descriptive statistics of each return series. Here, we always use the rounding method to keep four decimal places.
We can find from Table 1 that all the averages are close to zero. In three of the four pairs of time series, the standard deviation of the series in the Chinese market is greater than that in the US market. In addition, all skewness values are less than zero, and in particular, all excess kurtosis values are greater than zero. Here, we report the excess kurtosis obtained by subtracting 3 from the original kurtosis value. The kurtosis values imply that all distributions have peak characteristics and are not normal.
In addition to the market composite index, we used 31 industry indices compiled by SWS Research Co., Ltd. Here, we used three of them for dynamic analysis. Table 1 lists the descriptive statistics of the return series for the three indices. The symbols R i n d 1 , R i n d 2 and R i n d 3 correspond to the industry index codes, 801040.SL (Steel industry), 801180.SL (Real estate) and 801960.SL (Petrochemical industry), respectively. It can be found that the average of R i n d 2 is significantly larger than the other two indices, while the difference between the standard deviations is small. In particular, all skewness values are less than 0, and kurtosis values are greater than 0.

3. Method

3.1. Transfer Entropy

We consider two random variables I and J, the marginal probability distribution is p I ( · ) is p J ( · ) , and the joint distribution is p I , J ( · ) . We assume that the underlying dynamic structure conforms to a stationary Markov process, where I and J correspond to orders k and l, respectively. For I, the conditional distribution of state at time t + 1 ( i t + 1 ) is independent of state in t k ( i t k ), that is, p I ( i t + 1 | i t , , i n k + 1 ) = p I ( i t + 1 | i t , , i n k ) . Similarly, for J, p J ( j t + 1 | j t , , j n k + 1 ) = p J ( j t + 1 | j t , , j n k ) , where j t represents the state at time t. The transfer entropy of J I is defined as Equation (1) [15], where the symbols i t k = ( i t , , i n k + 1 ) and j t k = ( j t , , j n k + 1 ) . In this article, we set l = k = 1 .
T E J I = p ( i t + 1 , i t k , j t l ) l o g p ( i t + 1 | i t k , j t l ) p ( i t + 1 | i t k )

3.2. The Method Used to Estimate the Transfer Entropy

For a time series, we can encode it into a time series including states and then estimate the transfer entropy. Here, we use a method proposed in previous studies to encode the time series [25,26,34], which encodes the original time series through a group of quantiles. One of the advantages of this method is that a large weight can be assigned to the tail of the distribution, thereby focusing on the influence of extreme values in the financial market [25,26]. In addition, we use the R package RTransferEntropy developed by Behrendt et al. [34].
If the time series s = { s ( t ) } is coded into n states, n 1 quantiles Q = { q 1 , q 2 , , q n 1 } are required, as shown in Equation (2). For example, if the observation s ( t ) is between the quantiles q 2 and q 3 of the distribution, then s ( t ) = 2 . The converted time series s = { s ( t ) } includes n states, and can be used for conditional distribution and joint distribution.
s ( t ) = 1 s ( t ) q 1 2 s ( t ) ( q 2 , q 3 ) n 1 s ( t ) ( q n 2 , q n 1 ) n s ( t ) q n 1
In order to comprehensively analyze the significance of information flow at different quantile parameters, we set the following five sets of parameters: Q 1 = { q ( 0.05 ) , q ( 0.5 ) , q ( 0.95 ) } , Q 2 = { q ( 0.10 ) , q ( 0.5 ) , q ( 0.90 ) } , Q 3 = { q ( 0.15 ) , q ( 0.5 ) , q ( 0.85 ) } , Q 4 = { q ( 0.20 ) , q ( 0.5 ) , q ( 0.80 ) } and Q 5 = { q ( 0.25 ) , q ( 0.5 ) , q ( 0.75 ) } , where the symbol q ( α ) represents α -quantile. From the first to the fifth set of parameters, the weight of the extreme values of the tail decreases. In the first group, extreme values are given the greatest weight, and the distribution in the fifth group is equally divided into four parts.
Here, we use the method proposed by Dimpfl et al. to test the significance of T E [25,34]. The method first estimates the dynamics between two variables through the Markov process. Secondly, based on the Markov process, a pair of simulated time series whose dependence has been eliminated can be generated, and the T E value can be estimated. Third, we repeatedly generate the simulated time series N b o o t times to obtain a T E value distribution for comparison. Finally, the p-value is generated by comparing the original T E value with the quantile of the baseline distribution. In this article, we always set N b o o t = 1000 .

3.3. L R P -Based Analysis

The local random permutation ( L R P ) method uses R P surrogates locally to estimate the intensity of the information flow within a period [16]. Previous studies have confirmed that this method can effectively describe changes in information flow through toy models [16]. In particular, to maintain the correlation between series, L R P provides surrogates that keep the Pearson correlation unchanged.
We consider the information flow s 1 s 2 between the series s 1 = { s 1 ( t ) } and s 2 = { s 2 ( t ) } , where each series includes N observations. We assume that the length of the calculation window for constructing local R P surrogates is L w , and the T E s 1 s 2 value is significant.
1
We extract the observations in the time interval [ 1 , L w ] to obtain the sub-time series s 1 [ 1 , L w ] = { s 1 ( 1 ) , , s 1 ( L w ) } and s 2 [ 1 , L w ] = { s 2 ( 1 ) , , s 2 ( L w ) } , and shuffle the observations of the sub-sequences s 1 [ 1 , L w ] and s 2 [ 1 , L w ] . In the scrambling step, the correspondence between the timestamps of the two series does not change, that is, the series { ( s 1 ( k ) , s 2 ( k ) ) } composed of pairs is scrambled.
2
We calculate the transfer entropy value between the time series obtained by local scrambling in step 1, and denoted it as T E s 1 s 2 [ 1 , L w ] ( 1 ) .
3
We repeat steps 1 and 2 a total of M times to obtain a distribution of T E values used as a benchmark ( T E b e n 1 = { T E s 1 s 2 [ 1 , L w ] ( k ) | k = 1 , 2 , , M } ).
4
We calculate the Z-score Z [ 1 , L w ] = T E m [ 1 , L w ] s [ 1 , L w ] .
5
We move the calculation window one observation each time, and repeat steps 1–4 to get the distribution T E b e n i = { T E s 1 s 2 [ i , L w ] ( k ) | k = 1 , 2 , , M } and the Z-score. For example, the second Z-value is obtained by scrambling all observations in the interval [ 2 , L w + 1 ] . Finally, we get the Z-score sequence { Z [ i , i + L w 1 ] | i = 1 , 2 , , N L w + 1 } .
The estimation of the T E value requires setting the parameter Q k , so the parameter may affect the calculation result. Here, we use the following method to synthesize the calculation results of different parameters. For a sequence of Z-scores ( { z k ( t ) } ) generated by a parameter Q k , we construct a 0–1 sequence using Equation (3), where Z t h is the threshold. We set the threshold Z t h = 1.645 here. In this way, we obtain a series I k = { I k ( t ) | t = 1 , 2 , , N L w + 1 } based on Z-scores, in which observations only take values in { 0 , 1 } . If T E b e n i is a normal distribution, then the original T E value is greater than most of the values (95%) in T E b e n i , which implies that there is a significant information flow in the period [ i , i + L w 1 ] . In this article, we always set M = 1000 . Then, we calculate the series I = { I ( t ) } ( I ( t ) = k I k ( t ) ).
I k ( t ) = 1 , z k ( t ) z t h 0 , z k ( t ) < z t h
Since each Z-score corresponds to an interval, in this article, we use the following convention to plot the figure: the right end of the interval corresponding to the value is used as the time stamp in the figure. For example, for the value Z [ 1 , L w ] , the time label in the figure is the timestamp corresponding to L w . Similarly, we plot the time series { I ( t ) } .

4. Results

4.1. Information Flow Analysis Based on Weekly Data

4.1.1. T E Values of Different Parameters

We use a group of parameters Q k ( k = 1 , , 5 ) to calculate the information flow between R u s and R c n . Table 2 shows the calculated results, all T E u s c n values are significant at the significance level α = 0.05 , and three of the p-values are less than 0.01. In addition, all T E c n u s values are not significant. Table 2 suggests that there is an asymmetric information flow between the Chinese market and the US market, and the US market is a dominant information source.
The minimum p-value of 0.005 corresponds to the parameter Q 2 , which means that significant information flow can be detected when a large weight is assigned to extreme values. In addition, the p-value corresponding to the equally divided quantile interval is also less than 0.05. This means that significant u s c n information flow can be detected regardless of whether a greater weight is given to the extreme values.
In summary, there is sufficient evidence to support the existence of information flow u s c n . The change of the p-value with the change of the parameter implies that the analysis of the information flow depends on the choice of the parameter, and thus the calculation results of multiple parameters need to be integrated.

4.1.2. Information Flow Dynamics for Weekly Data

In the previous section, we analyzed the information flow between the two markets globally. However, the static analysis only shows the existence of information flow in the considered period, thus lacking local dynamic analysis. Below, we use L R P to analyze the dynamics of information flow, in particular, to detect periods with localized strong information flow.
We set five sets of parameters Q k ( k = 1 , , 5 ) and L w = 96 , where the calculation window corresponds to two years of trading weeks. The five subfigures of Figure 1 show the Z-score time series. We observe that all subfigures include local peaks larger than 1.645, implying non-trivial information flow dynamics between series. These peaks correspond to periods of localized strong information flow. For example, for Figure 1a, it can be seen that the Z-score increased rapidly in 2008 and dropped significantly thereafter. Furthermore, we can also observe that the number of peaks in all Z-score series is greater than 1. For example, three localized peaks can be observed in Figure 1b, implying the existence of three periods of strong information flow.
The numbers of local peaks for the five Z-score series are 2, 3, 2, 3, and 3, respectively. Table 3 lists the periods corresponding to these peaks. We use T 1 , T 2 , and T 3 to represent the time intervals in three periods, where each period corresponds to two endpoints. For example, the starting time corresponding to T 1 ( Q 1 ) is 8 June 2007, and the closing time is 24 April 2009. The first line indicates that there is a significant flow of information in the period 8 June 2007–24 April 2009. We list the Z-score at the corresponding time interval in the fourth column. The maximum value corresponds to the T 1 period, while the minimum value corresponds to the T 2 period.
We find that most of the local peaks correspond to periods that are close to each other, such as the T 1 period of parameter Q 1 and the T 1 period of Q 2 are 8 June 2007–24 April 2009 and 2 March 2007–19 January 2009 respectively. Table 3 shows that the 2008 crisis significantly affected the information flow of u s c n . In addition, the periods corresponding to several local peaks also include major events. For example, the third peak of Q 5 includes the oil crisis and the collapse of the US stock market. In summary, the L R P -based method identifies some local peaks, indicating that the dynamics of information flow are non-trivial.

4.1.3. Comprehensive Analysis of Z-Score Series

We roughly identified three important periods by Z-scores in the previous subsection. Below, we calculate the I series from the I k series of five Z-score series. The series I takes the value in the set { 0 , 1 , 2 , 3 , 4 , 5 } . If I k ( t ) = 0 , the observations in the time period [ t L w + 1 , t ] do not significantly affect the analysis results of the information flow. On the contrary, if I ( t ) = 5 , it implies that the information flow within the period [ t L w + 1 , t ] is significant and does not depend on the choice of parameters. A large I ( t ) value implies that there is information flow in multiple quantile intervals { Q k | k = 1 , , 5 . } .
Figure 2 shows the I series. There are three periods where most observations are greater than or equal to 1. In particular, the analysis shows that there is a period where most observations are greater than or equal to 3, that is, 8 February 2008–24 September 2010. During the 2008–2009 crisis in the US market, there was strong information flow u s c n . We use N i to represent the number of observations equal to i in the I series, as shown in Table 4. It can be found that if the calculation results of 5 parameters are integrated, nearly half of the observations are greater than or equal to 1. This implies that information flows widely exist between the two markets during the period under consideration. Table 5 lists the three periods identified by Figure 2. Each period includes some major events that affect the market. The first period corresponds to the 2007–2009 crisis in the US market. The second period includes the crisis in the Chinese stock market in 2015. In addition, before the stock market crash in 2015, the bubble was generated in the second half of 2014. The second period not only includes the breakage period of the bubble, but also the generation period. The third period includes a series of major events, such as the circuit breaker in the US market in March 2020, and the COVID-19 pandemic. Based on this table, we divide three periods for the analysis of daily data in the next section: 2007–2010, 2014–2017, and 2008–2021.

4.2. Information Flow Analysis of Daily Data

4.2.1. Global Analysis of Daily Data

In this section, we analyze the relationship between the daily data of the two indices. Table 6 lists the transfer entropy values T E u s c n of different parameters. For the first and third periods, all T E values are significant at significance level α = 0.05 . For the second period, the T E values of parameters Q 1 and Q 2 were not significant. However, the p-value for the T E value of Q 4 was less than 0.01. This implies that extreme values have little effect on information flow in the second period. In summary, we still found significant information flows in the three periods of daily data. Below, we use L R P -based analysis to discuss the dynamics of information flow in each period.
In the next subsection, for the first period and the second period, we choose the parameter Q corresponding to the maximum T E value in Table 6. In the analysis of the first and third periods, the parameter Q was set to Q 4 and Q 2 , respectively. In addition, the second period is a special period, in which both Q 1 and Q 2 correspond to insignificant T E values, suggesting that extreme values in this period weakly affect the level of information flow. For the second period, we set Q 3 and Q 4 for calculation.

4.2.2. Information Flow Dynamics for 2007–2010 Data

Figure 3 shows the Z-score series for the 2007–2010 data. If the threshold is 1.645, the series includes three significant periods: 26 November 2007 (28 June 2007)–25 June 2009, 28 October 2009 (1 June 2009)–8 January 2010, and 24 May 2010 (21 December 2009)–19 October 2010. Here, the date in parentheses is the left endpoint of the calculation window for the first Z-score greater than 1.645 within the period. In particular, the maximum value (3.9880) is in the first period and the calculation window is 3 November 2008–7 April 2009.
The calculation results clearly show that the 2007–2009 crisis in the US market significantly affected the Chinese market. In addition, we also observe that the u s c n information flow also exists in the post-crisis period. These results imply not one peak with strong information flow, but multiple peaks with different intensities.

4.2.3. Information Flow Dynamics for 2014–2017 Data

Figure 4 shows the information flow dynamics for the 2014–2017 data. Figure 4a includes a major significant period: 6 January 2016 (6 August 2015)–10 May 2016. The results show that the information flow in the u s c n direction is enhanced after the Chinese market bubble burst. Figure 4b shows the calculation results corresponding to Q 4 . We found that it also includes a major significant period: 30 December 2016 (1 August 2016)–1 March 2017. The period included an important event that affected US markets: the 2016 presidential election. A previous study showed that the event significantly affected correlation dynamics in the US stock market [35]. The analysis here suggests that there was a significant information flow from the US stock market to the Chinese stock market around this event. The two results in Figure 4 show that from 2 January 2014 to 29 December 2017, the US market significantly influenced the Chinese market in two periods, and both periods included major events.

4.2.4. Information Flow Dynamics for 2018–2021 Data

Figure 5 shows the calculation results for the 2018–2021 data. The period includes several sub-periods greater than the critical Z-score, suggesting that from 2 January 2018 to 31 December 2021, the flow of information between the two markets was active and strong. These sub-periods are as follows: 7 June 2018 (3 January 2018)–31 July 2018 ( T 1 ), 22 February 2019 (12 September 2018)–15 May 2019 ( T 2 ), 19 June 2019 (11 January 2019)–30 July 2019 ( T 3 ), 27 August 2019 (29 March 2019)–5 February 2020 ( T 4 ), 27 February 2020 (19 September 2019)–29 July 2020 ( T 5 ), 3 March 2021 (24 September 2020)–2 August 2021 ( T 6 ). We label the different subperiods with T i . It can be found that T 2 , T 3 , T 4 and T 5 can be combined into an important period. There were several major events in this period: Sino-US trade friction, the oil crisis, the COVID-19 pandemic, the US stock market crisis in March 2020. In addition, periods T 1 and T 6 still belong to the period of Sino-US trade friction and the period of the COVID-19 pandemic, respectively. In particular, the calculation window corresponding to the maximum value (3.4339) is 21 October 2019–23 March 2020. Several major events are included in this period, such as the oil crisis, the COVID-19 pandemic, and the March 2020 market crisis. To sum up, it is reasonable to speculate that the impact of multiple major events led to the active information flow during the period.

4.3. Information Flow between Industry Indices in the Chinese Market and the S&P500 Index

4.3.1. Global Information Flow Analysis between Industry and Market Index

Since the composite index includes stocks belonging to various industries, in this section, we analyze the relationship between the S&P500 index and the industry index in the Chinese market. Here, we consider the case where a larger weight is given to the tail of the distribution, i.e., Q = Q 1 . Table 7 lists the T E values and p-values, where the symbol “ind” represents the industry index, and the code is listed in the second column.
The fourth and sixth columns show that there are several industry indices that are closely related to the S&P500 index. For example, there is a significant bidirectional flow of information between the petrochemical industry and the S&P500 index. In addition, some unidirectional effects can also be found, such as in real estate. It should be noted that not all industry indices are affected by the S&P500 index, such as the public utilities index and the architectural ornament index. To sum up, Table 7 shows that the information flow from S&P500 to Chinese market industry indices is heterogeneous.

4.3.2. The Dynamics of Information Flow between Industry Index and S&P500 Index

In the previous section, we found diversity in the relationship between the S&P500 index and industry indices, with some industry indices having a stronger flow of information between them and the S&P500 index. A related issue is whether the dynamics of information flow for different industry indices have similar patterns. Here, we select three industry indices and compare information flow dynamics. The three indices are 801040.SL (Steel industry), 801180.SL (Real estate) and 801960.SL (Petrochemical industry), and all have significant u s i n d information flow.
We analyze dynamics using weekly data and always set Q = Q 1 and L w = 96 weeks. Figure 6 shows the dynamics of information flow for u s 801040 . S L . The Z-score series includes 794 values, and 396 values are greater than the critical threshold, implying that there is a continuous flow of information u s 801040 . S L during the period. In addition, it includes several local peaks, and the corresponding periods of the two main peaks are 18 April 2008 (2 June 2006)–15 April 2011 and 24 October 2014 (21 December 2012)–30 June 2017, respectively.
Figure 7 presents the Z-score series for the S&P500 Index → Real estate Index. For this Z-score series, there are only 124 values greater than the critical threshold. It shows a different pattern from Figure 6, such as observations in the period 4 September 2020 (26 October 2018)–16 July 2021 contribute significantly to the information flow. In addition, another major peak corresponds to the period 25 December 2009 (15 February 2008)–24 June 2011.
The information flow analysis result of u s 801960 . S L is shown in Figure 8. We find that it included a period with most values greater than the critical threshold: 10 October 2014–9 June 2017. This period is close to that of the second peak in Figure 6. However, for the series shown in Figure 8, it includes some values less than the critical threshold.
The results in Figure 6, Figure 7 and Figure 8 show that the information flow dynamics of u s i n d exhibit multiple patterns, thus indicating a high complexity of information flow between markets.

5. Discussion

This study examines periods of strong information flow between two important markets, where these periods are accompanied by major events. This implies that long-term major events may affect market relations more deeply. This provides a forward-looking analysis for analyzing the relationship between the two markets. In particular, we can analyze the dynamics of information flow in detail to identify changes in the relationship between markets when markets are impacted by major events. A related topic that can be studied is whether non-linear information flow helps predict future returns in the market.
Several previous studies have revealed that major events affect the relationship between markets [4,5,6,7,8,9,10,11,12,13,14]. Unlike previous studies, we analyze long-term data and focus on the dynamics of information flow. In particular, L R P -based analysis characterizes the strength of the local information flow, so that the time-varying characteristics of the information flow in a period can be observed in detail.

6. Conclusions

We discussed the long-term relationship between the Chinese market and the US market and found that there is an asymmetric information flow, where the US market is the source of information flow. Calculations show that the information flow between the Chinese market and the US market is not stable, but there are some significant periods. The L R P -based analysis identified three periods of significant information flow. All three periods include some major events affecting the market, such as the 2008 crisis, the oil crisis, and the COVID-19 pandemic, thus suggesting that the dynamics of information flow may be impacted by major events. In addition, we conducted a detailed dynamic analysis with daily data. Calculations show that each period includes nontrivial dynamics, some of which appear as local peaks, suggesting dramatic changes in information flow.
We also analyzed the information flow between the S&P500 index and some industry indices in the Chinese A-share market. Calculations show that the S&P500 index does not always significantly affect each industry index. Some industries are significantly affected, such as the steel industry and real estate. However, analysis based on weekly data shows that the dynamics of information flow between the industry indices and the S&P500 index exhibits multiple patterns. In addition, there are also some significant information flows from the Chinese market to the US market, thus demonstrating the heterogeneity of information flow.
This study demonstrates the high complexity of nonlinear causal relationships between two important markets and facilitates quantitative studies of the impact of major events. In particular, empirical analysis suggests that the heterogeneity of information flow in industry indices needs to be taken into account when discussing risk spillovers between markets. In this article, we only consider the information flow between the two markets. A related and important topic is the study of how major events affect information flow networks that include multiple markets. Since L R P -based methods involve two variables, for the information flow network, we need to construct a method that includes multiple variables. An alternative approach is to characterize changes in network structure by shuffling observations. In addition, the relationship between industry indices within a market can be analyzed in detail through L R P -based methods and high-frequency data. For example, for a trading day, we can analyze how important announcements impact the information flow network constructed by multiple industry indices.

Author Contributions

Methodology, C.-X.N.; formal analysis, C.-X.N. and J.X.; investigation, C.-X.N. and J.X.; writing—original draft preparation, C.-X.N.; writing—review and editing, C.-X.N.; visualization, C.-X.N. and J.X.; funding acquisition, C.-X.N. All authors have read and agreed to the published version of the manuscript.

Funding

The work is supported by the Characteristic & preponderant discipline of key construction universities in Zhejiang province (Zhejiang Gongshang University—Statistics).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Goh, J.C.; Jiang, F.; Tu, J.; Wang, Y. Can US economic variables predict the Chinese stock market? Pac.-Basin Financ. J. 2013, 22, 69–87. [Google Scholar] [CrossRef]
  2. Chen, J.; Jiang, F.; Li, H.; Xu, W. Chinese stock market volatility and the role of US economic variables. Pac.-Basin Financ. J. 2016, 39, 70–83. [Google Scholar] [CrossRef]
  3. Hu, Z.; Kutan, A.M.; Sun, P.W. Is US economic policy uncertainty priced in China’s A—shares market? Evidence from market, industry, and individual stocks. Int. Rev. Financ. Anal. 2018, 57, 207–220. [Google Scholar] [CrossRef]
  4. Kenourgios, D.; Samitas, A.; Paltalidis, N. Financial crises and stock market contagion in a multivariate time-varying asymmetric framework. J. Int. Financ. Mark. Inst. Money 2011, 21, 92–106. [Google Scholar] [CrossRef]
  5. Cheung, W.; Fung, S.; Tsai, S.C. Global capital market interdependence and spillover effect of credit risk: Evidence from the 2007–2009 global financial crisis. Appl. Financ. Econ. 2010, 20, 85–103. [Google Scholar] [CrossRef]
  6. Hwang, E.; Min, H.G.; Kim, B.H.; Kim, H. Determinants of stock market comovements among US and emerging economies during the US financial crisis. Econ. Model. 2013, 35, 338–348. [Google Scholar] [CrossRef]
  7. Samarakoon, L.P. Stock market interdependence, contagion, and the US financial crisis: The case of emerging and frontier markets. J. Int. Financ. Mark. Inst. Money 2011, 21, 724–742. [Google Scholar] [CrossRef]
  8. Wang, G.J.; Xie, C.; Lin, M.; Stanley, H.E. Stock market contagion during the global financial crisis: A multiscale approach. Financ. Res. Lett. 2017, 22, 163–168. [Google Scholar] [CrossRef]
  9. Dungey, M.; Gajurel, D. Equity market contagion during the global financial crisis: Evidence from the world’s eight largest economies. Econ. Syst. 2014, 38, 161–177. [Google Scholar] [CrossRef]
  10. Jin, X.; An, X. Global financial crisis and emerging stock market contagion: A volatility impulse response function approach. Res. Int. Bus. Financ. 2016, 36, 179–195. [Google Scholar] [CrossRef]
  11. Ahmad, W.; Sehgal, S.; Bhanumurthy, N. Eurozone crisis and BRIICKS stock markets: Contagion or market interdependence? Econ. Model. 2013, 33, 209–225. [Google Scholar]
  12. Sansa, N.A. The Impact of the COVID-19 on the Financial Markets: Evidence from China and USA. Electron. Res. J. Soc. Sci. Humanit. 2020, 2. [Google Scholar]
  13. Nguyen, D.T.; Phan, D.H.B.; Ming, T.C. An assessment of how COVID-19 changed the global equity market. Econ. Anal. Policy 2021, 69, 480–491. [Google Scholar]
  14. Zhang, D.; Hu, M.; Ji, Q. Financial markets under the global pandemic of COVID-19. Financ. Res. Lett. 2020, 36, 101528. [Google Scholar] [CrossRef] [PubMed]
  15. Schreiber, T. Measuring information transfer. Phys. Rev. Lett. 2000, 85, 461. [Google Scholar]
  16. Nie, C.X. Dynamics of the price–Volume information flow based on surrogate time series. Chaos Interdiscip. J. Nonlinear Sci. 2021, 31, 013106. [Google Scholar] [CrossRef] [PubMed]
  17. Theiler, J.; Eubank, S.; Longtin, A.; Galdrikian, B.; Farmer, J.D. Testing for nonlinearity in time series: The method of surrogate data. Phys. D Nonlinear Phenom. 1992, 58, 77–94. [Google Scholar]
  18. Small, M. Applied Nonlinear Time Series Analysis: Applications in Physics, Physiology and Finance; World Scientific: Singapore, 2005; pp. 115–129. [Google Scholar]
  19. Lancaster, G.; Iatsenko, D.; Pidde, A.; Ticcinelli, V.; Stefanovska, A. Surrogate data for hypothesis testing of physical systems. Phys. Rep. 2018, 748, 1–60. [Google Scholar]
  20. Marschinski, R.; Kantz, H. Analysing the information flow between financial time series. Eur. Phys. J. B-Condens. Matter Complex Syst. 2002, 30, 275–281. [Google Scholar]
  21. Dimpfl, T.; Peter, F.J. Group transfer entropy with an application to cryptocurrencies. Phys. A Stat. Mech. Its Appl. 2019, 516, 543–551. [Google Scholar] [CrossRef]
  22. Jizba, P.; Kleinert, H.; Shefaat, M. Rényi’s information transfer between financial time series. Phys. A Stat. Mech. Its Appl. 2012, 391, 2971–2989. [Google Scholar] [CrossRef]
  23. Papana, A.; Kyrtsou, C.; Kugiumtzis, D.; Diks, C. Detecting Causality in Non-stationary Time Series Using Partial Symbolic Transfer Entropy: Evidence in Financial Data. Comput. Econ. 2016, 47, 341–365. [Google Scholar] [CrossRef]
  24. Kwon, O.; Yang, J.S. Information flow between composite stock index and individual stocks. Phys. A Stat. Mech. Its Appl. 2008, 387, 2851–2856. [Google Scholar] [CrossRef]
  25. Dimpfl, T.; Peter, F.J. The impact of the financial crisis on transatlantic information flows: An intraday analysis. J. Int. Financ. Mark. Inst. Money 2014, 31, 1–13. [Google Scholar] [CrossRef]
  26. Dimpfl, T.; Peter, F.J. Using transfer entropy to measure information flows between financial markets. Stud. Nonlinear Dyn. Econom. 2013, 17, 85–102. [Google Scholar] [CrossRef]
  27. Daugherty, M.S.; Jithendranathan, T. A study of linkages between frontier markets and the U.S. equity markets using multivariate GARCH and transfer entropy. J. Multinatl. Financ. Manag. 2015, 32, 95–115. [Google Scholar] [CrossRef]
  28. Kwon, O.; Yang, J.S. Information flow between stock indices. EPL Europhys. Lett. 2008, 82, 68003. [Google Scholar] [CrossRef]
  29. Ong, M.A. An information theoretic analysis of stock returns, volatility and trading volumes. Appl. Econ. 2015, 47, 3891–3906. [Google Scholar] [CrossRef]
  30. Kyrtsou, C.; Kugiumtzis, D.; Papana, A. Further insights on the relationship between SP500, VIX and volume: A new asymmetric causality test. Eur. J. Financ. 2019, 25, 1402–1419. [Google Scholar] [CrossRef]
  31. Zheng, J.; Nie, C.X. The Dynamics of Price–Volume Information Transfer in the Cryptocurrency Markets. Adv. Complex Syst. 2020, 23, 2050014. [Google Scholar] [CrossRef]
  32. Sensoy, A.; Sobaci, C.; Sensoy, S.; Alali, F. Effective transfer entropy approach to information flow between exchange rates and stock markets. Chaos Solitons Fractals 2014, 68, 180–185. [Google Scholar] [CrossRef]
  33. Kim, Y.; Kim, J.; Yook, S.H. Information transfer network of global market indices. Phys. A Stat. Mech. Its Appl. 2015, 430, 39–45. [Google Scholar] [CrossRef]
  34. Behrendt, S.; Dimpfl, T.; Peter, F.J.; Zimmermann, D.J. RTransferEntropy—Quantifying information flow between different time series using effective transfer entropy. SoftwareX 2019, 10, 100265. [Google Scholar] [CrossRef]
  35. Nie, C.X. A network-based method for detecting critical events of correlation dynamics in financial markets. EPL Europhys. Lett. 2020, 131, 50001. [Google Scholar] [CrossRef]
Figure 1. The figure shows the Z-scores for different parameters Q, where (ae) correspond to Q 1 Q 5 , respectively.
Figure 1. The figure shows the Z-scores for different parameters Q, where (ae) correspond to Q 1 Q 5 , respectively.
Entropy 24 01102 g001
Figure 2. The I time series obtained from five sets of parameters, and takes values in { 0 , 1 , 2 , 3 , 4 , 5 } . It includes three local peaks, and the maximum corresponds to the 2008 crisis.
Figure 2. The I time series obtained from five sets of parameters, and takes values in { 0 , 1 , 2 , 3 , 4 , 5 } . It includes three local peaks, and the maximum corresponds to the 2008 crisis.
Entropy 24 01102 g002
Figure 3. The figure shows the Z-score time series between R u s 1 and R c n 1 , where L w = 100 .
Figure 3. The figure shows the Z-score time series between R u s 1 and R c n 1 , where L w = 100 .
Entropy 24 01102 g003
Figure 4. The figure shows the Z-score time series between R u s 2 and R c n 2 , where (a,b) correspond to parameters Q 3 and Q 4 , respectively.
Figure 4. The figure shows the Z-score time series between R u s 2 and R c n 2 , where (a,b) correspond to parameters Q 3 and Q 4 , respectively.
Entropy 24 01102 g004
Figure 5. The figure shows the Z-score series for R u s 3 and R c n 3 , which includes two main peaks ( Q = Q 2 , L w = 100 ).
Figure 5. The figure shows the Z-score series for R u s 3 and R c n 3 , which includes two main peaks ( Q = Q 2 , L w = 100 ).
Entropy 24 01102 g005
Figure 6. The figure shows the Z-score time series for the industry index 801040.SL (Steel industry) and the S&P500 index, including two clear local peaks.
Figure 6. The figure shows the Z-score time series for the industry index 801040.SL (Steel industry) and the S&P500 index, including two clear local peaks.
Entropy 24 01102 g006
Figure 7. The figure shows the Z-score time series for the industry index 801180.SL (Real estate) and the S&P500 index, with only a few Z-score values above the critical threshold.
Figure 7. The figure shows the Z-score time series for the industry index 801180.SL (Real estate) and the S&P500 index, with only a few Z-score values above the critical threshold.
Entropy 24 01102 g007
Figure 8. The figure shows the Z-score time series of the industry index 801960.SL (Petrochemical industry) and S&P500 index, and most of the Z-score values greater than the critical threshold are in the period 2014–2017.
Figure 8. The figure shows the Z-score time series of the industry index 801960.SL (Petrochemical industry) and S&P500 index, and most of the Z-score values greater than the critical threshold are in the period 2014–2017.
Entropy 24 01102 g008
Table 1. Descriptive statistics of preprocessed time series.
Table 1. Descriptive statistics of preprocessed time series.
SeriesAverageStdSkewnessKurtosis
R u s 0.00120.0261−2.055723.1000
R c n 0.00160.0359−0.29552.2400
R u s 1 1.2752 × 10 4 0.0179−0.48719.0371
R c n 1 4.3938 × 10 4 0.0242−0.51262.0974
R u s 2 3.9919 × 10 4 0.0077−0.23614.2776
R c n 2 5.8243 × 10 4 0.0161−0.97946.7738
R u s 3 6.0558 × 10 4 0.0138−1.040517.2147
R c n 3 2.0142 × 10 4 0.0134−0.30523.6399
R i n d 1 0.00070.0437−0.20442.0224
R i n d 2 0.00160.0460−0.36582.0825
R i n d 3 0.00070.0391−0.31571.7477
Table 2. Information flow analysis of long-term weekly data.
Table 2. Information flow analysis of long-term weekly data.
Q k TE cn us p cn us TE us cn p us cn
Q 1 0.0405370.0540.0484190.006
Q 2 0.03526220.2610.05193780.005
Q 3 0.03817430.1410.04494180.039
Q 4 0.03577500.1830.04925540.008
Q 5 0.0472300.0140.0443030.031
Table 3. Three significant periods of the information flow of parameter.
Table 3. Three significant periods of the information flow of parameter.
Q T i tZPeriod
Q 1 T 1 24 April 20094.05508 June 2007–24 April 2009
Q 1 T 2 17 November 20172.20841 January 2016–17 November 2017
Q 2 T 1 9 January 20093.07002 March 2007–19 January 2009
Q 2 T 2 15 May 20152.598019 July 2013–15 May 2015
Q 2 T 3 12 October 20182.45342 December 2016–12 October 2018
Q 3 T 1 8 May 20092.906722 June 2007–8 May 2009
Q 3 T 2 17 April 20152.968121 June 2013–17 April 2015
Q 4 T 1 2 January 20093.627916 February 2007–2 January 2009
Q 4 T 2 8 November 20131.834923 December 2011–8 November 2013
Q 4 T 3 28 February 20203.256413 April 2018–28 February 2020
Q 5 T 1 9 April 20103.020223 May 2008–9 April 2010
Q 5 T 2 21 November 20141.801718 March 2013–21 November 2014
Q 5 T 3 13 March 20202.094127 April 2018–13 March 2020
Table 4. Statistics of I series.
Table 4. Statistics of I series.
N i N 1 N 2 N 3 N 4 N 5 Sum
Number14280296624341
Table 5. Significant periods identified by Figure 2.
Table 5. Significant periods identified by Figure 2.
T i Period
T 1 8 February 2008–24 September 2010
T 2 15 August 2014–20 November 2015
T 3 13 April 2018–19 March 2021
Table 6. Analysis of T E value of daily data.
Table 6. Analysis of T E value of daily data.
Period2007–20102014–20172018–2021
Q 1 0.0573 (<0.001)0.0322 (0.262)0.0587 (<0.001)
Q 2 0.0490 (0.006)0.0286 (0.529)0.0630 (<0.001)
Q 3 0.0510 (0.002)0.0505 (0.002)0.0545 (<0.001)
Q 4 0.0601 (<0.001)0.0489 (0.006)0.0519 (<0.001)
Q 5 0.0600 (<0.001)0.0407 (0.041)0.0577 (<0.001)
Table 7. T E values of industry indices and S&P500 index.
Table 7. T E values of industry indices and S&P500 index.
IndustryCode TE ind us p ind us TE us ind p us ind
AFAF801010.SL0.03410.2540.04440.011
Basic chemicals801030.SL0.04480.0120.03940.061
Steel industry801040.SL0.03500.0930.06860.000
Nonferrous metals801050.SL0.03530.1250.04750.005
Electronics801080.SL0.03250.3340.04130.034
Household Electric Appliance801110.SL0.02800.5510.06220.000
Food and beverage801120.SL0.02930.4910.04430.012
Textile, clothing.801130.SL0.04150.0550.04830.004
Light manufacturing801140.SL0.03780.1020.05040.000
Bio-pharmaceutical industry801150.SL0.02940.4990.04180.036
Public utilities801160.SL0.04310.0220.03590.121
Transportation industry801170.SL0.03380.2570.05190.001
Real estate801180.SL0.03040.4270.04690.008
Retail business801200.SL0.03880.0790.03580.154
Social services801210.SL0.04070.0400.04280.028
Comprehensive company801230.SL0.02960.4820.04760.007
Building material801710.SL0.03470.2130.04680.009
Architectural ornament801720.SL0.03930.0720.03020.379
Power equipment801730.SL0.02790.5590.03470.160
National defense industry801740.SL0.03420.2390.03140.351
Computer industry801750.SL0.02530.7190.03830.096
Media industry801760.SL0.03090.3340.04840.006
Communication industry801770.SL0.04230.0470.04140.031
Banking801780.SL0.04570.0210.03330.220
Non-bank Financial Institutions801790.SL0.02510.7750.03720.087
Automobile industry801880.SL0.02700.5910.05080.004
Mechanical equipment801890.SL0.04430.0210.04470.017
Coal industry801950.SL0.04180.0310.04240.016
Petrochemical industry801960.SL0.05100.0010.04690.004
Environmental protection801970.SL0.02820.5950.04750.006
Beauty care801980.SL0.03370.2580.04360.017
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Nie, C.-X.; Xiao, J. Dynamics of Information Flow between the Chinese A-Share Market and the U.S. Stock Market: From the 2008 Crisis to the COVID-19 Pandemic Period. Entropy 2022, 24, 1102. https://doi.org/10.3390/e24081102

AMA Style

Nie C-X, Xiao J. Dynamics of Information Flow between the Chinese A-Share Market and the U.S. Stock Market: From the 2008 Crisis to the COVID-19 Pandemic Period. Entropy. 2022; 24(8):1102. https://doi.org/10.3390/e24081102

Chicago/Turabian Style

Nie, Chun-Xiao, and Jing Xiao. 2022. "Dynamics of Information Flow between the Chinese A-Share Market and the U.S. Stock Market: From the 2008 Crisis to the COVID-19 Pandemic Period" Entropy 24, no. 8: 1102. https://doi.org/10.3390/e24081102

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop