1. Introduction
The current literature on bubble tests is confronted with the difficulty to conclude that a price bubble is not caused by time-varying or regime switching fundamentals (
Gürkaynak 2008). Recent tests proposed by (
Phillips et al. 2011,
2015) provide powerful tests, essentially based on the supremum of sequential unit root test statistics, and have been applied to the cryptocurrency markets by (
Cheung et al. 2015;
Corbet et al. 2018;
Hafner 2018), where the latter accounts for time-varying volatility. These tests, however, are purely statistical in nature and do not allow us to infer if structural breaks detected in the time series processes of asset prices are evidence of bubbles or are due to breaks in the underlying (unobserved) fundamentals (
Pesaran and Johnsson 2018). An inclusion of extracted sentiment information, representing the sentiment in the crypto community with their specific linguistic features, contributes to solving this inconclusive puzzle and adds economic and behavioral information into the statistical settings.
Alternative bubble tests have been proposed e.g., by
Pavlidis et al. (
2017) based on the gap between spot and futures prices and applied to equities and exchange rates, and
Pavlidis et al. (
2018) using market expectations of futures prices applied to the oil market—see also
Kruse and Wegener (
2019). With the lack of liquidity in futures prices of cryptocurrencies, it seems difficult to apply these tests to crypto markets today. Further bubble tests include
Cheah and Fry (
2015), who use a continuous time model to identify bubbles via anomalous behaviors of the drift and volatility components, and
Fry and Cheah (
2016), who develop models for financial bubbles and crashes based on statistical physics, with applications to Bitcoin and Ripple.
Bubbles are more prone to emerge in the crypto market than in the stock markets. Theoretical grounds for market efficiency rely crucially on the stabilizing powers of rational speculation—see e.g., (
De Long et al. 1990;
Glosten and Milgrom 1985;
Yang and Brown 2016). Given the presence of limits to arbitrage (e.g., no short-sale venue) and the limited fundamental information in the cryptocurrency market, rational speculation that pulls prices close to its fundamental value is not possible. These constraints result in a hurdle of price discovery.
In this paper, we postulate that a bubble-like behavior of prices is characterized by a smooth transition function that dynamically assigns the probability (loading) to the explosive regime and the random walk regime, given the exogenous sentiment information. By this construction, the speculative bubble can only be pumped up with anomalous sentiment. We therefore develop an econometric framework and a test for a sentiment-induced price bubble.
We target the cryptocurrency-related messages in Stocktwits which attracts the crypto community to share their information, opinions and sentimental moods. We use the sentiment measures constructed by
Nasekin and Chen (
2018) from this social media as their newly constructed sentiment index is viewed as a representative sentiment from the crypto community, with a consideration of their specific linguistic features. The information content of it is relevant for future market performance and can be used to predict the price and volatility evolution (
Chen et al. 2018). As mentioned before, due to the limited knowledge of a fundamental value in this new digital asset class, the mispricing caused by sentiments cannot be promptly corrected or revert to its fundamental value. This is the reason why sentiment entails a short-run predictability because of an inefficient crypto market that defers a price correction process. This slow correction makes sentiment accumulated and amplified; as a consequence, the bubble is able to grow and probably collapse once sentimental bias is finally being corrected.
The econometric framework is that of a smooth transition autoregressive model (STAR), where the transition variable is the sentiment index. The idea is that, in times of a very high sentiment index, corresponding to excessively bullish evaluations, the price dynamics will be driven by an explosive autoregression, while otherwise they follow a random walk. We allow for conditional heteroskedasticity and fat tails by specifying recently proposed score-driven models that are shown to fit the data well. Volatility is allowed to depend explicitly on the sentiment index. This complements previous studies on cryptocurrency volatility as in (
Conrad et al. 2018;
Kjaerland et al. 2018).
We apply the model to the CRIX index, which is a value weighted index of the cryptocurrency market with endogenously determined number of constituents using statistical criteria. The reallocation of the CRIX happens on a monthly and quarterly basis—see (
Trimborn and Härdle 2018) and
thecrix.de for details. We identify several bubble periods, primarily in 2017. Volatility is negatively depending on the sentiment index, meaning that bad sentiments or news increase volatility, a feature commonly called leverage effect in classical financial markets. Here, the leverage effect is explicitly driven by the sentiment index.
The paper is organized as follows. We first present the sentiment index for cryptocurrencies in
Section 2. Then, we introduce the econometric model in
Section 3 and discuss its application to the CRIX.
Section 4 provides the conclusions.
3. A Sentiment-Based Model for Locally Explosive Crypto Prices
Suppose we have a series of log prices for the CRIX, denoted
, and a series of sentiment indices for the crypto market, called
. The idea is to allow for bubble-like behavior of prices, given by a locally explosive autoregressive process, where the explosive regime is determined by a sentiment index of the crypto market. The transition between the random walk and the explosive regime is driven by a smooth transition function as in classical smooth transition AR models (STAR). Furthermore, we take into account conditional heteroskedasticity of the error term and fat tails of the conditional distribution. The model can be written as
where
,
is an i.i.d. error term with mean zero and unit variance,
is volatility, and
is the logistic function, i.e.,
with “steepness” parameter
and “threshold” parameter
. Essentially, the dynamics of
are a mixture of two regimes. When the index
is large, then
will be close to unity and more weight is given to the explosive regime, while if it is small, then
is close to zero and more weight is given to the random walk regime. In the limiting case,
one obtains as a special case the threshold autoregressive model, as
degenerates to the indicator function
. It is for this reason that we interpret the situation
as the bubble regime and
as the non-bubble regime, although in the smooth transition model there is strictly speaking a continuum of regimes. See also
van Dijk et al. (
2002), who adopt the same interpretation.
Estimation of the model can be done by nonlinear least squares—see, e.g.,
Teräsvirta (
1994). However, it will be more efficient to take into account conditional heteroskedasticity and fat tails of the distribution of
by using maximum likelihood estimation (MLE).
The volatility part of the model is taken to be the Beta-t-EGARCH model of (
Creal et al. 2011;
Harvey 2013). That is, we assume that
follows a student-
t distributed random variable with mean zero, scale one, and
degrees of freedom and the volatility dynamics are driven by the score of the likelihood function, i.e.,
By Proposition 12 of
Harvey (
2013), we can write alternatively
, where
is an IID beta distributed r.v. The reason for using a score driven EGARCH rather than the classical EGARCH model of
Nelson (
1991) is that many recent empirical studies have found that the news impact function of classical EGARCH tends to overweigh the impact of large shocks on volatility, while the impact functions of score driven models tend to give a more accurate account of the impact of large shocks—see, e.g.,
Harvey (
2013) for a detailed discussion and motivation for score driven models.
The exponential form of volatility is convenient to augment the volatility equation with explanatory variables without having to worry about the positivity of the variance. We consider an additional term based on the first difference of the sentiment index,
, i.e., the volatility equation becomes
The motivation for using the first differences of the sentiment index rather than the index level is that changes in the index might be more informative to explain price uncertainty, and hence volatility, than the index itself. The sign of the parameter is not a priori clear, as it may be that volatility increases when either the sentiment index increases or decreases. We have tried other functional forms for the impact of the sentiment index, such as , where c is a constant—for example, the sample mean of the sentiment index. However, and perhaps surprisingly, the best form turned out to be the linear one.
Estimation of the transition parameter
is often problematic when this parameter is large, as then the transition function is steep and a large number of observations in a neighborhood of
is required to obtain a reliable estimate of
—see, e.g.,
Granger and Teräsvirta (
1993) for a detailed discussion. In that case, they suggest to first reparameterize
g as
, where
is the sample standard deviation of the sentiment index, then set
to a fixed value, e.g., unity, and estimate the remaining parameters by MLE. The procedure can be reiterated by using a set of fixed values for
on a grid. We follow their advice here using the grid of integers from 1 to 10 and found that, after rescaling of the transition function,
maximizes the likelihood and gives the best results.
We compare our model with one that ignores the sensitivity index in the conditional mean and variance, i.e.,
We call this model M0, as opposed to the above complete model M1, and we would like to test model M1 versus model M0 to see whether the sensitivity index has a significant contribution to explain locally explosive behavior and volatility. Testing is however non-standard as under the null hypothesis,
, there are unidentified parameters,
and
. Thus, likelihood ratio test statistics do not have a chi-square distribution under the null. This is a well known problem in STAR models—see, e.g., (
Granger and Teräsvirta 1993;
van Dijk et al. 2002) for an overview. The simplest solution is to use an LM-type test by estimating the auxiliary regression
where
is an error term, and then test the hypothesis
. As shown by
Luukkonen et al. (
1988), testing
is equivalent to testing
, as the mean term in the auxiliary regression is the first order Taylor expansion of the logistic regression. See
van Dijk et al. (
2002) for details.
For the CRIX and sentiment index, daily observations from 8 August 2014 to 15 May 2018, the estimation results are reported in
Table 1. In the sentiment-free model M0, the constant
is positive and significant, while in model M1 the combined term of
and
is closer to zero. The estimate of
is small but significant, indicating that the explosive regime is important. In addition, the difference in the log likelihood values suggests that the goodness-of-fit of M1 is substantially higher than that of M0. A classical likelihood ratio test clearly would reject M0 in favor of M1. However, as outlined above, this test is non-standard in our context due to unidentified parameters under the null hypothesis. Instead, we perform the auxiliary regression approach in Equation (
3) and obtain the least squares estimator of
with a standard error of 0.0002, so that the
p-value of the t-test for
is very close to zero. Hence, we reject the hypothesis of a random walk in favor of STAR nonlinearity. Rather than estimating the degrees-of-freedom parameter
directly, we estimate its inverse,
, as this often yields more stable results numerically—see, e.g.,
Harvey (
2013). To summarize, our testing approach suggests a significant contribution of the sentiment index to explain locally explosive behavior of the CRIX.
The estimated transition function is given by
, where
is the standard deviation of the sentiment index. This function is shown in
Figure 3, indicating the “bubble regime” for
. Empirically, this regime occurs in about 16% of the sample period, as, for 219 of 1340 observations, the sentiment index is larger than the estimated value of
.
Note that the estimated volatility parameters, except for the sentiment term, are rather similar for the two models and characterized by high persistence, i.e.,
is close to one, and fat tails of the conditional student-t distribution given by a degrees of freedom parameter
of about 2.6 for both models. However, the parameter related to the sentiment index,
is significant and negative, indicating that volatility increases whenever there is a drop in the sentiment index. This is similar to financial markets, where negative news tend to have stronger impact on volatility than positive news, often referred to as the “leverage effect” and first noted by (
Black 1976;
Christie 1982), see
Bauwens et al. (
2012) for a recent overview. In our case, the asymmetry in the impact of positive and negative innovations on volatility is explicitly modeled by the change of the sentiment index.
Figure 4 shows the estimated log volatility process together with the estimated conditional mean of returns, i.e.,
. The shaded areas highlight the estimated bubble periods, which mainly occurred in 2017 and parts of 2018. Not surprisingly, the shaded bubble periods correspond to substantially higher conditional mean returns, while it is close to zero for the non-shaded areas. Unlike
Hafner (
2018) who finds a single bubble regime starting in May 2017 and whose sample ends in December 2017, we find multiple bubble periods, mainly during the period May 2017 to April 2018. Hence, the starting date of these periods coincides with the single regime of
Hafner (
2018), but, due to the volatility of the sentiment index, this regime is decomposed into several sub-regimes. While the procedure advocated by (
Hafner 2018;
Phillips et al. 2011) identifies bubble periods that are of long duration and quite inert to price decreases, our approach produces regimes of shorter duration because, as the sentiment index drops, one quickly leaves a bubble regime.
Furthermore, we find that volatility is generally higher in the bubble regimes, with an average log volatility of −3.58 compared with −4.04 outside of a bubble. However, short term movements of volatility tend to react negatively to changes of the sentiment index, as reflected by the negative estimate of
in model (
2). Hence, our approach of using a sentiment index for modelling cryptocurrencies not only identifies locally explosive bubble periods, but also measures its impact on volatility. Moreover, it can be used as a predictive device, on a daily basis, both for returns and volatility. The method we propose conveys regulation implications in the cryptocurrency markets. Very likely scams come to a play given investors’ irrational exuberance and a surge of initial coin offerings (ICOs). However, these challenge the regulators in the presence of bubbles.