Next Article in Journal
A Chaos-Enhanced Fractional-Order Chaotic System with Self-Reproduction Based on a Memcapacitor and Meminductor
Next Article in Special Issue
An Efficient Approach to Solving the Fractional SIR Epidemic Model with the Atangana–Baleanu–Caputo Fractional Operator
Previous Article in Journal
Certain Novel Fractional Integral Inequalities via Fuzzy Interval Valued Functions
Previous Article in Special Issue
Fractional Factor Model for Data Transmission in Real-Time Monitoring Network
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Discrete q-Exponential Limit Order Cancellation Time Distribution

Institute of Theoretical Physics and Astronomy, Vilnius University, Saulėtekio al. 3, 10257 Vilnius, Lithuania
Fractal Fract. 2023, 7(8), 581; https://doi.org/10.3390/fractalfract7080581
Submission received: 23 June 2023 / Revised: 23 July 2023 / Accepted: 26 July 2023 / Published: 28 July 2023
(This article belongs to the Special Issue Feature Papers for Mathematical Physics Section)

Abstract

:
Modeling financial markets based on empirical data poses challenges in selecting the most appropriate models. Despite the abundance of empirical data available, researchers often face difficulties in identifying the best fitting model. Long-range memory and self-similarity estimators, commonly used for this purpose, can yield inconsistent parameter values, as they are tailored to specific time series models. In our previous work, we explored order disbalance time series from the broader perspective of fractional L’evy stable motion, revealing a stable anti-correlation in the financial market order flow. However, a more detailed analysis of empirical data indicates the need for a more specific order flow model that incorporates the power-law distribution of limit order cancellation times. When considering a series in event time, the limit order cancellation times follow a discrete probability mass function derived from the Tsallis q-exponential distribution. The combination of power-law distributions for limit order volumes and cancellation times introduces a novel approach to modeling order disbalance in the financial markets. Moreover, this proposed model has the potential to serve as an example for modeling opinion dynamics in social systems. By tailoring the model to incorporate the unique statistical properties of financial market data, we can improve the accuracy of our predictions and gain deeper insights into the dynamics of these complex systems.

1. Introduction

In the realm of econometrics and econophysics, researchers have long explored the intriguing properties of long-range memory in natural and social systems, often characterized by self-similarity and power-law statistical distributions. In financial markets, the abundance of data related to volatility, trading activity, and order flow has provided fertile ground for empirical investigations into long-range memory properties [1,2,3,4,5]. Various models with fractional noise have been proposed in econometrics to describe volatility time series [1,6,7,8,9,10,11]. However, from the perspective of econophysics, these models tend to lack sufficient microscopic reasoning and primarily serve as macroscopic descriptions of complex social systems, often based on assumptions of long-range memory. As a result, predicting stock price movements, despite the application of advanced trading algorithms and machine learning techniques, remains an enduring challenge for researchers [12,13,14].
To deepen our understanding of long-range memory in social systems, it becomes essential to compare the macroscopic modeling with empirical analyses. Our previous review [15] raised the crucial question of whether observed long-range memory in social systems is a result of genuine long-range memory processes or merely an outcome of the non-linearity of Markov processes. In our endeavor to explore this question, we have reduced the macroscopic dynamics of financial markets to a set of stochastic differential equations (SDEs) and related them to a microscopic agent-based model capable of reproducing empirical probability density functions (PDFs) and power spectral densities (PSDs) of absolute returns [16,17,18,19]. Moreover, we have employed this model to interpret the scaling behavior of volatility return intervals [20]. This approach could also find relevance in other social systems, where non-linear SDEs derived from agent-based models describing opinion or population dynamics lead to macroscopic descriptions [21,22]. Given these complexities, selecting the most appropriate model for interpreting empirical time series poses a significant challenge.
A promising line of inquiry comes from the observation that market-order flows exhibit long-range persistence, attributed to the order-splitting behavior of individual traders [23]. This finding reinforces the presence of genuine long-range memory in financial systems, as recently confirmed in a comprehensive investigation [24]. Consequently, we anticipate the manifestation of persistence in the limit-order flow as well.
In our previous contribution, we explored order disbalance time series in financial markets from the perspective of fractional Lévy stable motion (FLSM) [25]. Although FLSM and auto-regressive fractionally integrated moving average (ARFIMA) processes offer generalized models for self-similar fractional time series [26,27,28], they require more comprehensive approaches to explain the observed statistical properties of order disbalance in financial markets.
In this study, we continue our analysis of limit order flow using LOBSTER data, see technical description [29] or short description https://lobsterdata.com/info/HowDoesItWork.php (accessed on 20 July 2023), aiming to demonstrate the possibility of simple, yet specific modeling of empirical time series. A key innovation of our approach lies in the discovery of a general statistical distribution governing limit order cancellation times. We propose the application of Tsallis statistics, a generalization of Boltzmann–Gibbs statistics [30,31,32], to fit the histograms of limit order cancellation times. Remarkably, the distribution’s parameters for various stocks and time periods appear close, suggesting a universal nature of the observed statistical property.
To augment the identification of the limit order cancellation times’ q-exponential distribution as a new stylized fact in limit order statistical properties, we further consider the assumption of limit order flow as fractional Lévy noise (FLN). Empirical data analysis supports the plausibility of such an assumption. Ultimately, we present a relatively straightforward model with empirical grounding, serving as an artificial model defining statistical properties of order flow and disbalance in financial markets. This new approach addresses certain contradictions uncovered in previous investigations from the perspective of FLSM [25].
The rest of this paper is organized as follows: Section 2 introduces the discrete Tsallis q-exponential distribution; Section 3 defines and investigates the empirical statistical properties of limit order cancellation times; Section 4 examines a modified version of the order disbalance time series and its statistical properties; Section 5 studies the statistical properties of the limit order submission sequence; Section 6 introduces the artificial order disbalance model, and finally, we conclude the results and summarize our findings.

2. The Discrete Tsallis q-Exponential Distribution

Let us introduce the discrete Tsallis q-exponential distribution to capture the probability mass function (PMF) for discrete variables k = 1 , 2 , 3 , based on the continuous Tsallis q-exponential probability density function (PDF) [30], given by:
P λ , q ( x ) = ( 2 q ) λ ( 1 ( 1 q ) x λ ) 1 ( 1 q ) ,
where ( 1 ( 1 q ) x λ ) > 0 . First, we explore an approach using the survival function of the q-exponential distribution [33]:
S P λ , q ( x ) = ( 1 + ( q 1 ) x λ ) 2 q 1 q .
Withthis survival function, we define the PMF of the discrete q-exponential distribution as:
P λ , q ( d s ) ( k ) = S P λ , q ( k 1 ) S P λ , q ( k ) = ( 1 + ( q 1 ) ( k 1 ) λ ) 2 q 1 q ( 1 + ( q 1 ) k λ ) 2 q 1 q .
The PMF (3) is normalized for k = 1 , 2 , . Interestingly, this power-law PMF converges to the geometric distribution as q 1 , which aligns with expectations that we have one more q-generalization of the geometric distribution [34,35]. We can express this convergence as:
lim q 1 P λ , q ( d s ) ( k ) = exp y λ ( exp λ 1 ) = ( 1 p ) k 1 p ,
where we denote p = 1 exp λ . This result confirms the suitability of the PMF (3) as a Tsallis q-generalization of the geometric distribution, making it a potential candidate for modeling event waiting times in social and other complex systems.
The PMF for the discrete variable k often is defined as follows, [36]:
P λ , q ( d ) ( k ) = P λ , q ( k ) i = 1 i = P λ , q ( i ) .
After substitution of Equation (1) into Equation (5) we obtain a version of the discrete Tsallis q-exponential distribution as:
P λ , q ( d ) ( k ) = { 1 + ( 1 q ) λ k } 1 1 q ( ( q 1 ) λ ) 1 1 q H u r w i t z Z e t a 1 q 1 , 1 + 1 λ ( q 1 ) .
The explicit form of the limit of this PMF when q 1 is currently unclear; thus, we will use Equation (3) instead of Equation (6) in this investigation.

3. Cancellation Times of Limit Orders in the Order Flow of Financial Markets

In this section, we analyze the cancellation times of limit orders in the order flow of financial markets using the LOBSTER data for all NASDAQ-traded stocks [29]. The LOB data that LOBSTER reconstructs originates from NASDAQ’s Historical TotalView-ITCHfiles (http://nasdaqtrader.com, accessed on 20 July 2023). Here we construct the daily time series of order flow from 3 to 31 August 2020, a total of 21 working days. This data exemplifies an empirical social system appropriate for investigating power-law statistical properties. The statistical properties of the cancellation times of limit orders are of particular interest in this study as they play a vital role in understanding order flow and disbalance dynamics.
We retrieve LOBSTER data files: message.csv and orderbook.csv for each selected trading day and ticker (stock). These files contain the complete list of events causing an update of LOB up to the ten levels of prices. Any event j changing the LOB state has a time value t j = j counted in the event space. Thus, we deal with a discrete time scale and avoid daily seasonality related to the fluctuating activity of traders. Every limit order has its identification code; therefore, it is straightforward to pair limit order submission and cancellation events. Let us define the limit order cancellation time as the time difference between the order cancellation and submission event. Seeking a simplified approach to the order flow modeling, we consider order cancellation and execution as equivalent events leading to the complete deletion of the previously submitted limit order. In the LOBSTER message.csv file, these events are denoted as event types 3 and 4.
We can calculate the list of all cancellation times τ i from empirical data or group this data according to the price levels or the volume of the limit orders. The dependence of the cancellation time PDF on the level of price is weak. The two sub-figures in Figure 1 illustrate the empirical PDFs of the cancellation times calculated separately for the four price levels of the two stocks AMZN and MA. It is worth noting that the dependence of the cancellation time PDF on the price level is weak, with slight differences observed between the first price level (red) and the subsequent levels (blue, green, and black).
We group the cancellation times τ i into four categories based on the volumes v i of the limit orders: (1) v i 3 ; (2) 3 < v i 23 ; (3) 23 < v i 123 ; (4) 123 < v i 623 . Each group provides sufficient τ i values from the joint period of 21 trading days, enabling the calculation of histograms and evaluation of empirical probability density functions (PDFs). The two sub-figures in Figure 2 illustrate the empirical PDFs of AMZN and MA stock’s cancellation times calculated separately for the four groups of limit order volumes. Once again, the empirical distributions show weak dependence on the limit order volumes, reinforcing our modeling assumption of independence from price and volume.
Based on the independence assumption, we fit the empirical histograms of total limit order cancellation times submitted on the stock market for ten stocks (NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, JNJ) using the PMF (3). The fitting is performed using the Maximum Likelihood Estimator (MLE) method [37]. Data for every stock is joined from 21 trading day of August 2020. Only limit orders up to the ten levels of prices are included in this MLE calculation. Table 1 provides the calculated λ and q values for each stock from the joint empirical histograms.
The mean values of parameters for these ten stocks are λ = 0.3 and q = 1.5 . Figure 3 demonstrates the close fitting PMFs for all ten stocks. The thin color lines represent PMFs for individual stocks, while the thick black line represents the PMF with mean values of parameters.

4. Live Limit Orders and Order Disbalance

The analysis of limit order cancellation times revealed intriguing empirical properties with remarkably close values of parameters for different stocks. This suggests the assumption that the observed probability mass function (PMF) might be stable over time, which can be valuable in constructing simplified order flow and disbalance models. To achieve this objective, we reconstruct the order disbalance time series from the data in the message files. We adopt an alternative approach to form disbalance time series to explore more aspects of potential memory effects in empirical order disbalance time series.
As mentioned in Section 3, the LOBSTER data’s message files contain sufficient information about limit orders submitted to the exchange, including their prices, volumes, event times of submission, cancellation, and full execution. With this information, we can construct a list of live limit orders at any discrete event step. We use the notation L O { i 1 , i 2 , v i 1 , i 2 } to denote a limit order, where i 1 represents the order submission event time, i 2 is the order cancellation or full execution time, and v i 1 , i 2 is the volume (positive for buy limit orders and negative for sell limit orders). To simplify the modeling approach, we ignore the indexing of price levels, as it is not crucial for the investigation of the time series. We include limit orders up to the tenth level of prices on both the buy and sell sides. Using this notation, we can rewrite the order disbalance as follows:
X ( j ) = i 1 j < i 2 v i 1 , i 2 = i = 1 j Y ( i ) .
Here, the first sum is over all the live limit orders, including all the limit order volumes v i 1 , i 2 submitted before event j and waiting for cancellation or execution. A sequence of limit order submissions of length N generates a series of order disbalance X ( j ) of length 2 N since each submission is paired with a cancellation or execution event.
This modified definition of order disbalance series provides a slightly simplified version compared to the previous approach used in [25], which relied on the data from the orderbook.csv file. By excluding order book events related to the partial execution of orders, we can focus on exploring the statistical properties of the time series with a simplified model. The mean squared displacement (MSD) and Hurst exponents are evaluated for these empirical time series, considering two levels of random (reshuffled) time series. The first level involves reshuffling only the empirical sequence of volumes v i 1 , i 2 , yielding new series X R v ( j ) and Y R v ( j ) . This reshuffling destroys the correlation contained in the limit order submission sequence. The second level involves additional complete reshuffling of all increments Y R ( j ) = R a n d o m [ Y ( i ) ] , leading to the memory-less time series X R ( j ) , where the anti-correlation arising from the limit order cancellation events is destroyed as well.
Table 2 presents the MSD and Hurst exponents calculated for the order disbalance X ( j ) of ten stocks NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, and JNJ. The Hurst exponents for different series are listed and evaluated using AVE and Higuchi’s method. The empirical analysis shows minor changes in scaling parameters compared to the results in [25].
Our previous exploration of the order flow from the perspective of FLSM or ARFIMA models [25] has sparked new questions and raised concerns about the applicability of these models. Notably, while considering the time series X ( j ) as an accumulated ARFIMA(0,d,0) process provided valuable insights, it was observed that empirical order disbalance time series exhibit strict boundedness. This discrepancy indicates the existence of diffusion reversion mechanisms that fall outside the scope of the ARFIMA model.
Another critical challenge that emerged during our analysis is related to the auto-codifference of empirical Y ( j ) time series [38]. The order disbalance increments Y ( j ) demonstrate clear auto-dependence, as shown in Figure 4. However, fitting the empirical auto-codifference with the expected theoretical form D t η derived for fractional L’evy noise [39] presented difficulties. Qualitatively, the asymptotic auto-codifference of the ARFIMA(0,d,0) process only matched the empirical data for memory parameter values in the region of d 0.375 . However, within this region, the accumulated ARFIMA process exhibited strong unbounded behavior, which is inconsistent with the empirical order disbalance time series.
To gain a more specific empirical perspective, we investigated the potential application of the FLSM approach to the order disbalance series, assuming that the distribution of limit order volumes for the most liquid stocks followed a stable L’evy distribution with parameter 2 > α > 1 [25]. However, this assumption turned out to be very approximate due to the distinctive resonance structure in the PDF of volumes. This observation led us to reevaluate the limit order flow from a different angle.
One specific issue arising from the FLSM perspective was related to the sample mean squared displacement (MSD), which has an exponent λ M S D = 2 d + 1 for large samples and lags [26]. However, these values of the parameter d defined by the empirical λ M S D in Table 2 range from d = 0.11 for AMZN to d = 0.07 for JNJ. These values contradict the empirical analysis in [25], which suggested d = H A V H A V R 0.3 for all stocks.
To address limitations of the previous approach and gain deeper insights into the auto-dependence of increments Y ( j ) , we introduced a modified order disbalance definition, Equation (7). The pairing of limit order increments with opposite signs due to cancellation times assumed to follow a q-exponential PMF provides a new quantitative model contributing to the observed auto-dependence. This new mechanism of auto-dependence is fundamentally different from the fractionally integrated increments in the ARFIMA0,d,0 series. Thus, the simplification introduced here is helpful for the more precise interpretation of memory effects in the order disbalance time series. In Figure 5, we visualize the results of empirical analysis presented in Table 2.
The auto-dependence in the Y ( j ) series can also have other origins; for example, some dependence can arise from the original sequence of the limit order volumes v i 1 , i 2 . Such memory effects in market order flow have been investigated in [14,23,40], and competing interpretations have been provided. To evaluate the possible contribution of the auto-dependence in the limit order volume sequence, we use the random reshuffling procedure of the limit order volumes to obtain X R v ( j ) and Y R v ( j ) series with zero volume correlation. The evaluated Hurst exponents of the X R v ( j ) series, see Table 2 and Figure 5, are slightly shifted to the side of smaller values. Notably, time series X R v ( j ) remained strictly bounded, in contrast to the unbounded behavior observed in X R ( j ) , suggesting that the diffusion of the order disbalance series is self-reverted due to every limit order being canceled or executed. The observed auto-dependence in Y ( j ) and its origin from both the cancellation times and the sequence of limit order volumes raise intriguing questions about the underlying mechanisms of market dynamics. Further theoretical consideration and empirical studies may help uncover the nature of the long-range memory in order disbalance and contribute to a more comprehensive understanding of market order flow.

5. Time Series of Limit Order Submissions

It is essential to investigate the statistical properties of time series that comprise only a sequence of limit order submissions to the market, denoted as X L ( j ) ,
X L ( j ) = i = 1 j v i = i = 1 j Y L ( i ) ,
where v i represents the volume of the submitted limit order, and order cancellation or execution is not included in the series. In this case, we can confidently consider the series from the perspective of FLSM, as the order flow remains uninterrupted by cancellation events. The analysis results are provided in Table 3.
The first method used to evaluate the parameter d is based on the assumption that the series X L ( j ) exhibits FLSM-like behavior, with the memory parameter d M S D derived from the sample mean squared displacement (MSD) exponent λ M S D = 2 d M S D + 1 , as referenced in [25,26]. The second method, d C D , involves calculating the sample auto-codifference C D ( t ) with lag t of series Y L ( j ) , as defined in Equation (9). It is worth noting that this method is sensitive to the evaluation of parameter α due to its reliance on the asymptotic form of auto-codifference for fractional L’evy noise, as given in Equation (10), leading to the definition d C D = γ / α + 1 1 / α [38].
C D ( t ) = ln j = 1 + t j = N i Y L ( j ) + j = 1 j = N t i Y L ( j ) N j = 1 + t j = N i ( Y L ( j ) Y L ( j t ) ) ,
where i denotes imaginary units and N is the length of the series. Note that this method is sensitive to the evaluation of parameter α , as we use the asymptotic form
C D ( t ) t γ = t α H α ,
of auto-codifference for the fractional Lévy noise, see [38]. The third method, d H , relies on the relation used in [25], where d H = H A V H A V R , assuming that the time series X L ( j ) is fractional L’evy stable motion-like.
Evaluation of memory parameter d using three different methods and results in the Table 3 support the idea that the limit order series X L ( j ) is FLSM-like. Fluctuations of memory parameters between methods are considerably smaller than fluctuations between different stocks. The major problem in this consideration remains the assumption of the limit order volume distribution according to the Lévy stable distribution. We fit the α parameter of this distribution to only the tail part v i > 35 of the empirical histogram. Results provided in the Table 3 show very stable values for all stocks α 1.8 .

6. Artificial Order Disbalance Time Series

We propose an artificial order disbalance time series model that captures the main observations of this study. The model involves two random sequences: (a) A sequence of limit order volumes generated as ARFIMA{0,d,0}{ α , N , v max }, where d is the memory parameter, α is the stability index, N is the length of the sequence, and v max defines the maximum possible absolute value of the volume, selected from the observed empirical data. (b) A corresponding sequence of limit order cancellation times with the same length N generated using the probability mass function (PMF) P λ , q ( d s ) ( k ) defined by Equation (3).
With these two independent sequences, we can calculate the model sequence of events X M ( j ) = i = 1 i = j Y M ( i ) defined by v i 1 , i 2 m o d , which includes cancellation and execution events. This generated random sequence represents the artificial analog of order disbalance time series introduced in Equation (7), and it is used for the empirical analysis. We choose the artificial model parameters aiming to reproduce the empirical data: α = 1.8 ; N = 200000 ; v m a x = 1000 ; λ = 0.42 ; q = 1.5 .
We compare results generated by the artificial series Y M ( i ) and X M ( j ) with those from the empirical series. In Table 4, we present the MSD and Hurst exponents, together with the memory parameter d C D , for the artificially generated series and empirical series of stocks MA, NFLX, and TSLA.
The results in the Table 4 are averaged over five realizations of artificial series and five daily empirical series. The model results with d = 0.2 closely match the series of stock MA, while the model results with d = 0.3 are similar to the empirical series of NFLX and TSLA. Thus, we conclude that the empirically established q-exponential nature of the limit order cancellation times helps to reconstruct the properties of the order disbalance time series X ( j ) . Such reconstructions are crucial for interpreting order disbalance time series from the perspective of FLSM.
In Figure 6, we demonstrate the application of sample auto-codifference [38], as defined in Equation (9), for the empirical and artificial model time series. The left sub-figure shows the NFLX auto-codifference of the limit order flow Y L ( i ) for the first five trading days of August 2020, along with the best fit by the asymptotic curve from Equation (10). The fluctuations in defined parameters γ = { 0.30 , 0.25 , 0.33 , 0.18 , 0.30 } and d C D = { 0.28 , 0.31 , 0.26 , 0.34 , 0.28 } are considerable for the daily series, but the average d is close to 0.3 . The right sub-figure compares three auto-codifference curves, averaged over five sample series related to NFLX data. The red curve represents the auto-codifference of the empirical order disbalance increment series Y ( j ) , the black curve of the corresponding order disbalance artificial model series with d = 0.3 , and the green curve of the artificial limit order flow series Y L ( i ) . The similar behavior of all three curves and good correspondence between the empirical and synthetic series indicate the usefulness of auto-codifference in the research of persistence in financial and other social systems.
An intriguing result is that the auto-codifference of two very different processes in terms of self-similarity exhibits the expected behavior. Positively correlated fractional L’evy noise-like series Y L ( i ) display a similar auto-codifference as series Y ( i ) that exhibit anti-persistence from a self-similarity perspective ( H < < 0.5 ). For example, our preliminary investigation of empirical series based only on the limit order signs shows that the persistence of limit order flow disappears when cancellation and execution of orders are included in the series.
In conclusion, the artificial order disbalance time series model provides valuable insights into the persistence and memory properties of the series. The comparison with empirical data demonstrates the usefulness of the model and supports the conclusion that the q-exponential nature of limit order cancellation times contributes to the observed persistence in order disbalance time series. The application of auto-codifference further enhances our understanding of the self-similarity behavior in these financial systems, opening up new avenues for research in this domain.

7. Discussion and Conclusions

In this study, we have delved into the statistical properties of limit order cancellation times in financial markets to better understand the peculiarities of order disbalance time series from the perspective of fractional L’evy stable motion (FLSM). Our previous investigation of order disbalance time series within the framework of FLSM yielded contradictory conclusions [25]. However, empirical time series often exhibit specific characteristics that necessitate careful consideration during empirical analysis. To address the question of why order disbalance time series in financial markets are strictly bounded, we have focused on the statistical properties of limit order cancellation times, treating them as discrete events.
To this end, we have introduced the concept of a discrete q-exponential distribution, presented in Equation (3), as a q-extension of the geometric distribution, based on the theoretical foundations of generalized Tsallis statistics [41]. This distribution allows for a better fit of empirical limit order cancellation times, revealing their weak sensitivity to order sizes and price levels. Remarkably, the parameters of the fitted discrete q-exponential PMF, λ = 0.3 , and q = 1.5 , have proven consistent across ten stocks and trading days analyzed. Building on this unique statistical property of cancellation times, we model and empirically investigate limit order flow and order disbalance time series.
The clear distinction between the series of limit order flow X L ( j ) and the series of order disbalance X ( j ) , which includes order cancellations, is essential in this research. Limit order flow series display persistence and remain unbounded, making them FLSM-like. On the other hand, order disbalance series, which includes order cancellations and executions, is bounded and exhibits anti-persistence. It is important to acknowledge that limit order flow in financial markets serves as a prime example of time series requiring thorough empirical analysis to validate the use of econometric methods for time series analysis.
By combining fractional L’evy stable limit order flow with the q-exponential cancellation time distribution, we propose a relatively straightforward model of order disbalance in financial markets. This model also serves as an illustrative example of the broader approach to modeling opinion dynamics in various social systems. Our research highlights the significance of social system modeling to ensure the proper utilization of formal mathematical methods.
In summary, our study contributes to a better understanding of order disbalance time series and their memory effects in financial markets. The incorporation of the discrete q-exponential distribution for modeling cancellation times provides valuable insights into the persistence of the order disbalance time series and helps address the question of their boundedness. Furthermore, the combination of FLSM and the q-exponential distribution proves to be a promising approach for modeling social systems, which can be explored further in future research.
In conclusion, the statistical properties of limit order cancellation times and their impact on order disbalance time series have shed light on the dynamics of financial markets. This study not only enhances our understanding of complex financial systems but also highlights the importance of empirical analysis when applying mathematical methods to social system modeling. By bridging the gap between theory and empirical observations, we contribute to the development of more accurate models and deeper insights into the behavior of financial markets and social systems as a whole.

Funding

This research received no external funding.

Data Availability Statement

Restrictions apply to the availability of these data. Data was obtained from LOBSTER: high-frequency, easy-to-use and latest limit order book data for research at https://lobsterdata.com/ (accessed on 20 July 2023), and are available from the authors/at https://lobsterdata.com/contact.php (accessed on 20 July 2023) with the permission of the LOBSTER data provider founded by a group of financial econometricians affiliated with Humboldt Universität zu Berlin, Germany and the University of Vienna, Austria.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ARFIMAAuto–regressive fractionally integrated moving average
AVEAbsolute Value estimator
FBMFractional Brownian motion
FGNFractional Gaussian noise
FLSMfractional Lèvy stable motion
MSDMean squared displacement
PDFProbability density function
PMFProbability mass function

References

  1. Baillie, R.T.; Bollerslev, T.; Mikkelsen, H.O. Fractionally integrated generalized autoregressive conditional heteroskedasticity. J. Econom. 1996, 74, 3–30. [Google Scholar] [CrossRef]
  2. Engle, R.; Patton, A. What good is a volatility model? Quant. Financ. 2001, 1, 237–245. [Google Scholar] [CrossRef]
  3. Plerou, V.; Gopikrishnan, P.; Gabaix, X.; Amaral, L.; Stanley, H. Price fluctuations, market activity and trading volume. Quant. Financ. 2001, 1, 262–269. [Google Scholar] [CrossRef]
  4. Gabaix, X.; Gopikrishnan, P.; Plerou, V.; Stanley, H.E. A theory of power law distributions in financial market fluctuations. Nature 2003, 423, 267–270. [Google Scholar] [CrossRef] [PubMed]
  5. Rangarajan, G.; Ding, M. (Eds.) Processes with Long-Range Correlations: Theory and Applications, Lecture Notes in Physics; Springer: Berlin/Heidelberg, Germany, 2003; Volume 621, p. XVIII, 398. [Google Scholar]
  6. Ding, Z.; Granger, C.W.J.; Engle, R.F. A long memory property of stock market returns and a new model. J. Empir. Financ. 1993, 1, 83–106. [Google Scholar] [CrossRef]
  7. Bollerslev, T.H.-O.; Mikkelsen, H.O. Modeling and pricing long-memory in stock market volatility. J. Econom. 1996, 73, 151–184. [Google Scholar]
  8. Giraitis, L.; Leipus, R.; Surgailis, D. ARCH() models and long memory. In Handbook of Financial Time Series; Anderson, T.G., Davis, R.A., Kreis, J., Mikosh, T., Eds.; Springer: Berlin/Heidelberg, Germany, 2009; pp. 71–84. [Google Scholar] [CrossRef]
  9. Conrad, C. Non-negativity conditions for the hyperbolic GARCH model. J. Econom. 2010, 157, 441–457. [Google Scholar] [CrossRef] [Green Version]
  10. Arouri, M.E.H.; Hammoudeh, S.; Lahiani, A.; Nguyen, D.K. Long memory and structural breaks in modeling the return and volatility dynamics of precious metals. Q. Rev. Econ. Financ. 2012, 52, 207–218. [Google Scholar] [CrossRef] [Green Version]
  11. Tayefi, M.; Ramanathan, T.V. An overview of FIGARCH and related time series models. Austrian J. Stat. 2012, 41, 175–196. [Google Scholar] [CrossRef]
  12. Kercheval, A.N.; Zhang, Y. Modelling high-frequency limit order book dynamics with support vector machines. Quant. Financ. 2015, 15, 1315–1329. [Google Scholar] [CrossRef]
  13. Kumar, I.; Dogra, K.; Utreja, C.; Yadav, P. A Comparative Study of Supervised Machine Learning Algorithms for Stock Market Trend Prediction. In Proceedings of the 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT), Coimbatore, India, 20–21 April 2018. [Google Scholar] [CrossRef]
  14. Zaznov, I.; Kunkel, J.; Dufour, A.; Badii, A. Predicting Stock Price Changes Based on the Limit Order Book: A Survey. Mathematics 2022, 10, 1234. [Google Scholar] [CrossRef]
  15. Kazakevicius, R.; Kononovicius, A.; Kaulakys, B.; Gontis, V. Understanding the Nature of the Long-Range Memory Phenomenon in Socioeconomic Systems. Entropy 2021, 23, 1125. [Google Scholar] [CrossRef]
  16. Kononovicius, A.; Gontis, V. Three state herding model of the financial markets. EPL 2013, 101, 28001. [Google Scholar] [CrossRef] [Green Version]
  17. Gontis, V.; Kononovicius, A. Consentaneous agent-based and stochastic model of the financial markets. PLoS ONE 2014, 9, e102201. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Gontis, V.; Kononovicius, A. Burst and inter-burst duration statistics as empirical test of long-range memory in the financial markets. Phys. A 2017, 483, 266–272. [Google Scholar] [CrossRef] [Green Version]
  19. Gontis, V.; Kononovicius, A. The consentaneous model of the financial markets exhibiting spurious nature of long-range memory. Phys. A 2018, 505, 1075–1083. [Google Scholar] [CrossRef] [Green Version]
  20. Gontis, V.; Havlin, S.; Kononovicius, A.; Podobnik, B.; Stanley, H.E. Stochastic model of financial markets reproducing scaling and memory in volatility return intervals. Phys. A 2016, 462, 1091–1102. [Google Scholar] [CrossRef] [Green Version]
  21. De, A.; Valera, I.; Ganguly, N.; Bhattacharya, S.; Gomez-Rodriguez, M. Learning and Forecasting Opinion Dynamics in Social Networks. In Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, Barcelona, Spain, 5–10 December 2016; Lee, D.D., Sugiyama, M., von Luxburg, U., Guyon, I., Garnett, R., Eds.; pp. 397–405. [Google Scholar]
  22. Gontis, V.; Kononovicius, A. Spurious memory in non-equilibrium stochastic models of imitative behavior. Entropy 2017, 19, 387. [Google Scholar] [CrossRef] [Green Version]
  23. Lillo, F.; Mike, S.; Farmer, J.D. Theory for long memory in supply and demand. Phys. Rev. E 2005, 71, 066122. [Google Scholar] [CrossRef] [Green Version]
  24. Sato, Y.; Kanazawa, K. Exact solution to a generalised Lillo-Mike-Farmer model with heterogeneous order-splitting strategies. arXiv 2023, arXiv:2306.13378. [Google Scholar] [CrossRef]
  25. Gontis, V. Order flow in the financial markets from the perspective of the Fractional Lévy stable motion. Commun. Nonlinear Sci. Numer. Simul. 2022, 105, 106087. [Google Scholar] [CrossRef]
  26. Burnecki, K.; Weron, A. Fractional Levy stable motion can model subdiffusive dynamics. Phys. Rev. E 2010, 82, 021130. [Google Scholar] [CrossRef] [Green Version]
  27. Burnecki, K.; Weron, A. Algorithms for testing of fractional dynamics: A practical guide to ARFIMA modelling. J. Stat. Mech. 2014, 2014, P10036. [Google Scholar] [CrossRef]
  28. Burnecki, K.; Sikora, G. Identification and validation of stable ARFIMA processes with application to UMTS data. Chaos Solitons Fractals 2017, 102, 456–466. [Google Scholar] [CrossRef]
  29. Huang, R.; Polak, T. LOBSTER: The limit Order Book Reconstructor; Discussion Paper School of Business and Economics; Technical Report; Humboldt Universitat zu Berlin: Berlin, Germany, 2011. [Google Scholar]
  30. Tsallis, C. Nonadditive entropy and nonextensive statistical mechanics—An overview after 20 years. Braz. J. Phys. 2009, 39, 337–356. [Google Scholar] [CrossRef]
  31. Tsallis, C. Economics and finance: Q-Statistical stylized features galore. Entropy 2017, 19, 457. [Google Scholar] [CrossRef] [Green Version]
  32. Stosic, D.; Stosic, D.; Stosic, T. Nonextensive triplets in stock market indices. Phys. A Stat. Mech. Appl. 2019, 525, 192–198. [Google Scholar] [CrossRef] [Green Version]
  33. Bercher, J.F.; Vignat, C. A new look at q-exponential distributions via excess statistics. Phys. A Stat. Mech. Appl. 2008, 387, 5422–5432. [Google Scholar] [CrossRef] [Green Version]
  34. Matsuzoe, H.; Ohara, A. Geometry for q-Exponential Families. In Recent Progress in Differential Geometry and Its Related Fields; World Scientific: Singapore, 2011; pp. 55–71. [Google Scholar] [CrossRef]
  35. Yalcin, F.; Eryilmaz, S. q-geometric and q-binomial distributions of order k. J. Comput. Appl. Math. 2014, 271, 31–38. [Google Scholar] [CrossRef]
  36. Nekoukhou, V.; Alamatsaz, M.H.; Bidram, H. A Discrete Analog of the Generalized Exponential Distribution. Commun. Stat. Theory Methods 2012, 41, 2000–2013. [Google Scholar] [CrossRef]
  37. Shalizi, C.R. Maximum Likelihood Estimation for q-Exponential (Tsallis) Distributions. arXiv 2007, arXiv:math/0701854. [Google Scholar]
  38. Wyłomańska, A.; Chechkin, A.; Gajda, J.; Sokolov, I.M. Codifference as a practical tool to measure interdependence. Phys. A Stat. Mech. Appl. 2015, 421, 412–429. [Google Scholar] [CrossRef] [Green Version]
  39. Astrauskas, A.; Lévy, J.B.; Taqqu, M.S. The asymptotic dependence structure of the linear fractional Lévy motion. Liet. Mat. Rink. 1991, 31, 3–28. [Google Scholar]
  40. Tóth, B.; Palit, I.; Lillo, F.; Farmer, J.D. Why is equity order flow so persistent? J. Econ. Dyn. Control 2015, 51, 218–239. [Google Scholar]
  41. Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [Google Scholar] [CrossRef]
Figure 1. Examples of the cancellation time histograms (PDFs) for the AMZN and MA stocks. Empirical PDFs for the four price levels are calculated for the joint period of 21 trading days. The plots are red for the first level, blue for the second, green for the third, and black for the fourth.
Figure 1. Examples of the cancellation time histograms (PDFs) for the AMZN and MA stocks. Empirical PDFs for the four price levels are calculated for the joint period of 21 trading days. The plots are red for the first level, blue for the second, green for the third, and black for the fourth.
Fractalfract 07 00581 g001
Figure 2. Examples of cancellation time histograms (PDFs) for the AMZN and MA stocks. Empirical PDFs for the four intervals of limit order volumes are calculated for the joint period of 21 trading days. (red) v i 3 ; (blue) 3 < v i 23 ; (green) 23 < v i 123 ; (black) 123 < v i 623 .
Figure 2. Examples of cancellation time histograms (PDFs) for the AMZN and MA stocks. Empirical PDFs for the four intervals of limit order volumes are calculated for the joint period of 21 trading days. (red) v i 3 ; (blue) 3 < v i 23 ; (green) 23 < v i 123 ; (black) 123 < v i 623 .
Fractalfract 07 00581 g002
Figure 3. Fitted PMFs (3) of the joined one-month cancellation times for the ten stocks NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, and JNJ. The thin color lines represent PMFs for the individual stocks, and the thick black line represents the PMF with mean values of parameters λ = 0.3 and q = 1.5 . Data for each MLE calculation is joined from 21 trading day of August 2020.
Figure 3. Fitted PMFs (3) of the joined one-month cancellation times for the ten stocks NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, and JNJ. The thin color lines represent PMFs for the individual stocks, and the thick black line represents the PMF with mean values of parameters λ = 0.3 and q = 1.5 . Data for each MLE calculation is joined from 21 trading day of August 2020.
Fractalfract 07 00581 g003
Figure 4. Auto-codifference, (9), from a single empirical trajectory Y ( j ) of AMZN stock traded on 3 August 2020, blue line. Black line, the fit of empirical auto-codifference by 0.388 t 0.254 . Red line, auto-codifference calculated from a single trajectory of fractional Lévy noise with parameters d = 0.375 and α = 2.0 .
Figure 4. Auto-codifference, (9), from a single empirical trajectory Y ( j ) of AMZN stock traded on 3 August 2020, blue line. Black line, the fit of empirical auto-codifference by 0.388 t 0.254 . Red line, auto-codifference calculated from a single trajectory of fractional Lévy noise with parameters d = 0.375 and α = 2.0 .
Fractalfract 07 00581 g004
Figure 5. Comparison of exponents calculated for the empirical and randomized time series. All rows have 10 values corresponding to the stocks investigated in this contribution. Scaling parameters are labeled as defined in the text and Table 2.
Figure 5. Comparison of exponents calculated for the empirical and randomized time series. All rows have 10 values corresponding to the stocks investigated in this contribution. Scaling parameters are labeled as defined in the text and Table 2.
Fractalfract 07 00581 g005
Figure 6. Sample auto-codifference of the NFLX time series, Equation (9). In the left sub-figure auto-codifference of the empirical time series Y L ( i ) , the collar curves represent five daily series, and th eblack lines are the best fit by C τ γ . In the right sub-figure: red represents the auto-codifference averaged over five daily empirical NFLX time series Y ( i ) ; black represents the auto-codifference averaged over five realizations of model series Y ( i ) ; and green represents the auto-codifference averaged over five realizations of model limit volume series Y L ( i ) .
Figure 6. Sample auto-codifference of the NFLX time series, Equation (9). In the left sub-figure auto-codifference of the empirical time series Y L ( i ) , the collar curves represent five daily series, and th eblack lines are the best fit by C τ γ . In the right sub-figure: red represents the auto-codifference averaged over five daily empirical NFLX time series Y ( i ) ; black represents the auto-codifference averaged over five realizations of model series Y ( i ) ; and green represents the auto-codifference averaged over five realizations of model limit volume series Y L ( i ) .
Fractalfract 07 00581 g006
Table 1. Parameters of PMF (3) { λ , q } calculated using MLE for the cancellation times of ten stocks NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, and JNJ. Data for each MLE calculation is joined from 21 trading day of August 2020.
Table 1. Parameters of PMF (3) { λ , q } calculated using MLE for the cancellation times of ten stocks NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, and JNJ. Data for each MLE calculation is joined from 21 trading day of August 2020.
Exp.NVDAHDAMZNNFLXMALLYTSLAADBEVJNJ
λ 0.38 0.22 0.41 0.42 0.17 0.15 0.4 0.5 0.19 0.14
q 1.51 1.48 1.52 1.5 1.45 1.46 1.54 1.52 1.5 1.47
Table 2. The MSD and Hurst’s exponents calculated using the same equations as in [25] for the order disbalance X ( j ) , Equation (7), of the ten stocks NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, and JNJ. Evaluated exponents are listed in the first column as follows: λ M S D is the exponent of sample MSD; λ R v is the exponent for the series X R v ( j ) with a randomized sequence of volumes; H A V is H of series X ( j ) evaluated using AVE; H A V R v is the same for the series X R v ( j ) with a randomized sequence of volumes; H A V R is the same for the series X R ( j ) with a randomized sequence of Y ( j ) ; H H i g is H of series X ( j ) evaluated using Higuchi’s method; H H i g R v is the same for the series X R v ( j ) with a randomized sequence of volumes; H H i g R is the same for the series X R ( j ) with a randomized sequence of Y ( j ) .
Table 2. The MSD and Hurst’s exponents calculated using the same equations as in [25] for the order disbalance X ( j ) , Equation (7), of the ten stocks NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, and JNJ. Evaluated exponents are listed in the first column as follows: λ M S D is the exponent of sample MSD; λ R v is the exponent for the series X R v ( j ) with a randomized sequence of volumes; H A V is H of series X ( j ) evaluated using AVE; H A V R v is the same for the series X R v ( j ) with a randomized sequence of volumes; H A V R is the same for the series X R ( j ) with a randomized sequence of Y ( j ) ; H H i g is H of series X ( j ) evaluated using Higuchi’s method; H H i g R v is the same for the series X R v ( j ) with a randomized sequence of volumes; H H i g R is the same for the series X R ( j ) with a randomized sequence of Y ( j ) .
Exp.NVDA  HDAMZNNFLX MA LLYTSLAADBE  V JNJ
λ M S D 0.82 1.01 0.79 0.86 0.92 1.05 0.89 0.88 1.09 1.10
λ R v 0.90 0.93 0.90 0.88 0.99 0.94 0.90 0.86 0.91 0.94
H A V 0.32 0.21 0.19 0.21 0.17 0.2 0.26 0.26 0.30 0.31
H A V R v 0.22 0.16 0.18 0.18 018 0.18 0.22 0.18 0.22 0.21
H A V R 0.54 0.49 0.54 0.51 0.48 0.51 0.50 0.50 0.50 0.49
H H i g 0.33 0.22 0.22 0.24 0.2 0.22 0.28 0.28 0.33 0.32
H H i g R v 0.22 0.18 0.20 0.18 0.18 0.18 0.22 0.29 0.23 0.21
H H i g R 0.54 0.49 0.50 0.51 0.49 0.50 0.51 0.50 0.53 0.49
Table 3. Evaluated memory parameter d for the limit order flow time series X L ( j ) , as defined in Equation (8), for ten stocks: NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, and JNJ. Three different methods are used for evaluation, listed in the first column: d M S D is evaluated from the exponent of sample MSD, λ M S D = 2 d M S D + 1 ; d C D is evaluated from the sample auto-codifference, as defined in Equation (9); d H is evaluated as H A V E H A V E R from the sample series X L ( j ) and X L R ( j ) using AVE. The last line presents the empirical values of the stability parameter α obtained by fitting the tails of volume histograms with the L’evy stable distribution.
Table 3. Evaluated memory parameter d for the limit order flow time series X L ( j ) , as defined in Equation (8), for ten stocks: NVDA, HD, AMZN, NFLX, MA, LLY, TSLA, ADBE, V, and JNJ. Three different methods are used for evaluation, listed in the first column: d M S D is evaluated from the exponent of sample MSD, λ M S D = 2 d M S D + 1 ; d C D is evaluated from the sample auto-codifference, as defined in Equation (9); d H is evaluated as H A V E H A V E R from the sample series X L ( j ) and X L R ( j ) using AVE. The last line presents the empirical values of the stability parameter α obtained by fitting the tails of volume histograms with the L’evy stable distribution.
Exp.NVDA  HDAMZNNFLX MA LLYTSLAADBE  V  JNJ
d M S D 0.30 0.18 0.32 0.29 0.18 0.14 0.28 0.28 0.20 0.20
d C D 0.29 0.23 0.30 0.29 0.19 0.20 0.34 0.30 0.26 0.25
d H 0.32 0.21 0.38 0.32 0.21 0.15 0.35 0.34 0.20 0.19
α 1.80 1.80 1.79 1.80 1.80 1.80 1.80 1.79 1.79 1.79
Table 4. Sample series evaluated MSD and Hurst exponents, along with memory parameter d C D , for the artificially generated series and the empirical series of stocks MA, NFLX, and TSLA. λ ( X ) represents the sample MSD exponent evaluated for the series X ( j ) ; λ ( X L ) is the same for the series of limit volumes; H ( X ) is the Hurst exponent evaluated using AVE for the series X ( j ) ; H ( X R ) is the same for the reshuffled series X R ( J ) ; H ( X L ) is the same for the series of limit volumes X L ( j ) ; H ( X L R ) is the same for the reshuffled limit volume series X L R ( j ) ; and d C D is the memory parameter evaluated from the sample auto-codifference of series Y ( i ) .
Table 4. Sample series evaluated MSD and Hurst exponents, along with memory parameter d C D , for the artificially generated series and the empirical series of stocks MA, NFLX, and TSLA. λ ( X ) represents the sample MSD exponent evaluated for the series X ( j ) ; λ ( X L ) is the same for the series of limit volumes; H ( X ) is the Hurst exponent evaluated using AVE for the series X ( j ) ; H ( X R ) is the same for the reshuffled series X R ( J ) ; H ( X L ) is the same for the series of limit volumes X L ( j ) ; H ( X L R ) is the same for the reshuffled limit volume series X L R ( j ) ; and d C D is the memory parameter evaluated from the sample auto-codifference of series Y ( i ) .
Series λ ( X ) λ ( X v ) H ( X ) H ( X R ) H ( X v ) H ( X LR ) d CD
Model d = 0.2 0.97 1.36 0.20 0.55 0.72 0.54 0.18
MA 0.92 1.37 0.17 0.48 0.71 0.50 0.19
Model d = 0.3 1.00 1.60 0.24 0.54 0.83 0.55 0.22
NFLX 0.86 1.58 0.21 0.51 0.83 0.51 0.29
TSLA 0.89 1.56 0.26 0.50 0.87 0.52 0.34
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gontis, V. Discrete q-Exponential Limit Order Cancellation Time Distribution. Fractal Fract. 2023, 7, 581. https://doi.org/10.3390/fractalfract7080581

AMA Style

Gontis V. Discrete q-Exponential Limit Order Cancellation Time Distribution. Fractal and Fractional. 2023; 7(8):581. https://doi.org/10.3390/fractalfract7080581

Chicago/Turabian Style

Gontis, Vygintas. 2023. "Discrete q-Exponential Limit Order Cancellation Time Distribution" Fractal and Fractional 7, no. 8: 581. https://doi.org/10.3390/fractalfract7080581

APA Style

Gontis, V. (2023). Discrete q-Exponential Limit Order Cancellation Time Distribution. Fractal and Fractional, 7(8), 581. https://doi.org/10.3390/fractalfract7080581

Article Metrics

Back to TopTop