1. Introduction
The issue of structural change has long been a pivotal research topic in statistics. Extensive empirical evidence from various domains, including finance, meteorology, medicine, and engineering, indicates that structural changes are ubiquitous in time series data. Particularly, the detection of variance change points has become a prominent research focus, garnering significant attention. In statistical applications, variance is frequently associated with risk. Over the past five decades, scholars have conducted extensive and in-depth studies on the identification and localization of variance change points, as evidenced by the relevant literature [
1,
2,
3,
4,
5,
6].
In the existing literature on change-point detection, it is frequently assumed that only the parameter under scrutiny undergoes a structural change while other parameters remain invariant. However, this assumption often fails to reflect real-world complexities. For example, Cavaliere [
7] examined the unit root test using least squares estimation and specifically analyzed the impact of heteroscedasticity on test outcomes. The results indicated that variance change points can substantially diminish the power of the unit root test, leading to biased conclusions. Furthermore, Dette et al. [
8] questioned the assumption of stationarity. They contended that concentrating exclusively on the stability of the parameter under examination while neglecting changes in other parameters is overly restrictive. Consequently, they integrated both variance change points and correlation coefficient change points into their analysis. These studies underscore the necessity of accounting for variance change points when other parameters are not constant. This paper primarily addresses the challenge of testing for variance change points in the presence of mean change points.
Among the existing research literature on variance change points and mean change points, most studies have primarily focused on scenarios where either variance change points or mean change points exist independently in time series data. However, relatively limited attention has been devoted to cases where both the variance and the mean of the model undergo changes. Long et al. [
9] integrated the quartile information of auxiliary variables with ranked set sampling (RSS) and extreme ranked set sampling (ERSS), and compared these approaches to the traditional simple random sampling (SRS) method. They demonstrated that the estimators derived from RSS and ERSS exhibited higher efficiency compared to those from SRS, thus enriching the research on ratio estimation. However, relatively limited attention has been devoted to cases where both the variance and the mean of the model undergo changes. Antoch et al. [
10] studied the application of the CUSUM test in the context of an average change-point model with independent errors. Bai [
11] investigated the application of the CUSUM test to the average change-point model within the context of a linear process. Inclan et al. [
12] performed a CUSUM test to detect variance change points in a model with independent normal errors. Gombay et al. [
13] conducted CUSUM tests for variance change-point models for weakly dependent errors. Some methods assume that other variables remain constant when monitoring a change point in specific variables; however, this assumption is often unrealistic in practical applications. Real-world data typically display intricate and dynamic characteristics, and both variance and mean can change concurrently as a result of multiple influencing factors. Therefore, the current methodologies exhibit significant limitations, because many approaches employed two separate statistical metrics to monitor these change points individually, which not only increased the complexity and computational burden of the monitoring process but may also result in inaccurate detection due to the interdependence between the two statistical metrics. Pitarakis [
14] applied the least squares method to estimate the variance and mean change points in regression models, focusing primarily on the consistency of estimation without addressing the convergence rate of change-point estimates or the influence of mean change points on variance change-point estimation. Yuan Fang [
15] et al. utilized cumulative sum statistics for the simultaneous estimation of variance and mean change points in independent sequences, providing a more practical approach compared to Pitarakis [
14]. Wang Huimin [
16] et al. extended this methodology to dependent cases, achieving the same convergence rate as Yuan Fang [
16] for independent sequences and validating the effectiveness of their change-point estimation through numerical simulations. Jin [
17,
18] et al. introduced an enhanced cumulative sum test based on residual sequences to detect variance change points in autoregressive processes with mean change points, effectively mitigating the impact of mean change points on variance change-point detection.
The aforementioned research primarily focuses on offline testing issues, specifically the detection of change points within static historical datasets. However, in practical applications, a continuous stream of new data is generated. To determine whether the new data can be adequately described by the existing model or if structural change points exist, it is crucial to establish online monitoring mechanisms. Aue et al. [
19] conducted a comprehensive analysis and investigation of the monitoring statistics for the mean change points in the RCA(1) time series model, and derived the asymptotic distribution of these statistics. Liu et al. [
20] proposed two monitoring statistics based on the cumulative sum of residuals (CUSUM) for detecting change points in mean vector of multivariate time series. Additionally, they introduced a window width parameter to construct CUSUM statistics for real-time online monitoring of mean change points. Horvath et al. [
21] and Aue et al. [
22] provided the moving sum (MOSUM) methodology for the online monitoring of change points within location models. To address the limitation of the traditional cumulative sum of squares test being relatively insensitive to variance changes in the later period, Qi and Tian [
23] introduced a monitoring procedure based on the cumulative sum of residual squares statistics. This method aimed to detect variance change points within the model and extend its applicability from independent and identically distributed errors to scenarios involving correlated errors. Zhao [
24] suggested a ratio test statistic for variance change-point detection within linear processes. Qin and You [
25] introduced a novel test statistic, which addressed the issue of low power in the ratio test for variance change points under certain conditions. Chen et al. [
26] introduced two ratio-type statistics designed for sequentially monitoring variance and mean change points in long-memory time series. Although these statistics effectively monitor variance or mean changes individually, they require two separate monitoring procedures to detect both types of change points, thereby increasing computational complexity and resource consumption. Consequently, this paper addresses the challenge of simultaneously monitoring variance and mean change points in an online setting without assuming that variance and mean change points must occur concurrently. This approach enhances the accuracy of capturing the dynamic characteristics of time series data.
This paper has contributions in the following three aspects. First, to address the limitation of the traditional sliding window method where the critical value fluctuates with the window length, a fixed-window sliding monitoring approach is proposed. This method can avoid the impact caused by the window length parameter, resulting in a consistent monitoring environment. It effectively mitigates the influence of window length variations on critical value settings and significantly improves the robustness and reliability of change point detection. Second, a modified ratio-type test statistic is constructed to simultaneously monitor for variance change points and mean change points. Compared to the conventional practice of separately constructing variance and mean test statistics for monitoring change points, this approach offers greater efficiency and simplicity. The asymptotic distribution and consistency of this statistic are also rigorously derived. Finally, an improved mixed change-point monitoring procedure is introduced. Simulation results demonstrate that this method can accurately distinguish between variance change points and mean change points.
4. Monte Carlo Simulation
This section assesses the effectiveness of value-type statistics in detecting structural change points via Monte Carlo numerical simulations. The data generation process is described as follows:
- (1)
When the mean change point precedes the variance change point,
- (2)
When the variance change point precedes the mean change point,
Without loss of generality, let the significance level be denoted as
, the monitoring sample size be
, the uncontaminated sample size be
, the window width be
, the position of the mean change point be
, the mean jump amplitude be
, the position of the variance change point be
, and the ratio of the standard deviations be
.
Table 1 provides the critical values for the test statistic
. It is evident from
Table 1 that the sample size of uncontaminated data has a relatively minor impact on the critical values and can be neglected. In contrast, both the sample size and the fixed window size significantly influence these critical values. Specifically, ① as the sample size increases, critical values decrease; ② as the fixed window size grows, critical values also decrease.
Table 2 illustrates the empirical significance level of the test statistic
at the 0.05 significance level, defined as the probability of rejecting the null hypothesis after 2000 repetitions of the experiment under the null hypothesis. It can be observed from the table that the significance level of the statistical quantity
consistently hovers around 0.05. When the monitoring sample size is small and the window width is large, the number of window frames becomes relatively limited, leading to potential distortion. Conversely, when the monitoring sample size is large and the window width is small, the number of window frames increases substantially, resulting in a more conservative outcome. It can be inferred from
Table 2 that empirical size relates to monitoring sample sizes, fixed window size, and uncontaminated sample sizes. To minimize deviations in empirical size, for small sample sizes, smaller window sizes and uncontaminated sample sizes should be used; for large sample sizes, larger window sizes should be employed while keeping smaller uncontaminated samples. This manifests specifically as follows: ① with an increase in monitoring sample size, there is a corresponding decrease in critical value; ② an increase in fixed window size results in a decrease in critical value.
Table 3 and
Table 4 present the empirical power of the statistic
under the hypotheses of mean breaks
and variance breaks
, respectively. The empirical power is defined as the rejection rate obtained from 2000 repeated trials under the alternative hypotheses. The following observations can be made from
Table 3: ① when the window width remains constant and the monitoring sample size increases, the empirical power also increases, because the expanded window length captures more samples, which aligns with the findings of Theorem 2; ② the presence or absence of contamination in the sample size does not significantly affect the empirical power; ③ as anticipated, an increase in the mean jump amplitude leads to a corresponding increase in the empirical power; ④ since a moving window frame was selected in this study, the rejection rate remains insensitive to the position of the change point within the monitoring area. Regardless of whether the change point appears at the front or back of the entire sample, the moving window frame will inevitably cover it. For instance, given the monitoring sample size
, the uncontaminated sample size
, the window width
, the jump amplitude
, and the mean change-point position
, the empirical potential is 0.8424; when the mean change point position
is given, the empirical potential is 0.8185; and when the mean change-point position
is given, the empirical potential is 0.8224. At this juncture, the empirical potential at the mean change-point position
is greater than that at the mean change-point position
; ⑤ additionally, as the change point moves further away, the average run length decreases. This phenomenon may be attributed to the increased sliding number of the moving window frame and the subsequent testing of more samples when the change point is located further back. This conclusion aligns with the common principles governing change-point detection programs.
When only a variance change point is present, the aforementioned patterns can similarly be inferred from
Table 4. For example, as the monitoring sample size increases, the empirical potential also increases; the sample size without contamination has minimal impact on the empirical potential; the rejection rate remains relatively insensitive to the position of the change point; and the average run length decreases when the change point occurs later in the sequence. Additionally, as the ratio of standard deviations increases, so does the empirical potential, indicating that the test statistic
is particularly effective for detecting variance change points. For example, when the monitoring sample size is
, the pollution-free sample size is
, the window width is
, the variance change-point position is
, and the ratio of standard deviations is
, the empirical potential equals 0.7255. When the ratio of standard deviations increases to
, the empirical potential rises to 0.8761. This comparison confirms that the empirical potential is greater at a higher ratio of standard deviations
than at a lower ratio
. Therefore, it can be concluded that the proposed statistics exhibit superior performance in detecting both mean change points and variance change points.
Next, we examine the scenario involving mixed breakpoints. There are three distinct configurations for the window frame in relation to these breakpoints: (1) it covers only the mean breakpoint; (2) it covers only the variance breakpoint; (3) it simultaneously encompasses both the mean and variance breakpoints. To conserve space, the subsequent numerical simulations present results exclusively for a sample size of
and a window width of
. Under the assumption that all other parameters remain constant, by comparing the findings in
Table 5 with those in
Table 3 and
Table 4, it becomes evident that ① the presence of a mean breakpoint induces a downward trend in the empirical power associated with the variance breakpoint. This decline is more pronounced when the magnitude of the mean jump is
. For instance, when the monitoring sample size is
, the mean breakpoint occurs at
with a jump magnitude of
, while the variance breakpoint is located at
, resulting in a standard deviation ratio of
and an empirical power of 0.3328. In contrast, when only the variance breakpoint exists, the empirical power is 0.7682. When all other conditions remain unchanged but the jump magnitude varies
, the empirical power adjusts to 0.5231. ② Under the condition that other factors remain constant, when the ratio of standard deviations is negative, the decrease in empirical potential becomes more pronounced. For instance, with a monitoring sample size of
, the mean change point located at
, a jump amplitude of
, the variance change point also at
, and a ratio of standard deviations of
, the empirical potential is 0.5231. In contrast, when only the variance change point exists, the empirical potential increases to 0.7760. Under identical conditions, when the ratio of standard deviations is
, the empirical potential is 0.7682; however, when only the variance change point exists, it further increases to 0.8834. ③ When the mean change point is positioned further back, and the ratio of standard deviations is
, the empirical potential decreases. This indicates that the presence of a mean change point weakens the test effect for the variance change point. Conversely, when the ratio of standard deviations is
, the empirical potential increases, suggesting that the mean change point enhances the test effect for the variance change point. For example, with a monitoring sample size of
, if the mean change point is located at
, with a jump amplitude of
, and the variance change point is at
, while the ratio of standard deviations is
, the empirical potential is 0.1916. However, in the absence of a mean change point, the empirical potential for only the variance change point is 0.6840. Under otherwise identical conditions, when the ratio of standard deviations is
, the empirical potential rises to 0.8824, compared to 0.8050 when only the variance change point exists.
In conclusion, the concurrent presence of mean change points and variance change points can lead to distorted test outcomes. Therefore, this paper proposes a method for accurately identifying these change points. The detailed results are presented in
Table 6 and
Table 7. From
Table 6, it is evident that the results and patterns after mean removal closely resemble those in
Table 4. However, compared with
Table 4, there remain some minor empirical potentials in
Table 6, primarily due to the imprecise estimation of the positions of mean change points. In contrast to
Table 5, the empirical potential in
Table 6 has significantly increased. For instance, in
Table 5, the position of the mean change point is
, the mean jump amplitude is
, the position of the variance change point is
, and the ratio of standard deviations is
, resulting in an empirical potential of 0.3328. Under otherwise identical conditions, the empirical potential obtained after mean removal is 0.7792. When only variance change points exist, the empirical potential is 0.7682. This indicates that the proposed method effectively mitigates the influence of mean change points on the detection of variance change points.
It can be observed from
Table 7 that after removing the variance and reintroducing the trend, the results and patterns closely resemble those in
Table 3. However, compared to
Table 3, there are still some minor empirical potential differences in
Table 7, primarily attributed to the underestimation of the variance change point. In contrast to
Table 5, the empirical potential has significantly increased. For instance, in
Table 5, the mean change point position is
, the mean jump amplitude is
, the variance change point position is
, and the ratio of the standard deviation is
, resulting in an empirical potential of 0.2647. Under otherwise identical conditions, after removing the variance and reintroducing the trend, the empirical potential increases to 0.8653. When only the mean change point exists, the empirical potential reaches 0.8840. These findings suggest that the proposed method effectively mitigates the impact of variance change points on the detection of mean change points.
By analyzing the aforementioned numerical simulation results, it is evident that the test outcomes become distorted when both mean change points and variance change points coexist. After correcting for the mean, an increased empirical potential value signifies the presence of variance change points; conversely, after correcting for the variance, an elevated empirical potential value indicates the existence of mean change points.
As shown in
Table 2, the empirical size is approximately aligned with the nominal level 5%, which demonstrates that the Type I error is well controlled. Furthermore,
Table 3,
Table 4,
Table 5,
Table 6 and
Table 7 reveal that the empirical power increases as a function of sample size, window width, jump amplitude, and standard deviation. For instance, in
Table 3, when the monitoring sample size is
, the window width is
, the jump amplitude is
, and the mean change point position is
, the empirical power is 0.8863. This indicates that the probability of a Type II error is low. Consequently, the test methodology introduced in this paper exhibits strong control over both Type I and Type II errors.
5. Applications
This section evaluates the effectiveness of the proposed method using the closing stock price data of IBM from 17 May 1961 to 2 November 1962, denoted as dataset
.
Figure 4 presents the scatter plot of the standardized dataset
derived from
, revealing a significant horizontal shift in the standardized sequence of these stock prices. This suggests the presence of a potential mean change point in the sequence.
Figure 5 illustrates the first-order difference plot of the data, indicating volatility in the stock prices, leading us to suspect a variance change point in the sequence. In this analysis, the first 200 data points are treated as unpolluted data, with a window width
. At a 5% significance level, we apply the variable change point monitoring method introduced in this paper, initiating the monitoring from the 201st observation to detect and classify any change points. (1) Dataset
is substituted into variable
, where
,
, to determine the estimated position
of the mean change point. ① The sequence is divided into two segments,
and
. The sample mean of segment
is calculated to be
, while that of segment
is
. The data in each segment are then adjusted by subtracting their respective sample means, resulting in a new merged sequence
. Upon re-monitoring, a change point is detected at the 238th sample. This change point may indicate either a variance change or both a variance and mean change. ② By computation, the sample variance of the first segment is
, and that of the second segment is
. After normalizing the data of both segments by their respective sample variances, we merged them into sequence
for further monitoring. During this monitoring process, no change point was detected at the 238th sample. Consequently, it can be concluded that a variance change point exists at this position. (2) dataset
was substituted into variable
, where
,
, to determine the estimated position
of the mean change point. The sequence was divided into two segments, designated as
and
. Calculations revealed that the sample mean of the first segment was
, while that of the second segment was
. The data from both segments were individually adjusted by subtracting their respective sample means, subsequently merged into a new sequence denoted as
, and re-monitored. It was observed that no change point was detected at the 253rd sample. Consequently, it was concluded that only a mean change point existed at this position. This finding aligns with the results reported in Reference [
31], demonstrating that the method proposed in this paper can not only effectively distinguish between mean change points and variance change points but also identify the type of change points more rapidly and accurately.