Next Article in Journal
Supervised Deep Learning Techniques for Image Description: A Systematic Review
Next Article in Special Issue
Sample Size Calculations in Simple Linear Regression: A New Approach
Previous Article in Journal
Estimating the Number of Communities in Weighted Networks
Previous Article in Special Issue
A Robust and High-Dimensional Clustering Algorithm Based on Feature Weight and Entropy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Change-Point Detection for Multi-Way Tensor-Based Frameworks

1
School of Statistics, Tianjin University of Finance and Economics, Tianjin 300222, China
2
Department of Mathematics and Statistics, York University, Toronto, ON M3J 1P3, Canada
*
Author to whom correspondence should be addressed.
Entropy 2023, 25(4), 552; https://doi.org/10.3390/e25040552
Submission received: 8 February 2023 / Revised: 20 March 2023 / Accepted: 20 March 2023 / Published: 23 March 2023
(This article belongs to the Special Issue Statistical Methods for Modeling High-Dimensional and Complex Data)

Abstract

:
Graph-based change-point detection methods are often applied due to their advantages for using high-dimensional data. Most applications focus on extracting effective information of objects while ignoring their main features. However, in some applications, one may be interested in detecting objects with different features, such as color. Therefore, we propose a general graph-based change-point detection method under the multi-way tensor framework, aimed at detecting objects with different features that change in the distribution of one or more slices. Furthermore, considering that recorded tensor sequences may be vulnerable to natural disturbances, such as lighting in images or videos, we propose an improved method incorporating histogram equalization techniques to improve detection efficiency. Finally, through simulations and real data analysis, we show that the proposed methods achieve higher efficiency in detecting change-points.

1. Introduction

Videos or images are common in real life. Examples include images of traffic lights to regulate traffic, videos recorded by electronic traffic cameras to track vehicles that violate regulations, and webcams in smart agriculture to monitor the ripening stage of vegetables or fruits and automatically estimate the harvest time. Videos or images include huge amounts of information, but not all are useful. Therefore, extracting useful information from videos or a sequence of images, which can be regarded as a change-point detection problem, is important. The authors of [1] proposed a distribution-free, consistent graph-based change-point detection method for recorded videos tracking cell division. After that, a non-Euclidean graph-based change-point test was proposed by [2] as an effective way to reduce random interference, and applied to detecting both landing and departure times in a sequence of bees’ flower visitation data.
Digital videos or images are special cases of multi-way tensors. A r-way tensor can be expressed as X R p 1 × × p r , where R denotes the real number line and p represents the dimension of mode- of X , = 1 , , r . The elements of the tensor X can be accessed using r indices as in x i 1 , , i r , varying i from 1 to p , = 1 , , r . A slice refers to a tensor subarray with one fixed index. For example, X i r = { x · , , · , r } is defined to be the i r -th slice of X along mode r (abbreviated as the i r -th mode-r slice), i r = 1 , , p r . Thus, X contains r different sets of slices. In modern statistical applications, tensors are commonly encountered in many fields. A color image can be represented as a three-way tensor, i.e., R p 1 × p 2 × 3 , where { x · , · , i 3 } , i 3 = 1 , 2 , 3 , are mode-3 slices, the combination of which determines the color of an image. Nevertheless, both [1,2] converted three-way tensors to one-way tensors by taking the average of R , G , B mode-3 slices, which neglects the object’s color information.
In real life, one may also be interested in detecting targets with different colors. For example, detecting changes in the color of traffic lights may help daltonians reduce the limitations of visual impairment and be able to pass through traffic lights normally. Figure 1 shows a sequence of traffic light images containing different colors, such as green ( t = 1 , , 10 ), yellow ( t = 11 , , 14 ), and red ( t = 15 , , 27 ). Traffic light colors are dominated by R , G , B mode-3 slices, not their average. In addition, detecting a change in the color of fruits or vegetables may help monitor their maturity stages and estimate their harvest time, which is important for building automated harvesting systems. For example, the ripening process of tomato fruits can be categorized into three stages: mature green (MG), breaker (BR), and light red (LR) [3,4]. Figure 2 displays 26 frames of these three stages, i.e., MG ( t = 1 , , 8 ) , BR ( t = 9 , , 19 ) , and LR ( t = 20 , , 26 ) , which are extracted from a video that recorded the tomato fruits’ maturation process from MG to LR. However, the pixel values of images or video frames are sensitive to light intensity. Figure 3 presents the same sequence of 26 frames, but the light intensity changes at t = 5 , 10 , 17 , 21 . All these disturbances make change-point detection more challenging.
Inspired by the above examples, we were interested in developing a change-point detection method for the multi-way tensor-based framework, which we present in this paper. Suppose an observation of an object can be represented as a sequence of r-way tensors. The features of an object are dominated by one or a few slices. One may encounter a situation where the distribution of one or some mode- slices suddenly changes, = 1 , , r , say, mode-r slices. If so, taking the average over all mode-r slices may mask changes, such that some features of the object are no longer clearly visible. Following the graph-based change-point detection method in [2], we propose two detection methods based on the multi-way tensor framework. One is directly based on the elements of the slices, and the other is based on the histograms of the slices to reduce the influence of natural disturbances, such as lighting, in the image. Change-points occur when the value of some important dominant slices change suddenly. Without loss of generality, we take a three-way tensor as an example to intuitively illustrate the performance of the proposed method. We show through simulations and real-data analysis that the proposed methods are more capable than the method in [2] regardless of whether the sequence of tensors is affected by natural disturbances.
The main contributions of this study are summarized. First, to the best of our knowledge, few studies have addressed the problem of identifying temporal change-points in sequences of tensors. Therefore, we propose a graph-based change-point detection method to identify temporal changes under the framework of multi-way tensors. Second, we propose maximal and histogram-based edge weights to build graphs. The latter aims to downplay or remove natural distractions, such as lighting, in an image. Third, we conduct simulation studies in different scenarios with equal or unequal variances of the slices, with or without natural disturbances. Our simulation results demonstrate the robustness and efficiency of the proposed methods in change-point detection. Finally, our proposed methods are suitable for, but not limited to, detecting color changes of traffic lights and monitoring maturity stage changes in smart agriculture.

2. Methodology

2.1. Model Setup

Let X t be a sequence of r-way tensors, i.e., X t R p 1 × × p r , where p , = 1 , , r , are positive integers, and t = 1 , , T . X t contains p mode- slices, denoted as X i , t ( ) R p 1 × × p 1 × p + 1 × × p r , where the superscript denotes different modes, = 1 , , r , and the subscript i denotes the i -th slice along mode , i = 1 , , p . For the collection of tensors, assume that a time point t * divides the observations into two segments such that X t in two segments are significantly different, whereby the mode- slices characterize the difference. Then t * is a change-point. For example, the color of a colorful image is dominated by mode-3 slices. A change of any mode-3 slice will result in a change in the color of this image. Generally, for the mode- slices, denote S = { 1 , , p } . Let T t ( ) and K t ( ) be subsets of S for t = 1 , , t * and t = t * + 1 , , T , respectively. Note that K t ( ) = for t = 1 , , t * . Without loss of generality, make = r . Define j = { j 1 , , j r 1 } R r 1 , an index label vector, where j = 1 , , p with = 1 , , r 1 . Denote J = { 1 , , p 1 } × × { 1 , , p r 1 } . Let x j , m , t ( r ) be the elements of X m , t ( r ) at time t, j J and m { 1 , , p r } . Suppose that the object area of these slices is represented as J 1 , t ( r ) J at time t. We use the following model to illustrate the time series with a change-point at time t * :
x j , m , t ( r ) = μ m , 1 ( r ) + σ m , 1 ( r ) ϵ , j J 1 , t ( r ) , m T t ( r ) , m K t ( r ) , μ m , 2 ( r ) + σ m , 2 ( r ) ϵ , j J 1 , t ( r ) , m K t ( r ) ,
where μ m , i ( r ) , σ m , i ( r ) , i = 1 , 2 , are unknown parameters, ϵ is an independently identically distributed (iid) random error, with a mean of zero and a variance of one. The first part in model (1) corresponds to the mode-r slices with no change (initial state). The second part shows significant changes among these mode-r slices for t > t * . For simplicity, we drop the suffix r in (1). Then, model (1) becomes
x j , m , t = μ m , 1 + σ m , 1 ϵ , j J 1 , t , m T t , m K t , μ m , 2 + σ m , 2 ϵ , j J 1 , t , m K t .
If there exists a change-point t * , similar to [2], detecting the change-point of (1) is equivalent to testing the following hypothesis:
H 0 : K t = , t = 1 , , T vs H a : K t = , t = 1 , , t * , K t , t = t * + 1 , , T .
Suppose that a time point t divides the observations into two segments, ‘past’ ( t ) and ‘future’ ( > t ), t = 1 , , T . In light of [2,5], we define the number of edges that connect points from the ‘past’ and ‘future’ segments over a graph G , for any t, and ( t 1 , t 2 ) E ( G ( ω ) ) ,
C t G ( ω ) = ( t 1 , t 2 ) E ( G ( ω ) ) I { I { t 1 > t } I { t 2 > t } } ,
where I { A } is an indicator function that is equal to 1 if A is true, and 0 otherwise; and G is a connected, edge-weighted undirected graph with vertex set { 1 , , T } and edge set E ( G ) . Many ways can be used to construct such a graph, such as minimum spanning tree (MST), minimum distance pairing (MDP), and shortest Hamiltonian path (SHP) [1,2,5]. Nevertheless, not all of them perform well in a graph-based hypothesis test. As is shown in [1], a test based on MST performed poorly in terms of power for a large dimension d. Moreover, Chen and Zhang [5] argue that MDP achieves the desirable property of being truly distribution-free. However, Biswas et al. [6] show that the generalized run test using the SHP is distribution-free and consistent as a dimension d goes to infinity. In our study, we thus use the graph G that is obtained using SHP, denoted as SHP ( ω ) . Small values of C t G ( ω ) are against the null hypothesis H 0 in (3), which is equivalent to constructing the graph that attains the minimum of the sum of edge weights among all of the spanning paths. Therefore, it is key to establish suitable edge weights ω t 1 , t 2 between any two nodes t 1 , t 2 in the construction of graph G . We propose two edge weights, given in (7) or (8) of the following two subsections.
To test the null hypothesis, an SHP-based test was proposed in [1], with the test statistic
S T SHP ( ω ) = 1 T 1 t = 1 T 1 Z t SHP ( ω ) 2 ,
where Z t SHP ( ω ) is the standardization of C t SHP ( ω ) , Z t SHP ( ω ) = ( C t SHP ( ω ) E 0 ) / var 0 , E 0 = 2 t ( T t ) / T ; and var 0 = 2 t ( T t ) { 2 t ( T t ) T } / ( T 3 T 2 ) . We reject the null hypothesis at the significant level α if the statistic S T SHP ( ω ) is larger than the simulated critical value. When rejecting H 0 , the change-point estimate is given by
arg min 1 t T C t SHP ( ω ) / { t ( T t ) } .
Under the null hypothesis, the distribution of S T SHP ( ω ) is distribution-free. The critical values are calculated via the permutation method with 100,000 replications based on various d and significance levels α . The details are given in the Appendix A.

2.2. Maximum Edge Weight

As mentioned earlier, it is possible to encounter a mode-r slice change, such as the first mode-r slice. Taking the average of all mode-r slices may mask the importance of the first slice and thus the variation in important features of the object. For example, when a study focuses on detecting time points of color change in a sequence of images, averaging three mode-3 slices removes the color features of these images. Undoubtedly, the method of [2] will fail to detect it. Inspired by the problem of detecting color changes, we thus propose maximum edge weights to enforce the importance of considering the mode-r slices.
Let E ( G ) be a set of edges of the undirected edge-weighted graph, the nodes of which are { 1 , , T } . The maximum edge weights of nodes t 1 , t 2 are defined as
ω t 1 , t 2 = max m { 1 , , p r } j J ( x j , m , t 1 x j , m , t 2 ) ,
where j J means that j { 1 , , p } for = 1 , , r 1 . The methods based on the maximum edge weights and the non-Euclidean edge weights of [2] are named Maxima-SHP and nonE-SHP, respectively.

2.3. Histogram-Based Edge Weight

Both the nonE-SHP and Maxima-SHP are directly based on the differences between tensors’ elements, which can easily be affected by some natural factors. For example, the pixel values of the video’s frames are sensitive to light intensity, which makes the video’s frames or images either over- or under-exposed, resulting in a narrow range of intensity values of pixels. In such a case, using the method of nonE-SHP or Maxima-SHP directly may fail to detect the underlying true change-points. Histogram equalization is useful in images where the background and foreground are both light and dark, by effectively spreading highly populated intensity values. Motivated by the histogram equalization, we propose a new weight based on the histograms of all elements of the tensors rather than all elements directly. For the tensor X m , t , let f m , h i , t be the empirical probability of occurrence of intensity level h i in the m-th mode-r slice ( m = 1 , , p r ), where h i , i = 1 , , L , denote the intensity levels, and L is the number of possible intensity levels (e.g., L = 256 for an 8-bit image). We propose a histogram-based edge weight of nodes t 1 , t 2 ,
ω t 1 , t 2 = max k = 1 , , L m { 1 , , p r } L i = 1 k ( f m , h i , t 1 f m , h i , t 2 ) .
The weight is in line with the histogram equalization technique that can dilute or remove the effects of disturbances, such as the effects of light intensity on images or videos. The method using the histogram-based edge weight is named HistEq-SHP.

3. Simulations

3.1. Comparative Study of the Performance of NonE-SHP and Maxima-SHP

We generated three-way tensor samples, X t R p 1 × p 2 × 3 , using model (2). Under the null hypothesis, the model’s settings are μ m , 1 = 0 , σ m , 1 = 1 , ϵ iid N ( 0 , 1 ) for m = 1 , 2 , 3 , and d = p 1 p 2 = 5 , 10, 100, 500, 1000, 5000, 10,000, 50,000. We carried out 10,000 simulations with α = 0.05 and T = 20, 40, 60, 80, 100, 200. Table 1 displays the estimated type I errors via nonE-SHP and the proposed Maxima-SHP. All the estimated type I errors are around 0.05, consistent with the significance level α = 0.05.
Additionally, we examined the powers of Maxima-SHP and nonE-SHP under the alternative hypothesis. For simplicity, we also considered three-way tensor samples generated using model (2) with r = 3 . Suppose that t * is the preset change-point, and K t = { 1 , 3 } for t > t * , indicating that the distribution of the first and third mode-3 slices X 1 , t change after t * . When t t * , for the third mode-3 slice μ 3 , 1 = 0.2 , whereas for the first and second mode-3 slices μ 1 , 1 = μ 2 , 1 = 0 . When t > t * , the distribution of the second mode-3 slice remains unchanged, whereas both distributions of the first and third mode-3 slices change, μ 1 , 2 = 0.2 and μ 3 , 2 = 0 . Furthermore, let σ m , 1 = σ m , 2 = 1 , ϵ iid N ( 0 , 1 ) for m = 1 , 2 , 3 . Figure 4 visually shows the generated three-way tensors for t t * and after t * , i.e., t > t * .
We performed the simulations via 10,000 repetitions at the significance level of 0.05. Figure 5 shows the estimated powers of Maxima-SHP and nonE-SHP by setting the change-point at T / 2 . Clearly, as the dimension d increases, the powers of nonE-SHP are all around 0.05, implying that it fails to reject the null hypothesis when the alternative hypothesis is true. However, the powers of Maxima-SHP tend to one for the whole T, empirically showing the strong power consistency. The power consistency of Maxima-SHP is mainly affected by the dimension d. Figure 6 shows similar results as Figure 5 when μ 3 , 1 = μ 1 , 2 = 0.3 and keeping the other values unchanged. We conducted two more simulations by setting the location of change-point at T / 4 . The results are given in the Appendix B.
Additionally, we carried out other simulations under the scenario of unequal variances for different slices. The variance of the second mode-3 slice was equal to 1.5, whereas the other settings were the same as above. Table 2 shows that the estimated type I errors are still around 0.05 under the null hypothesis for both Maxima-SHP and nonE-SHP. Figure 7 gives the estimated powers under the alternative hypothesis. These results indicate that the proposed Maxima-SHP still outperforms the nonE-SHP, whether the variances of the slices are equal or not.

3.2. Comparative Study of the Performance of NonE-SHP and HistEq-SHP

In order to investigate the performance of HistEq-SHP, we first performed change-point detection by generating a sequence of synthetic gray images (r = 2) under different T and d. Figure 8 is an example of T = 40, d = 10,000, r = 2. In Figure 8, we depict images constructed using a normal distribution with a mean of 0.8 and a standard deviation of 0.05 for t = 1 , , 9 (see Figure 8a ), and using a normal distribution with a mean of 0.5 and a standard deviation of 0.05 for t = 10 , , 19 (see Figure 8b). Furthermore, we added an object, the shape of a butterfly, to the images constructed using the normal distribution with a mean of 0.5 and a standard deviation of 0.05 for t = 20 , , 29 (see Figure 8c). Finally, the illustration of the image in Figure 8d was generated on the basis of Figure 8a for t = 30 , , 40 . The time point t = 19 is the change-point. Under various T and d, we calculated the empirical powers based on 10,000 repetitions. We remark that we preprocessed the images using the histogram equalization technique before using the nonE-SHP method. Table 3 displays the powers of HistEq-SHP and nonE-SHP, which shows that the powers of both methods are ones when d = 5000 and 10,000 regardless of T. For d = 50,000, the powers of HistEq-SHP are ones, but those of the nonE-SHP method are close to one, implying the proposed method outperforms the nonE-SHP method for high dimension d.
We further examined the performance of these methods on three-way tensors by extending gray images to color images. As we discussed in the introduction, lighting can easily affect images, making change-point detection even more challenging. We performed another simulation to check the stability of the proposed methods by generating synthetic images under different light intensities. Figure 9 displays four types of images: Figure 9a is generated from a normal distribution with a mean of 0.8 and a standard deviation of 0.05 for three mode-3 slices, and a red object is added; Figure 9b is generated in the same way as in Figure 9a except that the mean of the normal distribution is 0.5; Figure 9c,d are generated in the same way as in Figure 9b and Figure 9a, respectively, except that blue targets replaced the red ones. Note that Figure 9a,d are in the same light intensity, and so are Figure 9b,c. It is obvious that the time change-point is at t * = 19 . Table 4 gives the powers of nonE-SHP and HistEq-SHP. It shows that the powers of nonE-SHP are close to 0.05 in all cases, and, hence, the method failed to detect the real change-point, i.e., where the colors of the object are not the same, and the images receive different light intensities. On the contrary, the powers of HistEq-SHP are all ones. The results demonstrate that although the images are affected by different light intensities, the proposed HistEq-SHP method can successfully identify the true change-point.

4. Case Studies

4.1. Example 1: Color Change in a Traffic Light

Our proposed method has important implications in real life. An application that recognizes the color change of traffic lights may help daltonians reduce the limitations of their visual impairment and allow them to navigate traffic light intersections. We extracted 27 frames per second from a short video (27 s) taken at a traffic light intersection, recording the entire traffic light transition from green to red. We cropped each frame to size 73 × 40 . Figure 1 displays three frames containing different colors of the traffic light, green ( t = 1 , , 10 ), yellow ( t = 11 , , 14 ), and red ( t = 15 , , 27 ). Clearly, based on the hypothesis test in (3), two change-points exist: t * = 10 and t * = 14 . In this case, we aim to detect the moments when the traffic light has changed its color, e.g., t * = 10 and 14.
We transformed the three-way tensor of the R,G,B slices into three vectors and normalized the pixel values. According to Equation (7), we extracted the maximum edge weights of the three color slices to construct SHP ( ω ) . The test statistic S T SHP ( ω ) in Equation (5) was 10.5787, larger than the corresponding critical value 2.1491 ( T = 27 , d = 2920 ), leading to the rejection of the null hypothesis at the significance level 0.05. The estimated location of the change-point was t * = 14 . To further test if other change-points exist, we considered the two segments: { X t , t = 1 , , 13 } (Segment I) and { X t , t = 15 , , 27 } (Segment II). Following the same procedure as above, the second change-point was estimated at t * = 10 in Segment I, whereas there was no change-point in Segment II. The estimated results are given in Figure 10 (upper panel), which agrees with the real situation.
To compare Maxima-SHP with nonE-SHP, we also constructed the SHP ( ω ) via the non-Euclidean weight given by [2]. By following the same procedure, although the ratio cut was minimized at location 19, the statistic S T SHP ( ω ) was 0.6278, less than the critical value, resulting in failing to reject the null hypothesis. The results are given in Figure 10 (lower panel). As expected, the nonE-SHP failed to detect the real change-points in such a scenario because it disregarded the main feature in three slices in constructing its non-Euclidean weights.

4.2. Example 2: Tomato Fruit Color Change

Nowadays, automatic harvesting systems have become a powerful technique for smart agriculture [7]. Monitoring the maturity stages of fruits or vegetables and estimating their harvest time are important steps in building an automatic harvesting system. Based on video recordings of fruit’s or vegetable’s maturation processes, the problem of monitoring the maturity stages and estimating the harvest times of fruit or vegetable can be transformed into a change-point detection problem.
Figure 2 shows three stages of the ripening process of the tomato fruit, MG ( t = 1 , , 8 ) , BR ( t = 9 , , 19 ) , and LR ( t = 20 , , 26 ) , containing 26 frames extracted from a video. Each of them was cropped to size 149 × 149. Obviously, t * = 8 and t * = 19 are considered as the change-points. Applying the change-point detection methods of HistEq-SHP and nonE-SHP, the results are displayed in Figure 11. As expected, both methods successfully detected the real change-points.
However, videos or images are easily affected by light intensity, which makes change-point detection more challenging. Here, we present another example to illustrate that our proposed method is more robust than the nonE-SHP when the recorded videos or frames are recorded in different light intensities. We darkened the frames of t = 6 , , 10 and lightened the frames of t = 18 , , 21 , shown in Figure 3. Note that the change-points are still at t * = 8 and t * = 19 , the detection of which is easily affected by the false change-points t = 5 , t = 10 , t = 17 , and t = 21 , where the light intensity changes. By applying nonE-SHP, three false change-points t * = 5 , t * = 10 , and t * = 17 were detected (see the lower panel of Figure 12). In contrast, the proposed HistEq-SHP detected two change-points at t * = 8 and t * = 19 (see the upper panel of Figure 12), consistent with the real ones. The nonE-SHP failed to detect the real change-points when the videos or frames were recorded in different light intensities. However, our proposed HistEq-SHP remained unaffected by disturbance(s) and successfully detected the real change-points.

5. Conclusions

Nowadays, videos or images are ubiquitously used in real life. This paper proposes a general class of graph-based change-point detection methods for checking whether objects with different characteristics and sequences of tensors are affected by natural disturbances. Simulation studies empirically showed the power consistency of the proposed methods. Furthermore, the proposed methods are shown to detect changes in the color of traffic lights at intersections and monitor changes in fruit ripening stages in smart farming. All numerical studies demonstrated the effectiveness of the proposed methods.

Author Contributions

Conceptualization, Y.W. and S.Q.; methodology, Y.W. and S.Q.; software, S.Q.; validation, S.Q. and G.Z.; formal analysis, S.Q.; investigation, S.Q.; resources, S.Q.; data curation, S.Q. and G.Z.; writing—original draft preparation, S.Q. and G.Z.; writing—review and editing, Y.W. and S.Q.; visualization, G.Z.; supervision, Y.W. and S.Q.; project administration, S.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Tianyuan Fund for Mathematics, National Natural Science Foundation of China (Grant No. 12226333), the National Natural Science Foundation of China (Grant No.12201454), and the Natural Science and Engineering Research Council of Canada (RGPIN-2017-05720).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Simulated Critical Values of Both Maxima-SHP and HistEq-SHP

Appendix A.1. Simulated Critical Values of Maxima-SHP

Under the null hypothesis, the distribution of S T SHP ( ω ) defined in (5) is distribution-free. The critical values are obtained from simulations via the permutation method with 100,000 replications based on various d and significance levels α . Table A1 lists the estimated critical values under the null hypothesis. We generated data using model (2) with K t = and T = 20 , r = 3 , μ m , 1 = 0 , σ m , 1 = 1 , ϵ iid N ( 0 , 1 ) for m = 1 , 2 , 3 . The significance levels mainly indicate changes in the critical values. There is little difference among the c α s in response to changing the dimension d and the length of tensor sequences T. Table A2Table A4 present the estimated c α by letting T = 60 , 100 , 200 , respectively.
Table A1. The estimated critical values ( c α ) of the Maxima-SHP based on different d and significance levels α for T = 20 .
Table A1. The estimated critical values ( c α ) of the Maxima-SHP based on different d and significance levels α for T = 20 .
d10%5%2.5%1%0.5%0.1%
101.74912.11632.48782.99043.37184.2241
501.75182.13212.49722.98713.37724.2879
1001.74912.12032.49902.99923.37524.2060
5001.75062.12542.50202.97993.33754.1512
10001.75372.11942.50273.00563.35624.2720
50001.75242.12002.48592.97233.34884.2147
10,0001.76582.13072.50273.00023.38754.2840
50,0001.75552.12692.49262.98293.36034.2390
Table A2. The estimated critical values ( c α ) of the Maxima-SHP based on different d and significance levels α for T = 60 .
Table A2. The estimated critical values ( c α ) of the Maxima-SHP based on different d and significance levels α for T = 60 .
d10%5%2.50%1%0.50%0.10%
101.77262.16022.57443.11983.56294.5688
501.76302.16172.57173.14583.58214.5475
1001.77902.18142.58773.15143.60004.6520
5001.76822.15642.55403.11613.52314.3641
10001.77442.16522.57853.13123.52264.6107
50001.78212.18582.59483.15663.58534.4944
10,0001.76072.14712.54833.09433.53044.4988
50,0001.76522.15522.55693.12533.54564.5816
Table A3. The estimated critical values ( c α ) of the Maxima-SHP based on different d and significance levels α for T = 100.
Table A3. The estimated critical values ( c α ) of the Maxima-SHP based on different d and significance levels α for T = 100.
d10%5%2.50%1%0.50%0.10%
101.76372.16592.55943.10313.51634.4318
501.76462.16122.54913.08393.50534.4586
1001.76812.16622.55093.10263.53704.5026
5001.77892.17912.57953.13613.55554.5226
10001.76672.17002.58233.13723.54144.5186
50001.77002.15962.56643.07973.50404.5572
10,0001.76132.15262.57113.12163.56114.5922
50,0001.76572.16852.56093.10983.50594.4325
Table A4. The estimated critical values ( c α ) of the Maxima-SHP based on different d and significance levels α for T = 200.
Table A4. The estimated critical values ( c α ) of the Maxima-SHP based on different d and significance levels α for T = 200.
d10%5%2.50%1%0.50%0.10%
101.75842.15432.54253.09153.46454.5578
501.74762.12842.52393.05323.46794.4469
1001.75132.14042.54143.06833.50844.5517
5001.75262.14202.52453.02763.43494.3104
10001.75752.13402.53593.06813.47544.4134
50001.75782.14572.54283.09053.51534.5126
10,0001.75982.14892.54463.08793.48474.4418
50,0001.74762.13362.53433.05673.44724.4326

Appendix A.2. Simulated Critical Values for the HistEq-SHP

According to the intensity levels of the 8-bit image, let L equal 256. We replaced the weight in S T SHP ( ω ) with Equation (8) to obtain the critical values ( c α ) the same way. Table A5 lists the estimated critical values with T = 40 , d = 5000, 10,000, 50,000. Significant differences of c α are only evident among different α values. The critical values are slightly different as the value of d changes. Other settings of the critical values with T = 100 , d = 5000, 10,000, 50,000 are presented in Table A6, in which the same phenomena as in Table A5 can be observed.
Table A5. The estimated critical values ( c α ) of the HistEq-SHP based on different d and significance levels α for T = 40 .
Table A5. The estimated critical values ( c α ) of the HistEq-SHP based on different d and significance levels α for T = 40 .
d10%5%2.5%1%0.5%0.1%
50001.78572.17542.55713.11913.51164.6093
10,0001.77752.17042.56833.12373.54654.4774
50,0001.76802.15912.54333.08333.48944.4746
Table A6. The estimated critical values ( c α ) of the HistEq-SHP based on different d and significance levels α for T = 100.
Table A6. The estimated critical values ( c α ) of the HistEq-SHP based on different d and significance levels α for T = 100.
d10%5%2.50%1%0.50%0.10%
50001.78952.20132.61253.17433.64174.6668
10,0001.78272.18412.60333.15703.55954.5371
50,0001.77422.17552.59143.14283.59684.5675

Appendix B. Empirical Powers of Maxima-SHP and NonE-SHP

We examined the powers of Maxima-SHP and nonE-SHP under the alternative hypothesis by setting the change-point at T / 4 . Figure A1 and Figure A2 show similar results as Figure 5 and Figure 6, respectively.
Figure A1. Empirical powers of both Maxima-SHP and nonE-SHP based on 10,000 simulations with t * = T / 4 , T = 40 , 100 , 200 , μ 3 , 1 = 0.2 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.2 , μ 3 , 2 = 0 for t > t * .
Figure A1. Empirical powers of both Maxima-SHP and nonE-SHP based on 10,000 simulations with t * = T / 4 , T = 40 , 100 , 200 , μ 3 , 1 = 0.2 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.2 , μ 3 , 2 = 0 for t > t * .
Entropy 25 00552 g0a1
Figure A2. Empirical powers of both Maxima-SHP and nonE-SHP based on 10,000 simulations with t * = T / 4 , T = 40 , 100 , 200 , μ 3 , 1 = 0.3 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.3 , μ 3 , 2 = 0 for t > t * .
Figure A2. Empirical powers of both Maxima-SHP and nonE-SHP based on 10,000 simulations with t * = T / 4 , T = 40 , 100 , 200 , μ 3 , 1 = 0.3 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.3 , μ 3 , 2 = 0 for t > t * .
Entropy 25 00552 g0a2

References

  1. Shi, X.; Wu, Y.; Rao, C.R. Consistent and powerful graph-based change-point test for high-dimensional data. Proc. Natl. Acad. Sci. USA 2017, 114, 3873–3878. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Shi, X.; Wu, Y.; Rao, C.R. Consistent and powerful non-Euclidean graph-based change-point test with applications to segmenting random interfered video data. Proc. Natl. Acad. Sci. USA 2018, 115, 5914–5919. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Gonzalez, C.; Ré, M.D.; Sossi, M.L.; Valle, E.M.; Boggio, S.B. Tomato cv. ‘Micro-Tom’ as a model system to study postharvest chilling tolerance. Sci. Hortic. 2015, 253, 412–421. [Google Scholar] [CrossRef]
  4. Reissig, G.N.; Posso, D.A.; Borella, J.; da Silveira, R.V.D.; Rombaldi, C.V.; Bacarin, M.A. High MT-sHSP23.6 expression and moderate water deficit influence the antioxidant system in ‘Micro-Tom’ tomato fruit under hypoxia. Sci. Hortic. 2020, 242, 127–136. [Google Scholar] [CrossRef]
  5. Chen, H.; Zhang, N. Graph-Based Change-Point Detection. Ann. Stat. 2015, 43, 139–176. [Google Scholar] [CrossRef]
  6. Biswas, M.; Mukhopadhyay, M.; Ghosh, A.K. A distribution-free two-sample run test applicable to high-dimensional data. Biometrika 2014, 101, 913–926. [Google Scholar] [CrossRef]
  7. Fujinaga, T.; Yasukawa, S.; Ishii, K. Tomato growth state map for the automation of monitoring and harvesting. J. Robot. Mechatronics 2020, 32, 1279–1291. [Google Scholar] [CrossRef]
Figure 1. Three different colors of traffic lights: green (left panel, t = 1 , , 10 ), yellow (middle panel, t = 11 , , 14 ), and red (right panel, t = 15 , , 27 ).
Figure 1. Three different colors of traffic lights: green (left panel, t = 1 , , 10 ), yellow (middle panel, t = 11 , , 14 ), and red (right panel, t = 15 , , 27 ).
Entropy 25 00552 g001
Figure 2. Three ripening stages: MG (left panel, t = 1 , , 8 ), BR (middle panel, t = 9 , , 19 ), and LR (right panel, t = 20 , , 26 ).
Figure 2. Three ripening stages: MG (left panel, t = 1 , , 8 ), BR (middle panel, t = 9 , , 19 ), and LR (right panel, t = 20 , , 26 ).
Entropy 25 00552 g002
Figure 3. Three ripening stages where the light intensity changes at t = 5 , 10 , 17 , 21 : (a) t = 1 , , 5 , (b) t = 6 , , 8 , (c) t = 9 , 10 , (d) t = 11 , , 17 , (e) t = 18 , 19 , (f) t = 20 , 21 , (g) t = 22 , , 26 .
Figure 3. Three ripening stages where the light intensity changes at t = 5 , 10 , 17 , 21 : (a) t = 1 , , 5 , (b) t = 6 , , 8 , (c) t = 9 , 10 , (d) t = 11 , , 17 , (e) t = 18 , 19 , (f) t = 20 , 21 , (g) t = 22 , , 26 .
Entropy 25 00552 g003
Figure 4. Illustration of the generated three-way tensors for t t * and t > t * in the middle and on the right, respectively.
Figure 4. Illustration of the generated three-way tensors for t t * and t > t * in the middle and on the right, respectively.
Entropy 25 00552 g004
Figure 5. Estimated powers of Maxima-SHP and nonE-SHP over 10,000 simulations by setting equal slice variance; t * = T / 2 , T = 40 , 100 , 200 , μ 3 , 1 = 0.2 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.2 , μ 3 , 2 = 0 for t > t * .
Figure 5. Estimated powers of Maxima-SHP and nonE-SHP over 10,000 simulations by setting equal slice variance; t * = T / 2 , T = 40 , 100 , 200 , μ 3 , 1 = 0.2 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.2 , μ 3 , 2 = 0 for t > t * .
Entropy 25 00552 g005
Figure 6. Estimated powers of Maxima-SHP and nonE-SHP over 10,000 simulations by setting t * = T / 2 , T = 40 , 100 , 200 , μ 3 , 1 = 0.3 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.3 , μ 3 , 2 = 0 for t > t * .
Figure 6. Estimated powers of Maxima-SHP and nonE-SHP over 10,000 simulations by setting t * = T / 2 , T = 40 , 100 , 200 , μ 3 , 1 = 0.3 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.3 , μ 3 , 2 = 0 for t > t * .
Entropy 25 00552 g006
Figure 7. Estimated powers of Maxima-SHP and nonE-SHP over 10,000 simulations with unequal variances of slices by setting t * = T / 2 , T = 40 , 100 , 200 , σ 2 , 1 = σ 2 , 2 = 1.5 , μ 3 , 1 = 0.3 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.3 , μ 3 , 2 = 0 for t > t * .
Figure 7. Estimated powers of Maxima-SHP and nonE-SHP over 10,000 simulations with unequal variances of slices by setting t * = T / 2 , T = 40 , 100 , 200 , σ 2 , 1 = σ 2 , 2 = 1.5 , μ 3 , 1 = 0.3 , μ 1 , 1 = μ 2 , 1 = 0 for t t * ; and K t = { 1 , 3 } , μ 1 , 2 = 0.3 , μ 3 , 2 = 0 for t > t * .
Entropy 25 00552 g007
Figure 8. Illustrations of a sequence of synthetic gray images with T = 40, d = 10,000, r = 2 : (a) t = 1 , , 9 ; (b) t = 10 , , 19 ; (c) t = 20 , , 29 ; (d) t = 30 , , 40 .
Figure 8. Illustrations of a sequence of synthetic gray images with T = 40, d = 10,000, r = 2 : (a) t = 1 , , 9 ; (b) t = 10 , , 19 ; (c) t = 20 , , 29 ; (d) t = 30 , , 40 .
Entropy 25 00552 g008
Figure 9. Illustrations of a sequence of synthetic color images with T = 40, d = 10,000, r = 3: (a) t = 1 , , 9 ; (b) t = 10 , , 19 ; (c) t = 20 , , 29 ; (d) t = 30 , , 40 . (a,d) are in the same light intensity, and so are (b,c).
Figure 9. Illustrations of a sequence of synthetic color images with T = 40, d = 10,000, r = 3: (a) t = 1 , , 9 ; (b) t = 10 , , 19 ; (c) t = 20 , , 29 ; (d) t = 30 , , 40 . (a,d) are in the same light intensity, and so are (b,c).
Entropy 25 00552 g009
Figure 10. The change-point estimation using Maxima-SHP (upper panel) and nonE-SHP (lower panel).
Figure 10. The change-point estimation using Maxima-SHP (upper panel) and nonE-SHP (lower panel).
Entropy 25 00552 g010
Figure 11. Change-point detection via HistEq-SHP (upper panel) and nonE-SHP (lower panel).
Figure 11. Change-point detection via HistEq-SHP (upper panel) and nonE-SHP (lower panel).
Entropy 25 00552 g011
Figure 12. Change-points detected by HistEq-SHP (upper panel) and nonE-SHP (lower panel), light intensity changes at t = 5 , 10 , 17 , 21 .
Figure 12. Change-points detected by HistEq-SHP (upper panel) and nonE-SHP (lower panel), light intensity changes at t = 5 , 10 , 17 , 21 .
Entropy 25 00552 g012
Table 1. Comparison of type I error rates using nonE-SHP and Maxima-SHP based on various T and d.
Table 1. Comparison of type I error rates using nonE-SHP and Maxima-SHP based on various T and d.
TMethodd
10501005001000500010,00050,000
20Maxima-SHP0.05190.05050.04930.04950.05240.04950.05070.0513
nonE-SHP0.04460.05060.05020.05100.05350.04940.04960.0463
40Maxima-SHP0.05120.04950.05190.05000.05610.05030.04600.0531
nonE-SHP0.05290.04980.04840.04960.04780.05300.05020.0530
60Maxima-SHP0.05070.04910.04790.05080.04830.04770.04970.0497
nonE-SHP0.04870.04910.04890.05180.04600.05040.05070.0469
80Maxima-SHP0.04870.04950.05080.04790.05650.04680.05050.0514
nonE-SHP0.05220.05130.04930.05380.05220.04800.05190.0500
100Maxima-SHP0.04590.04980.04650.04760.05170.04810.04840.0484
nonE-SHP0.05220.05060.05210.04580.04970.04560.04960.0483
200Maxima-SHP0.04560.05070.04800.05140.05100.05130.05000.0540
nonE-SHP0.04760.05140.04730.04890.04660.05190.04340.0503
Table 2. Comparison of type I error rates of nonE-SHP and Maxima-SHP with unequal variances of slices, based on various T and d.
Table 2. Comparison of type I error rates of nonE-SHP and Maxima-SHP with unequal variances of slices, based on various T and d.
TMethodd
10501005001000500010,00050,000
20Maxima-SHP0.05320.04790.05370.04840.04640.05200.05160.0497
nonE-SHP0.04890.04680.05330.04710.05110.05250.04940.0468
40Maxima-SHP0.04750.05190.05010.04730.05160.04640.05330.0525
nonE-SHP0.04900.04950.05420.04780.05290.05000.04980.0527
60Maxima-SHP0.05180.05310.04960.04920.04760.05080.05320.0521
nonE-SHP0.04690.04710.04560.05140.05380.04640.05650.0487
80Maxima-SHP0.04850.04930.04620.04550.05300.04720.05040.0521
nonE-SHP0.04980.04760.04460.04870.05010.04810.04660.0504
100Maxima-SHP0.04950.04720.05140.05080.05340.04940.05300.0503
nonE-SHP0.05050.04960.04830.04410.05130.05040.04770.0534
200Maxima-SHP0.04890.05270.05370.05020.05210.04790.05430.0487
nonE-SHP0.04610.05280.05160.05230.05150.05100.05010.0462
Table 3. Comparison of the powers of nonE-SHP and HistEq-SHP based on gray images with various T and d.
Table 3. Comparison of the powers of nonE-SHP and HistEq-SHP based on gray images with various T and d.
TMethodd
500010,00050,000
40nonE-SHP1.00001.00000.9110
HistEq-SHP1.00001.00001.0000
100nonE-SHP1.00001.00000.9991
HistEq-SHP1.00001.00001.0000
Table 4. Comparison of the powers of nonE-SHP and HistEq-SHP based on color images with various T and d.
Table 4. Comparison of the powers of nonE-SHP and HistEq-SHP based on color images with various T and d.
TMethodd
500010,00050,000
40nonE-SHP0.05030.04850.0499
HistEq-SHP1.00001.00001.0000
100nonE-SHP0.05270.05070.0494
HistEq-SHP1.00001.00001.0000
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Qin, S.; Zhou, G.; Wu, Y. Change-Point Detection for Multi-Way Tensor-Based Frameworks. Entropy 2023, 25, 552. https://doi.org/10.3390/e25040552

AMA Style

Qin S, Zhou G, Wu Y. Change-Point Detection for Multi-Way Tensor-Based Frameworks. Entropy. 2023; 25(4):552. https://doi.org/10.3390/e25040552

Chicago/Turabian Style

Qin, Shanshan, Ge Zhou, and Yuehua Wu. 2023. "Change-Point Detection for Multi-Way Tensor-Based Frameworks" Entropy 25, no. 4: 552. https://doi.org/10.3390/e25040552

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop