Multiple Change-Point Detection in a Functional Sample via the 𝒢-Sum Process

Danielius, Tadas; Račkauskas, Alfredas

doi:10.3390/math10132294

Open AccessArticle

Multiple Change-Point Detection in a Functional Sample via the 𝒢-Sum Process

by

Tadas Danielius

^*

and

Alfredas Račkauskas

Institute of Applied Mathematics, Vilnius University, 03225 Vilnius, Lithuania

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(13), 2294; https://doi.org/10.3390/math10132294

Submission received: 8 June 2022 / Revised: 26 June 2022 / Accepted: 27 June 2022 / Published: 30 June 2022

(This article belongs to the Special Issue Advances of Functional and High-Dimensional Data Analysis)

Download

Browse Figures

Versions Notes

Abstract

:

We first define the

G

-CUSUM process and investigate its theoretical aspects including asymptotic behavior. By choosing different sets

G

, we propose some tests for multiple change-point detections in a functional sample. We apply the proposed testing procedures to the real-world neurophysiological data and demonstrate how it can identify the existence of the multiple change-points and localize them.

Keywords:

p-variation; functional data; functional change-point detection; functional principal component analysis

MSC:

62R10

1. Introduction

Consider a second-order stationary sequence of stochastic processes

Y_{i} = (Y_{i} (t), t \in [0, 1]), i \in N,

defined on a probability space

(Ω, F, P)

, having zero mean and covariance function

γ = {γ (s, t), s, t \in [0, 1]}

. For a given functional sample

X_{1} (t), \dots, X_{n} (t), t \in [0, 1],

consider the model:

X_{k} (t) = g (k / n, t) + Y_{k} (t), t \in [0, 1], k = 1, \dots, n,

(1)

where the function

g : [0, 1] \times [0, 1] \to R

is deterministic, but unobserved. Our main aim is in testing the hypothesis:

H_{0} : g = 0 versus H_{1} : g \neq 0

with emphasis on a case of change-point detection, which corresponds to a piecewise-constant function g with respect to the first argument.

This model covers a broad range of real-world problems such as climate change detection, image analysis, analysis of medical treatments, especially magnetic resonance images of brain activities, and speech recognition, to name a few. Besides, the change-point detection model (1) can be used for knot selection in spline smoothing as well for trend changes in functional time series analysis.

There is a huge list of references on testing for change-points or structural changes for a sequence of independent random variables/vectors. We refer to Csörgő and Horváth [1], Brodsky and Darkhovsky [2], Basseville and Nikiforov [3], and Chen and Gupta [4] for accounts of various techniques.

Within the functional data analysis literature, change-point detection has largely focused on one change-point problem. In Berkes et al. [5], a cumulative sum (CUSUM) test was proposed for independent functional data by using projections of the sample onto some principal components of covariance

γ

. Later, the problem was studied in Aue et al. [6], where its asymptotic properties were developed. This test was extended to weakly dependent functional data and epidemic changes by Aston and Kirch [7]. Aue et al. [8] proposed a fully functional method for finding a change in the mean without losing information due to dimension reduction, T. Harris, Bo Li, and J. D. Tucker [9] propose the multiple change-point isolation method for detecting multiple changes in the mean and covariance of a functional process.

The methodology we propose is based on some measures of variation of the process:

W_{n} (s) = \sum_{k = 1}^{⌊ n s ⌋} (X_{k} - {\bar{X}}_{n}) + (n s - ⌊ n s ⌋) (X_{⌊ n s ⌋ + 1} - {\bar{X}}_{n}), s \in [0, 1],

where

{\bar{X}}_{n} = n^{- 1} (X_{1} + \dots + X_{n})

.

Since this process is infinite-dimensional, we used the projections technique to reduce the dimension. To this aim, we assumed that

Y_{i}

is mean-squared continuous and jointly measurable and that

γ

has finite trace:

tr (γ) = \int_{0}^{1} γ (t, t) d t < \infty

. In this case,

Y_{i}

is also an

L_{2} (0, 1)

-valued random element, where

L_{2} : = L_{2} (0, 1)

is a Hilbert space of Lebesgue square integrable functions on

[0, 1]

endowed with the inner product

〈 f, g 〉 = \int_{0}^{1} f (t) g (t) d t

and the norm

∥ f ∥ : = \sqrt{〈 f, f 〉}

.

In the case where the number of change-points is known to be no bigger than m, our test statistics are constructed from

(m, p)

-variation (see the definition below) of the processes

(〈 W_{n} (s), ψ 〉, s \in [0, 1])

, where

ψ \in Ψ \subset L_{2} (0, 1)

runs through a finite set

Ψ

of possibly random directions in

L_{2} (0, 1)

. In particular,

Ψ

consists of estimated principal components. If the number of change-points is unknown, we consider the p-variation of the processes

(〈 W_{n} (s), ψ 〉, s \in [0, 1]), ψ \in Ψ

and estimate the possible number of change-points.

The paper is organized as follows. In Section 2,

G

-sum and

G

-CUSUM processes are defined and their asymptotic behavior is considered in a framework of the

ℓ^{\infty} (G)

space. The results presented in this section are used to derive the asymptotic distributions of the test statistics presented in Section 3. Section 4 is devoted to simulation studies of the proposed test algorithms. Section 5 contains a case study. Finally, Section 6 is devoted to the proofs of our main theoretical results.

2. $𝒢$ -Sum Process and Its Asymptotic

Let

Q

be the set of all probability measures on (

[0, 1], B_{[0, 1]})

. For any

Q \in Q

and Q-integrable function f,

Q f : = \int_{0}^{1} f d Q .

As usual,

L_{2} ([0, 1], Q)

is a set of measurable functions on

[0, 1]

, which are square-integrable for the measure Q, and

L_{2} ([0, 1], Q)

is an associated Hilbert space endowed with the inner product:

{〈 f, g 〉}_{Q} = \int_{0}^{1} f (t) g (t) Q (d t), f, g \in L_{2} ([0, 1], Q)

and corresponding distance

ρ_{Q} (f, g)

,

f, g \in L_{2} ([0, 1], Q)

. We abbreviate

L_{2} ([0, 1], λ)

to

L_{2}

and

{〈 \cdot, \cdot 〉}_{λ}

to

〈 \cdot, \cdot 〉

for Lebesgue measure

λ

. We use the norm

∥ f ∥ : = \sqrt{〈 f, f 〉}

and the distance

ρ (f, g) = ∥ f - g ∥

for the elements

f, g \in L_{2}

. On the set

L_{2} \times L_{2}

, we use the inner product:

{〈 (f, g), (f^{'}, g^{'}) 〉}_{2} = 〈 f, f^{'} 〉 + 〈 g, g^{'} 〉

and the corresponding distance:

ρ_{2} ((f, g), (f^{'}, g^{'})) = {(∥ f - f^{'} ∥^{2} + {∥ g - g^{'} ∥}^{2})}^{1 / 2}, f, f^{'}, g, g^{'} \in L_{2} .

For two given sets

F, Ψ \subset L_{2}

, we consider the

F \times Ψ

-sum process:

ν_{n} = (\sum_{k = 1}^{n} ν_{n k} (f, ψ), f \in F, ψ \in Ψ),

where

ν_{n k} (f, ψ) = 〈 X_{k}, ψ 〉 λ_{n k} (f)

,

λ_{n k}

is a uniform probability on the interval

[(k - 1) / n, k / n]

and

λ_{n k} (f) = \int_{0}^{1} f (t) d λ_{n k} (t)

. A natural framework for stochastic process

ν_{n}

is the space

ℓ^{\infty} (G)

, where

G = F \times Ψ

. Recall for a class

G

that

ℓ^{\infty} (G)

is a Banach space of all uniformly bounded real-valued functions

μ

on

G

endowed with the uniform norm:

{∥ μ ∥}_{G} : = sup {| μ (g) | : g \in G} .

Given a pseudometric d on

G

,

U C (G, d)

is a set of all

μ \in ℓ^{\infty} (G)

, which are uniformly d-continuous. The set

U C (G, d)

is a separable subspace of

ℓ^{\infty} (G)

if and only if

(G, d)

is totally bounded. The pseudometric space

(G, d)

is totally bounded if

N (ε, G, d)

is finite for every

ε > 0

, where

N (ε, G, d)

is the minimal number of open balls of d-radius

ε

, which are necessary to cover

G

.

It is worth noting that the process

ν_{n}

is continuous when

F \times Ψ

is endowed with the metric

ρ_{2}

. Indeed,

\begin{matrix} | ν_{n k} (f, ψ) - ν_{n k} (f^{'}, ψ^{'}) | & \leq | 〈 Y_{k}, ψ 〉 λ_{n k} (f) - 〈 Y_{k}, ψ^{'} 〉 λ_{n k} (f^{'}) | \\ \leq | λ_{n k} (f) | 〈 Y_{k}, ψ - ψ^{'} 〉 + 〈 Y_{k}, ψ^{'} 〉 λ_{n k} (f - f^{'}) | \\ \leq ∥ Y_{k} ∥ [\sqrt{n} ∥ f ∥ \cdot ∥ ψ - ψ^{'} ∥ + ∥ ψ^{'} ∥ \cdot ∥ f - f^{'} ∥] \\ \leq \sqrt{2} ∥ Y_{k} ∥ max {\sqrt{n} ∥ f ∥, ∥ ψ ∥} ρ_{2} ((f, ψ), (f^{'}, ψ^{'})), \end{matrix}

since

| λ_{n k} (f) | \leq \sqrt{n} ∥ f ∥

for every

f \in L_{2} .

If both sets

F

and

Ψ

are totally bounded, then the process

ν_{n}

is uniformly continuous so that

ν_{n}

takes values in the subspace

U C (G)

.

Next, we specify the set

F \subset L_{2}

. To this aim, we recall some definitions. For a function

f : [0, 1] \to R

, a positive number

0 < p < \infty

, and an integer

m \in N

, the

(m, p)

-variation of f on the interval

[0, t]

is

v_{m, p} (f; [0, t]) : = sup \{\sum_{j = 1}^{m} {| f (t_{j}) - f (t_{j - 1}) |}^{p}\},

where the supremum is taken over all partitions

0 = t_{0} < t_{1} < \dots < t_{m} = t,

of the interval

[0, t]

. We abbreviate

v_{m, p} (f) : = v_{m, p} (f; [0, 1])

. If

v_{p} (f) : = {sup}_{m \geq 1} v_{m, p} (f) < \infty

, then we say that f has finite p-variation and

W_{p} [0, 1]

is the set of all such functions. The set

W_{p} [0, 1]

,

p \geq 1

, is a (non-separable) Banach space with the norm:

{| | f | |}_{[p]} : = sup_{0 \leq t \leq 1} | f (t) | + v_{p}^{1 / p} (f) .

The embedding

W_{p} [0, 1] ↪ W_{q} [0, 1]

is continuous and

v_{q}^{1 / q} (f) \leq v_{p}^{1 / p} (f), for 1 \leq p < q .

For more information on the space

W_{p} [0, 1]

, we refer to [10].

The limiting zero mean Gaussian process

ν_{γ} = (ν (f, ψ), f \in F, ψ \in Ψ)

is defined via covariance:

E ν_{γ} (f, ψ) ν_{γ} (f^{'}, ψ^{'}) = K_{γ} ((f, ψ), (f^{'}, ψ,)) : = 〈 Γ ψ, ψ^{'} 〉 〈 f, f^{'} 〉, ψ, ψ^{'}, f, f^{'} \in L_{2},

(2)

where

Γ : L_{2} \to L_{2}

is the covariance operator corresponding to the kernel

γ

. The function

K_{γ} : G \times G \to R

is positive definite:

\sum_{k, j = 1}^{m} c_{j} c_{k} K_{γ} ((f_{j}, ψ_{j}), (f_{k}, ψ_{k})) \geq 0,

(3)

for all

c_{1}, \dots, c_{m} \in R

,

(f_{1}, ψ_{1}), \dots, (f_{m}, ψ_{m}) \in G

, and

m \geq 1

. Indeed, if we denote by

W = (W (f), f \in L_{2})

the isonormal Gaussian process on the Hilbert space

L_{2}

, we see that

K_{γ} ((f_{j}, ψ_{j}), (f_{k}, ψ_{k})) = E 〈 Y, ψ_{j} 〉 〈 Y, ψ_{k} 〉 E W (f_{j}) W (f_{k});

hence,

\sum_{k, j = 1}^{m} c_{j} c_{k} K_{γ} ((f_{j}, ψ_{j}), (f_{k}, ψ_{k})) = E {(\sum_{k = 1}^{m} c_{k} 〈 Y, ψ_{k} 〉 W (f_{k}))}^{2}

and (3) follows. This justifies the existence of the process

ν_{γ}

.

Throughout, we shall exploit the following.

Assumption 1.

Random processes

Y, Y_{1}, Y_{2}, \dots

are i.i.d. mean square continuous, jointly measurable, with mean zero and covariance γ such that

\int_{0}^{1} γ (t, t) d t < \infty

.

For the model (1), we consider null hypothesis

H_{0} : g = 0

and two possible alternatives:

H_{A} : g = g_{n} = u_{n} q_{n}, where u_{n} \to u in W_{2} [0, 1], \sqrt{n} q_{n} \to q in L_{2},

and

H_{A}^{'} : g = g_{n} = u_{n} q_{n}, where u_{n} \to u in W_{2} [0, 1], \sqrt{n} sup_{ψ \in Ψ} | 〈 q_{n}, ψ 〉 | \to \infty .

In both alternatives, the function

u_{n}

is responsible for the configuration of a drift within the sample, whereas the function

q_{n}

estimates a magnitude of the drift.

Our main theoretical results are Theorems 1 and 3, which are proven in Section 6.

Theorem 1.

Let the random processes

(X_{k})

be defined by

(1)

, where

Y, Y_{1}, Y_{2}, \dots

satisfy Assumption 1. Assume that, for some

1 \leq q < 2

, the set

F \subset W_{q} [0, 1]

is bounded and the set

Ψ \subset L_{2}

satisfies

\int_{0}^{1} \sqrt{log N (ε, Ψ, ρ)} d ε < \infty .

(4)

Then, there exists a version of a Gaussian process

ν_{γ}

on

L_{2} \times L_{2}

such that its restriction on

F \times Ψ

,

(ν_{γ} (f, ψ), f \in F, ψ \in Ψ)

is continuous and the following hold:

(1a): Under $H_{0}$ :

$n^{- 1 / 2} ν_{n} \to_{n \to \infty}^{D} ν_{γ} in ℓ^{\infty} (F \times Ψ) .$

(5)
(1b): Under $H_{A}$ ,

$n^{- 1 / 2} ν_{n} \to_{n \to \infty}^{D} ν_{γ} + Δ, in ℓ^{\infty} (F \times Ψ),$

(6)

where

$Δ (f, ψ) = 〈 u, f 〉 〈 q, ψ 〉 .$

If

u (s) = 1, s \in [0, 1]

, then the alternative

H_{A}

corresponds to the presence of a signal in a noise. In this case,

Δ (f, ψ) = λ (f) 〈 q, ψ 〉 .

Therefore, the use of this theorem for testing a signal in a noise is meaningful provided

〈 q, ψ 〉 \neq 0

.

As a corollary, Theorem 1 combined with the continuous mapping theorem gives the following result.

Theorem 2.

Assume that conditions of Theorem 1 are satisfied. Then, the following hold:

(2a): Under $H_{0}$

$sup_{ψ \in Ψ, f \in F} | n^{- 1 / 2} ν_{n} (f, ψ) | \to_{n \to \infty}^{D} sup_{ψ \in Ψ, f \in F} | ν_{γ} (f, ψ) | .$
(2b): Under $H_{A}$ ,

$sup_{ψ \in Ψ, f \in F} | n^{- 1 / 2} ν_{n} (f, ψ) | \to_{n \to \infty}^{D} sup_{ψ \in Ψ, f \in F} | ν_{γ} (f, ψ) + 〈 u, f 〉 〈 q, ψ 〉 | .$
(2c): Under $H_{A}^{'}$ ,

$sup_{ψ \in Ψ, f \in F} | n^{- 1 / 2} ν_{n} (f, ψ) | \to_{n \to \infty}^{P} \infty .$

(7)

Proof.

Since both (2a) and (2b) are by-products of Theorem 1 and continuous mappings, we need to prove only (2c). First, we observe that

\begin{matrix} sup_{ψ \in Ψ, f \in F} | ν_{n} (f, ψ) | & \geq sup_{ψ \in Ψ, f \in F} | \sum_{k = 1}^{n} (〈 Y_{k}, ψ 〉 + 〈 q_{n}, ψ 〉 u_{n} (k / n)) λ_{n k} (f) | \\ \geq sup_{ψ \in Ψ, f \in F} | \sum_{k = 1}^{n} u_{n} (k / n) λ_{n k} (f) | \cdot | 〈 q_{n}, ψ 〉 | - O_{P} (\sqrt{n}), \end{matrix}

by (2a). Consider

I_{n} (f) : = | \sum_{k = 1}^{n} u_{n} (k / n) λ_{n k} (f) | .

We have

\begin{matrix} I_{n} (f) & = n | \sum_{k = 1}^{n} u_{n} (k / n) \int_{(k - 1) / n}^{k / n} f (t) d t | \\ \geq n | \sum_{k = 1}^{n} \int_{(k - 1) / n}^{k / n} u_{n} (t) f (t) d t | - n | \sum_{k = 1}^{n} \int_{(k - 1) / n}^{k / n} (u_{n} (t) - u_{n} (k / n)) f (t) d t | \\ : = I_{n}^{'} (f) - I_{n}^{″} (f) . \end{matrix}

By the Hölder inequality,

\begin{matrix} I_{n}^{″} (f) & \leq n \sum_{k = 1}^{n} {(\int_{(k - 1) / n}^{k / n} {(u_{n} (t) - u_{n} (k / n))}^{2} d t)}^{1 / 2} {(\int_{(k - 1) / n}^{k / n} f^{2} (t) d t)}^{1 / 2} \\ \leq n {(\sum_{k = 1}^{n} \int_{(k - 1) / n}^{k / n} {(u_{n} (t) - u_{n} (k / n))}^{2} d t)}^{1 / 2} {(\sum_{k = 1}^{n} \int_{(k - 1) / n}^{k / n} f^{2} (t) d t)}^{1 / 2} \\ \leq n {(n^{- 1} \sum_{k = 1}^{n} v_{2} (u_{n}, [(k - 1) / n, k / n]))}^{1 / 2} ∥ f ∥ \leq \sqrt{n} v_{2}^{1 / 2} (u_{n}) ∥ f ∥ . \end{matrix}

Since

I_{n}^{'} (f) = n | 〈 u_{n}, f 〉 |

, we deduce

I_{n} (ψ, f) \geq n | 〈 u_{n}, f 〉 | - \sqrt{n} v_{2}^{1 / 2} (u_{n}) ∥ f ∥ .

Hence,

n^{- 1 / 2} sup_{ψ \in Ψ, f \in F} | ν_{n} (f, ψ) | \geq \sqrt{n} sup_{ψ \in Ψ, f \in F} | 〈 u_{n}, f 〉 | \cdot | 〈 q_{n}, ψ 〉 | - O_{P} (1)

and this completes the proof of (2c). □

Next, we consider

G

-sum process

μ_{n} = (μ_{n} (f, ψ), f \in F, ψ \in Ψ)

defined by

μ_{n} (f, ψ) = \sum_{k = 1}^{n} 〈 X_{k} - {\bar{X}}_{n}, ψ 〉 λ_{n k} (f),

where

{\bar{X}}_{n} = n^{- 1} (X_{1} + \dots + X_{n})

. Its limiting zero mean Gaussian process

μ_{γ}

is defined via covariance:

E μ_{γ} (f, ψ) μ_{γ} (f^{'}, ψ^{'}) = 〈 Γ ψ, ψ^{'} 〉 [〈 f, f^{'} 〉 - λ (f) λ (f^{'})], ψ, ψ^{'}, f, f^{'} \in L_{2} .

(8)

The existence of Gaussian process

μ_{γ}

can be justified as that of

ν_{γ}

above. Just notice that

〈 f, f^{'} 〉 - λ (f) λ (f^{'}) - E (W (f) - λ (f) W (1)) (W (f^{'}) - λ (f^{'}) W (1)),

where

1 (t) = 1, t \in [0, 1]

.

Theorem 3.

Assume that the conditions of Theorem 1 are satisfied. Then, there exists a version of the Gaussian process

μ_{γ}

on

L_{2} (0, 1) \times L_{2} (0, 1)

such that its restriction on

F \times Ψ

,

(ν (f, ψ), f \in F, ψ \in Ψ)

is continuous and the following hold:

(3a): Under $H_{0}$ ,

$n^{- 1 / 2} μ_{n} \to_{n \to \infty}^{D} μ_{γ} i n ℓ^{\infty} (F \times Ψ);$

(9)
(3b): Under alternative $H_{A}$ ,

$n^{- 1 / 2} μ_{n} \to_{n \to \infty}^{D} μ_{γ} + \tilde{Δ} i n ℓ^{\infty} (F \times Ψ),$

(10)

where

$\tilde{Δ} (f, ψ) = [〈 u, f 〉 - λ (u) λ (f)] 〈 q, ψ 〉 .$

We see that the limit distribution of the

G

-sum process separates the null and alternative hypothesis provided

[〈 u, f 〉 - λ (u) λ (f)] 〈 q, ψ 〉 \neq 0

. As a corollary, Theorem 3 combined with the continuous mapping theorem gives the following results.

Theorem 4.

Assume that the conditions of Theorem 1 are satisfied. Then, the following hold:

(4a): Under $H_{0}$ ,

$sup_{ψ \in Ψ, f \in F} | n^{- 1 / 2} μ_{n} (f, ψ) | \to_{n \to \infty}^{D} sup_{ψ \in Ψ, f \in F} | μ_{γ} (f, ψ) | .$

(11)
(4b): Under $H_{A}$ ,

$sup_{ψ \in Ψ, f \in F} | n^{- 1 / 2} μ_{n} (f, ψ) | \to_{n \to \infty}^{D} sup_{ψ \in Ψ, f \in F} | μ_{γ} (f, ψ) + \tilde{Δ} (f, ψ) | .$

(12)
(4c): Under $H_{A}^{'}$ ,

$sup_{ψ \in Ψ, f \in F} | n^{- 1 / 2} μ_{n} (f, ψ) | \to_{n \to \infty}^{P} \infty .$

(13)

Proof.

Both (4a) and (4b) are by-products of Theorem 3 and continuous mappings, whereas the proof of (4c) follows the lines of the proof of Theorem 2 (2c). □

3. Test Statistics

Several useful test statistics can be obtained from the

G

-sum process

μ_{n} = (μ_{n} (f, ψ),

(f, ψ) \in G = F \times Ψ),

by considering concrete examples of sets

Ψ

and

F

.

Throughout this section, we assume that the sample

X_{1}, X_{2}, \dots, X_{n}

follows the model (1) and

Y, Y_{1}, Y_{2}, \dots

satisfies Assumption 1.

By

Γ

, we denote the covariance operator of Y:

Γ = E (Y \otimes Y)

. Recall

Γ x (t) = \int_{0}^{1} γ (t, s) x (s) d s, x \in [0, 1] .

According to Mercer’s theorem, the covariance

γ

has then the following singular-value decomposition:

γ (s, t) = \sum_{r = 1}^{m} λ_{r} ψ_{r} (s) ψ_{r} (t), t, s \in [0, 1],

(14)

where

λ_{1}, \dots, λ_{m}

are all the decreasingly ordered positive eigenvalues of

Γ

and

ψ_{1}, \dots, ψ_{m}

are the associated eigenfunctions of

Γ

such that

\int_{0}^{1} ψ_{r}^{2} (t) d t = 1, \int_{0}^{1} ψ_{r} (t) ψ_{ℓ} (t) d t = 0, r \neq ℓ,

and m is the smallest integer such that, when

r > m

,

λ_{r} = 0

. If

m = \infty

, then all the eigenvalues are positive, and in this case,

\sum_{r} λ_{r} < \infty

. Note that

λ_{r} = E {〈 Y, ψ_{r} 〉}^{2} .

Besides, we shall assume the following.

Assumption 2.

The eigenvalues

λ_{r}

satisfy, for some

d > 0,

λ_{1} > λ_{2} > \dots > λ_{d} > λ_{d + 1} .

In statistical analysis, the eigenvalues and eigenfunctions of Γ are replaced by their estimated versions. Noting that, for each k,

E [(X_{k} - E (X_{k})) \otimes (X_{k} - E (X_{k}))] = Γ,

one estimates Γ by

{\hat{Γ}}_{n} : = \frac{1}{n} \sum_{i = 1}^{n} [(X_{i} - {\bar{X}}_{n}) \otimes (X_{i} - {\bar{X}}_{n})],

where

{\bar{X}}_{n} (s) = n^{- 1} (X_{1} (s) + \dots + X_{n} (s) .

We denote the eigenvalues and eigenfunctions of

\hat{Γ}

by

{\hat{λ}}_{n r}

and

{\hat{ψ}}_{n r}

,

r = 1, \dots, n - 1,

respectively. In order to ensure that

{\hat{ψ}}_{n r}

may be viewed as an estimator of

ψ_{r}

rather than of

- ψ_{r}

, we will in the following assume that the signs are such that

〈 {\hat{ψ}}_{n r}, ψ_{r} 〉 \geq 0

. Note that

\hat{Γ} {\hat{ψ}}_{n r} = {\hat{λ}}_{n r} {\hat{ψ}}_{n r}, r = 1, \dots, n - 1,

(15)

and

{\hat{λ}}_{n r} = \frac{1}{n - 1} \sum_{i = 1}^{n} {〈 X_{i} - {\bar{X}}_{n}, {\hat{ψ}}_{n r} 〉}^{2}, r = 1, \dots, n .

(16)

The use of the estimated eigenfunctions and eigenvalues in the test statistics is justified by the following result. For a Hilbert–Schmidt operator T on

L_{2}

, we denote by

{∥ T ∥}_{H S}

its Hilbert–Schmidt norm.

Lemma 1.

Assume that Assumption 1 holds. Then, under

H_{A}

,

∥ {\hat{Γ}}_{n} {- Γ ∥}_{H S} \to 0 a s n \to \infty .

Proof.

First, we observe that

{\hat{Γ}}_{n} = {\tilde{Γ}}_{n} + T_{n 1} + T_{n 2} + T_{n 3},

where

\begin{matrix} {\tilde{Γ}}_{n} & = \frac{1}{n} \sum_{k = 1}^{n} (Y_{k} - {\bar{Y}}_{n}) \otimes (Y_{k} - {\bar{Y}}_{n}), \\ T_{n 1} & = \frac{1}{n} \sum_{k = 1}^{n} [u_{n} (k / n) - \frac{1}{n} \sum_{j = 1}^{n} u_{n} (j / n)] (Y_{k} - {\bar{Y}}_{n}) \otimes q_{n}, \\ T_{n 2} & = \frac{1}{n} \sum_{k = 1}^{n} [u_{n} (k / n) - \frac{1}{n} \sum_{j = 1}^{n} u_{n} (j / n)] q_{n} \otimes (Y_{k} - {\bar{Y}}_{n}), \\ T_{n 3} & = \frac{1}{n} \sum_{k = 1}^{n} {[u_{n} (k / n) - \frac{1}{n} \sum_{j = 1}^{n} u_{n} (j / n)]}^{2} q_{n} \otimes q_{n} . \end{matrix}

It is well known that

∥ {\tilde{Γ}}_{n} {- Γ ∥}_{H S} \to_{n \to \infty}^{a . s .} 0

as

n \to \infty

. By the moment inequality for sums of independent random variables, we deduce

E ∥ T_{n i} ∥_{H S}^{2} \leq c n^{- 2} \sum_{k = 1}^{n} {[u_{n} (k / n) - \frac{1}{n} \sum_{j = 1}^{n} u_{n} (j / n)]}^{2} {E ∥ Y ∥}^{2} {∥ q_{n} ∥}^{2},

for both

i = 1, 2

. This yields

T_{n i} \to_{n \to \infty}^{P} 0

. Next, we have

∥ T_{n 3} ∥_{H S} = \frac{1}{n} \sum_{k = 1}^{n} {[u_{n} (k / n) - \frac{1}{n} \sum_{j = 1}^{n} u_{n} (j / n)]}^{2} ∥ q_{n} ∥^{2} \leq \frac{1}{n} \sum_{k = 1}^{n} u_{n}^{2} (k / n) {∥ q_{n} ∥}^{2} \to 0

as

n \to \infty

due to assumption

H_{A}

. This completes the proof. □

Lemma 2.

Assume that Assumptions 1 and 2 for some finite d hold and

E (∥ Y ∥^{4}) < \infty .

Then, under

H_{0}

, as well as under

H_{A}

:

n^{1 / 2} | {\hat{λ}}_{n j} - λ_{j} | = O_{P} (1), and n^{1 / 2} ∥ {\hat{c}}_{j} {\hat{ψ}}_{n j} - ψ_{j} ∥ = O_{P} (1)

for each

1 \leq j \leq d

, where

{\hat{c}}_{n j} = 〈 {\hat{ψ}}_{n j}, ψ_{j} 〉

.

Proof.

If the null hypothesis is satisfied, then

{\hat{Γ}}_{n} = {\tilde{Γ}}_{n}

and the asymptotic results for the eigenvalues and eigenfunctions of

{\tilde{T}}_{n}

are well known (see, e.g., [11]). Under alternative

H_{A}

, the results follow from Lemma 1 and Lemmas 2.2 and 2.3 in [11]. □

Next, we consider separately the test statistics for at most one, at most m, and for an unknown number of change-points.

3.1. Testing at Most One Change-Point

Define for

d > 0

,

T_{n, 1} (d) : = max_{1 \leq j \leq d} \frac{1}{\sqrt{λ_{j}}} max_{1 \leq k \leq n} | \sum_{i = 1}^{k} 〈 X_{i} - {\bar{X}}_{n}, ψ_{j} 〉 | .

(17)

This statistic is designed for at most one change-point alternative. Its limiting distribution is established in the following theorem.

Theorem 5.

Let random functional sample

(X_{k})

be defined by

(1)

where

Y, Y_{1}, Y_{2}, \dots

satisfies Assumptions 1 and 2. Then,

(a): Under $H_{0}$ , it holds that

$n^{- 1 / 2} T_{n, 1} (d) \to_{n \to \infty}^{D} max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | B_{k} (t) |,$

where $B_{1}, \dots, B_{d}$ are independent standard Brownian bridge processes;
(b): Under $H_{A}$ , it holds that

$n^{- 1 / 2} T_{n, 1} (d) \to_{n \to \infty}^{D} max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | B_{k} (t) + Δ (t) 〈 q, ψ_{k} / \sqrt{λ_{k}} 〉 |,$

where

$Δ (t) = \int_{0}^{t} u (s) d s - t \int_{0}^{1} u (s) d s, t \in [0, 1] .$

(18)
(c): Under $H_{A}^{'}$ , it holds that

$n^{- 1 / 2} T_{n, 1} (d) \to_{n \to \infty}^{P} \infty .$

Proof.

Consider the sets

Ψ_{d, γ} : = \{\frac{ψ_{1}}{\sqrt{λ_{1}}}, \dots, \frac{ψ_{d}}{\sqrt{λ_{d}}}\}, and F_{1} = {1_{[0, t]}, t \in [0, 1]} .

(19)

Observing that

T_{n, 1} (d) = sup_{ψ \in Ψ_{d, γ}, f \in F_{1}} | μ_{n} (f, ψ) |

and

F_{1}

is a bounded set in

W_{q}

, we complete the proof by applying Theorem 3. □

Based on this result, we construct the testing procedure in a classical way. Choose for a given

α \in (0, 1)

,

C_{α} > 0

such that

P (max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | B_{k} (t) | > C_{α}) = α .

According to Theorem 5, the test:

T_{n, 1} (d) \geq \sqrt{n} C_{α}

(20)

will have asymptotic level

α

. Under the alternative

H_{A}

, we have

\begin{matrix} lim_{n \to \infty} P (n^{- 1 / 2} T_{n, 1} (d) \geq C_{α}) & \geq P (max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | B_{k} (t) | \leq max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | Δ (t) 〈 q, ψ_{k} / λ_{k} 〉 - C_{α}) \\ \geq 1 - α \end{matrix}

when

max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | Δ (t) 〈 q, ψ_{k} / λ_{k} 〉 | \geq 2 C_{α} .

(21)

Hence, if

g (s, t) = g_{n} (s, t) = u_{n} (s) q_{n} (t)

and

\sqrt{n} max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | 〈 g_{n} (t, \cdot), ψ_{k} / \sqrt{λ_{k}} 〉 | \to \infty

as

n \to \infty

, then the test (20) is asymptotically consistent.

Let us note that, due to the independence of Brownian bridges

B_{k}, k = 1, \dots, d

, we have

1 - α = P (max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | B_{k} (t) | \leq C_{α}) = P^{d} (max_{0 \leq t \leq 1} | B_{1} (t) | \leq C_{α}) .

This yields

P (max_{0 \leq t \leq 1} | B_{1} (t) | \leq C_{α}) = {(1 - α)}^{1 / d} .

Hence,

C_{α}

is the

{(1 - α)}^{1 / d}

-quantile of the distribution of

{sup}_{0 \leq t \leq 1} | B_{1} (t) |

. This observation simplifies the calculations of critical values

C_{α}

.

In particular, if there is

s^{*} \in (0, 1)

such that

u (s) = 1_{[0, s^{*}]} (s), s \in [0, 1],

then we have one change-point model:

X_{k} (t) = 1_{[0, s^{*}]} (k / n) q_{n} (t) + Y_{k} (t), t \in [0, 1] .

In this case,

Δ (t) = Δ^{*} (t) : = min {t, s^{*}} - t s^{*}, t \in [0, 1] .

Figure 1 below shows generated density functions of

{max}_{1 \leq k \leq d} {max}_{0 \leq t \leq 1} | B_{k} (t) |

and

{max}_{1 \leq k \leq d} {max}_{0 \leq t \leq 1} | B_{k} (t) + Δ^{*} (t) 〈 q, ψ_{k} / \sqrt{λ_{k}} 〉 |

for

d = [1, 10, 30]

,

s^{*} \in {1 / 4, 1 / 2, 3 / 4}

where

q = a ψ_{k} \sqrt{λ_{k}}

for a fixed k.

Let us observe that test statistic

T_{n, 1} (d)

tends to infinity when

d \to \infty

. On the other hand, with larger d, the approximation of

X_{j}

by series

\sum_{k = 1}^{d} 〈 X, ψ_{j} 〉 ψ_{j}

is better and leads to better testing power. The following result establishes the asymptotic distribution of

T_{n, 1} (d)

as

d \to \infty

.

Theorem 6.

Let random functional sample

(X_{k})

be defined by

(1)

where

Y, Y_{1}, Y_{2}, \dots

satisfies Assumption 1. Then, under

H_{0}

,

lim_{d \to \infty} lim_{n \to \infty} P (n^{- 1 / 2} T_{n, 1} (d) \leq \frac{x}{a_{d}} + b_{d}) = exp {- e^{- x}}, x \geq 0,

(22)

where

a_{d} = {(8 ln d)}^{1 / 2}, b_{d} = \frac{1}{4} a_{d} + \frac{ln ln d}{a_{d}} .

(23)

Proof.

By Theorem 5, the proof reduces to

lim_{d \to \infty} P (max_{1 \leq j \leq d} ∥ B_{j} ∥_{\infty} \leq x / a_{d} + b_{d}) = exp {- e^{- x}}, x \geq 0 .

(24)

It is known that

P (∥ B_{j} ∥_{\infty} > u) = 2 e^{- 2 u^{2}} u^{2} (1 + o (1)), u \to \infty .

Since Brownian bridges

B_{j}, 1 \leq j \leq d

are independent, we have

\begin{matrix} P (max_{1 \leq j \leq d} ∥ B_{j} ∥_{\infty} \leq x / a_{d} + b_{d}) & = P^{d} (∥ B_{1} ∥_{0} \leq x / a_{d} + b_{d}) \\ = {(1 - P (∥ B_{1} ∥_{0} \geq x / a_{d} + b_{d}))}^{d} \end{matrix}

and

lim_{d \to \infty} d P (∥ B_{1} ∥_{\infty} \geq x / a_{d} + b_{d}) = e^{- x} .

This proves (24). □

When d is large, the test (20) becomes

T_{n, 1} \geq \sqrt{n} [\frac{1}{a_{d}} ln (\frac{1}{ln (1 / α)}) + b_{d}]

(25)

and has asymptotic level

α

as n and d tend to infinity.

The dependence on d of critical values of the tests (20) and (25) is shown in Figure 2. A comparison was made for asymptotic level

α = 0.05

. From Figure 2, we see that the critical values in (25) are smaller than those in (20). This means that the error of the first kind is more likely with the test (25), rather than with (36). This is confirmed by simulations.

If the eigenfunctions

(ψ_{k})

are unknown, we use the statistics:

{\hat{T}}_{n, 1} (d) : = max_{1 \leq j \leq d} \frac{1}{\sqrt{{\hat{λ}}_{j}}} max_{1 \leq k \leq n} | \sum_{i = 1}^{k} 〈 X_{i} - {\bar{X}}_{n}, {\hat{ψ}}_{j} 〉 | .

(26)

Theorem 7.

Let random functional sample

(X_{k})

be defined by

(1)

, where

Y, Y_{1}, Y_{2}, \dots

satisfies Assumptions 1 and 2. Then:

(a): Under $H_{0}$ ,

$n^{- 1 / 2} {\hat{T}}_{n, 1} (d) \to_{n \to \infty}^{D} max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | B_{k} (t) |,$

where $B_{1}, \dots, B_{d}$ are independent standard Brownian bridge processes;
(b): Under $H_{A}$ , if ${E ∥ Y ∥}^{4} < \infty$ , it holds that

$n^{- 1 / 2} {\hat{T}}_{n, 1} (d) \to_{n \to \infty}^{D} max_{1 \leq k \leq d} max_{0 \leq t \leq 1} | B_{k} (t) + Δ (t) 〈 q, ψ_{k} / \sqrt{λ_{k}} 〉 |,$

where $Δ (t) = \int_{0}^{t} u (s) d s - t \int_{0}^{1} u (s) d s, t \in [0, 1]$ .
(c): Under $H_{A}^{'}$ , if ${E ∥ Y ∥}^{4} < \infty$ , it holds that

$n^{- 1 / 2} {\hat{T}}_{n, 1} (d) \to_{n \to \infty}^{P} \infty .$

Proof.

The result follows from Theorem 5 if we show that

D_{n} : = n^{- 1 / 2} | T_{n, 1} (d) - {\hat{T}}_{n, 1} (d) | \to_{n \to \infty}^{P} 0 .

(27)

On the set

{max}_{1 \leq j \leq d} | λ_{j} - {\hat{λ}}_{n j} | + {max}_{1 \leq j \leq d} ∥ ψ_{j} - {\hat{c}}_{j} {\hat{ψ}}_{n j} ∥ \leq A n^{- 1 / 2}

and for

n \geq N_{0}

such that

A n^{- 1 / 2} < λ_{d} / 2

, simple algebra gives

D_{n} \leq D_{n 1} + D_{n 2},

where

\begin{matrix} D_{n 1} & = max_{1 \leq j \leq d} | \frac{1}{λ_{j}} - \frac{1}{{\hat{λ}}_{n j}} | max_{1 \leq k \leq 1} | \sum_{i = 1}^{k} 〈 X_{i} - {\bar{X}}_{n}, ψ_{j} 〉 | \\ \leq \frac{2}{λ_{d}} max_{1 \leq j \leq n} | {\hat{λ}}_{n j} - λ_{j} | n^{- 1 / 2} T_{n, 1} (d) \to 0 as n \to \infty, \end{matrix}

and

\begin{matrix} D_{n 2} & \leq n^{- 1 / 2} \frac{2}{ε} max_{1 \leq k \leq n} ∥ \sum_{i = 1}^{k} [X_{i} - {\bar{X}}_{n}] ∥ max_{1 \leq j \leq d} ∥ {\hat{ψ}}_{n j} - ψ_{j} ∥ \\ \leq \frac{2 A}{ε} n^{- 1} max_{1 \leq k \leq n} ∥ \sum_{i = 1}^{k} [X_{i} - {\bar{X}}_{n}] ∥ \to 0 as n \to \infty \end{matrix}

by the law of large numbers. Lemma 2 concludes the proof. □

Test (20) now becomes

{\hat{T}}_{n, 1} (d) \geq \sqrt{n} C_{α}

(28)

and has asymptotic level

α

by Theorem 7.

3.2. Testing at Most m Change-Points

For

m > 1

, let

N_{m}

be a set of all partitions

κ = (k_{i}, i = 0, 1, \dots, m)

of the set

{0, 1, \dots, n}

such that

0 = k_{0} < k_{1} < \dots < k_{m - 1} < k_{m} = n

. Next, consider for fixed integers d,

1 \leq m < n

and real

p > 2

,

T_{n, m} (d, p) : = max_{1 \leq j \leq d} \frac{1}{\sqrt{λ_{j}}} max_{κ \in N_{m}} {\{\sum_{k = 1}^{m} | \sum_{k_{i - 1} + 1}^{k_{i}} 〈 X_{k} - {\bar{X}}_{n}, ψ_{j} 〉 |^{p}\}}^{1 / p} .

(29)

The statistics

T_{n, m} (d, p)

are designed for testing at most m change-points in a sample.

Theorem 8.

Let the random sample

(X_{i}, i = 1, \dots, n)

be as in Theorem 1. Then:

(a): Under $H_{0}$ ,

$n^{- 1 / 2} T_{n, m} (d, p) \to_{n \to \infty}^{D} max_{1 \leq j \leq d} v_{m, p}^{1 / p} (B_{j}),$

where $B_{1}, \dots, B_{d}$ are independent standard Brownian bridges.
(b): Under $H_{A}$ ,

$n^{- 1 / 2} T_{n, m} (d, p) \to_{n \to \infty}^{D} max_{1 \leq j \leq d} v_{m, p}^{1 / p} (B_{j} + Δ 〈 q, ψ_{j} / \sqrt{λ_{j}} 〉),$

where $Δ (t), t \in [0, 1]$ is as defined in Theorem 2.
(c): Under $H_{A}^{'}$ ,

$n^{- 1 / 2} T_{n, m} (d, p) \to_{n \to \infty}^{P} \infty .$

Proof.

For

1 \leq m \leq n

and

q = p / (p - 1)

, set

F_{m, q} : = \{\sum_{j = 1}^{m} b_{j} 1_{(t_{j - 1}, t_{j}]} : \sum_{j = 1}^{m} {| b_{j} |}^{q} \leq 1, 0 = t_{0} < t_{1} < \dots < t_{m} = 1\} .

(30)

It is easy to check that

F_{m, q} \subset W_{q} [0, 1]

. Since

sup \{| \sum_{k = 1}^{m} a_{k} b_{k} | : \sum_{k = 1}^{m} {| b_{k} |}^{q} \leq 1\} = {(\sum_{k = 1}^{m} {| a_{k} |}^{p})}^{1 / p},

we have

T_{n, m} (d) = max_{ψ \in Ψ_{d, γ}} max_{f \in F_{m, q}} | μ_{n} (ψ, f) |,

and the results follow from Theorem 2. □

In particular, if there is

s_{1}^{*}, s_{2}^{*} \in (0, 1)

such that

u (s) = 1_{[s_{1}^{*}, s_{2}^{*}]} (s), s \in [0, 1],

then (1) corresponds to the so-called changed segment model. In this case, we have

Δ (t) = Δ_{2}^{*} (t) : = max {0, min {t, s_{2}^{*}} - s_{1}^{*}} - t (s_{2}^{*} - s_{1}^{*}), t \in [0, 1]

. Figure 3) shows the generated density functions of

{max}_{1 \leq k \leq d} v_{4, p}^{1 / p} (B_{k})

and

{max}_{1 \leq k \leq d} v_{4, p}^{1 / p} (B_{k} + a_{k} Δ_{2}^{*})

for different values of

d \geq 1

,

0 < s_{1}^{*} < s_{2} < 1

, and

p > 2

. The numbers

a_{1}, \dots, a_{d}

were sampled from the uniform distribution on

[0, 15]

.

With the estimated eigenvalues and eigenfunctions, we define

{\hat{T}}_{n, m} (d, p) : = max_{1 \leq j \leq d} \frac{1}{\sqrt{{\hat{λ}}_{n j}}} max_{κ \in N_{m}} {\{\sum_{k = 1}^{m} | \sum_{k_{i - 1} + 1}^{k_{i}} 〈 X_{k} - {\bar{X}}_{n}, {\hat{ψ}}_{n j} 〉 |^{p}\}}^{1 / p} .

(31)

Theorem 9.

Let the functional sample

(X_{k}, k = 1, \dots, n)

be defined by

(1)

where

Y, Y_{1}, Y_{2}, \dots

satisfies Assumptions 1 and 2. Then:

(a): Under $H_{0}$ ,

$n^{- 1 / 2} {\hat{T}}_{n, m} (d, p) \to_{n \to \infty}^{D} max_{1 \leq j \leq d} v_{m, p}^{1 / p} (B_{j}),$

where $B_{1}, \dots, B_{d}$ are independent standard Brownian bridges.
(b): Under $H_{A}$ ,

$n^{- 1 / 2} {\hat{T}}_{n, m} (d, p) \to_{n \to \infty}^{D} max_{1 \leq j \leq d} v_{m, p}^{1 / p} (B_{j} + Δ 〈 q, ψ_{j} / \sqrt{λ_{j}} 〉),$

where $Δ (t), t \in [0, 1]$ is as defined in Theorem 2.
(c): Under $H_{A}^{'}$ ,

$n^{- 1 / 2} {\hat{T}}_{n, m} (d, p) \to_{n \to \infty}^{P} \infty .$

Proof.

This goes along the lines of the proof of Theorem 7. □

According to Theorems 8 and 9, the tests:

T_{n, m} (d, p) \geq \sqrt{n} C_{α} (m, d, p) and {\hat{T}}_{n, m} (d, p) \geq \sqrt{n} C_{α} (m, d, p)

(32)

respectively, will have asymptotic level

α

, if

C_{α} (m, d, p)

is such that

P (v_{m, p}^{1 / p} (B) \leq C_{α} (m, d, p)) = {(1 - α)}^{1 / d} .

3.3. Testing Unknown Number of Change-Points

Next, consider for fixed integers d as above and real

p > 2

,

T_{n} (d, p) : = max_{1 \leq j \leq d} \frac{1}{\sqrt{λ_{j}}} max_{1 \leq m \leq n} max_{κ \in N_{m}} {\{\sum_{k = 1}^{m} | \sum_{k_{i - 1} + 1}^{k_{i}} 〈 X_{k} - {\bar{X}}_{n}, ψ_{j} 〉 |^{p}\}}^{1 / p} .

(33)

The statistics

T_{n} (d, p)

are designed for testing an unknown number of change-points in a sample.

Theorem 10.

Let random sample

(X_{i}, i = 1, \dots, n)

be as in Theorem 1. Then:

(a): Under $H_{0}$ ,

$n^{- 1 / 2} T_{n} (d, p) \to_{n \to \infty}^{D} max_{1 \leq j \leq d} v_{p}^{1 / p} (B_{j}),$

where $B_{1}, \dots, B_{d}$ are independent standard Brownian bridges.
(b): Under $H_{A},$

$n^{- 1 / 2} T_{n} (d, p) \to_{n \to \infty}^{D} max_{1 \leq j \leq d} v_{p}^{1 / p} (B_{j} + Δ 〈 q, ψ_{j} / \sqrt{λ_{j}} 〉),$

where $Δ (t), t \in [0, 1]$ is as defined in Theorem 1.
(c): Under $H_{A}^{'}$ ,

$n^{- 1 / 2} T_{n} (d, p) \to_{n \to \infty}^{P} \infty .$

Proof.

For

q = p / (p - 1)

, set

F_{q} : = \{\sum_{j = 1}^{m} b_{j} 1_{(t_{j - 1}, t_{j}]} : \sum_{j = 1}^{\infty} {| b_{j} |}^{q} \leq 1, 0 = t_{0} < t_{1} < \dots < t_{m} = 1, m \geq 1\} .

(34)

It is easy to check that

F_{q} \subset W_{q} [0, 1]

. Since

sup \{| \sum_{k = 1}^{\infty} a_{k} b_{k} | : \sum_{k = 1}^{\infty} {| b_{k} |}^{q} \leq 1\} = {(\sum_{k = 1}^{\infty} {| a_{k} |}^{p})}^{1 / p},

we have

T_{n} (d) = max_{ψ \in Ψ_{d, γ}} max_{f \in F_{m, q}} | μ_{n} (ψ, f) |,

and both statements (a) and (b) follow from Theorem 1. □

With the estimated eigenvalues and eigenfunctions, we define:

{\hat{T}}_{n} (d, p) : = max_{1 \leq j \leq d} \frac{1}{\sqrt{{\hat{λ}}_{n j}}} max_{1 \leq m \leq n} max_{κ \in N_{m}} {\{\sum_{k = 1}^{m} | \sum_{k_{i - 1} + 1}^{k_{i}} 〈 X_{k} - {\bar{X}}_{n}, {\hat{ψ}}_{n j} 〉 |^{p}\}}^{1 / p} .

(35)

Theorem 11.

Let random sample

(X_{i})

be as in Theorem 1. Then:

(a): Under $H_{0}$ ,

$n^{- 1 / 2} {\hat{T}}_{n} (d, p) \to_{n \to \infty}^{D} max_{1 \leq j \leq d} v_{p}^{1 / p} (B_{j}),$

where $B_{1}, \dots, B_{d}$ are independent standard Brownian bridges.
(b): Under $H_{A},$

$n^{- 1 / 2} {\hat{T}}_{n} (d, p) \to_{n \to \infty}^{D} max_{1 \leq j \leq d} v_{p}^{1 / p} (B_{j} + Δ 〈 q, ψ_{j} / \sqrt{λ_{j}} 〉),$

where $Δ (t), t \in [0, 1]$ is as defined in Theorem 1.
(c): Under $H_{A}^{'}$ ,

$n^{- 1 / 2} {\hat{T}}_{n} (d, p) \to_{n \to \infty}^{P} \infty .$

Proof.

This goes along the lines of the proof of Theorem 7. □

According to Theorems 10 and 11, the tests:

T_{n} (d, p) \geq \sqrt{n} C_{α} (d, p) and {\hat{T}}_{n} (d, p) \geq \sqrt{n} C_{α} (d, p)

(36)

respectively, will have asymptotic level

α

, if

C_{α} (d, p)

is such that

P (v_{p}^{1 / p} (B) \leq C_{α} (d, p)) = {(1 - α)}^{1 / d} .

The quantiles of distribution function of

v_{p}^{1 / p} (B)

were estimated in [12].

4. Simulation Results

We examined the above-defined test statistics in a Monte Carlo simulation study. In the first subsection, we describe the simulated data under consideration. The statistical power analysis of the tests (36) and (32) is presented in Section 4.2.

4.1. Data

We used the following three scenarios:

(S1): Let $(ξ_{j k})$ be i.i.d. symmetrized Pareto random variables with index p (we used $p = 5$ ). Set

$Y_{j} (t) = \sum_{k = 1}^{d} ξ_{j k} \frac{\sqrt{2} cos (k π t)}{k σ}, t \in [0, 1], j \geq 1,$

(37)

where $σ^{2} = E ξ_{11}^{2}$ . Under the null hypothesis, we take $X_{k} = Y_{k}, k = 1, 2, \dots, n$ .
Under the alternative, we consider

$X_{j} (t) = u_{n} (j / n) \sum_{k = 1}^{d} a_{n k} cos (k π t) + Y_{j}, t \in [0, 1], j = 1, \dots, n,$

where the function $u_{n}$ defines the change-points’ configuration and the coefficients $(a_{n k})$ are subject to choice.
(S2): We start with discrete observations $(x_{i j}, j = 0, 1, \dots, M), i = 1, \dots, n,$ by taking $x_{i j} = X_{i} (τ_{j})$ , where the random sample ( $X_{j}, j = 1, \dots, n$ ) is generated as in scenario (S1). Discrete observations are converted to the functional data $(X_{j}, j = 1, \dots, n)$ by using B-spline bases.
(S3): Discrete observations $(i / M, y_{i j}), i = 0, 1, \dots, M, j = 1, \dots, n,$ are generated by taking

$y_{i j} = M^{- 1 / 2} \sum_{k = 1}^{i} ξ_{k j},$

so that $y_{i j}$ can be interpreted as the observation of a standard Wiener process at $i / M$ . From $(y_{i j}, i = 1, \dots, M)$ , the function $Y_{j}$ is obtained using the B-spline smoothing technique. During the simulation, we used $M = 1000$ and $D = 50$ B-spline functions, thus obtaining $n = 500$ functions $Y_{1}, \dots, Y_{n}$ . Then, we define for $j = 1, \dots, n$ ,

$X_{j} = \{\begin{matrix} Y_{j}, & under null \\ u_{n} (j / n) q_{n} + Y_{j}, & under alternative \end{matrix}$

and consider different configurations $u_{n}$ of change-points and $q_{n} (t) = a_{n} \sqrt{M t}$ , $t \in [0, 1] .$

We mainly concentrated on two possible change-point alternatives. The first is obtained with

u_{n} (t) = 1_{[0, θ]} (t)

and corresponds to one change-point alternative. Another is for the epidemic-type alternative, for which we take

u_{n} (t) = 1_{[θ_{1}, θ_{2}]} (t)

.

Scenario (S1) is used as an optimal case situation where the actual eigenvalues and eigenfunctions are known. In this case, we are not required to approximate discrete functions, thus avoiding any data loss or measurement errors. The second scenario continues with the same random functional sample, but goes through extra steps such us taking function values at discrete data points and reconstructing the random functional sample on a different set of basis functions. The aim of this exercise is to measure the impact when some information could be lost due to measurements taken at discrete points and smoothing. The simulation results show that, even after the reconstruction of the random functional sample, the performance of the test does not suffer too much.

Our simulation starts with the generation process of the random functional sample

Y_{j}

as described in the first scenario with

d = 30

.

First of all, we can compare the true eigenfunctions of covariance operator

Γ = E [Y_{j} \otimes Y_{j}]

with the eigenfunctions of estimated operator

{\hat{Γ}}_{n}

(see Figure 4).

We see that the estimated harmonics has almost the same shape; only every second, the estimated eigenfunctions are phase shifted.

Next, for both scenarios (S1) and (S2), the density functions of the test statistic

T_{n, 1} (d)

(17) were estimated using Monte Carlo with 10,000 repetitions (see Figure 5). It shows four density plots: the red density functions of

T_{n, 1} (d)

are calculated using the true eigenfunctions and eigenvalues, while the black curves show the density of

{\hat{T}}_{n, 1} (d)

(26) using the estimated eigenfunctions and eigenvalues. The left side density plots were estimated from the samples under the null hypothesis, while the plots on the right side show the density of

T_{n, 1} (d)

and

{\hat{T}}_{n, 1} (d)

with the sample:

X_{j} (t) = \sum_{k = 1}^{d} (ξ_{j k} + 1_{(τ > j)} a) \frac{\sqrt{2} cos (k π t)}{k σ}, t \in [0, 1], τ = 250, j = 1 \dots 500

(38)

with added drift

a = 0.2

.

Since functional Principal Component Analysis (fPCA) represents a functional data sample in the most parsimonious way, we can see that the density of the test statistics in scenario (S2) is more on the left side and more concise. Critical values

c_{d} (α)

with

α = 0.05

of the statistics

T_{n, 1} (d)

and

{\hat{T}}_{n, 1} (d)

were also calculated and are shown in Figure 5.

4.2. Statistical Power Analysis

First, we compared the statistical power of the test (20) with statistic

T_{n, 1} (d)

of the scenario (S1) and scenario (S2) with statistic

{\hat{T}}_{n, 1} (d)

. To this aim, we used sample

(X_{j}, j = 1, \dots, n)

defined in (38), where

τ = 250

, which is in the middle of the sample. We started with the no drift

a = 0

(corresponding to the null hypothesis) and increasing the drift amount a by

0.03

up to the point when

a > 0.3

. At each a value, we repeated the simulation 1000 times. This gives a good indication of the statistical power with the amount of the added drift. The statistical power is illustrated in Figure 6. Based on the simulation results, we can see that, even if the random functional sample is approximated from the discrete data points, it still holds the same statistical power and the performance does not suffer from the information loss due to smoothing and fPCA. These are important results, because, normally, in observed real-world data, the true functions are unknown and have to be approximated, which almost always introduces measurement errors.

Next, we focus on the power tests (36) and (32) used directly on the functional data sets simulated in scenario (S3). Figure 7 and Figure 8 present the clear opposites of the functional data sets with respect to the change-point. The changes can be easily observed. However, especially working with functional data sets, the changes may not be that obvious. As an example, Figure 9 illustrates another functional data set with the change-point, where the presence of the change-point is not visible, but Monte Carlo experiments show that, with the same magnitude of change, for almost 80% of the cases,

H_{0}

was correctly rejected.

The density of the limiting distribution and asymptotic critical values were estimated using the Monte Carlo technique by simulating a Brownian bridge with 1000 points and running 100,000 replications.

In the power studies, we tested two variants of the random functional samples, one with a single change-point in the middle of the functional sample and the second with the two change-points forming epidemic change. In the first case, the functional sample

X_{1}, \dots, X_{n}

is constructed from

n = 1000

random functions where 500 curves are changed in order to violate the null hypothesis. The model that violates the null hypothesis is defined as

X_{k} (t) = Δ (t) 1 {i > n / 2} + Y_{i} (t)

,

Δ (t) = a \sqrt{M t}, t \in [0, 1], M = 1000

, and the parameter a is used to control the magnitude of the drift after the change-point. In the second case, during each iteration,

n = 1500

random functions are generated, where 500 curves in the middle were modified by taking

X_{k} (t) = Δ (t) 1 {2 n / 3 > i > n / 3} + Y_{i} (t), t \in [0, 1]

. During each repetition, twostatistics are calculated:

{\hat{T}}_{n} (d, p)

(35) and

{\hat{T}}_{n, 1} (d)

(26) in the single change-point simulation. For the epidemic change simulation

{\hat{T}}_{n, m} (d, p)

(31),

m = 2

statistic is calculated. We set the p-variation p parameter to 3. We also tested with different p-values, but this did not have any impact on the overall performance.

Figure 10 presents the results of the statistical power simulation. From the results, we can see that epidemic change has weaker statistical power. On the other hand, when restricting the partition count, we observed one benefit, that the locations of the partitions in many cases match or are very close to the actual locations of the change-point.

5. Application to Brain Activity Data

The findings of real data analysis to show the performance of the proposed test are demonstrated in this section. The data were collected during a long-term study on voluntary alcohol- consuming rats following chronic alcohol experience. The data consist of two sets: neurophysiological activity from the two brain centers (the dorsal and ventral striatum) and data from the lickometer device. The lickometer devices were used to monitor the drinking bouts. During the single trial, two locations of the brain were monitored for each rat. Rats were given two drinking bouts, one with alcohol and the other with water. Any time, they were able to freely choose what to drink. Electrodes were attached to the brains, and neurophysiological data were sampled at 1kHz intervals. It was not the goal of this study to confirm nor reject the findings, but to show the advantages of the functional approach for change-point detection. For this reason, the data are well suited to illustrate the behavior of the test in real-world settings.

In our analysis, we took the first alcohol drinking event, which lasted around 27 s. We also included 10 s before the drinking event and 10 s after the event. The total time was 47 s long. The time series was broken down into processes of 100 ms. Each process had 100 data points.

A_{470, 100} = (\begin{matrix} a_{1, 1} & a_{1, 2} & \dots & a_{1, 100} \\ a_{2, 1} & a_{2, 2} & \dots & a_{2, 100} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ a_{470, 1} & a_{470, 2} & \dots & a_{470, 100} \end{matrix})

All the processes were smoothed to the functions using 50 B-spline basis functions. The overall functional sample contained 470 functions

\hat{F} = [f_{1}, f_{2}, \dots, f_{470}]

. The functional sample was separated into sub-samples

\hat{F_{i}} = [f_{1}, f 2, \dots, f_{20 + i}]

,

i = 0, 1, \dots, 450

. For each sub-sample

{\hat{F}}_{i}

, two statistics were calculated (

{\hat{T}}_{n} (d, p)

(35) and

{\hat{T}}_{n, m} (d, p)

(31),

m = 2

).

The results are visualized in Figure 11. We can see that tests with statistics

{\hat{T}}_{n} (d, p)

and

{\hat{T}}_{(n, m)} (d, p)

strongly rejected the null hypothesis at around 2 s and onward after the rat started to consume the alcohol, which suggests that the changes in the brain activity can be observed. However, the changes appear to happen only for the CPu brain region. Interestingly, the statistic

{\hat{T}}_{n, m} (d, p)

has much larger volatility compared to the unrestricted

{\hat{T}}_{n} (d, p)

in the Nacc brain region before the drinking event and lower volatility just after the drinking event started. However, it is not fully clear if this is the expected behavior or a Type I error.

Finally, the locations of the restricted (

m = 2

) p-variation partition points nearly matched the beginning and the end of the drinking period. In Figure 11, the gray vertical dashed lines indicate the actual beginning and the actual end of the drinking period measured by the lickometer and the black vertical lines indicate the location of the partitions calculated from the functional sample

{\hat{F}}_{450}

. The first partition is located at 10.5 s and the second partition point at 38.4 s, which aligns well with the data collected from the lickometer.

The test with a restricted partition count showed weaker statistical power, but it did help determine the location of the change-points.

6. Proof of Theorems 1 and 3

The following theorem is a version of Theorem 2.11.1 in [13] adapted to the case of continuous processes.

Theorem 12.

Assume that

{Z_{n i} : 1 \leq i \leq m_{n}}

are independent continuous stochastic processes indexed by a totally bounded semi-metric space

(G, d)

such that

lim_{n \to \infty} \sum_{i = 1}^{m_{n}} E {∥ Z_{n i} ∥}_{F}^{2} 1_{{∥ Z_{n i} ∥_{F} > η}} = 0 f o r e v e r y η > 0,

(39)

lim_{n \to \infty} sup_{d (f, g) < δ_{n}} \sum_{i = 1}^{m_{n}} E {[Z_{n i} (f) - Z_{n i} (g)]}^{2} = 0 f o r e v e r y δ_{n} ↓ 0,

(40)

\int_{0}^{δ_{n}} \sqrt{log N (ϵ, G, d_{n})} d ϵ \to_{n \to \infty}^{P} 0 f o r e v e r y δ_{n} ↓ 0,

(41)

where

d_{n} (f, g) = {(\sum_{k = 1}^{m_{n}} {[Z_{n k} (f) - Z_{n k} (g)]}^{2})}^{1 / 2} .

Then, the sequence

Z_{n} : = \sum_{i = 1}^{m_{n}} (Z_{n i} - E Z_{n i})

is asymptotically d-equicontinuous, that is, for every

ε > 0

,

lim_{δ ↓ 0} \underset{n \to \infty}{lim sup} P (sup_{d (f, g) < δ} | Z_{n} (f) - Z_{n} (g) | > ε) = 0 .

Furthermore,

(Z_{n})

converges in law in

ℓ^{\infty} (G)

provided that covariances converge pointwise on

G \times G

.

Proof of Theorem 1 (1a).

Without loss of generality, we assumed that

∥ ψ ∥ \leq 1

for all

ψ \in Ψ

and

{∥ f ∥}_{sup} \leq 1

for all

f \in F

. To prove

(1 a)

, we applied Theorem 12 for

G = F \times Ψ

,

d = ρ_{2}

, and

Z_{n k} = n^{- 1 / 2} ν_{n k}

,

k = 1, \dots, n

, where, under

H_{0}

,

ν_{n k} (f, ψ) = 〈 Y_{k}, ψ 〉 λ_{n k} (f)

. Let us check first the conditions (39)–(41). We have

\begin{matrix} n^{- 1 / 2} {∥ ν_{n k} ∥}_{G} & = n^{- 1 / 2} sup_{ψ \in Ψ, f \in F} | 〈 Y_{k}, ψ 〉 λ_{n k} (f) | \leq n^{- 1 / 2} ∥ Y_{k} ∥ sup_{ψ \in Ψ} ∥ ψ ∥ sup_{f \in F} {∥ f ∥}_{sup} \\ \leq n^{- 1 / 2} ∥ Y_{k} ∥ . \end{matrix}

Hence, (39) easily follows from

{E ∥ Y ∥}^{2} < \infty .

Since

ν_{n k} (f, ψ) - ν_{n k} (f^{'}, ψ^{'}) = 〈 Y_{k}, ψ - ψ^{'} 〉 λ_{n k} (f) + 〈 Y_{k}, ψ^{'} 〉 λ_{n k} (f - f^{'})

and

Y, Y_{k}

are identically distributed, we have

E {[ν_{n k} (f, ψ) - ν_{n k} (f^{'}, ψ^{'})]}^{2} \leq {2 E ∥ Y ∥}^{2} [λ_{n k}^{2} (f - f^{'}) ∥ ψ^{'} ∥^{2} + λ_{n k}^{2} (f) ∥ ψ - ψ^{'} ∥] .

Summing this estimate and noting that for any

g \in L_{2} (0, 1)

,

n^{- 1} \sum_{k = 1}^{n} λ_{n k}^{2} (g) \leq n^{- 1} \sum_{k = 1}^{n} λ_{n k} (g^{2}) = {∥ g ∥}^{2}

by the Hölder inequality, we find

\begin{matrix} n^{- 1} \sum_{k = 1}^{n} E {[ν_{n k} (f, ψ) - ν_{n k} (f^{'}, ψ^{'})]}^{2} & \leq {2 E ∥ Y ∥}^{2} {[∥ ψ ∥}^{2} ∥ f - f^{'} ∥^{2} + ∥ f^{'} ∥ ∥ ψ - ψ^{'} ∥^{2}] \\ \leq {2 E ∥ Y ∥}^{2} {[∥ ψ ∥}^{2} + ∥ f^{'} ∥^{2}] δ_{n} \\ \leq {4 E ∥ Y ∥}^{2} δ_{n} \end{matrix}

if

ρ_{2} ((f, ψ), (f^{'}, ψ^{'})) < δ_{n} .

This estimate yields (40). To check (41), we have

\begin{matrix} d_{n} ( & (f, ψ), (f^{'}, ψ^{'})) = {(n^{- 1} \sum_{k = 1}^{n} {[〈 Y_{k}, ψ 〉 λ_{n k} (f) - 〈 Y_{k}, ψ^{'} 〉 λ_{n k} (f^{'})]}^{2})}^{1 / 2} \\ = {(n^{- 1} \sum_{k = 1}^{n} {[〈 Y_{k}, ψ - ψ^{'} 〉 λ_{n k} (f) + 〈 Y_{k}, ψ^{'} 〉 λ_{n k} (f - f^{'})]}^{2})}^{1 / 2} \\ \leq {(n^{- 1} \sum_{k = 1}^{n} {〈 Y_{k}, ψ - ψ^{'} 〉}^{2} λ_{n k}^{2} (f))}^{1 / 2} + {(n^{- 1} \sum_{k = 1}^{n} 〈 Y_{k}, ψ^{'} 〉 λ_{n k}^{2} (f - f^{'}))}^{1 / 2} \\ \leq {(n^{- 1} \sum_{k = 1}^{n} {∥ Y_{k} ∥}^{2} λ_{n k} (f^{2}))}^{1 / 2} ρ_{2, λ} (ψ, ψ^{'}) + {(n^{- 1} \sum_{k = 1}^{n} {〈 Y_{k}, ψ^{'} 〉}^{2} λ_{n k} {(f - f^{'})}^{2})}^{1 / 2} \\ \leq A_{n} [ρ_{2, λ} (ψ, ψ^{'}) + ρ_{2, Q} (f, f^{'})], \end{matrix}

where

A_{n} = {(n^{- 1} \sum_{k = 1}^{n} {∥ Y_{k} ∥}^{2})}^{1 / 2}, and Q = A_{n}^{- 2} n^{- 1} \sum_{k = 1}^{n} {∥ Y_{k} ∥}^{2} λ_{n k}

Hence,

N (ε, F \times Ψ, d_{n}) \leq N (A_{n}^{- 1} ε, F, ρ_{2, Q}) N (A_{n}^{- 1} ε, Ψ, ρ_{2, λ}) .

and the condition (41) is satisfied, provided that

I_{1} (δ_{n}) : = \int_{0}^{δ_{n}} sup_{Q \in Q} \sqrt{log N (A_{n}^{- 1} ϵ, F, ρ_{2, Q})} d ϵ \to_{n \to \infty}^{P} 0 for every δ_{n} ↓ 0,

(42)

and

I_{2} (δ_{n}) : = \int_{0}^{δ_{n}} \sqrt{log N (B_{n}^{- 1} ϵ, Ψ, ρ_{2, λ})} d ϵ \to_{n \to \infty}^{P} 0 for every δ_{n} ↓ 0,

(43)

hold. Set

J_{1} (a) : = \int_{0}^{a} sup_{Q \in Q} \sqrt{log N (ϵ, F, ρ_{2, Q})} d ϵ, J_{2} (a) : = \int_{0}^{a} \sqrt{log N (ϵ, Ψ, ρ_{2, λ})} d ϵ .

It is known (see, e.g., [14]) that

J_{1} (1) < \infty

. Hence,

J_{1} (a) \to 0

as

a \to 0

. By the condition (4),

J_{2} (a) \to 0

as

a \to 0

. Changing the integration variables gives

I_{1} (δ_{n}) = A_{n} J_{1} (A_{n}^{- 1} δ_{n})

and

J_{2} (δ_{n}) = A_{n} J_{2} (A_{n}^{- 1} δ_{n})

.

Set

σ^{2} : = E {∥ Y ∥}^{2}

. By the strong law of large numbers,

A_{n}^{2} \to_{n \to \infty}^{P} σ^{2}

. Choosing

η < 3 σ^{2} / 4

, we have, for any

δ > 0

,

\begin{matrix} P (I_{1} (δ_{n}) > δ) & \leq P (A_{n} J_{1} (A_{n}^{- 1} δ_{n}) > δ, | A_{n}^{2} - σ^{2} | < η) + P (| A_{n}^{2} - σ^{2} | > η) \\ \leq P (A_{n} J_{1} (A_{n}^{- 1} δ_{n}) > δ, A_{n}^{2} > σ^{2} / 4) + P (| A_{n}^{2} - σ^{2} | > η) \\ \leq P (A_{n} J_{1} (η δ_{n} / 2) > δ) + P (| A_{n}^{2} - σ^{2} | > η) \to 0 \end{matrix}

as

n \to \infty

. Similarly, we prove

I_{2} (δ_{n}) \to_{n \to \infty}^{P} 0

.

Next, we have to check the pointwise convergence of the covariances of

(Z_{n})

. Since

Y_{k}

are independent, we have

\begin{matrix} E (\sum_{k = 1}^{n} 〈 Y_{k}, ψ 〉 λ_{n k} (f) \sum_{k = 1}^{n} 〈 Y_{k}, ψ^{'} 〉 λ_{n k} (f^{'})) & = \sum_{k = 1}^{n} E (〈 Y_{k}, ψ 〉 〈 Y_{k}, ψ^{'} 〉) λ_{n k} (f) λ_{n k} (f^{'})) \\ = (Γ ψ, ψ^{'}) \sum_{k = 1}^{n} λ_{n k} (f) λ_{n k} (f^{'}) . \end{matrix}

We shall prove that

I_{n} : = n^{- 1} \sum_{k = 1}^{n} λ_{n k} (f) λ_{n k} (f^{'}) \to 〈 f, f^{'} 〉 as n \to \infty .

Set

{\tilde{I}}_{n} : = n^{- 1} \sum_{k = 1}^{n} f (k / n) f^{'} (k / n)

. Evidently,

{lim}_{n \to \infty} {\tilde{I}}_{n} = 〈 f, f^{'} 〉

, and we have to check

lim_{n \to \infty} | I_{n} - {\hat{I}}_{n} | \to 0 .

We have

Δ_{n} : = | I_{n} - {\hat{I}}_{n} | \leq n^{- 1} \sum_{k = 1}^{n} [λ_{n k} (f) λ_{n k} (f^{'}) - f (k / n) f^{'} (k / n)] \leq | Δ_{n}^{'} | + | Δ_{n}^{''} |,

where

Δ_{n}^{'} : = n^{- 1} \sum_{k = 1}^{n} [λ_{n k} (f) (λ_{n k} (f^{'}) - f^{'} (k / n))], Δ_{n}^{″} : = n^{- 1} \sum_{k = 1}^{n} [f^{'} (k / n) (λ_{n k} (f) - λ_{n k} (f^{'}))] .

Observing that

\begin{matrix} | λ_{n k} (f) - f (k / n) | & = | \int_{0}^{1} (f (t) - f (k / n)) d λ_{n k} (t) | \leq n | \int_{(k - 1) / n}^{k / n} (f (t) - f (k / n)) d t | \\ \leq \int_{0}^{1} sup {| f (t) - f (k / n) | : t \in [(k - 1) / n, k / n]} d λ_{n k} \\ \leq sup {| f (t) - f (k / n) | : t \in [(k - 1) / n, k / n] \\ \leq v_{2}^{1 / 2} (f; [(k - 1) / n, k / n)]), \end{matrix}

we have

\begin{matrix} | Δ_{n} | & \leq n^{- 1} λ_{n k} (f) v_{2}^{1 / 2} (f, [(k - 1] / n, k / n]) \\ \leq n^{- 1} {(\sum_{k = 1}^{n} λ_{n k}^{2} (f))}^{1 / 2} {(\sum_{k = 1}^{n} v_{2} (f, [(k - 1) / n, k / n]))}^{1 / 2} \\ \leq n^{- 1 / 2} ∥ f ∥ v_{2}^{1 / 2} (f) . \end{matrix}

This yields

lim_{n \to \infty} E (Z_{n} (f, ψ) Z_{n} (f^{'}, ψ^{'})) = 〈 Γ ψ, ψ^{'} 〉 〈 f, f^{'} 〉 .

To complete the proof of

(a)

, note that the existence of the continuous modification of Gaussian process

ν = ν (ψ, f), (ψ, f) \in G = Ψ \times F)

follows by Dudley [15], since the entropy condition

\int_{0}^{1} \sqrt{log N (ϵ, G, ρ_{2})} d ϵ < \infty

is satisfied. □

Lemma 3.

It holds that

lim_{n \to \infty} sup_{{| | g | |}_{(2)} \leq 1} | n^{- 1} \sum_{k = 1}^{n} g (k / n) - λ (g) | = 0 .

Proof.

We have

I_{n} : = \frac{1}{n} \sum_{k = 1}^{n} g (k / n) - λ (g) = \sum_{k = 1}^{n} \int_{(k - 1) / n}^{k / n} [g (k / n) - g (s)] d s

For every

s \in [(k - 1) / n, k / n]

,

| g (k / n) - g (s) | \leq v_{2}^{1 / 2} (g, [(k - 1) / n, k / n]) .

Hence,

\begin{matrix} | I_{n} | & \leq \sum_{k = 1}^{n} \int_{(k - 1) / n}^{k / n} v_{2}^{1 / 2} (g, [(k - 1) / n, k / n]) d s \\ = \frac{1}{n} \sum_{k = 1}^{n} v_{2}^{1 / 2} (g, [(k - 1) / n, k / n]) \leq \frac{1}{\sqrt{n}} {(\sum_{k = 1}^{n} v_{2} (g, (k - 1) / n, k / n]))}^{1 / 2} \\ \leq \frac{1}{\sqrt{n}} {∥ g ∥}_{(2)} \end{matrix}

and this completes the proof. □

Proof of Theorem 1 (1b).

Under

H_{A}

,

\begin{matrix} 〈 X_{k}, ψ 〉 λ_{n k} (f) & = 〈 Y_{k}, ψ 〉 λ_{n k} (f) + 〈 g_{n} (k / n, \cdot), ψ 〉 λ_{n k} (f) \\ = 〈 Y_{k}, ψ 〉 λ_{n k} (f) + n^{- 1 / 2} u (k / n) [〈 a, ψ 〉 + 〈 a_{n}, ψ 〉] . \end{matrix}

Hence,

ν_{n} (f, ψ) = {\hat{ν}}_{n} (f, ψ) + Δ_{n} (f, ψ) + r_{n} (f, ψ),

where

{\hat{μ}}_{n} (f, ψ) = \sum_{i = 1}^{n} 〈 Y_{i}, ψ 〉 λ_{n i} (f),

Δ_{n} (f, ψ) = n^{- 1 / 2} \sum_{k = 1}^{n} u (i / n) λ_{n i} (f) 〈 q, ψ 〉

and

r_{n} (f, ψ) = n^{- 1 / 2} \sum_{k = 1} u (i / n)] λ_{n i} (f) 〈 q_{n}, ψ 〉 .

We have by (1a)

n^{- 1 / 2} {\hat{ν}}_{n} \to_{n \to \infty}^{D} ν_{γ} in the space ℓ^{\infty} (F \times Ψ) .

To complete the proof, we have to check

lim_{n \to \infty} sup_{f \in F, ψ \in Ψ} | n^{- 1 / 2} Δ_{n} (f, ψ) - Δ (f, ψ) | = 0 .

(44)

and

lim_{n \to \infty} sup_{f \in F, ψ \in Ψ} | n^{- 1 / 2} r_{n} (f, ψ) | = 0 .

(45)

To this aim, we involve lemma 3. We have

n^{- 1 / 2} Δ_{n} (f, ψ) - Δ (f, ψ) = [I_{n 1} (f) + I_{n 2} (f)] 〈 q, ψ 〉,

where

I_{n 1} (f) = \frac{1}{n} \sum_{k = 1}^{n} u (k / n) [λ_{n k} (f) - f (k / n)], I_{n 2} (f) = \frac{1}{n} \sum_{k = 1}^{n} u (k / n) f (k / n) - 〈 u, f 〉 .

By Lemma 3 applied to the function

u f

, we have

I_{n 2} (f) \to 0

uniformly over

f \in F

. Consider

I_{n 1}

. We have, as in the proof of Lemma 3,

| I_{n 1} (f) | \leq \sum_{k = 1}^{n} | u (k / n) | \int_{(k - 1) / n}^{k / n} [f (s) - f (k / n) | d s \leq n^{- 1 / 2} {∥ u ∥}_{\infty} {∥ f ∥}_{(2)} .

Hence,

I_{n 2} (f) \to 0

uniformly over

f \in F

. The convergence (45) follows by observing that

| n^{- 1 / 2} r_{n} {(f, ψ) | \leq ∥ u ∥}_{\infty} \int_{0}^{1} | f (s) | d s ∥ ψ ∥ \cdot ∥ q_{n} ∥ .

This proves (45) and completes the proof of (1b). □

Proof of Theorem 3 (3a).

Consider the map

T : ℓ^{\infty} (F) \to ℓ^{\infty} (F), T (x) (f) = x (f) - x (1) λ (f)

. The continuity of T is easy to check. Observing that

{\hat{ν}}_{n} = T (ν_{n})

, the convergence (9) is a corollary of Theorem 1 and a continuous mapping theorem.

To prove (3b), observe that, under

H_{A}

,

\begin{matrix} 〈 X_{k}, ψ 〉 λ_{n k} (f) & = 〈 Y_{k}, ψ 〉 λ_{n k} (f) + 〈 g_{n} (k / n, \cdot), ψ 〉 λ_{n k} (f) \\ = 〈 Y_{k}, ψ 〉 λ_{n k} (f) + n^{- 1 / 2} u (k / n) [〈 v, ψ 〉 + 〈 v_{n}, ψ 〉] \end{matrix}

hence

μ_{n} (f, ψ) = {\hat{μ}}_{n} (f, ψ) + {\tilde{Δ}}_{n} (f, ψ) + {\tilde{r}}_{n} (f, ψ),

where

{\hat{μ}}_{n} (f, ψ) = \sum_{i = 1}^{n} 〈 Y_{i} - {\bar{Y}}_{n}, ψ 〉 λ_{n i} (f),

{\tilde{Δ}}_{n} (f, ψ) = n^{- 1 / 2} \sum_{k = 1}^{n} [u (i / n) - n^{- 1} \sum_{j = 1}^{n} u (j / n)] λ_{n i} (f) 〈 q, ψ 〉

and

{\tilde{r}}_{n} (f, ψ) = n^{- 1 / 2} \sum_{k = 1} [u (i / n) - n^{- 1} \sum_{j = 1}^{n} u (j / n)] λ_{n i} (f) 〈 q_{n}, ψ 〉 .

We have by

(a)

n^{- 1 / 2} {\hat{μ}}_{n} \to_{n \to \infty}^{D} μ_{γ} in the space ℓ^{\infty} (F \times Ψ) .

To complete the proof, we have to check

lim_{n \to \infty} sup_{f \in F, ψ \in Ψ} | n^{- 1 / 2} {\tilde{Δ}}_{n} (f, ψ) - \tilde{Δ} (f, ψ) | = 0 .

(46)

and

lim_{n \to \infty} sup_{f \in F, ψ \in Ψ} | n^{- 1 / 2} {\tilde{r}}_{n} (f, ψ) | = 0 .

(47)

For this, we can use (44) and (45) and observe that

n^{- 1} \sum_{k = 1}^{n} u (k / n) \to λ (u)

as

n \to \infty

. □

Author Contributions

Conceptualization, A.R.; Methodology, A.R.; Software, T.D.; Supervision, A.R.; Formal analysis, T.D. and A.R.; Writing—original draft preparation, A.R., T.D.; Investigation, T.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Csörgő, M.; Horváth, L. Limit Theorems in Change-Point Analysis; John Wiley & Sons: New York, NY, USA, 1997. [Google Scholar]
Brodsky, B.E.; Darkhovsky, B.S. Non Parametric Methods in Change Point Problems; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1993. [Google Scholar]
Basseville, M.; Nikiforov, N. The Detection of Abrupt Changes—Theory and Applications; Information and System Sciences Series; Prentice-Hall: Upper Saddle River, NJ, USA, 1993. [Google Scholar]
Chen, J.; Gupta, A.K. Parametric Statistical Change Point Analysis; Birkhauser Boston, Inc.: Boston, MA, USA, 2000. [Google Scholar]
Berkes, I.; Gabrys, R.; Horváth, L.; Kokoszka, P. Detecting changes in the mean of functional observations. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2009, 71, 927–946. [Google Scholar] [CrossRef]
Aue, A.; Gabrys, R.; Horváth, L.; Kokoszka, P. Estimation of a change-point in the mean function of functional data. J. Multivar. Anal. 2009, 100, 2254–2269. [Google Scholar] [CrossRef] [Green Version]
Aston, J.A.; Kirch, C. Detecting and estimating changes in dependent functional data. J. Multivar. Anal. 2012, 109, 204–220. [Google Scholar] [CrossRef] [Green Version]
Aue, A.; Rice, G.; Sönmez, O. Detecting and dating structural breaks in functional data without dimension reduction. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2018, 80, 509–529. [Google Scholar] [CrossRef] [Green Version]
Harris, T.; Li, B.; Tucker, J.D. Scalable Multiple Changepoint Detection for Functional Data Sequences. arXiv 2021, arXiv:2008.01889v2. [Google Scholar] [CrossRef]
Dudley, R.M.; Norvaiša, R. Concrete Functional Analysis; Springer: New York, NY, USA, 2011. [Google Scholar]
Horváth, L.; Kokoszka, P. Inference for Functional Data with Applications; Springer: Berlin, Germany, 2012. [Google Scholar]
Danielius, T.; Račkauskas, A. p-Variation of CUSUM process and testing change in the mean. Commun. Stat. Simul. Comput. 2020, 1–13. [Google Scholar] [CrossRef]
der Vaart, A.W.V.; Wellner, J.A. Weak Convergence and Empirical Processes with Applications to Statistics; Springer: Berlin, Germany, 1996. [Google Scholar]
Norvaiša, R.; Račkauskas, A. Convergence in law of partial sum processes in p-variation norm. Lith. Math. J. 2008, 48, 212–227. [Google Scholar] [CrossRef]
Dudley, R.M. Sample functions of the gaussian process. Ann. Probab. 1973, 1, 66–103. [Google Scholar] [CrossRef]

Figure 1. Density functions.

Figure 2. Comparison of the critical values in (20) and (25) with

α = 0.05

and the density function of

T_{n, 1} (d)

.

Figure 2. Comparison of the critical values in (20) and (25) with

α = 0.05

and the density function of

T_{n, 1} (d)

.

Figure 3. Functions

Δ_{2}^{*}

and density functions.

Figure 3. Functions

Δ_{2}^{*}

and density functions.

Figure 4. True basis functions (red) and “reconstructed” basis functions (black) using fPCA method.

Figure 5. Density plots of the test statistic

T_{n, 1} (d)

and

{\hat{T}}_{n, 1} (d)

.

Figure 5. Density plots of the test statistic

T_{n, 1} (d)

and

{\hat{T}}_{n, 1} (d)

.

Figure 6. The comparison of the statistical power of scenario (S1) and (S2).

Figure 7. Functional data set

(Y_{j})

.

Figure 7. Functional data set

(Y_{j})

.

Figure 8. Functional sample

(X_{j})

with one change-point.

Figure 8. Functional sample

(X_{j})

with one change-point.

Figure 9. Sample with introduced drift of magnitude

a = 0.004

after the change-point.

Figure 9. Sample with introduced drift of magnitude

a = 0.004

after the change-point.

Figure 10. Power curves.

Figure 11. Statistics of the first alcohol drinking event, which lasted about 27 s. Ten seconds before and 10 s after were also included. The red horizontal line indicates the critical value with 0.95. Vertical gray dashed lines mark the beginning and the end of the drinking time. The black solid vertical lines mark the locations of the change-points detected using the restricted p-variation method. Blue and light blue colors represent different brain regions.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Danielius, T.; Račkauskas, A. Multiple Change-Point Detection in a Functional Sample via the 𝒢-Sum Process. Mathematics 2022, 10, 2294. https://doi.org/10.3390/math10132294

AMA Style

Danielius T, Račkauskas A. Multiple Change-Point Detection in a Functional Sample via the 𝒢-Sum Process. Mathematics. 2022; 10(13):2294. https://doi.org/10.3390/math10132294

Chicago/Turabian Style

Danielius, Tadas, and Alfredas Račkauskas. 2022. "Multiple Change-Point Detection in a Functional Sample via the 𝒢-Sum Process" Mathematics 10, no. 13: 2294. https://doi.org/10.3390/math10132294

APA Style

Danielius, T., & Račkauskas, A. (2022). Multiple Change-Point Detection in a Functional Sample via the 𝒢-Sum Process. Mathematics, 10(13), 2294. https://doi.org/10.3390/math10132294

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multiple Change-Point Detection in a Functional Sample via the 𝒢-Sum Process

Abstract

1. Introduction

2. $𝒢$ -Sum Process and Its Asymptotic

3. Test Statistics

3.1. Testing at Most One Change-Point

3.2. Testing at Most m Change-Points

3.3. Testing Unknown Number of Change-Points

4. Simulation Results

4.1. Data

4.2. Statistical Power Analysis

5. Application to Brain Activity Data

6. Proof of Theorems 1 and 3

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Multiple Change-Point Detection in a Functional Sample via the 𝒢-Sum Process

Abstract

1. Introduction

2. 𝒢-Sum Process and Its Asymptotic

3. Test Statistics

3.1. Testing at Most One Change-Point

3.2. Testing at Most m Change-Points

3.3. Testing Unknown Number of Change-Points

4. Simulation Results

4.1. Data

4.2. Statistical Power Analysis

5. Application to Brain Activity Data

6. Proof of Theorems 1 and 3

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. $𝒢$ -Sum Process and Its Asymptotic