On the Height of One-Dimensional Random Walk

Mohamed Abdelkader

doi:10.3390/math11214513

Department of Statistics and Operations Research, Faculty of Sciences, King Saud University, Riyadh 11451, Saudi Arabia

Mathematics2023, 11(21), 4513;https://doi.org/10.3390/math11214513

This article belongs to the Special Issue Advances of Applied Probability and Statistics

Version Notes

Order Reprints

Abstract

Consider the one-dimensional random walk

X_{n}

: as it evolves (at each unit of time), it either increases by one with probability p or resets to 0 with probability

1 - p

. In the present paper, we analyze the law of the height statistics

H_{n}

, corresponding to our model

X_{n}

. Also, we prove that the limiting distribution of the walk

X_{n}

is a shifted geometric distribution with parameter

1 - p

and find the closed forms of the mean and the variance of

X_{n}

using the probability-generating function.

Keywords:

height; return time; random walk

MSC:

60C05; 60F99; 60E05; 60G40

1. Introduction

Let

{(X_{n})}_{n \in N}

be a discrete random walk with one dimension defined as follows: The walk starts from the origin at Time 0. After one unit of time, the process

X_{n}

shifts by one positive unit with probability

p \in (0, 1)

or resets to 0 with probability

1 - p

. We provide three examples of the evolution of our random walk until time n = 10:

0 \overset{p}{⟶} 1 \overset{p}{⟶} 2 \overset{1 - p}{⟶} 0 \overset{p}{⟶} 1 \overset{p}{⟶} 2 \overset{p}{⟶} 3 \overset{p}{⟶} 4 \overset{p}{⟶} 5 \overset{p}{⟶} 6 \overset{p}{⟶} 7,

0 \overset{p}{⟶} 1 \overset{p}{⟶} 2 \overset{p}{⟶} 3 \overset{1 - p}{⟶} 0 \overset{p}{⟶} 1 \overset{p}{⟶} 2 \overset{p}{⟶} 3 \overset{p}{⟶} 4 \overset{1 - p}{⟶} 0 \overset{p}{⟶} 1,

0 \overset{p}{⟶} 1 \overset{1 - p}{⟶} 0 \overset{1 - p}{⟶} 0 \overset{1 - p}{⟶} 0 \overset{1 - p}{⟶} 0 \overset{p}{⟶} 1 \overset{p}{⟶} 2 \overset{1 - p}{⟶} 0 \overset{1 - p}{⟶} 0 \overset{1 - p}{⟶} 0,

In the above examples, the height of the preceding walks are equal to 7, 4, and 2, respectively.

In this work, we are interested in analyzing the height statistics, denoted by

H_{n}

, of the random walk

X_{n}

. Our analysis of the height is based on the combinatorial analysis of the coefficient

a_{(n, r, k)} = Card (I_{n, r, k})

, representing the number of ways to choose r distinct integers

n_{1}, \dots, n_{r}

satisfying the following conditions:

I_{n, r, k} = {1 \leq n_{1} < \dots < n_{r} \leq n} \cap {max (n_{i + 1} - n_{i}) \leq k, 1 \leq i \leq r},

such that

n_{1} = 1

and

n_{r} = n

and based on the probability distribution of the return time, denoted by

N_{n}

, of the random walk

X_{n}

, given by

P (N_{n} = r) = p^{n} {(\frac{1 - p}{p})}^{r + 1} \sum_{s = 1}^{n - r - 1} (\binom{n - 1 - s}{r}) p^{s} .

Our contribution in this current paper is finding a closed form of the distribution of the height statistics

H_{n}

using a combinatorial analysis of the coefficient

a_{(n, r, k)}

and the distribution of the return time

N_{n}

of the random walk

X_{n}

. The closed form of the probability distribution of

H_{n}

is given by:

P (H_{n} \leq k) = p^{2 n} \sum_{r = 1}^{n} \sum_{s = 1}^{n - r - 1} p^{s} a_{(n, r, k)} {(\frac{1 - p}{p})}^{2 r + 1} (\binom{n - 1 - s}{r}) I_{I_{n, r, k}} .

Furthermore, we study the statistical properties of the random walk

X_{n}

, like the mean, the variance, and the limiting distribution of

X_{n}

. Precisely, we prove that the limiting distribution of the random walk

X_{n}

is a shifted geometric distribution with parameter

1 - p

, and we give the closed forms of the mean and the variance of

X_{n}

.

This analysis of the height statistics is very important, and it is applicable to many aspects of renewable energy. For example, electricity today plays a very important role in daily activities and is very essential for transport, education, healthcare, and many other sectors. For this reason, controlling electricity consumption is necessary and is performed by estimating the maximum amount of electricity consumption. Electricity consumption is estimated via statistical methods such as time series models [1,2], regression models [3], and ARIMA models [4]. Furthermore, this maximum amount is similar to the height of the electricity consumption in a given period of time.

In the literature, the statistical properties of the height statistics are studied in one dimension via the kernel method and singularity analysis (see [5,6]). For example, we can mention the distribution of the ranked heights of the excursions of a Brownian bridge, investigated by Pitman and Yor in [7]. Similarly, Csaki and Hu analyzed the asymptotic properties of ranked heights in Brownian excursions in [8]. Also, Csaki and Hu analyzed the lengths and heights of random walk excursions in [9]. Furthermore, Katzenbeisser and Panny studied the maximal height of simple random walks, which were revisited in [10]. In addition, Banderier and Nicodème [11] studied the height of discrete bridges/meanders/excursions for bounded discrete walks. Also, Aguech, Althagafi, and Banderier in [12] analyzed the height of walks with resets and the Moran model.

This paper is organized as follows. In Section 2, we introduce our model in detail and define the return time and the height statistics, denoted by

H_{n}

, of our random walk

X_{n}

. In Section 3, we present our main result concerning the distribution of the height statistics of the random walk

X_{n}

. In Section 4, we use the R program to find all possibilities of the integers

n_{1}, \dots, n_{r}

satisfying the conditions defined in Equation (4) and compute the combinatorial coefficient for different values of n, r, and k. In Section 5, we prove that the limiting distribution of the random walk

X_{n}

is a shifted geometric distribution with parameter

1 - p

. Also, we use the probability-generating function of the random walk

X_{n}

to obtain their mean and variance. In Section 6, we present some conclusions concerning our results and some perspectives.

2. Definitions and Presentation of the Model

In this section, we define an elegant tool called the probability-generating function, which plays an important role in finding the mean and variance of the random walk. Next, we present our model: a one-dimensional random walk. Finally, we finish this section by providing definitions of some statistics like the return time and the height.

Let U be a discrete random variable with distribution

P (U = r) = p_{r}

,

r \in N

. The probability-generating function, denoted by G, of the variable U is defined by:

G_{U} (u) = E (u^{U}) = \sum_{r = 0}^{\infty} u^{r} p_{r},

for all

u \in R

such that

| u | < 1

.

The probability-generating functions constitute an elegant tool to study the statistical characteristics of a random walk. Precisely, the probability density functions associated with discrete stochastic processes and their moments can be obtained from the derivatives of the probability-generating function. In fact, we can obtain the closed forms of the mean and the variance of the process if we derive the probability-generating function, at

u = 1

. For more details, see [6,13,14].

Furthermore, we introduce the following important equations, which are related to the mean and variance of U and

G_{U} (u)

:

E (U) = \frac{\partial G_{U} (u)}{\partial u} |_{u = 1} and V a r (U) = \frac{\partial^{2} G_{U} (u)}{\partial^{2} u} |_{u = 1} + E (U) - E {(U)}^{2} .

(1)

Consider the one-dimensional random walk

X_{n}

. It starts from 0 at Time 0 (i.e.,

X_{0} = 0

), parameterized by a probability

p \in (0, 1)

. It is given by the following system:

\begin{matrix} X_{n + 1} & = \{\begin{matrix} X_{n} + 1 & with probability p, \\ 0 & with probability 1 - p, \end{matrix} \end{matrix}

(2)

where

p \in (0, 1)

. We denote by the statistics

N_{n}

the number of return times of the random walk

X_{n}

to 0 up to time n and

H_{n}

the height of the random walk

X_{n}

:

\begin{matrix} N_{n} & = Card \{1 \leq k \leq n, such that, X_{k} = 0\}, \\ H_{n} & = max (X_{1}, X_{2}, \dots, X_{n}) . \end{matrix}

3. Main Result

The goal of this section is to obtain the distribution of

H_{n}

. To reach this goal, we apply at first a very important result concerning the distribution of the return time

N_{n}

of the random walk

X_{n}

(see Theorem 3 in [15]). For the second setup, we analyze the joint distribution of

(H_{n}, N_{n})

using the conditional probability and the marginal distribution of the return time

N_{n}

. Finally, we deduce the marginal distribution of

H_{n}

.

Now, we present a very important result concerning the distribution of the return time,

N_{n}

, of the random walk

X_{n}

.

Lemma 1

([15]). The exact distribution of

N_{n}

is given by

P (N_{n} = r) = p^{n} {(\frac{1 - p}{p})}^{r + 1} \sum_{s = 1}^{n - r - 1} (\binom{n - 1 - s}{r}) p^{s} .

Consider the following event

\{H_{n} \leq k | N_{n} = r\}

representing the height statistics

H_{n}

, bounded by k, given that the return time

N_{n}

equals r of the random walk

X_{n}

:

\begin{matrix} \{H_{n} \leq k | N_{n} = r\} & = ⋃_{I_{n, r, k}} {G_{1} = n_{1}, \dots, G_{r} = n_{r} - n_{r - 1}, X_{n_{r} + 1} \dots X_{n} \neq 0}, \end{matrix}

(3)

where

G_{i}

are i.i.d. geometric random variables with parameter

1 - p

and

I_{n, r, k} = {1 \leq n_{1} < \dots < n_{r} \leq n} \cap {max (n_{i + 1} - n_{i}) \leq k, 1 \leq i \leq r},

(4)

such that

n_{0} = 1

and

n_{r + 1} = n

. We define the combinatorial coefficient

a_{n, r, k}

:

a_{(n, r, k)} = Card (I_{n, r, k}),

(5)

representing the number of ways to choose r distinct integers

n_{1}, \dots, n_{r}

satisfying the conditions in Equation (4).

Remark 1.

The combinatorial coefficient

a_{n, r, k}

depends on the parameters n, r, and k, where n represents the length of the random walk

X_{n}

and a k integer less than n.

We present a closed form of the combinatorial coefficient

a_{n, r, k}

in the next lemma.

Lemma 2.

The coefficient

a_{n, r, k}

is given by

a_{n, r, k} = [z^{n}] {(\sum_{j = 1}^{k} z^{j})}^{r + 1} = [z^{n - r - 1}] {(\sum_{j = 0}^{k - 1} z^{j})}^{r + 1},

(6)

where

[z^{m}] G (z)

stands for the coefficient of

z^{m}

in the power series

G (z)

.

Proof.

For all

(n_{1}, \dots, n_{r}) \in I_{n, r, k}

, let

k_{1} = n_{1}, \dots k_{i} = n_{i + 1} - n_{i}

and

k_{r + 1} = n - n_{r}

. It is obvious that identifying

(k_{1}, \dots, k_{r + 1})

is equivalent to identifying

(n_{1}, \dots, n_{r})

, and then:

a_{n, r, k} = \{(k_{1}, \dots, k_{r + 1}) \in {\{1, \dots, k\}}^{r + 1}, such that, \sum_{i = 1}^{r + 1} k_{i} = n\} . = [z^{n}] {(\sum_{j = 1}^{k} z^{j})}^{r + 1} .

□

Remark 2.

From Equation (6), the combinatorial coefficient

a_{n, r, k}

is the coefficient of

z^{n - r - 1}

in the power series

G (z)

.

Next, we give some results about the height

H_{n}

of the random walk

X_{n}

. It represents the maximal height attained by the walk

X_{n}

, in all of the past from 1 to n. This means that, for all n and for all

k \leq n

, the values of

X_{n}

are between 0 and k. For this purpose, firstly, we compute the joint distribution of the discrete return time

N_{n}

and the height

H_{n}

of the random walk

X_{n}

. Secondly, for all

k \in {0, \dots, n - 1}

and for all

r \in {1, \dots, n - 1}

, we find the conditional probability of the height

H_{n}

bounded by the integer k given that the return time

N_{n}

equals r. Furthermore, we determine the probability of the intersection between the events

{H_{n} \leq k}

and

{N_{n} = r}

. Finally, we deduce the marginal distribution of

H_{n}

.

The next theorem leads to the conditional probability that the height

H_{n}

of the random walk

X_{n}

is bounded by k given that the return time

N_{n}

equals r.

Theorem 1.

The conditional distribution of

H_{n}

, given

N_{n} = r

, is given by

\begin{matrix} P (H_{n} \leq k | N_{n} = r) & = \sum_{I_{n, r, k}} P (G_{1} = n_{1}, G_{2} = n_{2} - n_{1}, \dots, G_{r} = n_{r} - n_{r - 1}, X_{n_{r} + 1} \dots X_{n} \neq 0) \\ = a_{(n, r, k)} p^{r} {(1 - p)}^{n - r}, \end{matrix}

Proof.

Using (3), we have

\begin{matrix} P (H_{n} \leq k | N_{n} = r) = & \sum_{I_{n, r, k}} P ({X_{1} \neq 0, X_{2} \neq 0, \dots, X_{n_{1} - 1} \neq 0, X_{n_{1}} = 0} \\ \cap {X_{n_{1} + 1} \neq 0, \dots, X_{n_{2} - 1} \neq 0, X_{n_{2}} = 0} \\ ⋮ \\ \cap {X_{n_{r - 1} + 1} \neq 0, \dots, X_{n_{r} - 1} \neq 0, X_{n_{r}} = 0} \\ \cap {X_{n_{r} + 1} \neq 0, \dots, X_{n_{r} - 1} \neq 0, X_{n} \neq 0}), \end{matrix}

where

n_{r + 1} = n

,

n_{1} = 1

, and

i \in {0, 1, \dots, r}

.

Using the conditional probability, we obtain

\begin{matrix} P (H_{n} \leq k | N_{n} = r) = & \sum_{I_{n, r, k}} P [{X_{n} \neq 0, X_{2} \neq 0, \dots, X_{n_{r} + 1} \neq 0} | {X_{n_{r}} = 0}] \\ \times P [{X_{n_{r}} = 0, X_{n_{r} - 1} \neq 0, \dots, X_{n_{r - 1} - 1} \neq 0, X_{n_{r - 1} + 1} \neq 0} | {X_{n_{r - 1}} = 0}] \\ \times P [{X_{n_{r - 1}} = 0, X_{n_{r - 1} - 1} \neq 0, \dots, X_{n_{r - 2} + 1} \neq 0} | {X_{n_{r - 2}} = 0}] \\ ⋮ \\ \times P [{X_{n_{2}} = 0, \dots, X_{n_{2} - 1} \neq 0, X_{n_{1} + 1} \neq 0} | {X_{n_{1}} = 0}] \\ \times P [{X_{n_{1}} = 0, \dots, X_{n_{1} - 1} \neq 0, X_{2} \neq 0}] . \end{matrix}

Thus,

\begin{matrix} P (H_{n} \leq k | N_{n} = r) = & \sum_{I_{n, r, k}} p^{n - n_{r}} (1 - p) p^{n_{r} - n_{r - 1} - 1} \dots (1 - p) p^{n_{2} - n_{1} - 1} (1 - p) p^{n_{1}} \\ = & a_{(n, r, k)} {(1 - p)}^{r} p^{n - r} . \end{matrix}

□

From Theorem 1, we deduce the joint distribution of the following events

{N_{n} = r}

and

{H_{n} \leq k}

.

Corollary 1.

The joint distribution of

(N_{n}, H_{n})

satisfies the following relation:

P (H_{n} \leq k, N_{n} = r) = a_{(n, r, k)} p^{2 n} {(\frac{1 - p}{p})}^{2 r + 1} \sum_{s = 1}^{n - r - 1} (\binom{n - 1 - s}{r}) p^{s},

(7)

where

a_{(n, r, k)}

is defined in Lemma 1.

Proof.

One has

P (H_{n} \leq k, N_{n} = r) = P (H_{n} \leq k | N_{n} = r) P (N_{n} = r),

Applying Lemma 1 and Theorem 1, we obtain

P (H_{n} \leq k, N_{n} = r) = a_{(n, r, k)} p^{2 n} {(\frac{1 - p}{p})}^{2 r + 1} \sum_{s = 1}^{n - r - 1} (\binom{n - 1 - s}{r}) p^{s},

where

a_{(n, r, k)}

is defined in Lemma 1. □

We deduce here some information about the distribution of

H_{n}

. By summing over r in Equation (7), we obtain the marginal distribution of

H_{n}

, as follows:

Corollary 2.

The probability distribution of the height statistics

H_{n}

of the random walk

X_{n}

is given by the following equation:

P (H_{n} \leq k) = p^{2 n} \sum_{r = 1}^{n} \sum_{s = 1}^{n - r - 1} p^{s} a_{(n, r, k)} {(\frac{1 - p}{p})}^{2 r + 1} (\binom{n - 1 - s}{r}) I_{I_{n, r, k}},

where

I_{n, r, k}

is defined in Equation (4).

4. Simulation of the Combinatorial Coefficient $a_{n, r, k}$

In this section, we use the R program to compute the combinatorial coefficient

a_{(n, r, k)}

for different values of n, r, and k. In the first case, we find the value of the coefficient

a_{n, r, k}

and count all the possibilities of the integers

n_{1}, \dots, n_{r}

for

n = 7

,

k \in {2, 3}

and

r \in {4, 5, 6}

. In the second case, we determine the possibilities of the integers

n_{1}, \dots, n_{r}

for

n = 5

,

k \in {2, 3}

and

r \in {3, 4}

. Also, we list the values of the combinatorial coefficient

a_{n, r, k}

for different values of n (4, 5, 6, and 7), r (2, 3, 4, and 5), and k (2, 3, 4, 5, and 6).

In Table 1, we find all the possibilities of the integers

n_{1}, \dots, n_{r}

under the conditions defined in Equation (4) for different values of r and k when n equals 7 and compute the corresponding combinatorial coefficient

a_{n, r, k}

. Precisely, in the first case, when n and k are fixed at 7 and 2, respectively, and the number r takes values of 4, 5, and 6, then the combinatorial coefficient

a_{n, r, k}

takes the values 17, 12, and 7. This means that, when r increases, then the coefficient

a_{n, r, k}

decreases. In the second case, if n and k equal 7 and 2 and r increases from 2 to 3, then the coefficient

a_{n, r, k}

increases from 12 to 18, respectively. This means that the coefficient

a_{n, r, k}

increases when k increases. Also, from Table 1, we observe that

a_{n, r, k}

is fixed at 7 when r is near n and k byat least 2 (

r = 6

and

k \geq 2

).

Table 1. All possibilities of the integers

n_{1}, \dots, n_{r}

for

n = 7

.

Table 2 lists all the possibilities of the integers

n_{1}, \dots, n_{r}

under the conditions defined in Equation (4) for different values of r and k when n equals 5. Also, we deduce the value of the combinatorial coefficient

a_{n, r, k}

for each list. Furthermore, Table 2 shows two cases of the increasing of the combinatorial coefficient

a_{n, r, k}

, which depends on the parameters r (the number of integers

n_{1}, \dots, n_{r}

) and h (the bound of the height

H_{n}

of the random walk

X_{n}

). Precisely, in the first case, when n and k are fixed at 5 and 2 and r increases from 3 to 4, then the combinatorial coefficient

a_{n, r, k}

decreases from 8 to 5. This means that the coefficient

a_{n, r, k}

decreases when r increases. In the second case, if n and r are equal to 5 and 3 and k increases from 2 to 3, then the coefficient

a_{n, r, k}

increases from 8 to 10, respectively. This means the coefficient

a_{n, r, k}

increases when k increases for fixed n and r.

Table 2. All possibilities of the integers

n_{1}, \dots, n_{r}

for

n = 5

.

Table 3 shows that the combinatorial coefficient

a_{n, r, k}

depends on the three parameters n, r, and K. Precisely, this coefficient is increasing or decreasing if the parameters n, r, and p change. From Table 3, we distinguish three cases concerning the computation of

a_{n, r, k}

:

Table 3. The combinatorial coefficient

a_{(n, r, k)}

for different values of n.

In the first case, when n is increasing, r and k are fixed, then we observe that the coefficient

a_{n, r, k}

increases. For example, if n takes values of 4, 5, 6, and 7 and r and k equal 2 and 4, then

a_{n, r, k}

takes values of 6, 10, 14, and 16, respectively. Sometimes, this increasing of

a_{n, r, k}

is very quick, and it takes values of 4, 5, 15, and 32 when n takes values of 4, 5, 6, and 7 and r and k equal 4 and 3, respectively. But, sometimes,

a_{n, r, k}

decreases under the same conditions. For example,

a_{n, r, k}

takes values of 5, 5, 3, and 1 when n equals 4, 5, 6, and 7 and r and k equal 2.

In the second case, the combinatorial coefficient

a_{n, r, k}

decreases or increases when n and k are fixed but r increases. This means that there exists a maximal coefficient

a_{n, r, k}

for special values of n, r, and k. Firstly, if n equals 5, k equals 3, and r takes values of 2, 3, and 4, then the coefficient

a_{n, r, k}

increases from 9 to 10 and decreases to 5, respectively. Secondly, the coefficient

a_{n, r, k}

increases from 3 to 11 and decreases to 6 when n equals 6, k equals 2, and r takes values of 2, 3, 4, and 5. Finally, the coefficient

a_{n, r, k}

increases from 16 to 35 and decreases to 21 when n equals 7, k equals 4, and r takes values of 2, 3, 4, and 5.

In the third case, the combinatorial coefficient

a_{n, r, k}

increases when n and r are fixed, but k increases. This means that

a_{n, r, k}

and k are proportionally related. Firstly, if n equals 5, r equals 3, but k takes values of 1, 2, 3, and 4, then the coefficient

a_{n, r, k}

equals 1, 8, 10, and 10, respectively. Secondly, if n equals 6, r equals 4, but k takes values of 2, 3, and 4, then the coefficient

a_{n, r, k}

equals 11, 15, and 10, respectively. Finally, if n equals 7, r equals 3, but k takes values of 2, 3, 4, 5, and 6, then the coefficient

a_{n, r, k}

equals 6, 25, 33, 35, and 35, respectively.

Furthermore, Table 3 shows that there exists a maximal combinatorial coefficient

a_{n, r, k}

for special values of n, r, and k. Firstly,

a_{n, r, k}

equals 6 when

n = 4

,

r = 2

, and k increases from 3 to n. Secondly,

a_{n, r, k}

equals 10 when

n = 5

,

r = 2

, and k increases from 4 to n or

r = 3

and k increases from 3 to n. Next,

a_{n, r, k}

equals 20 when

n = 6

,

r = 3

, and k increases from 4 to n. Finally,

a_{n, r, k}

equals 35 when

n = 7

,

r = 3

, and k increases from 5 to n or

r = 4

and k increases from 4 to n.

Finally, we observe a very nice property of the combinatorial coefficient

a_{n, r, k}

. This property depends on the parity of n and the length of the random walk

X_{n}

. Precisely, we mention that, if n is an even number, r equals

(n / 2)

, and k takes any value from

r + 1

to n, then

a_{n, r, k}

is maximal. For example, when

n = 4

,

r = 2

, and

3 \leq k \leq 4

or

n = 6

,

r = 3

, and

4 \leq k \leq 6

, the combinatorial coefficient

a_{n, r, k}

is maximal and equals 10 and 20, respectively. But, if n is an odd number, r equals

\frac{n - 1}{2}

and k takes any value from

r + 2

to n or r equals

\frac{n + 1}{2}

and k takes any value from r to n, then

a_{n, r, k}

is maximal. For the first example, when

n = 5

,

r = 2

, and

4 \leq k \leq 5

or

r = 3

and

3 \leq k \leq 5

, the combinatorial coefficient

a_{n, r, k}

is maximal and equals 10. For the second example, when

n = 7

,

r = 3

, and

5 \leq k \leq 7

or

r = 4

and

4 \leq k \leq 7

, the combinatorial coefficient

a_{n, r, k}

is maximal and equals 35.

We perform the computation of the combinatorial coefficient

a_{n, r, k}

for different values of the parameters n, r, and k by the following setups:

First setup:

1.: We fix the three parameters n, r, and k;
2.: we initialize the combinatorial coefficient $a_{n, r, k}$ to 0;
3.: We fix the integers $(n_{1}, \dots, n_{r})$ to (1, …, r), then we guarantee that the difference between two consecutive integers is less than k;
4.: We change $n_{r}$ by a value from $r + 1$ to n, and we stop if $n_{r} - n_{r - 1} > k$ ;
2.: When $n_{r} - n_{r - 1} \leq k$ , then $a_{n, r, k} \leftarrow a_{n, r, k} + 1$ .

Second setup:

1.: We start with the integers $(n_{1}, \dots, n_{r - 2}, n_{r}, n_{r + 1})$ , which equal (1, …, $r - 2$ , r, $r + 1$ ) such that the difference between two consecutive integers is less than or equal to k;
2.: We change $n_{r + 1}$ by a value from $r + 2$ to n, and we stop if $n_{r + 1} - n_{r} > k$ ;
3.: When $n_{r} - n_{r - 1} \leq k$ , then $a_{n, r, k} \leftarrow a_{n, r, k} + 1$ ;
4.: $a_{n, r, k} \leftarrow a_{n, r, k} + 1$ .

Third setup:

1.: We repeat the same procedure from the first and second setups;
2.: The last choice of the integers $(n_{1}, \dots, n_{r})$ is $(1, 1 + k, 2 + k, \dots, n)$ , then we guarantee that the difference between two consecutive integers is less than k;
3.: $a_{n, r, k} \leftarrow a_{n, r, k} + 1$ . If $n_{2} - n_{1} > k$ , we stop the procedure in the third setup.

Final setup:

1.: We repeat the preceding setups for $n_{1}$ from 2 to an integer c such that $c - 1 = k$ ;
2.: $a_{n, r, k} \leftarrow a_{n, r, k} + 1$ .

5. Distribution of the Random Walk $X_{n}$

In this section, we analyze some statistical properties like the limiting distribution, the mean, and the variance of the random walk

X_{n}

using a very nice tool called the probability-generating function. Firstly, we find the relation between the probabilities of the random walk

X_{.}

at two consecutive times n and

n + 1

using the conditional probability. Secondly, we determine a recursive equation between

f_{n} (x)

and

f_{n + 1} (x)

, where

f_{n} (x) = E [x^{X_{n}}]

represents the probability-generating function of

X_{n}

. Next, we use

f_{n} (x)

to prove that the random walk

X_{n}

converges to a shifted geometric distribution with parameter

1 - p

asymptotically. Also, we derive

f_{n} (x)

to obtain the mean and the variance of the random walk

X_{n}

. Start by the definition of the probability mass function of

X_{n}

. Denote, for all

r \in {0, \dots, n + 1}

,

P_{n + 1} (r) = P (X_{n + 1} = r) .

(8)

The following lemma presents the recursion of the probabilities.

Lemma 3.

For all

n \geq 0

, we have

\begin{matrix} P_{n + 1} (r) & = \{\begin{matrix} p P_{n} (r - 1), & if r \geq 1, \\ (1 - p), & if r = 0 . \end{matrix} \end{matrix}

Proof.

This proof is based on the utility of the conditional probability that the Moran walk

X_{.}

equals r at time

n + 1

given that it equals l at time n, then:

1.: For $r \geq 1$ , we have

$P_{n + 1} (r) = P (X_{n + 1} = r | X_{n} = r - 1) \times P (X_{n} = r - 1) = p P_{n} (r - 1),$
2.: For $r = 0$ , we have

$\begin{matrix} P_{n + 1} (r) = & \sum_{l = 0}^{n} P (X_{n + 1} = r, X_{n} = l) \\ = & \sum_{l = 1}^{n} P (X_{n + 1} = r | X_{n} = l) \times P (X_{n} = l) \\ = & (1 - p) \sum_{l = 0}^{n} P_{n} (l) = 1 - p . \end{matrix}$

□

Next, we define the sequence of polynomials

f_{n} (x)

(for

n \in N

) by the fact that the coefficient of

x^{r}

in

f_{n} (x)

is the probability that, at time n, the position of the process

X_{.}

is at level r, that is

f_{n} (x) = E (x^{X_{n}}) = \sum_{r = 0}^{n} x^{r} P_{n} (r) .

(9)

From Equation (9) and Lemma 3, we deduce a recursive equation relating

f_{n + 1} (x)

,

f_{0} (x)

, and

f_{n} (x)

. It is presented in the next proposition.

Proposition 1.

For all

x \in R

, the explicit expression of the sequence of polynomials

f_{n} (x)

satisfies the following recurrence:

f_{n + 1} (x) = (1 - p) + p x f_{n} (x),

(10)

with the initial condition

f_{0} (x) = P_{0} (0) = 1

.

Proof.

Using Equation (9) and for all

n \geq 1

, the function

f_{n + 1} (x)

can be developed as:

f_{n + 1} (x) = P_{n + 1} (0) + \sum_{r = 1}^{n + 1} x^{r} P_{n + 1} (r) .

Due to Lemma 3, we have:

\sum_{r = 1}^{n + 1} x^{r} P_{n + 1} (r) = p x f_{n} (x) .

□

Now, we use Equation (10) to show that the random walk

X_{n}

converges to a shifted geometric distribution with parameter

1 - p

asymptotically. It is introduced in the next theorem.

Theorem 2.

The limiting distribution of the process

X_{n}

converges to a shifted geometric distribution with parameter

1 - p

, with a probability-generating function given by the following: for all

n \geq 0

,

f_{n} (x) = E (x^{X_{n}}) = {(p x)}^{n} + (1 - p) \frac{1 - {(p x)}^{n}}{1 - p x},

(11)

for all

x \in R

, such that

| 1 - p x | < 1

.

Proof.

Iterating the recursive equation defined in (10) n times, we obtain

f_{n} (x) = E (x^{X_{n}}) = {(p x)}^{n} + (1 - p) \frac{1 - {(p x)}^{n}}{1 - p x},

and passing to the limit of

f_{n} (x)

, then we have

lim_{n \to \infty} f_{n} (x) = lim_{n \to \infty} [{(p x)}^{n} + (1 - p) \frac{1 - {(p x)}^{n}}{1 - p x},] = \frac{1 - p}{1 - p x};

this is exactly the generating function of a shifted geometric distribution with parameter

1 - p

. □

To derive the probability-generating function

f_{n} (x)

given in Theorem 2, we deduce the closed expressions of the mean and the variance of the random walk

X_{n}

.

Corollary 3.

The mean and the variance of the random walk

X_{n}

are given by

E (X_{n}) = \frac{p - p^{n + 1}}{1 - p},

(12)

V a r (X_{n}) = \frac{p}{{(1 - p)}^{2}} (1 - p^{n} \{p^{n + 1} + (1 + 2 n) (1 - p)\}) .

Proof.

The first derivative of

f_{n} (x)

defined in Equation (11) with respect to x:

\begin{matrix} \frac{\partial f_{n} (x)}{\partial x} = n p^{n} x^{n - 1} - (1 - p) \frac{n p^{n} x^{n - 1}}{1 - p x} + (1 - p) p \frac{1 - p^{n} x^{n}}{{(1 - p x)}^{2}}, \end{matrix}

(13)

evaluating at

u = 1

,

\frac{\partial f_{n} (x)}{\partial x} |_{x = 1} = \frac{p - p^{n + 1}}{1 - p} .

Using Equation (1), we obtain

E (X_{n}) = \frac{\partial f_{n} (x)}{\partial x} |_{x = 1} = \frac{p - p^{n + 1}}{1 - p} .

To derive the variance of

X_{n}

, we need to define the following sequences of functions:

\begin{matrix} K_{n} (x) = & \frac{x^{n - 1}}{1 - p x}, \\ L_{n} (x) = & \frac{1 - p^{n} x^{n}}{{(1 - p x)}^{2}} . \end{matrix}

Observe that the first and second derivatives of

f_{n} (x)

are given by

\begin{matrix} \frac{\partial f_{n} (x)}{\partial x} = & n p^{n} x^{n - 1} - n p^{n} (1 - p) K_{n} (x) + p (1 - p) L_{n} (x), \\ \frac{\partial^{2} f_{n} (x)}{\partial u^{x}} = & n (n - 1) p^{n} x^{n - 2} - n p^{n} (1 - p) \frac{\partial K_{n} (x)}{\partial x} + p (1 - p) \frac{\partial L_{n} (x)}{\partial x} . \end{matrix}

(14)

The first derivative of

K_{n} (x)

and

L_{n} (x)

with respect to x at

x = 1

is given by

\frac{\partial K_{n} (x)}{\partial x} |_{x = 1} = \frac{(n - 1) x^{n - 2}}{1 - p x} + \frac{p x^{n - 1}}{{(1 - p x)}^{2}} |_{x = 1} = \frac{(n - 1)}{1 - p} + \frac{p}{{(1 - p)}^{2}},

(15)

\frac{\partial L_{n} (u)}{\partial x} |_{x = 1} = \frac{- n p^{n} x^{n - 1}}{{(1 - p x)}^{2}} + \frac{2 p (1 - p^{n} x^{n})}{{(1 - p x)}^{3}} |_{x = 1} = \frac{- n p^{n}}{{(1 - p)}^{2}} + \frac{2 p (1 - p^{n})}{{(1 - p)}^{3}} .

(16)

Combining Equations (14)–(16), we obtain

\begin{matrix} \frac{\partial^{2} f_{n} (x)}{\partial u^{x}} |_{x = 1} = & - \frac{2 n p^{n + 1}}{1 - p} + \frac{2 p^{2} (1 - p^{n})}{{(1 - p)}^{2}} . \end{matrix}

(17)

Applying Equations (1), (12), and (17), we obtain

\begin{matrix} V a r (X_{n}) = & - \frac{2 n p^{n + 1}}{1 - p} + \frac{2 p^{2} (1 - p^{n})}{{(1 - p)}^{2}} + \frac{p - p^{n + 1}}{1 - p} - {(\frac{p - p^{n + 1}}{1 - p})}^{2} . \end{matrix}

□

6. Conclusions and Perspectives

In this current paper, we stated our main result concerning the height of the random walk

X_{n}

. Precisely, we found the joint distribution between the height and the return time statistics. This is given by the following formula:

P (H_{n} \leq k, N_{n} = r) = a_{(n, r, k)} {(1 - p)}^{r} p^{n - r} P (N_{n} = r),

where

a_{(n, r, k)}

is a combinatorial coefficient. Also, we analyzed this coefficient numerically using the R program and took some properties:

1.: If n increases and r and k are fixed, then the combinatorial coefficient $a_{(n, r, k)}$ increases;
2.: The combinatorial coefficient $a_{n, r, k}$ decreases or increases when n and k are fixed, but r increases. This means that there exists a maximal coefficient $a_{n, r, k}$ for special values of n, r, and k;
3.: The combinatorial coefficient $a_{n, r, k}$ increases when n and r are fixed, but k increases. This means that $a_{n, r, k}$ and k are proportionally related.

Also, we observe from Table 3 a very nice property of the combinatorial coefficient

a_{n, r, k}

. This property depends on the parity of n and special values of r and k. Precisely, we mention that, if n is an even number, r equals

(n / 2)

, and k takes any value from

r + 1

to n, then

a_{n, r, k}

is maximal. But, if n is an odd number, r equals

\frac{n - 1}{2}

, and k takes any value from

r + 2

to n or r equals

\frac{n + 1}{2}

and k takes any value from r to n, then

a_{n, r, k}

is maximal.

Furthermore, we studied the statistical properties of the random walk

X_{n}

like the limit distribution, the mean, and the variance. Firstly, we found the closed form of the probability-generating function of the random walk

X_{n}

from the recursive equation defined in Equation (10). Next, we proved that the limiting distribution of

X_{n}

is a shifted geometric distribution with parameter

1 - p

. Finally, we derived the probability-generating function of

X_{n}

to obtain the closed forms of the mean and the variance of

X_{n}

.

In the next work, we plan to work on the following questions:

1.: Can we find a closed form of the probability-generating function of the height?
2.: Can we explicitly calculate the mean and variance of the height statistics using the probability-generating function of $H_{n}$ ?

Funding

We thank the Deputyship for Research and Innovation, the “Ministry of Education”, in Saudi Arabia for funding this research (IFKSUOR3-331-1).

Data Availability Statement

The random samples were generated using the RStudio-2023.09.0 program.

Acknowledgments

The author extends appreciation to the Deputyship for Research and Innovation, the “Ministry of Education”, in Saudi Arabia for funding this research (IFKSUOR3-331-1).

Conflicts of Interest

The author declares no conflict of interest.

References

Sarkodie, S.A. Estimating Ghana’s electricity consumption by 2030: An ARIMA forecast. Energy Sources Part B Econ. Plan. Policy 2017, 12, 936–944. [Google Scholar] [CrossRef]
Chavez, S.G.; Bernat, J.X.; Coalla, H.L. Forecasting of energy production and consumption in Asturias (northern Spain). Energy 1999, 24, 183–198. [Google Scholar] [CrossRef]
Kankal, M.; Akpınar, A.; Kömürcü, M.; Özşahin, I. Modeling and forecasting of Turkey’s energy. Consumption using socio-economic and demographic variables. Appl. Energy 2011, 88, 1927–1939. [Google Scholar] [CrossRef]
Koutroumanidis, T.; Ioannou, K.; Arabatzis, G. Predicting fuelwood prices in Greece with the use of ARIMA models, artificial neural networks and a hybrid ARIMA–ANN model. Energy Policy 2009, 37, 3627–3634. [Google Scholar] [CrossRef]
Banderier, C.; Flajolet, P. Basic analytic combinatorics of directed lattice paths. Theor. Sci. 2002, 281, 37–80. [Google Scholar] [CrossRef]
Flajolet, P.; Sedgewick, R. Analytic Combinatorics; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
Jim Pitman, J.; Yor, M. On the distribution of ranked heights of excursions of a Brownian bridge. Ann. Probab. 2001, 29, 361–384. [Google Scholar] [CrossRef]
Csaki, E.; Hu, Y. Asymptotic properties of ranked heights in Brownian excursions. J. Theor. Probab. 2001, 14, 77–96. [Google Scholar] [CrossRef]
Csaki, E.; Hu, Y. Lengths and heights of random walk excursions. In Discrete Mathematics and Theoretical Computer Science; Discrete Random Walks: Paris, France, 2003; pp. 45–52. [Google Scholar]
Katzenbeisser, W.; Panny, W. The maximal height of simple random walks revisited. J. Satistical Plan. Inference 2002, 101, 149–161. [Google Scholar] [CrossRef]
Banderier, C.; Nicodème, P. Bounded discrete walks. In Discrete Mathematics and Theoretical Computer Science; Discrete Random Walks: Paris, France, 2010; pp. 35–48. [Google Scholar] [CrossRef]
Althagafi, A.; Aguech, R.; Banderier, C. Height of walks with resets and the Moran model. Sémin. Lothar. Comb. 2023; submitted. [Google Scholar]
Gao, K.; Yan, X.; Peng, R.; Xing, L. Economic design of a linear consecutively connected system considering cost and signal loss. IEEE Trans. Syst. Man Cybern. Syst. 2021, 51, 5116–5128. [Google Scholar] [CrossRef]
Gao, K.; Peng, R.; Qu, C.L.; Xing, L.; Wang, S.; Wu, F. Linear system design with application in wireless sensor networks. J. Ind. Inf. Integr. 2022, 27, 100279. [Google Scholar] [CrossRef]
Aguech, R.; Abdelkader, M. Two-Dimensional Moran Model: Final Altitude and Number of Resets. Mathematics 2023, 11, 3774. [Google Scholar] [CrossRef]

Table 1. All possibilities of the integers

n_{1}, \dots, n_{r}

for

n = 7

.

Table 1. All possibilities of the integers

n_{1}, \dots, n_{r}

for

n = 7

.

$r = 4, k = 2$				$r = 5, k = 2$					$r = 5, k = 3$					$r = 6, k = 2$
$n_{1}$	$n_{2}$	$n_{3}$	$n_{4}$	$n_{1}$	$n_{2}$	$n_{3}$	$n_{4}$	$n_{5}$	$n_{1}$	$n_{2}$	$n_{3}$	$n_{4}$	$n_{5}$	$n_{1}$	$n_{2}$	$n_{3}$	$n_{4}$	$n_{5}$	$n_{6}$
1	2	3	5	1	2	3	4	5	1	2	3	4	5	1	2	3	4	5	6
1	2	4	6	1	2	3	4	6	1	2	3	4	6	1	2	3	4	5	7
1	3	4	5	1	2	3	5	6	1	2	3	4	7	1	2	3	4	6	7
1	3	4	6	1	2	3	5	7	1	2	3	5	6	1	2	3	5	6	7
1	3	5	6	1	3	4	5	6	1	2	3	5	7	1	2	4	5	6	7
1	3	5	7	1	3	4	5	7	1	2	3	6	7	1	3	4	5	6	7
2	3	4	5	1	3	4	6	7	1	2	4	5	6	2	3	4	5	6	7
2	3	4	6	2	3	4	5	6	1	2	4	5	7
2	3	5	6	2	3	4	5	7	1	2	4	6	7
2	3	5	7	2	3	4	6	7	1	3	4	5	6
2	4	5	6	2	4	5	6	7	1	3	4	5	7
2	4	5	7	3	4	5	6	7	1	3	4	6	7
2	4	6	7						1	4	5	6	7
3	4	5	6						2	3	4	5	6
3	4	5	7						2	3	4	5	7
3	4	6	7						2	3	4	6	7
3	5	6	7						2	3	5	6	7
									2	4	5	6	7

Table 2. All possibilities of the integers

n_{1}, \dots, n_{r}

for

n = 5

.

Table 2. All possibilities of the integers

n_{1}, \dots, n_{r}

for

n = 5

.

$r = 3, k = 3$			$r = 3, k = 2$			$r = 4, k = 2$
$n_{1}$	$n_{2}$	$n_{3}$	$n_{1}$	$n_{2}$	$n_{3}$	$n_{1}$	$n_{2}$	$n_{3}$	$n_{4}$
1	2	3	1	2	3	1	2	3	4
1	2	4	1	2	4	1	2	3	5
1	2	5	1	3	4	1	2	4	5
1	3	4	1	3	5	1	3	4	5
1	3	5	2	3	4	2	3	4	5
1	4	5	2	3	5
2	3	4	2	4	5
2	3	5	3	4	5
2	4	5
3	4	5

Table 3. The combinatorial coefficient

a_{(n, r, k)}

for different values of n.

Table 3. The combinatorial coefficient

a_{(n, r, k)}

for different values of n.

n	r	k	$a_{(n, r, k)}$	n	r	k	$a_{(n, r, k)}$
4	2	2	5	6	3	5	20
4	2	3	6	6	4	2	11
4	2	4	6	6	4	3	15
4	3	1	2	6	4	4	15
4	3	2	4	6	5	2	6
4	3	3	4	7	2	2	1
5	2	2	5	7	2	3	9
5	2	3	9	7	2	4	16
5	2	4	10	7	2	5	20
5	2	5	10	7	2	6	21
5	3	1	1	7	3	2	6
5	3	2	8	7	3	3	25
5	3	3	10	7	3	4	33
5	3	4	10	7	3	5	35
5	4	1	2	7	3	6	35
5	4	2	5	7	4	2	18
5	4	3	5	7	4	3	32
6	2	2	3	7	4	4	35
6	2	3	10	7	4	5	35
6	2	4	14	7	5	2	17
6	2	5	15	7	5	3	21
6	3	2	8	7	5	4	21
6	3	3	18	7	5	5	21
6	3	4	20	7	5	6	21

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

On the Height of One-Dimensional Random Walk

Abstract

1. Introduction

2. Definitions and Presentation of the Model

3. Main Result

4. Simulation of the Combinatorial Coefficient $a_{n, r, k}$

5. Distribution of the Random Walk $X_{n}$

6. Conclusions and Perspectives

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

n	r	k	$a_{(n, r, k)}$	n	r	k	$a_{(n, r, k)}$
4	2	2	5	6	3	5	20
4	2	3	6	6	4	2	11
4	2	4	6	6	4	3	15
4	3	1	2	6	4	4	15
4	3	2	4	6	5	2	6
4	3	3	4	7	2	2	1
5	2	2	5	7	2	3	9
5	2	3	9	7	2	4	16
5	2	4	10	7	2	5	20
5	2	5	10	7	2	6	21
5	3	1	1	7	3	2	6
5	3	2	8	7	3	3	25
5	3	3	10	7	3	4	33
5	3	4	10	7	3	5	35
5	4	1	2	7	3	6	35
5	4	2	5	7	4	2	18
5	4	3	5	7	4	3	32
6	2	2	3	7	4	4	35
6	2	3	10	7	4	5	35
6	2	4	14	7	5	2	17
6	2	5	15	7	5	3	21
6	3	2	8	7	5	4	21
6	3	3	18	7	5	5	21
6	3	4	20	7	5	6	21

n	r	k	$a_{(n, r, k)}$	n	r	k	$a_{(n, r, k)}$
4	2	2	5	6	3	5	20
4	2	3	6	6	4	2	11
4	2	4	6	6	4	3	15
4	3	1	2	6	4	4	15
4	3	2	4	6	5	2	6
4	3	3	4	7	2	2	1
5	2	2	5	7	2	3	9
5	2	3	9	7	2	4	16
5	2	4	10	7	2	5	20
5	2	5	10	7	2	6	21
5	3	1	1	7	3	2	6
5	3	2	8	7	3	3	25
5	3	3	10	7	3	4	33
5	3	4	10	7	3	5	35
5	4	1	2	7	3	6	35
5	4	2	5	7	4	2	18
5	4	3	5	7	4	3	32
6	2	2	3	7	4	4	35
6	2	3	10	7	4	5	35
6	2	4	14	7	5	2	17
6	2	5	15	7	5	3	21
6	3	2	8	7	5	4	21
6	3	3	18	7	5	5	21
6	3	4	20	7	5	6	21

On the Height of One-Dimensional Random Walk

Abstract

1. Introduction

2. Definitions and Presentation of the Model

3. Main Result

4. Simulation of the Combinatorial Coefficient a n , r , k

5. Distribution of the Random Walk X n

6. Conclusions and Perspectives

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

4. Simulation of the Combinatorial Coefficient $a_{n, r, k}$

5. Distribution of the Random Walk $X_{n}$

n	r	k	$a_{(n, r, k)}$	n	r	k	$a_{(n, r, k)}$
4	2	2	5	6	3	5	20
4	2	3	6	6	4	2	11
4	2	4	6	6	4	3	15
4	3	1	2	6	4	4	15
4	3	2	4	6	5	2	6
4	3	3	4	7	2	2	1
5	2	2	5	7	2	3	9
5	2	3	9	7	2	4	16
5	2	4	10	7	2	5	20
5	2	5	10	7	2	6	21
5	3	1	1	7	3	2	6
5	3	2	8	7	3	3	25
5	3	3	10	7	3	4	33
5	3	4	10	7	3	5	35
5	4	1	2	7	3	6	35
5	4	2	5	7	4	2	18
5	4	3	5	7	4	3	32
6	2	2	3	7	4	4	35
6	2	3	10	7	4	5	35
6	2	4	14	7	5	2	17
6	2	5	15	7	5	3	21
6	3	2	8	7	5	4	21
6	3	3	18	7	5	5	21
6	3	4	20	7	5	6	21