1. Introduction
Let
be a discrete random walk with one dimension defined as follows: The walk starts from the origin at Time 0. After one unit of time, the process
shifts by one positive unit with probability
or resets to 0 with probability
. We provide three examples of the evolution of our random walk until time
n = 10:
In the above examples, the height of the preceding walks are equal to 7, 4, and 2, respectively.
In this work, we are interested in analyzing the height statistics, denoted by
, of the random walk
. Our analysis of the height is based on the combinatorial analysis of the coefficient
, representing the number of ways to choose
r distinct integers
satisfying the following conditions:
such that
and
and based on the probability distribution of the return time, denoted by
, of the random walk
, given by
Our contribution in this current paper is finding a closed form of the distribution of the height statistics
using a combinatorial analysis of the coefficient
and the distribution of the return time
of the random walk
. The closed form of the probability distribution of
is given by:
Furthermore, we study the statistical properties of the random walk , like the mean, the variance, and the limiting distribution of . Precisely, we prove that the limiting distribution of the random walk is a shifted geometric distribution with parameter , and we give the closed forms of the mean and the variance of .
This analysis of the height statistics is very important, and it is applicable to many aspects of renewable energy. For example, electricity today plays a very important role in daily activities and is very essential for transport, education, healthcare, and many other sectors. For this reason, controlling electricity consumption is necessary and is performed by estimating the maximum amount of electricity consumption. Electricity consumption is estimated via statistical methods such as time series models [
1,
2], regression models [
3], and ARIMA models [
4]. Furthermore, this maximum amount is similar to the height of the electricity consumption in a given period of time.
In the literature, the statistical properties of the height statistics are studied in one dimension via the kernel method and singularity analysis (see [
5,
6]). For example, we can mention the distribution of the ranked heights of the excursions of a Brownian bridge, investigated by Pitman and Yor in [
7]. Similarly, Csaki and Hu analyzed the asymptotic properties of ranked heights in Brownian excursions in [
8]. Also, Csaki and Hu analyzed the lengths and heights of random walk excursions in [
9]. Furthermore, Katzenbeisser and Panny studied the maximal height of simple random walks, which were revisited in [
10]. In addition, Banderier and Nicodème [
11] studied the height of discrete bridges/meanders/excursions for bounded discrete walks. Also, Aguech, Althagafi, and Banderier in [
12] analyzed the height of walks with resets and the Moran model.
This paper is organized as follows. In
Section 2, we introduce our model in detail and define the return time and the height statistics, denoted by
, of our random walk
. In
Section 3, we present our main result concerning the distribution of the height statistics of the random walk
. In
Section 4, we use the R program to find all possibilities of the integers
satisfying the conditions defined in Equation (
4) and compute the combinatorial coefficient for different values of
n,
r, and
k. In
Section 5, we prove that the limiting distribution of the random walk
is a shifted geometric distribution with parameter
. Also, we use the probability-generating function of the random walk
to obtain their mean and variance. In
Section 6, we present some conclusions concerning our results and some perspectives.
3. Main Result
The goal of this section is to obtain the distribution of
. To reach this goal, we apply at first a very important result concerning the distribution of the return time
of the random walk
(see Theorem 3 in [
15]). For the second setup, we analyze the joint distribution of
using the conditional probability and the marginal distribution of the return time
. Finally, we deduce the marginal distribution of
.
Now, we present a very important result concerning the distribution of the return time, , of the random walk .
Lemma 1 ([
15]).
The exact distribution of is given by Consider the following event
representing the height statistics
, bounded by
k, given that the return time
equals
r of the random walk
:
where
are i.i.d. geometric random variables with parameter
and
such that
and
. We define the combinatorial coefficient
:
representing the number of ways to choose
r distinct integers
satisfying the conditions in Equation (
4).
Remark 1. The combinatorial coefficient depends on the parameters n, r, and k, where n represents the length of the random walk and a k integer less than n.
We present a closed form of the combinatorial coefficient in the next lemma.
Lemma 2. The coefficient is given bywhere stands for the coefficient of in the power series . Proof. For all
, let
and
. It is obvious that identifying
is equivalent to identifying
, and then:
□
Remark 2. From Equation (6), the combinatorial coefficient is the coefficient of in the power series . Next, we give some results about the height of the random walk . It represents the maximal height attained by the walk , in all of the past from 1 to n. This means that, for all n and for all , the values of are between 0 and k. For this purpose, firstly, we compute the joint distribution of the discrete return time and the height of the random walk . Secondly, for all and for all , we find the conditional probability of the height bounded by the integer k given that the return time equals r. Furthermore, we determine the probability of the intersection between the events and . Finally, we deduce the marginal distribution of .
The next theorem leads to the conditional probability that the height of the random walk is bounded by k given that the return time equals r.
Theorem 1. The conditional distribution of , given , is given by Proof. Using (
3), we have
where
,
, and
.
Using the conditional probability, we obtain
□
From Theorem 1, we deduce the joint distribution of the following events and .
Corollary 1. The joint distribution of satisfies the following relation:where is defined in Lemma 1. Proof. Applying Lemma 1 and Theorem 1, we obtain
where
is defined in Lemma 1. □
We deduce here some information about the distribution of
. By summing over
r in Equation (
7), we obtain the marginal distribution of
, as follows:
Corollary 2. The probability distribution of the height statistics of the random walk is given by the following equation:where is defined in Equation (4). 4. Simulation of the Combinatorial Coefficient
In this section, we use the R program to compute the combinatorial coefficient for different values of n, r, and k. In the first case, we find the value of the coefficient and count all the possibilities of the integers for , and . In the second case, we determine the possibilities of the integers for , and . Also, we list the values of the combinatorial coefficient for different values of n (4, 5, 6, and 7), r (2, 3, 4, and 5), and k (2, 3, 4, 5, and 6).
In
Table 1, we find all the possibilities of the integers
under the conditions defined in Equation (
4) for different values of
r and
k when
n equals 7 and compute the corresponding combinatorial coefficient
. Precisely, in the first case, when
n and
k are fixed at 7 and 2, respectively, and the number
r takes values of 4, 5, and 6, then the combinatorial coefficient
takes the values 17, 12, and 7. This means that, when
r increases, then the coefficient
decreases. In the second case, if
n and
k equal 7 and 2 and
r increases from 2 to 3, then the coefficient
increases from 12 to 18, respectively. This means that the coefficient
increases when
k increases. Also, from
Table 1, we observe that
is fixed at 7 when
r is near
n and
k byat least 2 (
and
).
Table 2 lists all the possibilities of the integers
under the conditions defined in Equation (
4) for different values of
r and
k when
n equals 5. Also, we deduce the value of the combinatorial coefficient
for each list. Furthermore,
Table 2 shows two cases of the increasing of the combinatorial coefficient
, which depends on the parameters
r (the number of integers
) and
h (the bound of the height
of the random walk
). Precisely, in the first case, when
n and
k are fixed at 5 and 2 and
r increases from 3 to 4, then the combinatorial coefficient
decreases from 8 to 5. This means that the coefficient
decreases when
r increases. In the second case, if
n and
r are equal to 5 and 3 and
k increases from 2 to 3, then the coefficient
increases from 8 to 10, respectively. This means the coefficient
increases when
k increases for fixed
n and
r.
Table 3 shows that the combinatorial coefficient
depends on the three parameters
n,
r, and
K. Precisely, this coefficient is increasing or decreasing if the parameters
n,
r, and
p change. From
Table 3, we distinguish three cases concerning the computation of
:
In the first case, when n is increasing, r and k are fixed, then we observe that the coefficient increases. For example, if n takes values of 4, 5, 6, and 7 and r and k equal 2 and 4, then takes values of 6, 10, 14, and 16, respectively. Sometimes, this increasing of is very quick, and it takes values of 4, 5, 15, and 32 when n takes values of 4, 5, 6, and 7 and r and k equal 4 and 3, respectively. But, sometimes, decreases under the same conditions. For example, takes values of 5, 5, 3, and 1 when n equals 4, 5, 6, and 7 and r and k equal 2.
In the second case, the combinatorial coefficient decreases or increases when n and k are fixed but r increases. This means that there exists a maximal coefficient for special values of n, r, and k. Firstly, if n equals 5, k equals 3, and r takes values of 2, 3, and 4, then the coefficient increases from 9 to 10 and decreases to 5, respectively. Secondly, the coefficient increases from 3 to 11 and decreases to 6 when n equals 6, k equals 2, and r takes values of 2, 3, 4, and 5. Finally, the coefficient increases from 16 to 35 and decreases to 21 when n equals 7, k equals 4, and r takes values of 2, 3, 4, and 5.
In the third case, the combinatorial coefficient increases when n and r are fixed, but k increases. This means that and k are proportionally related. Firstly, if n equals 5, r equals 3, but k takes values of 1, 2, 3, and 4, then the coefficient equals 1, 8, 10, and 10, respectively. Secondly, if n equals 6, r equals 4, but k takes values of 2, 3, and 4, then the coefficient equals 11, 15, and 10, respectively. Finally, if n equals 7, r equals 3, but k takes values of 2, 3, 4, 5, and 6, then the coefficient equals 6, 25, 33, 35, and 35, respectively.
Furthermore,
Table 3 shows that there exists a maximal combinatorial coefficient
for special values of
n,
r, and
k. Firstly,
equals 6 when
,
, and
k increases from 3 to
n. Secondly,
equals 10 when
,
, and
k increases from 4 to
n or
and
k increases from 3 to
n. Next,
equals 20 when
,
, and
k increases from 4 to
n. Finally,
equals 35 when
,
, and
k increases from 5 to
n or
and
k increases from 4 to
n.
Finally, we observe a very nice property of the combinatorial coefficient . This property depends on the parity of n and the length of the random walk . Precisely, we mention that, if n is an even number, r equals , and k takes any value from to n, then is maximal. For example, when , , and or , , and , the combinatorial coefficient is maximal and equals 10 and 20, respectively. But, if n is an odd number, r equals and k takes any value from to n or r equals and k takes any value from r to n, then is maximal. For the first example, when , , and or and , the combinatorial coefficient is maximal and equals 10. For the second example, when , , and or and , the combinatorial coefficient is maximal and equals 35.
We perform the computation of the combinatorial coefficient for different values of the parameters n, r, and k by the following setups:
First setup:
- 1.
We fix the three parameters n, r, and k;
- 2.
we initialize the combinatorial coefficient to 0;
- 3.
We fix the integers to (1, …, r), then we guarantee that the difference between two consecutive integers is less than k;
- 4.
We change by a value from to n, and we stop if ;
- 2.
When , then .
Second setup:
- 1.
We start with the integers , which equal (1, …, , r, ) such that the difference between two consecutive integers is less than or equal to k;
- 2.
We change by a value from to n, and we stop if ;
- 3.
When , then ;
- 4.
.
Third setup:
- 1.
We repeat the same procedure from the first and second setups;
- 2.
The last choice of the integers is , then we guarantee that the difference between two consecutive integers is less than k;
- 3.
. If , we stop the procedure in the third setup.
Final setup:
- 1.
We repeat the preceding setups for from 2 to an integer c such that ;
- 2.
.