Bridging Extremes: The Invertible Bimodal Gumbel Distribution

Otiniano, Cira G.; Silva, Eduarda B.; Matsushita, Raul Y.; Silva, Alan

doi:10.3390/e25121598

Open AccessArticle

Bridging Extremes: The Invertible Bimodal Gumbel Distribution

¹

Department of Statistics, University of Brasília, Brasília 70910-900, Brazil

²

Institute of Mathematics and Statistics, São Paulo University, São Paulo 05508-220, Brazil

^*

Author to whom correspondence should be addressed.

^†

The authors contributed equally to this work.

Entropy 2023, 25(12), 1598; https://doi.org/10.3390/e25121598

Submission received: 25 October 2023 / Revised: 16 November 2023 / Accepted: 24 November 2023 / Published: 29 November 2023

(This article belongs to the Special Issue Stochastic Models and Statistical Inference: Analysis and Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

This paper introduces a novel three-parameter invertible bimodal Gumbel distribution, addressing the need for a versatile statistical tool capable of simultaneously modeling maximum and minimum extremes in various fields such as hydrology, meteorology, finance, and insurance. Unlike previous bimodal Gumbel distributions available in the literature, our proposed model features a simple closed-form cumulative distribution function, enhancing its computational attractiveness and applicability. This paper elucidates the behavior and advantages of the invertible bimodal Gumbel distribution through detailed mathematical formulations, graphical illustrations, and exploration of distributional characteristics. We illustrate using financial data to estimate Value at Risk (VaR) from our suggested model, considering maximum and minimum blocks simultaneously.

Keywords:

Gumbel distribution; bimodality; extreme value theory; value at risk

1. Introduction

Bimodal heavy-tailed distributions are powerful analytical tools for capturing the complex nature of phenomena subject to extreme events in hydrology, meteorology, insurance, reliability, and finance, among other disciplines. Two main features characterize it. Firstly, it exhibits bimodality, meaning it has two distinct peaks or modes, indicating the presence of two prominent regimes within the overall data set. Secondly, it has heavy tails, which means a higher likelihood of occurrences of extreme values than with light tails distributions.

Bimodal heavy-tailed distributions are related to René Thom’s catastrophe theory, focusing on systems characterized by sudden, dramatic changes and a propensity for extreme events, e.g., [1,2]. Catastrophe theory deals with situations where small parameter changes can lead to abrupt shifts in the system’s state. This concept aligns with bimodal distributions, where a system may switch between two states or regimes. The heavy-tailed aspect of these distributions reflects the likelihood of rare, extreme events, mirroring the focus of catastrophe theory on significant, sudden changes. Both concepts encapsulate the unpredictability and uncertainty inherent in the systems they describe. Catastrophe theory provides a mathematical framework for understanding these dynamics, which can manifest statistically as bimodal, heavy-tailed distributions. This connection is especially relevant in economics and finance, where catastrophic shifts and heavy-tailed distributions are frequently observed. Essentially, the interplay between these concepts helps us to understand and model systems where small inputs or changes can lead to significant, unpredictable, and often extreme outputs or shifts.

Unlike standard bimodal distributions, these heavy-tailed versions give significant weight to extreme events, allowing for a more accurate representation of systems where outliers or “black swans” play a critical role. For example, one can perform inference over tails of financial returns by fitting an appropriate limiting distribution over data that exceeds a fixed threshold, where the dual peaks in such a distribution can indicate states or types of behavior within the system [3]. In addition, typically, a bimodal distribution exhibits higher entropy compared to an unimodal distribution because two distinct modes add complexity and unpredictability to the system. In this way, the concept of entropy dovetails nicely with its inherent complexities, providing a quantitative lens through which to assess and strategize based on this kind of data.

Meanwhile, under certain conditions, statistics of extreme events are described by theoretical distributions. For example, the Gumbel, also known as the extreme value or Fisher–Tippet type I distribution, is a limiting distribution for the maximum (or the minimum) of a sufficiently large simple random sample. This result arises from the Fisher–Tippett–Gnedenko theorem, which states that the normalized maximum of a sequence of such random samples converges to one of three types of extreme value distribution: Gumbel, Fréchet, or Weibull. The Gumbel case is suitable for some typical families of populational distributions, such as logistic, Gaussian, and gamma.

Nevertheless, practical situations demand more, and therefore we find several generalizations of Gumbel to make it more flexible, for example, the two-component extreme value distribution or mixture of two Gumbel distributions [4], the exponentiated Gumbel [5], the transmuted extreme value [6], the generalized Gumbel [7], the generalized three-parameter Gumbel [8], the beta-Gumbel [9], the Kumaraswamy-Gumbel [10], and the exponentiated generalized Gumbel [11]. However, some lead to non-identifiable models because of observationally equivalent parameterizations [12]. There are other closely related models such as the exponentiated Gumbel Type-2 [13], the Kumaraswamy generalized exponentiated Gumbel type-2 [14], the bimodal generalized extreme value (GEV) [15], and a bimodal Gumbel distribution applied to environmental data [16]. However, the disadvantage of the latter model is that its cumulated distribution function does not have a simple closed form.

In this work, we put forward an invertible bimodal Gumbel distribution whose cumulated distribution function has a simple closed form, making it more attractive for computational procedures and more flexible in applications (Section 2). Our suggested distribution allows us to model both maximum and minimum simultaneously, while the classical Gumbel distribution describes only one of the extremes (maximum or minimum).

After discussing the maximum likelihood estimation of the parameters from simulated data (Section 3), we illustrate our approach using two financial data sets to estimate the value at risk (VaR) in Section 4. As we are interested in studying the probability distribution’s tails, we perform the block maxima technique among the available tools to find the appropriate cutoff. Instead of the usual power law distribution [3,17,18], we suggest a bimodal Gumbel distribution as a candidate model to describe the tail behavior of financial returns.

2. Main Results

Also known as type I extreme value distribution, the Gumbel distribution is one of the limit distributions of normalized maximum (or minimum) statistics [19], belonging to the class of the GEV distribution [20]. We denote the Gumbel random variable Y with a location parameter

μ \in R

and a scale parameter

σ > 0

as

Y \sim G (\cdot; μ, σ)

. The forms of its probability density function (PDF) and cumulative distribution function (CDF) are, respectively,

\begin{matrix} g (y; μ, σ) = \frac{1}{σ} exp \{- (\frac{y - μ}{σ}) - exp [- (\frac{y - μ}{σ})]\}, \forall y \in R \end{matrix}

(1)

and

\begin{matrix} G (y; σ, μ) & = exp [- exp (- \frac{y - μ}{σ})], \forall y \in R . \end{matrix}

(2)

Let us introduce our suggested generalization of the Gumbel distribution left open by [15] in the following way. Considering the transformation,

\begin{matrix} T_{μ, δ} (x) = x {| x |}^{δ} + μ, x \in R, δ > - 1, μ \in R, \end{matrix}

(3)

after plugging it into (1) and (2), we obtain the invertible bimodal Gumbel distribution X with CDF and PDF given by, respectively,

\begin{matrix} F_{IBG} (x; μ, σ, δ) & = & G (T_{μ, δ} (x); σ) \\ = & exp \{- exp [- \frac{(x | x |^{δ}) + μ}{σ}]\}, x \in R \end{matrix}

(4)

and

\begin{matrix} f_{IBG} (x; μ, σ, δ) & = & \frac{1}{σ} {(δ + 1) | x |}^{δ} exp \{- (\frac{(x | x |^{δ}) + μ}{σ}) - exp [- \frac{(x | x |^{δ}) + μ}{σ}]\}, \end{matrix}

(5)

where

δ > 0

and

μ \in R

are shape parameters and

σ > 0

is a scale parameter.

We shall denote it as

X \sim F_{IBG} (\cdot; μ, σ, δ)

throughout this paper.

To illustrate the role of its parameters, Figure 1 depicts the effect of the shape parameter

δ

. When

δ = 0

, the model (5) reduces to the unimodal Gumbel (1). The density becomes bimodal for

δ > 0

, and the modes’ separation rises as

δ

increases. Figure 2 contrasts the PDF shapes with negative and positive values of

μ

, illustrating its role as a location parameter (left) or shape parameter (right). Finally, Figure 3 shows that

σ

remains the scale parameter.

Some Distributional Characteristics

Modes 1.

Straightforwardly from the concept of the modes of

X \sim_{G B} (\cdot; μ, σ, δ)

, one can find that they are the solution of the differential equation

\begin{matrix} \frac{T_{μ, δ}^{″} (x)}{{[T_{μ, δ}^{'} (x)]}^{2}} = \frac{1}{σ} - \frac{e^{- \frac{T_{μ, δ} (x)}{σ}}}{σ}, \end{matrix}

(6)

where

\begin{matrix} T_{μ, δ}^{'} (x) = (δ + 1) {| x |}^{δ} \end{matrix}

(7)

and

\begin{matrix} T_{μ, δ}^{″} (x) = sign (x) (δ + 1) δ {| x |}^{δ - 1}, \end{matrix}

(8)

with

sign (x) = x / | x |

as the sign function.

Moments 2.

We can write down the kth moment of X as

\begin{matrix} E (X^{k}) & = & \int_{- \infty}^{\infty} x^{k} f_{IBG} (x) d x \\ = & \int_{- \infty}^{\infty} x^{k} g (T_{μ, δ} (x), 0, μ) T_{μ, δ}^{'} (x) d x . \end{matrix}

(9)

By substitution

y = T (x)

and taking the inverse function

T^{- 1} (y) = {s g n (| y - μ |) | y - μ |}^{\frac{1}{δ + 1}}

, we can express (9) in terms of a unimodal Gumbel Y, as defined in (2) as

\begin{matrix} E (X^{k}) = E ({| Y - μ |}^{\frac{k}{δ + 1}} . I_{[μ, + \infty)}) + {(- 1)}^{k} E ({| Y - μ |}^{\frac{k}{δ + 1}} . I_{(- \infty, μ)}), \end{matrix}

(10)

where

I_{A}

is the indicator function of an event A.

Moment-Generating Type Function 3.

The moment-generating function (MGF) encapsulates information about the distributional moments, being a helpful tool to characterize an IBG random variable X. For our convenience, however, we consider its power transformation

X^{δ + 1}

and derive its MGF shown in (13) as follows. From its definition,

\begin{matrix} φ_{X} (t) = E (e^{t X^{δ + 1}}) & = & \int_{- \infty}^{\infty} \frac{e^{t x^{1 + δ}}}{σ} e^{\frac{- T (x)}{σ}} e^{- e^{\frac{- T (x)}{σ}}} T^{'} (x) d x . \end{matrix}

(11)

Considering the substitution

y = e^{\frac{- T (x)}{σ}}

and the expression

T^{- 1} (ln y^{- σ}) = s g n (| ln y^{- σ} - μ |) | ln y^{- σ} {- μ |}^{\frac{1}{δ + 1}}

, we can rewrite the integral (11) as being

\begin{matrix} φ_{X} (t) & = & \int_{0}^{e^{\frac{μ}{σ}}} e^{t (- σ ln y - μ)} e^{- y} d y + \int_{e^{\frac{μ}{σ}}}^{\infty} e^{{(- 1)}^{1 + δ} t (μ + σ ln y)} e^{- y} d y \\ = & e^{- t μ} \int_{0}^{e^{\frac{μ}{σ}}} y^{- t σ} e^{- y} d y + e^{{(- 1)}^{1 + δ} (t μ)} \int_{e^{\frac{μ}{σ}}}^{\infty} y^{t σ {(- 1)}^{1 + δ}} e^{- y} d y, \end{matrix}

(12)

that is,

\begin{matrix} φ_{X} (t) = e^{- t μ} Γ (1 - t σ; e^{\frac{μ}{σ}}) + e^{t μ {(- 1)}^{δ + 1}} γ (1 + {(- 1)}^{δ + 1} t σ; e^{\frac{μ}{σ}}), \end{matrix}

(13)

where

\begin{matrix} γ (a; x) = \int_{x}^{\infty} t^{a - 1} e^{- t} d t and Γ (a; x) = \int_{0}^{x} t^{a - 1} e^{- t} d t, \end{matrix}

(14)

are the upper and lower incomplete Gamma functions. Now, we can retrieve the moments of

X^{δ + 1}

by taking derivatives of the cumulant-generating type function,

C_{X} = ln φ_{X} (t)

. As usual, from the expansion

ln z = (z - 1) - {(z - 1)}^{2} / 2 + {(z - 1)}^{3} / 3 - \dots

, we find

\begin{matrix} C_{X} (t) = (E (e^{t X^{δ + 1}}) - 1) - \frac{{(E (e^{t X^{δ + 1}}) - 1)}^{2}}{2} + \frac{{(E (e^{t X^{δ + 1}}) - 1)}^{3}}{3} - \dots \end{matrix}

(15)

Thus, for example, we get the first two moments of

X^{δ + 1}

by taking the derivatives

\begin{matrix} \frac{d}{d t} C_{X} (t) ∣_{t = 0} & = & E (X^{δ + 1}) \\ \frac{d^{2}}{d t^{2}} C_{X} (t) ∣_{t = 0} & = & E (X^{2 (δ + 1)}) - {[E (X^{δ + 1})]}^{2} . . \end{matrix}

(16)

To benchmark our result, if

δ = 0

, from (13) and (16), we find

E (X) = σ γ - μ

and

V a r (X) = σ^{2} π^{2} / 6

, because

Γ (a; x) + γ (a; x) = Γ (a)

, where

Ψ (x) = d ln (Γ (x)) / d x = Γ^{'} (x) / Γ (x)

,

Ψ (1) = γ

is the Euler’s constant, and

Ψ^{'} (1) = \frac{π^{2}}{6}

. Thus, we get the mean and variance of the basic Gumbel model as expected, which confirms the accuracy of (13).

Quantiles 4.

Sampling by the inverse transform is a basic method with which to generate a pseudo-random variate of X, based on its quantile function of

F_{I B G}

. While the bimodal Gumbel model introduced previously by [16] does not provide a simple way to perform this method, our suggested model (5) yields a simple expression for the quantile function. Since X is an absolutely continuous random variable, denoting the cumulative probability as the standard uniform random variable

F_{I B G} (x_{q}; μ, σ, δ) = q \sim U [0, 1]

, we obtain the random quantile function as

\begin{matrix} X_{q} = F_{I B G}^{- 1} (q) & = & \{\begin{matrix} {(- σ ln (- ln (q)) - μ)}^{\frac{1}{1 + δ}}, & q > e^{- e^{- \frac{μ}{σ}}}, \\ - {(μ + σ ln (- ln (q)))}^{\frac{1}{1 + δ}}, & q < e^{- e^{- \frac{μ}{σ}}} . \end{matrix} \end{matrix}

(17)

Entropy 5.

The differential entropy of the bimodal Gumbel distribution

X \sim F_{I B G} (\cdot; μ, σ, δ) = G (Y (T_{μ, δ} (.), σ))

, where

Y \sim G (., σ)

denotes the basic Gumbel distribution, is given by

\begin{matrix} H (X) & = & - \int_{- \infty}^{\infty} f_{I B G} (x; μ, σ, δ) ln [f_{I B G} (x; μ, σ, δ)] d x \\ = & \int_{- \infty}^{\infty} g (T_{μ, δ} (x); σ) \{exp [- \frac{T_{μ, δ} (x)}{σ}] + \frac{T_{μ, δ} (x)}{σ} + ln (\frac{T_{μ, δ}^{'} (x)}{σ})\} \frac{T_{μ, δ}^{'} (x)}{σ} d x \\ = & \int_{- \infty}^{\infty} g (T_{μ, δ} (x); σ) exp [- \frac{T_{μ, δ} (x)}{σ}] \frac{T_{μ, δ}^{'} (x)}{σ} d x + \int_{- \infty}^{\infty} g (T_{μ, δ} (x); σ) \frac{T_{μ, δ} (x)}{σ} \frac{T_{μ, δ}^{'} (x)}{σ} d x \\ + & \int_{- \infty}^{\infty} g (T_{μ, δ} (x); σ) ln (\frac{T_{μ, δ}^{'} (x)}{σ}) \frac{T_{μ, δ}^{'} (x)}{σ} d x, \end{matrix}

(18)

where g is the PDF of Y, as defined in (2). By substituting

y = T (x)

in (18), we obtain

\begin{matrix} H (X) & = & \frac{1}{σ} \int_{- \infty}^{\infty} exp [- \frac{y}{σ}] g (y) d y + \frac{1}{σ^{2}} \int_{- \infty}^{\infty} y g (y) d y \\ + & \frac{1}{σ} \int_{- \infty}^{\infty} ln (\frac{δ + 1}{σ}) g (y) d y + \frac{1}{σ} \int_{- \infty}^{\infty} ln ({| y - μ |}^{\frac{δ}{δ + 1}}) d y \\ = & 1 + \frac{γ}{σ} + ln {(\frac{δ + 1}{σ})}^{\frac{1}{σ}} + ln ({E | Y - μ |}^{\frac{δ}{δ + 1}}) . \end{matrix}

(19)

3. Parameter Estimation

This section discusses the maximum likelihood (ML) estimation method to estimate the vector parameters

Θ = (μ, σ, δ)

. Let

x_{1}, \dots, x_{n}

be realizations independent copies of a random variable with PDF as defined in (5). The log-likelihood function is

\begin{matrix} l (Θ; x_{1}, x_{2}, \dots, x_{n}) & = & \sum_{i = 1}^{n} ln f (x_{i}; Θ) \\ = & n ln (δ + 1) - n ln σ + δ \sum_{i = 1}^{n} ln | x_{i} | - \frac{\sum_{i = 1}^{n} x_{i} {| x_{i} |}^{δ} + μ}{σ} \\ - & \sum_{i = 1}^{n} e^{- \frac{x_{i} {| x_{i} |}^{δ} + μ}{σ}} . \end{matrix}

(20)

This log-likelihood function is well-defined across the entire parameter space and is continuous and differentiable for the vector parameters. Additionally, the family of distributions

F_{IBG}

is identifiable, meaning different parameters should lead to distinct probability distributions, ensuring a unique maximum for the likelihood function.

Ahmad et al. (2010) [21] showed the identifiability of the finite mixture of Gumbel distributions; in particular, the family of a Gumbel component

F_{G} = {G : G = G (., μ, σ) as (2)}

is identifiable. Based on this, we have that the IBG family,

F_{IBG} = {F_{IBG} : F_{IBG} (., μ, σ, δ) as (4)}

, is identifiable. It must be proven that

\begin{matrix} F_{IBG} (x; μ_{1}, σ_{1}, δ_{1}) = F_{IBG} (x; μ_{2}, σ_{2}, δ_{2}) if and only if μ_{1} = μ_{2}, σ_{1} = σ_{2}, δ_{1} = δ_{2} . \end{matrix}

Indeed, from (4)

\begin{matrix} exp \{- exp [- \frac{{x | x |}^{δ_{1}} + μ_{1}}{σ_{1}}]\} = exp \{- exp [\frac{{x | x |}^{δ_{2}} + μ_{2}}{σ_{2}}]\} . \end{matrix}

(21)

Since

F_{G}

is identifiable, then

μ_{1} = μ_{2}

and

σ_{1} = σ_{2}

. Thus, the Equation (21) is valid if and only if

\begin{matrix} {| x |}^{δ_{1}} - {| x |}^{δ_{2}} = 0, \end{matrix}

which only happens when

δ_{1} = δ_{2}

, for any

x \in R

.

The ML estimates

\hat{μ}, \hat{σ}, \hat{δ}

are the solution of the system of likelihood equations

\begin{matrix} \frac{\partial l (Θ; x)}{\partial μ} & = & - \frac{n}{\hat{σ}} + \frac{1}{\hat{σ}} \sum_{i = 1}^{n} e^{- \frac{x_{i} {| x_{i} |}^{\hat{δ}} + \hat{μ}}{\hat{σ}}} = 0; \end{matrix}

(22)

\begin{matrix} \frac{\partial l (Θ; x)}{\partial σ} = - \frac{n}{\hat{σ}} - \sum_{i = 1}^{n} {\hat{σ}}^{- 2} (x_{i} | x_{i} |^{\hat{δ}} + \hat{μ}) - \sum_{i = 1}^{n} {\hat{σ}}^{- 2} (x_{i} | x_{i} |^{\hat{δ}} + \hat{μ}) e^{- (x_{i} {| x_{i} |}^{\hat{δ}} + \hat{μ}) {\hat{σ}}^{- 1}} = 0; \end{matrix}

(23)

\begin{matrix} \frac{\partial l (Θ; x)}{\partial δ} & = \frac{n}{\hat{δ} + 1} + \sum_{i = 1}^{n} ln | x_{i} | - \frac{1}{\hat{σ}} \sum_{i = 1}^{n} x_{i} {| x_{i} |}^{\hat{δ}} ln \frac{\sum_{k = 1}^{n} x_{k} {| x_{k} |}^{\hat{δ}}}{\hat{σ}} \\ + e^{\frac{\hat{μ}}{\hat{σ}}} \sum_{i = 1}^{n} e^{- \frac{x_{i} {| x_{i} |}^{\hat{δ}}}{\hat{σ}}} (\frac{\sum_{j = 1}^{n} x_{j} {| x_{j} |}^{\hat{δ}}}{\hat{σ}}) ln \frac{\sum_{k = 1}^{n} x_{k} {| x_{k} |}^{\hat{δ}}}{\hat{σ}} = 0 . \end{matrix}

(24)

After algebraic manipulations, we get the unique closed-form solution for estimating

μ

,

\begin{matrix} \hat{μ} = \hat{σ} ln \frac{\sum_{i = 1}^{n} e^{- \frac{x_{i} {| x_{i} |}^{\hat{δ}}}{\hat{σ}}}}{n} . \end{matrix}

(25)

However, the estimates

\hat{σ}

and

\hat{δ}

must be obtained numerically.

Numerical Performance of ML Estimates

Now, we perform a Monte Carlo study to assess the performance of maximum likelihood estimators

\hat{μ}

,

\hat{σ}

and

\hat{δ}

in terms of their means, mean squared errors (MSE), biases, and standard errors (SE). We defined a set of 9 parameter vectors,

θ_{1}, \dots, θ_{9}

, with

μ \in {- 1, 0, 1}

,

δ \in {0, 2, 4}

, and

σ = 1

, for three sampling scenarios:

n = 50, 100

, and 1000. For each of the 27 combinations between parameters and sample scenarios, we took 100 Monte Carlo replications using the software R (version 3.4.1) to get the empirical sampling distributions of

\hat{μ}

,

\hat{σ}

, and

\hat{δ}

. We generate the Monte Carlo variates of

X \sim F_{G B} (\cdot; μ, σ, δ)

through the inverse transform method with the quantile Function (17).

Table 1, Table 2 and Table 3 depict the empirical expected values, bias, MSE, and SE of the ML estimators of the IBG model. Figure 4, Figure 5 and Figure 6 illustrate the empirical behavior of the MSE vs n. Overall, the MSE decreases as the sample size increases, confirming the optimal properties of ML estimators from the statistical inference theory. In this study, we did not face numerical problems in estimating these parameters.

4. Application

We use two financial data sets taken from https://finance.yahoo.com (accessed on 1 December 2021) to illustrate the applicability of the invertible bimodal Gumbel model. The first is the daily stock prices of Petrobras (PETR4), quoted in US dollars, from 1 March 2000 to 10 January 2021, totaling 5465 observations. The other is the daily exchange rate of the Brazilian real against the US dollar (USD/BRL) from 12 January 2003 to 15 October 2021, totaling 4223 data points. We aim to get the value-at-risk (VaR) of these data, a common measure of financial risk. It denotes the maximum loss incurred on a portfolio over a specific time horizon with a given confidence level

1 - α

[22]. It is expressed in probabilistic terms as

\begin{matrix} {VaR}_{α} (X_{t}) = inf {x \in R : F (x) \geq α}, \end{matrix}

(26)

where

F (x)

is the cumulative distribution function (CDF) of a real random variable

X_{t}

observed at time

t \in {0, 1, 2, \dots}

, and

0 < α < 1

is a small prespecified probability. Particularly, in our study, the time horizon comprises the totality of data in each discrete time series. Moreover, as

X_{t}

is an absolutely continuous random variable with an invertible CDF, we can write

\begin{matrix} x_{α} & = & {VaR}_{α} (X_{t}) = F^{- 1} (1 - α), \end{matrix}

where

F^{- 1}

denotes the inverse function of F, and

x_{α}

is the α—quantile of

X_{t}

. As usual,

X_{t}

means the log return of prices, that is,

\begin{matrix} X_{t} = ln P_{t} - ln P_{t - 1}, \end{matrix}

(27)

where

P_{t}

is a price at time t.

Table 4 summarizes descriptive statistics for PETR4 and USD/BRL returns. The return averages are close to zero, and the proximity between the absolute values of the first and third quartiles indicates the possible symmetry of the data, except for the possible extreme values suggested by the maximum (PETR4) and minimum (USD/BRL) statistics. Indeed, Figure 7 and Figure 8 depict extreme returns, some of them due to the COVID-19 event. In the critical period of the pandemic, Petrobras shares plummeted

57 %

due to low demand for petroleum products (Figure 7). As for the exchange rate USD/BRL, the effect was the opposite: the American dollar became more expensive than the Brazilian real because of various political and economic reasons. Furthermore, in these data sets, we observe extreme positive (PETR4) and negative (USD/BRL) values that stand out significantly from the rest of the observations.

Now, we perform the block maxima and minima method to extract extreme values from our data. Let

X_{1}, \dots X_{n}

be a random sample of log returns following

X \sim F_{I B G}

. Based on its realized values

{x_{t}}_{t = 1}^{n}

, we organize it into T non-overlapping sub-samples of length N, where T means the integer part of

n / N

, resulting in T data blocks of size N. We choose N to cover natural periods (e.g., a week or month) so that the new sub-sample is IID. Now, we take the maximum and the minimum over each N-history. We define the jth sub-sample of maximum and minimum as

\begin{matrix} M_{j} = max {x_{(j - 1) N + 1}, \dots, x_{j N}}, \end{matrix}

(28)

and

\begin{matrix} m_{j} = min {x_{(j - 1) N + 1}, \dots, x_{j N}}, \end{matrix}

(29)

for

j = 1, \dots, T

. This results in a new sample of size

2 T

, consisting of maxima and minima,

\begin{matrix} {Y_{t}}_{t = 1}^{2 T} = {m_{t}, M_{t}}_{t = 1}^{T} . \end{matrix}

(30)

For our case,

N = 15

is a block length providing IID sub-samples based on the Ljung–Box test for serial independence with a significance level of

5 %

. The left panels of Figure 9 and Figure 10 depict the series of extreme PETR4 and USD/BRL returns extracted from the blocks, while the left ones show the bimodal form of their distributions.

Thus, we fit these empirical distributions of extreme returns using our suggested invertible bimodal Gumbel model,

F_{I B G} (x; θ)

, with parameters

μ

,

σ

, and

δ

. Table 5 shows their maximum likelihood estimates. Figure 11 depicts the fitted model against the corresponding distribution of the extracted extreme returns, indicating that the IBG is suitable for simultaneously describing minima and maxima extreme returns.

Finally, Table 6 presents the estimated VaR for

α = 10 %

,

5 %

, and

1 %

. As we are dealing with the logarithmic returns, to make these VaR values more understandable, we may consider that the maximum return of the stock is

exp VaR - 1

. Thus, for example, over a 15-day period, we do not expect a return greater than

7.4 %

for PETR4 and

2.6 %

for USD/BRL with a confidence of

90 %

.

5. Conclusions

This paper introduced and examined the IBG distribution as an extension of the classical Gumbel distribution. We addressed the limitations of the unimodal Gumbel by proposing a model capable of simultaneously representing both maximum and minimum extremes, enhancing its applicability and versatility. The mathematical formulations accompanied by illustrative figures elucidate the characteristics and behavior of the proposed distribution, emphasizing its advantages in terms of computational efficiency and flexibility. We presented its distributional properties, including mode, moment-generating functions, and entropy. In our illustration, we performed the maximum and minimum blocks technique to obtain serial independent data for the VaR estimation through the ML method, offering a novel perspective on modeling extremes through the lens of the invertible bimodal Gumbel distribution.

Author Contributions

Conceptualization, C.G.O. and E.B.S.; methodology, C.G.O.; software, E.B.S. and A.S.; validation, E.B.S., R.Y.M. and A.S.; formal analysis, C.G.O.; investigation, C.G.O.; resources, E.B.S.; data curation, C.G.O.; writing—original draft preparation, C.G.O.; writing—review and editing, R.Y.M.; visualization, A.S.; supervision, R.Y.M.; project administration, C.G.O.; funding acquisition, C.G.O. and R.Y.M. All authors have read and agreed to the published version of the manuscript.

Funding

This study was financed in part by the Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior—Brazil (CAPES)—Finance Code 001, Fundacao de Apoio a Pesquisa do Distrito Federal (FAPDF)—Finance Code 00193-00001860/2023-17, and National Council for Scientific and Technological Development (CNPq)—Finance Code 311548/2022-9.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

https://finance.yahoo.com, accessed on 1 December 2021.

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

Aubin, D. Forms of explanation in the catastrophe theory of René Thom: Topology, morphogenesis, and structuralism. In Growing Explanations: Historical Perspectives on Recent Science; Wise, M.N., Smith, B.H., Weintraub, E.R., Eds.; Duke University Press: New York, NY, USA, 2004; pp. 95–130. [Google Scholar]
Ebeling, W.; Schimansky-Geier, L. Stochastic dynamics of a bistable reaction system. Phys. A Stat. Mech. Its Appl. 1979, 98, 587–600. [Google Scholar] [CrossRef]
Smirnov, V.; Ma, Z.; Volchenkov, D. Invited article by M. Gidea Extreme events and emergency scales. Commun. Nonlinear Sci. Numer. Simul. 2020, 90, 105350. [Google Scholar] [CrossRef] [PubMed]
Rossi, F.; Fiorentino, M.; Versace, P. Two-component extreme value distribution for flood frequency analysis. Water Resour. Res. 1984, 20, 847–856. [Google Scholar] [CrossRef]
Nadarajah, S. The exponentiated Gumbel distribution with climate application. Environmetrics 2006, 17, 13–23. [Google Scholar] [CrossRef]
Aryal, G.R.; Tsokos, C.P. On the transmuted extreme value distribution with application. Nonlinear Anal. Theory Methods Appl. 2009, 71, 401–407. [Google Scholar] [CrossRef]
Cooray, K. Generalized Gumbel distribution. J. Appl. Stat. 2010, 37, 171–179. [Google Scholar] [CrossRef]
Jeong, B.Y.; Murshed, M.S.; Am Seo, Y.; Park, J.S. A three-parameter kappa distribution with hydrologic application: A generalized Gumbel distribution. Stoch. Environ. Res. Isk Assess. 2014, 8, 2063–2074. [Google Scholar] [CrossRef]
Nadarajah, S.; Kotz, S. The beta Gumbel distribution. Math. Probl. Eng. 2004, 4, 323–332. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Nadarajah, S.; Ortega, E.M.M. The Kumaraswamy Gumbel distribution. Stat. Methods Appl. 2012, 21, 139–168. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Ortega, E.M.M.; Da Cunha, D.C.C. The exponentiated generalized class of distributions. J. Data Sci. 2013, 11, 1–27. [Google Scholar] [CrossRef]
Pinheiro, E.C.; Ferrari, S.L.P. A comparative review of generalizations of the Gumbel extreme value distribution with an application to wind speed data. J. Stat. Comput. Simul. 2016, 86, 2241–2261. [Google Scholar] [CrossRef]
Okorie, I.E.; Akpanta, A.C.; Ohakwe, J. The Exponentiated Gumbel Type-2 Distribution: Properties and Application. Int. J. Math. Math. Sci. 2016, 2016, 5898356. [Google Scholar] [CrossRef]
Okorie, I.E.; Akpanta, A.C.; Ohakwe, J.; Chikezie, D.C.; Obi, E.O. The Kumaraswamy G Exponentiated Gumbel type-2 distribution. Afr. Stat. 2017, 12, 1367–1396. [Google Scholar]
Otiniano, C.E.G.; Sousa, B.; Vila, R.; Bourguignon, M. A Bimodal Model for Extremes Data. Environ. Ecol. Stat. 2023, 30, 261–288. [Google Scholar] [CrossRef]
Otiniano, C.E.G.; Vila, R.; Brom, P.C.; Bourguignon, M. On the bimodal Gumbel model with application to environmental data. Austrian J. Stat. 2023, 52, 45–65. [Google Scholar] [CrossRef]
Maia, A.; Matsushita, R.; Da Silva, S. Earnings distributions of scalable vs. non-scalable occupations. Phys. A 2020, 560, 125192. [Google Scholar]
Newman, M.E.J. Power laws, Pareto distributions and Zipf’s law. Contemp. Phys. 2005, 46, 323–351. [Google Scholar] [CrossRef]
Gumbel, E.J. Les valeurs extremes des distributions statistiques. Ann. L’Institut Henri Poincaré 1935, 5, 115–158. [Google Scholar]
Jenkinson, A.F. The frequency distribution of the annual maximum (or minimum) values of meteorological elements. Q. J. R. Meteorol. Soc. 1995, 81, 158–171. [Google Scholar] [CrossRef]
Ahmad, K.E.; Jaheen, Z.; Modhesh, A.A. Estimation of a Discriminant Function Based on Small Sample Size from a Mixture of Two Gumbel Distributions. Commun. Stat. Simul. Comput. 2010, 39, 713–725. [Google Scholar] [CrossRef]
Longin, F.M. From value at risk to stress testing: The extreme value approach. J. Bank. Financ. 2000, 24, 1097–1130. [Google Scholar] [CrossRef]

Figure 1. Density

f_{IBG} (x; 0, 1, δ)

, with

δ

ranging from −0.5 to 4. Bimodal distributions appear when

δ > 0

, and we have the unimodal Gumbel if

δ = 0

.

Figure 1. Density

f_{IBG} (x; 0, 1, δ)

, with

δ

ranging from −0.5 to 4. Bimodal distributions appear when

δ > 0

, and we have the unimodal Gumbel if

δ = 0

.

Figure 2. Density

f_{IBG} (\cdot; μ, 1, δ)

, with

μ

ranging from − 0.5 to 2,

δ = 0

or 1, and

μ

acting as a location (a) or shape parameter (b).

Figure 2. Density

f_{IBG} (\cdot; μ, 1, δ)

, with

μ

ranging from − 0.5 to 2,

δ = 0

or 1, and

μ

acting as a location (a) or shape parameter (b).

Figure 3. Density

f_{IBG} (\cdot; 0, σ, δ)

, with

σ

ranging from 1 to 4 and

δ = 0

(unimodal, (a)) or 2 (bimodal, (b)). In both cases,

σ

represents the scale parameter.

Figure 3. Density

f_{IBG} (\cdot; 0, σ, δ)

, with

σ

ranging from 1 to 4 and

δ = 0

(unimodal, (a)) or 2 (bimodal, (b)). In both cases,

σ

represents the scale parameter.

Figure 4. MSE from Monte Carlo replications of samples with n ranging from 50 to 1000, for

{\hat{θ}}_{1}

,

{\hat{θ}}_{2}

, and

{\hat{θ}}_{3}

.

Figure 4. MSE from Monte Carlo replications of samples with n ranging from 50 to 1000, for

{\hat{θ}}_{1}

,

{\hat{θ}}_{2}

, and

{\hat{θ}}_{3}

.

Figure 5. MSE behavior of

{\hat{θ}}_{4}, {\hat{θ}}_{5}

and

{\hat{θ}}_{6}

. MSE from Monte Carlo replications of samples with n ranging from 50 to 1000, for

{\hat{θ}}_{4}

,

{\hat{θ}}_{5}

, and

{\hat{θ}}_{6}

.

Figure 5. MSE behavior of

{\hat{θ}}_{4}, {\hat{θ}}_{5}

and

{\hat{θ}}_{6}

. MSE from Monte Carlo replications of samples with n ranging from 50 to 1000, for

{\hat{θ}}_{4}

,

{\hat{θ}}_{5}

, and

{\hat{θ}}_{6}

.

Figure 6. MSE from Monte Carlo replications of samples with n ranging from 50 to 1000, for

{\hat{θ}}_{7}

,

{\hat{θ}}_{8}

, and

{\hat{θ}}_{9}

.

Figure 6. MSE from Monte Carlo replications of samples with n ranging from 50 to 1000, for

{\hat{θ}}_{7}

,

{\hat{θ}}_{8}

, and

{\hat{θ}}_{9}

.

Figure 7. PETR4 Prices (left) and PETR4 log returns (right).

Figure 8. USD/BRL Prices (left) and USD/BRL log returns (right).

Figure 9. PETR4 sub-sample

{Y_{t}}_{t = 1}^{70}

: Extremes obtained from the blocks (left) and histogram (right).

Figure 9. PETR4 sub-sample

{Y_{t}}_{t = 1}^{70}

: Extremes obtained from the blocks (left) and histogram (right).

Figure 10. USD/BRL sub-sample

{Y_{t}}_{t = 1}^{70}

: Extremes obtained from the blocks (left) and histogram (right).

Figure 10. USD/BRL sub-sample

{Y_{t}}_{t = 1}^{70}

: Extremes obtained from the blocks (left) and histogram (right).

Figure 11. Histogram versus fitted distribution: PETR4 (left) and USD/BRL (right).

Table 1. Means, biases, mean squared errors (MSE), and standard errors (SE) of the estimated parameters from 100 Monte Carlo replications of samples with

n = 50

.

Table 1. Means, biases, mean squared errors (MSE), and standard errors (SE) of the estimated parameters from 100 Monte Carlo replications of samples with

n = 50

.

	$θ$	True	Mean	Bias	MSE	SE
	$μ$	−1	−1.00952	−0.00952	0.02163	0.1475
$θ_{1}$	$σ$	1	0.97492	−0.02507	0.02765	0.1653
	$δ$	0	0.01114	0.01114	0.01554	0.1248
	$μ$	−1	−1.03142	−0.03142	0.03419	0.1831
$θ_{2}$	$σ$	1	1.02366	0.02366	0.02592	0.1600
	$δ$	2	2.05863	0.05863	0.12062	0.3440
	$μ$	−1	−0.99788	0.00211	0.02748	0.1666
$θ_{3}$	$σ$	1	0.96513	−0.03486	0.02763	0.1634
	$δ$	4	3.91971	−0.08028	0.34339	0.5834
	$μ$	0	0.00145	0.00145	0.02113	0.1461
$θ_{4}$	$σ$	1	0.99965	−0.00034	0.01646	0.1289
	$δ$	0	0.03047	0.03047	0.01316	0.1111
	$μ$	0	0.00339	0.00339	0.02363	0.1544
$θ_{5}$	$σ$	1	1.00572	0.00572	0.01401	0.1188
	$δ$	2	2.14749	0.14749	0.14933	0.3589
	$μ$	0	−0.02841	−0.02841	0.02129	0.1438
$θ_{6}$	$σ$	1	0.97091	−0.02908	0.01374	0.1141
	$δ$	4	4.00093	0.00093	0.26488	0.5172
	$μ$	1	0.95417	−0.04582	0.02859	0.1636
$θ_{7}$	$σ$	1	0.98530	−0.01469	0.01918	0.1384
	$δ$	0	0.00560	0.00560	0.01242	0.1118
	$μ$	1	0.96207	−0.03792	0.02218	0.1447
$θ_{8}$	$σ$	1	1.04081	0.04081	0.02338	0.1481
	$δ$	2	2.10659	0.10659	0.17084	0.4013
	$μ$	1	0.93990	−0.06009	0.05954	0.2376
$θ_{9}$	$σ$	1	0.97762	−0.02237	0.12103	0.3489
	$δ$	4	3.96419	−0.03580	0.91265	0.9594

Table 2. Means, biases, mean squared errors (MSE), and standard errors (SE) of the estimated parameters from 100 Monte Carlo replications of samples with

n = 100

.

Table 2. Means, biases, mean squared errors (MSE), and standard errors (SE) of the estimated parameters from 100 Monte Carlo replications of samples with

n = 100

.

	$θ$	True	Mean	Bias	MSE	SE
	$μ$	−1	−0.98025	0.01974	0.01280	0.1120
$θ_{1}$	$σ$	1	0.98180	−0.01819	0.01146	0.1060
	$δ$	0	−0.00898	−0.00898	0.00780	0.0883
	$μ$	−1	−0.99734	0.00265	0.01532	0.1243
$θ_{2}$	$σ$	1	0.99027	−0.00972	0.01188	0.1091
	$δ$	2	2.01504	0.01504	0.06759	0.2608
	$μ$	−1	−0.99937	0.00062	0.01558	0.1254
$θ_{3}$	$σ$	1	0.97695	−0.02304	0.01185	0.1069
	$δ$	4	3.9113	−0.08860	0.13284	0.3553
$θ_{4}$	$μ$	0	−0.02087	−0.02087	0.01232	0.1095
	$σ$	1	0.99485	−0.00514	0.00724	0.0854
	$δ$	0	0.02235	0.02235	0.00722	0.0824
	$μ$	0	−0.01413	−0.01413	0.01182	0.1083
$θ_{5}$	$σ$	1	0.98941	−0.01058	0.00694	0.0830
	$δ$	2	2.06691	0.06691	0.07194	0.2610
	$μ$	0	−0.00765	−0.00765	0.01211	0.1103
$θ_{6}$	$σ$	1	0.99801	−0.00198	0.00713	0.0848
	$δ$	4	4.05904	0.05904	0.16391	0.4025
	$μ$	1	0.94706	−0.05293	0.01350	0.1039
$θ_{7}$	$σ$	1	0.98666	−0.01333	0.00964	0.0977
	$δ$	0	−0.02825	−0.0282	0.00588	0.0717
	$μ$	1	0.96052	−0.03947	0.014831	0.1157
$θ_{8}$	$σ$	1	1.01628	0.01628	0.01126	0.1053
	$δ$	2	1.99650	−0.00349	0.06319	0.2526
	$μ$	1	0.98981	−0.01018	0.01102	0.1050
$θ_{9}$	$σ$	1	0.99397	−0.00602	0.00709	0.0844
	$δ$	4	4.02093	0.02093	0.11929	0.3464

Table 3. Means, biases, mean squared errors (MSE), and standard errors (SE) of the estimated parameters from 100 Monte Carlo replications of samples with

n = 1000

.

Table 3. Means, biases, mean squared errors (MSE), and standard errors (SE) of the estimated parameters from 100 Monte Carlo replications of samples with

n = 1000

.

	$θ$	True	Mean	Bias	MSE	SE
	$μ$	−1	−0.99837	0.00162	0.00149	0.0388
$θ_{1}$	$σ$	1	0.99413	−0.00586	0.00104	0.0319
	$δ$	0	−0.00209	−0.00209	0.00072	0.0269
	$μ$	−1	−0.99906	0.00093	0.00130	0.0362
$θ_{2}$	$σ$	1	1.00507	0.00507	0.00088	0.0293
	$δ$	2	2.01457	0.01457	0.00540	0.0724
	$μ$	−1	−0.99417	0.00582	0.00110	0.0328
$θ_{3}$	$σ$	1	0.99574	−0.00425	0.00091	0.03016
	$δ$	4	3.99378	−0.00621	0.01433	0.1201
	$μ$	0	−0.00232	−0.00232	0.00134	0.0367
$θ_{4}$	$σ$	1	1.00226	0.00226	0.00069	0.02643
	$δ$	0	0.00303	0.00303	0.00076	0.0277
	$μ$	0	−0.00322	−0.00322	0.00094	0.0307
$θ_{5}$	$σ$	1	0.99633	−0.00366	0.00062	0.0248
	$δ$	2	2.01715	0.01715	0.00587	0.0751
	$μ$	0	−0.00824	−0.00824	0.00114	0.0330
$θ_{6}$	$σ$	1	1.00379	0.00379	0.00065	0.0254
	$δ$	4	4.01962	0.01962	0.01519	0.1223
	$μ$	1	0.99310	−0.00689	0.00136	0.0365
$θ_{7}$	$σ$	1	0.99649	−0.00350	0.00107	0.0327
	$δ$	0	−0.00116	−0.00116	0.00080	0.0284
	$μ$	1	0.95857	−0.04142	0.00253	0.0286
$θ_{8}$	$σ$	1	0.99516	−0.00483	0.00104	0.0320
	$δ$	2	1.96530	−0.03469	0.00929	0.0904
	$μ$	1	0.99638	−0.00361	0.00133	0.0365
$θ_{9}$	$σ$	1	1.00135	0.00135	0.00108	0.0330
	$δ$	4	4.00819	0.00819	0.01701	0.1308

Table 4. Descriptive statistics.

Stock	Minimum	1st Quartile	Median	Mean	3rd Quartile	Maximum
PETR4	−0.3523667	−0.0137843	0.0000000	0.0003036	0.0138190	0.7203695
USD/BRL	−0.3148314	−0.0056746	0.0000000	0.0001515	0.0060800	0.0966945

Table 5. ML estimates of

θ = (μ, σ, δ)

and their respective standard error (SE).

Table 5. ML estimates of

θ = (μ, σ, δ)

and their respective standard error (SE).

Stock	$\hat{μ}$	$\hat{σ}$	$\hat{δ}$
PETR4	0.000009962	0.000089877	1.246255323
SE	0.0000071	0.0000003	0.0000083
USD/BRL	0.000016486	0.000099908	1.31295954
SE	0.0000007	0.0000001	0.0000027

Table 6. Estimated

{VaR}_{α}

from (17).

Table 6. Estimated

{VaR}_{α}

from (17).

Stock	10%	5%	1%
PETR4	0.07166153	0.07835825	0.09003984
USD/BRL	0.02594245	0.0295429	0.03612925

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Otiniano, C.G.; Silva, E.B.; Matsushita, R.Y.; Silva, A. Bridging Extremes: The Invertible Bimodal Gumbel Distribution. Entropy 2023, 25, 1598. https://doi.org/10.3390/e25121598

AMA Style

Otiniano CG, Silva EB, Matsushita RY, Silva A. Bridging Extremes: The Invertible Bimodal Gumbel Distribution. Entropy. 2023; 25(12):1598. https://doi.org/10.3390/e25121598

Chicago/Turabian Style

Otiniano, Cira G., Eduarda B. Silva, Raul Y. Matsushita, and Alan Silva. 2023. "Bridging Extremes: The Invertible Bimodal Gumbel Distribution" Entropy 25, no. 12: 1598. https://doi.org/10.3390/e25121598

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bridging Extremes: The Invertible Bimodal Gumbel Distribution

Abstract

1. Introduction

2. Main Results

Some Distributional Characteristics

3. Parameter Estimation

Numerical Performance of ML Estimates

4. Application

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI