Next Article in Journal
Firm Risk and Tax Avoidance in Vietnam: Do Good Board Characteristics Interfere Effectively?
Previous Article in Journal
Dataset Analysis of Pandemic Risks and Risk Management Prospects Based on Management and Marketing in Conditions of COVID-19 Recession
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dependency Modeling Approach of Cause-Related Mortality and Longevity Risks: HIV/AIDS

1
African Centre of Excellence in Data Science (ACEDS), College of Business and Economics, University of Rwanda, Kigali P.O. Box 4285, Rwanda
2
Department of Mathematics, Physics, and Computing, School of Science and Aerospace Studies, Moi University, Eldoret P.O. Box 3900, Kenya
3
Department of Mathematics, College of Natural Sciences, Makerere University, Kampala P.O. Box 7062, Uganda
4
Department of Applied Statistics, School of Economics, University of Rwanda, Kigali P.O. Box 4285, Rwanda
*
Author to whom correspondence should be addressed.
Risks 2023, 11(2), 38; https://doi.org/10.3390/risks11020038
Submission received: 5 December 2022 / Revised: 27 January 2023 / Accepted: 28 January 2023 / Published: 9 February 2023

Abstract

:
Disaggregation of mortality by cause has advanced the development of life tables for life insurance and pension purposes. However, the assumption that the causes of death are independent is a challenge in reality. Furthermore, models that determine relationships among causes of death such as HIV/AIDS and their impact on mortality and longevity risks seem trivial or inflexible. To address these problems, we aim to determine and build an appropriate copula dependence model for HIV/AIDS against other causes of death in the presence of age, gender, and time. A bivariate copula model is proposed to capture the dependence structure of HIV/AIDS on life expectancy. This approach allows the fitting of flexible and interpretable bivariate copulas for a two-dimensional case. The dataset was derived from the World Health Organization database that constituted annualized death numbers, causes, age, gender, and years (2000 to 2019). Using Kendall’s tau and Pearson linear coefficient values, the survival Joe copulas proved to be a suitable model. The contribution and implication of this research are the quantification of the impact of HIV/AIDS on a life table, and, thus, the establishment of an alternative to the subjective actuarial judgment approach.

1. Introduction

Causes of death data in insurance have grown to be considered a critical input in estimating mortality and longevity risks in actuarial studies as used in epidemiology and biostatistics (Stracke and Heinen 2020). In particular, the pricing of life insurance and pension contracts requires mortality and longevity projections in developing life tables and thus, cause-based life tables have recently become invaluable (Murphy et al. 2021). Life tables can be classified based on model formulation, that is, using the aggregate death or the disaggregate causes of death mortality. The functional approach between the two is based on the risk studied and the underlying life insurance benefit. However, causes of death mortality models are constrained by the assumption of independence (Kjærgaard et al. 2019). Studies have demonstrated that there exists inherent dependency among causes, and thus reality may not hold (Kaishev et al. 2007; Chiang 1968).
Life insurance pays in the case of natural or accidental deaths. Natural deaths encompass the majority of causes; however, in some instances, deaths linked to suicide are excluded. In the United States, the Accidental Death and Dismemberment Policy (AD&D) and the Accidental Death Benefits (ADB) (Richmond 2009; Mueller 2004) pay claims due to accidental causes of deaths and injuries as defined in the policy. Such policies also act as riders to the main life insurance product to increase the final benefit payout. Additionally, a critical illness insurance policy takes effect on the occurrence of a defined list of diseases. These benefits would require different pricing life tables, such as the multiple decrement life tables. According to Beltrán-Sánchez et al. (2008), these mortality tables are referred to as the cause-deleted life tables or Associated Single Decrement Life Tables (ASDLT).
The impact of certain causes of death on a mortality life table has lacked a quantified approach, and thus actuarial judgments have relied on subjective principles. The conservatism approach employed by actuaries has continued to be the main technique in determining the extent of the impact of unquantified risks (Strauss et al. 2005). The aims of this approach are to alleviate basis risk in pricing, reserving, and valuation of life products. If the actuary is optimistic then the premiums or contributions will become less in the earlier years but will be increased in the latter years, alternatively, if the actuary is pessimistic then the premiums or contributions of the benefit will be higher than normal at the onset and reduce in the future. Life tables are affected by subjectiveness, for instance, mortality and longevity provisions are adjusted based on judgment of the prevailing mortality assumptions. In developing life tables in Kenya, a conservatism approach is considered in accounting for HIV/AIDS’ impact on the life table. To minimize this risk of subjective judgment by actuaries, a dependence model would thus be suggested to enable and enhance improvement in understanding the impact of deaths due to HIV/AIDS on a lifetable.
HIV/AIDS has shown a tremendous decline in being a leading cause of death for the past 20 years not only in entire Sub-Saharan Africa but more specifically in Kenya. Bett et al. (2022) have shown that for both males and females across all ages, HIV/AIDS has consistently portrayed a decreasing trend in the future. Measurement of the exact gain or loss to a life table is still to the best of our knowledge unquantified. The earliest Kenyan life tables were the A1949-52 mortality tables derived from the United Kingdom (UK) and Wales due to a similar mortality experience in Kenya as in the UK then. The KE 2001–2003 Tables for Assured Lives were published in 2011 and have been widely used, but have failed to capture the immense improvement in life expectancy despite the adjustment to HIV/AIDS. Model life tables include UN (1958, 1982), Coale and Demeny (1966, 1983), INDEPTH AIDS, decremented model life tables, and Weiss 1973 (Obscure anthropological model life tables), which have also been employed by the United Nations to estimate the extent of mortality on life tables and all rely heavily on assumptions. The interactions of causes of death with the aggregate deaths would be a key study to establish the correlation structure of specific causes of death with other causes, and this can be achieved by using copulas. Cause-eliminated life tables are created in the framework of the competing risks model where the relationships among competing events are assessed and measured in relation to other events of interest. Therefore, studying the dependence structure of HIV/AIDS using copulas would aid in understanding the future evolution of mortality and longevity risk and, consequently, the dependence structure. Additionally, Kaishev et al. (2007) noted in their paper that extensions of the copulas approach to newer causes such as HIV/AIDS would be relevant.
There are two main types of mortality and longevity models in actuarial literature, that is, extrapolative and explanatory approaches. The former method has been shown to be cautionable as demonstrated by Bengtsson and Keilman (2019) on the trends in life expectancy among women in Denmark and the Netherlands. Life expectancy from 1970 was overestimated as compared to other European countries because the extrapolation model failed to foresee trends, cause approach is thus preferred. Cause of death mortality models, for instance, vector error correction models (VECM), were developed to track the long-term association of the causes of death based on co-integrating relationships (Alai et al. 2013; Arnold and Sherris 2015). Co-integration (Engle and Granger 1987) is the long-term linear combination of integrated time series based on deterministic or stochastic trends. Modeling the multivariate dependence modeling structure takes the short-term correlation into account and ensures that the time dependence of the correlation is identified. For co-integration, the relationship must be longer to ensure validity in the co-integration relationship; however, the lack of sufficient long-term data hampers the robustness of the model.
The differentiation of correlation and co-integration is based on the time frame of the data, and thus co-integration is more suitable for long-term relationships while correlation is suitable for short-term ones. Copulas and co-integration techniques have been compared in financial stocks by Stander et al. (2013). It was found that the copula approach was more suitable for the short-term asset allocation approach as compared to the long-term co-integration approach.
Therefore, the purpose of this study is to determine the copula structure that exists between key causes of death that influence mortality and longevity risks and apply it to a cause-eliminated life table, particularly HIV/AIDS. This is because instead of only checking correlation, a copula structure would enable the illustration of profound relationships among covariates that can be measured.
This paper has been divided into four parts. The next section deals with copulas as applicable in mortality modeling, Section 3 will be methods and materials, followed by results and discussions in Section 4, and finally, the conclusion and future work in Section 5.

2. Copulas in Mortality Modelling

There exist three types of copulas: fundamental, implicit, and explicit (McNeil et al. 2015). The fundamental case involves the independence copula, the implicit case is the Gaussian and the Student-t copula, and the explicit type is the Archimedean copula family that includes the Frank, Gumbel, and Clayton copulas (Kaishev et al. 2007; Li and Lu 2018; Nelsen 2007) and others; all of these are applied based on the structure of the data. Exploration of bivariate distributions between the aggregate death rate and the individual causes would therefore enable us to obtain key relationships of interest.
Mortality data, specifically, shows that the causes of death have complex dependent structures that are non-symmetric, meaning that they are not linear in structure. Therefore, they may fail to be captured by the independence or the implicit multivariate normal copulas (Gaussian), and thus a flexible dependent model would suffice to understand the inherent complexities among the causes of death. Additionally, the challenge with the implicit Gaussian approach is that it fails to account for asymmetrical, extreme, and heavy-tailed distributions.
On the other hand, the explicit Archimedean copulas are rigid in nature and fail to allow differing dependency structures among variables. The hierarchical Archimedean copula model (Li and Lu 2018) was applied in life insurance; however, the complexities of the model have continued to render the model less applicable in practice. Due to the different complex marginals among individual variables and the non-symmetric dependencies of the individual pairs, a flexible copula structure is thus needed; consequently, the multivariate copulas that include rotated, survival, and tawn copulas have been developed to enhance flexibility with minimized complexities (Bedford and Cooke 2002; Aas et al. 2009). They are constructed from bivariate (pair) copulas and scaled to incorporate more variables of interest. They are feasible and applicable in reality because they allow the use of the conditioning approach.

2.1. Copula Definition

According to McNeil et al. (2015), d -dimensional copula is a distribution function on 0 , 1 d with a standard uniform marginal distribution. Let C u = C ( u 1 , , u d ) be the multivariate distribution functions that are copulas. Mapped as C : [ 0 , 1 ] d [ 0 , 1 ] with the following three conditions:
(1)
C ( u 1 , , u d ) is increasing in each u i
(2)
C 1 , , 1 , u i , 1 = u i such that i 1 , , d , u i [ 0 , 1 ]
(3)
  ( a 1 , , a d ) , ( b 1 , , b d )   0 , 1 d with a i b i we have the following:
i 1 = 1 2 i d = 1 2 1 i 1 + + i d C u 1 i 1 , , u d i d 0 ,
where u j 1 =   a j and u j 2 =   b j     j 1 , , d .
The third condition is the rectangle inequality that ensures a non-negative probability, that is, P a 1 U 1 b 1 , , a d U d b d 0 .

2.2. Sklar Theorem

A key result involving copulas is Sklar’s theorem (Sklar 1973), which states that any multivariate joint distribution can be written in terms of univariate marginal distribution functions and a copula, which describes the dependence structure between the two variables. The Sklar theorem enables a connection to exist between the multivariate distribution functions and the univariate margins. The theorem of Copulas is achieved and holds based on Sklar (1959) and it states that a multivariate distribution function F can be written in the form of a copula function such that if F ( x 1 , x 2 , , x d ) is a joint multivariate distribution function with univariate marginal distributions functions F 1 x 1 , F 2 x 2 , …, F d x d then there exists a copula function C ( u 1 , , u d ) such that:
F ( x 1 , x 2 , , x d ) = C ( F 1 x 1 , F 2 x 2 , …, F d x d ) if each F i is continuous then C is unique.
Frechet bounds (Fréchet 1951) explains limits of both the maximum and minimum bounds in a copula function are given by:
m a x 0 , u + v 1 C ( u , v ) m i n ( u , v ) .
Let T 1 , , T n , 0 T j < ω , j = 1 , , n represent the future lifetime random variable with a maximum time limit, ω , due to cause of death j for n total causes. Further, let min ( T 1 , , T n ) represent the actual random lifetime. Thus, the joint distributions that represent the lifetime distribution function will be:
F t 1 , , t n = P r { T 1 t 1 , , T n t n }
and the joint survival function:
S t 1 , , t n = P r { T 1 > t 1 , , T n > t n } .
These random variables will be considered dependent and non-defective such that
Pr T j < ω = 1 .

2.3. Complete Cause Elimination

The impact of a complete cause elimination and its quantification will be of interest, similar to the procedure of Elandt-Johnson (1976), who demonstrated cause elimination by considering the marginal distribution of the individual cause of death. Assuming that we remove cause i the resultant survival function becomes:
F t 1 , , t i 1 , t i + 1 , , t n = P r { T 1 t 1 , , T i 1 t i 1 , T i + 1 t i + 1 , , T n t n } where i is removed and letting,
S i t = S t 1 , , t i 1 , t i + 1 , , t n = P r { T 1 > t 1 , , T i 1 > t i 1 , T i + 1 > t i + 1 , , , T n > t n } .
The measurement and effect of S i t which is after the removal of cause i would thus form our objective function to obtain. Therefore, a copula approach would suffice to perform this task. The effect on the survival function can be further grouped into two; that is, the crude and net survival functions.

2.4. Crude Survival Function

It is also called the cumulative incidence function and is usually estimated from the observed mortality data because S t 1 , , t n = P r { T 1 > t 1 , , T n > t n } = S t are assumed to be mutually exclusive, as pointed out by Bryant and Dignam (2004).

2.5. Net Survival Function

Denoting S i t = P r { T i > t } as marginal survival function due to cause i alone associated to the multivariate joint survival function. The individual S i t are the values of interest. The complement would end up being:
F i t = 1 S i t .
Thus, in order to achieve the estimates of the net survival function on the basis of the crude survival functions n non-linear differential equations will have to be solved. Since we would be working without the assumption of independence among the causes, therefore, we would require to set up a dependence model achieved with an appropriate copula structure. The i-th cause survival function S x i t will thus be removed from the multivariate joint survival function, achieving a net survival function. The resultant copula for age x would end up being of the form:
S x t , , t = C ( S x 1 t , , S x i 1 t , S x i + 1 t , , S x n t ) .
The specific families’ input parameter description and implementation are shown and demonstrated by Nelsen (2007) in Table 1.
Other flexible copulas include vine copulas. For instance, the 2-dimensional vine copulas that are formed from the bivariate distributions are obtained as below:
f x 1 , x 2 = C 1 , 2 ( F 1 ( x 1 ) , F 2 ( x 2 ) ) · f x 1 · f x 2
f 2 / 1 ( x 2 / x 1 ) = C 1 , 2 ( F 1 ( x 1 ) , F 2 ( x 2 ) ) · f x 2
where the joint, marginal, and conditional distributions are given as Table 2.

2.6. Dependence Measures

Considering the main rank correlation measure, that is, Kendall’s tau, it is defined as the probability of concordance less the probability of discordance (Nelsen 2007). Mathematically, for two random variables X and Y with the following pair of observations x 1 , y 1 x 2 , y 2 , , ( x n , y n ) from the random vector X , Y Kendall’s tau is described as:
τ ^ = 2 n n 1 i < j s g n ( x i x j ) s g n ( y i y j ) .
For purposes of comparison, a Pearson linear correlation coefficient (Schober et al. 2018) will be utilized as a reference and comparability value.

3. Methods and Materials

3.1. Data Source

The data set used contains individual causes of death for each given gender, age, and year derived from the World Health Organization database that has collated 131 causes of death from 2000 to 2019 for Kenya. It is noted that the aggregate mortality rate is a time series that can be decomposed into individual causes of death, and thus the 131 causes of death are similar time series data for this given period. Time series can either be stationary or non-stationary. Weak or strict stationarity can easily be modeled to forecast the future; however, this is not the case for non-stationary series, in which the majority of mortality data fall into. The considered data will be for the years 2000 to 2019 for both males and females at least age 20 and above and will be used to develop an appropriate copulas dependence model. Additionally, we will partition the causes of death into three categories—all-causes (A), HIV/AIDS causes (HA), and non-HIV/AIDS (NHA) causes—and develop an appropriate copula dependence model based on the last two categories.

3.2. Fitting the Multivariate Dependence Model

The copula structure will be determined by the VineCopula package in R that attempts to ascertain the copula structure that exist between the two variables hence the pair bivariate copula case. To determine the best-fitting copula, the Akaike information criterion (AIC), Bayesian information criterion (BIC), and log likelihood measures will be comparatively selected. The optimal choice will be based on a minimal AIC and BIC and a maximum log likelihood. The maximum likelihood estimation method will be implemented with the aide of the VineCopula BiCopula analysis in R (Bedford and Cooke 2002; Dissmann et al. 2013).

3.3. Multiple Decrement Model

Let D i x , t describe the cause-specific deaths at age x and time t or simply D i , such that D H A represents deaths for HIV/AIDS, D N H A represents deaths for non-HIV/AIDS, and D A represents deaths for all causes.
We will make use of the study of Fergany (1971) in the construction of an abridged life table where, (x, x + n) will be the age interval for a person aged x. nmx to be the age-specific central death rate, nqx will represent the probability of death, lx the number of lives aged x, ndx the number of persons dead in the interval n, nLx number of person-years lived in the interval, and Tx, future lifetime at age x.

3.4. Life Expectancy

Life expectancy will be our key variable of measurement. It is a measurement that determines the average length of time lived by a person aged x. It is denoted by E T x for the continuous case and E K x for the discrete case, where T x and K x represent the random remaining lifetime at age x. Mathematically, they are expressed as:
e x o = E T x = 0 t · f x ( t ) d t
applying integration by substitution,
= 0 1 F x ( t ) d t
= 0 S x ( t ) d t
e x = E K x = k = 1 k · P ( K x = k )
= k = 1 S x ( k ) .
Such that the discrete and the continuous case may be summarized as below
K x T x K x + 1 .

4. Results and Discussion

The following section presents the results of the study, commencing with the exploratory analysis of HIV/AIDS and non-HIV/AIDS-related deaths based on time and age for males and females. Subsequently, estimation and modeling of the marginal distribution of each cause of death of the fitted bivariate copula model will follow. Finally, the application of this model to a mortality life table in terms of the gain and loss scenario of life expectancy rates will be discussed.

4.1. All Causes of Death Distribution with and without HIV/AIDS in Terms of Time

Figure 1 and Figure 2 display the elimination of HIV/AIDS causes of death from the aggregate causes of death over time for both males and females. It is evident that there has been a consistent reduction in mortality rates across the years for both males and females. The fitted abline() indicates a steep declining trend for both HIV/AIDS and non-HIV/AIDS causes of death. This result could suggest the existence of a positive correlation between these two variables. It is interesting to note a higher variability for the non-HIV/AIDS female rates.

4.2. All Causes of Death Distribution with and without HIV/AIDS in Terms of Age

The distribution based on age for males and females is presented in Figure 3 and Figure 4. Firstly, the death rates show a positive correlation with age, that is, the death rates increase with age, which confirms the assumption of the human mortality pattern. Secondly, all causes death rates without HIV/AIDS are lower than all causes to a large extent across all ages, both for males and females; however, we observe there is a significant difference for the younger ages (20 to 50 years) as compared to the older ages. Lastly, it is again seen that the females display a greater difference between the two causes of death. This result explains the impact of HIV/AIDS on the younger population as compared to the older population and also in terms of gender because it seems females are adversely affected by HIV/AIDS as compared to males.

4.3. Joint Distribution for Non-HIV/AIDS against HIV/AIDS Death Rates for Males and Females

Figure 5 presents the joint distribution of HIV/AIDS and non-HIV/AIDS death rates for males and females. It is a bivariate pair plot of these two causes aged over 20 years running from 2000 to 2019. Despite limited data points, we can observe a positive correlation between these two causes of death within each gender. It is, however, evident that the relationship between this pair is asymmetric and non-linear in structure. Additionally, the female case is observed to be more erratic as compared to its male counterpart. The correlation structure observed by these exploratory results suggests a positive relationship between these two categorized causes of death with both time, age, and gender; however, it is critical to note that we have treated these two categorized causes of death as independent of each other. In reality, this is not the case. To overcome this challenge, a dependency model will first be implemented because a trivial correlation procedure will not fully explain this phenomenon. Therefore, applying a copula structure that caters to dependent causes of death may help better understand the dependency level of the specific causes of death based on age, gender, and time.

4.4. Marginal Distributions for HIV/AIDS and Non-HIV/AIDS for Males and Females

Prior to determining the copula structure, it is prudent first to determine the individual univariate distributions of the two variables. This is because a copula structure is basically a joint multivariate representation of two marginal distributions and their copula. In our case, the two univariate distributions are the non-HIV/AIDS and HIV/AIDS causes of death. Determination of these two distributions is not a trivial matter, and thus a visual and estimation approach will be undertaken. Using the fitdistrplus package in R (Delignette-Muller and Dutang 2015), we achieve both objectives. The output will entail the Cullen and Frey graphs together with the fitted marginals using the maximum likelihood estimation method. The procedure observes from the data the level of kurtosis and skewness and then estimates possible theoretical univariate distributions. For instance, the normal, uniform, exponential, logistic, beta, lognormal, and gamma distributions are potential marginal (univariate distribution) selections. As shown in Figure 6, the estimated marginal distribution for HIV/AIDS for males tends to be a uniform or beta distribution. Similarly, for females, the same is confirmed for HIV/AIDS, as seen in Figure 7. On the other hand, Figure 8 and Figure 9 show the estimated marginal distributions for non-HIV/AIDS for both males and females. As demonstrated by these two graphs, the uniform and beta distributions gave the closest fits for the males and the uniform, beta, and normal distributions for the females.

4.5. Bivariate Copula Analysis

Using the proposed marginals for males (uniform and beta) and females (uniform, beta, and normal), a bivariate copula analysis was performed using the vineCopula package in R, which entailed fitting all the proposed bivariate copula structures as listed in Table 1. The results for the males and females were thus presented in Table 3. The rotated Joe copula (180 degrees; survival Joe) was observed to be the fitted copula structure for the males and females. The optimal selection is seen to be the minimal AIC and BIC and maximum log-likelihood as expected. The parameters achieved for males are Survival Joe (par = 4.77, tau = 0.66) while those for females are Survival Joe (par = 3.29, tau = 0.55).
The observed correlation structure for the males is higher than that of the females; this is presented by a Kendall’s tau of 0.67 against 0.55, respectively. This lower dependency level confirms the evidence of the erratic nature of the relationship between HIV/AIDS and non-HIV/AIDS for females, as seen in Figure 4 and Figure 5. Additionally, the selected marginal distributions are also identified as ideal based on the results in Table 4 and Table 5 for males and females, respectively. It was noted that the beta marginals were suitable for both males and females. Visually, Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14 confirm these results after 500 simulations.

4.6. Independence versus Dependence Assumption

These results establish the fact that treating causes of death independently may lead to erroneous conclusions. A rank-based dependence measure would observe that the correlation structure between HIV/AIDS and non-HIV/AIDS is 0.6 and 0.43 for males and females, respectively, and 0.64 and 0.39 based on a Pearson linear correlation coefficient, as shown in Table 4 and Table 5. The proximity implies that these two dependence measures are generally similar. According to Kendall’s tau, the actual measure of concordance between the two causes of death is 0.669 for males and 0.552 for females. Concordant pairs mean that higher rates of one variable are correlated with the higher rates of the other pair, while discordant pairs signify the opposite. Kendall’s tau values close to zero means that the dependence is low or even that the variables are independent; conversely, if they are close to one, then this pair is dependent.

4.7. Application of Cause-Specific Mortality Models to the Sensitivity to Life Expectancy

We apply the findings of this approach to the Kenyan mortality period life table for the year 2019 as an example. The construction of the life table and the cause-eliminated life table will follow the approach of (Arias et al. 2019; Keyfitz et al. 1972; Chiang 1968), and thus the life expectancy quantity will be obtained as given in Section 3 of this study.

4.8. Life Expectancy Gain/Loss Analysis

Scenarios from simulating the obtained copula structure would aid in fitting the two causes. The determined copulas for males and females were applied and adjusted to the constructed table and the life expectancy at age 20, e 20 , observed for all-cause mortality, non-HIV/AIDS cause mortality assuming independence, and non-HIV/AIDS cause mortality allowing dependence. The results are given in Table 6 and Table 7.

4.9. All-Cause Mortality

It is evident that the life expectancy for all-cause mortality has had a consistent increase across the years 2000, 2010, and 2010, as expected for males and females. This is due to the general decline in mortality attributed across the years from 2000 to 2019.

4.10. Non-HIV/AIDS Mortality (Assuming Independence and Allowing Dependence)

Eliminating HIV/AIDS from the causes of death in the early years, such as the year 2000, has the greatest impact because it was during this year that mortality rates were at their highest and life expectancies were at their lowest for males and females. This scenario is evident for both the independence and dependence cases; however, there exist significant differences between the two approaches across years, age, and gender. Based on years, there is a consistent reduction in losses across the years probably due to the fading effect of risks posed by HIV/AIDS. With regard to age, life expectancy losses are experienced in younger years as compared to older ages, as corroborated by Nall et al. (2019). This could be due to two reasons. Fewer data is available at advanced ages, and HIV/AIDS risk is highly prevalent among the young. Based on gender, females experienced the highest losses as compared to males due to the fact that prevalent due to HIV/AIDS is highest among the female population (Sia et al. 2014). Actuaries make adjustments to the life table based on historical data, and if they are to assume independence, then one may overestimate or underestimate the effect of HIV/AIDS on the life table. Using copulas, one may make adjustments to the actual correlation structure of the causes of death and thus quantify its change to the life tables

4.11. Limitation of the Study

The study used 20-year historical mortality data, which is generally insufficient in the majority of mortality studies; however, the dynamic nature of mortality due to mortality shocks would enable this approach to be plausible and be an alternative approach. Furthermore, the application of this model to life tables did not consider smoothing of the mortality rates, which is a common practice in life table construction.

5. Conclusions and Further Work

The aim of the research was to determine an appropriate bivariate copula structure that would fit HIV/AIDS cause of death and non-HIV/AIDS cause of death for both males and females aged 20 and above. It was determined that the Survival Joe copulas were suitable to quantify the dependence structure of HIV/AIDS and non-HIV/AIDS for females and males, respectively. The impact of the causes of death on a life table and specifically life expectancy was measured by the use of this copula where indolence and dependence scenarios were simulated. This dependence model is an advance over the correlation approach because it gives a deeper understanding of the relationship between variables, specifically causes of death. Actuaries may alternatively implement the copula approach so as to quantify the inherent risk and as a way to understand the mortality and longevity risk caused by HIV/AIDS or a specific cause of death.
Further work would entail incorporating trends that would be based on the future development of the cause of death and even extending the two-dimensional case where two or more cases are observed.

Author Contributions

Conceptualization, N.B., J.K. and D.R.; methodology, N.B., J.K. and D.R.; software, N.B.; validation, N.B., J.K. and D.R.; formal analysis, N.B.; investigation, N.B.; resources, J.K. and D.R.; data curation, N.B.; writing—original draft N.B.; writing—review J.K. and D.R.; visualization, N.B.; supervision, J.K. and D.R.; project administration, N.B.; funding acquisition, N.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the African Center of Excellence in Data Science (ACE-DS) and the University of Rwanda (grant number = ESC 91).

Data Availability Statement

The data that support the findings of this study are available from https://www.who.int/data/gho/data/themes/mortality-and-global-health-estimates/ghe-leading-causes-of-death accessed 1 December 2021.

Acknowledgments

Nicholas Bett wishes to thank the African Center of Excellence in Data Science (ACE-DS) for funding this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Aas, Kjersti, Claudia Czado, Arnoldo Frigessi, and Henrik Bakken. 2009. Pair-Copula Constructions of Multiple Dependence. Insurance: Mathematics and Economics 44: 182–98. [Google Scholar] [CrossRef]
  2. Alai, Daniel H., Severine Gaille, and Michael Sherris. 2013. Modelling Cause-of-Death Mortality and the Impact of Cause-Elimination. UNSW Australian School of Business Research Paper. Sydney: UNSW Australian School of Business Research. [Google Scholar]
  3. Arias, Elizabeth, Jiaquan Xu, and Kenneth D. Kochanek. 2019. United States Life Tables, 2016. Available online: https://www.cdc.gov/nchs/data/nvsr/nvsr68/nvsr68_04-508.pdf (accessed on 20 September 2022).
  4. Arnold, Séverine, and Michael Sherris. 2015. Causes-of-Death Mortality: What Do We Know on Their Dependence? North American Actuarial Journal 19: 116–28. [Google Scholar] [CrossRef]
  5. Bedford, Tim, and Roger M. Cooke. 2002. Vines—A New Graphical Model for Dependent Random Variables. The Annals of Statistics 30: 1031–68. [Google Scholar] [CrossRef]
  6. Beltrán-Sánchez, Hiram, Samuel H. Preston, and Vladimir Canudas-Romo. 2008. An Integrated Approach to Cause-of-Death Analysis: Cause-Deleted Life Tables and Decompositions of Life Expectancy. Demographic Research 19: 1323. [Google Scholar] [CrossRef]
  7. Bengtsson, Tommy, and Nico Keilman. 2019. Old and New Perspectives on Mortality Forecasting. Berlin: Springer Nature. [Google Scholar]
  8. Bett, Nicholas, Juma Kasozi, and Daniel Ruturwa. 2022. Temporal Clustering of the Causes of Death for Mortality Modelling. Risks 10: 99. [Google Scholar] [CrossRef]
  9. Bryant, John, and James J. Dignam. 2004. Semiparametric Models for Cumulative Incidence Functions. Biometrics 60: 182–90. [Google Scholar] [CrossRef]
  10. Chiang, Chin Long. 1968. The Life Table and Its Construction. Introduction to Stochastic Processes in Biostatistics, 198–214. [Google Scholar]
  11. Delignette-Muller, Marie Laure, and Christophe Dutang. 2015. Fitdistrplus: An R Package for Fitting Distributions. Journal of Statistical Software 64: 1–34. [Google Scholar] [CrossRef]
  12. Dissmann, Jeffrey, Eike C. Brechmann, Claudia Czado, and Dorota Kurowicka. 2013. Selecting and Estimating Regular Vine Copulae and Application to Financial Returns. Computational Statistics & Data Analysis 59: 52–69. [Google Scholar]
  13. Elandt-Johnson, Regina C. 1976. Conditional Failure Time Distributions under Competing Risk Theory with Dependent Failure Times and Proportional Hazard Rates. Scandinavian Actuarial Journal 1976: 37–51. [Google Scholar] [CrossRef]
  14. Engle, Robert F., and Clive W. J. Granger. 1987. Co-Integration and Error Correction: Representation, Estimation, and Testing. Econometrica: Journal of the Econometric Society 1987: 251–76. [Google Scholar] [CrossRef]
  15. Fergany, Nader. 1971. On the Human Survivorship Function…-Google Scholar. Available online: https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Fergany+%281971.+%22On+the+Human+Survivorship+Function+and+Life+Table+Construction%2C%22+Demography8%283%29%3A331-334%29&btnG= (accessed on 20 September 2022).
  16. Fréchet, Maurice. 1951. Sur Les Tableaux de Corrélation Dont Les Marges Sont Données. Annal Université Lyon, 3^e serie, Sciences, Section A 14: 53–77. [Google Scholar]
  17. Kaishev, Vladimir K., Dimitrina S. Dimitrova, and Steven Haberman. 2007. Modelling the Joint Distribution of Competing Risks Survival Times Using Copula Functions. Insurance: Mathematics and Economics 41: 339–61. [Google Scholar] [CrossRef]
  18. Keyfitz, Nathan, Samuel H. Preston, and Robert Schoen. 1972. Inferring Probabilities from Rates: Extension to Multiple Decrement. Scandinavian Actuarial Journal 1972: 1–13. [Google Scholar] [CrossRef]
  19. Kjærgaard, Søren, Yunus Emre Ergemen, Malene Kallestrup-Lamb, Jim Oeppen, and Rune Lindahl-Jacobsen. 2019. Forecasting Causes of Death by Using Compositional Data Analysis: The Case of Cancer Deaths. Journal of the Royal Statistical Society: Series C (Applied Statistics) 68: 1351–70. [Google Scholar] [CrossRef]
  20. Li, Hong, and Yang Lu. 2018. Modeling Cause-of-Death Mortality Using Hierarchical Archimedean Copula. Scandinavian Actuarial Journal 2019: 1–26. [Google Scholar] [CrossRef]
  21. McNeil, Alexander J., Rüdiger Frey, and Paul Embrechts. 2015. Quantitative Risk Management: Concepts, Techniques and Tools-Revised Edition. Princeton: Princeton University Press. [Google Scholar]
  22. Mueller, Steve. 2004. Strategically Managing AD&D Risk. Risk Management 51: 40. [Google Scholar]
  23. Murphy, Sherry L., Kenneth D. Kochanek, Jiaquan Xu, and Elizabeth Arias. 2021. Mortality in the United States, 2020. Available online: https://stacks.cdc.gov/view/cdc/112079 (accessed on 20 September 2022).
  24. Nall, Allison, Tiffany Chenneville, Lindsey M. Rodriguez, and Jennifer L. O’Brien. 2019. Factors Affecting HIV Testing among Youth in Kenya. International Journal of Environmental Research and Public Health 16: 1450. [Google Scholar] [CrossRef]
  25. Nelsen, Roger B. 2007. An Introduction to Copulas. Berlin: Springer Science & Business Media. [Google Scholar]
  26. Richmond, Douglas R. 2009. Drugs, Sex, and Accidental Death Insurance. Tort Trial & Insurance Practice Law Journal 2009: 57–96. [Google Scholar]
  27. Schober, Patrick, Christa Boer, and Lothar A. Schwarte. 2018. Correlation Coefficients: Appropriate Use and Interpretation. Anesthesia & Analgesia 126: 1763–68. [Google Scholar] [CrossRef]
  28. Sia, Drissa, Yentéma Onadja, Arijit Nandi, Anne Foro, and Timothy Brewer. 2014. What Lies behind Gender Inequalities in HIV/AIDS in Sub-Saharan African Countries: Evidence from Kenya, Lesotho and Tanzania. Health Policy and Planning 29: 938–49. [Google Scholar] [CrossRef] [PubMed]
  29. Sklar, Abe. 1973. Random Variables, Joint Distribution Functions, and Copulas. Kybernetika 9: 449–60. [Google Scholar]
  30. Sklar, M. 1959. Fonctions de Repartition an Dimensions et Leurs Marges. Publications de l’Institut de statistique de l’Université de Paris 8: 229–31. [Google Scholar]
  31. Stander, Y. S., D. J. Marais, and Ilse Botha. 2013. A Comparison of Cointegration and Copula Asset Allocation Approaches. Studies in Economics and Econometrics 37: 1–28. [Google Scholar] [CrossRef]
  32. Stracke, Andrea, and Winfried Heinen. 2020. SOA—Influenza Pandemic: The Impact on an Insured Lives Life Insurance Portfolio. Available online: https://www.soa.org/library/newsletters/the-actuary-magazine/2006/june/pub-influenza-the-impact-on-an-insured-lives-life-insurance-portfolio (accessed on 20 September 2022).
  33. Strauss, David J., Pierre J. Vachon, and Robert M. Shavelle. 2005. Estimation of Future Mortality Rates and Life Expectancy in Chronic Medical Conditions. Journal of Insurance Medicine 37: 20–34. [Google Scholar]
Figure 1. HIV/AIDS and non-HIV/AIDS distribution for males over time given by the black line (observed trend plot) and the blue line (fitted abline).
Figure 1. HIV/AIDS and non-HIV/AIDS distribution for males over time given by the black line (observed trend plot) and the blue line (fitted abline).
Risks 11 00038 g001
Figure 2. HIV/AIDS and non-HIV/AIDS distribution for females over time given by the black line (observed trend plot) and the blue line (fitted abline).
Figure 2. HIV/AIDS and non-HIV/AIDS distribution for females over time given by the black line (observed trend plot) and the blue line (fitted abline).
Risks 11 00038 g002
Figure 3. All causes and non-HIV/AIDS cause of death rates distribution for males against age.
Figure 3. All causes and non-HIV/AIDS cause of death rates distribution for males against age.
Risks 11 00038 g003
Figure 4. All causes and non-HIV/AIDS cause of death rates distribution for females against age.
Figure 4. All causes and non-HIV/AIDS cause of death rates distribution for females against age.
Risks 11 00038 g004
Figure 5. Joint distribution for non-HIV/AIDS and HIV/AIDS for males and females.
Figure 5. Joint distribution for non-HIV/AIDS and HIV/AIDS for males and females.
Risks 11 00038 g005
Figure 6. Estimated marginal distribution for HIV/AIDS for males.
Figure 6. Estimated marginal distribution for HIV/AIDS for males.
Risks 11 00038 g006
Figure 7. Estimated marginal distribution for HIV/AIDS for females.
Figure 7. Estimated marginal distribution for HIV/AIDS for females.
Risks 11 00038 g007
Figure 8. Estimated marginal distribution for non-HIV/AIDS for males.
Figure 8. Estimated marginal distribution for non-HIV/AIDS for males.
Risks 11 00038 g008
Figure 9. Estimated marginal distribution for non-HIV/AIDS for females.
Figure 9. Estimated marginal distribution for non-HIV/AIDS for females.
Risks 11 00038 g009
Figure 10. Uniform marginal distribution for males.
Figure 10. Uniform marginal distribution for males.
Risks 11 00038 g010
Figure 11. Beta marginal distribution for males.
Figure 11. Beta marginal distribution for males.
Risks 11 00038 g011
Figure 12. Uniform marginal distribution for females.
Figure 12. Uniform marginal distribution for females.
Risks 11 00038 g012
Figure 13. Beta marginal distribution for females.
Figure 13. Beta marginal distribution for females.
Risks 11 00038 g013
Figure 14. Normal marginal distribution for females.
Figure 14. Normal marginal distribution for females.
Risks 11 00038 g014
Table 1. Bivariate copula structures.
Table 1. Bivariate copula structures.
Family NameCopula Structure
Independence copula C u 1 , u 2 = u 1 · u 2 = u 1 u 2
Gaussian copula C u 1 , u 2 = Φ p Φ 1 u 1 , Φ 1 u 2   w h e r e   Φ p is the C.D.F of the bivariate normal distribution with N ( 0 , 1 ) and correlation ρ ( 1,1 )
Student t copula (t-copula) C u 1 , u 2 = t p , v ( t v 1 u 1 , t v 1 u 2 ) where t is the C.D.F of the bivariate t-distribution with mean 0 and degrees of freedom ν > 2 , with correlation ρ ( 1,1 )
Clayton copula C u 1 , u 2 = φ 1 ( φ u 1 + φ u 2 ) where φ t = 1 θ t θ 1 such that θ ( 0 , )
Gumbel copula C u 1 , u 2 = φ 1 ( φ u 1 + φ u 2 ) where φ t = ( l o g ( t ) ) θ such that θ ( 1 , )
Frank copula C u 1 , u 2 = φ 1 ( φ u 1 + φ u 2 ) where φ t = log exp θ t 1 exp θ 1 such that θ R \{0}
Joe copula C u 1 , u 2 = φ 1 ( φ u 1 + φ u 2 ) where φ t = ( l o g ( 1 ( 1 t ) θ ) such that θ [ 1 , )
Clayton-Gumbel = BB1 copula C u 1 , u 2 = φ 1 ( φ u 1 + φ u 2 ) where φ t = ( t θ 1 ) δ such that θ > 0 , δ 1
Joe-Gumbel = BB6 copula C u 1 , u 2 = φ 1 ( φ u 1 + φ u 2 ) where φ t = ( l o g [ 1 ( 1 t ) θ ] ) δ such that θ 1 , δ 1
Joe-Clayton = BB7 copula C u 1 , u 2 = φ 1 ( φ u 1 + φ u 2 ) where φ t = ( 1 ( 1 t ) θ ) δ 1 such that θ 1 , δ 0
Joe-Frank = BB8 copula C u 1 , u 2 = φ 1 ( φ u 1 + φ u 2 ) where φ t = log 1 ( 1 δ t ) θ 1 ( 1 δ ) θ such that θ 1 , δ ( 0 , 1 ]
90 degrees rotated copulas C 90 u 1 , u 2 = u 2 C 1 u 1 , u 2 with c 90 u 1 , u 2 =  c 1 u 1 , u 2
180 degrees rotated copulas C 180 u 1 , u 2 = u 1 + u 2 1 + C 1 u 1 , 1 u 2 with c 180 u 1 , u 2 =  c 1 u 1 , 1 u 2
270 degrees rotated copulas C 270 u 1 , u 2 = u 1 C u 1 , 1 u 2 with c 270 u 1 , u 2 =  c u 1 , 1 u 2
Survival Copulas C ^ u 1 , u 2 = u 1 + u 2 1 + C 1 u 1 , 1 u 2 = P [ U 1 > u 1 , U 2 > u 2 ]
Tawn type 1 copula C u 1 , u 2 = e x p [ l o g ( u 1 , u 2 ) A ( l o g u 1 l o g ( u 1 , , u 2 ) ) ] with A w = 1 θ + ϕ w + θ w 2 + ϕ w 2 where w [ 0 , 1 ] , 0 ≤ ϕ 1 , 0 ≤ ϕ 2 , θ [ 0 , ]
Tawn type 2 copula C u 1 , u 2 = e x p [ l o g ( u 1 , u 2 ) A ( l o g u 1 l o g ( u 1 , , u 2 ) ) ] with A w = 1 ϕ 1 1 w + ( 1 ϕ 2 ) w + [ ( ϕ 1 w ) 1 θ + ( ϕ 2 ( 1 w ) ) 1 θ ] θ where w [ 0 , 1 ] , 0 ≤ ϕ 1 , ϕ 2 1 , θ [ 0 , ]
Table 2. P.D.F and C.D.F of the joint, marginal, and conditional distribution.
Table 2. P.D.F and C.D.F of the joint, marginal, and conditional distribution.
Distribution P . D . F C . D . F
Joint f ( x 1 , , x d ) F ( x 1 , , x d )
Marginal f x i , i = 1 , , d F x i , i = 1 , , d
Conditional f i j x i x j   f o r   i j F i j x i x j   f o r   i j
Table 3. Model selection results for males and females.
Table 3. Model selection results for males and females.
MalesFemales
Family Code and NamelogLikAICBIClogLikAICBIC
0 = independence copula000000
1 = Gaussian copula7.8413.69−12.693.54−5.08−4.08
2 = Student t copula (t-copula)7.78−11.56−9.573.48−2.96−0.96
3 = Clayton copula14.62−27.23−26.238.68−15.36−14.36
4 = Gumbel copula4.6−7.21−6.211.41−0.820.17
5 = Frank copula7.73−13.45−12.452.97−3.93−2.94
6 = Joe copula2.03−2.05−1.060.161.692.68
7 = BB1 copula14.61−25.22−23.238.67−13.35−11.36
8 = BB6 copula4.6−5.2−3.211.411.183.17
9 = BB7 copula14.7−25.4−23.418.68−13.36−11.37
10 = BB8 copula5.58−7.16−5.172.13−0.251.74
13 = rotated Clayton copula (180 degrees; survival Clayton)2.86−3.73−2.730.630.731.73
14 = rotated Gumbel copula (180 degrees; survival Gumbel)10.83−19.67−18.675.87−9.74−8.74
16 = rotated Joe copula (180 degrees; survival Joe’)14.72−27.43−26.449.13−16.25−15.26
17 = rotated BB1 copula (180 degrees; survival BB1)10.83−17.66−15.665.86−7.73−5.74
18 = rotated BB6 copula (180 degrees; survival BB6)14.72−25.43−23.449.12−14.25−12.26
19 = rotated BB7 copula (180 degrees; survival BB7)14.72−25.43−23.449.13−14.25−12.26
20 = rotated BB8 copula (180 degrees; survival BB8)14.72−25.43−23.449.13−14.25−12.26
104 = Tawn type 1 copula4.2−4.39−2.41.870.252.24
114 = rotated Tawn type 1 copula (180 degrees)10.1−16.2−14.214.56−5.13−3.14
204 = Tawn type 2 copula4.11−4.21−2.220.992.034.02
214 = rotated Tawn type 2 copula (180 degrees)10.6−17.21−15.227.51−11.03−9.03
Table 4. Correlation structure of the bivariate pair HIVAIDS and non-HIVAIDS for males.
Table 4. Correlation structure of the bivariate pair HIVAIDS and non-HIVAIDS for males.
MarginalsPearson Linear Correlation CoefficientKendall’s Tau = 0.66
Assuming Independence0.64176820.6
Beta0.72152730.6687808
Uniform 0.83250420.6503343
Table 5. Correlation structure of the bivariate pair HIVAIDS and non-HIVAIDS for females.
Table 5. Correlation structure of the bivariate pair HIVAIDS and non-HIVAIDS for females.
MarginalsPearson Linear Correlation CoefficientKendall’s Tau = 0.55
Assuming Independence0.38798910.4315789
Beta0.59337270.5522082
Uniform 0.71420360.5313874
Normal0.73610250.54
Table 6. Life expectancy gain/loss analysis for males.
Table 6. Life expectancy gain/loss analysis for males.
201920102000
Age (x)ex(All Cause)ex(Non-HIVAIDS)
Independence
ex(Non-HIVAIDS)
Dependence
Gain/
(Loss)
ex(All Cause)ex(Non-HIVAIDS)
Independence
ex(Non-HIVAIDS)
Dependence
Gain/
(Loss)
ex(All Cause)ex(Non-HIVAIDS)
Independence
ex(Non-HIVAIDS)
Dependence
Gain/(Loss)
2044.044.944.6−0.340.442.041.0−1.036.141.837.4−4.4
2543.144.043.7−0.339.441.140.1−1.035.240.936.4−4.5
3039.540.340.1−0.235.937.536.5−1.031.837.233.0−4.2
3535.135.835.6−0.231.733.132.3−0.828.232.829.4−3.4
4030.831.531.4−0.127.929.028.5−0.525.428.826.5−2.3
4526.927.427.4024.425.225.0−0.223.025.424.0−1.4
5023.123.523.60.121.321.721.80.120.622.121.4−0.7
5519.619.820.00.218.218.518.60.117.718.718.5−0.2
6016.216.416.60.215.115.315.50.214.815.315.50.2
6513.213.213.50.312.312.412.60.212.012.312.60.3
7010.310.410.60.29.79.79.90.29.59.710.00.3
757.87.88.00.27.47.47.60.27.37.47.60.2
805.55.55.60.15.25.25.40.25.35.35.50.2
85+3.13.13.20.13.13.13.103.13.13.20.1
Table 7. Life expectancy gain/loss analysis for females.
Table 7. Life expectancy gain/loss analysis for females.
201920102000
Age (x)ex(All Cause)ex(Non-HIVAIDS)
Independence
ex(Non-HIVAIDS)
Dependence
Gain/
(Loss)
ex(All Cause)ex(Non-HIVAIDS)
Independence
ex(Non-HIVAIDS)
Dependence
Gain/
(Loss)
ex(All Cause)ex(Non-HIVAIDS)
Independence
ex(Non-HIVAIDS)
Dependence
Gain/(Loss)
2048.048.748.90.243.646.544.7−2.237.845.639.1−6.5
2547.147.748.00.342.745.543.8−1.736.944.738.3−6.4
3043.444.044.30.339.342.040.4−1.634.441.235.7−5.5
3539.039.639.90.335.537.836.6−1.231.837.333.0−4.3
4034.835.235.60.432.133.733.1−0.629.433.330.4−2.9
4530.731.131.50.428.829.829.7−0.126.729.527.6−1.9
5026.827.027.50.525.526.126.30.224.026.024.8−1.2
5522.923.023.50.522.022.422.70.321.122.421.9−0.5
6019.119.219.60.418.518.719.10.418.018.818.6−0.2
6515.415.415.90.515.015.115.50.414.815.315.30
7011.912.012.40.411.711.812.10.311.611.912.10.2
758.98.99.20.38.78.89.10.38.88.99.10.2
806.16.16.30.26.06.06.30.36.16.26.40.2
85+3.43.43.50.13.43.43.50.13.53.53.60.1
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bett, N.; Kasozi, J.; Ruturwa, D. Dependency Modeling Approach of Cause-Related Mortality and Longevity Risks: HIV/AIDS. Risks 2023, 11, 38. https://doi.org/10.3390/risks11020038

AMA Style

Bett N, Kasozi J, Ruturwa D. Dependency Modeling Approach of Cause-Related Mortality and Longevity Risks: HIV/AIDS. Risks. 2023; 11(2):38. https://doi.org/10.3390/risks11020038

Chicago/Turabian Style

Bett, Nicholas, Juma Kasozi, and Daniel Ruturwa. 2023. "Dependency Modeling Approach of Cause-Related Mortality and Longevity Risks: HIV/AIDS" Risks 11, no. 2: 38. https://doi.org/10.3390/risks11020038

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop