Next Article in Journal
Skewed Binary Regression to Study Rental Cars by Tourists in the Canary Islands
Next Article in Special Issue
The Effects of ERM Adoption on European Insurance Firms Performance and Risks
Previous Article in Journal
An Empirical Investigation into Alarming Signals Ignored by the U.S. Multi-Brand Retailer J. Crew Incorporation during COVID-19 Pandemic
Previous Article in Special Issue
Managing Financial Risks while Performing International Commercial Transactions. Intertemporal Lessons from Athens in Classical Times
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Return Based Risk Measures for Non-Normally Distributed Returns: An Alternative Modelling Approach

by
Eyden Samunderu
1,* and
Yvonne T. Murahwa
2
1
International School of Management (ISM), Otto Hahn Str., 19, 44227 Dortmund, Germany
2
Independent Researcher, 16 Cheshire Road, Mt Pleasant, Harare P.O. Box MP163, Zimbabwe
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2021, 14(11), 540; https://doi.org/10.3390/jrfm14110540
Submission received: 30 June 2021 / Revised: 29 October 2021 / Accepted: 1 November 2021 / Published: 10 November 2021
(This article belongs to the Special Issue Risk and Financial Consequences)

Abstract

:
Developments in the world of finance have led the authors to assess the adequacy of using the normal distribution assumptions alone in measuring risk. Cushioning against risk has always created a plethora of complexities and challenges; hence, this paper attempts to analyse statistical properties of various risk measures in a not normal distribution and provide a financial blueprint on how to manage risk. It is assumed that using old assumptions of normality alone in a distribution is not as accurate, which has led to the use of models that do not give accurate risk measures. Our empirical design of study firstly examined an overview of the use of returns in measuring risk and an assessment of the current financial environment. As an alternative to conventional measures, our paper employs a mosaic of risk techniques in order to ascertain the fact that there is no one universal risk measure. The next step involved looking at the current risk proxy measures adopted, such as the Gaussian-based, value at risk (VaR) measure. Furthermore, the authors analysed multiple alternative approaches that do not take into account the normality assumption, such as other variations of VaR, as well as econometric models that can be used in risk measurement and forecasting. Value at risk (VaR) is a widely used measure of financial risk, which provides a way of quantifying and managing the risk of a portfolio. Arguably, VaR represents the most important tool for evaluating market risk as one of the several threats to the global financial system. Upon carrying out an extensive literature review, a data set was applied which was composed of three main asset classes: bonds, equities and hedge funds. The first part was to determine to what extent returns are not normally distributed. After testing the hypothesis, it was found that the majority of returns are not normally distributed but instead exhibit skewness and kurtosis greater or less than three. The study then applied various VaR methods to measure risk in order to determine the most efficient ones. Different timelines were used to carry out stressed value at risks, and it was seen that during periods of crisis, the volatility of asset returns was higher. The other steps that followed examined the relationship of the variables, correlation tests and time series analysis conducted and led to the forecasting of the returns. It was noted that these methods could not be used in isolation. We adopted the use of a mosaic of all the methods from the VaR measures, which included studying the behaviour and relation of assets with each other. Furthermore, we also examined the environment as a whole, then applied forecasting models to accurately value returns; this gave a much more accurate and relevant risk measure as compared to the initial assumption of normality.

1. Introduction

Financial markets have always been prone to exogenous shocks, and as a result, the element of risk is the dominant factor when examining and observing global patterns of the financial world. In historical patterns of crises (for example, 2007–2008), global financial meltdown had a massive impact on how firms could construct their portfolio in an effort to mitigate risk impact. It has become prudent to analyse current risk measures and find better ways of measuring it to allow for more accurate risk management and decision making. Thus, the increased volatility of financial markets over the last decade has induced researchers, policy makers and practitioners to develop and design more sophisticated risk management tools. Value at risk (VaR) has become the most standard proxy used by financial analysts to measure and quantify risk impact. In mathematical terms, VaR is calculated as follows:
V a R α = α σ W
where α reflects the selected confidence level, σ the standard deviation of the portfolio returns and W the initial portfolio (Jorion 2003).
“Risk measurement is the foundation of risk management and hence of vital importance in any financial institution and investor” (Goodfellow and Salm 2016, p. 80). The universe of risk measures is large, and so, there is no single measure of investment risk that is correct for all purposes or for all investors. This differs based on investor needs and requirements. Risk tends to be measured in relation to price changes, which can take various forms such as relative, absolute, or log price changes. This price change is what this paper bases the risk measures on.
Conditional value at risk (CVaR), which is also known as Mean Excess Loss, Mean Shortfall or Tail VaR, is an alternative to VaR and attempts to deal with its predecessor’s shortcomings. It measures the losses beyond the VaR point and has better coherence with properties of a robust risk measure. It is also referred to as expected tail loss and satisfies the measures that defines risk. Therefore, it can be argued that is is an ideal alternative that overcomes the shortfalls of VaR because it represents the loss that is expected beyond the loss given by the VaR. It can be argued that CVaR is a coherent risk measure that inhibits the following properties, such as transition-equivariant, positively homogeneous, convex and monotonic (Pflug 2000).
VaR is such a prevalent method to estimate risk, partly due to the regulatory framework created by the Basel committee on Banking Supervision during the 1990s, which forces the banks to calculate risk adjusted measures of capital adequacy based on VaR for their portfolios. This requirement was put into place in order to mitigate banks from taking on too much financial risk (Basel Committee on Banking Supervision 2009).
A VaR estimate corresponds to a specific critical value of a portfolio’s potential one-day profit and loss probability distribution. Given their function both as internal risk management tools and as potential regulatory measures of risk exposure, it is important to quantify the accuracy of an institution’s VaR estimates. However, it has some undesirable mathematical characteristics, such as a lack of subadditivity and convexity.
VaR also tends to be popular due to its ease in use and implementation by the average investors. However, with studies emerging, it is being revealed that most data are not homoscedastic but exhibit heteroscedasticity in the variance. This implores the use of the autoregressive conditional heteroscedasticity model (ARCH) and the general autoregressive conditional heteroscedasticity model (GARCH) to capture the changing volatility over time, since they are conditional upon heteroscedasticity (Orhan and Köksal 2011).
This study initially looks at the normality/non-normality of most of the data of returns. If this is affirmed, incorporating non-normality in risk calculations should therefore provide a better way of understanding and measuring the downside risk that is in an asset. The rest of the paper assesses models that consider this and gives a glimpse into the many options available that can be used to assess risk in non-normal distributions.

2. Literature Review

Risk measurement has evolved over time, and it began with the introduction of probabilities. The discovery and estimation of probabilities paved the way towards the first step in risk quantification. Evidence from Marrison (2002) alludes to the fact that risk is unavoidable in the financial industry, and so the best way to deal with it is to manage how it is measured and how it can provide the greatest risk-adjusted return whilst minimizing the risk exposure of the asset. In their study, Lovreta and López Pascual (2020) examined the impact of sovereign default risk by using a vector autogressive (VAR) framework and applying Grainger-causality tests to illustrate implications of parameter instability between banks and sovereign default risk. The findings revealed that structural dependence in the financial system extends between banks and sovereign default risk volatility. The findings made further revelations that structural changes are present in the short-run dynamic relations between the sovereign and banking sector default risk during the period under analysis.
A large strand of risk measurement has focused on risk aggregation concerning risk implied by an aggregate financial position (Francq and Zakoïan 2018). Risk aggregation with dependence uncertainty is commonly referred to as the sum of individual risks with known marginal distributions and unspecified dependence structure. In their study, Francq and Zakoïan (2018) investigated both the estimation of market and estimation risks in portfolios by observing individual returns that follow a semi parametric multivariate dynamic model, and the asset composition was time varying. They developed asymptotic theory, which is used to estimate conditional VaR, and proposed an alternative risk testing approach called Filtered Historical Simulation (FHS). They concluded that by neglecting the estimation risk, practitioners might believe risk is controlled at a given level.
The universe of risk measures is large, and so there is no single measure of investment risk that is correct for all purposes or for all investors. Thus, the vast number of financial risk measures in the literature can be broadly subsumed into two categories: 1. risk as the magnitude of deviations from a target (risk of first kind), and 2. risk as necessary capital with respect to necessary premium (risk of the second kind). This differs based on investor needs and requirements. Risk tends to be measured in relation to price changes, which can take various forms, such as relative, absolute or log price changes. This price change is what this paper bases the risk measures on. The analysis is of various asset returns across different classes.
Generally, there are three specific measures used to measure the risk of individual assets, namely standard deviation, beta and duration. The volatility of asset prices is determined by the standard deviation approach, beta tackles market risk and portfolio risk measures, whilst duration measures the sensitivity of debt security prices to changes in interest rates. As the topic suggests, this paper focuses on the risk measures that are determined by the returns of the assets, and so it first assesses standard deviation, its downfall and what alternatives can be used to make the risk measure more accurate and reliable. It is also important to note that whilst standard deviation may be the appropriate measure of risk for high-volume markets (for example, Germany, the UK and France) it is not the most reliable risk measurement in terms of volatility when considering low-volume markets. It is also imperative that extreme financial external shocks lead to sharp spike jumps in stock prices and subsequently, volatility patterns. Wang et al. (2020) adopted four Jump-GARCH models in order to forecast the jump diffusion volatility which is used as a risk factor. The authors (Wang et al. 2020) considered both linear and nonlinear effects and the VaR of financial institutions by using vector quantile regressions. Evidence from the investigation revealed three interesting findings, which showed that when observing the volatility process of bank stock prices, the jump diffusion GARCH model is better than the continuous diffusion GARCH model. The jump behaviour of bank stock was seen to be heterogeneous due the difference in the sensitivity of the abnormal information shock. The performance of the support vector regression was seen to be a better approach than parametric quantile regression and nonparametric quantile regression. These findings yet again show that there is no “one size fits all” risk measurement model; thus, observing and measuring risk on asset returns requires a much more bespoke approach. Tang and Su (2017) conducted an analysis of the impact of financial events on systematic risk by observing the Chinese banking system. The authors criticize the methodological use of Contingent Claims Analysis (CCA) as a theoretical model to measure systematic risk (or beta of an asset) due to its strict theoretical assumptions and single source of risk information, as well as the fact that it limits the asset value volatility of the macro sector to being stationary stochastic process. Instead, they proposed an alternative model by relaxing the assumptions of CCA, replacing pure diffusion with jump diffusion, and introducing a macro-jump CCA that acts as a proxy to predict early warning effects of financial events.
Liu et al. (2020) examined the estimation for conditional volatility models and proposed a model-averaging estimator for conditional volatility under a framework of zero conditional mean. The three most popular univariate conditional volatility models are the generalized autoregressive conditional heteroskedasticity (GARCH) model of Engle (1982) and Bollerslev (1986) and the exponential GARCH (or EGARCH) model of Nelson (1990). Using Monte Carlo experiments, Liu et al. (2020) showed that their approach, the model-averaging forecast, leads to a better forecast accuracy, unlike other commonly used methods.
The Gaussian distribution introduced the assumptions of a normal distribution, which inevitably ushered in the mean variance framework; commonly used measures of risk are modelled around the normal distribution. However, it has been seen that most distributions are in fact not normally distributed, and continuing to ignore this fact by maintaining the assumptions used in the current risk measures distorts outcomes and leaves the financial markets unprepared to deal with and manage the degree of risk impact.
On the contrary, most returns tend to be not normally distributed but instead have heavy or fat tails that tend to be skewed. As highlighted by the recent 2007/2008 financial crisis, there are serious deficiencies concerning the risk models used. Most of these stem from the assumptions used in inputting the models; therefore, there is a model risk that is prevalent and it is desirable to find one that has a lower model risk level. Risk management models will need to be improved, putting greater emphasis on stress tests, scenario analysis and the use of forecasting. Bhowmik and Wang (2020) conducted a comprehensive literature review in order to ascertain different risk models raging from GARCH family-based models and used stock market returns as a “barometer” and an “alarm for economic and financial activities”. Therefore, in order to prevent market uncertainty and mitigate risk in the stock markets, it is fundamental that the volatility of the stock market index returns is effectively measured. This means that financial market volatility is mainly reflected in the deviation of the expected future value of assets, indicating the possibility that volatility represents the uncertainty of the future price of an asset. This uncertainty is usually characterized by variance or standard deviation. However, it is imperative to note that measuring stock market volatility is a complex process for researchers because volatility tends to cluster. In fact, volatility is seen to be a permanent behaviour of the stock market around the globe. The presence of the volatility in the stock price makes it possible to earn abnormal profits by risk seeking investors. Kumar and Biswal (2019) employed GARCH econometric models in their study, and the results confirmed the presence of volatility clustering and advantageous effects that affect the future of stock markets. When using GARCH family models to analyse and forecast return volatility, the selection of input variables for forecasting is paramount, as essential conditions will be given for the method to have a stationary solution and perfect matching (Nelson 1990).
Dixit and Agrawal (2019) also suggested that the P-GARCH model is the most suitable to predict and forecast stock market volatility for the Bombay Stock Exchange (BSE) and National Stock Exchange (NSE) of India. The authors also highlighted the fact that volatility in financial markets is reflected because of uncertainty in the price, return, unexpected events and non-constant variance, which can be measured through the GARCH models that will give insight for investment decisions.
Variance measures the average squared deviation from the expected value, but this measure does not really give the full picture or description of the risk. In addition, proponents of this approach, such as Bodie et al. (2001), suggest that even though the returns are not normally distributed, if returns of a larger portfolio are used, it begins to resemble a normal distribution as it grows. However, in the case of stock prices, these cannot be negative, and so, to say that the normal distribution will accurately represent this distribution is a stretch.
The 2007/2008 financial crisis revealed that most risk managers are moving away from the historical treatment of risk but instead are laying more emphasis on scenario analysis and stress testing (Crouchy et al. 2014). Stress testing is a risk management tool used to evaluate the potential impact on portfolio values of unlikely, although plausible events or movements in a set of financial variables (Lopez 2005). They are designed to explore the tails of the distribution of losses beyond the threshold (typically, 99%) used in the VaR analysis. For now, it is worth noting that the reason for them moving more to this approach is that they want to focus on looking at the outcome of a given adverse stress/scenario on their company or on a portfolio.
Volatility as a risk measure is conditional on past information and, by definition, the distribution must be symmetric, and thus not capable of capturing any asymmetric aspect of risk. This makes this risk measure more useful during normal market conditions, but less helpful during crises when the market does not behave according to the normality assumptions (Billio et al. 2016). However, the relevance of volatility forecasting for financial risk management has drawn a significant level of interest in the literature. This volatility forecastibility tends to vary with horizon, and different horizons are relevant in different applications. The estimation of volatility is a key input for calculating VaR because it directly depends on the expected volatility, time horizon and confidence interval for the continuous returns under analysis (Lelasi et al. 2021).
The VaR framework can be defined as a risk-measuring framework developed by financial market professionals as a means of measuring and comparing risk inherent in different markets. It describes the tail loss that can occur over a given time period resulting from its exposure to market risk, at a given confidence level. “It is the loss in market value time of an asset over a given time period that is exceeded with a probability” (Baker and Filbeck 2015). In other words, at a given time, an investor wants to know what he may stand to lose at a set confidence level and ideally, this is what VaR aims to answer. Thus, VaR models were sanctioned for determining market risk capital requirements for large banks and international banking authorities through the 1996 Market Risk Amendment Basel Accord. Spurred on by these developments, VaR has become a standard measure of financial market risk used by financial and even non-financial firms. VaR forecasts are often used as a testing ground when fitting alternative models for representing the dynamic evolution of time series of financial returns.
The downside of using VaR as a risk management tool is that it is most accurate for the evaluation of very short periods, such as a one-day period, and becomes more inaccurate the longer the time period (Krause 2003). Another limitation with VaR is that it does not give any further details on the losses beyond this VaR point. This is a limiting factor because such extreme risk, which is unaccounted for, can even result in the failure of the business or portfolio. VaR assumes that historical correlations do not vary in times of market stress and that the historical correlations remain unchanged, which is an incorrect assumption that can be proven in the data analysis (Baker and Filbeck 2015). VaR usually underestimates ex post volatility. Several studies have tested the performance of VaR in volatility forecasts and tested its limits against ex-post benchmarks (Nieto and Ruiz 2016; Barns et al. 2017). These studies concluded that simpler volatility specifications produce better VaR forecasts. For example, the popular VaR methodology based on a normality assumption on the conditional distribution of returns often leads to the underestimation of financial losses, as fat tails are not accounted for.
Jorion (2003) reiterates how VaR is an incomplete measure and may need to be used in conjunction with other risk measures to make it more accurate regarding shortcomings highlighted in the above discussions. The prominent downside of using VaR measurements is that it more suitable and accurate for the evaluation of very short periods, such as one-day periods. It operates under the assumption that the distribution is normally distributed and so it becomes a more inaccurate measure as the period prolongs. Since it works under “normal market conditions”, it can be deemed as misleading for measures with a longer time span, which do not exhibit normal conditions during abnormal market periods. Other factors such as volatility can also give inaccurate data as they are subject to human bias, further making this proxy measure inaccurate.
As a decision tool, VaR has certain limiting mathematical properties such as convexity and subadditivity, as well as the fact that it gives the point at which the value at risk is and fails to give further details beyond this VaR point. Properties of risk measures can be formulated in terms of preference structures induced by dominance relations (see Fishburn 1980). This is detrimental because such extreme risk, which is not accounted for, can have a severe impact on the business or portfolio failure.
Stress testing is the second most commonly used risk management tool together with VaR, which is preferably used when assessing the investment strategy of a portfolio, to determine portfolio risk. It can also be used in determining hedging strategies, which will reduce portfolio losses (Van Dyk 2016). Thus, stress tests conducted in the context of a risk model can provide a useful alternative or complement to the current ad hoc methods of stress testing (Alexander and Sheedy 2008).
Most areas that may have been neglected in stress testing, which may have resulted in the recent crisis, could include failure to include risks such as securitization risk in the stress testing process. According to Van Dyk (2016), the stress tests that were carried out were unable to recognize the risk inherent in structured products. These risks would differ with differing asset classes such as between equities and bonds.

2.1. Suggested Alternatives for Non-Normal Distributions—Unconditional Distributions

There are two ways that modelling methods can be classified: as unconditional distributions, which are time independent, and conditional distributions, which are time dependent. Those that are classified as unconditional are those used under the normality assumption, and they assume that distributions of returns are independent of each other and tend to be independent of their past data. These include different variations of VaR. The conditional distribution approach is a newer modelling method designed to cater for non-normal distributions and address the shortcomings of the normal distribution to accurately model returns and risk. This method disputes the idea that returns are identical and independent, and models used in this category include GARCH and stochastic volatility, which are time-dependent processes. A benefit of this approach is that the models account for volatility clustering, which is a frequently observed phenomenon among return series. Hansen and Lunde (2005) made a comparison of ARCH-type models in terms of their ability to describe variance. They tested different volatility models in order to ascertain a better and more robust description of financial time series. In total, 330 GARCH-type models were compared using DM–USD exchange rate data and daily IBM returns. Findings revealed that there was no evidence to support the notion that GARCH is inferior to other models, and the models that performed better were the ones that accommodated a leverage effect. This argument is consistent with Kumar and Biswal (2019). This view is also mirrored in the work of Kilai et al. (2018), who acknowledge the shortcomings of GARCH-normal because it underestimates risk.
Conditional value at risk (CVaR)/expected tail loss/expected shortfall (ES) are alternatives to VaR and try to deal with their predecessor’s shortcomings. These measure the losses beyond the VaR and have better coherence with the aforementioned desired properties of a good risk measure, two of which are convexity and monotonicity, which are there to ensure a desirable optimum exists (Baker and Filbeck 2015). CVaR modelling is cast in terms of the regression quantile function—the inverse of the conditional distribution function. In principle, VaR and CVaR measure different properties of the distribution. VaR is a quantile and CVaR is a conditional tail expectation. The two values only coincide if the tail is cut off. However, the problem choice between VaR and CVaR, especially in financial risk management, has been a popular base for discussion. The reasons affecting choice are based on the differences in mathematical properties, the stability of statistical estimation, the simplicity of optimization procedures and acceptance by regulators, etc.
In their early work, Rockafellar and Uryasev (2000) focussed on minimizing CVaR, whereby they used a technique for portfolio optimization which calculated VaR and optimized CVaR simultaneously. This technique is fundamental for investment companies, brokerage firms, mutual funds, etc., and uses a combination of analytical or scenario-based methods. Several case studies showed that risk optimization with the CVaR performance function and constraints can be carried out for large portfolios and a large number of scenarios with relatively small computational resources. It is also possible to dynamically hedge CVaR with options, and this is because CVaR takes into account average loss exceeding VaR. CVaR is considered as a coherent risk measure (Acerbi and Tasche 2002). CVaR has superior mathematical properties compared to VaR. CVaR is so called a “coherent measure”; for example, the CVaR of a portfolio is a continuous and convex function with respect to positions in instruments, whereas the VaR may even be a discontinuous function (Sarykalin et al. 2014). The theory of coherence requires a measure of risk to satisfy four mathematical criteria: (1) translation or drift variance (linear), (2) homogeneity, (3) monotonicity and (4) subadditivity (Artzner et al. 1999; Sheppard 2013). Risk management conducted using CVaR functions can be performed efficiently, and it can be optimized and constrained with convex and linear programming methods. Overall, CVaR provides an adequate picture of risks reflected in extreme tails. This is a very important property if the extreme tail losses are correctly estimated. However, broadly speaking, problems of risk management with VaR and CVaR can be classified as falling under the heading of stochastic optimization.
By using this measure, it is possible to handle extreme events and to measure risk in a more precise and easier way. Instead of focusing on what is likely to happen, this is achieved through the examination of the conditional expectations, hence the name. Expected shortfall has been one of the methods proposed as an alternative to VaR. For instance, in a situation when imposing limits on traders, by using VaR, the firm will assume too much because it holds less capital to cover this risk. This then puts the company at risk (Danielsson et al. 2005).
As the economic financial meltdown of 2008–2009 demonstrated, VaR fares poorly during periods of market distress (Chen 2014). Basel 2.5 addressed some of the shortcomings of VaR and added stressed VaR (SVaR). SVaR is viewed as a response to the pro-cyclicality of classical VaR. This is “a forward-looking measure of portfolio risk that attempts to quantify extreme tail risk calculated over a long time horizon” (Berner 2012). Its aim is to provide a wider perspective of various inherent risks such as market and credit risk, as well as gap risks and jumps. This is because it analyses a time-period that experienced extremes and takes into account the sudden changes. In short, stressed VaR is used to obtain an idea of the possible losses likely to occur given worse market conditions. SVaR corrects various deficits of the ordinary VaR in times of market stress and it employs the Gaussian (normal) probability formalism in a completely different way than VaR and is designed to account for tail risk and collective behaviour (Billio et al. 2016). The two main properties of SVaR are “fat tail volatilities” that account for outlier events in the risk factors and stressed correlations between risk factors that account for collective market participant behaviour in stressed markets (Dash 2012).
Lichtner (2019) emphasized that what is fundamental to any SVaR model is the choice of the return-type model for each risk factor. In his study, he proposed a generalized return model. This is because of the sensitive nature of SVaR numbers to the chosen return-type model, and researchers have to make prudent choices on which return type to adopt. The findings reveal interesting, different SVaR dynamics for each return type such as absolute, relative, shifted relative, log, etc.
It is a regulatory requirement according to the Basel III framework that SVaR be calculated for banks and financial institutions according to the framework (European Banking Authority 2012). They state that the benefit of using it is that it is procyclical. Applying this to asset classes or portfolios will have positive benefits over the generic VaR. The European Banking Authority (2012) stated that historical data which covers a continuous 12 months must be used or if it is less, then it must cover the period under which the portfolio was under stress. There is no weighting of historical data when determining the relevant historical period or calibrating the stressed VaR model. This is because the weighting of data in a stressed period would not result in a true reflection of the potential stressed losses that could occur for an asset or portfolio. It is assumed that a stressed VaR can be exceptionally smaller than VaR, and so this is tested as the authors compare some VaR variations. In the Basel framework, it is also recommended that there should be ongoing monitoring of stressed VaR relative to VaR. The ratio between the two measures that is identified at the beginning of the monitoring period should be used as a reference to the continued monitoring. Should the ratio decrease significantly, it should signal the potential need for a review of the stress period. Should the ratio between the stressed VaR and VaR go below one, this should be taken into account as a warning signal that may warrant the review of the stressed period.
Marginal VaR (MVaR) is another variation of VaR and amplifies the notion of understanding an additional amount of risk for new investments. This allows portfolio managers to understand the impact of adding or deducting positions in their portfolios. Volatility measures the uncertainty of the return of an asset when it is considered in isolation. When this asset belongs to a portfolio, what is of critical importance is its contribution to the portfolio risk. The effect of the small changes in a part of the portfolio to the portfolio VaR is measured by marginal value at risk (MVaR).
Incremental value at risk (IVaR) is yet another approach adding onto the traditional VaR approach. It has become a standard tool for making portfolio-hedging decisions, in particular, hedging and speculating with options and reducing the risk in a risk return analysis (Mina 2002). In theory, IVaR is a metric that measures the contribution in terms of relative VaR of a position or a group of positions with respect to the total risk of a pre-existent portfolio. Its aim is to calculate the worst-case scenario most likely to occur for a portfolio in a particular given period. Investors can therefore be able to determine which investment to undertake based on how it affects the portfolio, choosing the one which has the least losses impact. More recently, Jain and Chakrabarty (2020) made further improvements upon the performance of a managed portfolio by proposing the use of MVaR in order to ascertain the desirability of assets for inclusion in a managed portfolio. They empirically conducted the study with S&P BSE 100 as the benchmark index with at least five different optimization problems. Thus, in order to capture the effect of the change in dollar exposure of assets on the overall risk portfolio, the use of VaR of the individual assets is not adequate, and therefore, MVaR provides a suitable fit. This means that MVaR is more amenable when it comes to identifying additional portfolio risk (Jain and Chakrabarty 2020).
Lower Partial Moment (LPM) is a measure of downside risk. This is calculated as the average of squared deviations below a given target return. It is also a supplement to the VaR LPM and applies specifically to the left tale of a distribution of the VaR. It can be the average across all negative deviations from the VaR or the variance of the negative deviations of a pre-defined VaR It is, therefore, a method that places heavier weight on larger deviations from the current VaR in comparison to the smaller deviations (Goodfellow and Salm 2016). An investor can select or set the risk-free rates, and by selecting the degree of the moment, one can specify the risk levels that suit the portfolio or risk needs. Goodfellow and Salm (2016) compared three different risk measures based on the same stock return data, the portfolio variance as in the seminal works of Markowitz (1952) and the VaR based on t copula. They concluded that normal assumption substantially underestimates the risk faced by an organization, and therefore, they discredited the use of risk measures based on normal assumption.
Multifactor and Proxy Models are another option. The difference between multifactor models and arbitrage pricing models is that the latter limit themselves to historic data, whilst the former expand the data and include other macro-economic factors in the mode such as ratios and market capitalization. Their assumptions are that market prices usually fluctuate for no specific reason. They also assume that the returns of a stock are related to the riskiness; if a stock has high returns over a period of time, it must be riskier than the one with lower returns.
Fama and French (1992) found that if markets are reasonably efficient in the long term, then the market capitalization and book to price ratios were good stand-ins or proxies for risk measures. These types of multifactor models will thus do better in comparison to conventional asset pricing models to explain the differences in returns. The Capital Asset Pricing Model (CAPM) is based on the idea that not all risks should affect asset prices. The CAPM gives us insights about what kind of risk is related to return.

2.2. Suggested Alternatives for Non-Normal Distributions—Conditional Distributions

Serial correlation in asset return time series analysis to determine volatility forecast—ACF PACF—falls under the conditional distribution aspect. The aim of time series analysis is to decompose data to find a trend or seasonal component to it. This is carried out to assess and predict, and serial correlation is one such methodology of time series analysis. This occurs when a given period has returns that are dependent on the previous time, also known as autocorrelation, and is used to describe the relationship between observations on the same variable over an independent period. A zero correlation means they are independent. If otherwise, then they do not evolve in a random process, but rather they are related to their prior values (Napper 2008).
Having asset returns that are serially correlated tends to distort an asset class’s actual risk as it reduces risk estimates from a time series characteristic, because serial correlation improperly smoothens an asset class volatility (J.P. Morgan Asset Management 2009). If one has a positive autocorrelation, this will lead to a VaR that underestimates the actual volatility of the asset. If one has a negative autocorrelation, this can result in an overstated volatility. (Baker and Filbeck 2015).
The Copula (t and Gaussian distribution) is a more mathematical approach to risk measurement which is under the Integrated Risk Management (IRM) framework, measuring the error of normal assumption and therefore adjusting it to fit the non-normality aspects of a distribution (Embrechts et al. 2001). It takes into account events such as shocks and, under this methodology, what is known as fatal shock models, which occur and result in the destruction of the component. Other types are non-fatal shock models, whereby there is a chance of surviving the shock. Shocks can result from natural catastrophes or underlying economic events.
This method allows for the construction of models that measure risk which go beyond the standard ones at the level of dependence. This can be used to stress test different products and portfolios for extreme correlations and for the measurement of dependence. This methodology also addresses the limitations in linear correlation. However, the linear correlation tends to be preferred because of its simplicity and assumes normality. The concept of Copulas is rather abstract, in that it does not capture the joint movements in extreme values. This is where it would be prudent to consider using the next method to take into account the joint movement aspects of the non-normal distribution. This can be achieved by looking at correlation from a portfolio context whilst taking into account the variations that may evolve given different periods and under differing market conditions.
Financial markets are dynamic and constantly changing, and so it is critical to obtain data that are accurate and relevant to the period. In their BIS article, Loretan and English (2000a) suggested that when determining the appropriate time interval to use, risk managers should consider the periods of relatively high or low volatility. This is because these periods will have relevant information on the true and underlying asset relationship. If volatility and correlation are not adequately assessed, it could cause problems for risk managers when they want to stress test or look at worst-case scenarios. The thinking behind this is that higher correlations between assets generally come with added increased volatility.
This can be performed by grouping a period of high volatility and assessing the historic correlation, and this can be applied to any future high volatility scenario projections. A time under normal conditions must also be assessed to see how the correlation of volatiles is under this. It would be good to assess the correlations in these periods and to determine whether or not they increase or decrease with increased volatility. Thus, the main challenge is how to model risk to accommodate the time-varying nature of asset return volatilities (Loretan and English 2000b).
The main reasons for forecasting can be classified as being for asset allocation, for risk management, and for taking bets on future volatility. When looking at risk management, risk is measured to find the potential losses that an asset or portfolio may incur in the future (Reider 2009). To ensure this is possible, there will be a need to have estimates of the future correlations of the assets, as was described above in looking at correlations in risk measures. Following the seminal work of Box and Jenkins (1970), Autoregressive Moving Average (ARMA) models have become the standard tool for modelling and forecasting univariate time series. The ARMA framework can also be extended in order to allow for the possibility of heteroscedasticity. It is also necessary to obtain estimates of the future volatilities of the assets and this is where the ARMA and generalized autoregressive heteroskedasticity (GARCH) family of models come into play. They come in various methods and hybrids, but their intention is to forecast volatility. Volatility is not the same as risk. When it is interpreted as uncertainty, it becomes a key input to many investment decisions and portfolio creations. Thus, a good forecast of the volatility of asset prices over the investment-holding period is a good starting point for assessing investment risk. However, as a concept, volatility is simple and intuitive. It measures variability or dispersion about central tendency. Thus, the greater the deviation, greater the volatility. Furthermore, volatility forecasts are sensitive to the specification of the volatility model, and it is paramount to establish a balance between capturing the salient features of the data and overfitting the data.
Value at risk forecasting with the ARMA models are ideal in times of increased volatility. As previously introduced, the autoregressive and moving average can be used in time series analysis, and stemming from this are the models, which this paper mostly focusses on in the risk forecasts for the asset classes. These are namely the ARMA model, which is a combination of the autoregressive model and the moving average, GARCH model and exponential generalized autoregressive conditional heteroscedasticity (EGARCH) models. These models are designed to deal with common financial data time series characteristics such as thick tails and can be used to forecast the returns. EGARCH models capture the most stylized features of stock volatility, mainly time series clustering, negative correlation returns, lognormality and under certain specifications (e.g., Bollerslev 1986).
While ARCH was developed to model the changing volatility of inflation series, the model and its later extensions were quickly adopted for use in modelling conditional volatility of financial returns. The GARCH technique is a more sensitive way to measure risk in a distribution. GARCH is useful for non-normal distributions to analyse an asset’s risk as well as market risk.
It integrates the theory of dynamic volatilities and that is used to describe variance in terms of what is currently observable (Engle 2004). This method takes the weighted average of past squared errors to form a weighted variance. By using the weights, with the most weight going towards the most recent information, it gives more influence to current data.
GARCH was developed to tackle the belief and fact that returns tend to be unpredictable and that they have fat tails. The main advantage of using these GARCH models is that they captured both the heavy tails of a return series as well as capturing the volatility clustering (Mabrouk 2016). According to the model, where there is high volatility, it is likely to remain high, and where there is low volatility, it will be low, given the fact that the periods will be time limited. GARCH adds onto the ARCH model, where the process produces extremes that would naturally be expected from a normal distribution, since the extreme values during the high volatility period are larger than could be anticipated from a constant volatility process (Engle 2004).
In using the GARCH approach, the analysis uses the exponentially weighted moving average (EWMA), which solves the problem of weights, whereby the recent return has more influence on the variance than that from the previous month. EWMA as a measure of volatility uses a smoothing parameter called Lambda, which is there to ensure that the variance that is calculated is more biased towards recent data. The weights exponentially decline at a rate throughout time.
The use of EWMA in this model is more fitting because it incorporates external shocks better than equally weighted moving averages. These estimates are much more efficient than the standard simple moving averages because they provide a more realistic measure of current volatility. However, there are some downsides to the model, such as the use of EWMA. For example, it assumes that the trend will continue in the long term, and so the forecasts determined become increasingly inaccurate as the forecast horizon expands. However, for scenarios with medium volatility of volatility, there is very little penalty for using EWMA regardless of the volatility generating process. The robust structure of EWMA appears to contribute to its greater forecasting accuracy than flexible GARCH. Brooks (2014) shared the logic that GARCH models are designed to capture the volatility clustering effects in the returns. In particular, he mentioned the Exponential GARCH model (EGARCH), about which he says on the plus side, its application ensures that the conditional variance will always be positive.
The stochastic volatility model (SV) is a model that is similar to the GARCH model but differs in that it is parameter driven by latent shocks, whereas GARCH is observation driven (Diebold and Rudebusch 2012). This makes it easier to analyse. The logic behind this model is that it considers issues such as time deformation (the idea that calendar time does not have to move in tandem with financial markets). Calendar time is constant and predictable unlike economic time, which may slow down or speed up. SV model forecasts are noticeably more accurate than GARCH in scenarios with very high volatility of volatility and a stochastic volatility generating process.
Another variation is the realized volatility. This method differs from the GARCH and SV approach, which looked at conditional expectations of discrete time squared returns. Realised volatility looks at realizations and not expectations. This realized volatility facilitates superior risk measurement, which can translate into superior risk management. By using RV, the model is based on continuous time, evolving along the way to allow for more accurate volatility forecasts. It allows for jumps and can be used to improve GARCH models and SV models.

3. Methodology

Our research design attempted to answer the research questions proposed earlier in this study. In view of the research problem, the proposed study applies a quantitative research design, drawing the analysis from multiple risk model testing methods. Through the testing methods derived from the GARCH family, we ascertain that the traditional assumptions of normality in a distribution do not assist financial decision makers in determining the appropriate models that give accurate risk measures. The normal distribution assumption is based on forecasting the future price changes of assets and basing these forecasts on the belief that prices follow a random walk. A random walk is described as a movement of variables or numbers that do not follow a specific order, and two types of random walk theories can be identified. Firstly, one is based on single price assets and the other is the fundamental model of asset price dynamics. To give our research design more richness, we also employed measures of dispersion in order to ascertain the distance between values within the distribution (range, interquartile range). Overall, three major ways of measuring risk are discussed, namely the variance–covariance approach, historical data and Monte Carlo simulations:
Choice of Data sets
Our research study adopted an empirical approach by examining a data set that represents the following asset classes: equities (138), hedge funds (13) and bonds (+20). All the data used in this study were obtained from Bloomberg. There was a diverse range from different classifications, sectors, industry classes, asset types, geographic regions and so forth. However, from this initial data pool, there had to be a selection of only those assets that would exist over the selected test time period, as is explained in the time period cut off section that follows.
Backtesting VaR Models
Several backtests were conducted, but the discussion on these methods is by no means exhaustive, since in this context, it is impossible to go through the variety of different methods and their applications. VaR models are only useful if they predict future risks accurately. In order to evaluate the quality of the estimates, the models should always be backtested with appropriate methods. However, a thorough discussion on every backtest method is beyond the scope of this study.
Unconditional coverage was adopted to statistically examine whether the frequency of exceptions over the time interval is in line with the selected confidence level. This was carried out to determine when portfolio losses exceeded VaR estimates.
Conditional coverage was also conducted through the clustering of exceptions in order not only to examine the frequency of VaR violations but also the time when they occur. For this part, both Christoffersen’s (1996) interval forecast test and the mixed Kupiec test by Haas (2001) were conducted. As discussed earlier, the choice of parameters in VaR calculations is not arbitrary whenever backtesting is involved. To construct a solid view on the validity of the model, a relatively low confidence level was used. According to Jorion (2003), a confidence level of 95% is well-suited to backtesting purposes. We argue that with this approach, it is possible to observe enough VaR violations within a time period. Since the software had a 95% confidence interval by default, the authors would have also opted to use levels of 90% and 99% and test each one individually. However, we do acknowledge that having more than one level of confidence in the backtesting process makes the testing more effective. The total number of trading days (observations) was 252, which was sufficient to produce some statistically significant backtests and was also in line with the Basel backtesting framework.
As discussed earlier, single backtests can never be enough to evaluate the goodness of a VaR model. If one test yields a decent outcome, the result should always be confirmed by another test (Haas 2001).
Various tests were conducted in order to ascertain the research question, “how to measure risk in a non-normal world”. The data were clustered into generic groups based on certain characteristics, and a summary of each group was established whilst accommodating exogenous factors. The cut off point for the data was data existing from 1995, and any asset that had data after this period was filtered out. This was carried out to ensure homogeneity, which allowed for better comparisons. There was a diverse range from different classifications, sectors, industry classes, asset types and geographic regions. However, from this initial data pool, there was only a selection of those assets that existed over the selected test time, and this is explained in the time period section that follows. The returns were calculated normally as opposed to via the lognormal method, which is time additive and assumes normal distribution. The skew of a lognormal distribution is positive, unlike market returns that tend to be negatively skewed. This assumption first had to be proved in the normality tests. It can also be a problem when associating log returns with fat tail assumptions.

3.1. Time Period

The timeline used in this analysis was a 21-year period ranging from 1995 to 2016. This time encompassed various periods of stress, ranging from the Mexican peso crisis to the EU debt crisis. These periods are very important, as they reflect the stress periods where one can observe the returns during this time and come up with the most accurate risk measures for these situations. As mentioned in the literature review sections, calculating risk is not only based on models but one has to take into account historic events and study trends, business cycles and other issues, so as to take a more comprehensive approach. The current period of volatility resulting from the COVID-19 effect has not been factored into the study, given that the data available are inadequate, as it is still being collected. This, however, may be investigated further in future studies to assess COVID-19’s impact. However, Iuga and Mihalciuc (2020) analysed the impact of global shocks and highlighted the global effects of COVID-19 by developing two models, using a regression model and their study to analyse the time frame from 2001–2020 Q2. This is due to the fact that data are still unfolding, and we minimized our analysis by not putting so much focus on the effect of the COVID-19 pandemic. Nevertheless, we do acknowledge the adverse effect the pandemic has had on economies and financial markets at large.

3.2. Tests

The first test carried out was to test the non-normality (skewness, kurtosis) of the data in order to see if there was non-normality present in the returns. If this was the case, then the next step was to determine the type of non-normality in the observed returns. To assess the non-normality or normality of the data sets, the Jarque Bera test was used, which is based on the classical measures of skewness and kurtosis (third and fourth moments of a distribution). As these measures are based on moments of the data, this test has a zero breakdown value. This test is a goodness-of-fit test of whether sample data have skewness and kurtosis that matches a normal distribution. Once the distributions were determined, it was easier to select the most appropriate measure or set of measures based on the literature gathered on the various options that can be used to accurately measure risk for a given data set with a given distribution. For any distribution F with finite central moments μ k ( k 3 ) , the skewness is defined as follows:
γ 1 ( F ) = μ 3 ( F ) / μ 2 ( F ) 3 / 2
For any distribution F with finite central moments μ k ( k 4 ) , the kurtosis is defined as follows:
γ 2 ( F ) = μ 4 ( F ) / μ 2 ( F ) 2
The next step was to conduct risk measure tests. After carrying out the normality tests, the next step was to carry out VaR tests. The VaR measure concept condenses the entire distribution of the portfolio returns into a single number that investors have found useful and is easily interpreted as a measure of market risk. The VaR is essentially a p-percentage quantile of the conditional distribution of portfolio returns. We selected two assets from each asset class that represent different characteristics to make our study more comprehensive and inclusive of different assets. The first part examined the various ways that directly improve the standard VaR by using its hybrids. Thereafter, the analysis focused on the application of other models to conduct volatility forecasts through the use of modelling methods suitable for conditional distributions which are time dependent. The results were compared with those from the other variation of VaR, which is the CVaR. The theory stands to state that the CVaR focusses on downside or tail risk, and so this measure is likely to be higher than that of standard VaR. Risk managers have a plethora of volatility measures to choose from when calculating VaR measures. Time series models of volatility range from exponentially smoothed and simple autoregressive models, over single-shock GARCH models, to two-shock stochastic volatility models. The benchmark measure pivoted in Morgan Guaranty Trust Company (1996) RiskMetrics sets the conditional mean constant and specifies the variance as an exponential filter:
σ t 2 = ( 1 λ ) ε t 1 2 + λ σ t 1 2
In the GARCH model, the most widely used form is GARCH (1, 1) and it has some extensions. In a simple GARCH model, the squared volatility is assessed.
When applying the SVaR, according to the Basel III framework, this can be calculated on top of the VaR with a 99% confidence interval. Applying this, however, to more a practical scenario involves the selection of a stress period. For this analysis, the period adopted was for monthly returns for a non-stress period of January 2003 to December 2005 (36 months) and the stressed period was the financial crisis from August 2007 to December 2009 (29 months). In order to broaden the scope of the analysis, two sets of data were used on the equities over the same periods. One was the monthly returns, which was similar to what was applied to the bonds and hedge funds, and the other was the daily returns of the equities. This was carried out to observe whether there was much difference when SVaR and VaR are calculated using monthly versus daily returns. Another dimension to this analysis was applied to the bonds and the normal period: VaR and stressed VaR were compared between one asset that is normally distributed and one that is not normally distributed. The aim was to visualize the extent of the impact a distribution of a data set had on the results of the test conducted.
In practice, Beta is typically estimated using parametric estimators such as ordinary least squares (OLS) because OLS is the best linear unbiased estimator. Additionally, the monthly, quarterly or annual returns data are very often employed (see Kamara et al. 2018). This is because they are stable and likely distributed. However, the outliers—a very common problem in real asset returns—may seriously affect the performance of the parametric estimators compared with non-parametric estimators.
The test for autocorrelation was another critical test undertaken because it helps in seeking dependence in returns over time. This was achieved by plotting the ACF and PACF to observe whether any correlation in an asset’s return existed. This will further help in the application of models such as the GARCH to better forecast volatility in the future. If one examines empirical time series, it is easy to observe that large fluctuations in asset prices are more often followed by large ones, while small fluctuations are more likely followed by small ones. This stylized fact is known as volatility clustering. In order to view the properties and determine whether there were more large fluctuations than pure random processes, we looked at the autocorrelations of the returns.
The autocorrelation function C ( x t , x t + τ ) is defined as follows:
c ( x t , x t + τ ) ( x t x t ) ( x t + τ x t + τ )
x t 2 x t 2 x t + τ 2 x t + τ 2
where x denotes the expectation value of the variable x .
After this, autocorrelation for an asset with itself and from the portfolio perspective was carried out. The most commonly used correlation test is the Pearson’s test, although it assumes normality. The hypothesis of this would be to see whether there is a relationship between the correlation of a portfolio of assets and volatility during the normal periods against that observed over the stress periods. The assumption is that asset classes have correlations that tend to increase during high volatility periods, unlike during normal market conditions. In a portfolio context, this could mean that the benefit of diversification may be nullified in high-stress conditions, leading to an understated risk.
The next step was to assess the time series of the data to observe how the series moved and thus determine if there was volatility clustering and the type of variance that exists in each asset. In financial markets, prices of stocks and commodities fluctuate over time, which then produce financial time series. These time series are in fact of great interest both to practitioners and theoreticians when making inferences and predictions. This was to determine if the volatility displayed was of a homoscedastic nature (same variance) or heteroskedastic (the opposite of homoskedasticity). This helped to determine whether the data behaved as is expected in financial markets, which is usually heteroskedastic. A characteristic inherent in a robust model is that the financial data should be able to accurately show and capture that volatility clustering exists (Lok 2015) in the time series.
Regression analysis was adopted to predict future volatility. In the process of finding the ideal VaR forecaster, the first step was to calculate the AIC of each asset. The AIC is ideal to show the best model to use in predicting volatility. After completing this, the final step was to forecast the risk using the three models, namely the ARCH, GARCH and EGARCH models for each of the asset classes to see how well the model forecasts volatility for each given asset class.
In carrying out this process, other software packages such as R were considered, but due to the complex nature of the methods, the authors opted to use the Excel software package called NumXL to carry out the computations. This package was familiar as it is their main statistical analysis tool. The next step involved plotting the forecast for the data using the various parameters for each model. Using this package, the parameters were calculated and used to calibrate and find the most efficient parameters to apply to each model. Using this data, the forecasts were plotted over a 12-month time horizon to assess and obtain the most likely volatility movements.

4. Results

4.1. Jarque Bera Test—Non-Normality Test

Despite the remarkable qualities of the Shapiro–Wilk test, we adopted the Jarque Bera (JB) test, which is applied to various general models, such as nonlinear regressions and conditional heteroskedasticity models.
When using the JB test, the aim of the test is to test the null hypothesis. In this case, the null hypothesis was that it is normally distributed and this is represented by a p-value of greater than 0.05, which will lead to failure and to the rejection of the hypothesis. However, if it is less than 0.05, then it means that one rejects the null hypothesis, leading to the conclusion that it is not normally distributed. This was applied to the data below for all three assets classes using this formula:
J B = n 6     ( S 2 + K 2 4 )   ~   X V = 2 2
where
  • S = the sample skew;
  • K = the sample excess kurtosis;
  • N = the number of non-missing values in the sample;
  • JB = the test statistic; JB has an asymptotic chi-square distribution.
In our analysis, we first derived the asymptotic distribution of the sample skewness and kurtosis coefficients of the model’s standardized residuals. After, we constructed an asymptotic X2 test of normality based on the results.

4.1.1. Hedge Fund Indices

Upon carrying out the tests on the hedge fund asset class, a summary of the findings from the Jarque Bera test are shown in Appendix A. All 11 funds have returns that are not normally distributed. By assessing the skewness, it shows that the returns are mostly negatively skewed or close to zero. This confirms the previous hypothesis on the distribution of returns, which stated that most returns exhibit a negative skew and that they tend to not be normally distributed. The kurtosis of the distribution is positive, indicating that the data set has fat tails. One particular asset with a low skewness of 0.0002 is the Dow Jones Credit Suisse Long/Short Equity Hedge Fund index, which would make one assume the distribution is normal, but if one looks at the kurtosis, it is 4. This means that even though its skew is close to zero, it is leptokurtic, which indicates that there is a clustering of the distribution around the mean. This will result in the distribution having higher peaks and fat tails, since most of the returns are clustered around the mean. The HFRMI Index and the Dow Jones credit Suisse Dedicated Share Bias Hedge Funds have a kurtosis less than 3, of 0.83 and 1.53, respectively, indicating that the distributions are platykurtic, meaning that they have lighter tails.

4.1.2. Equity Indices

The represented equity indices in Appendix B were selected to cover various diversifications, such as different geographic locations, and were either price weighted or market weighted. There were also stocks that represent small capitalization, mid-capitalization as well as large capitalization. This selection of indices ensured that the selection was broader and covered a wider range of the indices and so it was more representative of the markets as a whole. In developing these classifications, the authors also applied the World Economic Situation and Prospects (WESP), which classifies all countries into three segments, which are developed economies, economies in transition and developing economies.
Twenty-three indices were used to carry out this test. The results show that (be it capital weighted, all index weighted or price weighted) the distributions are not normally distributed, as indicated by the N. The only anomaly that is observed regarding normality, with a Y representing a yes, was with the four Japanese Stock Indices: the TPX500 Index (high capitalization), TPXSM Index (small capitalized), TPXM400 Index (mid capitalized) and the TPX100 Index (high liquidity), t the null hypothesis is not rejected, implying that the distribution is normally distributed. This only occurs in 4 indices of 23 observations.
For the rest of the indices, due to a vast number, a large number of different observations are made, but overall, it can be observed that they do not have a normal distribution. The kurtosis is mixed, with some being leptokurtic and others being platykurtic; regarding the differences in tails, some are heavy, implying high tail risk, and some are light, implying less risk in comparison to the heavy tails. The skewness results also differ when looking at the ones classified as being not normally distributed. They all exhibit symmetric distribution (between minus 0.5 and 0.5), and some are moderately skewed (between 0.5 to 1 and minus 0.5 to 1). The other measure is highly skewed, and this is shown by the indices in the data set that have a skewness greater than plus or minus 1, such as the BGSMDC index. Out of these 19 indices, only 4 have a negative skew, whilst the rest are positively skewed. Generally, a negative skew implies long-left tails, which indicates a higher chance of extreme negative outcomes, and positive implies a lesser likelihood of poor outcomes.

4.1.3. Bond Indices

There were around 60 bond indices, but the cut off was made to only include those that cover the test period, since most of them were younger issued bonds. Appendix C has the summary of the results. The results from this data set indicate that all the indices are not normally distributed, and so this supports the argument. Here, kurtosis seems to be above 3 for almost all of the results, indicating the heavy presence of fat tails. Again, almost all of the skewness results indicate a strong negative skew. However, the JPM Global Aggregate Bond index is the only one that has a normal distribution. Its skewness and kurtosis are close to zero, indicating potential symmetry.
Since the initial tests proved the non-normality of majority of asset returns, the analysis was narrowed down to adequately analyse the three asset classes, and two from each asset class were selected. This was to ensure that there was diversity in the results, which were representative of the general population. Below in Table 1 is a summary of the chosen assets and the description that is used in the analysis.
The two indices selected to represent the hedge funds are the HFRI Fund of Funds Composite Index (HFRI) and the Dow Jones Credit Suisse Event Driven Hedge Fund Index (DJCS). They each look at a hedge fund where one is a composite whilst the other is specific (Hedge Fund Research Inc. 2017). The HFRI is a hedge fund that invests with multiple managers through funds or managed accounts. There is, therefore, a diversified portfolio, since the idea behind it is to lower risk significantly by gaining the advantage of investing with more than one investment manager.
Equity Index: In selecting the equity indices to use, the authors chose two based on variant geographic locations. The first one is an equity index from Finland called the OMX HELSINKI INDEX (HEX). This is an all-shares index which reflects the current status and changes in the stock market. The other index that is examined is the Botswana Domestic Companies Index (BGSMDC), and this is an index that tracks companies that are traded in Botswana. The intention behind these choices was to obtain diverse representation. In this case, the HEX represents a stock market in the developing world against one from a nation in the developed nations which are found on two different continents, namely Europe and Africa. The classification of developed and developing nations is given according to the World Economic Situations and Prospects (WESP), which classifies all countries of the world into three broad categories, namely developing economies, economies in transition and developed economies.
Bond Index: The Merrill Lynch US High Yield Master II Index (H0A0) is the index that represents the corporate side of bonds because it is normally used as a benchmark index for high yield corporate bonds. This differs from the other bond index, which is the JP Morgan Government Bond Index-Emerging Markets (GBI-EM). This covers comprehensive emerging market debt benchmarks that track local currency bonds issued by emerging market governments.

4.2. Summary Statistics on Asset Class

In the first part of the analysis, an overview of the three asset classes was carried out to test for normality using the Jarque Bera test. This information is applied on these asset classes to estimate model and forecast volatility and the software used in Excel aids in the calculations. In this section, the authors give an overview of initial observations, which are further proved and justified using other models as the analysis continues.
The first step is a summary of Table 2, below, which was generated in NUMXL for the three assets under study. Below is the summary.

4.3. Standard Deviation

Regarding the equities, the HEX has a higher standard deviation compared to the BGSMDC of 7.57% and 4.29%, respectively, showing that is has higher volatility. For the hedge funds, they have a much lower standard deviation compared to equities of 1.65% and 1.80%, respectively, for the HFRI and the DJCS. On the bonds, the GBI-EM is lower than the HOAO, since the former is based on government issues that tend to have lower risk, whilst the latter has the corporate bonds, which carry more risks with them, such as credit and default risk. Thus, it is justified to be riskier. Overall, it can be seen that equities carry higher risks than the other asset classes, making it a riskier asset class characterized by high volatility.

4.4. Skew

The next result on the distribution of the asset returns is from the third central moment, which is skewness. Four out of six of the assets have a negative skew. The negative skew proves the assumptions, which states how the distributions of returns tend to be negatively skewed. The presence of skewness shows that the distributions are non-normal. However, the Government Bond index GBI-EM and the Botswana BGSMDC have positive skews of 0.1 and 1.9, respectively. The GBI-EM is normally distributed, and its skew is highly insignificant. For the BGSMDC, the positive skew needs to be investigated further. For the asset, it has a positive mean and so a combination of a positive mean and positive skew implies that investing in this asset not only results in positive expected returns but also positive surprises on the upside as well. Investments in this section are highly unstable and this is usually short term until the market catches on. This result is significant given that this is a stock index in a growing market with high growth potential.

4.5. Excess Kurtosis

The fourth central moment measure looks at the peakedness of a distribution. In Table 2, the NUMXL statistics output is calculated as excess kurtosis instead of kurtosis. Excess kurtosis is defined as kurtosis minus 3. A normal distribution should ideally have an excess kurtosis of 0. These tests also stand to prove otherwise that the returns are not only non-normal and skewed, but that they possess long tails to the left. The negative skew in the returns implies frequent small gains and a few extreme losses. If the figure is above 0, it is leptokurtic, and this signals the presence of fat tails in the data set. Fat tails are considered undesirable because they imply additional risk.
In the tests, the returns from the equity and hedge fund all have kurtosis greater than zero, implying that there are high levels of tail risk. However, on the bonds, a different result is seen. The H0A0 index that represents them is also similar to the equities and hedge funds, although it is unusually higher. This is because on the returns, it has one of the highest ranges of −18% to 11%. This index represents high yield corporate bonds and so with high reward comes high risk, hence the fat tails. Comparing it, however, to the government issue of tracking bonds, it has a kurtosis of 0.6 and was concluded to be normally distributed, and that is why its kurtosis is almost zero, reflecting the small risk exposure this asset has in comparison to its other peers that all showed non-normality of returns.

4.6. White Noise and ARCH Effect

White noise and the ARCH effect is further explored in the volatility forecasting section. However, as an overview, it can be seen that the ARCH effect gives details on the distribution’s fat tails and excess kurtosis. All the assets with an excess kurtosis greater than 0 are assumed to be serially correlated and have fat tails, further affirming the non-normality of the returns. White noise is defined as an assumption that a series has zero mean and constant variance. When applying this concept to the Autoregressive and Moving Average models, these are there to fix any assumptions that go against the white noise assumption according to Katchova (2013). Ideally, for the data to be applied to the model, it must first have the white noise characteristic. Its relevance will be shown in the section on ACF and PACF.

4.7. VAR Tests

Given that the data sets exhibited non-normal distributions with the exception of the GBI-EM bond index, the next step was to take an example from each asset class to first calculate the VaR and then compare it with CVaR. This comparison sought to prove how VaR underestimates risk and determine if CVaR is a better method. The calculations for all the VAR and CVar were made at 95%, 99% and 99.9% confidence intervals. Below, Figure 1 shows the graphs plotted for the two equity indices under analysis.
Figure 1 illustrates that the conditional VaR has a higher risk measure, which indicates how these equities have a much higher downside risk impact when measuring risk. Thus, for equities, it would be ideal to use the CVaR to capture this very aspect.
Similar results of this asset class to the equities are illustrated in Figure 2. However, it can be seen that generally, the hedge fund measurements for both the VaR and CVaR are lower than those of the equities, showing that this asset class generally has lower risk in relation to equities.
The results of the analysis of the bonds are depicted in Figure 2. From previous tests, it was denoted that the sovereign bond index GBI-EM was normally distributed, and so one would expect that the ideal measure would be the standard VaR. This is because the distribution is assumed to have little tail risk and its risk forecasts can be based on the Gaussian assumptions. For the HOAO index, which is not normally distributed, due to its distribution type it tends to have thicker tails and so a high tail risk. Therefore, to cater to the downside risk inherent in the tails, this index would be best suited with the CVaR measure (See Figure 3).

4.8. Stressed VaR versus VaR

In this case, the stressed VaR was calculated at a 99% confidence interval, as it is a regulatory requirement according to the Basel III framework in the case of banks and financial institutions. SVaR can be calculated by calculating the risk measures over a period where there was financial stress. These tests were then used as robustness checks. The stressed VaR must cover a period exceeding 12 months, and this was taken into account. The normal period was from January 2003 to December 2005 and the stressed period covered the financial crisis period from August 2007 to December 2009.
The results in Figure 4 above give a visual depiction of what is observed. The first analysis looks at the equities data. Because the data set also included the daily prices, the authors decided to calculate the normal and stressed VaR for both indices using the daily prices as well as the monthly prices to compare and see if there is an impact based on the time used in the calculation.
Firstly, by comparing the VaR and SVaR for both scenarios, it can be observed that that the stressed VaR has a higher risk component in comparison to the normal period. The next step was to look at the different time horizons. When using the daily returns, the VaR and SVaR both show that the risk measure is lower in comparison to the value that is computed monthly. A reason for this could be that when data are observed daily, smaller incremental changes are shown as compared to data that are calculated over larger intervals, where higher deviations between the starting price and the next price will lead to higher return movements.
The next step was to look at the hedge fund data observed over a monthly period. This also displays similar results to the equities, showing that the stressed value at risk has a higher risk measure in comparison to the VaR in the normal period.
In this scenario, the authors compared the VaR observed in Figure 5 over the total time (1995–2016) to a normal period. The previous VaR readings were −7% and −12% for the HFRI and DJCS, respectively, whilst the normal time VaR in this section was −1% and −2%, respectively, showing that it is much lower than the 1995–2016 period.
The final step in this analysis was to look at the stressed VaR for the bonds in Figure 6. Firstly, it can be observed that the stressed VaR is higher than the normal period VaR, similar to the other asset classes’ results. However, when looking at this date to make it more comprehensive, another angle is used to observe the data from Figure 6. Firstly, when observing the HOAO that is depicted in blue, it is seen that the SVaR is significantly higher than the VaR. However, when looking at the GBI-EM which is normally distributed, it can be seen that the readings for the VaR and SVaR are very similar, with the VaR being −3.57% whilst the SVaR is −3.08%, giving a negligible difference of 0.26%. This shows that in the case of a normal distribution without fat tails, the returns are generally uniform even in stressed periods.

4.9. Volatility Analysis/Time Series Analysis

In this analysis, the aim is to calculate the weighted moving average (WMA) and the exponentially weighted moving average (EWMA), which is the volatility of the returns. The weighted moving average examines the stationary of the series. In this case, a 12-month equal weighted moving average is calculated, and the forecast horizon is the current period. By analysing the time series, it is possible to take note of outliers, seasonal or cyclical trends and other patterns, all of which will help in the prediction of future volatility in the asset returns. In addition, by looking at the time series, it is easier to identify the impact of stress periods described earlier in this paper. The WMA is calculated using the formula below:
w m a t k = i = 1 k x t i   * w i i = 1 k w i
where:
  • Wi = the weight of the i-th data point in the moving/rolling window.
  • K = the size of the moving/rolling window.
  • Xt = is the value of the time series at time t.
The EWMA is also calculated, and it shows the volatility over time of the asset and is calculated as follows:
σ t 2 = λ σ t 1 2 + ( 1 λ ) x t 1 2
where:
  • Xt = the value of the time series value at time t.
  • Λ = the smoothing parameter (i.e., a non-negative constant between 0 and 1).
The HEX equity index in Figure 7 tracks stocks in an EU nation and it can be seen how volatile the returns were during the 2000 bubble and 2008 financial crisis. These markets were directly affected by the US economy situation, as their businesses and markets are interlinked. Generally, the trend, though volatile, is centred on the mean, signifying a stationary time series.
The time series in Figure 8 is different from the other equity returns, as it generally has a stable volatility, centring on the mean and fewer spikes during the 1997 Asian crisis, which could be because of the impact of Asian investors to the African nation being affected. It was not really affected by the tech bubble, implying that most of its investors were probably not in the tech industry. The WMA is generally more stable and smoothed out.
Figure 9 reveals that the HFRI hedge fund has a unique characteristic, which is structured in a way so that it has many fund managers and so that it tends to be more diversified, and so volatility is around the mean and relatively stable. Clustering and huge sparks are shown in the crisis periods for the returns line.
The DJCS in Figure 10 is also generally stable, since it is a hedge fund and experiences spikes during the stress period as it is based on liquid securities. This means that any change in the financial markets is reflected in the returns. It is noted that the WMA is relatively stable over time.
Figure 11 shows that the returns and the volatility are generally stable and only clustered around the stress period. There is little volatility because of the nature of bond pricing and its general predictability in comparison to equities. The notable high volatility is as a result of the overall financial markets being disrupted during periods of stress.
The GBI-EM in Figure 12 tracks the emerging markets’ government bonds; by virtue of the debt being sovereign, it is generally stable, and the returns and volatility (EWMA) only spike in the stress period. The WMA is generally smooth and centred around the mean.
Overall, with the exception of the GBI-EM index (it seems to show a steady and constant volatility), most of the time series exhibit no constant volatility, which is shown by the separate periods of high volatility and low volatility. This is consistent with heteroskedasticity or conditional heteroskedasticity. This means that it is ideal to use more advanced forecasting financial modelling tools on it because it behaves in a way that is expected of financial markets that tend to exhibit periods of high volatility and low volatility.
Through the application of time series analysis, it becomes necessary to pay attention to the order and so this is where the implementation of tests such as the serial correlation test is required. One such method was the time series analysis that was previously examined above. Now, the focus is on serial correlation, and this test is conducted with correlograms. This test is also critical in identifying the model order used in the tests such as ARMA models.
The ACF and PACF are used to detect the form of time series process. For it to be a suitable time series to apply forecasting models, it must be a stationery process, which is a desirable trait. This can be observed from the correlograms below (see Figure 13).
The log returns do not exhibit strong interdependency, though lag order 1 shows marginal significance. To shed more light on this, it is critical to assess whether the time series exhibits white noise, which implies no serial correlation. As shown in Table 2, the white noise test was conducted and it showed that both the HEX and BGSMDC tested false, implying that there is no white noise and so the series is serially correlated.
For the hedge fund plot (Figure 14), a similar pattern to that of the equities shows that the returns do not exhibit strong interdependency, though lag order 1 for the ACF although the PACF for HFRI has three significant lags. To assess this, Table 2 for the statistics summary, shows that the white noise test also showed that both assets tested false, which means there is also serial correlation of the returns of each asset.
Figure 15 illustrates the bond correlograms, and firstly, it shows that the corporate bond, H0AO, has one significant ACF lag, and for the PACF, it has four significant lags, which are lag 1, 2, 3 and 13. This result clearly shows that the asset has serial correlation. To affirm this Table 2, also shows false results for white noise, further confirming it is serially correlated.
However, when looking at the GBI-EM index, there a few significant lags for the ACF (7 and 13) and PACF (7 and 12). This asset was already affirmed to be normally distributed, and this further shows that the white noise test showed a true result, implying that the distribution is not serially correlated.
After looking at the correlation of the returns, it is also necessary to look at the squared values to assess whether there is correlation amongst them. This assists in decisions on the best model to use. Details on the arch effect are summarized in Table 2, and it shows that for bonds, the HFRI hedge fund and HEX equity, the ARCH effect is true, meaning that the squared values are correlated. This is the opposite for the BGSM and DJCS, which tested false in the tests.

4.10. Correlation Test—Portfolio

In this test, the authors carried out correlation tests between two time periods for each asset. For the purpose of uniformity, the same parameters applied in the stressed value at risk test are applied here as well.
By assessing Table 3, various pieces of information can be deduced, firstly by looking at the correlation of assets in a similar asset class. The equity indices BGSMI and HEX are positively correlated under normal conditions; they move together with a coefficient of 0.05 but, during the stress period, this relationship is reversed, and it can be seen that the correlation is now negative at −0.1. This means under normal market conditions they almost move together, though to a small significance, and soon after, they move away from each other.
The next class is the hedge funds, which both have a strong positive correlation that only strengthens during stress periods. In normal conditions, the correlation is 0.85, and it is 0.96 in a stressed scenario. For the bonds, it can be seen that the correlation coefficient is fairly stable and positive at 0.69 in normal conditions and 0.61 in the stressed period.
By making combinations using all assets, it can be seen that the most drastic change in correlation was observed between the HEX-BGSMI equity indices and the HEX equity–GBI-EM bond, in which all the correlations shifted from being negatively correlated to positively correlated. However, for both index combinations, the initial coefficients were close to zero, and so they did not really have a strong relationship to start with. Even during the stress period, the coefficients remained close to zero, with the BGSMI-HEX moving from a positive 0.05 to a negative 0.1 and the HEX-GBI-EM moving from a negative 0.01 to a positive 0.19.
Overall, the results from this test are inconclusive, showing that the behaviour of assets does not rely on the class but on how they relate on an individual level with each other. In coming up with a portfolio, one would have to test all the assets, as was carried out in this section, and observe how each asset relates to the other so that the best risk measures can be applied to cater for periods when the normal volatility of the assets shifts in stress periods.

4.11. ARMA, GARCH, EGARCH Model Parameter

4.11.1. ARMA Parameters

The authors first used the ARMA model to forecast future risk. In coming up with the final forecast, there were a few assumptions made that should be noted. The order used in the simulation was an ARMA (1, 1). Using NUMXL, the first step was to come up with general parameters; after that, the author calibrated the parameters to find the optimal ones and all this conducted in the program.
μ is the long run mean, φ1 is the first coefficient of the autoregressive (AR) component (first order correlation), θ,1 is the first coefficient of the moving average (MA) component and finally, σ is the standard deviation of the residuals (this measure looks at the dispersion of points and measures the accuracy of the variable being investigated).
For all six assets, these parameters were calculated to ascertain the final ARMA forecast over a period of 12 months, and the results are detailed in Table 4 below.
Here, it can be observed that the standard deviation of the residuals is highest for the equities asset class. For the optimal coefficients for the autoregressive process and moving average process, the calibration process selected the ideal figures to input into the best ARMA model for each asset.

4.11.2. GARCH Parameters

In forecasting with GARCH, there were also some model parameters that were calculated and calibrated via the Excel spreadsheet. These are the ones input into the equation to come up with the volatility forecast. µ is the long run mean, α0 is the constant in the conditional volatility equation, α1 is the first coefficient of the ARCH component and β1 is the first coefficient of the GARCH component. All of this is summarized in Table 5 below.
The GARCH model takes the ARMA process a step forward and factors in the conditional volatility element as well as the other GARCH components. The first coefficient of the ARCH component across all assets has a parameter close to zero.

4.11.3. EGARCH Parameters

Forecasting with the EGARCH model of the 1, 1 order has the parameters shown in Table 6 below.
The µ is the long run mean; α0 is a constant in the conditional volatility equation; α1 is the first coefficient of the ARCH component, γ1 is the first leverage coefficient and β1 is the first coefficient of the ARMA component. The EGARCH has “the constant volatility embedded in the leverage coefficient which if included in a model of volatility it amplifies the volatility” (Engle and Siriwardane 2014).

4.11.4. ARMA, GARCH, EGARCH Model Forecasts

The models were used intend to capture the autocorrelation of squared returns and the volatility going back to the mean. These models took non-normal components into account, such as excess kurtosis, and factored them into the forecasts. The NumXL ARCH/GARCH modelling wizard automated the steps used to construct an ARCH model: guessing initial parameters, parameter validation, the goodness of fit testing, and residuals diagnosis. Therefore, by using these parameters, the final volatility forecasts were made using three models for the two equity assets, and Figure 16 shows the 12-month forecasts.

4.11.5. Equities Model Forecast

The first analysis is for the two equity assets, and Figure 16 gives the three forecasts made using all three models: ARMA, GARCH and EGARCH.
First, the HEX equity index and BGSMDC equity index and their 12-month volatility forecast based on the previously mentioned parameters are observed in Figure 16. It can be seen that the EGARCH has the lowest volatility forecast, followed by ARMA, and the highest is the GARCH model. Slight increases are anticipated in the first period, then they slowly taper off to become stable. This indicates that the nearer the forecast, the higher the level of accuracy. That is why the authors limited this to a 12-month forecast, as only current and relevant information affected the volatility forecast.

4.11.6. Hedge Fund Model Forecasts

For the hedge funds, the results are similar to the equity, with the EGARCH recording the lowest volatility relative to GARCH and ARMA. However, for the GARCH forecast, it is lower than the ARMA forecast for the HFRI (See Figure 17). This could be down to the difference in asset characteristics. The GARCH for both has a steady upward increase, whilst the ARMA indicates a steady, almost flat increase for the DJCS, which is different from the HFRI composite index, which increases up in the first period and then tapers off. This shows higher initial anticipated volatility for the composite and lower volatility for the event driven fund, which already anticipated the event shocks in its historic data.

4.11.7. Bond Model Forecasts

The bond forecasts have varying results from both the equity and hedge funds. It can be observed that the corporate bonds HOAO forecast has a higher volatility using the ARMA model but is similar to all other assets with the lowest EGARCH forecast. On the other hand, the sovereign GBI_EM bond forecast has higher volatility in the first period, falling sharply after that, with the lowest forecasts for the rest of the periods. In this situation, the GARCH has the highest volatility of all three (See Figure 18).
After carrying out the tests for the three asset classes, there is need to adopt the best model to use. It cannot easily be interpreted by the forecast tables, and since the parameters used for all three assets are not similar, this is where the Akaike Information Criterion (AIC) test is considered. Akaike Information Criterion (AIC) is a method that is used to measure the goodness of fit of a model. Ideally, its lowest value is the best model to use. This can be used to determine which method between the three is ideal based on this test in order to obtain the best volatility forecast. Below is a table showing the results of the three tests for the six assets. In general, the AIC is defined as:
AIC = 2k − 2 × ln(L)
where:
  • k is the number of model parameters.table
  • ln(L) is the log-likelihood function for the statistical model.
By looking overall at the above results, it is clear that the asset class with the lowest AIC is generally the hedge funds and bonds, whilst equity has the highest. This can therefore help to conclude that when applying these models, it is most suitable for hedge funds and bonds and not as much for the equities. However, it must be noted that the AIC figure for EGARCH of the BGSMDC is unavailable for comparison (See Table 7).
When analysing based on the model type, it can be seen that EGARCH is the most accurate model to use because for hedge funds and bonds, it has the lowest AIC for the three methods per asset. To analyse this further, the next step of assessing the volatility forecast for each asset critiques this discovery and seeks to validate it or otherwise.

5. Discussion

5.1. Non-Normality Test Results

The normality tests indicated that for the hedge funds, all 11 were not normally distributed; for equities, 19 out of 23 assets were not normally distributed, with the exception of the Japanese indices that exhibited normality. When testing the bonds, it was also established that 9 out of the 10 data sets were not normally distributed, with the exception of the JP Morgan Government Bond Index-Emerging Markets (GBI-EM), which covers comprehensive emerging market debt benchmarks that track local currency bonds issued by emerging market governments.
The normality tests also yielded an analysis on various factors, and this was performed on the six selected assets. One was the standard deviation analysis, where it was discovered that hedge funds tend to have a lower standard deviation than equities, implying that equities are riskier than hedge funds. These are just generalized assumptions based on the initial analysis. For bonds, the level of risk differs based on the underlying asset. Corporate bonds have a higher standard deviation than sovereign bonds. The summary on the skewness was that most of the assets are negatively skewed with long tails, showing that they have a high downside risk. The results for the kurtosis showed that most of the distributions are leptokurtic. Another element in the summary was the ARCH effect, which gives details on the distribution’s fat tails and excess kurtosis, and all of this was carried out to further affirm the non-normality of the returns. Once the argument on the type of distribution was resolved, the next step was to assess the risk measures relevant to this distribution.
VaR versus CVaR
After carrying out the analysis, it can be said that the CVaR is an extension of VaR that gives the total amount of loss given a loss event. If one is using VaR, according to Kidd (2012), the main question would be how often their portfolio could lose at least X amount of euros, whilst one using the CVaR method is concerned with answering the question of how much it could lose when its portfolio’s losses exceed X amount of euros.
From the tests undertaken in the analysis, it can be seen that CVaR is a better measure for downside risk than VaR, as it has higher risk measures and is more sensitive to the shape of the tail loss the non-normal distributions. From an asset class perspective, it can be seen that the equities generally have a higher risk measure compared to the hedge funds. However, this is merely a generalization because what drives risk in an asset is not only its class but also what the asset is composed of. When drawing conclusions for this, it is critical to factor in other items such as the composition of the asset, its liquidity features, the industry, the geographic location and all other risk factors so that one can obtain an even more accurate measurement. This means combining this analysis asset by asset with the information in Figure 3, which gives information on the global risk landscape according to the world economic forum survey (World Economic Forum 2017).

5.2. VaR versus Stressed VaR

The results from the stressed VaR analysis show that when using different time horizons to compute the VaR and SVaR, it was observed that the returns are lower for the daily-observed data in comparison to the monthly observations. This shows that one needs to be aware of the reason for carrying out the tests so that they can use the correct data time horizons to obtain accurate results and not end up understating or overstating the risk measure. If it is for daily use, it would be ideal to use a shorter time to calculate, and if it is to observe an asset over a longer period, then the monthly time horizon could be adequate. All of this is situational, and so other factors should be taken into account together with this information.
Another interesting observation made was when looking at the bonds whereby the one asset was normally distributed and the other was non-normal. It was observed that the normally distributed GBI-EM had VaR and SVaR measures that were close together, unlike what was observed in the other asset classes. This therefore showed that the normal distribution, due to it not having the presence of fat tails, had less risk observed and so the returns were generally uniform, even in stressed time periods. This means that the standard VaR would be a good measure for this, but for all other data that is non-normal, it is critical to isolate the stress period and calculate its risks in comparison to a normal period. This makes SVaR a good measure to use in volatile periods because it captures the market risk inherent at that time. By knowing these measures, it can aid in improving the risk measures, in particular when forecasting data whereby the expectation is a downward one. This will enable the investor to assess if they have taken on excessive risk which may lead to their bankruptcy should another stress period occur. This then can be adjusted to suitable risk levels that do not threaten the loss of the assets.

5.3. Time Series Analysis

The time series analysis is useful in helping to identify the way security moves over time. By looking at the past trends, it can aid in anticipating the types of risks that affect that particular stock. For instance, one can refer to the World Economic Forum Global Risks Perception Survey 2016 indicated above, which identifies highly probable risks such as cyber-attacks and natural disasters. Using this information, one can look at the asset and what it’s underlying asset is, and so determine the likelihood of adding more risk to it or reducing risk. For instance, if an index tracks agriculture companies that are based in an area where there is increasing amounts of natural disasters that occur and affect the industry, this means it is likely to affect the profitability and so lower returns will be forecasted. If an investor had calculated a VaR up to a certain point, it could now adjust that point based on anticipated risks it would have identified by analysing the time series.
It is also worth noting that when looking at a time series, it is essential to look at the underlying asset, as it can also give information on how the time series behaves. For instance, the HFRI hedge fund index, which is a fund that is managed by a diverse array of fund managers, is likely to exhibit a time series with a few extreme fluctuations, indicating that there is less volatility, probably resulting from the diversification aspect.
By using this information in conjunction with information obtained from the variations of VaR previously observed, one can decide whether the risk figure must be adjusted to accommodate the added risk based on the historic behaviour of the time series. The time series analysis also has a plot of the WMA; the authors are able to visually determine the stationarity of the time series. The EWMA plot shows the volatility of the asset over time, and it is easy to visually observe the periods of volatility clustering. The EWMA shows the periods with large returns that are clustered and very distinct from those observed periods of small returns, which may also be clustered. Not only does the volatility spike up during a crisis or stress period, but also it eventually drops back to approximately the same level of volatility as before the crisis, which is an indication of stationarity. This is a vital property which affirms the suitability of the use of advanced forecasting models in the time series volatility forecasting. A stationary time series is a time series that is mean reverting, meaning that it moves close to the mean.

5.4. Correlation Test—Single Asset

In time series analysis, it is said that a correlogram/analysis is one such tool that is necessary in identifying the model to use in the volatility forecasts. The correlogram analysis examines the time–spatial dependency within the sample data, and focuses on the empirical auto-covariance, auto-correlation and related statistical tests. It is key to the discovery of the interdependency of the observations.
Upon carrying out the tests, it can be seen that five of the six assets exhibit serial correlation. These results further affirm the general hypothesis that the returns of several asset classes tend to be serially correlated. For instance, it is commonly expected in hedge funds because they usually hold illiquid assets which can be hard to price and so for the fund managers to determine the prices, they rely on past information to derive the value. This therefore has an impact on the ideal method to use to forecast volatility in the next section.
The analysis after this discovery also looked at the squared values to assess if there is correlation amongst them. The results showed that even though the returns showed correlation, the ARCH effect showed that in both bonds, the HFRI hedge fund and HEX Equity, the ARCH effect is true, meaning that the squared values are correlated. This is the opposite for the BGSM and DJCS, which recorded false results in the tests. Overall, an assumption of correlation can be assumed and a GARCH model may be used.

5.5. Correlation Test-Portfolio Context

After carrying out all the tests and looking at the relationship between assets of a similar class or different combinations, it can be concluded that there is no particular pattern on asset classes and how they interact with each other in normal versus stressed periods. Therefore, it can be advised that for each portfolio, a portfolio manager needs to generate a similar correlation matrix to assess and see which assets are correlated, how they behave with each other during stress periods and how they are under normal periods. In the example used in the analysis, selecting the two hedge funds will a good example because whether or not the assets are in a normal or stressed period, they seem to maintain a correlation that is positive and close to each other. Therefore, this will allow an asset manager to maintain his portfolio in the same risk levels if that is his investment approach.
If there were a need to diversify and hedge, the portfolio manager would pick the assets with the lowest correlation or those that move in opposite directions, such as the two equity assets that switch correlations from being negatively correlated to being positively correlated during times of stress. Regardless of what the change in the correlation is, it is essential for the investor to know which direction volatility of the assets is likely to move. This will allow one to anticipate which changes may occur depending on how the assets relate to each other and how it affects the level of risk at a given time or when planning for future risk. Failure to correctly anticipate market risk may overstate the amount of diversification in a portfolio that can lead to an investor taking on excessive risk.

5.6. Volatility Forecast (ARMA, GARCH and EGARCH)

Beg and Anwar (2014) view the arrival of new information as a key driver of volatility in asset returns. According to the authors, the constant release of new information results in financial assets prices showing volatility, which in turn affects the financial risk analysis and risk management strategies. To consider this factor, models such as the Generalized Autoregressive Conditional Heteroscedasticity model can be used to analyse financial data.
Upon carrying out the studies, it can be said that the use of these volatility forecasts (ARMA, GARCH, EGARCH) is useful for non-normal distributions to analyse an asset’s risk as well as market risk. This is because these models take into account the use of various parameters such as the use of a leverage coefficient in calculating volatility with the GARCH model. This can affect an asset’s risk and incorporate them into the model. The type of model lags to consider can also be determined by looking at the ACF and PACF plots, as was carried out. These plots are useful in determining the best lag order to use for each individual asset class. In our case, most of the plots showed an ACF with one lag and then decay. Thus, for this study, the authors applied the order of 1, 1. However, in further use of these models, it is critical to look at other elements of the data, such as the stationarity, as all of this helps to determine whether the models are suitable for use with the chosen data set.
In the data analysis section, after the parameters were calculated and defined for all three models, the next step was to plot the 12-month forecasts using each model. It was generally observed that the GARCH model had the highest volatility of all three for each asset, whilst the EGARCH was the lowest. The main advantage of GARCH models is that they captured jointly heavy tails and volatility, which contribute to clustering (Jansky and Rippel 2011). This demonstrates a characteristic of the data sets being observed (except for the GBI-EM bond index, which is normally distributed and so its plot was very different from the other two, as it had a high EGARCH at period one which sharply declined.
Overall, when one examines the literature regarding the use of these models, it can be seen that they are vastly expanding to cater to different elements within the data set. Thus, it can be concluded that when carrying out a risk forecast, any of these models can be applied, but it all depends on the asset characteristics and what risk factors to consider.

5.7. AIC Test for Best Model

Akaike Information Criterion (AIC) was used to measure the goodness fit for each of the three models. Ideally, its lowest value is the best model to use, and in the test, it was seen that the EGARCH was seen as the best fit because it had the lowest AIC overall for all the assets examined (See Table 7).
Results from this test therefore give a guideline to the best method to use to forecast risk for these non-normally distributed assets (with the exception of the normally distributed JP Morgan Government Bond Index-Emerging Markets (GBI-EM)) out of all the methods. Hedge funds and bonds seemed best suited, with the lowest AIC when looking at asset classes. When looking at the best method to use overall, the EGARCH reigns superior, with the lowest AIC for the three methods per asset. Further tests to see which is better between GARCH and EGARCH could be carried out, as suggested by Malmsten (2004). He considers some of these tests to be symmetric models and asymmetric models. However, these go beyond the scope of this paper, but it is worth noting that various other tests can be used to determine the validity of model choices apart from the AIC, which is used in this study.
Results summary
VaR has become a popular method in measuring risk, and due to its simplicity, it became an overnight sensation; however, the main distribution assumption of normality led to it being discredited, leading to more investors moving away from it and focusing on its hybrids that take into account the fat tails. Although value at risk has been a measure that has been used in the recent past by most investors when measuring the risk exposure, it was inadequate when calculating how much would be lost, and this situation usually led to some investors underestimating risk measures such that when a crisis occurred, most of them suffered to the brink of collapse and bankruptcy. This led to the creation of other variations of this risk measure that specifically focus on the downside risks, and these included the conditional value at risk and stressed value at risk, amongst many. These were tested, and it was seen that the CVaR had a higher risk measure compared to the standard VaR, as it took into account the existence of heavy tails, and so this was a better measure of risk and would shield the asset holder from overexposure of their asset or portfolio which would lead to bankruptcy. Another measure that was considered was the stressed value at risk, and this measure showed that volatility differs during the stressed period and so if it is measured separately, it would give a better picture of how risky an asset could get in periods of high volatility.
The next step involved analysing the time series, and it was seen as a critical step in the research because it showed how the assets behave and can be used to see if any patterns are prevalent in the behaviour of the assets as they move over time. Observing a time series helped give information on issues such as the stationarity of the time series, which allow for the use of certain econometric models to establish volatility forecasts. After this, the test of correlation was performed for each asset to observe whether or not correlation of an asset’s prices with one another exists. This was proven to be true, and so based on these data, it can be concluded that price dependence is a major theme occurring in the distribution of returns showing long periods of price dependence. It was also observed that there is the existence of price jumps, which are all key to obtaining a more realistic risk measure.
The next test was to find and compare volatility forecasting over the time horizon using three selected methods, namely ARCH, GARCH and its hybrid EGARCH. These methods were ideal to use because financial time series returns are skewed and fat tailed, and these models are geared to take this into account. They were able to capture the volatility clustering and took into account changes in trends, and all these were factored into the parameters which the authors calibrated using the NUMXL function in Excel to obtain the best combination of parameter values to use. The benefits of using these models are also that depending on the characteristics of the data being observed, there are hybrids to each method to take this into account. GARCH, for instance, has many of these, one of which is EGARCH, which was used here, and others include FIGARCH, HYGARCH and so forth. These model extensions are very relevant for the conditional volatility of financial returns.
However, in looking at the various methods, one must take into account and beware of the limits of statistics, as one cannot fully rely on this method for accurate measures. The data sets used in this paper aimed to offer a more robust representation of various assets and classifications of the market, such as sampling from a low developed economy, a composite index, a specialized index and so forth, but this does not mean all risks will be taken into account in the asset pricing or risk allocation in portfolios. There is still the risk of risks that are not quite well understood to either come in and drive returns up or down drastically, thus resulting in price jumps. Therefore, it can be observed that by having a combination of the risk measures which take into account other elements such as scenario analysis which involves looking at other macroeconomic variables, such as stress testing, which tests just how risky an asset can become, and forecasting with risk models, the ideal method for an investor would be to combine all elements to ensure they obtain the best risk measure possible.

6. Conclusions and Future Research

This paper made numerous attempts to elucidate the complexity of measuring risk. In order to obtain the best possible accurate risk measures, we recommend that researchers should incorporate historical data and adopt scenario analysis for a more robust outcome. This calls for observing asset behaviours and how they respond to external shocks and normal periods. This also means looking at an asset and how its individual returns are autocorrelated, and this will reveal how much of an impact historical data has on the future pricing of the asset. The findings also reveal several stylized facts about financial markets, such as fat tails in asset distributions, the absence of autocorrelations of asset returns, volatility clustering and aggregational normality.
Incorporating non-normality in risk calculations should provide a better way of understanding and measuring the downside risk that is in an asset. The models that have been analysed in this paper considered this, and so the results that are obtained reflect this aspect. All of the methods that were explored in the analysis section shed some light into the many options available that can be used to assess risk in non-normal distributions. One aspect identified was that if the methods are used together and not in isolation, they complement each other by addressing the shortcomings of the other to give the closest to accuracy in the measures. For instance, VaR can be used as the standard measure, and then adjusted based on the information obtained from the time series. However, it must be noted that there is no single risk measurement framework that can ensure portfolios and assets are immune to unexpected risk exposure, but by using a mosaic of methods, we can come close to this.
To summarize the study and tests conducted, the first step of testing the assumption that returns tend to be non-normal as opposed to the general assumption of normality was examined. The interpretation of findings from the tests clearly showed that the majority of the returns are not normally distributed, but instead are skewed and also possess excess kurtosis. Thus, it would be significant in the approach of assessing risk to first test the asset’s historical data for normality. In some cases, one may find a distribution that is normal, such as the one found for the bonds index GBI-EM, in which case a basic risk test such as VaR could be adequate to some degree.
Finally, in order to obtain the most accurate possible risk measures, the best approach is not to just move away from the use of historic data. However, creating a balance yields better risk measures, whereby historic data are used to assess the behaviour of the asset and how it responds to shocks and normal periods by doing a time series analysis. This can also include looking at the asset and how its individual returns are correlated to each other (autocorrelation) to see how much of an impact previous data has on the future pricing of the asset. Since all models discussed have both merits and weaknesses, for the ones under non-normal assumption discussed in this paper, the risk measures that will be calculated will be closest to the level of accuracy in reflecting the market conditions.

7. Research Limitations

The ARCH and GARCH model family has been evolving since its introduction, such that a multitude of new varieties of these have emerged (Bollerslev et al. 2010). This poses a limitation on to how many varieties this study could have used due to the vastness of computations needed. This makes picking the best model to forecast volatility all types of financial data a bit cumbersome for this particular study. Another limitation faced was the availability of more recent data to adequately analyse the impact that COVID has had over the past year and more.
Another major limitation faced in providing universal data results was due to inadequate computing power to dissect all available asset classes. Instead, we picked three relatively common ones to apply. Even within these three asset classes, only two from each were used for volatility forecasting. For instance, the authors had 60 bond indices but only had to analyse two. It is hoped that in future studies, we can evaluate other asset classes such as forex, futures and other derivatives to apply the tests used in this study.
Future Research
Future research papers will be undertaken, implementing different asset classes as well as newer GARCH models for forecasting risk in the continued search for the most accurate risk measures. The authors are currently gathering information in the present COVID era to further study its impact on risk measures. It would be interesting to assess the short-term and long-term impacts the pandemic has had. This will allow the authors to gain a clearer picture on the future of similar events and find ways to circumvent negative financial crisis. Once the data are fully available, a follow up paper should be written to assess this new variable.
Another research area the authors wish to investigate are crypto currencies as an “asset class” more so than a digital currency. It would also be interesting to assess their price movements and behaviour and see how they affect the performance of other assets, as well as the type of risks involved with using these.

8. Patents

This section is not mandatory but may be added if there are patents resulting from the work reported in this manuscript.

Author Contributions

Conceptualization: E.S. & Y.T.M. methodology—Y.T.M. software, Y.T.M. validation, E.S. and Y.T.M. did formal analysis, Y.T.M. investigation, E.S. and Y.T.M. resources, E.S. and Y.T.M. data curation—E.S. and Y.T.M. writing—original draft preparation, E.S. and Y.T.M. writing—review and editing, E.S. visualization, E.S. supervision, E.S. project administration, N/A funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board (or Ethics Committee).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Jarque Bera Test On Hedge Fund Data 1995–2016.
Table A1. Jarque Bera Test On Hedge Fund Data 1995–2016.
ASSET CLASS: HEDGE FUNDS
PERIOD: 1995–2015SKEWNESSKURTOSISJARQUE BERA TEST
FREQUENCY: MONTHLY DATA Test Statp-ValueNormality
YES/NO (Y/N)
1HFRIMI Index0.55350.83021.0532.68105 × 10−5N
2HFRI Fund of Funds Composite Index−0.69134.362230.299.8651× 10−51N
3HFRI Relative Value (Total) Index−2.773615.9003119.60N
4Dow Jones Credit Suisse Equity Market Neutral Hedge Fund Index−14.5592227.5115787030N
5Dow Jones Credit Suisse Convertible Arbitrage Hedge Fund Index−2.791718.6424165.50N
6Dow Jones Credit Suisse Fixed Income Arbitrage Hedge Fund Index−4.783437.000160650N
7Dow Jones Credit Suisse Event Driven Hedge Fund Index−2.12109.9121278.72.1944 × 10−278N
8HFRI Merger Arbitrage Index−1.42115.903472.212.8841 × 10−103N
9Dow Jones Credit Suisse Event Driven Distressed Hedge Fund Index−2.187512.0241800.80N
10Dow Jones Credit Suisse Dedicated Short Bias Hedge Fund Index0.75341.57652.2844.43371 × 10−12N
11Dow Jones Credit Suisse Long/Short Equity Hedge Fund Index0.00024.015177.293.18098 × 10−39N
Note: N—not normal. Source: authors.

Appendix B

Table A2. Jarque Bera Test On Global Equity Indices Data 1995–2016.
Table A2. Jarque Bera Test On Global Equity Indices Data 1995–2016.
ASSET CLASS: HEDGE FUNDS
PERIOD: 1995–2015SKEWNESSKURTOSISJARQUE BERA TEST
FREQUENCY: MONTHLY DATA Test Statp-ValueNormality
YES/NO (Y/N)
1HEX Index0.13342.033745.57590.0000000001N
2OBXP Index−0.95272.7752122.76470.0000000000N
3WIG20 Index0.69144.3217223.05500.0000000000N
4SWIG80 Index0.56291.808849.17230.0000000000N
5BUX Index−0.08293.2649115.77620.0000000000N
6XU100 Index1.39296.6046556.64240.0000000000N
7MOSENEW Index0.52022.205164.40310.0000000000N
8BGSMDC Index2.429412.97252078.85160.0000000000N
9SEMDEX Index0.09093.5010133.14470.0000000000N
10TPX500 Index−0.21490.49624.66780.0969191846Y
11TPXSM Index−0.0077−0.07320.06060.9701371267Y
12TPXM400 Index−0.19090.40653.37010.1854371291Y
13TPX100 Index−0.18520.48764.06200.1312014021Y
14HSI Index−0.02502.214852.96160.0000000000N
15HSCEI Index0.77123.8577186.27290.0000000000N
16SHCOMP Index0.22981.336121.54390.0000209803N
17AS30 Index−0.77680.915535.09200.0000000240N
18AS52 Index11.3502127.9268182,873.05440.0000000000N
19NZSE Index−0.63981.100430.85800.0000001992N
20CSEALL Index0.47011.169524.39180.0000050511N
21SET Index0.17382.643477.00790.0000000000N
22FBMKLCI Index0.61505.8711389.81910.0000000000N
23PCOMP Index0.19034.5279223.67480.0000000000N
Source: authors.

Appendix C

Table A3. Jarque Bera Test On Global Bond Indices Data 1995–2016.
Table A3. Jarque Bera Test On Global Bond Indices Data 1995–2016.
ASSET CLASS: HEDGE FUNDS
PERIOD: 1995–2015SKEWNESSKURTOSISJARQUE BERA TEST
FREQUENCY: MONTHLY DATA Test Statp-ValueNormality
YES/NO (Y/N)
Index
DGOVTRR Index Val LOC U−2.8746010823.2598466314.81160N
PRR Index Val LOC U−3.0017495224.0122976738.95630N
Market Value ex Cash LOC U−2.411374217.2190243517.29050N
EMGBTRR Index Val LOC U−3.1138554926.6877498261.22390N
PRR Index Val LOC U−3.2671475927.6549718882.43880N
Market Value ex Cash LOC U−1.8024626416.4806113130.66620N
H0A0TRR Index Val LOC U−1.250727689.90518861148.07045 × 10−250N
PRR Index Val LOC U−1.3611182110.2472321236.57973 × 10−269N
Market Value ex Cash LOC U−0.627493316.1434245432.483211.22 × 10−94N
R0A0TRR Index Val LOC U−0.481913645.0908619295.304217.51 × 10−65N
PRR Index Val LOC U−0.975061086.9261038569.512792.1 × 10−124N
Market Value ex Cash LOC U1.0327998386.9753815582.149143.9 × 10−127N
R0O0TRR Index Val LOC U−14.8031114232.9907606,773.140N
PRR Index Val LOC U−3.2926032129.0060729731.88870N
R0U0TRR Index Val LOC U0.2636328664.1566215193.110631.17 × 10−42N
PRR Index Val LOC U−0.168589328.2990509758.86731.6 × 10−165N
Market Value ex Cash LOC U1.35060243810.1424141211.81577.2 × 10−264N
R0M0TRR Index Val LOC U0.39937948411.2033831387.69194.6 × 10−302N
PRR Index Val LOC U0.04861577510.7891261280.56158.5 × 10−279N
Market Value ex Cash LOC U0.9172484115.3501174351.880493.89 × 10−77N
R0C0TRR Index Val LOC U−1.6572837227.4766988425.50810N
PRR Index Val LOC U−1.9200259630.73552110553.6010N
Market Value ex Cash LOC U0.997400863.4861319177.455842.92 × 10−39N
R010TRR Index Val LOC U−3.1224636616.6423973475.65350N
PRR Index Val LOC U−3.3178008118.3873094203.36770N
Market Value ex Cash LOC U1.1043723960.263749654.6354631.37 × 10−12N
Source: authors.

References

  1. Acerbi, Carlo, and Dirk Tasche. 2002. On the coherence of expected shortfall. Journal of Banking and Finance 26: 1487–503. [Google Scholar] [CrossRef] [Green Version]
  2. Alexander, Carol, and Elizabeth Sheedy. 2008. Developing stress testing framework based on market risk models. Journal of Banking and Finance 32: 2220–36. [Google Scholar] [CrossRef]
  3. Artzner, Philippe, Freddy Delbaen, Jean Marc Eber, and David Heath. 1999. Coherent measure of risk. Mathematical Finance 9: 203–28. [Google Scholar] [CrossRef]
  4. Basel Committee on Banking Supervision. 2009. Principles for Sound Stress Testing Practices and Supervision: January, 2009. Basel: Bank for International Settlements (BIS). [Google Scholar]
  5. Baker, Kent, and Greg Filbeck. 2015. Investment Risk Management, Financial Markets and Investment Series. Oxford: Oxford University Press. [Google Scholar]
  6. Barns, Dennis, Gildas Blanchard, and Thorsten Lehnert. 2017. Volatility measures and value at risk. International Journal of Forecasting 33: 848–63. [Google Scholar]
  7. Beg, Alam Rabiul, and Sajid Anwar. 2014. Detecting Volatility Persistence in GARCH Models in the Presence of the Leverage Effect. Quantitative Finance 14: 2205–13. [Google Scholar] [CrossRef]
  8. Berner, Richard. 2012. Stress VaR and systematic risk indicators, private sector application of how to measure systematic interconnectedness. In IMF Conference on Operationalizing Systematic Risk Monitoring. New York: Morgan Stanley, May 28. [Google Scholar]
  9. Bhowmik, Roni, and Shouyang Wang. 2020. Stock market volatility and return analysis: A systematic literature review. Entropy 22: 522. [Google Scholar] [CrossRef]
  10. Billio, Monica, Lorenzo Frattarolo, and Pelizzon Loriana. 2016. Hedge Fund Tail Risk: An Investigation in Stressed Markets. The Journal of Alternative Investments Spring 18: 109–24. [Google Scholar] [CrossRef]
  11. Bodie, Zvi, Alex Kane, and Allan Marcus. 2001. Finance Investments, 5th ed. London: McGraw-Hill. [Google Scholar]
  12. Bollerslev, Tim. 1986. Generalized autoregressive conditional heteroscedasticity. Journal of Econometrics 31: 307–27. [Google Scholar] [CrossRef] [Green Version]
  13. Bollerslev, Tim, Jeffrey Russell, and Mark Watson. 2010. Volatility and Time Series Econometrics. Oxford: Oxford University Press. [Google Scholar]
  14. Box, George Pelham, and Gwilym Jenkins. 1976. Time Series Analysis, Forecasting and Control. San Francisco: Holden Day. [Google Scholar]
  15. Brooks, C. 2014. Introductory Econometrics for Finance, 3rd ed. UK: University of Cambridge. [Google Scholar]
  16. Chen, James Ming. 2014. Measuring market risk under Basel accords: VaR, stressed VaR and expected shortfall. The IEB International Journal of Finance 8: 184–201. [Google Scholar]
  17. Christoffersen, Peter. 1996. Evaluating interval forecasts. International Economic Review 39: 841–62. [Google Scholar] [CrossRef]
  18. Crouchy, Michel Dan Galal, and Robert Mark. 2014. The Essentials of Risk Management, 2nd ed. London: McGraw-Hill. [Google Scholar]
  19. Iuga, Iulia Cristina, and Anastatsia Mihalciuc. 2020. Major Crises of the XXIst century and impact on economic growth. Sustainability 12: 9373. [Google Scholar] [CrossRef]
  20. Danielsson, Jon, Bjørn Jorgensen, Gennady Samorodnitsky, Mandira Sarma, and Casper G. De Vries. 2005. Subaddivity Re-Examined: The Case for Value at Risk. New York: Cornell University Operations Research and Industrial Engineering. [Google Scholar]
  21. Dash, Jan. 2012. Stressed Value at Risk. Paper presented at IEEE Conference on Computational Intelligence for Financial Engineering and Economics (CIFERr), Shenzhen, China, May 4–5. [Google Scholar]
  22. Diebold, Francis Xavier, and Glen David Rudebusch. 2012. Yield Curve Modelling and Forecasting. The dynamic Nelson-Siegel approach. Princeton: Princeton University Press. [Google Scholar]
  23. Dixit, Jitenda Kumar, and Vivek Agrawal. 2019. Foresight for stock market volatility: A study in the Indian perspective. Foresight 22: 1–13. [Google Scholar] [CrossRef]
  24. Embrecchts, Paul, Filip Lindskog, and Alexander McNeil. 2001. Modelling Dependence with Copulas and Applications to Risk Management. September 10. Available online: https://people.math.ethz.ch/~embrecht/ftp/copchapter.pdf (accessed on 11 August 2017).
  25. Engle, Robert F. 1982. Autoregressive conditional heteroskedasticity with estimates of the variance of the United Kingdom inflation. Econometrica 50: 987–1007. [Google Scholar] [CrossRef]
  26. Engle, Robert. 2004. Risk and Volatility: Econometric models and financial practice. American Economic Review 94: 405–20. [Google Scholar] [CrossRef] [Green Version]
  27. Engle, Robert, and Emil Siriwardane. 2014. Structural GARCH: The Volatility-Leverage Connection. Available online: https://www.ecb.europa.eu/events/pdf/conferences/140623/Engle-and-Siriwardane_Structural-GARCH-the-volatility-leverage-connection.pdf?e24ef9f36b79ea29f0bb4f6e5a8fabf3 (accessed on 27 July 2017).
  28. European Banking Authority. 2012. EBA Guidelines on Stressed Value at Risk (Stressed VaR) EBA/GL/2012/2. London: EBA, May 16. [Google Scholar]
  29. Fama, Eugene, and Kenneth French. 1992. The cross-section of expected stock returns. Journal of Finance 57: 427–65. [Google Scholar] [CrossRef]
  30. Fishburn, Peter Charles. 1980. Stochastic dominants and moments of distributions. Mathematics of Operations Research 5: 94–100. [Google Scholar] [CrossRef]
  31. Francq, Christian, and Jean-Michel Zakoïan. 2018. Estimation risk for the VaR of portfolios driven by semi-parametric multivariate models. Journal of Econometrics 205: 381–401. [Google Scholar] [CrossRef]
  32. Goodfellow, Christiane, and Christian Salm. 2016. Risky Risk Measures, A note on underestimating financial risk under the normal assumption. Capernican Journal of Finance & Accounting 5: 85–108. [Google Scholar]
  33. Haas, Marcus. 2001. New methods in backtesting 2001. In Financial Engineering. Bonn: Research Center Ceasar. [Google Scholar]
  34. Hansen, Peter Robert, and Asger Lunde. 2005. A forecast comparison of volatility models: Does anything beat a GARCH (1, 1)? Journal of Applied Econometrics 20: 873–89. [Google Scholar] [CrossRef] [Green Version]
  35. Hedge Fund Research Inc. 2017. HFRI Indices–Index Descriptions. Available online: https://www.hedgefundresearch.com/hfri-indices-index-descriptions (accessed on 22 August 2017).
  36. J.P. Morgan Asset Management. 2009. Non-normality of Market Returns, a Framework for Asset Allocation Decision Making. Available online: https://jai.pm-research.com/content/12/3/8 (accessed on 14 June 2021).
  37. Jain, Shrey, and Siddhartha Chakrabarty. 2020. Does Marginal VaR lead to improved performance of managed portfolios? A study of S&P BSE 100 and S&P BSE 200. Asia Pacific Financial Markets 27: 291–323. [Google Scholar] [CrossRef]
  38. Jansky, Ivo, and Milan Rippel. 2011. Value at Risk Forecasting with ARMA-GARCH Family of Models in Times of Increased Volatility. IES Working Paper: 27/2011. Prague: Institute of Economic Studies, Faculty of Social Sciences, Charles University. [Google Scholar]
  39. Jorion, Philippe. 2003. Financial Risk Manager Handbook, 2nd ed. Hoboken: GARP, Wiley & Sons. [Google Scholar]
  40. Kamara, Avraham, Robert Korajczyk, Xiaoxia Lou, and Ronnie Sadka. 2018. Short-horizon beta or long-horizon alpha? Journal of Portfolio Management 45: 96–105. [Google Scholar] [CrossRef]
  41. Katchova, Ani. 2013. Time Series ARIMA Models: Econometrics Academy. Available online: https://sites.google.com/site/econometricsacademy/econometrics-models/time-series-arima-models (accessed on 4 September 2017).
  42. Kidd, D. 2012. Value at Risk and Conditional Value at Risk. Investment and Risk Performance. CFA Institute. Available online: http://www.cfapubs.org/doi/pdf/10.2469/irpn.v2012.n1.6 (accessed on 16 August 2020).
  43. Kilai, Mutua, Anthony Gichuhi Waititu, and Anthony Wanjoya. 2018. Modelling Kenyan foreign exchange risk using asymmetry GARCH models and Extreme Value Theory approaches. International Journal of Data Science and Analysis 4: 38–45. [Google Scholar] [CrossRef] [Green Version]
  44. Krause, Andreas. 2003. Exploring the limitations of Value at Risk: How good is it in practice? Journal of Risk Finance 4: 19–28. [Google Scholar] [CrossRef] [Green Version]
  45. Kumar, Arya, and Saroj Kantar Biswal. 2019. Impulsive clustering and leverage effect of emerging stock market with special reference to Brazil, India, Indonesia and Pakistan. Journal of Advanced Research in Dynamic Control System 11: 33–37. [Google Scholar] [CrossRef]
  46. Lelasi, Federica, Paolo Capelli, and Angeloantonio Russo. 2021. Forecasting volatility by integrating risk with environmental, social and governance risk. Corporate Social Responsibility and Environmental Management 28: 1483–95. [Google Scholar] [CrossRef]
  47. Lichtner, Mark. 2019. How to choose the return model for market risk? Getting towards a right magnitude of stressed VaR. Quantitative Finance 19: 1391–407. [Google Scholar] [CrossRef]
  48. Liu, Qingfeng, Qingsong Yao, and Guoqing Zhao. 2020. Model averaging estimation for conditional volatility models with an application to stock market volatility forecast. Journal of Forecasting 39: 841–63. [Google Scholar] [CrossRef]
  49. Lok, Hsiao Yen. 2015. Different Methods of Back testing VaR and ES. Ph.D. dissertation, Actuarial Research Centre, Heriot Watt University, Edinburgh, UK. Available online: https://www.actuaries.org.uk/system/files/field/document/HY%20Lok%20PARTY%20Jan%202015.pdf (accessed on 15 February 2021).
  50. Lopez, Jose. 2005. Stress tests: Useful complements to financial risk models. FRBSF Economic Letter 2005: 119–24. [Google Scholar]
  51. Loretan, Mico, and William English. 2000a. Evaluating “Correlation Breakdowns” during the Periods of Market Volatility. Basel: Bank of International Settlements (BIS), Available online: http://www.bis.org/publ/confer08k.pdf (accessed on 18 September 2017).
  52. Loretan, Mico, and William English. 2000b. III Special feature: Evaluating changes in correlations during periods of high market volatility: Bank for International Settlements (BIS). Quarterly Review 2000: 29–36. [Google Scholar]
  53. Lovreta, Lidija, and Joaquin López Pascual. 2020. Structural breaks in the interaction between bank and sovereign default risk. SERIEs 2020. [Google Scholar] [CrossRef]
  54. Mabrouk, Samir. 2016. Forecasting daily conditional volatility and h-step-ahead short and long Value-at-Risk accuracy: Evidence from financial data. The Journal of Finance and Data Science 2: 136–51. [Google Scholar] [CrossRef] [Green Version]
  55. Malmsten, Hans. 2004. Evaluating Exponential GARCH Models: SSE/EFI Working Paper Series in Economics and Finance, No. 564. Stockholm: Stockholm School of Economics. [Google Scholar]
  56. Markowitz, Harry. 1952. Portfolio selection. Journal of Finance 7: 71–91. [Google Scholar] [CrossRef]
  57. Marrison, Chrsitopher. 2002. The Fundamentals of Risk Measurement. New York: Mc Graw Hill. [Google Scholar]
  58. Mina, Jorge. 2002. Measuring bets with Relative Value at Risk. Derivatives Week, Learning Curve 2002: 14–5. [Google Scholar]
  59. RiskMetrics Technical Document, 4th ed. New York: Available online: https://my.liuc.it/MatSup/2006/F85555/rmtd.pdf (accessed on 16 July 2021).
  60. Napper, Kerry. 2008. Refining Risk Measures. MLC Investment Management. White Paper—Serial Correlation 1: 2–13. [Google Scholar]
  61. Nelson, Daniel. 1990. ARCH models as diffusion approximations. Journal of Econometrics 45: 7–38. [Google Scholar] [CrossRef]
  62. Nieto, Maria Rosa, and Esther Ruiz. 2016. Frontiers in VaR forecasting and back testing. International Journal of Forecasting 32: 475–501. [Google Scholar] [CrossRef]
  63. Orhan, Mehmet, and Bülent Köksal. 2011. A comparison of GARCH models for VaR estimation. ESWA 2012: 2582–3592. [Google Scholar]
  64. Pflug, Georg Ch. 2000. Some remarks on the Value at Risk and the Conditional Value at Risk. In Probabilistic Constrained Optimization. Nonconvex Optimization and Its Applications 49. Edited by Uryasev Stanislav. Boston: Springer. [Google Scholar] [CrossRef]
  65. Reider, Rob. 2009. Volatility Forecasting 1: GARCH Models: October 2009. Available online: cims.nyu.edu/~almgren/timeseries/Vol_Forecast1.pdf (accessed on 10 June 2017).
  66. Rockafellar, Ralph Tyrell, and Stanislav Uryasev. 2000. Optimization of conditional value at risk. Journal of Risk 2: 21–42. [Google Scholar] [CrossRef] [Green Version]
  67. Sarykalin, Sergey, Gaia Serraino, and Stanislav Uryasev. 2014. Value at Risk vs. Conditional Value at Risk in Risk Management and Optimization. INFORMS Tutorials in Operations Research 2008: 270–94. [Google Scholar] [CrossRef] [Green Version]
  68. Sheppard. 2013. Financial Econometrics Notes; UK: University of Oxford. Available online: http://www.kevinsheppard.com/images/b/bb/Financial_Econometrics_2013-2014.pdf (accessed on 10 June 2017).
  69. Tang, Wang, and Fang Su. 2017. Analysis of the impact of extreme financial events on systematic risk. A case study of China’s banking sector. Economic Research Journal 52: 17–33. [Google Scholar]
  70. Van Dyk, Francois. 2016. Stress Testing 101: The method, recent developments, transparency concerns and some recommendations. The SA Financial Markets Journal. Available online: http://financialmarketsjournal.co.za/oldsite/16thedition/printedarticles/stresstesting.htm (accessed on 7 March 2017).
  71. Wang, Zhouwei, Qicheng Zhao, Min Zhu, and Tao Pang. 2020. Jump Aggregation, Volatility Prediction, and Nonlinear Estimation of Banks’ Sustainability Risk. Sustainability 12: 8849. [Google Scholar] [CrossRef]
  72. World Economic Forum. 2017. The Global Risks Report 2017, 12th ed. Zurich: World Economic Forum. [Google Scholar]
Figure 1. VaR versus CVaR for HEX and BGSMDC Equity Asset Index. Source: authors.
Figure 1. VaR versus CVaR for HEX and BGSMDC Equity Asset Index. Source: authors.
Jrfm 14 00540 g001
Figure 2. VaR vs. CVaR-HFRI and DJCS hedge fund.
Figure 2. VaR vs. CVaR-HFRI and DJCS hedge fund.
Jrfm 14 00540 g002
Figure 3. VaR and CVaR for HOAO and GBI-EM Bond Index Asset Class.
Figure 3. VaR and CVaR for HOAO and GBI-EM Bond Index Asset Class.
Jrfm 14 00540 g003
Figure 4. VaR versus SVaR equities. Source: authors.
Figure 4. VaR versus SVaR equities. Source: authors.
Jrfm 14 00540 g004
Figure 5. VaR versus SVaR hedge fund index.
Figure 5. VaR versus SVaR hedge fund index.
Jrfm 14 00540 g005
Figure 6. VaR versus SVaR bond index. Source: authors.
Figure 6. VaR versus SVaR bond index. Source: authors.
Jrfm 14 00540 g006
Figure 7. HEX monthly time series analysis, equity. Source: authors.
Figure 7. HEX monthly time series analysis, equity. Source: authors.
Jrfm 14 00540 g007
Figure 8. BGSMDC monthly time series analysis, equity. Source: authors.
Figure 8. BGSMDC monthly time series analysis, equity. Source: authors.
Jrfm 14 00540 g008
Figure 9. HFRI monthly time series analysis, hedge fund. Source: authors.
Figure 9. HFRI monthly time series analysis, hedge fund. Source: authors.
Jrfm 14 00540 g009
Figure 10. DJCS monthly time series analysis, hedge fund. Source: authors.
Figure 10. DJCS monthly time series analysis, hedge fund. Source: authors.
Jrfm 14 00540 g010
Figure 11. H0AO monthly time series analysis, bonds. Source: authors.
Figure 11. H0AO monthly time series analysis, bonds. Source: authors.
Jrfm 14 00540 g011
Figure 12. GBI-EM monthly time series analysis, bond index. Source: authors.
Figure 12. GBI-EM monthly time series analysis, bond index. Source: authors.
Jrfm 14 00540 g012
Figure 13. ACF and PACF plots for equities. Source: authors.
Figure 13. ACF and PACF plots for equities. Source: authors.
Jrfm 14 00540 g013
Figure 14. ACF and PACF plot for hedge funds. Source: authors.
Figure 14. ACF and PACF plot for hedge funds. Source: authors.
Jrfm 14 00540 g014aJrfm 14 00540 g014b
Figure 15. ACF and PACF plot for bonds. Source: authors.
Figure 15. ACF and PACF plot for bonds. Source: authors.
Jrfm 14 00540 g015aJrfm 14 00540 g015b
Figure 16. Equities model forecasts (12 months). Source: authors.
Figure 16. Equities model forecasts (12 months). Source: authors.
Jrfm 14 00540 g016
Figure 17. Hedge fund model forecasts (12 months). Source: authors.
Figure 17. Hedge fund model forecasts (12 months). Source: authors.
Jrfm 14 00540 g017
Figure 18. Bonds model forecasts (12 months). Source: authors.
Figure 18. Bonds model forecasts (12 months). Source: authors.
Jrfm 14 00540 g018
Table 1. Summary of asset classes representatives.
Table 1. Summary of asset classes representatives.
ASSET CLASSNAMEDESCRIPTION
Hedge FundHFRI Fund of Funds Composite IndexHFRI
Hedge FundDow Jones Credit Suisse Event Driven Hedge Fund IndexDJCS
Equity IndexOMX Helsinki All-Share IndexHEX
Equity IndexThe Botswana Domestic Companies IndexBGSMDC
Bond IndexJPMorgan Government Bond Index- Emerging MarketsGBI-EM
Bond Indexthe Merrill Lynch US High Yield Master II Index HOAO
Source: Authors.
Table 2. Summary statistics. Source: authors.
Table 2. Summary statistics. Source: authors.
EQUITY INDEX TestP-Value SIG?
HEXBGSMDC HEX BGSMDC
Average: 0.60%1.35%White Noise?1.00%FALSE0.00%FALSE
Standard Deviation: 7.57%4.29%Normal Distributed?0.00%FALSE0.00%FALSE
Skew: −0.311.9ARCH Effect?0.00%TRUE64.11%FALSE
Excess Kurtosis: 2.119.6
HEDGE FUND INDEX TestP-Value SIG?
HFRIDJCS HFRI DJCS
Average: 0.44%0.68%White Noise?0.00%FALSE0.00%FALSE
Standard Deviation: 1.65%1.80%Normal Distributed?0.00%FALSE0.00%FALSE
Skew: −0.85−2.32ARCH Effect?0.05%TRUE2.24 × 10−306FALSE
Excess Kurtosis: 4.7 11.65
BOND INDEX TestP-Value SIG?
GBI-EMHOAO GBI-EM HOAO
Average: 0.45%0.62%White Noise?53.18%TRUE3.05 × 10−6FALSE
Standard Deviation: 1.64%2.57%Normal Distributed?0.00%TRUE0FALSE
Skew: 0.1−1.65ARCH Effect?98.43%TRUE1.77 × 10−15TRUE
Excess Kurtosis: 0.6812.01
Table 3. Correlation matrix for all assets. Source: authors.
Table 3. Correlation matrix for all assets. Source: authors.
CORRELATION 2003–2005CORRELATION 2007–2009
BGMDCHEXHFRIDGCSGBI-EM H0A0 BGSMDCHEXHFRIDGCSGBI-EM H0A0
BGSMDC1 BGSMDC1
HEX0.051 HEX−0.101
HFRI−0.100.471 HFRI−0.220.641
DGCS−0.230.310.851 DGCS−0.060.670.961
GBI-EM−0.14−0.010.370.371 GBI-EM−0.080.190.180.151
H0A0−0.340.310.390.290.691H0A0−0.220.520.590.550.611
Table 4. ARMA parameters. Source: authors.
Table 4. ARMA parameters. Source: authors.
ARMA Model Parameters
Hedge FundEquityBonds
ParamHFRIDJCSHEXBGSMDC GBI-EMHOAO
μ0.00430.00680.0060.01350.00470.0088
φ10.52320.6602−0.14080.4726−0.3639−0.0947
θ1−0.1931−0.33080.4−0.01760.52430.4652
σ0.01530.01650.07310.03820.01640.0241
Table 5. GARCH parameters. Source: authors.
Table 5. GARCH parameters. Source: authors.
GARCH Model Parameters
Hedge FundEquityBonds
ParamHFRIDJCSHEXBGSMDC GBI-EMHOAO
µ0.00430.01160.00950.01250.00380.0066
α000.00010.00010.0010.0000.0001
α10.15330.46210.16430.67810.08360.2496
β10.81430.44170.818100.82040.6705
Table 6. EGARCH parameters. Source: authors.
Table 6. EGARCH parameters. Source: authors.
EGARCH Model Parameters
Hedge FundEquityBonds
HFRIDJCSHEXBGSMDC GBI-EMHOAO
µ00.010.010.0100.01
α0−0.45−3.58−0.53−3.36−7.9−0.55
α10.280.150.35−0.480.230.1
γ10.209106−3.04519−0.103490.1985441.066853−1.99898
β10.9719520.5843610.9525730.5689960.0647520.941026
Table 7. AIC model goodness of fit test for ARMA, GARCH and EGARCH. Source: authors.
Table 7. AIC model goodness of fit test for ARMA, GARCH and EGARCH. Source: authors.
AIC Model Goodness of Fit Test
Hedge Fund EquityBonds
HFRIDJCSHEXBGSMDCGBI-EMH0A0
ARMA−1445.59−1404.57−614.82−953.59−1407.94−1192.77
GARCH−1457.21−1391.03−655.40−938.08−1404.11−1276.66
EGARCH−1459.71−1422.17−647.43 −1410.05−1290.07
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Samunderu, E.; Murahwa, Y.T. Return Based Risk Measures for Non-Normally Distributed Returns: An Alternative Modelling Approach. J. Risk Financial Manag. 2021, 14, 540. https://doi.org/10.3390/jrfm14110540

AMA Style

Samunderu E, Murahwa YT. Return Based Risk Measures for Non-Normally Distributed Returns: An Alternative Modelling Approach. Journal of Risk and Financial Management. 2021; 14(11):540. https://doi.org/10.3390/jrfm14110540

Chicago/Turabian Style

Samunderu, Eyden, and Yvonne T. Murahwa. 2021. "Return Based Risk Measures for Non-Normally Distributed Returns: An Alternative Modelling Approach" Journal of Risk and Financial Management 14, no. 11: 540. https://doi.org/10.3390/jrfm14110540

Article Metrics

Back to TopTop