1. Introduction
In a broad sense, “risk” is understood as a possible danger of any unfavorable outcome. It is inherent in all spheres of human and social activity. In economics, risk is usually interpreted as the probability of a person or an organization losing part of its resources, income shortfall, or additional costs because of implementation of certain production and financial policies [
1].
Changes in the stability of the economic situation and the external and internal environment of any organization’s activity should be assessed in a timely manner to avoid these adverse outcomes. To do this, it is necessary to be able to quantitatively assess the risk of their occurrence and form recommendations to reduce it [
2].
The solution to these issues is constantly becoming more complicated due to the increasing efficiency of functioning of the created systems, with a simultaneous tendency to their complication, and the growth of various challenges in the economy and society. Since the occurrence of an unfavorable outcome is explicitly or implicitly reduced to financial losses, the problem of risk analysis is most acute in the economy. It is necessary to build adequate models and effective methods to solve the problems of risk monitoring [
3,
4,
5].
In the framework of monitoring, the system state is assessed, controlled, and managed depending on the impact of certain factors. Without preventive and protective measures, over time the system has a growing conditional probability of an unfavorable event: , where A is an unfavorable event and is the state of the system at the k-th moment of time.
In economics, risk analysis is often associated with certain difficulties. These include the presence of several risk factors, their interconnectedness (correlation), non-stationarity of processes, and small data samples. Therefore, risk models in the economy should consider these features.
The aim of the research is to propose and test on real data a model of multidimensional risk that satisfies the main features of processes in the economy.
2. Materials and Methods
Let us consider a model of multidimensional risk [
6,
7]. We have a set of risk factors in the form of a random vector with correlated components
, which here we understand as the main socio-economic indicators affecting the safety of the system.
In this case, under the violation of security, we understand the decrease in the level of socio-economic development of the system. We believe that these risk factors can be correlated (dependent) and their simultaneous manifestation is possible. In fact, we represent the state of the system under study as a vector in the Euclidean space . Instead of identifying specific hazardous situations, we will define geometric regions of unfavorable outcomes. These areas may look arbitrary depending on the specific problem and are determined based on available a priori information. In economics, usually undesirable events are deviations of risk factor values from some acceptable areas characterizing the safe state of the system under study. Therefore, we will use the concept of undesirable events as large and unlikely deviations of a random variable from the best values in terms of safety.
To apply the multifactor risk model, it is necessary to define the boundaries of acceptable values (threshold levels) of risk factors, according to which a considerable safe state of the system is calculated. Threshold levels can interpreted as values of risk factors; the output of at least one of the risk factors beyond them leads to the emergence of unfavorable outcomes in the form of violation of economic security. Since risk is in a certain sense a conditional category, its dynamics—improvement or deterioration of the economic situation and the impact of risk factors on it—are important in the first place.
Determination of threshold levels of indicators is difficult due to the large volume of data. In this regard, the values of thresholds are determined based on the analysis of the median and standard deviation for the period under study, which made it possible to consider the dispersion of values of indicators relative to the average and to determine the values, which will signal the manifestation of a crisis.
We define the area of safe values for each risk factor as ; , are the left and right boundaries of acceptable values, limiting the area of favorable outcomes, determined on the basis of expert assessments, . Accordingly, the areas of unfavorable outcomes are complements of sets , i.e., .
If some boundaries are one-sided, then , when only the right boundary is given, and , when only the left boundary is given. Let us assign the numbers of all risk factors to three sets of indices: (the factors have both limits of permissible values); (the factors have only the left limit of permissible values); J(+) (the factors have only the right limit of permissible values).
Then we define the conditional safest state of the system
. In the case of bilateral boundaries of acceptable values
; for unilateral boundaries, the values are set based on available historical data of the risk factors and further (a priori) information provided by e.g., experts [
7].
The region of unfavorable outcomes
D for all risk factors consists of two sets
. The set
forms the situations when at least one of the risk factors
was in the region of unfavorable outcomes
, i.e.,
. However, situations of large deviations also can occur when all factors
Xj are in the admissible area
. To account for such situations, we introduce a set:
where
and
.
As risk, we will understand the probability of an unfavorable outcome
D:
From (1), we see that the situations
and
. We will interpret the value
P(
D) as the probability of an unfavorable outcome occurring at the next point in time (in our case, in the next year). The estimation of the probability of entering the risk area
P(
D) is based on the Monte Carlo statistical testing method [
8,
9].
First, we consider the random vector to be Gaussian: (1) the use of the normal distribution law relies on the central limit theorem; (2) the analyzed sample is small, which does not allow us to estimate the distribution law of the random vector.
Second, from the available data sample
we define the mean vector
and the covariance matrix
. The matrix
contains the results of observations of
risk factors
for
time periods. These completely describe the distribution law
of the data according to following formula:
Third, we repeatedly generate new observations
with the spreading law
. This procedure is described in [
7]. As a result, the probability estimate
will be equal to the frequency
, where
is the total number of generated observations
,
is the number of outcomes when the generated observation
.
Let us estimate the contribution to the total risk of its components
Xj. Absolute and relative change in the probability of unfavorable outcome of the multidimensional system due to the addition of factor
Xj we determine by the following formulas:
where
,
is the area of unfavorable outcomes
D after exclusion of risk factor
Xj.
The contribution of the correlation between the indicators
Xj to the overall risk is calculated as
where all components
of the random vector
are mutually independent and have the same distributions as
Xj.
For risk analysis, it may also be useful to estimate the univariate risk for each factor as .
The disadvantage of the model described above is that it is static and does not consider the non-stationarity of risk factors.
3. Results
The behavior of risk factors can be represented as a set of mutually dependent time series. Risk analysis in the economy using this model is often complicated by non-stationarity (presence of trends) of indicators.
The trend in the risk factor
Xj leads to the fact that it consists of the sum of two independent components
, where
is the trend, which is understood as some continuous quasi-deterministic component from time, and
is a random component [
10,
11]. Then the variance of the risk factor will be equal to the sum of two variances
. As a result, when estimating risk by formula (1) in the procedure of generating observations
, the dispersions of risk factors
with trends will be overestimated by the values
. This leads to significant distortions of covariance matrices and, as a result, to overestimation of risk.
We will consider the trends of risk factors as follows. Since for operational risk monitoring the number of observations Δ in each analyzed interval should be small (about 5–15 observations), it is sufficient to consider two models—linear and parabolic trends. At each analyzed interval we determine the model parameters and risk as follows.
For each
j-th risk factor
Xj, we estimate the conditional mathematical expectation as a parabolic regression equation from time
and check the statistical significance of the coefficient
. If it is statistically significant, then further on this interval we consider the risk factor
as a trend
. Otherwise, instead of (2) we consider the paired linear regression
and check the statistical significance of the coefficient
bk;j. If it is significant, then further on this interval, we consider the risk factor
as a trend
. Otherwise, we return to the static variant
describing the risk factor as
.
Next, based on models (2)–(4), we calculate the estimates of the risk , errors and error covariance matrix . After that, we estimate at any i-th moment of the interval the probability of an unfavorable outcome P(D).
Note that the most accurate regression estimates correspond to the middle of the range of values of the explanatory variable, so it is desirable to take the time moment i for risk assessment in the middle of the interval. If the problem of risk forecasting is solved, the right boundary of the interval can be increased by considering the error covariance matrix unchanged for the forecast periods, and instead of the actual values of risk factors we should use their forecasts according to the regression Equations (2)–(4).
Consideration on model data has shown that the dynamic variant of the model of multidimensional risk considers the presence of trends in risk factors and allows us to estimate risk correctly. For example,
Figure 1 shows the results of risk assessment for two correlated risk factors with trends. The dynamic case considered the presence of trends in risk factors, while the static case did not.
The dynamic model of multivariate risk showed almost identical estimation results with the model in the absence of trends (
Figure 1a). Some decrease in risk with the increase in the volume Δ of the sample analyzed shows the convergence of sample estimates to the theoretical result at
. As can be seen from
Figure 1b, in the presence of trends in risk factors, the static variant of the risk analysis model leads to a shift in
estimates towards overestimation of the probability of an unfavorable outcome: at
.
4. Discussion
Let us further monitor the risk of deterioration of the socio-economic condition of the Russian Federation according to annual data. The risk factors are given in
Table 1. The information source is the database of the Federal State Statistics Service for the period from 2000 to 2023. Unilateral limits of acceptable values (threshold levels) of indicators are given in
Table 2. The lower thresholds are set for the factors whose increasing values indicate a positive trend, otherwise the upper thresholds are set. The rationale for the risk factors used and their threshold levels is given in [
7].
The risks were calculated on the basis of multivariate static and dynamic risk models described above. The statistical significance of in (5) and in (6) was checked using Student’s t-criterion. Due to the small size of the analyzed samples, the significance level λ = 0.1 was used.
The results of monitoring the decline in the level of socio-economic development of Russia according to annual data for static (Δ = 10) and dynamic (Δ = 8) variants of risk assessment are shown in
Figure 2 and
Figure 3. The dynamic variant allowed the use of a smaller data sample size.
According to the obtained results, the most significant economic shocks of the Russian Federation, noted by many authors, are clearly traceable. Thus, the crisis of 1998, caused by default on foreign debts, led to instability of the economic situation up to 2006. The global financial crisis of 2008 led to a deep recession of the Russian economy in 2009 and to a drop in industrial production () and a sharp increase in unemployment (). Unlike dynamic estimation, the static one, due to the specifics of its implementation, is more tied to Δ intervals, so the 2008 crisis overlaps with the next surge of economic instability and represents a single period of fluctuation in the probability of unfavorable outcome from 2006 to 2019.
Figure 3 shows two spikes—in 2013 and 2017. The difficult geopolitical situation in 2013, the fall in oil prices, and the lingering effects of the 2008 crisis led to economic stagnation. Despite the absence of a strong economic downturn this year, the signs of the crisis manifested themselves through falling household incomes (
) and rising unemployment (
). The 2014 sanctions hardly led to an increase in risks, which can be explained by the state’s use of reserves to buy socio-economic problems. In 2017, the country was still in a state of prolonged economic crisis caused by the long-term effects of sanctions, falling oil prices and the ruble exchange rate, as well as rising inflation. The COVID-19 pandemic led to a sharp decline in economic activity in 2020, affecting industrial output (
) and consolidated budget revenues (
). The impact of increased sanctions pressure in 2022 on the selected risk factors was not revealed up to 2023 (end of dataset).
5. Conclusions
The model of multidimensional risk oriented on the peculiarities of processes in the economy is proposed. They usually include the presence of several correlated risk factors that can occur simultaneously, non-stationarity of processes, and small data samples. The proposed model is based on the representation of the investigated economic system or phenomenon in the form of multidimensional non-stationary processes, which at each moment of time are considered as Gaussian random vectors. In the article, the probability of an unfavorable outcome—a decrease in the level of socio-economic development—is understood as the risk value. This model is supplemented with a dynamic variant, in which risk factors are locally described in the form of parabolic or linear trends. This allowed us to expand the scope of application of this approach.
The monitoring of the risk of decline in the level of socio-economic development of Russia on an annual basis from 2000 to 2023 was carried out. The results of monitoring showed that the proposed multivariate risk model was generally sensitive to all the most significant economic shocks and adequately responded to them. The model was tested on synthetic data simulating the presence of trends in risk factors.
It can be noted that the dynamic variant of risk monitoring is more sensitive to short-term negative changes in indicators. Less dependence on the interval Δ allows the association of the risk with the actual values of a particular period, facilitating the interpretation and perception of the results of the analysis. The static variant, in its turn, is less sensitive, but allows an assessment of the situation under conditions of a large number of economic shocks in a short period.
In the future, we plan to continue the research in the direction of studying the influence of individual risk factors and their correlation on the reduction in the level of socio-economic development.