Next Article in Journal
A New Simplified Weak Second-Order Scheme for Solving Stochastic Differential Equations with Jumps
Next Article in Special Issue
Smooth Break Detection and De-Trending in Unit Root Testing
Previous Article in Journal
Iterative Variable Selection for High-Dimensional Data: Prediction of Pathological Response in Triple-Negative Breast Cancer
Previous Article in Special Issue
Non-Linear Interdependencies between International Stock Markets: The Polish and Spanish Case
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Mean Squared Variance Portfolio: A Mixed-Integer Linear Programming Formulation

by
Francisco Fernández-Navarro
1,*,
Luisa Martínez-Nieto
2,
Mariano Carbonero-Ruz
1 and
Teresa Montero-Romero
2
1
Department of Quantitative Methods, Universidad Loyola Andalucía, 41704 Sevilla, Spain
2
Department of Financial Economics and Accounting, Universidad Loyola Andalucía, 41704 Sevilla, Spain
*
Author to whom correspondence should be addressed.
Mathematics 2021, 9(3), 223; https://doi.org/10.3390/math9030223
Submission received: 18 December 2020 / Revised: 19 January 2021 / Accepted: 20 January 2021 / Published: 23 January 2021
(This article belongs to the Special Issue Application of Mathematical Methods in Financial Economics)

Abstract

:
The mean-variance (MV) portfolio is typically formulated as a quadratic programming (QP) problem that linearly combines the conflicting objectives of minimizing the risk and maximizing the expected return through a risk aversion profile parameter. In this formulation, the two objectives are expressed in different units, an issue that could definitely hamper obtaining a more competitive set of portfolio weights. For example, a modification in the scale in which returns are expressed (by one or percent) in the MV portfolio, implies a modification in the solution of the problem. Motivated by this issue, a novel mean squared variance (MSV) portfolio is proposed in this paper. The associated optimization problem of the proposed strategy is very similar to the Markowitz optimization, with the exception of the portfolio mean, which is presented in squared form in our formulation. The resulting portfolio model is a non-convex QP problem, which has been reformulated as a mixed-integer linear programming (MILP) problem. The reformulation of the initial non-convex QP problem into an MILP allows for future researchers and practitioners to obtain the global solution of the problem via the use of current state-of-the-art MILP solvers. Additionally, a novel purely data-driven method for determining the optimal value of the hyper-parameter that is associated with the MV and MSV approaches is also proposed in this paper. The MSV portfolio has been empirically tested on eight portfolio time series problems with three different estimation windows (composing a total of 24 datasets), showing very competitive performance in most of the problems.

1. Introduction

Markowitz (1952) [1,2] proposed the well-known mean-variance (MV) portfolio model under the assumption that a rational investor aims at maximizing returns and minimizing risks. Mathematically speaking, the MV portfolio framework is a bi-objective optimization problem, where an efficient frontier is composed by all combination assets that are not dominated by any other in expected return and risk simultaneously. The two goals that are associated with the framework can be addressed separately as two independent optimization problems giving rise to the global maximum return (GMR) [3] and global minimum variance (GMV) [4,5,6] portfolios.
The MV portfolio has been widely implemented in the scientific community and accepted by professionals [7,8,9]. The main advantages of the approach are its ease of use, since the theory presents the concept of return and risk in a in a straightforward fashion and the ease in which the optimization problem is formulated. Unfortunately, the MV portfolio has yet to be proven competitive in out-of-sample validation [10,11,12]. Thus, the main limitations of the theory are as follows:
  • High concentration: MV-inspired portfolios are highly concentrated on a few securities with the "best: features [13,14]. Assets with either high expected returns or low expected variance will be overweighted, in this way losing the power of diversification that the theory is supposed to ensure [15].
  • Instability: MV portfolios tend to drastically re-allocate resources when the asset features change slightly, regardless of transaction costs or data inaccuracy [16,17]. This mainly occurs because MV portfolios do not take estimation inaccuracy into account and concentrate on assets with "good" features.
  • Sensitivity to input errors: because the MV portfolio gives excessive weight to assets with high expected returns, the resulting portfolios are highly sensitive to errors in the input data [18,19,20].
The optimization problem that is associated with the MV portfolio can be formulated in several ways, depending on how the risk aversion parameter is incorporated into the equations. The main formulations for the optimization problem can be summarized, as follows:
  • Maximize the expected return for a specified risk: the first possible formulation includes the maximization of the portfolio mean in the objective function of the problem and the maximum level of risk that an investor is able to assume as a constraint of the problem.
  • Minimize the risk for a pre-determined expected return: the second alternative tries to minimize the risk and introduces the minimum level of the mean return as a constraint of the optimization.
  • Minimize the risk and maximize the expected return combining both of the objectives through a user-defined risk aversion parameter.
The above problems could have linear or non-linear constraints, equality, and inequality constraints. Furthermore, the three described optimization problems are mathematically equivalent and the solutions are called mean-variance (MV) efficient. Hence, the efficient points in the return-risk plot are called the efficient frontier.
One of the potential limitations of the third formulation of the problem is that it combines two objectives that are measured in different units. In fact, the variance is a second-order moment statistic and the mean is a first-order moment one. Consequently, a modification in the scale in which returns are expressed (both by one or percent, for example), differently affects them, accordingly changing the solution of the problem. Furthermore, the most widely used performance measure in the quantitative finance literature to compare portfolio strategies is the Sharpe ratio, which computes the quotient between the return of the portfolio with respect to the standard deviation of the portfolio’s excess return [21,22,23]. As can be seen, the Sharpe ratio assesses the two objectives in the same unit level, unlike the third formulation of the optimization problem, which evaluates the objectives in different units [24].
Motivated by this fact, a novel portfolio strategy, which is denoted as mean squared variance (MSV), which calculates the two objectives of the minimization function of the problem in the same unit is proposed in this paper. Specifically, the mean return of the portfolio is expressed in squared form in the proposed strategy, aiming to provide a straightforward manner of comparing the two objectives in the same unit. The resulting portfolio model is a non-convex quadratic programming (QP) problem, which has been reformulated as a mixed-integer linear programming (MILP) problem to reach, in this way, the global solution of the problem.
The reformulation of the baseline QP problem as an MILP is done via the adaptation of the QP problem as a linear complementary problem, the use of binary variables and big-M constraints to model the complementary constraints. The estimation of the upper bounds on the dual variables was borrowed from the work of [25] for the case of a standard QP problem. The reformulation of the initial non-convex QP problem into an MILP has the advantage of ensuring that the global solution is obtained via the use of current state-of-the-art MILP solvers.
Finally, an innovative purely data-driven method for determining the optimal value of the hyper-parameter that is associated with the MV and MSV strategies (the risk aversion parameter, λ ) is also proposed in this paper. The hyper-parameter optimization is based on a Bayesian approach for the global optimization of λ [26]. The approach will allow the manuscript authors to show that the proposed formulation of the MV problem helps the global optimization procedure achieve a more competitive set of portfolio weights (in terms of the Sharpe ratio).
The rest of the paper is organized, as follows. Section 2 describes the foundations of the proposed mean squared variance (MSV) strategy and its reformulation as a mixed-integer linear programming (MILP) optimization problem. Section 3 details the experimental framework that was developed for the empirical comparison of the state-of-the-art portfolio strategies implemented. Section 4 reports the statistical results obtained by the different strategies and Section 5 presents the conclusions that were reached in the study.

2. The Proposed Method

The aim of this section is to describe the mathematical formulation (Section 2.1) and foundations (Section 2.2) of the proposed method, named the mean squared variance (MSV) portfolio, as well as to provide the necessary details to reformulate the model as a mixed-integer linear programming (MILP) optimization problem (Section 2.3).

2.1. Mathematical Formulation of the Model

The proposed method determines, for a portfolio consisting of N assets ( n = 1 , , N ), the optimal weights of the portfolio’s value invested in each asset, w 1 , , w N 0 ( n = 1 N w n = 1 ), while using, as inputs of the problem, the expected returns ( μ n ), risks ( σ n ) and covariances between assets ( σ n m ). As previously mentioned, the associated optimization problem of the model is very similar to the Markowitz optimization, except in the part of the portfolio mean, which is presented in squared form in our formulation. Thus, the MSV portfolio is defined as:
min w 1 , , w N λ n , m = 1 N w n w m σ n m ( 1 λ ) n = 1 N w n μ n 2 . s . t . n = 1 N w n = 1 . w 1 , , w N 0 ,
where the term n = 1 N w n μ n is the portfolio mean, n , m = 1 N w n w m σ n m , ( σ n n = σ n 2 ) is the portfolio variance and λ [ 0 , 1 ] is a hyper-parameter of the problem that weights the relative importance of the risk with respect to the mean squared return. The model can also be formulated in matrix form as follows:
min w R N λ w T Σ w ( 1 λ ) w T μ μ T w . s . t . 1 T w = 1 . w 0 ,
where w = ( w 1 w N ) T R N is the vector with the optimal weights of the portfolio, Σ R N × N is the covariance matrix of the asset’s returns, μ = ( μ 1 μ N ) T R N is the vector with the expected returns, and 0 and 1 are N-dimensional vectors with zeros and ones in all the rows, respectively. For ease of reference, the proposed MSV portfolio is expressed as:
min w R N 1 2 w T H w . s . t . 1 T w = 1 . w 0 ,
with H = λ Σ ( 1 λ ) R R N × N , and R = μ μ T and the constant 1 / 2 has been incorporated in the objective function in order to facilitate its derivation.

2.2. Main Foundations of the Model

In this section, the optimal portfolios of the MSV and MV models are analytically compared. The GMV and GMR portfolios are also included in the study for reference. The four optimization problems share the same feasible region and, therefore, for their analysis, it will suffice to compare their objective functions:
F MSV ( w | λ ) = λ w T Σ w ( 1 λ ) w T μ μ T w . F MV ( w | λ ) = λ w T Σ w ( 1 λ ) w T μ . F GMV ( w ) = w T Σ w . F GMR ( w ) = w T μ .
It is straightforward to realize that, when only the expected return is taken into account ( λ = 0 ), the MSV, MV, and GMR models provide the same optimal solution. Additionally, when only risk is taken into account ( λ = 1 ), MSV, MV, and GMV achieve the same portfolio. The analysis in intermediate situations, when both return and risk ( 0 < λ < 1 ) are considered, is not so immediate. This is pursued assuming that 0 < w T μ < 1 and, therefore, the expected return for any portfolio is positive and lower that 100%. Under these conditions, w T μ μ T w = ( w T μ ) 2 < w T μ , so F MV ( w | λ ) F MSV ( w | λ ) .
In any case, Section 4 shows how the mathematical formulation of the MSV problem helps the model to achieve significantly better out-of-sample Sharpe ratios than the ones that were reported by the traditional MV method.

2.3. Mixed-Integer Linear Programming Reformulation

The H matrix in the QP problem that is associated to the model (see Equation (3)) is the difference between two positively weighted matrices ( Σ and R ). In this context, positivity of H can not be assured and the QP problem associated may be a non-convex QP problem. Due to this fact, the authors detail below how the model could be reformulated as an MILP optimization problem. The reformulation will allow for us to yield the global optimum in each portfolio problem using the current state-of-the-art MILP solvers.
The first step in the reformulation is the introduction of the Lagrange multipliers κ R for the equality constraint and δ R N for the non-negativity constraints (no short sales restrictions). The KKT conditions for the problem after introducing those multipliers are given by:
H w + 1 κ δ = 0 . w T δ = 0 . 1 T w = 1 . w , δ 0 .
In this way, the KKT points of the optimization problem are denoted as:
Ω KKT = { ( w , κ , δ ) R 2 N + 1 : ( w , κ , δ )   satisfy   Equations   in   ( 4 ) }
In the proposed portfolio problem, the KKT conditions are first-order necessary conditions for the optimal solutions of the system, since the feasible set of QP is a polyhedron. Because of that, the KKT conditions can be incorporated into the optimization problem as redundant constraints, in this way obtaining the following equivalent formulation of the problem:
min w R N ( 1 / 2 ) w T H w . s . t . H w + 1 κ δ = 0 . w T δ = 0 . 1 T w = 1 . w , δ 0 .
Next, the KKT conditions are used in order to linearize the objective function, as suggested in [27]. First, the equality constraint H w = δ 1 κ is used to express the objective function as:
( 1 / 2 ) w T H w = ( 1 / 2 ) w T ( δ 1 κ ) = ( 1 / 2 ) w T δ ( 1 / 2 ) w T 1 κ ,
and, second, when considering that w T δ = 0 and 1 T w = 1 , the final objective function, in linear form, is defined as ( 1 / 2 ) κ , equivalent to κ .
As a result of the previously mentioned transformations, the optimization problem (3) is equivalent to the following problem with a linear (instead of quadratic) objective:
min w R N κ . s . t . H w + 1 κ δ = 0 . w T δ = 0 . 1 T w = 1 . w , δ 0 .
and, while taking into account that the δ and w variables are complementary to each other (complementary constraint), the problem can be formulated as:
min w R N κ . s . t . H w + 1 κ δ = 0 N . 1 T w = 1 . 0 w δ 0 ,
where w δ denotes perpendicularity between vectors w and δ , and it impedes the optimization problem as a linear problem. Note that problem (9) is equivalent to 2 N linear programs obtained by making each possible assignment for the complementary variables: either δ n = 0 or w n = 0 for each n = 1 , , N ; hence, it is not possible for the optimization problem to have a finite optimal solution that is not attained. Therefore, problem (9) can be reformulated as an MILP while assuming that the upper bounds for w and δ exist and they are defined as vectors u , v R N with entries u n and v n :
min w R N κ . s . t . H w + 1 κ δ = 0 . 1 T w = 1 . 0 w n z n u n . 0 δ n ( 1 z n ) v n . z n { 0 , 1 } ,
for n = 1 , , N . The obvious drawback of this formulation is that in order to find u n and v n , the upper bounds of w n and δ n should be known a priori. The upper bound, u R N , for the primal variables, w R N , is u = 1 and it can be computed (while assuming that the feasible set of QP is non-empty and bounded) by setting:
u n = max { w n : 1 T w = 1 , w n 0 } ,
for every n = 1 , , N . The calculation of the upper bounds, v R N , on the dual variables, δ R N , was borrowed from the research work of [25], which sets those bounds (for a standard quadratic program) as:
v s 1 ,
with
s = 2 N ( | | H | | , + | | f | | ) ,
where | | f | | = 0 in the proposed formulation as f represents the vector that is associated with the linear part of the optimization problem (non-existent in the MSV strategy).

3. Experimental Framework

The aim of this section is to fully define the datasets and methods used for comparison purposes, the performance metrics, and the hypothesis tests that were implemented in the present empirical study.

3.1. Out-Of-Sample Empirical Validation and Portfolio Problems Selected

The empirical validation implemented is inspired by the rolling window out-of-sample validation method [28]. Therefore, given a time series with T months of securities returns, an estimation window length, M, is selected and, in each month t, starting from t = M + 1 , the data in the previous M months are employed to compute the optimal weights that are associated with each strategy. The return in month t is computed while using the previously computed weights.
The optimization process is repeated Q times by adding, in each iteration, the return of the next period in the time series and removing from the estimation window the earliest return. In this way, the outcome of this empirical validation, for each strategy and time series tested, is a set of Q out-of-sample returns, r 1 , , r Q . In the experimental framework, the estimation window length, M, is set to 60, 120, and 240, as recommended in [10], M { 60 , 120 , 240 } . Additionally, the validation process is repeated 36 times, Q = 36 .
Table 1 details the properties of the selected time series portfolio problems, for each portfolio problem, its ID, name (Dataset), the estimation window length (M), the number of total months considered (T) and the number of assets (N). The datasets considered were obtained from Kenneth French’s web site at Dartmouth University (http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/). It is important to mention that these portfolio problems have been widely used in the community for validating novel portfolio strategies [10,29,30]. The date of the last monthly excess returns considered for all the problems is 1 August 2020.
Each dataset contained monthly returns in excess of the one-month Treasury bill rate. All portfolios are constructed on June of each year with NYSE, AMEX, and NASDAQ stocks (real-world portfolio datasets). The industry portfolio holds one portfolio of each of five or ten different groups of sectors that formed by the SIC code (consumer durable and non-durables, manufacturing, oil and gas, business equipment, transmission, retail, healthcare, utilities, and others, etc.).

3.2. Strategies Implemented

The proposed strategy (MSV) is compared, in the experimental study, to the baseline MV strategy and its corresponding parts: the GMR and GMV strategies. The proposed strategy, MSV, and the well-known MV both include an additional hyper-parameter, λ , which defines the importance of the variance term with respect to the mean. The range of values of the λ hyper-parameter has been set to λ [ 0 , 1 ] .

3.3. Performance Measures

In the experimentation, two well-known performance measures are computed for each strategy in each time series portfolio problem. Specifically:
  • The out-of-sample mean returns (MR):
    MR = ( 1 / Q ) q = 1 Q r q .
  • The out-of-sample Sharpe ratio (SR), defined as the sample mean of out-of-sample excess returns, MR, divided by their corresponding sample standard deviation:
    SR = MR ( 1 / Q ) q = 1 Q ( r q MR ) 2 .

3.4. Hyper-Parameter Optimization

In this research, the hyper-parameters of each strategy have been globally optimized while using the data available in the estimation window part in order to adapt the model to the behavior of the market in each dataset. Thus, the last year of the estimation window data was used in an internal validation process to evaluate the performance, in terms of out-of-sample Sharpe ratio, which is associated with the specific value of the hyper-parameter being explored, whereas the previous years were employed to compute the optimal weights for this λ value.
The hyper-parameter optimization was implemented through a Bayesian process. Bayesian optimization is an algorithm employed in global optimization to minimize a certain objective function, treated as a black box, by varying the value of its independent variables. The algorithm itself relies on an internal Gaussian process that approximates the objective function and it is trained by subsequent evaluations of the true objective. The approximated model is used for optimization to reduce computational costs and for its robust nature in addressing stochastic noise in function evaluations.

3.5. Statistical Hypothesis Testing

Statistical hypothesis testing was used in order to provide statistical support in the evaluation of the reported results. It is important to clarify that a statistical analysis through parametric tests could lead to mistaken conclusions in this study, since a previous evaluation of the MR and SR values that were provided by the strategies implemented resulted in rejecting the normality and equality of the variance hypothesis. Furthermore, as noted by Demšar [31], the independence condition is not truly verified in a rolling window out-of-sample validation. For these reasons, two non-parametric Friedman tests (with the ranking of MR and SR of the strategies as the test variables) were carried out to assess the statistical significance of the rank differences. Consequently, a non-parametric Holm post hoc test was used to ascertain which strategies were distinctive among the multiple comparisons performed [31].

4. Results

In this section, the different experimental analyses that were carried out with the proposed portfolio model are detailed. In particular, the goals of the section are as follows:
  • to compare the out-of-sample performance of the MSV portfolio with the performance provided by state-of-the-art MV-based strategies (Section 4.1); and,
  • to analyse the diversification levels produced by the proposed MSV portfolio and the MV portfolio in problems with different dimensions (Section 4.2).

4.1. Performance Analysis

In this subsection, the performance results of the MSV strategy are compared to those that are provided by the standard MV strategy and its corresponding individual components (the GMV and GMR strategies). Table 2 shows the MR and SR out-of-sample performance results for each time series and the implemented strategies. For the strategies requiring an internal hyper-parameter optimization (such as the MV or MSV), the validation process is repeated 10 times in order to evaluate the robustness of the stochastic optimization procedure (Section 3.4) and, consequently, the performance reported for those strategies in each dataset is the mean and standard deviation of the MR and SR performance achieved in the 10 repetitions considered for the time series being analyzed (Table 2). Based on the reported MR and SR for each technique and dataset, the ranking of each strategy in each dataset and measure (R = 1 for the best performing strategy and R = 4 for the worst one) is obtained. In addition, Table 2 presents the mean MR and SR over the 24 datasets that were considered for each strategy ( M R ¯ and S R ¯ ) and their corresponding mean rankings ( R ¯ M R and R ¯ S R ).
From a purely descriptive point of view, it can be concluded that the MSV strategy obtained the best results in 12 datasets and the second-best results in nine problems when the MR measure is considered. With regard to the SR measure, the MSV portfolio achieved the best results in 10 time series and the second-best results in 12 datasets. Furthermore, the MSV portfolio yields the best mean ( M R ¯ = 0.8731 , S R ¯ = 0.1607 ) and ranking ( R ¯ M R = 1.7500 , R ¯ S R = 1.7083 ) in MR and SR, respectively. The second-best mean ranking is obtained by the baseline MV method in the two performance measures ( R ¯ M R = 2.4792 , R ¯ S R = 2.5417 ). The third most competitive method appears to be the GMR strategy that achieved the second-best mean MR ( M R ¯ = 0.8426 ) and the third-best mean MR and SR ranking ( R ¯ M R = 2.5208 , R ¯ S R = 2.7292 ). Finally, the worst performing portfolio was the GMV one, which reported the worst mean MR and SR ( M R ¯ = 0.6934 , S R ¯ = 0.1369 ) and worst mean MR and SR ranking ( R ¯ M R = 3.2500 , R ¯ S R = 3.0208 ).
The competitive performance of the proposed portfolio can also be graphically seen in Figure 1, in which the cumulative monthly returns of the four implemented strategies during the out-of-sample validation process for the five Industry Portfolio problems ( M = 60 , ID = 1) are reported. The MSV strategy seems to be an appealing approach for portfolio optimization, in performance terms, as can be seen in Figure 1. Another aspect that is important to stress is that the MSV portfolio is less sensitive to the length of the estimation windows than its comparison strategies, reporting competitive results for different values of M (unlike, for example, the GMR strategy that usually only achieves competitive results for small values of M). In a similar fashion, the MSV strategy also achieves competitive results for different values of N (problems with a different number of assets).
In order to determine the statistical significance of the rank differences reported for each strategy in the different problems, a non-parametric Friedman test [32] with the ranking of M R and S R of the techniques that were implemented as the test variables has been carried out. The test shows that the effect of the strategy used for portfolio optimization is statistically significant at a significance level of 5%, as the confidence interval is C 0 = ( 0 , F 0.05 = 2.7374 ) and the F-distribution statistical values are F = 6.6840 C 0 for MR and F = 5.4109 C 0 for SR. Consequently, the null hypothesis stating that all strategies perform equally in mean ranking is rejected.
Based on this rejection, the non-parametric Holm test was implemented to compare all of the strategies to the proposed MSV method (which was used as the control method). Table 3 shows the results of the Holm test for the MR and SR performance measures. From the results of this test, it can be concluded that MSV obtains a significantly better MR ranking than all of the remaining strategies for α = 0.10 and statistically outperforms the GMV strategy for α = 0.05 . While using the Sharpe ratio, SR, as the test variable, the MSV portfolio achieves significantly better results than the comparison methods for α = 0.10 and α = 0.05 .
As can be seen, the MSV model achieve significantly better out-of-sample results (in MR and SR) that its MV counterpart. Thus, it is assumed that the current formulation of the model helps in the tuning of the risk aversion profile parameter and, consequently, it helps in obtaining a final competitive out-of-sample performance.

4.2. Diversification Analysis

An additional experimental study was performed in order to graphically illustrate the level of portfolio concentration of the MSV portfolio in contrast to the MV portfolio [4,13]. In this empirical study, the diversification levels of the MSV and MV portfolios in different types of problems were compared according to their w T w values, a mathematical proxy for assessing the degree of concentration in the portfolio weights [12,33,34,35]. Thus, Figure 2 shows the boxplots of diversification of the optimal weights of the MV and MSV portfolios for λ { 0.2 , 0.4 , 0.6 , 0.8 } in problems with different numbers of assets ( N { 7 , 8 , 9 , 10 } ). For each value of N, the boxplot of each strategy is computed according to 30 randomly generated portfolio problems with the same number of assets and λ values. In this specific experiment, the randomly generated expected returns are within the range [ 0.5 , 1.0 ] .
The MSV tends to produce more concentrated portfolios than its MV counterpart when the expected return of the portfolio is within the range [ 0.5 , 1.0 ] , as can be seen in Figure 2. In cases where the randomly generated expected returns are within the range [ 0.0 , 0.5 ] , the results (with respect to the diversification level) are just the opposite. As expected, the difference in concentrations is reduced when λ is close to one, since both of the portfolios converge to the GMV portfolio when λ = 1 .

5. Conclusions

In this paper, an alternative to the traditional mean-variance (MV) strategy, named the mean squared variance (MSV) portfolio, is proposed. The novel proposed portfolio model overcomes the original limitation of the traditional model that expresses the two conflicting objectives in different units by including the expected return term in squared form. The proposed portfolio model is presented as a non-convex QP problem, which has been reformulated as a mixed-integer linear programming (MILP) problem to be globally solved by state-of-the-art MILP solvers.
Besides, a novel data-driven approach that is based on a Bayesian approach to estimate the optimal value of the hyper-parameter associated with the MV and MSV strategies is also proposed in this paper. The aim is to show that the formulation of the MSV problem helps the global optimization procedure achieve a more competitive set of portfolio weights (in terms of the Sharpe ratio).
The proposed portfolio model was empirically tested on 8 portfolio problems with three different estimation windows (24 datasets were considered), reporting in most of the problems a very competitive performance of both the mean return and Sharpe ratio. In this research work, it is assumed that the current formulation of the model helps the tuning of the risk aversion profile parameter and, consequently, it helps in obtaining a final competitive performance.

Author Contributions

Conceptualization, F.F.-N. and M.C.-R.; Methodology, F.F.-N. and M.C.-R.; Software, F.F.-N.; Validation, F.F.-N.; Formal analysis, F.F.-N. and L.M.-N.; Data curation, L.M.-N. and T.M.-R.; Writing—original draft preparation, F.F.-N.; Writing—review and editing, L.M.-N., T.M.-R. and M.C.-R. All authors have read and agreed to the published version of the manuscript.

Funding

The research work of F.F.N. is funded by the Spanish Ministry of Science under Project ENE2017-88889-C2-1-R.

Institutional Review Board Statement

Not applicable

Informed Consent Statement

Not applicable

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BMBook-to-market
GMRGlobal maximum return
GMVGlobal minimum variance
IInvestment
MILPMixed-integer linear programming
MRMean return
MSVMean squared variance
MVMean-variance
OPOperating profitability
QPQuadratic programming
SRSharpe ratio

References

  1. Markowitz, H. Portfolio selection. J. Financ. 1952, 7, 77–91. [Google Scholar]
  2. Markowitz, H. Mean–variance approximations to expected utility. Eur. J. Oper. Res. 2014, 234, 346–355. [Google Scholar] [CrossRef]
  3. Zhou, R.; Palomar, D.P. Understanding the Quintile Portfolio. IEEE Trans. Signal Process. 2020, 68, 4030–4040. [Google Scholar] [CrossRef]
  4. Coqueret, G. Diversified minimum-variance portfolios. Ann. Financ. 2015, 11, 221–241. [Google Scholar] [CrossRef]
  5. Maillet, B.; Tokpavi, S.; Vaucher, B. Global minimum variance portfolio optimisation under some model risk: A robust regression-based approach. Eur. J. Oper. Res. 2015, 244, 289–299. [Google Scholar] [CrossRef]
  6. Bodnar, T.; Parolya, N.; Schmid, W. Estimation of the global minimum variance portfolio in high dimensions. Eur. J. Oper. Res. 2018, 266, 371–390. [Google Scholar] [CrossRef] [Green Version]
  7. Lim, A.E.; Zhou, X.Y. Mean-variance portfolio selection with random parameters in a complete market. Math. Oper. Res. 2002, 27, 101–120. [Google Scholar] [CrossRef] [Green Version]
  8. Yin, G.; Zhou, X.Y. Markowitz’s mean-variance portfolio selection with regime switching: From discrete-time models to their continuous-time limits. IEEE Trans. Autom. Control 2004, 49, 349–360. [Google Scholar] [CrossRef]
  9. Usta, I.; Kantar, Y.M. Mean-variance-skewness-entropy measures: A multi-objective approach for portfolio selection. Entropy 2011, 13, 117–133. [Google Scholar] [CrossRef]
  10. DeMiguel, V.; Garlappi, L.; Uppal, R. Optimal versus naive diversification: How inefficient is the 1/N portfolio strategy? Rev. Financ. Stud. 2009, 22, 1915–1953. [Google Scholar] [CrossRef] [Green Version]
  11. Tu, J.; Zhou, G. Markowitz meets Talmud: A combination of sophisticated and naive diversification strategies. J. Financ. Econ. 2011, 99, 204–215. [Google Scholar] [CrossRef] [Green Version]
  12. Li, H.; Huang, Q.; Wu, B. Improving the naive diversification: An enhanced indexation approach. Financ. Res. Lett. 2020, 101661. [Google Scholar] [CrossRef]
  13. Schmidt, A.B. Managing portfolio diversity within the mean variance theory. Ann. Oper. Res. 2019, 282, 315–329. [Google Scholar] [CrossRef]
  14. Cai, H.; Schmidt, A.B. Comparing mean–variance portfolios and equal-weight portfolios for major US equity indexes. J. Asset Manag. 2020, 21, 326–332. [Google Scholar] [CrossRef]
  15. Bird, R.; Tippett, M. Note—naive diversification and portfolio risk—A note. Manag. Sci. 1986, 32, 244–251. [Google Scholar] [CrossRef] [Green Version]
  16. Schreiner, J. Portfolio revision: A turnover-constrained approach. Financ. Manag. 1980, 9, 67–75. [Google Scholar] [CrossRef]
  17. Kourtis, A. A Stability Approach to Mean-Variance Optimization. Financ. Rev. 2015, 50, 301–330. [Google Scholar] [CrossRef]
  18. Best, M.J.; Grauer, R.R. On the sensitivity of mean-variance-efficient portfolios to changes in asset means: Some analytical and computational results. Rev. Financ. Stud. 1991, 4, 315–342. [Google Scholar] [CrossRef] [Green Version]
  19. Chopra, V.K.; Ziemba, W.T. The effect of errors in means, variances, and covariances on optimal portfolio choice. In Handbook of the Fundamentals of Financial Decision Making: Part I; World Scientific: London, UK, 2013; pp. 365–373. [Google Scholar]
  20. Palczewski, A.; Palczewski, J. Theoretical and empirical estimates of mean–variance portfolio sensitivity. Eur. J. Oper. Res. 2014, 234, 402–410. [Google Scholar] [CrossRef] [Green Version]
  21. Sharpe, W.F. Capital asset prices: A theory of market equilibrium under conditions of risk. J. Financ. 1964, 19, 425–442. [Google Scholar]
  22. Sharpe, W.F. Portfolio Theory and Capital Markets; McGraw-Hill College: New York, NY, USA, 1970. [Google Scholar]
  23. Bailey, D.H.; Lopez de Prado, M. The Sharpe ratio efficient frontier. J. Risk 2012, 15, 3–44. [Google Scholar] [CrossRef] [Green Version]
  24. Sharpe, W.F. The sharpe ratio. J. Portf. Manag. 1994, 21, 49–58. [Google Scholar] [CrossRef]
  25. Xia, W.; Vera, J.C.; Zuluaga, L.F. Globally solving nonconvex quadratic programs via linear integer programming techniques. INFORMS J. Comput. 2020, 32, 40–56. [Google Scholar] [CrossRef]
  26. Mockus, J. Bayesian Approach to Global Optimization: Theory and Applications; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012; Volume 37. [Google Scholar]
  27. Giannessi, F.; Tomasin, E. Nonconvex quadratic programs, linear complementarity problems, and integer linear programs. In Proceedings of the IFIP Technical Conference on Optimization Techniques, Rome, Italy, 7–11 May 1973; Springer: Berlin/Heidelberg, Germany, 1973; pp. 437–449. [Google Scholar]
  28. Bergmeir, C.; Benítez, J.M. On the use of cross-validation for time series predictor evaluation. Inf. Sci. 2012, 191, 192–213. [Google Scholar] [CrossRef]
  29. Black, A.J.; McMillan, D.G. Non-linear predictability of value and growth stocks and economic activity. J. Bus. Financ. Account. 2004, 31, 439–474. [Google Scholar] [CrossRef]
  30. Swinkels, L.; Tjong-A-Tjoe, L. Can mutual funds time investment styles? J. Asset Manag. 2007, 8, 123–132. [Google Scholar] [CrossRef]
  31. Demšar, J. Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 2006, 7, 1–30. [Google Scholar]
  32. Friedman, M. A comparison of alternative tests of significance for the problem of m rankings. Ann. Math. Stat. 1940, 11, 86–92. [Google Scholar] [CrossRef]
  33. Bender, J.; Briand, R.; Nielsen, F.; Stefek, D. Portfolio of risk premia: A new approach to diversification. J. Portf. Manag. 2010, 36, 17–25. [Google Scholar] [CrossRef]
  34. Liesiö, J.; Xu, P.; Kuosmanen, T. Portfolio Diversification based on Stochastic Dominance under Incomplete Probability Information. Eur. J. Oper. Res. 2020, 286, 755–768. [Google Scholar] [CrossRef]
  35. Cesarone, F.; Scozzari, A.; Tardella, F. An optimization–diversification approach to portfolio selection. J. Glob. Optim. 2020, 76, 245–265. [Google Scholar] [CrossRef]
Figure 1. Cumulative monthly returns plot of the four portfolios implemented from 1 September 2017–1 August 2020.
Figure 1. Cumulative monthly returns plot of the four portfolios implemented from 1 September 2017–1 August 2020.
Mathematics 09 00223 g001
Figure 2. The boxplots of diversification for the MV and MSV strategies for N { 7 , 8 , 9 , 10 } and λ { 0.2 , 0.4 , 0.6 , 0.8 } .
Figure 2. The boxplots of diversification for the MV and MSV strategies for N { 7 , 8 , 9 , 10 } and λ { 0.2 , 0.4 , 0.6 , 0.8 } .
Mathematics 09 00223 g002
Table 1. Characteristics of the benchmark datasets.
Table 1. Characteristics of the benchmark datasets.
ID.DatasetMTN
Industry Portfolios
{ 1 , 2 , 3 } 5 Industry Portfolios { 60 , 120 , 240 } { 96 , 156 , 276 } 5
{ 4 , 5 , 6 } 10 Industry Portfolios { 60 , 120 , 240 } { 96 , 156 , 276 } 10
Emerging Market Factors
{ 7 , 8 , 9 } 6 Emerging Market Portfolios Formed on BM and OP { 60 , 120 , 240 } { 96 , 156 , 276 } 6
{ 10 , 11 , 12 } 6 Emerging Market Portfolios Formed on Size and BM { 60 , 120 , 240 } { 96 , 156 , 276 } 6
{ 13 , 14 , 15 } 6 Emerging Market Portfolios Formed on Size and OP { 60 , 120 , 240 } { 96 , 156 , 276 } 6
Bivariate sorts on Size, BM and I
{ 16 , 17 , 18 } 6 Portfolios Formed on Size and BM { 60 , 120 , 240 } { 96 , 156 , 276 } 6
{ 19 , 20 , 21 } 6 Portfolios Formed on Size and I { 60 , 120 , 240 } { 96 , 156 , 276 } 6
Bivariate sorts on Size, BM and I
{ 22 , 23 , 24 } 25 European Portfolios Formed on Size and BM { 60 , 120 , 240 } { 96 , 156 , 276 } 25
BM: Book-to-Market; OP: Operating Profitability; I: Investment.
Table 2. Statistical results of the strategies implemented: the mean returns (MR) and Sharpe ratio (SR) out-of-sample performance of each strategy and dataset (MR and SR), mean MR ( M R ¯ ), mean MR ranking ( R ¯ M R ), mean SR ( S R ¯ ) and mean SR ranking ( R ¯ S R ). The datasets are denoted by their IDs. The best results are in bold face and second-best results in italics.
Table 2. Statistical results of the strategies implemented: the mean returns (MR) and Sharpe ratio (SR) out-of-sample performance of each strategy and dataset (MR and SR), mean MR ( M R ¯ ), mean MR ranking ( R ¯ M R ), mean SR ( S R ¯ ) and mean SR ranking ( R ¯ S R ). The datasets are denoted by their IDs. The best results are in bold face and second-best results in italics.
MR MetricSR Metric
IDGMVGMRMVMSVGMVGMRMVMSVID
10.9521 1.6197 1.1717 0.0065 1.8161 0.0000 0.1330 0.2243 0.1646 0.0015 0.2339 0.0000 1
21.4004 1.8981 1.8977 0.0003 1.8981 0.0000 0.2825 0.3408 0.3391 0.0008 0.3408 0.0000 2
31.30980.9897 1.3146 0.0013 1.3478 0.0034 0.26980.1748 0.2705 0.0008 0.2775 0.0021 3
40.6058 2.1011 1.4175 0.0008 1.2533 0.0338 0.1411 0.3493 0.2426 0.0005 0.2965 0.0098 4
50.6324 1.9033 1.2299 0.0012 1.1355 0.0004 0.1541 0.3077 0.2174 0.0013 0.2062 0.0003 5
60.7234 1.2039 0.8252 0.0021 1.0470 0.0866 0.17520.2079 0.2093 0.0007 0.2084 0.0124 6
70.54040.5456 0.5477 0.0008 0.5577 0.0013 0.1016 0.0960 0.1016 0.0003 0.1028 0.0005 7
8 0.5473 0.4494 0.5342 0.0008 0.4494 0.0000 0.1027 0.0791 0.0959 0.0002 0.0791 0.0000 8
9 0.5118 −0.1094 0.1042 0.0118 0.0021 0.0089 0.0960 −0.0175 0.0166 0.0019 0.0021 0.0017 9
100.18980.6411 0.6524 0.0419 0.6848 0.0321 0.03240.1277 0.1295 0.0072 0.1351 0.0057 10
110.4162 0.6039 0.5962 0.0051 0.6005 0.0041 0.0742 0.1268 0.1249 0.0010 0.1270 0.0007 11
120.3150 0.1772 0.1772 0.0000 0.1775 0.0012 0.0561 0.0306 0.0306 0.0000 0.0309 0.0002 12
130.1505 0.4075 0.1259 0.0031 0.2530 0.0810 0.0262 0.0747 0.0220 0.0010 0.0456 0.0266 13
140.2519 0.3078 0.2679 0.0014 0.2756 0.0294 0.0444 0.0614 0.0474 0.0008 0.0515 0.0091 14
150.1601 0.3181 0.1640 0.0321 0.1919 0.0705 0.0283 0.0561 0.0291 0.0087 0.0339 0.0124 15
161.1048 1.7043 1.5803 0.1945 1.7876 0.0000 0.2017 0.3252 0.2992 0.0413 0.3288 0.0000 16
171.41311.4092 1.4394 0.0187 1.4656 0.0021 0.25340.2600 0.2682 0.0022 0.2639 0.0007 17
181.05820.1845 1.1500 0.0018 1.2829 0.3904 0.2032 0.0276 0.2013 0.0004 0.2280 0.0085 18
191.02941.4941 1.4982 0.1002 1.5056 0.0196 0.19480.2271 0.2445 0.0106 0.2621 0.0023 19
201.0063 1.0665 1.0790 0.0008 1.0067 0.0009 0.19080.1721 0.1930 0.0001 0.1912 0.0003 20
21 0.7662 0.0917 0.7658 0.0009 0.7664 0.0007 0.14880.0134 0.1497 0.0002 0.1490 0.0002 21
220.43910.5806 0.5843 0.0091 0.5823 0.0064 0.1011 0.1019 0.0468 0.0015 0.1048 0.0023 22
230.71580.8431 0.8613 0.0148 0.8641 0.0217 0.1801 0.1530 0.1548 0.0027 0.1577 0.0031 23
24 0.4016 −0.2092 0.1046 0.0113 0.0032 0.0283 0.0953 −0.0347 0.0324 0.0023 0.0008 0.0009 24
M R ¯ 0.6934 0.8426 0.8196 0.8731 0.13690.1452 0.1472 0.1607 S R ¯
R ¯ M R 3.25002.5208 2.4792 1.7500 3.02082.7292 2.5417 1.7083 R ¯ S R
Table 3. Statistical results for the Holm test for α = 0.10 and α = 0.05 using the mean squared variance (MSV) strategy as the control method: mean MR and SR ranking of the strategies implemented ( R ¯ M R and R ¯ S R ), z-statistics and p-values of the Holm tests for the MR and SR analysis and adjusted α Holm values ( α 0.10 and α 0.05 ). The best results are in bold face and second-best results in italics.
Table 3. Statistical results for the Holm test for α = 0.10 and α = 0.05 using the mean squared variance (MSV) strategy as the control method: mean MR and SR ranking of the strategies implemented ( R ¯ M R and R ¯ S R ), z-statistics and p-values of the Holm tests for the MR and SR analysis and adjusted α Holm values ( α 0.10 and α 0.05 ). The best results are in bold face and second-best results in italics.
MR Analysis
Method R ¯ M R z-statisticp-value α 0.10 α 0.05
GMV , 3.25004.02491 × 10 4 0.03330.0167
GMR 2.52082.06830.03860.05000.0250
MV 2.4792 1.95660.05040.10000.0500
MSV 1.7500 ----
SR Analysis
Method R ¯ S R z-statisticp-value α 0.10 α 0.05
GMV , 3.02083.52184 × 10 4 0.03330.0167
GMR , 2.72922.73940.00620.05000.0250
MV , 2.5417 2.23620.02530.10000.0500
MSV 1.7083 ----
•: Statistical difference with α = 0.10; ∘: Statistical difference with α = 0.05.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fernández-Navarro, F.; Martínez-Nieto, L.; Carbonero-Ruz, M.; Montero-Romero, T. Mean Squared Variance Portfolio: A Mixed-Integer Linear Programming Formulation. Mathematics 2021, 9, 223. https://doi.org/10.3390/math9030223

AMA Style

Fernández-Navarro F, Martínez-Nieto L, Carbonero-Ruz M, Montero-Romero T. Mean Squared Variance Portfolio: A Mixed-Integer Linear Programming Formulation. Mathematics. 2021; 9(3):223. https://doi.org/10.3390/math9030223

Chicago/Turabian Style

Fernández-Navarro, Francisco, Luisa Martínez-Nieto, Mariano Carbonero-Ruz, and Teresa Montero-Romero. 2021. "Mean Squared Variance Portfolio: A Mixed-Integer Linear Programming Formulation" Mathematics 9, no. 3: 223. https://doi.org/10.3390/math9030223

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop