Model and Data-Driven System Portfolio Selection Based on Value and Risk

Wei, Hechuan; Xia, Boyuan; Yang, Zhiwei; Zhou, Zhexuan

doi:10.3390/app9081657

Open AccessArticle

Model and Data-Driven System Portfolio Selection Based on Value and Risk

by

Hechuan Wei

¹,

Boyuan Xia

²,

Zhiwei Yang

^2,* and

Zhexuan Zhou

²

¹

College of Information Science and Engineering, Northeastern University, Shenyang 110004, China

²

College of Systems Engineering, National University of Defense Technology, Changsha 410073, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2019, 9(8), 1657; https://doi.org/10.3390/app9081657

Submission received: 15 February 2019 / Revised: 17 April 2019 / Accepted: 18 April 2019 / Published: 22 April 2019

(This article belongs to the Special Issue Applied Sciences Based on and Related to Computer and Control)

Download

Browse Figures

Versions Notes

Abstract

:

Featured Application

The work is potential to be applied to the weapon system portfolio selection and other project portfolio selection on the context of research and development.

Abstract

System portfolio selection is a kind of tradeoff analysis and decision-making on multiple systems as a whole to fulfill the overall performance on the perspective of System of Systems (SoS). To avoid the subjectivity of traditional expert experience-dependent models, a model and data-driven approach is proposed to make an advance on the system portfolio selection. Two criteria of value and risk are used to indicate the quality of system portfolios. A capability gap model is employed to determine the value of system portfolios, with the weight information determined by correlation analysis. Then, the risk is represented by the remaining useful life (RUL), which is predicted by analyzing time series of system operational data. Next, based on the value and risk, an optimization model is proposed. Finally, a case with 100 candidate systems is studied under the scenario of anti-missile. By utilizing the Non-dominated Sorting Differential Evolution (NSDE) algorithm, a Pareto set with 200 individuals is obtained. Some characters of the Pareto set are analyzed by discussing the frequency of being selected and the association rules. Through the conclusion of the whole procedures, it can be proved that the proposed model and data-driven approach is feasible and effective for system portfolio selection.

Keywords:

model and data-driven; system portfolio selection; value and risk; capability gap; remaining useful life; genetic algorithm

1. Introduction

Joint operations have become the main trend of modern warfare. The construction of “system of systems (SoS)” is not only a goal but also a basic guideline on the long-term weapon/equipment development. System portfolio selection is a widely used concept of weapon SoS construction, where a key step is evaluation [1]. Traditional evaluation models rely too much on subjective awareness, making assessment results inaccurate and unconvincing to some extent. With the rise of data science, an effective method to compensate for the low accuracy and implementing difficulty of relying on expert experience is making decisions according to real data. Therefore, the combination of data-driven methods and model-based approaches is a new trend to solving system portfolio selection problems.

Markowitz first proposed the portfolio theory in 1952, opening a new era of utilizing mathematical approaches in resource allocation problems [2,3]. In the field of management science and operation research, the portfolio theory is widely used in project research and development (R&D) [4], supplier selection [5], material selection etc. With the development of SoS science, portfolio theory shows increasing popularity in the field of weapon SoS construction, where the optimal system portfolio will be selected by evaluating system portfolio candidates through model-based methods. So far, there has been little research that measures weapon system portfolios without subjective criteria. Typical measurements in most literature, such as benefit-risk analysis [6], cost-efficiency analysis [7], and requirement-satisfaction analysis [8] are inaccurate and unconvincing to some extent because they usually require too much expert experience.

Motivated by solving issues mentioned above, data-driven methods are combined with traditional model-based approaches to improve the accuracy and credibility of evaluation results on system portfolios, by reducing dependence on artificial expertise. In addition, data-driven methods are complements of model-based approaches, instead of substitutions, because pure data without models cannot construct the bridge connecting scheme variable inputs and evaluation outputs.

In civilian fields, the portfolio selection theory has been mainly studied and applied on project portfolio problems [9]. From the perspective of modeling, the scenario-based models are frequently used to describe the boundary of possible cases, based on which, decision-makers evaluate and select the well-matched optimal system portfolio [10,11,12,13]. Robust models are also widely studied and applied in project portfolio problems to solve the difficulties in determining probabilities of future scenarios, aiming to select an ideal system portfolio that performances well at almost all possible situations [14,15,16]. As for the evaluation and trade-off of project portfolios, variant methods are proposed and studied, such as risk analysis methods [17], value evaluation methods [18], cost-efficiency methods [19], fuzzy assessment methods [4], preference-based methods [20], game theory, interactive decision methods [21], etc. A common ground of those methods is determining the value and risk of a system portfolio to abstractly indicate what decision-makers expect or not expect. As regard to portfolio planning and optimization, the goal is to select the optimal project portfolio by analyzing and comparing candidate project portfolios. The mixed integer model [22], multi-objective optimization [23], hybrid and dynamic planning are the most popular optimization methods. In addition, genetic algorithms [23], Monte Carlo simulation [24] and Lagrangian relaxation methods are also widely used in the solving process, when facing a large solution scale and specific constraints.

In military fields, most methods in system portfolio selection are based on specific evaluation models, where the most commonly investigated techniques include multiple objective analysis, multiple criteria analysis [25], value analysis [26], cost-efficiency analysis [27], expert judgment [27], Monte Carlo technique, risk analysis and etc. In detail, Yang et al. [26] formulize the weapon system portfolio problem with a mixed integer non-linear optimization model and solve the problem with an adaptive immune genetic algorithm. Greiner et al. [28] conclude challenges of the Department of Defense (DoD) in determining weapon system value during portfolio selection processes. Cheng et al. [29] use combat network and operation loop to analyze strategies of the weapon system portfolio selection problem, where the operational capability evaluation indexes of weapon systems are constructed. Zhou et al. [30] deal with weapon system portfolio selection problems based on fuzzy clustering, with the maximum deviation methods applied to rank all the candidates by calculating the weight of each weapon system. Kangaspunta J et al. [27] use the cost-efficiency method to decide the acquisition and maintenance of military equipment, aiming to build long-term capabilities in future military conflicts. Li et al. [31] adopt a network-based method to formulate and analyze weapon system portfolio architecting problem by embedding different types of systems into a network. Zhou et al. [32] study the evolving capability requirement-oriented portfolio planning problem with a capability-based approach from the perspective of operational research. Huang et al. [33] regard the weapon system portfolio as a constrained combinatorial optimization problem and use a self-adaptive memetic algorithm-based decision-making method to maximize the expected damage of hostile targets.

Whatever in civilian fields and military fields, the model-based portfolio selection methods have been elaborately studied. With the increase of requirements for more accurate and valid approaches, the data-driven idea is appropriate to be applied to the portfolio selection. In the paper, we focus mainly on system portfolio evaluation, where a key part is determining criteria that influence the evaluation result of an object. Herein, two criteria of value and risk are used to evaluate system portfolios, where the value criterion is decided according to capability gaps of system portfolios and the risk criterion is decided by the remaining useful life (RUL) of systems. Based on the two criteria, the optimization is to obtain the system portfolio with the maximal value and minimal risk, within the limitation of a certain cost. To increase the credibility and practicability, the weight information in value evaluation and the RUL are all decided according to simulation data, instead of expert experience.

The remaining parts of the paper are structured as follows. In the second section, the capability gap-based value decision method and the RUL-based risk decision method are studied. In the third section, a case is examined to verify the utility and effectiveness of the proposed methods and models. Then, the results are discussed by analyzing the frequency of being selected and the association rules.

2. Materials and Methods

2.1. Capability Gap Based Value Decision

2.1.1. Weight Decision Based on Correlation Analysis

Weight information determines the criteria importance, and thus influences the final evaluation results of systems portfolios. Typically, criteria weights tend to be determined according to expert experience, which is criticized for its subjectivity and infeasibility. Correlation analysis is a quantitative method to measure the correlations between independent and dependent variables, and therefore can be used to indicate the weights of independent variables.

The maximal information coefficient (MIC) is used to measure correlations among all kinds of data. Compared to other measurements, it is acknowledged that the MIC is more sensible to identify correlations among variables, whatever for linear relations or non-linear relations (cubic, exponential, sinusoidal, parabolic, etc.)

The correlation of variable X and Y can be indicated by the mutual information, as Equation (1) shows.

I (X, Y) = \sum_{x, y} p (x, y) \log \frac{p (x, y)}{p (x) p (y)},

(1)

Because the information entropy of discrete random variables is denoted as

H (p) = - \sum_{i = 1}^{n} p_{i} \log p_{i}

, where

\sum_{i = 1}^{n} p_{i} = 1

. Therefore, the

I (X, Y)

can be proved to be equivalent to Equation (2).

\begin{array}{l} I (X, Y) & = H (X) + H (Y) - H (X, Y) \\ = H (X) - H (X | Y) \\ = H (Y) - H (Y | X) \end{array},

(2)

With respect to MIC, it is assumed that the

D = X \times Y \subset ℝ^{2}

is the variable space. Dividing the D into a grid G of

x \times y

. The distribution of data D in grid G is represented by

D |_{G}

. Executing multiple divisions on D and calculating the

I (D |_{G})

under different divisions. Then, the MIC can be obtained by calculating the max

I (D |_{G})

value from all possible division schemes.

Here, a grid of one row and one column is taken for example as Figure 1 shows. All data points are divided into four areas: top left, top right, bottom left, and bottom right. The numbers of data points belonging to each area are 1, 4, 4, and 1. Then, the normalized numbers of data point frequencies in the four regions are 0.1, 0.4, 0.4, and 0.1. Herein, X has two values: left and right, and Y has two values: upper and lower. The joint probabilities of the data in four regions can be calculated as

p (X = l e f t, Y = u p) = 0.1

,

P (X = r i g h t, Y = u p) = 0.4

,

P (X = l e f t, Y = d o w n) = 0.4

,

P (X = r i g h t, Y = d o w n) = 0.1

. Therefore, the point frequencies in X, Y are

P (X = l e f t) = 0.5

,

P (X = r i g h t) = 0.5

,

P (Y = u p p e r) = 0.5

and

P (Y = l o w e r) = 0.5

. According to the mutual information calculation formula introduced above, the mutual information of X and Y is obtained as Equation (3).

\begin{array}{l} p (X = l e f t, Y = u p) \log (\frac{p (X = l e f t, Y = u p)}{p (X = l e f t) p (Y = u p)}) \\ + p (X = l e f t, Y = u p) \log (\frac{p (X = l e f t, Y = u p)}{p (X = l e f t) p (Y = u p)}) \\ I (X, Y) = + p (X = l e f t, Y = u p) \log (\frac{p (X = l e f t, Y = u p)}{p (X = l e f t) p (Y = u p)}) \\ + p (X = l e f t, Y = u p) \log (\frac{p (X = l e f t, Y = u p)}{p (X = l e f t) p (Y = u p)}) \\ = 0.1 \times \log (\frac{0.1}{0.5 \times 0.5}) + 0.4 \times \log (\frac{0.4}{0.5 \times 0.5}) + 0.4 \times \log (\frac{0.4}{0.5 \times 0.5}) + 0.1 \times \log (\frac{0.1}{0.5 \times 0.5}) \end{array},

(3)

By using the traversal algorithm, we can find the maximal value from all possible division schemes as

I^{*} (D, x, y)

= \max I (D |_{G})

. Then, the characteristic matrix

M {(D)}_{x, y}

is constructed through normalization operation on

I^{*} (D, x, y)

as

M {(D)}_{x, y} = I^{*} (D, x, y) / \log \min {x, y}

. Whereupon the MIC can be obtained according to

M I C (D) = \max_{x, y < B (n)} {M {(D)}_{x, y}}

.

Finally, for the relation of

y = R (x)

,

x = [x_{1}, x_{2,}, \dots, x_{n}]

, the weight information of elements in x can be obtained by calculating the MIC between each independent variable of

[x_{1}, x_{2,}, \dots, x_{n}]

and dependent variable y.

2.1.2. Value Model Construction

Value is a measurement to denote the importance degree of an object. In the military field, a frequently used measurement for denoting value is the capability gap, meaning the gap between system capabilities and capability requirements. Before calculating the capability gap, it’s necessary to obtain system portfolio capabilities, which are the combination of those of component systems.

Firstly, let

C R = {c r_{1}, \dots, c r_{k}}

be the set of capabilities requirements, proposed by stakeholders, where

c r_{j}

denotes the value of the jth capability requirement. The

C (s_{i}) = (c_{i 1}, \dots, c_{i k}), C (s_{i}) \in {0, 1}^{m}

represents the capability value of the system

s_{i}

, where

c_{i j}

denotes the capability value of

s_{i}

on the jth capability. The

P C (x_{i}) = (p c_{1} (x_{i}), \dots, p c_{k} (x_{i}))

denotes the capabilities values of the system portfolio corresponding to scheme

x_{i}

, where

p c_{j} (x_{i})

is the jth capability value.

In the combination process, one case must be considered is that certain systems in a system portfolio may have the same capabilities. It’s a key procedure to deal with the combination of this kind of capabilities. Hereon, four rules are introduced as follows to support this combination.

Assume there are n systems

s_{i}, i = 1, \dots, n

in a system portfolio, providing the same capability

t

, with capability values:

c_{i t}, i = 1, \dots, n

.

(1): Additive rule: the combined capability value is $\sum c_{i t}, i = 1, \dots n$ . E.g., Assuming that 3 transportation systems operate at the same time, the freight volume are 5t, 6t and 7t respectively, then the portfolio of the 3 systems can provide a freight capability of 18t.
(2): Maximal rule: the combined capability value is $m a x {c_{i t}}, i = 1, \dots, n$ . E.g., Assuming that there are three bridges over a river, and each bridge can bear the weight of 100t, 120t and 130t respectively, then only object less than 130t can pass the river, because one object can only pass over one bridge at the same time.
(3): Minimal rule: the combined capability value is $m i n {c_{i t}}, i = 1, \dots, n$ . E.g., Assuming that there are 3 tandem oil pipelines with oil flow of 5 t/hour, 6 t/hour and 3 t/hour, then the max oil flow of the 3 pipeline is 3 t/hour.
(4): Average rule: the combined capability value is $\sum c_{i t} / n, i = 1, \dots n$ . E.g., Assuming there are 3 forecast systems, with correctly predicting probability 50%, 60%, and 70%, then the overall correctly predicting probability is 60%.

In addition, the capability can be classified into benefit and cost types. Firstly, the relation between a single capability of a solution and corresponding capability requirement is defined based on three premises: (1) When a capability of a solution is deficient or worse than the inferior value of the corresponding capability requirement interval, which means the solution absolutely cannot meet the capability requirement. Its capability gap is supposed to be 1. (2) When a solution can provide a capability, with its value falling in the interval of corresponding capability requirement, its capability gap should be a number between

[0, 1]

. Especially when the solution’s capability value exactly falls in the middle of the capability requirement interval, the capability gap is supposed to be 0.5, which means the capability can meet the capability requirement by half. (3) When a capability of a solution equals to or better than the superior value of the corresponding capability requirement interval, it means that the capability can absolutely meet the corresponding capability requirement, and the capability gap is supposed to be 0.

A concrete case is given in Figure 2, reflecting the relation between a capability and corresponding capability requirement interval

[l, u]

. Four special points discussed above are marked by circle points. To mathematically model the relations with universal forms, a formula is defined to fit the linear line above, shown in Equation (4).

G (c_{i}, c r_{i}) = {\begin{cases} {\begin{cases} - c_{i} / (2 a) + 1, 0 \leq c_{i} < a \\ (- c_{i} + u) / (2 b), a \leq c_{i} < u \\ 0, u < c_{i} \end{cases}; b e n e f i t t y p e \\ {\begin{cases} 1, c_{i} = 0 \\ 0, 0 < c_{i} < l \\ (c_{i} - l) / (2 b), l \leq c_{i} < a \\ (c_{i} + w o r s t - 2 a) / (2 \times (w o r s t - a)), a \leq c_{i} < w o r s t \end{cases}; C o s t t y p e \end{cases},

(4)

where

G (c_{i}, c r_{i})

represents the capability gap between the ith capability and corresponding capability requirement interval

[l, u]

. Two special notation a and b are defined to simplify the formula, where a = (l + u)/2, b = (u − l)/2. The notation worst denotes the requirement threshold of cost-type capability that the capability gap will be judged to be 1 if a cost-type capability value exceeds worst.

Then, the capability gaps of all capabilities should be aggregated to compute the total capability gap of a system portfolio as

\sum_{i = 1}^{k} w_{i} G (c_{i}, c r_{i})

, where the

w_{i}

denote the weight of the ith capability and is determined according to the method in Section 2.1.1. Then, according to capability gaps, the value is defined as Equation (5), which indicates that the bigger the total capability gap, the smaller the value of a system portfolio.

V (x_{j}) = \frac{\sum_{i = 1}^{k} w_{i} (1 - G (c_{i}, c r_{i}))}{k}, \sum_{i = 1}^{k} w_{i} = 1,

(5)

Different to general “capability-based” evaluation methods, which are usually based on hierarchy structures such as a tree structure that the capabilities of all systems in bottom level are up-aggregated to an integrated value, capability gap is a criterion denoting the gap between capabilities and capability requirements. For general methods, an inevitable defect is that an extremely high capability will pull up the integrated capability value, which is obviously unreasonable. Whereas for the capability gap, it can mitigate the effects of extreme capability values. For example, even an infinitely high benefit-type capability can only result in corresponding capability gap of 0, instead of unreasonably extreme value.

2.2. Risk Decision Based on RUL Prediction

General risk assessment methods follow the steps of risk factor identification, risk analysis, risk assessment and risk management. However, for weapon systems, decision-makers mainly focus on the availability and stability of systems in the operation process. Therefore, the RUL, which denotes when a system is predicted to fault, is used as a weapon system risk criterion. The longer the RUL, the smaller the probability a risk will happen in the operational process. In addition, the RUL prediction method based on similarities of degradation characteristics is proved to perform better than other classical methods when there are sufficient historical samples. Therefore, the paper adopts degradation characteristics similarities to predict the RUL to indicate system portfolio risks.

The key resource for predicting RUL is the operational data, which is typically obtained from embedded sensors with time series. In specific, the degradation data is analyzed to support the construction of the RUL prediction model and the main steps can be concluded as degradation track phase space construction, track matching, and RUL prediction. Detailed procedures are given in Figure 3.

2.2.1. Feature extraction based on variation coefficient

When multiple sensors are used to monitor the health of a system, a key step for predicting health states is selecting features from the multidimensional time series data, because only parameters without commonality can be treated as attributes features. Therefore, parameters screening is the prerequisite to ensure the non-commonality of parameters. Intuitively, practitioners should firstly deal with correlations of parameters to make the transformed parameters independent.

In this paper, the statistic of variation coefficients is used to select the parameters features. Similar to the concept of standard deviation and variance in statistics, the variation coefficient is described as the dispersion degree of observations. The variation coefficient is calculated by the ratio of the standard deviation to the mean, as Equation (6) shows. Therefore, it is dimensionless.

V C = (S t a n d a r d D e v i a t i o n / M e a n) \times 100 % .

(6)

Variables with larger variation coefficients show more obvious features and therefore are more suitable for describing characteristics of data.

2.2.2. Reconstruction of Degradation Track Phase Space

According to Taken’s theorem, the potential dynamics laws of a system can be studied by constructing the phase space that preserves the topological properties of the original system. Coupled with the nonlinear characteristics of degradation, the time delay embedding theorem is often applied to construct high-dimensional phase space.

Assume that the time series is

X = (x_{1}, x_{2}, \dots, x_{N})

, then the points in the phase space can be expressed as the following form of row vector.

X_{i} = (x_{i - (d - 1) τ}, x_{i - (d - 2) τ}, \dots, x_{i - τ}, x_{i}), i = 1 + (d - 1) τ, 2 + (d - 1) τ, \dots, N,

(7)

where d is the embedded dimension of phase space and

τ

is the time delay.

Embedded dimension d is the key parameter in phase space construction, whose value will be determined by the non-subjective algorithm.

X_{i} (d)

is a point in the d dimension phase space. If the point

X_{n (i, d)}

nearest to

X_{i} (d)

exists, the following relation can be deduced.

‖ X_{i} (d) - X_{n (i, d)} (d) ‖ = \min_{j = 1 + (d - 1) τ, \dots, N, j \neq i} {‖ X_{i} (d) - X_{j} (d) ‖}_{\infty},

(8)

For all points in the d and d + 1 dimensional phase spaces, the definitions for

a (i, d), E (d)

E^{*} (d), E_{1} (d)

, and

E_{2} (d)

are given respectively.

\begin{matrix} a (i, d) = \frac{‖ X_{i} (d + 1) - X_{n (i, d + 1)} (d + 1) ‖}{‖ X_{i} (d) - X_{n (i, d)} (d) ‖} \\ E (d) = \frac{1}{N - d τ} \sum_{i = 1 + (d - 1) τ}^{N - τ} a (i, d) \\ E_{1} (d) = \frac{E (d + 1)}{E (d)} \\ E^{*} (d) = \frac{1}{N - d τ} \sum_{i = 1}^{N - τ} | x_{i + d τ} - x_{n (i, d) + d τ} | \\ E_{2} (d) = \frac{E^{*} (d + 1)}{E^{*} (d)} \end{matrix},

(9)

Then the value of the embedded dimension d can be determined by finding the smallest spatial dimension that makes the d and d + 1 dimensional phase spaces topologically equivalent. The topological equivalence here refers to that the nearest neighbors in the d dimensional space still remain the closest in the d + 1 dimensional space, which means that the

E_{1} (d)

tends to be a stable value when the d and d + 1 dimensional phase spaces are topologically equivalent. However, in reality, it is difficult to find the smallest d that makes

E_{1} (d)

a stable value. That’s why the

E^{*} (d)

and

E_{2} (d)

are defined. If x is a random time series, then

E_{2} (d)

will vary with d, or

E_{2} (d)

will approach 1. Then by combining

E_{1} (d)

and

E_{1} (d)

, the embedded dimension of the degradation track phase space can be determined. After the construction of the phase space, the time series can be transformed into the form of special track. By referring to the degradation data transformed from historical data and comparing the similarity of the new track with the reference track in phase space, the RUL can be predicted.

2.2.3. System Portfolio Risk Determination Based on RUL

Assume that

Z = {Z_{1}, Z_{2}, \dots, Z_{l_{1}}}

represents the degradation reference track and

Y = {Y_{1}, Y_{2}, \dots, Y_{l_{2}}}

,

l_{1} > l_{2}

, is the current degradation track. Then, the normalized cross correlation (NCC) is introduced to measure the similarity between tracks. The specific definition of NCC is given as follows:

s_{Y Z} (i) = \frac{(Y - \bar{Y}) \times (Z_{i} - {\bar{Z}}_{i})^{'}}{{‖ Y - \bar{Y} ‖}_{2} {‖ Z_{i} - {\bar{Z}}_{i} ‖}_{2}},

(10)

where

Z_{i} = {Z_{i}, Z_{i + 1}, \dots, Z_{i + l_{2} - 1}}

and

Y

are of the same length.

\bar{Y}

and

{\bar{Z}}_{i}

are the mean vectors of

Y

and

Z_{i}

respectively.

Taking the jth time series

x^{j} = {x_{1}^{j}, x_{2}^{j}, \dots, x_{N_{j}}^{j}}

as an example, the time points of the time series are denoted as

t^{j} = {t_{1}^{j}, t_{2}^{j}, \dots, t_{N_{j}}^{j}}

. Apparently,

t_{N_{j}}^{j}

is the running life of the j th degradation process. Taking time delay

τ

as 1 and considering embedded dimension as d for all degradation track phase spaces (d is no less than the maximal embedded dimension of all phase spaces). Then the original time series of degradation process is transformed into the track of phase space. Denoting the reference track for the jth process as

Z_{j} = {[X_{d}^{j}, X_{d + 1}^{j}, \dots, X_{N_{j}}^{j}]}^{T}

, the points in phase space are expressed as

X_{i}^{j} = [x_{i - (d - 1)}^{j}, x_{i - (d - 1)}^{j}, \dots x_{i}^{j}]

.

Assuming the current degradation time series is

y = {y_{1}, y_{2}, \dots, y_{c}}

. Similarly, it is transformed into an incomplete degradation track form denoted as

Y = [Y_{d}, Y_{1 + d}, \dots, Y_{c}]

,

Y_{i} = [y_{i - (d - 1)}, y_{i - (d - 2)}, \dots, y_{i - 1}, y_{i}]

.

When performing track matching, a track subset

Y^{k} = [Y_{k - l + 1}, \dots, Y_{k}]

is selected with a time window length of l in the current phase space. Then the NCC between the current track

Y^{k}

and the reference track

Z_{j}

should be modified as follows.

s_{k}^{j} = s_{Y Z} + (1 - \frac{| t_{k} - t_{k}^{j} |}{t_{k}}),

(11)

If the

s_{k}^{j}

reaches the maximum at the time point

T_{k}^{j}

, then sub-tracks at time

T_{k}^{j}

are regarded as the best matching results in terms of degradation reference track

Z_{j}

. This result also reflects the influence of the track shape to the degradation stage. Therefore, the RUL of the most similar part to the jth degradation reference track can be estimated as

L_{k}^{j} = t_{N_{j}}^{j} - T_{k}^{j}

.

On the basis of the above result of the best track matching, the weight of the remaining life calculated according to the jth degradation reference track is given as

w_{k}^{j} = s_{k}^{j} (T_{k}^{j}) / (\sum_{j = 1}^{M} s_{k}^{j} (T_{k}^{j}))

, where M is the total number of the degradation processes. Then the remaining life of the system k is estimated as Equation (12).

R U L_{k} = \sum_{j = 1}^{M} ω_{k}^{j} L_{k}^{j},

(12)

Then, the RUL should be transformed into risk according to the joint probability. It is identified as a risk event when any system in a system portfolio breaks. Therefore, for a system portfolio

x_{i} = (s_{1}, s_{2}, \dots, s_{n})

, the risk is calculated as Equation (13) shows.

R (x_{i}) = 1 - \prod_{j = 1}^{n} [{(1 - \frac{1}{R U L_{j}})}^{x_{i j}}],

(13)

In conclusion, the RUL is an important criterion for the weapon system research because longer RUL can reduce the failure risk of systems in combat. The availability of each component system is the basis to guarantee the normal operation of systems portfolios. An operation activity will be broken or even unsuccessful if any system of a system portfolio malfunctions in the operation process. This induces the demand for long RUL of systems.

2.3. System Portfolio Optimization

As discussed above, the paper tries to solve the system portfolio problem considering system values and risks. The corresponding notation is as follows. Let

S = {s_{1}, \dots, s_{m}}

denotes the set of alternative systems, indicating m systems can be selected for a system portfolio.

x_{i} = (x_{i 1}, \dots, x_{i m}), x_{i} \in {0, 1}^{m}

is one of system portfolio schemes, where

x_{i j} = 0 \ 1

, with

x_{i j}

= 0 denoting the jth system in S is not selected in scheme

x_{i}

, and

x_{i j}

= 1 denoting the jth system in S is selected in scheme

x_{i}

. All possible schemes compose the solution space X. Let

E = (e_{1}, \dots, e_{m})

represents the cost of systems, where

e_{i}

is the cost of system

s_{i}

, and B denotes the total budget.

Then, the system portfolio optimization model can be formulated as follows.

\begin{array}{l} \begin{matrix} m a x & V (x_{j}) = \frac{\sum_{i = 1}^{k} w_{i} (1 - G (c_{i}, c r_{i}))}{k} \end{matrix} \\ \begin{matrix} m i n & R (x_{j}) = 1 - \prod_{i = 1}^{k} [{(1 - \frac{1}{R U L_{i}})}^{x_{j i}}] \end{matrix} \\ s . t . {\begin{cases} \sum_{i = 1}^{m} x_{j i} \times e_{i} \leq B \\ x_{j} = (x_{j 1}, \dots, x_{j m}), x_{j} \in {0, 1}^{m} \end{cases} \end{array},

(14)

where

V (x_{j})

and

R (x_{j})

are the value and risk of the system portfolio scheme

x_{j}

.

G (c_{i}, c r_{i})

represents the capability gap on the ith capability. In addition, the cost of scheme

x_{j}

must within the budget limitation.

It can be seen that the optimization is a multi-objective problem and the 2 objectives are conflict with each other. Therefore, an optimal system portfolio with the best performance in both 2 objectives does not exist. The target is to obtain the Pareto optimal solutions of 2 objectives, also known as non-dominated solutions.

3. Results

3.1. Background Description

It is hypothesized that the problem aims to select an optimal system portfolio under the anti-missile scenario, where an object will suffer saturated missiles attacks. The objective is selecting a system portfolio from 100 alternative systems (

2^{100} - 1

candidate system portfolios in total) under the budget limitation to maximize the system portfolio value and simultaneously minimize the whole risk.

According to the operation process of OODA, capabilities discussed in the paper are

c 1

detection range,

c 2

communication range,

c 3

striking range and

c 4

decision time, where the former three capabilities are beneficial type and the last one is cost type. In a specific operation scenario, the capability requirements and combination rules are shown in Table 1.

In addition, for capabilities of systems, they are generated by executing a Monte Carlo simulation method according to truncated normal distribution functions. The histogram of generated data is shown in Figure 4. The worst value of cost-type capability decision time is 34.6853, which will be used in value calculation according to Section 2.1.2.

3.2. Value Calculation

3.2.1. Weight Determination

Based on simulations on the “Command: Modern Air/Naval Operations”, an ultimate military simulator for modern military conflicts, the weight information can be deduced by the correlation analysis.

The independent variables are four capabilities, that is detection range, communication range, striking range and decision time. The dependent variable is the intercepted missile number. By auto-simulating for 10,000 times, 10,000 sets of data are generated. Through the MIC algorithm, the corresponding results can be obtained, as Table 2 shows. Through normalization, the weight of four capabilities are determined as 0.276, 0.250, 0.174, and 0.300.

3.2.2. Value Calculation

Because it is impossible to calculate all values of

2^{100} - 1

candidate system portfolios, an example of the value calculation process is introduced. Assuming a system portfolio SP1 have 5 component systems of S1, S2, S3, S4, and S5, with capability information shown in Table 3.

According to capability combination rules in Table 1, the combined capabilities of system portfolio SP1 are shown in the last column in Table 3.

Then, according to Equation (4), the capability gaps of four combined capabilities are calculated as Equation (15) shows. The value of any system portfolio can be calculated based on the same steps.

\begin{array}{l} G (c 1, C r 1) = 0 \\ G (c 1, C r 1) = (\frac{- 150.0713}{100 + 250}) + 1 = 0.571 \\ G (c 1, C r 1) = (\frac{- 218.758}{150 + 300}) + 1 = 0.514 \\ G (c 1, C r 1) = (\frac{18.44034 - 10}{30 - 10}) = 0.422 \end{array},

(15)

3.3. Risk Determination Based on RUL Prediction

In the case study, the key component of weapon systems, the turbine engine, is taken as an example for analyzing risks. The data is derived from the experiment conducted by a commercial modular simulation software C-MAPSS as shown in Figure 5.

The C-MAPSS simulates the operation of a turbine engine with 900,000-pound thrust and records monitoring signals. Based on the principle of thermodynamics, two failure modes are designed: high-pressure compressor degradation and fan degradation. The main functional modules and connections are shown in Figure 6. The simulation runs in the following settings:

(1): The simulation experiment data contains time series of 21 variables. It can be further divided into a training set and a testing set. Each multivariate time series corresponds to a specific engine, meaning that the data can be considered to be generated by engines of different systems.
(2): The initial wear condition of each engine might not be identical and there are manufacturing variations, which are considered reasonable and not treated as reasons of engine failures.
(3): There are 3 operational setting parameters that have a substantial impact on an engine’s performance.
(4): There are noises in the data.
(5): The engines operate normally at the initial moment and begin to degrade at some points in time series. In the training set, the cumulative degradation quantity continues to grow until it reaches or exceeds the preset threshold. In the testing set, the time series will terminate when engines fail.

As a result, 100 degradation tracks are obtained in the training set and 100 tracks before failure in the testing set. The training data is used to establish the RUL prediction model of engines, and the testing set is used to test the feasibility of the model.

The monitoring data is shown in the scatter plots in Figure 7. Each plot visualizes the 100 degradation tracks of one variable in the training set. The engine code, the operation cycle and the 3 operational setting parameters are not shown in the figure.

Due to the fact that the constant variable is unable to reflect the evolution of engine degradations, variable 1, 5, 6, 10, 16, 18, and 19 are not regarded as feature variables. What’s more, the tracks exhibit different trends in terms of variable 9 and 14, so variable 9 and 14 are inadequate to describe the degradation process.

Then, the variation coefficients of the rest 12 variables are calculated based on the degradation data and the results are shown in Table 4. According to the rule of eliminating variables with small variation coefficients, variable 3, 4, 11, 15, 17, 20 and 21 are chosen as the base variables that represent engine degradation characters.

According to the RUL prediction method, the remaining life of the engines in the testing set can be estimated by matching the testing data with the reference tracks. Then, the risks can be obtained. The results are shown in Table 5.

3.4. Portfolio Selection Results Analysis

In total, there are

2^{100} - 1

possible schemes, which is a huge number. Thus, a heuristic algorithm is necessary to be applied to the solving process. Considering the value and risk factors, the objectives are maximizing the value and minimize the risk of system portfolios. Therefore, a multi-objective algorithm is employed to solving the optimization problem. The non-dominated sorting generic algorithm (NSGA) is a kind of widely used multi-objective algorithm, which exhibits a good performance for retaining elites in offspring. On the other side, the differential evolution (DE) is a nice genetic operator, which plays really well on keeping population diversity. Thus, the paper uses the non-dominated differential evolution (NSDE) algorithm, which fuses the two advantages of NSGA and DE, to solve the system portfolio optimization problem. The corresponding parameters are set as follows. The population size is Pop = 100, the number of iterations is Gen = 1000, the mutation probability is 0.01 and the crossover probability is 0.2.

Due to the certain randomization of all genetic algorithms, The NSDE also generates results with certain fluctuant. A typical method to guarantee the optimality of generated result is running the algorithm for multiple times, and then select the best individuals by comparing the corresponding multiple results. In the case, the program is iterated for 10 times to generate 10 Pareto results with each containing 200 individuals, shown in (a) of Figure 8. Then, the 10 sets of Pareto results are combined together to obtain the best 200 individuals among them, as shown in (b) of Figure 8.

In detail, the 200 individuals of the best Pareto set are shown in the system option diagram in Figure 9. The rectangular area is divided into 100 × 200 rectangles according to the number of system candidates and the dimension of the non-dominated weapon system portfolios. Each rectangle represents whether a system candidate is selected in the Pareto set. If a weapon system i is selected by the j th non-dominated system portfolio, the i th row and j th column rectangular block will be colored black, otherwise, it is left blank.

From Figure 10, it can be seen that some systems are frequently selected in the Pareto set. However, some systems are seldom selected or even never selected. To compare the importance degree of different systems, the frequencies for all systems of being selected in the Pareto set is counted. Systems of S6, S9, S15, S16, S22, S25, S32, S39, S50, S52, S65, S69, S70, S71, S72, S73, S75, S79, S83, S90, S98, and S99 are selected by at least one system portfolio in the Pareto set. In addition, the systems S9, S25, S50 are quite important according to the high selected numbers in the Pareto set. Further, the rank of systems according to selected numbers is: S25 > S9 > S50 > S39 > S69 > S75 > S83 > S22 > S71 > S32 > S73 > S16 > S98 > S72 > S15 > S6 > S65 > S99 > S70 > S79 > S52 > S90, which to some extend indicates the importance degrees of selected systems. As regarding to the rest systems, they can be directly neglected in the system portfolio selection process.

By deeper analysis, it can be discovered that some systems tend to always be selected together. Therefore, a frequent item set mining algorithm of Apriori is applied to identify the association rules, shown in Table 6. The support parameter indicates the ratio between the simultaneously appearing frequency and all items, which means the probability of appearing simultaneously. The confidence of the rule of “A

\to

B” represents the ratio of

s u p p o r t (A \cup B) / s u p p o r t (A)

, which means the probability of

A \cup B

when

A

appears.

In Table 5, the association rules are ranked by the value of support and confidence respectively. Firstly, according to the ranking by support, it can be elicited that the “S9

\to

S25” is the most frequent rule, which means they tend to be selected together. In addition, when system S75 is selected, the system S25 must also be selected according to the first rule in the ranking by confidence. Referring the association rules, decision-makers can have a deeper understanding of the significance of system portfolios.

4. Discussion

The paper shows the feasibility of replacing expert subjective expertise with knowledge obtained from data. Firstly, the weight information of capabilities is determined by analyzing correlations between capabilities and the intercepted missile numbers, based on operation simulation data. Then, as regards the risk criterion, the paper tries to determine the risk by mining information from system operation data. The data-driven methods are only components of the model-based approaches, aiming to increase the accuracy and credibility of results.

In the case study, 100 system candidates are provided to be optimized on the scenario of anti-missile. By automatically simulating the operation scenario for 10,000 times, 10,000 simulation results are generated, according to which, the maximal information coefficients between four capabilities and the variable of intercepted missile quantity are calculated as the weight of capabilities. It quantitatively indicates that the capability of decision time has the biggest impact on the interpreted missile quantity. In addition, by running the simulator of C-MAPSS for 200 times, 200 groups of system operation data are generated, according to which, systems risks are obtained through prediction of RUL.

In the system portfolio optimization, considering the great number of

2^{100} - 1

candidate system portfolios, the NSDE algorithm is applied to solving the optimization problem. To guarantee the optimality of the result as far as possible, 10 Pareto sets are obtained by running the NSDE for 10 times. 200 non-dominated individuals are reserved by comparing the 10 Pareto sets. However, it can be not proved the best Pareto set, due to the almost infinity of candidate system portfolios and the randomness of genetic algorithms. By further analyzing the characters of generated Pareto set, 22 systems are selected at least one time, and 16 association rules are mined. These characters can play an assistant rule for decision-makers to make a deeper understanding of the system portfolios.

In conclusion, the system portfolio selection is the mainstream trend of future equipment development. Compared to other traditional system portfolio decision and optimization methods, the proposed model and data-driven approach provide a solution to avoid the excessive dependence on subjective expert experience in the evaluation and decision process. Traditionally, determining these parameters requires cumbersome processes of organizing experts, collecting expert opinions, analyzing expert scores etc., which are time and effort-consuming and more likely to be questioned. The model and data-driven methods can make use of models that have been proved to be effective on one hand, and on the other hand, it can determine the required parameter values in the model through data analysis. Therefore, it supports more efficient, more credible, and more practical evaluation and decisions in system portfolio selection and other fields applications.

Author Contributions

H.W. wrote the main parts of the paper; B.X. and Z.Z. designed and performed the simulations in the case study; Z.Y. and Z.Z. analyzed the result.

Funding

The work was funded by the National Natural Science Foundation of China, Grant 71690233 and 71571186.

Acknowledgments

We thank the C-MAPSS developers and the National Aeronautics and Space Administration (NASA) for providing the engine simulation data inputs.

Conflicts of Interest

The authors declare no conflict of interest.

References

Xia, B.; Dou, Y.; Zhao, Q.; Ge, B.; Zhang, Y. Robust System Portfolio Selection with Multi-Function Requirements and System Instability. In Proceedings of the Proceedings of the Systems Conference (SysCon), Montreal, QC, Canada, 24–27 April 2017. [Google Scholar]
Markowitz, H. Portfolio selection. J. Financ. 2012, 7, 77–91. [Google Scholar]
Ľuboš, P. Portfolio Selection and Asset Pricing Models. J. Financ. 2000, 55, 179–223. [Google Scholar]
Mohagheghi, V.; Mousavi, S.M.; Vahdani, B.; Shahriari, M.R. R&D project evaluation and project portfolio selection by a new interval type-2 fuzzy optimization approach. Neural Comput. Appl. 2017, 28, 3869–3888. [Google Scholar]
Abdollahi, M.; Arvan, M.; Razmi, J. An integrated approach for supplier portfolio selection: Lean or agile. Expert Syst. Appl. 2015, 42, 679–690. [Google Scholar] [CrossRef]
McCarthy, D.M. One Size Does Not Fit All-Right-Sized Signal Detection Systems That Are Appropriate for Your Portfolio Benefit-Risk Management Strategy. Pharmacoepidemiol. Drug Saf. 2016, 25, 404–405. [Google Scholar]
Achillas, C.; Aidonis, D.; Iakovou, E.; Thymianidis, M.; Tzetzis, D. A methodological framework for the inclusion of modern additive manufacturing into the production portfolio of a focused factory. J. Manuf. Syst. 2015, 37, 328–339. [Google Scholar] [CrossRef]
Dou, Y.; Zhang, P.; Ge, B.; Jiang, J.; Chen, Y. An integrated technology pushing and requirement pulling model for weapon system portfolio selection in defense acquisition and manufacturing. Proc. Inst. Mech. Eng. B J. Eng. 2015, 229, 1046–1067. [Google Scholar] [CrossRef]
Chien, C.F.; Huynh, N.T. An Integrated Approach for IC Design R&D Portfolio Decision and Project Scheduling and a Case Study. IEEE Trans. Semiconduct. Manuf. 2018, 31, 76–86. [Google Scholar]
Liesio, J.; Salo, A. Scenario-based portfolio selection of investment projects with incomplete probability and utility information. Eur. J. Oper. Res. 2012, 217, 162–172. [Google Scholar] [CrossRef]
Li, Y.P.; Huang, G.H.; Chen, X. Multistage scenario-based interval-stochastic programming for planning water resources allocation. Serra 2009, 23, 781–792. [Google Scholar] [CrossRef]
Xia, B.; Zhao, Q.; Yang, K.; Dou, Y.; Yang, Z. Scenario-Based Modeling and Solving Research on Robust Weapon Project Planning Problems. J. Syst. Eng. Electron. 2019, 30, 85–99. [Google Scholar]
Rafiee, M.; Kianfar, F. A scenario tree approach to multi-period project selection problem using real-option valuation method. Int. J. Adv. Manuf. Tech. 2011, 56, 411–420. [Google Scholar] [CrossRef]
Ben-Tal, A.; Chung, B.D.; Mandala, S.R.; Yao, T. Robust optimization for emergency logistics planning: Risk mitigation in humanitarian relief supply chains. Transp. Res. B Meth. 2011, 45, 1177–1189. [Google Scholar] [CrossRef]
Ismail, A.; Pham, H. Robust Markowitz mean-variance portfolio selection under ambiguous covariance matrix. Math. Financ. 2019, 29, 174–207. [Google Scholar] [CrossRef]
Hu, J.; Mehrotra, S. Robust and stochastically weighted multi-objective optimization models and reformulations. Oper. Res. 2012, 60, 939–953. [Google Scholar] [CrossRef]
Quintana, D.; Denysiuk, R.; Garcia-Rodriguez, S.; Gaspar-Cunha, A. Implementation Risk Management Using Evolutionary Multiobjective Optimization. Appl. Sci. 2017, 7, 1079. [Google Scholar] [CrossRef]
Poklepović, T.; Marasović, B.; Aljinović, Z. Portfolio selection model-based on technical, fundamental and market value analysis. In Proceedings of the European Conference on Operational Research, Vilnius, Litva, 8–11 July 2012. [Google Scholar]
Liesiö, J.; Mild, P.; Salo, A. Robust portfolio modeling with incomplete cost information and project interdependencies. Eur. J. Oper. Res. 2008, 190, 679–695. [Google Scholar] [CrossRef] [Green Version]
Golabi, K.; Kirkwood, C.W.; Sicherman, A. Selecting a portfolio of solar energy projects using multi-attribute preference theory. Manag. Sci. 1981, 27, 174–189. [Google Scholar] [CrossRef]
Ge, B.; Hipel, K.W.; Fang, L.; Yang, K.; Chen, Y. An Interactive Portfolio Decision Analysis Approach for System-of-Systems Architecting Using the Graph Model for Conflict Resolution. IEEE Trans. Syst. Man Cybern. Syst. 2014, 44, 1328–1346. [Google Scholar] [CrossRef]
Beraldi, P.; Violi, A.; Ferrara, M.; Ciancio, C.; Pansera, B.A. Dealing with complex transaction costs in portfolio management. Ann. Oper. Res. 2019. [Google Scholar] [CrossRef]
Saborido, R.; Ruiz, A.B.; Bermúdez, J.D.; Vercher, E.; Luque, M. Evolutionary multi-objective optimization algorithms for fuzzy portfolio selection. Appl. Soft. Comput. 2016, 39, 48–63. [Google Scholar] [CrossRef]
Nalpas, N.; Simar, L.; Vanhems, A. Portfolio selection in a multi-moment setting: A simple Monte-Carlo-FDH algorithm. Eur. J. Oper. Res. 2017, 263, 308–320. [Google Scholar] [CrossRef]
Sands, C. Application of Multi-Criteria Decision Making Methods to the DLA Energy Military Construction Portfolio Selection Process. Ph.D. Thesis, Washington, DC, USA, 2016. [Google Scholar]
Yang, S.; Yang, M.; Wang, S.; Huang, K. Adaptive immune genetic algorithm for weapon system portfolio optimization in military big data environment. Cluster Comput. 2016, 19, 1359–1372. [Google Scholar] [CrossRef]
Kangaspunta, J.; Liesiö, J.; Salo, A. Cost-efficiency analysis of weapon system portfolios. Eur. J. Oper. Res. 2012, 223, 264–275. [Google Scholar] [CrossRef]
Greiner, M.A.; Mcnutt, R.T.; Shunk, D.L.; Fowler, J.W. Selecting military weapon systems development portfolios: Challenges in value measurement. In Proceedings of the Portland International Conference on Management of Engineering & Technology, Portland, OR, USA, 29 July–2 August 2001. [Google Scholar]
Cheng, C.; Li, J.; Zhao, Q.; Jiang, J.; Yu, L.; Shang, H. Research on weapon system portfolio selection based on combat network modeling. In Proceedings of the Systems Conference (SysCon), Montreal, QC, Canada, 24–27 April 2017. [Google Scholar]
Zhou, Z.; Dou, Y.; Xia, B.; Jiang, J. Weapon systems portfolio selection based on fuzzy clustering analysis. In Proceedings of the IEEE International Conference on Control Science & Systems Engineering, Beijing, China, 17–19 August 2017. [Google Scholar]
Li, M.H.; Li, M.J.; Yang, K.W.; Xia, B.Y.; Wan, C.Q. A Network-Based Portfolio Optimization Approach for Military System of Systems Architecting. IEEE Access 2018. [Google Scholar] [CrossRef]
Yu, Z.; Tan, Y.J.; Yang, K.W.; Yu, Z.Y. Research on evolving capability requirements oriented weapon system of systems portfolio planning. In Proceedings of the International Conference on System of Systems Engineering, Genova, Italy, 16–19 July 2012. [Google Scholar]
Huang, K. Combinatorial optimization and simulation for weapon system portfolio using self-adaptive Memetic algorithm. J. Eng. Res. 2017, 5, 124–139. [Google Scholar]

Figure 1. Division for variable X and Y.

Figure 2. The gap function curve: (a) Benefit type capability, (b) cost type capability.

Figure 3. RUL prediction procedures.

Figure 4. Histogram of generated capability data.

Figure 5. The interface of the simulation platform.

Figure 6. The functional modules and their connection.

Figure 7. Scatter plots of the time series in terms of the different monitoring signals.

Figure 8. Result exhibition: (a) Result of 10 times of running; and (b) result of 200 best individuals.

Figure 9. System option diagram.

Figure 10. Frequencies for all systems of being selected in the Pareto set.

Table 1. Capabilities requirements.

Capabilities	$c 1 (Km)$	$c 2 (Km)$	$c 3 (Km)$	$c 4 (Second)$
CR	[100,200]	[100,250]	[150,300]	[10,30]
Rules	Maximal	Addition	Addition	Average

Table 2. Obtained maximal information coefficient (MIC) of input and output.

MIC	Intercepted Missile Number
Detection range	0.854
Communication range	0.772
Striking range	0.537
Decision time	0.928

Table 3. Capabilities of 5 systems.

Systems	Detection Range	Communication Range	Striking Range	Decision Time
S1	128.8502018	45.00221559	41.93626171	22.55909356
S2	205.1604693	36.35948504	50.01541323	16.53053513
S3	156.9600287	27.11329846	43.28357399	16.46184426
S4	152.7120943	26.19071935	53.57115127	21.31900886
S5	111.5170147	15.40553804	29.95161145	15.33123079
SP1	205.1604693	150.0713	218.758	18.44034

Table 4. Variation coefficients (VC) of the rest 12 variables.

Variable	Sensor 2	Sensor 3	Sensor 4	Sensor 7	Sensor 8	Sensor 11	Sensor 12	Sensor 13	Sensor 15	Sensor 17	Sensor 20	Sensor 21
VC	0.0567	0.2760	0.5110	0.1239	0.0020	0.4573	0.1117	0.0020	0.3408	0.2907	0.3519	0.3514

Table 5. The prediction results of the remaining life of different weapon systems.

S	RUL	Risk	S	RUL	Risk	S	RUL	Risk	S	RUL	Risk
S1	112	0.008929	S26	197	0.005076	S51	114	0.008772	S76	76	0.013158
S2	104	0.009615	S27	162	0.006173	S52	72	0.013889	S77	78	0.012821
S3	97	0.010309	S28	82	0.012195	S53	78	0.012821	S78	199	0.005025
S4	92	0.01087	S29	75	0.013333	S54	187	0.005348	S79	167	0.005988
S5	133	0.007519	S30	83	0.012048	S55	189	0.005291	S80	109	0.009174
S6	105	0.009524	S31	75	0.013333	S56	75	0.013333	S81	75	0.013333
S7	81	0.012346	S32	150	0.006667	S57	76	0.013158	S82	75	0.013333
S8	61	0.016393	S33	121	0.008264	S58	75	0.013333	S83	202	0.00495
S9	137	0.007299	S34	75	0.013333	S59	190	0.005263	S84	74	0.013514
S10	66	0.015152	S35	74	0.013514	S60	87	0.011494	S85	162	0.006173
S11	107	0.009346	S36	75	0.013333	S61	72	0.013889	S86	156	0.00641
S12	107	0.009346	S37	99	0.010101	S62	68	0.014706	S87	203	0.004926
S13	79	0.012658	S38	76	0.013158	S63	67	0.014925	S88	126	0.007937
S14	136	0.007353	S39	208	0.004808	S64	74	0.013514	S89	79	0.012658
S15	197	0.005076	S40	74	0.013514	S65	200	0.005	S90	79	0.012658
S16	188	0.005319	S41	94	0.010638	S66	74	0.013514	S91	72	0.013889
S17	79	0.012658	S42	73	0.013699	S67	200	0.005	S92	75	0.013333
S18	101	0.009901	S43	79	0.012658	S68	75	0.013333	S93	69	0.014493
S19	171	0.005848	S44	185	0.005405	S69	197	0.005076	S94	79	0.012658
S20	75	0.013333	S45	78	0.012821	S70	87	0.011494	S95	187	0.005348
S21	156	0.00641	S46	82	0.012195	S71	202	0.00495	S96	189	0.005291
S22	182	0.005495	S47	119	0.008403	S72	130	0.007692	S97	176	0.005682
S23	182	0.005495	S48	163	0.006135	S73	190	0.005263	S98	121	0.008264
S24	75	0.013333	S49	28	0.035714	S74	117	0.008547	S99	189	0.005291
S25	203	0.004926	S50	195	0.005128	S75	195	0.005128	S100	73	0.013699

Table 6. The association rules mined by frequent item set mining.

Ranking by Support			Ranking by Confidence
Association Rules	Support	Confidence	Association Rules	Support	Confidence
9 $\to$ 25	0.615	0.8723	75 $\to$ 25	0.255	1.0000
25 $\to$ 9	0.615	0.7593	[9,39] $\to$ 25	0.28	0.9032
50 $\to$ 25	0.57	0.8906	[9,50] $\to$ 25	0.465	0.9029
25 $\to$ 50	0.57	0.7037	50 $\to$ 25	0.57	0.8906
50 $\to$ 9	0.515	0.8047	9 $\to$ 25	0.615	0.8723
9 $\to$ 50	0.515	0.7305	[25,50] $\to$ 9	0.465	0.8158
[9,50] $\to$ 25	0.465	0.9029	50 $\to$ 9	0.515	0.8047
[25,50] $\to$ 9	0.465	0.8158	[25,39] $\to$ 9	0.28	0.7887
[9,25] $\to$ 50	0.465	0.7561	25 $\to$ 9	0.615	0.7593
50 $\to$ [9,25]	0.465	0.7266	[9,25] $\to$ 50	0.465	0.7561
9 $\to$ [25,50]	0.465	0.6596	69 $\to$ 50	0.25	0.7353
[9,39] $\to$ 25	0.28	0.9032	9 $\to$ 50	0.515	0.7305
[25,39] $\to$ 9	0.28	0.7887	50 $\to$ [9,25]	0.465	0.7266
39 $\to$ [9,25]	0.28	0.6222	25 $\to$ 50	0.57	0.7037
75 $\to$ 25	0.255	1.0000	9 $\to$ [25,50]	0.465	0.6596
69 $\to$ 50	0.25	0.7353	39 $\to$ [9,25]	0.28	0.6222

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wei, H.; Xia, B.; Yang, Z.; Zhou, Z. Model and Data-Driven System Portfolio Selection Based on Value and Risk. Appl. Sci. 2019, 9, 1657. https://doi.org/10.3390/app9081657

AMA Style

Wei H, Xia B, Yang Z, Zhou Z. Model and Data-Driven System Portfolio Selection Based on Value and Risk. Applied Sciences. 2019; 9(8):1657. https://doi.org/10.3390/app9081657

Chicago/Turabian Style

Wei, Hechuan, Boyuan Xia, Zhiwei Yang, and Zhexuan Zhou. 2019. "Model and Data-Driven System Portfolio Selection Based on Value and Risk" Applied Sciences 9, no. 8: 1657. https://doi.org/10.3390/app9081657

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Model and Data-Driven System Portfolio Selection Based on Value and Risk

Abstract

Featured Application

Abstract

1. Introduction

2. Materials and Methods

2.1. Capability Gap Based Value Decision

2.1.1. Weight Decision Based on Correlation Analysis

2.1.2. Value Model Construction

2.2. Risk Decision Based on RUL Prediction

2.2.1. Feature extraction based on variation coefficient

2.2.2. Reconstruction of Degradation Track Phase Space

2.2.3. System Portfolio Risk Determination Based on RUL

2.3. System Portfolio Optimization

3. Results

3.1. Background Description

3.2. Value Calculation

3.2.1. Weight Determination

3.2.2. Value Calculation

3.3. Risk Determination Based on RUL Prediction

3.4. Portfolio Selection Results Analysis

4. Discussion

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI