Copper Price Prediction Using Support Vector Regression Technique

Astudillo, Gabriel; Carrasco, Raúl; Fernández-Campusano, Christian; Chacón, Máx

doi:10.3390/app10196648

Open AccessArticle

Copper Price Prediction Using Support Vector Regression Technique

¹

Escuela de Ingeniería Informática, Universidad de Valparaíso, Valparaíso 2362905, Chile

²

Departamento de Ingeniería Informática, Universidad de Santiago de Chile, Santiago 9170124, Chile

³

Facultad de Ingeniería, Ciencia y Tecnología, Universidad Bernardo O’Higgins, Santiago 8370993, Chile

⁴

Departamento de Ingenierías Multidisciplinares, Universidad de Santiago de Chile, Santiago 9170124, Chile

⁵

Department of Architecture and Computer Technology, University of the Basque Country UPV/EHU, 20018 Donostia-San Sebastián, Spain

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2020, 10(19), 6648; https://doi.org/10.3390/app10196648

Submission received: 3 August 2020 / Revised: 6 September 2020 / Accepted: 19 September 2020 / Published: 23 September 2020

(This article belongs to the Section Computing and Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

:

Predicting copper price is essential for making decisions that can affect companies and governments dependent on the copper mining industry. Copper prices follow a time series that is nonlinear and non-stationary, and that has periods that change as a result of potential growth, cyclical fluctuation and errors. Sometimes, the trend and cyclical components together are referred to as a trend-cycle. In order to make predictions, it is necessary to consider the different characteristics of a trend-cycle. In this paper, we study a copper price prediction method using support vector regression (SVR). This work explores the potential of the SVR with external recurrences to make predictions at 5, 10, 15, 20 and 30 days into the future in the copper closing price at the London Metal Exchange. The best model for each forecast interval is performed using a grid search and balanced cross-validation. In experiments on real data sets, our results obtained indicate that the parameters (C,

ε

,

γ

) of the model support vector regression do not differ between the different prediction intervals. Additionally, the amount of preceding values used to make the estimates does not vary according to the predicted interval. Results show that the support vector regression model has a lower prediction error and is more robust. Our results show that the presented model is able to predict copper price volatilities near reality, as the root-mean-square error (RMSE) was equal to or less than the

2.2 %

for prediction periods of 5 and 10 days.

Keywords:

copper price; prediction; support vector regression

1. Introduction

Copper is one of the first metal products to be listed on the world’s main foreign exchange markets: The London Metal Exchange (LME), Commodity Exchange Market of New York (COMEX) and Shanghai Futures Exchange (SHFE). Copper price is determined by the supply and demand dynamics on the metal exchanges, especially the London Metal Exchange. Although it may be strongly influenced by the currency exchange rate and the investment flow, the factors that can cause fluctuations in volatile prices are partially associated with changes in the activity of the economic cycle [1].

There are many reasons for wanting to make predictions about the price of copper. On the one hand, copper, among other natural elements (e.g., silver), has a high electrical and thermal conductivity. On the other hand, it is cheaper than silver and more resistant to corrosion. Therefore, copper is the preferred metal option for electrical and electronic applications, both domestically and for more general industrial uses. Given the importance of the construction and telecommunications sectors in a modern economy, the changes in copper price can be perceived as an early indicator of global economic performance and have a significant impact factor on the performance of related companies [2].

With regard to the copper market, any variation in its demand translates entirely into price fluctuations. Market participants see it as an early sign of changes in global production. Thus, it affects mining companies in their investment plans, traders, investors, agents involved in the copper mining business and governments dependent on the copper mining industry. To illustrate this, we can consider the particular case of Chile. As the world’s leading copper producer and exporter, Chile produced an estimated 5.6 million metric tons of copper in 2019 [3]. The Chilean government made copper the main point of reference for the country’s structural budget rule introduced in 2000, trying to reduce the exposure of fluctuations in Chile’s gross domestic product (GDP) to the oscillation of the price of copper [4].

There are several studies which currently include copper (and other metals) as one of the products of interest in the evaluations of the prediction to improve the forecasts of price, such as [5,6,7,8,9]. Our study applies a copper price prediction technique using support vector regression (SVR) [10,11,12]. This work explores the potential of the SVR with external recurrences to make predictions at 5, 10, 15, 20 and 30 days into the future in the copper closing price at the London Metal Exchange. The best model for each forecast interval is performed using a grid search and balanced cross-validation.

The paper is distributed as follows: Section 2 presents the related works. Section 3 presents the proposed model. The data used and the methodology are presented in Section 4 and Section 5, respectively. Section 6 shows the results and analysis. Section 7 shows the discussion. Finally, Section 8 presents the conclusions.

2. Related Works

In the general case of the financial time series, the support vector machine (SVM) and SVR methods are widely used for making forecasts [13]. Basically, when the SVM method extends to nonlinear regression problems, it is called SVR [10,11,12]. The SVR method belonging to the field of data statistics was first proposed by Vapnik et al. [10] at the end of the twentieth century. A characteristic of this method is that it solves the problems of “high dimensionality” and “overlearning” to a certain degree. Furthermore, it achieves a significant effect in solving the problem of small samples. Consequently, support vector regression is used to solve the prediction problems of nonlinear data in engineering areas.

It is important to consider some works, such as the work done by Kim [14], which presents daily forecasts (using SVR) of the trend of change in the Korean Composite Stock Price Index (KOSPI), in which it uses 2928 days of data between January 1989 and December 1998. In another work, Kao et al. [15] use SVR to predict the stock index of the São Paulo State Stock Exchange (BOVESPA), the Shanghai Stock Exchange Composite (SSEC) and the Dow Jones, using data from April 2006 to April 2010. Similarly, Kazem et al. [16] predict the market prices of Microsoft, Intel and the National Bank shares using a set of data from November 2007 to November 2011. In these works, the prediction is always a day before and depends on a certain amount of past data (l), i.e., if

{\hat{p}}_{t + 1}

is the price foretold in

t + 1

, it has

{\hat{p}}_{t + 1} = f (p_{t}, p_{t - 1}, \dots, p_{t - l})

.

Additionally, we must consider the work of Patel et al. [17], which proposes to make predictions 10, 15 and 30 days in advance using a two-stage system based on SVR, neural networks and random forests, which are trained with nine technical indexes. Among these indicators is the stochastic index

% K

, that compares the closing price at a particular time with the price range during a given period, and

% D

, which is the first moving average. In addition, it uses the relative strength index (RSI) that indicates the price change rate and the average data rate, among others. The author uses historical data from January 2003 to December 2012 of the CNX Nifty stock index and S&P BSE Sensex.

On the other hand, several studies include copper and other metals as products of interest in the evaluations of the prediction to improve the forecasts of price. Such studies employ different methods and mathematical models such as autoregressive integrated moving average (ARIMA) models combined with wavelets [18], meta-heuristics models [19,20], neural networks models [2,21] and hybrid models [5,6,7,8,9]. The Fourier transform [22] is used to analyze the variability of the prices of various metals. In addition, there are works in the literature that study the relationships of commodity and asset price models, such as the case of oil prices and their effects on copper and silver prices [23].

3. Support Vector Regression Model

Given a data set of N elements

{(X_{i}, y_{i})}_{i = 1}^{N}

, where

X_{i}

is the i-th element in a space of n dimensions,

X_{i} = [x_{1, i}, \dots, x_{n, i}] \in R^{n}

and

y_{i}

(

y_{i} \in R

) is the actual value for

X_{i}

, a nonlinear function is defined as

ϕ : R^{n} \to R^{n_{h}}

. To map the entry data,

X_{i}

is an

R^{n_{h}}

space of high dimension called space of features that determines the nonlinear transformation

ϕ

. So, in a high-dimensional space, there exists a linear function f that makes it possible to relate the entry data

X_{i}

and output

y_{i}

. That linear function, the SVR function, is presented in Equation (1),

f (X) = W^{T} \cdot ϕ (X) + b

(1)

where

f (X)

represents the foretold values;

W \in R^{n}

and

b \in R

. The SVR minimizes the empiric risk, shown in Equation (2)

R_{r e g} (f) = C \sum_{i = 1}^{N} Θ_{ε} (y_{i} - f (X_{i})) + \frac{1}{2} ∥W^{T}∥

(2)

where

Θ_{ε} (y_{i} - f (X_{i}))

is a cost function. In the case of the

ε

-SVR, a loss function

ε

-insensitive is used [10,24], defined in Equation (3),

Θ_{ε} (y - f (X)) = \{\begin{matrix} | y - f (X) | - ε & If | y - f (X) | ⩾ ε \\ 0 & In another case \end{matrix}

(3)

Θ_{ε}

is used to determine the nonlinear function

ϕ

in the

R^{n_{h}}

space to find a function that can fit current training data with a deviation less than or equal to

ε

(see Figure 1a). This function minimizes the training error between the data training, and the function

ε

-insensitive is provided by Equation (4) [11,25].

min_{W, b, ξ^{*}, ξ} R_{r e g} (W, ξ^{*}, ξ) = \frac{1}{2} W^{T} W + C \sum_{i = 1}^{N} (ξ_{i}^{*} + ξ_{i})

(4)

subject to restrictions (for all,

i = 1, \dots, N

):

\begin{matrix} Y_{i} - W^{T} ϕ (X_{i}) - b & ⩽ & ε + ξ_{i}^{*} \\ - Y_{i} - W^{T} ϕ (X_{i}) + b & ⩽ & ε + ξ_{i} \\ ξ_{i}^{*} & ⩾ & 0 \\ ξ_{i} & ⩾ & 0 \end{matrix}

(5)

Equation (4) punishes the training errors of

f (X)

and Y through the function

ε

-insensitive (Figure 1b). The parameter C determines the compromise between the complexity of the model, expressed by the vector W and the points that fulfill the condition

| f (X) - y | ⩾ ε

in Equation (3). If

C \to \infty

, the model has a small margin and is adjusted to the data. If

C \to 0

, the model has a big margin, which is why it is softened. Finally,

ξ_{i}^{*}

represents the training errors greater than

ε

and

ξ_{i}

the errors less than

- ε

(see Figure 1a).

To solve this regression problem, we can replace the internal product of Equation (1) by functions of kernel

K ()

. This makes it possible to perform such an operation in a superior dimension, using low-dimensional space data input without knowing the transformation

ϕ

[26], as it is shown in Equation (6). This is called the kernel trick.

f (X) = \sum_{i = 1}^{N} (β^{*} - β) \cdot K (X_{i}, X) + b

(6)

The parameters

β^{*}

and

β

are Lagrange multipliers associated with the problem of quadratic optimization. Several types of functions can be used as kernel [27], but in this work, we will be using the Gaussian function of a radial base (RBF) [28]:

K (X_{i}, X) = e x p (- γ | | X_{i} - {X | |}^{2})

(7)

The parameters

γ

of the kernel function, the regularization constant C and

ε

of the loss function are considered the parameters of design for the SVR to use. Furthermore, they are obtained from a data set that is different from the training data.

4. Data Description

There are three major stock exchanges where the copper is traded: LME, COMEX and SHFE. Similar to [6,7], we use the price of copper given by LME, which is widely considered as a reference index for world prices of this metal [29]. The time series used in this research has 2971 daily data of copper prices in US dollars per metric ton from 2 January 2006 until 2 January 2018, as shown in Figure 2. (These data were obtained through a trial subscription on www.lme.com in January 2018. Currently, a membership must be paid for to get updated data.) Similar time ranges of copper price have been used in [6,7,20].

5. Methodology

The methodology used in this research contains four steps. Step 1 is focused on preparing the data. Steps 2 and 3 explain the training and prediction stages. Finally, step 4 details the performance measures used.

5.1. Step 1: Data Pre-Processing

In the first place, the data is normalized with the min–max normalization (MMN) method [30] within the range of 0 to 1. Then, a suitable range has to be determined for the SVR hyperparameter

ε

, which is related to the margin of tolerance of punishment for errors in training. For this, it is necessary to know the level of noise N that the time series has. It has an average

\bar{N}

= 3.66 ×

10^{- 4}

, a root-mean-square value

R M S_{N} = 0.0680

and a range between

[- 0.246, 0.205]

. These characteristics allow us to define a conservative interval for

ε = [0, 0.3]

.

Finally, the series of Figure 2 is divided into two series,

S_{a}

and

S_{b}

, each one with 50% of the data, which will be used for training and evaluating alternately.

5.2. Step 2: Parameters Adjust and Training

In the case of the prediction of time series with SVR, it is assumed that the actual value

y_{t}

is a function of its previous L values

{\vec{x}}_{t} = [y_{t - 1}, \dots, y_{t - L}]

and the hyperparameter of the SVR

\vec{w} = [C, ε, γ]

. Hence, the model has four parameters: L, the number of prior values to predict the actual value and the three hyperparameters of the SVR. The range of each one is shown in Table 1.

The i-th model of the SVR (

M_{i}

) is defined by the set of parameters

Q_{i} = {L_{i}, C_{i}, ε_{i}, γ_{i}}

. To adjust these parameters, it is made into a grid search, according to the recommendation of Hsu et al. [31] and its computational design shows in Algorithm 1. For all combinations of parameters

Q_{i}

, the model

M_{i}

is trained with the series

S_{a}

and is tested with the series

S_{b}

, and vice versa. Then, for each training and testing, the set of parameters

Q_{i}

with the least mean squared error (MSE) between the predicted and the original data is selected. This process of training/testing is made using the balanced cross-validation method proposed by McCarthy [32].

Algorithm 1: Algorithm design for grid search to find the best testing models.

5.3. Step 3: Prediction

To make the prediction at

p_{j}

days,

M_{i}

(with its best set of parameters

Q_{i}

) takes a vector of

L_{i}

past values, taking into account the previous predicted values if they correspond with

{\hat{y}}_{t + p_{j}}

as the value to predict in

p_{j}

days and

{\vec{x}}_{t + p_{j}}

being the vector that contains the previous

L_{i}

values that are used in the prediction. Then, you have

{\hat{y}}_{t + p_{j}} = M_{i} (x_{t + p_{j}})

and

{\vec{x}}_{t + p_{j}}

given by the expression of Equation (8).

\begin{matrix} {\vec{x}}_{t + p_{j}} = & \underset{p_{j} > 1}{\underset{⏟}{[{\hat{y}}_{t + (p_{j} - 1)}, {\hat{y}}_{t + (p_{j} - 2)}, \dots, {\hat{y}}_{t + 1}}}; \\ \underset{L_{i} ⩾ p_{j}}{\underset{⏟}{y_{t}, y_{t - 1}, \dots, y_{t - (L_{i} - p_{j})}]}} \end{matrix}

(8)

5.4. Step 4: Performance Measures

For each prediction interval, the effectiveness of the prediction model will be determined through performance measures such as MSE and RMSE. These performance measures have been used in previous prediction work [33,34,35]. Furthermore, the correlation coefficient between the predicted value and the actual value will be used. The computational design of step 3 and step 4 is shown in Algorithm 2.

Algorithm 2: Algorithm design for the prediction and performance measures steps.

For the implementation of the training and adjustment system, R, version 3.4.4 was used with the library e1071 [36] for the basic training functions and the library doParallel [37] for parallelizing the search of parameters.

Shapiro–Wilk normality tests will be used to (1) select the correlation coefficient (Pearson or Spearman) between the predicted values and the real values of the time series and (2) select the method of comparison of the means (Wilcoxon rank-sum test or two-sample t-test) of the errors for the different prediction time horizons. For all tests, a p-value < 0.05 is considered significant.

6. Results and Analysis

In the experiments, the best models were explored in a grid choosing the best (MSE minor) for each p prediction interval. Table 2 shows the parameters for the best SVR, according to the MSE index, for each prediction interval of p according to the training set and test. Additionally, the correlation index

ρ

between the real data and the predicted data and the root-mean-square error (RMSE) is shown. In Table 2,

R \overset{p}{\to} T

means that to train the SVR, the set

R \in {S_{a}, S_{b}}

is used, and it is tested in the set

T \in {S_{a}, S_{b}}

, with

R \neq T

, where p is the prediction interval, with

p \in {5, 10, 15, 20, 25, 30}

.

It is interesting to note that the amount of previous data (L) is independent of the prediction interval that is made, as well as the parameters of the SVR that remain practically intact. The adjustment capacity for the five-day prediction of the

S_{a} \to S_{b}

time series for the 2017 period is shown in Figure 3.

The best prediction capacity is obtained during the training with the series

S_{a}

, which is temporarily the oldest. The training could have been enhanced due to the level of noise of this series. The RMS value of the noise of series

S_{a}

is

R M S_{N_{a}} = 0.0848

, which is higher than the one of the series

S_{b}

,

R M S_{N_{b}} = 0.0318

. For example, for a five-day prediction, training with

S_{a}

gives

M S E = 0.0003

if training with

S_{b}

,

M S E = 0.0012

. Furthermore, the dispersion of the MSE is less compared to the training based on the most recent part of the series (see Figure 4a).

In Figure 4a, we show the distribution of the MSE obtained in the predictions summarized in Table 2, which will allow us to evaluate the capabilities of prediction statistically. In addition, the confidence intervals are shown in Figure 4b, with a 5% significance for the mean estimation of the MSE of the sample obtained from the simulation—that is, intervals built at 95% confidence.

Table 3 presents the differences of means of the MSE between groups in the lower triangle, and the symbols of confidence indicators in the upper triangle. With concern to the combination of pairs in the test of the hypothesis of mean differences for MSE, it is obtained in the simulation for the different prediction time intervals and the two time series

S_{a}

and

S_{b}

. Furthermore, the average difference of the MSE can be visually appreciated according to the 95% confidence interval (see Figure 4b). The previous results are presented in Figure 4b and Table 3. For the comparison of means of the MSE between groups, the Wilcoxon rank-sum test was used, because according to the Shapiro–Wilk test, they do not fit a normal distribution, with p-value ≤ 1.73 × 10

^{- 6}

for all groups.

7. Discussion

As a result of the search for training parameters for the SVR models and the search for the best prediction models for different forecasting intervals, it was determined that the amount of previous data of the time series that are needed (

L = 3

for the best models) is independent of the prediction interval. Similar values have been used in the literature in similar time series, but without justification based on a parameter search that minimizes the prediction error. For example, a value of

L = 2

is used in [8,9],

L = 3

in [5] and

L = 5

in [15].

Our results show that the presented model is able to predict copper price volatilities near reality (see Figure 3). Similar results are obtained compared to other works, such as:

In [9], an analysis on the dynamics of real prices for main industrial metals is presented. Using monthly data, the authors estimated linear and threshold autoregressive models. For the nonlinear models, they assumed that the dynamics of metal prices depend on their deviation from the recursive mean. We use a monthly prediction (30 days) to compare the best RMSE value obtained in both works. Our RMSE ( $0.033$ ) is similar to the RMSE ( $0.046$ ) obtained in that work.
In [8], the authors use time series models to predict the prices of Shanghai copper futures. This work introduces the application of X12-ARIMA-GARCH family models in futures price analysis and forecasting. To compare their results with ours, we use a short prediction period (5 and 10 days) and compare the RMSE obtained. In the same period, our RMSE ( $0.017$ and $0.018$ ) are similar to the RMSE ( $0.018$ and $0.022$ ) in that work.
In [5], a hybrid model is proposed to provide an accurate model for predictions of copper prices. The proposed model combines the adaptive neuro-fuzzy inference system and genetic algorithm. Our work presents an RMSE ( $R M S E = 0.033$ ) similar to the GA-ANFIS method ( $R M S E = 0.0813$ ), presented in that work. This is due to the granularity of the training and prediction data. It is interesting to note that in that work, a method based on SVM is shown whose error ( $R M S E = 0.1027$ ) is high compared to our work. The difference is that in our work, a regression was used with an exhaustive search of its parameters.
In [20], a Bat algorithm was used to predict the copper price volatility. The copper price was estimated using time series and Bat algorithms. The time series function used in this work is similar in our work. Under those conditions, the prediction error is $R M S E = 0.132$ . With the method proposed in this work, a maximum error of $R M S E = 0.08$ is achieved (see Table 2).

Finally, we can observe in Figure 4a that there are significant differences of the MSE at 95% confidence, between

S_{a} \to S_{b}

and

S_{b} \to S_{a}

in each of the prediction intervals 5, 10, 15, 20, 25 and 30 days. In contrast, for the prediction

S_{a} \to S_{b}

, there are no significant differences in the MSE between the prediction intervals 5, 10, 15, 20, 25 and 30 days. This allows us to show the robustness of the prediction in the short and medium-term since the prediction at the five-day interval has not lost performance over the 30-day prediction interval, considering it, in this case, as a medium-term. This is useful for the decision-making process for mining companies and traders. On the other hand, to affect the decision-making process for investors and the government, it is necessary to have reliable long-term forecasts.

8. Conclusions

In this work, the construction of a model was presented based on SVR that allows making a prediction of the closing value of copper in the Metal London Stock Exchange, as the RMSE was equal to or less than the

2.2 %

for prediction periods of 5 and 10 days. The method consists of finding the best model through a search in a grid, wherein each model is trained and tested through use of the balancing methods in cross-validation. For the training process, only the data of closing price of the series are used. The results indicate that the model of the SVR can be used regardless of the number of days of the prediction, and this can be done with only three actual values.Additionally, we observed that more current data negatively impact the MSE. This phenomenon must be studied in the future, but there is a signal that this can be explained through the level of noise and the amount of data of the training time series.

The importance of price prediction will depend on the interest of the agents and their objectives in short, medium and long-term prediction periods. Our work aims for short-term predictions of 5 days, 10, ..., up to 30 days. These predictions will interest brokers and investors who seek to take advantage of the periodic variations with active portfolio management. For medium-term predictions, such as monthly and annual predictions, governments may be more interested in their national budget, as is the case in Chile, which is an economy whose income and tax revenues come from copper mining. In the long-term (more than one year), investors and mining companies will be more interested in their long-term investment plans, such as the process of improvement, expansion, a search of new deposits that give value to their investments, or institutional or private investors with long-term investment horizons with buy and hold investment strategies.

For future work, it is necessary to apply the method to other time series of the stock index—for example, Standard & Poor’s 500 (S&P 500), Dow Jones, National Association of Securities Dealers Automated Quotation (NASDAQ) and BOVESPA. Furthermore, we can apply the method to others commodities like gold, silver, brent crude oil and corn to determine if it is possible to make forecasts with an error margin similar to the one found in this work.

Author Contributions

Conceptualization, G.A., R.C. and M.C.; formal analysis, G.A.; funding acquisition, M.C.; investigation, G.A. and R.C.; methodology, G.A., R.C., C.F.-C. and M.C.; project administration, M.C.; software, G.A. and R.C.; supervision, M.C.; validation, C.F.-C.; visualization, G.A. and R.C.; writing—original draft, G.A. and R.C.; writing—review and editing, C.F.-C. All authors have read and agreed to the published version of the manuscript.

Funding

This research is parcialy founded for Fondecyt project #1181659.

Acknowledgments

We are grateful to the Universidad de Santiago de Chile (USACH) and to Parcialy founded for Fondecyt project #1181659. Furthermore, the authors thank Paul Soper for the style correction of the language. Finally, the authors thank the referees for their valuable comments, which greatly improved the content and readability of the work.

Conflicts of Interest

The authors declare that there is no conflict of interest in the publication of this paper.

Abbreviations

The following abbreviations are used in this manuscript:

ARIMA	Autoregressive integrated moving average
BOVESPA	São Paulo State Stock Exchange
COMEX	Commodity Exchange Market of New York
GDP	Gross domestic product
KOSPI	Korean Composite Stock Price Index
LME	London Metal Exchange
MMN	Min–max normalization
MSE	Mean squared error
NASDAQ	National Association of Securities Dealers Automated Quotation
RBF	Gaussian function of a radial base
RMSE	Root-mean-square error
RSI	Relative strength index
SHFE	Shanghai Futures Exchange
S&P 500	Standard & Poor’s 500
SSEC	Shanghai Stock Exchange Composite
SVM	Support vector machine
SVR	Support vector regression

References

Oglend, A.; Asche, F. Cyclical non-stationarity in commodity prices. Empir. Econ. 2016, 51, 1465–1479. [Google Scholar] [CrossRef]
Lasheras, F.S.; de Cos Juez, F.J.; Sánchez, A.S.; Krzemień, A.; Fernández, P.R. Forecasting the COMEX copper spot price by means of neural networks and ARIMA models. Resour. Policy 2015, 45, 37–43. [Google Scholar] [CrossRef]
Ebert, L.; Menza, T.L. Chile, copper and resource revenue: A holistic approach to assessing commodity dependence. Resour. Policy 2015, 43, 101–111. [Google Scholar] [CrossRef]
Spilimbergo, A. Copper and the Chilean Economy, 1960–1998. Policy Reform 2002, 5, 115–126. [Google Scholar] [CrossRef]
Alameer, Z.; Abd Elaziz, M.; Ewees, A.A.; Ye, H.; Jianhua, Z. Forecasting copper prices using hybrid adaptive neuro-fuzzy inference system and genetic algorithms. Nat. Resour. Res. 2019, 28, 1385–1401. [Google Scholar] [CrossRef]
Hu, Y.; Ni, J.; Wen, L. A hybrid deep learning approach by integrating LSTM-ANN networks with GARCH model for copper price volatility prediction. Phys. A Stat. Mech. Appl. 2020, 557, 124907. [Google Scholar] [CrossRef]
García, D.; Kristjanpoller, W. An adaptive forecasting approach for copper price volatility through hybrid and non-hybrid models. Appl. Soft Comput. 2019, 74, 466–478. [Google Scholar] [CrossRef]
Wang, L.; Zhang, Z. Research on Shanghai Copper Futures Price Forecast Based on X12-ARIMA-GARCH Family Models. In Proceedings of the 2020 International Conference on Computer Information and Big Data Applications (CIBDA), Guiyang, China, 17–19 April 2020; pp. 304–308. [Google Scholar]
Rubaszek, M.; Karolak, Z.; Kwas, M. Mean-reversion, non-linearities and the dynamics of industrial metal prices. A forecasting perspective. Resour. Policy 2020, 65, 101538. [Google Scholar] [CrossRef]
Vapnik, V. The Nature of Statistical Learning Theory; Springer: New York, NY, USA, 1995; p. 188. [Google Scholar] [CrossRef]
Drucker, H.; Burges, C.J.; Kaufman, L.; Smola, A.; Vapnik, V. Support vector regression machines. Adv. Neural Inf. Process. Syst. 1997, 9, 155–161. [Google Scholar]
Vapnik, V.; Golowich, S.E.; Smola, A.J. Support Vector Method for Function Approximation, Regression Estimation and Signal Processing. In Advances in Neural Information Processing Systems 9; Mozer, M.C., Jordan, M.I., Petsche, T., Eds.; MIT Press: Cambridge, MA, USA, 1997; pp. 281–287. [Google Scholar]
Jaramillo, J.A.; Velásquez, J.D.; Franco, C.J. Research in Financial Time Series Forecasting with SVM: Contributions from Literature. IEEE Latin Am. Trans. 2017, 15, 145–153. [Google Scholar] [CrossRef]
Kim, K.J. Financial time series forecasting using support vector machines. Neurocomputing 2003, 55, 307–319. [Google Scholar] [CrossRef]
Kao, L.J.; Chiu, C.C.; Lu, C.J.; Chang, C.H. A hybrid approach by integrating wavelet-based feature extraction with MARS and SVR for stock index forecasting. Decis. Support Syst. 2013, 54, 1228–1244. [Google Scholar] [CrossRef]
Kazem, A.; Sharifi, E.; Hussain, F.K.; Saberi, M.; Hussain, O.K. Support vector regression with chaos-based firefly algorithm for stock market price forecasting. Appl. Soft Comput. J. 2013, 13, 947–958. [Google Scholar] [CrossRef]
Patel, J.; Shah, S.; Thakkar, P.; Kotecha, K. Predicting stock market index using fusion of machine learning techniques. Expert Syst. Appl. 2015, 42, 2162–2172. [Google Scholar] [CrossRef]
Kriechbaumer, T.; Angus, A.; Parsons, D.; Rivas Casado, M. An improved wavelet-ARIMA approach for forecasting metal prices. Resour. Policy 2014, 39, 32–41. [Google Scholar] [CrossRef] [Green Version]
Seguel, F.; Carrasco, R.; Adasme, P.; Alfaro, M.; Soto, I. A Meta-heuristic Approach for Copper Price Forecasting. In Information and Knowledge Management in Complex Systems; IFIP Advances in Information and Communication Technology; Liu, K., Nakata, K., Li, W., Galarreta, D., Eds.; Springer International Publishing: Toulouse, France, 2015; Volume 449, pp. 156–165. [Google Scholar] [CrossRef] [Green Version]
Dehghani, H.; Bogdanovic, D. Copper price estimation using bat algorithm. Resour. Policy 2018. [Google Scholar] [CrossRef]
Carrasco, R.; Vargas, M.; Soto, I.; Fuentealba, D.; Banguera, L.; Fuertes, G. Chaotic time series for copper’s price forecast: Neural networks and the discovery of knowledge for big data. In Digitalisation, Innovation, and Transformation; Liu, K., Nakata, K., Li, W., Baranauskas, C., Eds.; Springer: Cham, Switzerland; London, UK, 2018; Volume 527, pp. 278–288. [Google Scholar] [CrossRef]
Khalifa, A.; Miao, H.; Ramchander, S. Return distributions and volatility forecasting in metal futures markets: Evidence from gold, silver, and copper. J. Futures Mark. 2011, 31, 55–80. [Google Scholar] [CrossRef]
Fernandez-Perez, A.; Fuertes, A.M.; Miffre, J. Harvesting Commodity Styles: An Integrated Framework. In Proceedings of the INFINITI Conference on International Finance, València, Spain, 11–12 June 2017. [Google Scholar]
Cristianini, N.; Shawe-Taylor, J. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
Shokri, S.; Sadeghi, M.T.; Marvast, M.A.; Narasimhan, S. Improvement of the prediction performance of a soft sensor model based on support vector regression for production of ultra-low sulfur diesel. Pet. Sci. 2015, 12, 177–188. [Google Scholar] [CrossRef] [Green Version]
Wu, C.H.; Ho, J.M.; Lee, D.T. Travel-time prediction with support vector regression. IEEE Trans. Intell. Transp. Syst. 2004, 5, 276–281. [Google Scholar]
Yeh, C.Y.; Huang, C.W.; Lee, S.J. A multiple-kernel support vector regression approach for stock market price forecasting. Expert Syst. Appl. 2011, 38, 2177–2186. [Google Scholar] [CrossRef]
Vert, J.P.; Tsuda, K.; Schölkopf, B. A primer on kernel methods. Kernel Methods Comput. Biol. 2004, 47, 35–70. [Google Scholar]
Watkins, C.; McAleer, M. Econometric modelling of non-ferrous metal prices. J. Econ. Surv. 2004, 18, 651–701. [Google Scholar] [CrossRef]
Singh, D.; Singh, B. Investigating the impact of data normalization on classification performance. Appl. Soft Comput. J. 2019. [Google Scholar] [CrossRef]
Hsu, C.W.; Chang, C.C.; Lin, C.J. A Practical Guide to Support Vector Classification; Technical Report; National Taiwan University: Taipei, Taiwan, 2003. [Google Scholar]
McCarthy, P.J. The Use of Balanced Half-Sample Replication in Cross-Validation Studies. J. Am. Stat. Assoc. 1976, 71, 596–604. [Google Scholar] [CrossRef]
Atsalakis, G.S. Using computational intelligence to forecast carbon prices. Appl. Soft Comput. 2016, 43, 107–116. [Google Scholar] [CrossRef]
Henríquez, J.; Kristjanpoller, W. A combined Independent Component Analysis–Neural Network model for forecasting exchange rate variation. Appl. Soft Comput. 2019, 83, 105654. [Google Scholar] [CrossRef]
Kim, H.Y.; Won, C.H. Forecasting the volatility of stock price index: A hybrid model integrating LSTM with multiple GARCH-type models. Expert Syst. Appl. 2018, 103, 25–37. [Google Scholar] [CrossRef]
Meyer, D.; Dimitriadou, E.; Hornik, K.; Weingessel, A.; Leisch, F.; Chang, C.C.; Lin, C.C. e1071: Misc Functions of the Department of Statistics, Probability Theory Group (Formerly: E1071), TU Wien. R Package Version 1.7-3. 2019. Available online: https://CRAN.R-project.org/package=e1071 (accessed on 26 November 2019).
Analytics, R.; Weston, S. doParallel: Foreach Parallel Adaptor for the Parallel Package, R Package Version 1.0.14; 2014. Available online: https://CRAN.R-project.org/package=doParallel (accessed on 2 August 2019).

Figure 1. Schematic diagram; (a) support vector regression (SVR), and (b) SVR using an

ε

-insensitive function.

Figure 1. Schematic diagram; (a) support vector regression (SVR), and (b) SVR using an

ε

-insensitive function.

Figure 2. Evolution of time series of the closing copper price in lme, from January 2006 to January 2018.

Figure 3. Example of a five-day prediction in 2017. The solid line is the original series. The dotted line is the prediction.

Figure 4. MSE. (a) Distribution of MSE for type and interval. (b) 95% family-wise confidence level.

Table 1. Ranges for the grid of the parameters L, C,

ε

and

γ

for the radial and linear kernel.

Table 1. Ranges for the grid of the parameters L, C,

ε

and

γ

for the radial and linear kernel.

Parameter	Range
L	$1, 2, \dots, 10$
C	$2^{- 8}, 2^{- 7}, \dots, 2^{12}$
$ε$	$0.01, 0.02, \dots, 0.30$
$γ$	$2^{- 8}, 2^{- 7}, \dots, 2^{12}$

Table 2. Values of the parameters for the best models for each prediction interval according to the training set and test.

p	$S_{a} \overset{p}{\to} S_{b}$							$S_{b} \overset{p}{\to} S_{a}$
	L	c	$ε$	$γ$	$ρ$	MSE	RMSE	L	c	$ε$	$γ$	$ρ$	MSE	RMSE
5	3	32	0.11	$2^{- 7}$	0.9582	0.0003130	0.01769181	4	64	0.11	$2^{- 7}$	0.9471	0.0011858	0.03443545
10	3	32	0.11	$2^{- 7}$	0.9297	0.0004816	0.02194539	4	64	0.11	$2^{- 7}$	0.9130	0.0019877	0.04458363
15	3	32	0.12	$2^{- 7}$	0.8898	0.0006191	0.02488172	4	64	0.11	$2^{- 7}$	0.8974	0.0028632	0.05350888
20	3	32	0.12	$2^{- 7}$	0.8670	0.0009036	0.03005994	4	64	0.11	$2^{- 7}$	0.8194	0.0037151	0.06095162
25	3	32	0.11	$2^{- 7}$	0.8577	0.0012025	0.03467708	4	64	0.11	$2^{- 7}$	0.8115	0.0041516	0.06443291
30	3	32	0.12	$2^{- 7}$	0.8458	0.0010909	0.03302878	4	64	0.11	$2^{- 7}$	0.8110	0.0056090	0.07489326

Table 3. Differences of means of the mean squared error (MSE) (Wilcoxon rank-sum test) between groups in the lower triangle and p-value in the upper triangle with significance symbol (., *, **, *** indicate statistical significance at the 90%, 95%, 99% and 99.9% levels, respectively).

	$S_{a} \overset{5}{\to} S_{b}$	$S_{a} \overset{10}{\to} S_{b}$	$S_{a} \overset{15}{\to} S_{b}$	$S_{a} \overset{20}{\to} S_{b}$	$S_{a} \overset{25}{\to} S_{b}$	$S_{a} \overset{30}{\to} S_{b}$	$S_{b} \overset{5}{\to} S_{a}$	$S_{b} \overset{10}{\to} S_{a}$	$S_{b} \overset{15}{\to} S_{a}$	$S_{b} \overset{20}{\to} S_{a}$	$S_{b} \overset{25}{\to} S_{a}$	$S_{b} \overset{30}{\to} S_{a}$
$S_{a} \overset{5}{\to} S_{b}$		$\overset{}{0.999906}$	$\overset{}{0.989178}$	$\overset{}{0.558148}$	$\dot{0.085022}$	$\overset{}{0.301065}$	$\overset{*}{0.014258}$	$\overset{* * *}{8.12 \times 10^{- 8}}$	$\overset{* * *}{4.54 \times 10^{- 13}}$	$\overset{* * *}{4.38 \times 10^{- 13}}$	$\overset{* * *}{4.38 \times 10^{- 13}}$	$\overset{* * *}{3.22 \times 10^{- 13}}$
$S_{a} \overset{10}{\to} S_{b}$	0.000197		$\overset{}{0.999997}$	$\overset{}{0.943909}$	$\overset{}{0.412853}$	$\overset{}{0.723442}$	$\overset{}{0.430887}$	$\overset{* * *}{0.000130}$	$\overset{* * *}{2.13 \times 10^{- 10}}$	$\overset{* * *}{4.34 \times 10^{- 13}}$	$\overset{* * *}{4.21 \times 10^{- 13}}$	$\overset{* * *}{3.22 \times 10^{- 13}}$
$S_{a} \overset{15}{\to} S_{b}$	0.000376	0.000179		$\overset{}{0.998649}$	$\overset{}{0.800757}$	$\overset{}{0.950626}$	$\overset{}{0.936813}$	$\overset{* *}{0.008315}$	$\overset{* * *}{1.63 \times 10^{- 7}}$	$\overset{* * *}{6.78 \times 10^{- 12}}$	$\overset{* * *}{9.95 \times 10^{- 13}}$	$\overset{* * *}{3.94 \times 10^{- 13}}$
$S_{a} \overset{20}{\to} S_{b}$	0.000761	0.000563	0.000384		$\overset{}{0.999169}$	$\overset{}{0.999984}$	$\overset{}{1.000000}$	$\overset{}{0.345577}$	$\overset{* * *}{0.000211}$	$\overset{* * *}{4.80 \times 10^{- 8}}$	$\overset{* * *}{3.44 \times 10^{- 9}}$	$\overset{* * *}{4.37 \times 10^{- 13}}$
$S_{a} \overset{25}{\to} S_{b}$	0.001172	0.000974	0.000795	0.000411		$\overset{}{1.000000}$	$\overset{}{0.999583}$	$\overset{}{0.977224}$	$\overset{*}{0.029091}$	$\overset{* * *}{4.56 \times 10^{- 5}}$	$\overset{* * *}{3.70 \times 10^{- 6}}$	$\overset{* * *}{4.41 \times 10^{- 13}}$
$S_{a} \overset{30}{\to} S_{b}$	0.001051	0.000853	0.000674	0.000290	−0.000121		$\overset{}{0.999999}$	$\overset{}{0.939274}$	$\overset{*}{0.023743}$	$\overset{* * *}{4.85 \times 10^{- 5}}$	$\overset{* * *}{4.17 \times 10^{- 6}}$	$\overset{* * *}{4.60 \times 10^{- 13}}$
$S_{b} \overset{5}{\to} S_{a}$	0.000855	0.000658	0.000479	0.000094	−0.000317	−0.000195		$\dot{0.098575}$	$\overset{* * *}{1.07 \times 10^{- 6}}$	$\overset{* * *}{1.71 \times 10^{- 11}}$	$\overset{* * *}{2.26 \times 10^{- 12}}$	$\overset{* * *}{3.65 \times 10^{- 13}}$
$S_{b} \overset{10}{\to} S_{a}$	0.001710	0.001513	0.001334	0.000950	0.000539	0.000660	0.000855		$\overset{}{0.213924}$	$\overset{* * *}{0.000317}$	$\overset{* * *}{2.35 \times 10^{- 5}}$	$\overset{* * *}{4.32 \times 10^{- 13}}$
$S_{b} \overset{15}{\to} S_{a}$	0.002680	0.002483	0.002304	0.001919	0.001509	0.001630	0.001825	0.000970		$\overset{}{0.744952}$	$\overset{}{0.271314}$	$\overset{* * *}{1.23 \times 10^{- 7}}$
$S_{b} \overset{20}{\to} S_{a}$	0.003480	0.003283	0.003104	0.002720	0.002309	0.002430	0.002625	0.001770	0.000800		$\overset{}{0.999790}$	$\overset{* *}{0.002119}$
$S_{b} \overset{25}{\to} S_{a}$	0.003845	0.003647	0.003468	0.003084	0.002673	0.002794	0.002990	0.002134	0.001165	0.000365		$\dot{0.053965}$
$S_{b} \overset{30}{\to} S_{a}$	0.005533	0.005336	0.005157	0.004773	0.004362	0.004483	0.004678	0.003823	0.002853	0.002053	0.001688

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Astudillo, G.; Carrasco, R.; Fernández-Campusano, C.; Chacón, M. Copper Price Prediction Using Support Vector Regression Technique. Appl. Sci. 2020, 10, 6648. https://doi.org/10.3390/app10196648

AMA Style

Astudillo G, Carrasco R, Fernández-Campusano C, Chacón M. Copper Price Prediction Using Support Vector Regression Technique. Applied Sciences. 2020; 10(19):6648. https://doi.org/10.3390/app10196648

Chicago/Turabian Style

Astudillo, Gabriel, Raúl Carrasco, Christian Fernández-Campusano, and Máx Chacón. 2020. "Copper Price Prediction Using Support Vector Regression Technique" Applied Sciences 10, no. 19: 6648. https://doi.org/10.3390/app10196648

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Copper Price Prediction Using Support Vector Regression Technique

Abstract

1. Introduction

2. Related Works

3. Support Vector Regression Model

4. Data Description

5. Methodology

5.1. Step 1: Data Pre-Processing

5.2. Step 2: Parameters Adjust and Training

5.3. Step 3: Prediction

5.4. Step 4: Performance Measures

6. Results and Analysis

7. Discussion

8. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI