A Novel Intelligent Prediction Model for the Containerized Freight Index: A New Perspective of Adaptive Model Selection for Subseries

Yang, Wendong; Zhang, Hao; Yang, Sibo; Hao, Yan

doi:10.3390/systems12080309

Open AccessArticle

A Novel Intelligent Prediction Model for the Containerized Freight Index: A New Perspective of Adaptive Model Selection for Subseries

¹

School of Management Science and Engineering, Shandong University of Finance and Economics, Jinan 250014, China

²

Ping An Life Insurance Company of China Ltd., Shandong Branch, Jinan 250001, China

³

Business School, Shandong Normal University, Jinan 250014, China

^*

Author to whom correspondence should be addressed.

Systems 2024, 12(8), 309; https://doi.org/10.3390/systems12080309

Submission received: 2 July 2024 / Revised: 3 August 2024 / Accepted: 17 August 2024 / Published: 19 August 2024

(This article belongs to the Special Issue Data-Driven Modeling and Predictive Analysis for Business, Social, Economic, and Engineering Applications)

Download

Browse Figures

Versions Notes

Abstract

:

The prediction of the containerized freight index has important economic and social significance. Previous research has mostly applied sub-predictors directly for integration, which cannot be optimized for different datasets. To fill this research gap and improve prediction accuracy, this study innovatively proposes a new prediction model based on adaptive model selection and multi-objective ensemble to predict the containerized freight index. The proposed model comprises the following four modules: adaptive data preprocessing, model library, adaptive model selection, and multi-objective ensemble. Specifically, an adaptive data preprocessing module is established based on a novel modal decomposition technology that can effectively reduce the impact of perturbations in historical data on the prediction model. Second, a new model library is constructed to predict the subseries, consisting of four basic predictors. Then, the adaptive model selection module is established based on Lasso feature selection to choose valid predictors for subseries. For the subseries, different predictors can produce different effects; thus, to obtain better prediction results, the weights of each predictor must be reconsidered. Therefore, a multi-objective artificial vulture optimization algorithm is introduced into the multi-objective ensemble module, which can effectively improve the accuracy and stability of the prediction model. In addition, an important discovery is that the proposed model can acquire different models, adaptively varying with different extracted data features in various datasets, and it is common for multiple models or no model to be selected for the subseries.The proposed model demonstrates superior forecasting performance in the real freight market, achieving average MAE, RMSE, MAPE, IA, and TIC values of 9.55567, 11.29675, 0.44222%, 0.99787, and 0.00268, respectively, across four datasets. These results indicate that the proposed model has excellent predictive ability and robustness.

Keywords:

containerized freight index; ensemble forecasting; adaptive model selection; prediction

1. Introduction

Containerization has provided a favorable environment for international trade, and specialization and technological advances have simplified international shipping [1]. The China containerized freight index (CCFI) was released by the Shanghai Shipping Exchange in 1998 to objectively reflect changes in Chinese container freight rates. As the number of shipping containers continues to increase, the containerized freight index in China has a significant impact on the world economy [2]. In recent years, an increasing number of scholars have focused on predicting the containerized freight index [3]. Forecasting models that can accurately and stably predict the containerized freight index are becoming increasingly important and have considerable economic significance [4].

A traditional econometric model is a theoretical structure that describes the relationships between relevant economic variables. Jeon et al. [5] employed system dynamics to reflect the supply-side and demand-side drivers of the market and proposed a one-step prediction method by utilizing the cyclical system dynamics approach. Koyuncu and TAVACIOĞLU [6] applied Holt–Winter smoothing and the Seasonal Autoregressive Integrated Moving Average (SARIMA) to forecast the Shanghai Containerized Freight Index (SCFI), which revealed that the SARIMA model has excellent prediction performance in short-term forecasting. Kawasaki et al. [7] discussed the application of the SARIMA and Vector Autoregression (VAR) models in predicting container trade volume. Hirata and Matsuda [8] used the SARIMA and Long Short-Term Memory (LSTM) methods to predict SCFI, which demonstrated that the performance of the SARIMA model is better than that of LSTM in predicting cargo volumes along the western and eastern routes of Japan. However, limited data for building models and unrealistic assumptions regarding future conditions have led to the poor performance of traditional economic models in short-term forecasting.

Artificial intelligence models are established based on mathematical algorithms to create prediction models that can use specific algorithms to analyze data and learn from data patterns to generate models. In recent years, deep learning models have been widely applied to time-series forecasting. Shankar et al. [9] used LSTM to predict container throughput. Kim and Choi [10] used Gate Recurrent Units (GRUs) and LSTM to construct a prediction model for CCFI, which illustrated the improvement of the artificial intelligence models compared to the Autoregressive Integrated Moving Average (ARIMA) in terms of prediction accuracy. Dasari and Bhukya [11] analyzed the capability of Convolutional Neural Networks (CNNs) and CNN-LSTM in automatic feature extraction, which demonstrated that CNNs have a unique advantage in feature extraction. Khaksar Manshad et al. [12] discovered a novel time-series link prediction method based on evolutionary computation and irregular cellular learning automata. Swathi et al. [13] observed the satisfactory performance of LSTM model-based sentiment analysis in predicting time-series data. Xiao et al. [14] used LSTM and ensemble learning technology to predict the China Coastal Bulk Coal Freight Index. Liang et al. [15] transformed time-series data into network structures, selected relevant features with the Maximum Relevance, Minimum Redundancy (mRMR) method and applied Support Vector Machine (SVM), Deep Neural Network (DNN), and LSTM models to improve prediction accuracy. However, training artificial intelligence models requires a substantial amount of data for learning, and the quality and quantity of the data have a significant impact on performance.

Numerous models have been employed in the prediction field, and decomposition ensemble prediction model is one of the most commonly used models. Because the containerized freight index has strong nonlinearity and volatility, some scholars have proposed the use of data decomposition technology to decompose complex time series, which has the potential to enhance both the accuracy and stability of predictions. There are many data decomposition methods, such as wavelet packet decomposition [16], wavelet soft-threshold denoising [17], Ensemble Empirical Mode Decomposition (EEMD) [18], Complementary Ensemble Empirical Mode Decomposition (CEEMD) [19], Improved Complete Ensemble Empirical Mode Decomposition (ICEEMD) [20], Variational Mode Decomposition (VMD) [21], and Discrete Fourier Transform [22]. Specifically, Bai et al. [23] decomposed raw stock price-based Multivariate Empirical Mode Decomposition (MEMD) and the Neighborhood Rough Set (NRS). Rezaei et al. [24] discussed Empirical Mode Decomposition (EMD) and CEEMD performance, and Liu et al. [25] analyzed the results of Interval Discrete Wavelet Transform (IDWT), Interval Empirical Mode Decomposition (IEMD), and Interval VMD (IVMD) under different models. Nguyen and Phan [26] applied EEMD to decompose time-series data to achieve excellent performance. Yang et al. [27] introduced VMD and Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (ICEEMDAN) to the prediction model, which illustrated that data preprocessing technology has the benefits of denoising. Jaseena and Kovoor [28] applied the discrete wavelet transform algorithm to decompose raw data, which proved that Discrete Wavelet Transformation (DWT) is effective in enhancing the prediction model. Li et al. [29] considered the robustness of the decomposition algorithm to improve the accuracy of the prediction model. Da Silva et al. [30] presented an ensemble learning model that incorporates variable mode decomposition and singular spectrum analysis. Fang [31] applied the EMD method to a backpropagation (BP) neural network and ARIMA and used EMD-BP and EMD-ARIMA models to predict the containerized freight index. Chen et al. [32] decomposed the CCFI into multiple subseries (IMF) using the EMD method and predicted each IMF using the gray method and theAutoregressive Moving Average (ARMA) method separately, exploring the benefits of gray wave forecasting methods using periodic fluctuations.

There have also been many research achievements in predictors and ensemble schemes. For example, Tian and Chen [33] used a reinforced LSTM model to predict decomposed modes, and a modified sparrow search algorithm was applied to optimize the hyperparameters of an LSTM model. Zhao et al. [34] proposed a hybrid deep learning model based on CNN, VMD, and GRUs and validated the model using wind power. Niu et al. [35] proposed a novel decomposition ensemble model and compared its accuracy with that of simple Recurrent Neural Network (RNNs), GRUs, and LSTM, which indicated that the EEMD-RNN model can improve accuracy for time-series forecasting. Altan et al. [36] used LSTM to predict subseries, and prediction results were integrated using Multi-objective Gray Wolf Optimization (MOGWO), which indicated that the ensemble scheme is able to capture the inherent features of the time series and achieve more accurate prediction results than individual prediction models. Joseph et al. [37] predicted subseries using a bi-directional LSTM network. Luo and Zhang [38] demonstrated that Bi-directional LSTM (BiLSTM) can analyze the intrinsic characteristics of longer temporal-frequency information, and the attention mechanism can assign more reasonable weights to parameters, which can effectively enhance the accuracy of prediction results. Yang et al. [39] proved that recursive empirical mode decomposition and LSTM perform well in time-series data. Yu et al. [40] presented a new model based on LSTM and a time-series cross-correlation network, which achieved excellent prediction results in a wind power field. Yang et al. [41] employed Extreme Learning Machine (ELM) series models to establish a novel forecasting model and conducted experimental simulations, showing that the ensemble model can achieve excellent performance. Wang et al. [42] determined VMD parameters using Kullback–Leibler divergence and an applied LSTM-based attention module to predict stock prices. Hao et al. [43] introduced an advanced prediction model for air pollutant levels that integrates time-series reconstruction, sub-model simulation, weight optimization, and combination techniques to improve both prediction accuracy and stability in environmental management.

By examining the pertinent literature on prediction models and the containerized freight index, these studies show that the decomposition ensemble model has many advantages in containerized freight index prediction, and an increasing number of scholars are focusing on it. However, many problems still exist in forecasting the Containerized Freight Index (CFI). This subsection summarizes some of the limitations of CFI forecasting.

From the above literature analysis, most of the studies directly use the same type of predictor without focusing on the diversity of the prediction models. In theory, the diversity of predictors could improve the robustness of the prediction model so that the model can be stabilized and continuously operated. Hence, this paper suggests introducing a novel model library to bolster the stability of the prediction model, building upon existing frameworks.
Most previous studies selected the current optimal predictors for each subseries, but these predictors do not guarantee that the prediction results are optimal on a global level. Thus, this paper introduces the concept of valid predictors to enhance the overall predictive capability of the model.

To address the gaps identified above, this paper introduces a novel intelligent prediction model. This model is designed to enhance the prediction performance of the CFI by adopting a new approach to adaptive model selection for subseries. The proposed model comprises the following four key modules: adaptive data preprocessing, model library, adaptive model selection, and multi-objective ensemble. First, an adaptive data preprocessing module is introduced based on Sequential VMD (SVMD) technology to mitigate the impact of fluctuations and noise in the original data series. Next, a new model library module is constructed, which consists of four prediction models, namely Outlier Robust Extreme Learning Machine (ORELM), BP, the Group Method of Data Handling (GMDH), and the Adaptive-network-based Fuzzy Inference System (ANFIS), to offer a diversity of predictors for one subseries. These four predictors have good nonlinear fitting abilities and can ensure the prediction accuracy of the proposed model. This study introduces an adaptive model selection module that can efficiently select the appropriate predictor to improve integral prediction effectiveness. Finally, the multi-objective artificial vulture optimization algorithm is introduced into the multi-objective ensemble module, which enhances the accuracy and stability of the prediction model.

In this study, four experiments and one discussion are designed and conducted on four different datasets (Southeast Asia Freight Index, Dalian Containerized Freight Index, SCFI, and CCFI) to verify the superiority of the proposed model. At the same time, eleven benchmark models are built to confirm the superiority of this prediction model. The experimental results indicate that this model significantly outperforms the comparison models across all datasets, with MAPE values of 0.545154, 0.572040, 0.536753, and 0.114937%. This demonstrates the effectiveness of the model selection module in adapting to different datasets. Therefore, the proposed prediction model can serve as a powerful tool for the forecasting of container freight indices. The contributions and innovations of the proposed novel intelligent prediction model are summarized as follows:

Developing a creative prediction model for CFI. Although many studies have applied the decomposition ensemble framework to predict the CFI, the processing of decomposed subseries is inadequate and needs further improvement. Therefore, to fill existing gaps, this study innovatively proposes a prediction model for CFI that incorporates the following four modules: adaptive data preprocessing, model library, adaptive model selection, and multi-objective ensemble. In addition, note that to achieve effective integral prediction, the number of predictors is not limited by applying the adaptive selection strategy.
Devising an innovative adaptive data preprocessing module. It is clear from the analysis of previous studies that VMD is widely used in the decomposition ensemble model, but this study innovatively applies SVMD technology to the decomposition ensemble model for the CFI. Compared with VMD technology, SVMD can adaptively decompose the raw data and determine the number of decomposition layers (K), which can effectively ensure the objectivity of the decomposition process.
Establishing a model library to predict the decomposed subseries. Unlike most previous studies, which did not focus on the variety of predictors for subseries prediction, a new model library module is constructed, incorporating the following four models: ORELM, BP, GMDH, and ANFIS. It can not only greatly improve the diversity of predictors but also effectively advance the nonlinear fitting ability of the prediction model.
Introducing the idea of model validity to adaptive model selection. The adaptive model selection module is designed to obtain valid predictors for each subseries in this study. In this module, Least Absolute Shrinkage and Selection Operator (LASSO) technology is introduced to conduct adaptive model selection, selecting the valid predictors for the subseries from the perspective of advancing the effectiveness of integral forecasting. In contrast to scholars who choose the best predictor for one subseries locally, the number of predictors is not restricted, which means that for one subseries, multiple models may be selected, and it is common that no model is chosen. Thus, adaptive model selection chooses predictors that can effectively enhance the prediction performance from the perspective of the prediction model rather than using the greedy idea to select the optimal current predictor.
Designing the multi-objective ensemble module based on the Multi-objective Artificial Vulture Optimization Algorithm (MOAVOA). The multi-objective ensemble is proposed to integrate the prediction values of subseries, which can concurrently fulfill the requirements of both prediction accuracy and stability. By introducing archive grid and leader selection mechanisms, the search ability of MOAVOA is greatly boosted, which can effectively advance the prediction accuracy and robustness of the prediction model.

The remainder of this paper is organized as follows: Section 2 describes the preliminary methods. Section 3 introduces the proposed model. Section 4 presents the empirical results and verifies the contribution of this study. Section 5 discusses the results, and Section 6 summarizes the whole paper.

2. Preliminary Methods

A novel intelligent prediction model based on adaptive selection scheme for CFI is established in this study. It incorporates the following four modules: adaptive data preprocessing, model library, adaptive model selection, and multi-objective ensemble modules.

2.1. Adaptive Data Preprocessing Module

In contrast to EMD [44], VMD employs an iterative search to determine the optimal solution for the variational model [45]. The VMD model seeks the mode components, along with their corresponding center frequencies. SVMD can continuously extract modes without requiring a predefined number of modes, and its computational complexity is significantly lower than that of VMD [46]. The decomposition process of SVMD is as follows:

Assuming that the input signal (

f (t)

) is decomposed into two signals, namely the Lth mode (

u_{L} (t)

) and the residual signal (

f_{r} (t)

), f can be expressed as Equation (1).

f (t) = u_{L} (t) + f_{r} (t)

(1)

The residual signal (

f_{r} (t)

) is distinct from the

u_{L} (t)

signal, and

f_{r} (t)

can be expressed as Equation (2).

f_{r} (t) = \sum_{i = 1}^{L - 1} u_{i} (t) + f_{u} (t)

(2)

The Lth mode (

u_{L} (t)

) surrounds the central frequency, and

u_{L} (t)

can be expressed by Equation (3).

min J_{1} = min {∥\partial_{t} [(δ (t) + \frac{j}{π t}) * u_{L} (t)] e^{- j ω_{L} t}∥}_{2}^{2}

(3)

At the frequency where

u_{L} (t)

has a significant component, the objective is to minimize the

f_{r} (t)

of the residual signal.

{\hat{β}}_{L} (ω) = \frac{1}{α (ω - ω_{L})}

(4)

Equation (5) minimizes the spectral overlap between the Lth mode and the residual signal as follows:

J_{2} = {∥ β_{L} (t) * f_{r} (t) ∥}_{2}^{2}

(5)

Equation (6) can select an appropriate constraint to avoid the possibility that the L mode could be any of the

L - 1

modes.

{\hat{β}}_{i} (ω) = \frac{1}{α {(ω - ω_{i})}^{2}}

(6)

The added criterion is expressed in Equation (7).

J_{3} = \sum_{i = 1}^{L - 1} {∥ β_{i} (t) * u_{L} (t) ∥}_{2}^{2}

(7)

In Equation (8),

f (t)

is reconstructed from the L mode and unprocessed signal.

f (t) = u_{L} (t) + f_{u} (t) + \sum_{i = 1}^{L - 1} u_{i} (t)

(8)

Therefore, extracting the L mode is equivalent to constraint minimization (Equation (9)).

\begin{matrix} min_{u_{L}, ω_{L}, f_{r}} {α J_{1} + J_{2} + J_{3}} \\ s u b j e c t t o & u_{L} (t) + f_{r} (t) = f (t) \end{matrix}

(9)

2.2. Model Library Module

The model library module was developed to facilitate the incorporation of multiple predictors for subseries, including ORELM, BP neural network, GMDH, and ANFIS.

2.2.1. ORELM

ORELM is a feedforward neural network proposed by Zhang and Luo [47]. The calculation method for predicted values is expressed as follows:

f_{L} (x_{j}) = \sum_{i = 1}^{L} [β_{i} \cdot h_{i} (x_{j})] = \sum_{i = 1}^{L} [β_{i} \cdot g (ω_{i} * x_{j} + b_{i})]

(10)

The matrix form of Equation (10) can be expressed as follows:

T = H \cdot β

(11)

The procedure for computing the weight matrix (

β

) is expressed as follows:

\begin{matrix} \hat{β} & = H^{†} \cdot Y \\ = {(H^{T} H)}^{- 1} \cdot H^{T} \cdot Y \end{matrix}

(12)

The optimization function of ORELM is expressed as follows:

\begin{matrix} min_{β} C \cdot {‖ e ‖}_{1} + {‖ β ‖}_{2}^{2} \\ s . t . & Y - H \cdot β = e \end{matrix}

(13)

Equation (13) is solved using the augmented Lagrangian method, which is expressed as follows:

L_{u} (e, β, λ) = {‖ e ‖}_{1} + \frac{1}{C} {‖ β ‖}_{2}^{2} + λ^{T} (Y - H β - e) + \frac{u}{2} {‖ y - H β - e ‖}_{2}^{2}

(14)

2.2.2. BP Neural Network

The BP neural network is a multi-layer neural network and is one of the most extensively utilized models [48]. The BP network consists of an input layer, a hidden layer, and an output layer.

\begin{matrix} H & = f (W \cdot X + b_{h}) \\ O & = f (V \cdot H + b_{o}), \end{matrix}

(15)

where

X \in R^{p \times 1}

represents input data,

W \in R^{d \times p}

represents a weight matrix,

b_{h} \in R^{d \times 1}

represents bias,

V \in R^{q \times d}

represents a weight matrix,

b_{o} \in R^{q \times 1}

represents bias, and

f (x)

represents a sigmoid activation function.

In the backpropagation process, the loss function is expressed as follows:

J (Y, O) = \frac{1}{2} \sum_{k = 1}^{q} {(Y_{k} - O_{k})}^{2}

(16)

where

Y = {[Y_{1}, Y_{2}, \dots, Y_{q}]}^{T}, O = {[O_{1}, O_{2}, \dots, O_{q}]}^{T}

.

Using the gradient descent method, the iterative formulas for V,

b_{o}

, W, and

b_{h}

are expressed as follows:

\begin{matrix} V & \leftarrow V - ϵ \cdot \frac{\partial J}{\partial V} \\ b_{o} & \leftarrow b_{o} - ϵ \cdot \frac{\partial J}{\partial b_{o}} \\ W & \leftarrow W - ϵ \cdot \frac{\partial J}{\partial W} \\ b_{h} & \leftarrow b_{h} - ϵ \cdot \frac{\partial J}{\partial b_{h}} \end{matrix}

(17)

2.2.3. GMDH

GMBH is a heuristic model for complex nonlinear models [49]. The GMDH algorithm uses simple regression equations to obtain estimates of the output variables. Assume that the raw data consist of observation vectors (y) and N column-related feature vectors (

x_{1}, x_{2}, \dots, x_{N}

). The original equation can be represented as a quadratic regression polynomial, and Equation (18) is expressed as follows:

z = A + B \cdot u + C \cdot v + D \cdot u^{2} + E \cdot v^{2} + F \cdot u v

(18)

where

A, B, C, D, E

, and F represent the model parameters; u and v represent the variables; and z represents the fitted value of y.

The GMDH model generation process is described as follows.

Let the input variable set be

X = {x_{1}, x_{2}, \dots, x_{N}}

.

The output variables of the quadratic regression polynomial are obtained. u and v are extracted from the input variable set (X) and entered into Equation (18) to obtain the column of the predicted value (

z (u, v)

).

The calculation results (

z (u, v)

) are added to the output variable set (Z).

The variables with strong predictive ability in Z are identified. There are

\frac{N (N - 1)}{2}

variables in the output variable set (Z). In general, performance metrics also need to include a penalty to reduce network complexity. For example, the performance indicator is below a predefined threshold or the optimal retention number in Z is specified.

Input variables are then fed to the next network layer. Columns

z_{1}, z_{2}, \dots, z_{k}

are calculated from

x_{1}, x_{2}, \dots, x_{N}

, where k represents the overall number of preserved columns.

Then, the possibility of further enhancing the model is evaluated, and the minimum values of the performance indicators acquired in the current iteration are contrasted with those achieved in the preceding iteration. If improvements are implemented, let

X = z_{1}, z_{2}, \dots, z_{k}

; otherwise, the iteration is terminated and network generation is completed.

2.2.4. ANFIS

ANFIS represents an innovative fuzzy inference system structure that integrates fuzzy logic and neural networks [50]. Assume that x and y are inputs and z is the output of the fuzzy inference system. Common rules include two fuzzy if–then rules, as outlined below.

Rule 1: If x is A₁ and y is B₁, then z = p₁x + q₁y + r₁;

Rule 2: If x is A₂ and y is B₂, then z = p₂x + q₂y + r₂.

The ANFIS calculation process is described as follows.

Fuzzification. This layer converts the input variables into the degree of membership for each fuzzy set. x and y are input nodes.

A_{i}

(

B_{i}

) is a value related to the node function, and

U_{A_{i}} (x)

(

U_{B_{i}} (y)

) represents the degree to which x satisfies

A_{i}

(

B_{i}

). Typically, the range of

U_{A_{i}} (x)

(

U_{B_{i}} (y)

) is

(0, 1]

. The calculation method for

U_{A_{i}} (x)

(

U_{B_{i}} (y)

) is expressed in Equation (19).

\begin{matrix} U_{A_{i}} (x) = e^{- {(\frac{x - c_{i}}{a_{i}})}^{2}} \\ U_{B_{i}} (y) = e^{- {(\frac{y - c_{i + 2}}{a_{i + 2}})}^{2}}, \end{matrix}

(19)

where

i = 1, 2

,

c_{i}

represents the center of the Gaussian function, and

a_{i}

represents the density of the Gaussian function.

a_{i}

and

c_{i}

are parameters that can be optimized.

Rule Applicability. The output of each node results from the multiplication of the input signals, and it represents the applicability of the rules.

W_{i} = U_{A_{i}} (x) \cdot U_{B_{i}} (y)

(20)

Normalized applicability

{\bar{W}}_{i} = \frac{W_{i}}{\sum_{i} W_{i}}

(21)

TSK output layer. The layer then calculates the output of each rule.

O_{i} = {\bar{W}}_{i} \cdot f_{i} = {\bar{W}}_{i} \cdot (p_{i} x + q_{i} y + r_{i}),

(22)

where x and y are the input nodes;

{\bar{W}}_{i}

is the normalized applicability; and

p_{i}

,

q_{i}

, and

r_{i}

are the fuzzy rule parameters.

Summation. This layer sums each input to obtain the total output.

R = \sum_{i} O_{i} = \sum_{i} {\bar{W}}_{i} \cdot (p_{i} x + q_{i} y + r_{i})

(23)

ANFIS network output value (R).

2.3. Adaptive Model Selection Module

In this study, LASSO feature selection technology is applied to the adaptive model selection module. LASSO can solve the high complexity of a model caused by high-dimensional data through its sparsity characteristics.

By using LASSO regression, these unimportant parameter values are compressed to approximately zero. Assuming that the original objective function is

J (β)

, the objective function after adding LASSO selection is

Q (β)

, and the equation for

Q (β)

is expressed as follows:

Q (β) = J (β) + {λ ‖ β ‖}_{1},

(24)

where

β

is the parameter to be optimized.

Usually,

Q (β)

is not continuously differentiable; therefore, the gradient descent method cannot be used for optimization. The coordinate axis descent method is an optimization technique, and its algorithm is presented in Algorithm 1.

Algorithm 1 The coordinate axis descent method

Require:
- Loss function $J (θ)$ of the original model.
Ensure:
- The optimized model.
- Set the initial position point $θ^{(0)} = (θ_{1}^{(0)}, θ_{2}^{(0)}, \dots, θ_{p}^{(0)})$ ; the iteration number $k = 1$ .
- At the k iteration, other parameters except $θ_{j}^{(k)}$ are fixed and $θ_{j}^{(k)}$ is optimized. The iterative formula for $θ_{j}^{(k)}$ is shown in Equation (25).
  
  $θ_{j}^{(k)} = arg min_{θ_{j}} J (θ_{1}^{(k - 1)}, \dots, θ_{j}, \dots, θ_{p}^{(k - 1)})$
  
  (25)
  
  where $j = 1, 2, \dots, p$ .
- If the change in $θ_{j}^{(k)}$ is considerably smaller compared to the previous iteration, the result has converged and the iteration is terminated. Otherwise, continue the iteration.
  return The optimized model.

2.4. Multi-Objective Ensemble Module

The MOAVOA is an innovative intelligent optimization technique [51]. This algorithm is characterized by high computational speed and high accuracy; therefore, this study uses the latest MOAVO.

Following the formation of the initial population, the fitness of the solutions is evaluated. The optimal and suboptimal solutions are selected, and the other solutions are moved toward the best solution using Equation (26).

R (i) = \{\begin{matrix} B e s t v u l t u r e_{1} & i f p_{i} = L_{1} \\ B e s t v u l t u r e_{2} & i f p_{i} = L_{2}, \end{matrix}

(26)

where

L_{1} \in [0, 1]

and

L_{2} \in [0, 1]

represent hyperparameters that are given before the search operation, and

L_{1} + L_{2} = 1

.

The likelihood of selecting the best solution is determined using the roulette-wheel approach in Equation (27), and each optimal solution is chosen for groups.

P_{i} = \frac{F_{i}}{\sum_{i = 1}^{n} F_{i}}

(27)

Inspired by hungry vultures, the model is represented by Equation (28).

F = (2 \times r a n d_{1} + 1) \times z \times (1 - \frac{I t e r a t i o n_{i}}{I t e r a t i o n_{m a x}}),

(28)

where

z \in [- 1, 1]

,

h \in [- 2, 2]

,

r a n d_{i} \in [0, 1]

, and

I t e r a t i o n_{i}

and

I t e r a t i o n_{m a x}

represent the current and maximum number of iterations, respectively.

r a n d_{p 1} \in [0, 1]

is a random number. If

r a n d_{p 1}

is greater than

P 1

, it is explored using Equation (29); if

r a n d_{p 1}

is less than

P 1

, it is explored using Equation (30).

\begin{matrix} P (i + 1) & = R (i) - | X \times R (i) - P (i) | \times F \end{matrix}

(29)

\begin{matrix} P (i + 1) & = R (i) - F + r a n d_{2} \times ((u b - l b) \times r a n d_{3} + l b) \end{matrix}

(30)

r a n d_{p 2} \in [0, 1]

is a random number; if

r a n d_{p 2}

is greater than

P 2

, it uses the strategy of Equation (31); if

r a n d_{p 2}

is less than

P 2

, it uses the strategy of Equation (32).

\begin{matrix} P (i + 1) & = D (i) \times (F + r a n d_{4}) - R (i) + P (i) \end{matrix}

(31)

\begin{matrix} P (i + 1) & = R (i) - R (i) \times (\frac{r a n d_{5} \times P_{i}}{2 π}) (sin (P (i)) + cos (P (i))) \end{matrix}

(32)

r a n d_{p 3} \in [0, 1]

is a random number; if

r a n d_{p 3}

is greater than

P 3

, it uses the scheme of Equation (33); if

r a n d_{p 3}

is less than

P 3

, it uses the scheme of Equation (34).

\begin{matrix} P (i + 1) & = \frac{B V_{1} (i) + B V_{2} (i)}{2} - \frac{1}{2} \cdot (\frac{B V_{1} (i) \times P (i)}{B V_{1} (i) - P {(i)}^{2}} + \frac{B V_{2} (i) \times P (i)}{B V_{2} (i) - P {(i)}^{2}}) \end{matrix}

(33)

\begin{matrix} P (i + 1) & = R (i) - | d (t) | \times F \times L e v y (d), \end{matrix}

(34)

where

B V_{1} (i)

and

B V_{2} (i)

are the optimal vultures for the first and second groups in the current iteration, respectively.

3. Proposed Method

This study establishes a new intelligent prediction model-based adaptive selection strategy for CFI, incorporating the following four steps: (1) adaptive data preprocessing, (2) construction of the model library and subseries prediction, (3) adaptive model selection, and (4) multi-objective ensemble (Figure 1).

Step 1: Adaptive data preprocessing.

SVMD has many achievements in many fields and is not affected by the number of decomposed modes. Therefore, in this study, SVMD was selected to decompose the CFI. Multiple modes can be obtained from the raw data through adaptive data preprocessing module, which can significantly reduce the impact of data noise on the prediction accuracy.

Step 2: Construction of the model library and subseries prediction.

Owing to the strong nonlinearity of the containerized freight index, it is challenging for the single-predictor model to predict CFI trends; thus, the model library module is introduced into the prediction model. The model library contains the following four predictors: ORELM, BP, GMDH, and ANFIS. These predictors have a strong nonlinear fitting ability, and each predictor has unique characteristics, which gives the proposed model a high prediction accuracy.

Step 3: Adaptive model selection.

Because different models have different prediction abilities for different subseries, adaptive model selection module is essential. The LASSO feature selection algorithm achieves feature dimensionality reduction by introducing an L1 norm penalty term. In this study, the LASSO feature selection algorithm is used in adaptive model selection, which can effectively extract key impact factors and select the valid predictors for each subseries.

Step 4: Multi-objective ensemble.

The multi-objective ensemble is an important step in the prediction model that combines the prediction values of multiple subseries into a final prediction result. As different subseries prediction values may have different proportions in the final result, weights are added to different subseries prediction values. Because the MOAVOA has a strong ability to combine predictors, it is used to estimate the predictor weights in this study.

4. Empirical Results and Analysis

This section presents the empirical results of the proposed model. Section 4.1 introduces the data selection and performance metrics for the prediction results. Section 4.2 describes the experimental design used to demonstrate the experimental objectives. In Section 4.3, the results of the adaptive model selection module are discussed and analyzed. In Section 4.4, an experiment is designed to compare a single artificial neural network with the proposed model. In Section 4.5, an experiment is designed to compare the equal-weight method with the proposed model. In Section 4.6, an experiment is designed to compare different intelligent ensemble methods with the proposed model. These experiments were performed using MATLAB R2022b on a Windows 10 Intel Core i5-10210U CPU.

4.1. Data Description and Performance Metrics

The Southeast Asia Freight Index (SEAFI), Dalian Containerized Freight Index (DCFI), SCFI, and CCFI were selected to verify the model proposed in this study. These container freight indices have important applications in measuring the level of container transportation markets worldwide. The Southeast Asia CFI provides key insights into shipping trends in Southeast Asia and aids businesses and policymakers in making informed logistics and trade decisions. The Dalian CFI reflects shipping activities in Northeast China, helping businesses manage costs and optimize supply chains in this strategic area. The Shanghai CFI monitors shipping status throughout Shanghai and serves as a crucial indicator of global shipping trends and economic health. The China CFI offers a comprehensive overview of China’s container shipping market, covering multiple ports and helping businesses engaged in international trade with China forecast shipping expenses and adjust logistics strategies. The data are derived from https://x.qianzhan.com/xdata/ (accessed on 8 March 2023). The SEAFI dataset has 288 samples, the DCFI dataset has 288 samples, the SCFI dataset has 480 samples, and the CCFI dataset has 960 samples. The datasets were partitioned into three segments, namely training, validation, and testing sets, with an allocation ratio of 12:3:1. The data features and segmentation of all datasets are shown in Figure 2. Training samples from different datasets were used to train the prediction model, and the empirical results were compared to confirm the advantages of the proposed model. Specifically, SEAFI had 216 training samples, DCFI had 216 training samples, SCFI, had 360, and CCFI had 720 training samples. Table 1 presents the descriptive statistics for the four datasets. Table 1 shows that the descriptive statistical values of distinct datasets exhibit variations; therefore, different datasets have different characteristics, which verifies that the proposed model has strong robustness.

MAE, RMSE, MAPE, IA, and TIC were chosen to assess model performance because they each offer unique insights and are widely used in predictions [41]. MAE measures the average error magnitude in the same units as the data, making it easy to understand. RMSE emphasizes larger errors by squaring them, which is useful when large errors require more attention. MAPE provides the error as a percentage, making it easy to compare different datasets. IA measures both the size and direction of errors, with values ranging from 0 to 1, indicating agreement between the observed and predicted values. TIC compares forecasting performances across different scales and balances different error types, which is particularly helpful in economic forecasting. In addition, the stability of the prediction model can be determined by assessing the standard deviation of the prediction results [52]. A smaller standard deviation suggests that the model exhibits greater robustness and that the prediction results are more reliable. These metrics provide a comprehensive evaluation of model accuracy and error characteristics. The mathematical equations for these metrics are listed in Table 2.

4.2. Experimental Design

Eleven comparative models were constructed to demonstrate the predictive performance of the proposed model. Four comparative experiments were conducted to verify the superiority of the proposed model. Experiment I aimed to introduce the selection results of the adaptive selection module on different datasets, proving the necessity of the adaptive data preprocessing module in the proposed prediction model. In Experiment II, four sub-predictors (ORELM, BP, GMDH, and ANFIS) were used as comparison models to prove the necessity and effectiveness of the adaptive data preprocessing and multi-objective ensemble modules, proving that the proposed model exhibits superior predictive capabilities compared to a single neural network model. To further demonstrate the superiority of the multi-objective ensemble, Experiment III was designed, and the proposed model was compared with an equal-weight ensemble model, which can prove that the proposed model exhibits superior predictive capabilities compared to an equal-weight ensemble model. In Experiment IV, we compared the proposed model with other multi-objective ensemble methods, proving that the proposed model exhibits superior predictive capabilities compared to other multi-objective ensemble models.

4.3. Experiment I: Results of the Adaptive Model Selection Module

A comprehensive assessment of the results of the adaptive model selection process across different datasets was conducted. The model library, comprising ORELM, BP, GMDH, and ANFIS, showed a robust nonlinear fitting capability. By employing consistent hyperparameters across all the models, the influence of model-specific parameters was minimized, ensuring a fair comparison. The necessity of selecting models individually for each dataset stems from their distinct predictive capabilities. This is further exemplified by the varying contributions of the different dataset modes to the final prediction results, necessitating adaptive model selection for optimal performance. The outcomes of the adaptive model selection module for different datasets are shown in Table 3.

In the SEAFI dataset, ORELM and ANFIS are effective for mode 1, while ORELM, BP, and ANFIS are suited for mode 2. Interestingly, mode 3 requires no model, indicating that its exclusion may enhance forecasting accuracy. For the DCFI dataset, ORELM performed well for mode 1. BP and ANFIS were preferred for mode 2, and ORELM and ANFIS were suitable for mode 3. The results on the SCFI dataset demonstrate ORELM’s efficacy for mode 1, that of BP and ANFIS for mode 2, and that of a combination of ORELM and GMDH for mode 3. Finally, in the CCFI dataset, ORELM, BP, and ANFIS were effective for mode 1, and the same combination worked well for mode 2, with GMDH being the choice for mode 3. Owing to the use of adaptive model selection techniques, the proposed model achieved better predictive results than the comparison models. These results underscore the importance of adaptive model selection tailored to each dataset’s characteristics, demonstrating that different models and their combinations contribute variably to the prediction accuracy, depending on the data modes.

4.4. Experiment II: Comparison of Single Artificial Neural Network Model and Proposed Model

In this experiment, the comparative outcomes of a single artificial neural network and the proposed model were analyzed. The selected single artificial neural network models were ORELM, BP, GMDH, and ANFIS. The comparative outcomes between the single artificial neural network and the proposed model are displayed in Table 4. Visualizations of both the single artificial neural network and the proposed model are presented in Figure 3.

The experimental results compare the performance of single artificial neural network models (ORELM, BP, GMDH, and ANFIS) and those of the proposed model in predicting containerized freight indices across multiple datasets. The proposed model consistently outperforms the single models, demonstrating significantly lower MAE, RMSE, MAPE, and TIC values. For instance, for the SEAFI dataset, the proposed model achieved an MAE of 27.635876, an RMSE of 32.146729, and an MAPE of 0.545154%. The BP model, which performed the worst, recorded an MAE of 151.781720, an RMSE of 178.337512, and an MAPE of 3.015159%. This stark contrast highlights the superior predictive accuracy and reduced error rates of the proposed model compared with single neural network models. For the other datasets, the proposed model continued to demonstrate superior performance. The DCFI dataset achieved an MAE of 5.122103, an RMSE of 6.506787, and an MAPE of 0.572040%, whereas the ANFIS model performed poorly, with an MAE of 59.555643, an RMSE of 71.965793, and an MAPE of 6.865999%. Similarly, for the SCFI dataset, the proposed model had an MAE of 4.469254, an RMSE of 5.254923, and an MAPE of 0.536753%, significantly outperforming the ANFIS model’s MAE of 34.211761), RMSE of 39.326185, and MAPE of 4.063924%. For the CCFI dataset, the proposed model shows outstanding performance, with an MAE of 0.995431, RMSE of 1.278544, and MAPE of 0.114937%, compared with the BP model’s MAE of 10.592830, RMSE of 13.355790, and MAPE of 1.198346%.

These results indicate that the proposed model not only offers better accuracy but also greater reliability across various datasets, making it a superior choice for predicting containerized freight indices. Owing to the use of adaptive data preprocessing techniques, the proposed model achieved better predictive results than the comparison models. This consistent superiority across multiple performance metrics and datasets underscores the robustness and effectiveness of the proposed model for handling complex predictive tasks in freight index forecasting.

4.5. Experiment III: Comparison of Equal-Weight Method and the Proposed Model

In this section, we present a comparative experiment to assess the equal-weight strategy against the proposed model. The following comparison models were chosen: ADP-ORELM-EW, ADP-BP-EW, ADP-GMDH-EW, and ADP-ANFIS-EW, where ADP is an adaptive data preprocessing operation using SVMD and EW is the equal-weight method. All models used the same hyperparameters to mitigate the impact of the model parameters on the prediction outcomes. The comparative outcomes between the simple ensemble model and the proposed model are presented in Table 5, and a visual presentation of the comparative outcomes is presented in Figure 4.

The proposed model demonstrates superior prediction performance on the SEAFI, DCFI, SCFI, and CCFI datasets, with MAE values of 27.635876, 5.122103, 4.469254, and 0.995431, respectively. In comparison, ADP-BP-EW records the highest MAE of 113.873704 for SEAFI, ADP-ANFIS-EW has the maximum MAE loss of 10.902471 for DCFI, and ADP-GMDH-EW exhibits the poorest MAE values of 6.653787 and 1.877313 for SCFI and CCFI, respectively. In terms of RMSE, the proposed model achieves 32.146729, 6.506787, 5.254923, and 1.278544 for the SEAFI, DCFI, SCFI, and CCFI datasets, respectively. In contrast, ADP-GMDH-EW shows the poorest RMSE values, reaching 245.850597 for SEAFI, 7.532368 for SCFI, and 2.411816 for CCFI. Additionally, ADP-ANFIS-EW experienced a maximum RMSE loss of 13.981185 for DCFI. For MAPE, the proposed model attained values of 0.545154%, 0.572040%, 0.536753%, and 0.114937% for SEAFI, DCFI, SCFI, and CCFI, respectively. Comparatively, ADP-BP-EW achieved the worst MAPE of 2.219730% on SEAFI, ADP-ANFIS-EW recorded the maximum MAPE loss of 1.224123% on DCFI, and ADP-GMDH-EW showed the poorest MAPE values of 0.784850% and 0.215563% on SCFI and CCFI, respectively.

The experimental results highlight the benefits and advantages of the proposed model, particularly in its ability to significantly outperform alternative models across multiple performance indicators such as MAE, RMSE, and MAPE. The strength of the proposed model lies in its adaptive model selection module and multi-objective ensemble module, which collectively enhance its predictive accuracy and robustness. These components enable the model to select the most suitable models dynamically and optimally combine them, leading to more precise and reliable predictions. This adaptability is crucial for handling the varying characteristics and complexities of different datasets. Owing to the use of multi-objective ensemble techniques, the proposed model achieved better predictive results than the comparison models. Moreover, compared with traditional equal-weight ensemble models, the proposed model shows a significant improvement. This further demonstrates the superiority of multi-objective ensemble methods, providing a more nuanced and effective approach for predictive modeling. Its superior performance across diverse datasets underscores the versatility and potential of the model for broad applicability to various predictive tasks.

4.6. Experiment IV: Comparison of Different Intelligent Ensemble Method and the Proposed Model

The multi-objective ensemble module plays a vital role in consolidating the outputs of multiple predictors to produce the final prediction value. In this section, a comparison experiment designed to explore the influence of various multi-objective ensemble schemes on the final prediction value is presented. ADP-LASSO-MOGWO, ADP-LASSO-MODA, and ADP-LASSO-MOPSO were selected for comparison with the proposed model, and the adaptive model selection module was established based on LASSO technology. All models used the same hyperparameters to mitigate the impact of the model parameters on the prediction outcomes.

Because each intelligent optimization scheme has different ensemble capabilities for different datasets, it is necessary to analyze the different ensemble schemes separately for each dataset. The performance of each intelligent optimization scheme is shown in Table 6. A visualization of each intelligent optimization scheme is shown in Figure 5.

In the SEAFI dataset, the proposed model excels, with an MAE of 27.635876, an RMSE of 32.146729, and an MAPE of 0.545154%. In contrast, ADP-LASSO-MOGWO performed poorly, with MAE, RMSE, and MAPE values of 29.484178, 34.217756, and 0.583031%, respectively. ADP-LASSO-MODA showed the worst RMSE of 34.409024. For the DCFI dataset, the proposed model demonstrated strong performance, achieving an MAE of 5.122103, RMSE of 6.506787, and MAPE of 0.572040%. Conversely, ADP-LASSO-MODA underperformed, with corresponding values of 5.525988, 7.271350, and 0.612448%, while ADP-LASSO-MOGWO had the highest MAE and MAPE values of 5.568783 and 0.632117%, respectively. On the SCFI dataset, the proposed model again shows superior results, with an MAE of 4.469254, RMSE of 5.254923, and MAPE of 0.536753%. Meanwhile, ADP-LASSO-MODA lags behind, recording corresponding values of 4.868545, 5.737525, and 0.588101%. Finally, on the CCFI dataset, the proposed model achieved outstanding performance, with an MAE of 0.995431, RMSE of 1.278544, and MAPE of 0.114937%. Conversely, ADP-LASSO-MODA yielded less favorable results, with corresponding values of 1.142011, 1.440428, and 0.131289%.

The experimental results indicate that the proposed model significantly outperforms its counterparts in predictive accuracy across multiple datasets. This performance is underscored by lower MAE, RMSE, MAPE, IA, and TIC values, suggesting that the model delivers more precise and reliable forecasts. The experimental results prove that the proposed model exhibits superior predictive capabilities compared to other multi-objective ensemble models. Owing to the use of multi-objective ensemble techniques based on the MOAVOA, the proposed model achieved better predictive results than the comparison models. The benefits of the proposed model include enhanced predictive accuracy, which can lead to better decision making and resource allocation. The robustness and consistency of the model across various datasets also highlight its adaptability and potential applicability to a wide range of predictive tasks. In addition, the model effectively enhanced the overall predictive performance, indicating its ability to efficiently handle complex data patterns.

5. Further Discussion

This section discusses the necessity of the adaptive model selection and multi-objective ensemble modules in the proposed model. To further verify the contribution of each module to the overall model, Equation (35) is used to analyze the improvement percentage of point predictions [41]. Here,

I n d i c a t o r_{1}

and

I n d i c a t o r_{2}

are the statistical indicator values for the comparison and proposed models, respectively. The larger the

P_{i n d i c a t o r}

value, the better the performance of the proposed prediction model compared to that of the comparison model. Table 7 lists the improvement percentages of the proposed model across the four datasets.

\begin{matrix} P_{i n d i c a t o r} = \frac{I n d i c a t o r_{1} - I n d i c a t o r_{2}}{I n d i c a t o r_{1}} \end{matrix}

(35)

Comparison of the proposed model wit EW models ADP-ORELM-EW, ADP-BP-EW, ADP-GMDH-EW, and ADP-ANFIS-EW shows that the

P_{i n d i c a t o r}

values are all positive, which indicates that the proposed model outperforms the other models with EW. Specifically, the

P_{i n d i c a t o r}

values of the proposed model and ADP-GMDH-EW models for RMSE and TIC are as high as 0.869243 and 0.868656, respectively, on the SEAFI dataset. Therefore, the adaptive model selection module in the proposed model is more significant for the CFI. According to a comparison of the proposed model with models employing other intelligent optimization algorithms such as ADP-LASSO-MOGWO, ADP-LASSO-MODA, and ADP-LASSO-MOPSO, the

P_{i n d i c a t o r}

values are all positive, which demonstrates that the performance of the proposed model surpasses that of the models using other intelligent optimization algorithms. The difference in the

P_{i n d i c a t o r}

value between the proposed model and models using other intelligent optimization algorithms is relatively small for most indicators and datasets. However, the

P_{i n d i c a t o r}

values of the proposed model and ADP-LASSO-MODA models on RMSE and TIC indicators are as high as 0.105147 and 0.105402, respectively, on the DCFI. Hence, the multi-objective ensemble module in the proposed model is more effective than other intelligent optimization algorithms for the CFI.

Comparative analysis shows that the proposed model consistently outperforms other models, thanks to its effective adaptive model selection and multi-objective ensemble modules. These components significantly improve the forecasting performance of the model, making it superior to models with equal-weight ensembles and those using other intelligent optimization algorithms.

6. Conclusions

Accuracy and stability are extremely important for the prediction model to reflect the status of the container market. However, the CFI has strong nonlinearity, which presents a severe test for the prediction model. In addition, the VMD decomposition scheme requires the decomposition layer count (K) to be set, and the prediction performance of the prediction model may be poor when the choice of K value is not reasonable. Numerous studies have overlooked the influence of the model library module on the accuracy of prediction models and considered the effects of both adaptive model selection and multi-objective ensembles in enhancing the robustness of the models. Therefore, this study proposes a novel selection ensemble prediction model with adaptive data preprocessing, model library, adaptive model selection, and multi-objective ensemble modules from the perspective of adaptive model selection.

The primary findings of this study are outlined as follows:

The proposed decomposition ensemble prediction model has better prediction ability and robustness. On the CCFI dataset, the MAE, RMSE, MAPE, IA, and TIC of the proposed model are 0.995431, 1.278544, 0.114937%, 0.999878, and 0.000731, respectively; the standard deviation of the proposed model is 1.288266.
A novel model library module consists of ORELM, BP, GMDH, and ANFIS, which can provide multiple predictors for different modes. For the models that do not use the model library, the MAE values for ADP-ORELM-EW, ADP-BP-EW, ADP-GMDH-EW, and ADP-ANFIS-EW are 1.172282, 1.306912, 1.877313, and 1.150139, respectively; and the MAE value of the proposed model is 0.995431. This illustrates the importance of the novel model library module for prediction accuracy.
The adaptive model selection module can effectively identify an appropriate predictor by optimizing overall prediction performance. The adaptive model selection module can dynamically choose the most suitable predictor for distinct datasets, aligning with the decomposed subseries. Comparison experiments demonstrated the adaptive model selection module is able to improve the prediction reliability of the model proposed in this paper.
A novel multi-objective ensemble module based on the artificial vulture optimization algorithm was designed. It provides better ensemble results than MOGWO, MODA, and MOPSO. For models using other intelligent optimization algorithms, the MAPE values for ADP-LASSO-MOGWO, ADP-LASSO-MODA, and ADP-LASSO-MOPSO are 0.120120%, 0.131289%, and 0.120066%, respectively, and the MAPE value of the proposed model is 0.114937%. This illustrates that the novel multi-objective ensemble module is effective in improving the accuracy of the prediction model.
A new intelligent prediction model is proposed, which comprises the following four components: adaptive data preprocessing, model library, adaptive model selection, and multi-objective ensemble modules. The experimental analysis results prove that the proposed model demonstrates outstanding accuracy and robustness, and it also provides a new perspective to understand the decomposition ensemble model for other fields.

Despite the excellent performance of the proposed decomposition ensemble model in the container prediction field, there are still some limitations that require further investigation. This study primarily focuses on historical data analysis and does not consider other factors that influence the container freight index. Moreover, combining historical CFI data with related policy analyses is another potential research direction. By comprehensively considering policy changes and their impact on container transportation, the accuracy and reliability of the prediction models can be further enhanced. In addition, future research should explore the application of advanced technologies to enhance the generalization capabilities of prediction models and their ability to adapt to complex environmental changes.

Author Contributions

W.Y.: conceptualization, methodology, project administration, software, supervision, writing—original draft, writing—review and editing, and funding acquisition. H.Z.: data curation, formal analysis, investigation, methodology, software, validation, visualization, writing—original draft, and writing—review and editing. S.Y.: conceptualization, methodology, and writing—review and editing. Y.H.: funding acquisition, methodology, supervision, and writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant Nos. 72101138 and 72301157), the Humanities and Social Science Fund of the Ministry of Education of the People’s Republic of China (Grant No. 21YJCZH198), the Shandong Provincial Natural Science Foundation, China (Grant Nos. ZR2021QG034 and ZR2022QG036), the Social Science Planning Project of Shandong Province (Grant Nos. 22DJJJ24, 24DGLJ09), and the Shandong Province Higher Educational Youth Innovation Team Development Program (Grant No. 2021RW020).

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

Mr. Sibo Yang is employee of the company Ping An Life Insurance Company of China Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CFI	Containerized freight index
SEAFI	Southeast Asia Freight Index
DCFI	Dalian Containerized Freight Index
SCFI	Shanghai Containerized Freight Index
CCFI	China Containerized Freight Index
VAR	Vector autoregression
ARIMA	Autoregressive integrated moving average
SARIMA	Seasonal autoregressive integrated moving average
SVR	Support vector regression
LSSVR	Least squares support vector regression
GP	Genetic programming regression
TF-DPSO	Transfer prediction model guided by discrete particle swarm optimization
EMD-BP	BP model-based EMD
EMD-ARIMA	ARIMA model-based EMD
BDI	Baltic Dry Index
EMD	Empirical mode decomposition
EEMD	Ensemble empirical mode decomposition
CEEMD	Complementary ensemble empirical mode decomposition
ICEEMDAN	Improved complete ensemble empirical mode decomposition with adaptive noise
VMD	Variational mode decomposition
SVMD	Sequential variational mode decomposition
LSTM	Long short-term memory
ORELM	Outlier robust extreme learning machine
BP	Backpropagation
GMDH	Group method of data handling
ANFIS	Adaptive-network-based fuzzy inference system
ADP	Adaptive data preprocessing module with SVMD
MOAVOA	Multi-objective artificial vulture optimization algorithm
MOGWO	Multi-objective gray wolf optimization
MODA	Multi-objective dragonfly algorithm
MOPSO	Multi-objective particle swarm optimization
ADP-ORELM-EW	Equal-weight strategy-based ADP and ORELM
ADP-BP-EW	Equal-weight strategy-based ADP and BP
ADP-GMDH-EW	Equal-weight strategy-based ADP and GMDH
ADP-ANFIS-EW	Equal weight strategy-based ADP and ANFIS
ADP-LASSO-MOGWO	MOGWO ensemble based on ADP and LASSO
ADP-LASSO-MODA	MODA ensemble based on ADP and LASSO
ADP-LASSO-MOPSO	MOPSO ensemble based on ADP and LASSO

References

Feng, X.; Song, R.; Yin, W.; Yin, X.; Zhang, R. Multimodal Transportation Network with Cargo Containerization Technology: Advantages and Challenges. Transp. Policy 2023, 132, 128–143. [Google Scholar] [CrossRef]
Tu, X.; Yang, Y.; Lin, Y.; Ma, S. Analysis of Influencing Factors and Prediction of China’s Containerized Freight Index. Front. Mar. Sci. 2023, 10, 1245542. [Google Scholar] [CrossRef]
Yu, F.; Xiang, Z.; Wang, X.; Yang, M.; Kuang, H. An Innovative Tool for Cost Control under Fragmented Scenarios: The Container Freight Index Microinsurance. Transp. Res. Part E Logist. Transp. Rev. 2023, 169, 102975. [Google Scholar] [CrossRef]
Munim, Z.H.; Schramm, H.J. Forecasting Container Freight Rates for Major Trade Routes: A Comparison of Artificial Neural Networks and Conventional Models. Marit. Econ. Logist. 2021, 23, 310–327. [Google Scholar] [CrossRef]
Jeon, J.W.; Duru, O.; Yeo, G.T. Modelling Cyclic Container Freight Index Using System Dynamics. Marit. Policy Manag. 2020, 47, 287–303. [Google Scholar] [CrossRef]
Koyuncu, K.; TAVACIOĞLU, L. Forecasting Shanghai Containerized Freight Index by Using Time Series Models. Mar. Sci. Technol. Bull. 2021, 10, 426–434. [Google Scholar] [CrossRef]
Kawasaki, T.; Matsuda, T.; Lau, Y.y.; Fu, X. The Durability of Economic Indicators in Container Shipping Demand: A Case Study of East Asia–US Container Transport. Marit. Bus. Rev. 2022, 7, 288–304. [Google Scholar] [CrossRef]
Hirata, E.; Matsuda, T. Forecasting Shanghai Container Freight Index: A Deep-Learning-Based Model Experiment. J. Mar. Sci. Eng. 2022, 10, 593. [Google Scholar] [CrossRef]
Shankar, S.; Ilavarasan, P.V.; Punia, S.; Singh, S.P. Forecasting Container Throughput with Long Short-Term Memory Networks. Ind. Manag. Data Syst. 2020, 120, 425–441. [Google Scholar] [CrossRef]
Kim, D.; Choi, J.S. Estimation Model for Freight of Container Ships Using Deep Learning Method. J. Korean Soc. Mar. Environ. Saf. 2021, 27, 574–583. [Google Scholar] [CrossRef]
Dasari, C.M.; Bhukya, R. Explainable Deep Neural Networks for Novel Viral Genome Prediction. Appl. Intell. 2022, 52, 3002–3017. [Google Scholar] [CrossRef]
Khaksar Manshad, M.; Meybodi, M.R.; Salajegheh, A. A New Irregular Cellular Learning Automata-Based Evolutionary Computation for Time Series Link Prediction in Social Networks. Appl. Intell. 2021, 51, 71–84. [Google Scholar] [CrossRef]
Swathi, T.; Kasiviswanath, N.; Rao, A.A. An Optimal Deep Learning-Based LSTM for Stock Price Prediction Using Twitter Sentiment Analysis. Appl. Intell. 2022, 52, 13675–13688. [Google Scholar] [CrossRef]
Xiao, W.; Xu, C.; Liu, H.; Liu, X. A Hybrid LSTM-Based Ensemble Learning Approach for China Coastal Bulk Coal Freight Index Prediction. J. Adv. Transp. 2021, 2021, 5573650. [Google Scholar] [CrossRef]
Liang, X.; Wang, Y.; Yang, M. Systemic Modeling and Prediction of Port Container Throughput Using Hybrid Link Analysis in Complex Networks. Systems 2024, 12, 23. [Google Scholar] [CrossRef]
Li, M.; Yang, Y.; He, Z.; Guo, X.; Zhang, R.; Huang, B. A Wind Speed Forecasting Model Based on Multi-Objective Algorithm and Interpretability Learning. Energy 2023, 269, 126778. [Google Scholar] [CrossRef]
Wang, Y.; Xu, H.; Song, M.; Zhang, F.; Li, Y.; Zhou, S.; Zhang, L. A Convolutional Transformer-based Truncated Gaussian Density Network with Data Denoising for Wind Speed Forecasting. Appl. Energy 2023, 333, 120601. [Google Scholar] [CrossRef]
Yang, F.; Fu, X.; Yang, Q.; Chu, Z. Decomposition Strategy and Attention-Based Long Short-Term Memory Network for Multi-Step Ultra-Short-Term Agricultural Power Load Forecasting. Expert Syst. Appl. 2024, 238, 122226. [Google Scholar] [CrossRef]
Ding, Y.; Chen, Z.; Zhang, H.; Wang, X.; Guo, Y. A Short-Term Wind Power Prediction Model Based on CEEMD and WOA-KELM. Renew. Energy 2022, 189, 188–198. [Google Scholar] [CrossRef]
Li, G.; Zhong, X. Parking Demand Forecasting Based on Improved Complete Ensemble Empirical Mode Decomposition and GRU Model. Eng. Appl. Artif. Intell. 2023, 119, 105717. [Google Scholar] [CrossRef]
Liu, Y.; Huang, S.; Tian, X.; Zhang, F.; Zhao, F.; Zhang, C. A Stock Series Prediction Model Based on Variational Mode Decomposition and Dual-Channel Attention Network. Expert Syst. Appl. 2024, 238, 121708. [Google Scholar] [CrossRef]
Zhang, Z.; Wang, J.; Wei, D.; Luo, T.; Xia, Y. A Novel Ensemble System for Short-Term Wind Speed Forecasting Based on Two-stage Attention-Based Recurrent Neural Network. Renew. Energy 2023, 204, 11–23. [Google Scholar] [CrossRef]
Bai, J.; Guo, J.; Sun, B.; Guo, Y.; Bao, Q.; Xiao, X. Intelligent Forecasting Model of Stock Price Using Neighborhood Rough Set and Multivariate Empirical Mode Decomposition. Eng. Appl. Artif. Intell. 2023, 122, 106106. [Google Scholar] [CrossRef]
Rezaei, H.; Faaljou, H.; Mansourfar, G. Stock Price Prediction Using Deep Learning and Frequency Decomposition. Expert Syst. Appl. 2021, 169, 114332. [Google Scholar] [CrossRef]
Liu, J.; Wang, P.; Chen, H.; Zhu, J. A Combination Forecasting Model Based on Hybrid Interval Multi-Scale Decomposition: Application to Interval-Valued Carbon Price Forecasting. Expert Syst. Appl. 2022, 191, 116267. [Google Scholar] [CrossRef]
Nguyen, T.H.T.; Phan, Q.B. Hourly Day Ahead Wind Speed Forecasting Based on a Hybrid Model of EEMD, CNN-Bi-LSTM Embedded with GA Optimization. Energy Rep. 2022, 8, 53–60. [Google Scholar] [CrossRef]
Yang, W.; Hao, M.; Hao, Y. Innovative Ensemble System Based on Mixed Frequency Modeling for Wind Speed Point and Interval Forecasting. Inf. Sci. 2023, 622, 560–586. [Google Scholar] [CrossRef]
Jaseena, K.U.; Kovoor, B.C. Decomposition-Based Hybrid Wind Speed Forecasting Model Using Deep Bidirectional LSTM Networks. Energy Convers. Manag. 2021, 234, 113944. [Google Scholar] [CrossRef]
Li, X.; Zhang, X.; Zhang, C.; Wang, S. Forecasting Tourism Demand with a Novel Robust Decomposition and Ensemble Framework. Expert Syst. Appl. 2024, 236, 121388. [Google Scholar] [CrossRef]
Da Silva, R.G.; Moreno, S.R.; Ribeiro, M.H.D.M.; Larcher, J.H.K.; Mariani, V.C.; dos Santos Coelho, L. Multi-Step Short-Term Wind Speed Forecasting Based on Multi-Stage Decomposition Coupled with Stacking-Ensemble Learning Approach. Int. J. Electr. Power Energy Syst. 2022, 143, 108504. [Google Scholar] [CrossRef]
Fang, L. Establishment of Shipping Container Price Prediction Model for International Trade. In Proceedings of the 2022 6th International Symposium on Computer Science and Intelligent Control (ISCSIC), Beijing, China, 11–13 November 2022; pp. 331–335. [Google Scholar] [CrossRef]
Chen, Y.; Liu, B.; Wang, T. Analysing and Forecasting China Containerized Freight Index with a Hybrid Decomposition–Ensemble Method Based on EMD, Grey Wave and ARMA. Grey Syst. Theory Appl. 2021, 11, 358–371. [Google Scholar] [CrossRef]
Tian, Z.; Chen, H. A Novel Decomposition-Ensemble Prediction Model for Ultra-Short-Term Wind Speed. Energy Convers. Manag. 2021, 248, 114775. [Google Scholar] [CrossRef]
Zhao, Z.; Yun, S.; Jia, L.; Guo, J.; Meng, Y.; He, N.; Li, X.; Shi, J.; Yang, L. Hybrid VMD-CNN-GRU-Based Model for Short-Term Forecasting of Wind Power Considering Spatio-Temporal Features. Eng. Appl. Artif. Intell. 2023, 121, 105982. [Google Scholar] [CrossRef]
Niu, X.; Ma, J.; Wang, Y.; Zhang, J.; Chen, H.; Tang, H. A Novel Decomposition-Ensemble Learning Model Based on Ensemble Empirical Mode Decomposition and Recurrent Neural Network for Landslide Displacement Prediction. Appl. Sci. 2021, 11, 4684. [Google Scholar] [CrossRef]
Altan, A.; Karasu, S.; Zio, E. A New Hybrid Model for Wind Speed Forecasting Combining Long Short-Term Memory Neural Network, Decomposition Methods and Grey Wolf Optimizer. Appl. Soft Comput. 2021, 100, 106996. [Google Scholar] [CrossRef]
Joseph, L.P.; Deo, R.C.; Prasad, R.; Salcedo-Sanz, S.; Raj, N.; Soar, J. Near Real-Time Wind Speed Forecast Model with Bidirectional LSTM Networks. Renew. Energy 2023, 204, 39–58. [Google Scholar] [CrossRef]
Luo, J.; Zhang, X. Convolutional Neural Network Based on Attention Mechanism and Bi-LSTM for Bearing Remaining Life Prediction. Appl. Intell. 2022, 52, 1076–1091. [Google Scholar] [CrossRef]
Yang, Y.; Fan, C.; Xiong, H. A Novel General-Purpose Hybrid Model for Time Series Forecasting. Appl. Intell. 2022, 52, 2212–2223. [Google Scholar] [CrossRef] [PubMed]
Yu, R.; Sun, Y.; Li, X.; Yu, J.; Gao, J.; Liu, Z.; Yu, M. Time Series Cross-Correlation Network for Wind Power Prediction. Appl. Intell. 2023, 53, 11403–11419. [Google Scholar] [CrossRef]
Yang, S.; Yang, W.; Wang, X.; Hao, Y. A Novel Selective Ensemble System for Wind Speed Forecasting: From a New Perspective of Multiple Predictors for Subseries. Energy Convers. Manag. 2023, 294, 117590. [Google Scholar] [CrossRef]
Wang, J.; Cui, Q.; Sun, X.; He, M. Asian Stock Markets Closing Index Forecast Based on Secondary Decomposition, Multi-Factor Analysis and Attention-Based LSTM Model. Eng. Appl. Artif. Intell. 2022, 113, 104908. [Google Scholar] [CrossRef]
Hao, Y.; Zhou, Y.; Gao, J.; Wang, J.; Hao, Y.; Zhou, Y.; Gao, J.; Wang, J. A Novel Air Pollutant Concentration Prediction System Based on Decomposition-Ensemble Mode and Multi-Objective Optimization for Environmental System Management. Systems 2022, 10, 139. [Google Scholar] [CrossRef]
Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.C.; Tung, C.C.; Liu, H.H. The Empirical Mode Decomposition and the Hilbert Spectrum for Nonlinear and Non-Stationary Time Series Analysis. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
Dragomiretskiy, K.; Zosso, D. Variational Mode Decomposition. IEEE Trans. Signal Process. 2013, 62, 531–544. [Google Scholar] [CrossRef]
Nazari, M.; Sakhaei, S.M. Successive Variational Mode Decomposition. Signal Process. 2020, 174, 107610. [Google Scholar] [CrossRef]
Zhang, K.; Luo, M. Outlier-Robust Extreme Learning Machine for Regression Problems. Neurocomputing 2015, 151, 1519–1527. [Google Scholar] [CrossRef]
Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning Representations by Back Propagating Errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
Oh, S.K.; Pedrycz, W. The Design of Self-Organizing Polynomial Neural Networks. Inf. Sci. 2002, 141, 237–258. [Google Scholar] [CrossRef]
Jang, J.S. ANFIS: Adaptive-Network-Based Fuzzy Inference System. IEEE Trans. Syst. Man Cybern. 1993, 23, 665–685. [Google Scholar] [CrossRef]
Khodadadi, N.; Soleimanian Gharehchopogh, F.; Mirjalili, S. MOAVOA: A New Multi-Objective Artificial Vultures Optimization Algorithm. Neural Comput. Appl. 2022, 34, 20791–20829. [Google Scholar] [CrossRef]
Hao, Y.; Wang, X.; Wang, J.; Yang, W. A new perspective of wind speed forecasting: Multi-objective and model selection-based ensemble interval-valued wind speed forecasting system. Energy Convers. Manag. 2024, 299, 117868. [Google Scholar] [CrossRef]

Figure 1. Framework of the proposed prediction model.

Figure 2. Data features and data segmentation of all datasets.

Figure 3. Visualization of the single artificial neural network and the proposed model.

Figure 4. Visual presentation of the simple ensemble model with the proposed model.

Figure 5. Visualization of each intelligent optimization scheme on different datasets.

Table 1. Descriptive statistical values for each dataset.

Site	Dataset	Number of Samples	Mean	Std	Min	Max	Kurtosis	Skewness
SEAFI	Training	216	1108.69	1153.18	351.08	4997.93	6.05	2.78
	Validation	54	5096.20	1388.78	3896.11	8100.88	−0.14	1.16
	Testing	18	5052.77	266.43	4426.25	5427.59	0.10	−0.59
	All Samples	288	2102.85	2081.37	351.08	8100.88	−0.12	1.10
DCFI	Training	216	1033.50	253.85	489.46	1,528.54	−0.65	−0.33
	Validation	54	885.22	78.62	708.98	1,003.87	−0.84	−0.33
	Testing	18	872.02	57.85	800.95	977.94	−0.98	0.57
	All Samples	288	995.60	232.26	489.46	1528.54	−0.50	0.05
SCFI	Training	360	1016.22	265.62	400.43	1583.18	−0.30	−0.21
	Validation	90	814.02	76.41	646.59	956.63	−0.59	−0.35
	Testing	30	844.16	83.69	723.93	976.52	−1.47	0.15
	All Samples	480	967.56	248.10	400.43	1583.18	−0.21	0.22
CCFI	Training	720	1048.76	112.30	712.58	1335.86	0.48	−0.54
	Validation	180	790.92	63.18	632.36	891.32	0.07	−0.89
	Testing	60	872.72	58.38	776.92	1023.02	0.35	0.83
	All Samples	960	989.41	145.95	632.36	1335.86	−0.75	−0.27

Table 2. Evaluation metrics and calculation methods.

Evaluation Metric	Equation
MAE	$M A E = \frac{1}{N} \cdot \sum_{i = 1}^{N} \|F_{i} - A_{i}\|$
RMSE	$R M S E = \sqrt{\frac{1}{N} \cdot \sum_{i = 1}^{N} {(F_{i} - A_{i})}^{2}}$
MAPE	$M A P E = \frac{1}{N} \cdot \sum_{i = 1}^{N} \|\frac{A_{i} - F_{i}}{A_{i}}\| \times 100 %$
IA	$I A = 1 - \sum_{i = 1}^{N} {(F_{i} - A_{i})}^{2} / \sum_{i = 1}^{N} (\| A_{i} - \bar{A} \| + \| F_{i} - \bar{A} {\|)}^{2}$
TIC	$T I C = \sqrt{\sum_{i = 1}^{N} {(F_{i} - A_{i})}^{2} / N} / (\sqrt{\sum_{i = 1}^{N} {(A_{i})}^{2} / N} + \sqrt{\sum_{i = 1}^{N} {(F_{i})}^{2} / N})$
Std	$S t d = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(E_{i} - \bar{E})}^{2}}$

Note:

A_{i}

and

F_{i}

are the actual and predicted values, respectively;

E_{i}

and

\bar{E}

are the loss value and the average loss value, respectively; and N is the number of samples.

Table 3. Results of the adaptive model selection module.

Site	Mode	ORELM	BP	GMDH	ANFIS
SEAFI	Mode 1	Y	N	N	Y
	Mode 2	Y	Y	N	Y
	Mode 3	N	N	N	N
DCFI	Mode 1	Y	N	N	N
	Mode 2	N	Y	N	Y
	Mode 3	Y	N	N	Y
SCFI	Mode 1	Y	N	N	N
	Mode 2	N	Y	N	Y
	Mode 3	Y	N	Y	N
CCFI	Mode 1	Y	Y	N	Y
	Mode 2	Y	Y	N	Y
	Mode 3	N	N	Y	N

Note: Y indicates that the model is selected, and N indicates that the model is not selected in the specified dataset.

Table 4. Comparison results between the single neural network model and the proposed model.

Dataset	Model	MAE	RMSE	MAPE (%)	IA	TIC	Std
SEAFI	ORELM	123.205880	140.452904	2.424671	0.919028	0.014032	92.916048
	BP	151.781720	178.337512	3.015159	0.898137	0.017885	101.466952
	GMDH	101.303133	129.772530	2.042018	0.914612	0.012805	132.178997
	ANFIS	88.548893	108.999677	1.778814	0.949963	0.010741	107.657257
	The proposed model	27.635876	32.146729	0.545154	0.996079	0.003177	33.074908
DCFI	ORELM	34.481033	52.631409	3.884173	0.784448	0.030240	53.596355
	BP	42.923168	62.901634	4.867715	0.744500	0.035711	63.428099
	GMDH	41.831904	58.162482	4.778038	0.747546	0.032934	56.944095
	ANFIS	59.555643	71.965793	6.865999	0.668320	0.040178	59.503584
	The proposed model	5.122103	6.506787	0.572040	0.996557	0.003723	6.694206
SCFI	ORELM	23.464746	31.746372	2.775890	0.962969	0.018749	32.102956
	BP	29.696249	36.681646	3.521435	0.956465	0.021458	35.331475
	GMDH	28.687338	32.469763	3.459923	0.958508	0.018983	29.524676
	ANFIS	34.211761	39.326185	4.063924	0.947827	0.023011	38.185880
	The proposed model	4.469254	5.254923	0.536753	0.998968	0.003098	5.344411
CCFI	ORELM	10.240264	13.011621	1.161968	0.987359	0.007452	12.731295
	BP	10.592830	13.355790	1.198346	0.986741	0.007624	13.215159
	GMDH	10.203530	13.001215	1.162990	0.987694	0.007428	13.078269
	ANFIS	10.084362	12.768399	1.140371	0.987910	0.007308	12.697646
	The proposed model	0.995431	1.278544	0.114937	0.999878	0.000731	1.288266

Table 5. Comparison results between models with equal-weight strategy and the proposed model.

Dataset	Model	MAE	RMSE	MAPE (%)	IA	TIC	Std
SEAFI	ADP-ORELM-EW	105.563264	129.581138	2.113399	0.946112	0.012936	79.910906
	ADP-BP-EW	113.873704	135.872823	2.219730	0.927242	0.013582	79.910906
	ADP-GMDH-EW	90.683083	245.850597	1.765016	0.834359	0.024187	249.904473
	ADP-ANFIS-EW	39.377648	47.877909	0.779728	0.991010	0.004721	43.485246
	the proposed model	27.635876	32.146729	0.545154	0.996079	0.003177	33.074908
DCFI	ADP-ORELM-EW	5.899039	7.304674	0.667163	0.995566	0.004178	7.482022
	ADP-BP-EW	7.638733	9.157015	0.873128	0.993064	0.005230	8.735189
	ADP-GMDH-EW	10.556177	12.258772	1.193767	0.986967	0.007025	12.398004
	ADP-ANFIS-EW	10.902471	13.981185	1.224123	0.983172	0.008015	14.024307
	the proposed model	5.122103	6.506787	0.572040	0.996557	0.003723	6.694206
SCFI	ADP-ORELM-EW	4.796620	5.436566	0.581625	0.998887	0.003204	5.488654
	ADP-BP-EW	4.841608	5.680261	0.583479	0.998781	0.003349	5.772341
	ADP-GMDH-EW	6.653787	7.532368	0.784850	0.997834	0.004445	7.512794
	ADP-ANFIS-EW	4.984599	5.613779	0.597748	0.998811	0.003310	5.706959
	the proposed model	4.469254	5.254923	0.536753	0.998968	0.003098	5.344411
CCFI	ADP-ORELM-EW	1.172282	1.434608	0.135113	0.999848	0.000820	1.438433
	ADP-BP-EW	1.306912	1.617994	0.150218	0.999805	0.000925	1.619969
	ADP-GMDH-EW	1.877313	2.411816	0.215563	0.999565	0.001379	2.414805
	ADP-ANFIS-EW	1.150139	1.415673	0.132375	0.999850	0.000809	1.426747
	the proposed model	0.995431	1.278544	0.114937	0.999878	0.000731	1.288266

Table 6. Comparison results between different intelligent ensemble schemes and the proposed model.

Dataset	Model	MAE	RMSE	MAPE (%)	IA	TIC	Std
SEAFI	ADP-LASSO-MOGWO	29.484178	34.217756	0.583031	0.995536	0.003382	35.151828
	ADP-LASSO-MODA	29.438289	34.409024	0.582734	0.995620	0.003400	35.385542
	ADP-LASSO-MOPSO	28.929883	33.154539	0.572354	0.995787	0.003276	34.088452
	The proposed model	27.635876	32.146729	0.545154	0.996079	0.003177	33.074908
DCFI	ADP-LASSO-MOGWO	5.568783	6.935399	0.632117	0.996041	0.003974	6.767971
	ADP-LASSO-MODA	5.525988	7.271350	0.612448	0.995762	0.004162	7.469158
	ADP-LASSO-MOPSO	5.434726	6.905829	0.606732	0.996156	0.003952	7.096348
	The proposed model	5.122103	6.506787	0.572040	0.996557	0.003723	6.694206
SCFI	ADP-LASSO-MOGWO	4.657254	5.596515	0.563780	0.998825	0.003298	5.654825
	ADP-LASSO-MODA	4.868545	5.737525	0.588101	0.998758	0.003382	5.814262
	ADP-LASSO-MOPSO	4.615076	5.340995	0.554456	0.998929	0.003150	5.403378
	The proposed model	4.469254	5.254923	0.536753	0.998968	0.003098	5.344411
CCFI	ADP-LASSO-MOGWO	1.061382	1.375900	0.120120	0.999861	0.000787	1.359917
	ADP-LASSO-MODA	1.142011	1.440428	0.131289	0.999846	0.000823	1.451698
	ADP-LASSO-MOPSO	1.048825	1.284413	0.120066	0.999877	0.000734	1.294984
	The proposed model	0.995431	1.278544	0.114937	0.999878	0.000731	1.288266

Table 7. The

P_{i n d i c a t o r}

results of the comparison model and the proposed model on different datasets.

Table 7. The

P_{i n d i c a t o r}

results of the comparison model and the proposed model on different datasets.

Indicator	SEAFI	DCFI	SCFI	CCFI	Average	SEAFI	DCFI	SCFI	CCFI	Average
	The proposed model vs. ADP-ORELM-EW					The proposed model vs. ADP-BP-EW
MAE	0.738206	0.131706	0.068249	0.150860	0.272255	0.757311	0.329456	0.076907	0.238333	0.350502
RMSE	0.751918	0.109230	0.033411	0.108785	0.250836	0.763406	0.289420	0.074880	0.209797	0.334376
MAPE	0.742048	0.142579	0.077150	0.149327	0.277776	0.754405	0.344839	0.080081	0.234867	0.353548
IA	0.052813	0.000995	0.000081	0.000031	0.013480	0.074238	0.003517	0.000187	0.000074	0.019504
TIC	0.754425	0.108974	0.033036	0.108883	0.251329	0.766105	0.288100	0.075020	0.209737	0.334740
	The proposed model vs. ADP-GMDH-EW					The proposed model vs. ADP-ANFIS-EW
MAE	0.695248	0.514777	0.328314	0.469757	0.502024	0.298184	0.530189	0.103388	0.134512	0.266568
RMSE	0.869243	0.469214	0.302354	0.469883	0.527674	0.328569	0.534604	0.063924	0.096865	0.255990
MAPE	0.691133	0.520811	0.316108	0.466807	0.498715	0.300841	0.532694	0.102041	0.131738	0.266828
IA	0.193826	0.009716	0.001136	0.000314	0.051248	0.005114	0.013614	0.000157	0.000028	0.004728
TIC	0.868656	0.469995	0.302998	0.469819	0.527867	0.327138	0.535508	0.064027	0.096925	0.255899
	The proposed model vs. ADP-LASSO-MOGWO					The proposed model vs. ADP-LASSO-MODA
MAE	0.062688	0.080212	0.040367	0.062137	0.061351	0.061227	0.073088	0.082014	0.128352	0.086170
RMSE	0.060525	0.061801	0.061036	0.070758	0.063530	0.065747	0.105147	0.084113	0.112386	0.091848
MAPE	0.064965	0.095042	0.047939	0.043149	0.062774	0.064488	0.065979	0.087312	0.124552	0.085583
IA	0.000545	0.000517	0.000143	0.000018	0.000306	0.000461	0.000798	0.000210	0.000032	0.000375
TIC	0.060758	0.063069	0.060665	0.070913	0.063851	0.065651	0.105402	0.083855	0.112384	0.091823
	The proposed model vs. ADP-LASSO-MOPSO
MAE	0.044729	0.057523	0.031597	0.050908	0.046189
RMSE	0.030397	0.057783	0.016115	0.004569	0.027216
MAPE	0.047523	0.057178	0.031930	0.042718	0.044837
IA	0.000292	0.000403	0.000039	0.000001	0.000184
TIC	0.030329	0.058027	0.016420	0.004582	0.027340

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, W.; Zhang, H.; Yang, S.; Hao, Y. A Novel Intelligent Prediction Model for the Containerized Freight Index: A New Perspective of Adaptive Model Selection for Subseries. Systems 2024, 12, 309. https://doi.org/10.3390/systems12080309

AMA Style

Yang W, Zhang H, Yang S, Hao Y. A Novel Intelligent Prediction Model for the Containerized Freight Index: A New Perspective of Adaptive Model Selection for Subseries. Systems. 2024; 12(8):309. https://doi.org/10.3390/systems12080309

Chicago/Turabian Style

Yang, Wendong, Hao Zhang, Sibo Yang, and Yan Hao. 2024. "A Novel Intelligent Prediction Model for the Containerized Freight Index: A New Perspective of Adaptive Model Selection for Subseries" Systems 12, no. 8: 309. https://doi.org/10.3390/systems12080309

APA Style

Yang, W., Zhang, H., Yang, S., & Hao, Y. (2024). A Novel Intelligent Prediction Model for the Containerized Freight Index: A New Perspective of Adaptive Model Selection for Subseries. Systems, 12(8), 309. https://doi.org/10.3390/systems12080309

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Intelligent Prediction Model for the Containerized Freight Index: A New Perspective of Adaptive Model Selection for Subseries

Abstract

1. Introduction

2. Preliminary Methods

2.1. Adaptive Data Preprocessing Module

2.2. Model Library Module

2.2.1. ORELM

2.2.2. BP Neural Network

2.2.3. GMDH

2.2.4. ANFIS

2.3. Adaptive Model Selection Module

2.4. Multi-Objective Ensemble Module

3. Proposed Method

4. Empirical Results and Analysis

4.1. Data Description and Performance Metrics

4.2. Experimental Design

4.3. Experiment I: Results of the Adaptive Model Selection Module

4.4. Experiment II: Comparison of Single Artificial Neural Network Model and Proposed Model

4.5. Experiment III: Comparison of Equal-Weight Method and the Proposed Model

4.6. Experiment IV: Comparison of Different Intelligent Ensemble Method and the Proposed Model

5. Further Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI