Next Article in Journal
Mesostructural Model for the Fatigue Analysis of Open-Cell Metal Foams
Previous Article in Journal
Robust and Accurate Recognition of Carriage Linear Array Images for Train Fault Detection
Previous Article in Special Issue
Multicriteria Decision Model for Port Evaluation and Ranking: An Analysis of Container Terminals in Latin America and the Caribbean Using PCA-TOPSIS Methodologies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improving Ship Fuel Consumption and Carbon Intensity Prediction Accuracy Based on a Long Short-Term Memory Model with Self-Attention Mechanism

1
Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai 201306, China
2
National Engineering Research Center for Special Equipment and Power Systems of Ships and Marine Engineering, Shanghai 201306, China
3
Logistics Engineering College, Shanghai Maritime University, Shanghai 201306, China
4
COSCO Shipping Technology Co., Ltd., Shanghai 200135, China
5
Merchant Marine College, Shanghai Maritime University, Shanghai 201306, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(18), 8526; https://doi.org/10.3390/app14188526
Submission received: 28 August 2024 / Revised: 16 September 2024 / Accepted: 20 September 2024 / Published: 22 September 2024

Abstract

:
The prediction of fuel consumption and Carbon Intensity Index (CII) of ships is crucial for optimizing decarbonization strategies in the maritime industry. This study proposes a ship fuel consumption prediction model based on the Long Short-Term Memory with Self-Attention Mechanism (SA-LSTM). The model is applied to a container ship of 2400 TEU to predict its hourly fuel consumption, hourly CII, and annual CII rating. Four different feature sets are selected from these data sources and are used as inputs for SA-LSTM and another ten models. The results demonstrate that the SA-LSTM model outperforms the other models in prediction accuracy. Specifically, the Mean Absolute Percentage Error (MAPE) for fuel consumption predictions using the SA-LSTM model is reduced by up to 20% compared to the XGBoost and by up to 12% compared to the LSTM model. Additionally, the SA-LSTM model achieves the highest accuracy in annual CII predictions.

1. Introduction

The shipping industry is a significant source of global greenhouse gases, contributing 2.89% of global carbon emissions [1]. The IMO adopted the 2023 IMO Strategy on Reduction of GHG Emissions from Ships, which sets explicit phased targets for reducing both the total amount and intensity of ship carbon emissions, aiming for near-zero emissions by around 2050. The IMO has introduced a series of short-term, medium-term, and long-term measures to promote emission reductions. A key short-term measure is reducing the carbon emission intensity of ships [2]. From 1 January 2023, ships with a gross tonnage exceeding 5000 engaged in international navigation must comply with the Operational Carbon Intensity Index (CII) rating, which increases by 2% annually until 2026 (BIMCO, 2022). Ships that fail to meet the required ratings will need to make necessary adjustments, which could impact their operations, leasing, and sales [3].
Accurately forecasting ship carbon intensity and its rating is essential for effectively managing and controlling ship CII ratings [4]. The key to predicting a ship’s carbon intensity lies in forecasting its fuel consumption. By converting fuel consumption into carbon emissions and considering the sailing distance and capacity of the ship, the carbon emission intensity can be calculated. Ship carbon intensity can be measured on various time scales, including hourly, daily, and annually, with annual carbon intensity being the critical data for determining the ship’s CII rating.
The fuel consumption of ships is not only closely related to the operating cost of the ship, but it also has a significant connection with the ship’s carbon emissions. Hence, the research on ship fuel consumption prediction has received wide attention [5,6,7,8]. With the increasing availability of data, more researchers are using machine learning techniques to predict ship fuel consumption. Wang et al. (2018) employed the Least Absolute Shrinkage and Selection Operator (LASSO) regression algorithm, which proved particularly effective. The results demonstrated that the LASSO regression model had a superior prediction performance when compared to neural networks and support vector regression [9]. Jeon et al. (2018) investigated the Artificial Neural Network (ANN) model and found that it a had higher prediction accuracy for main engine fuel consumption than polynomial regression and support vector machines [10]. Ren et al. (2022) examined the ridge regression model with four different datasets, including Automatic Identification System (AIS) data, Measurement, Reporting and Verification (MRV) data, and MRV-normalized data. They found that the model based on MRV reports achieved the optimal result [11]. Li et al. (2022) compared the results of various prediction models under different combinations of data source such as nautical logs, meteorological data, and AIS data [12,13,14]. Uyanık et al. (2020) also investigated several fuel consumption prediction models, for example, the kernel ridge regression, Bayesian ridge regression, and Adaboost. They found that the ridge regression model had greater accuracy [15].
However, there are not too many studies that focuses on applying deep learning methods to ship fuel consumption prediction. In 2017, Google put forward the Transformer model based on the self-attention mechanism, which enhanced the accuracy of 11 Natural Language Processing (NLP) tasks [16]. Since then, the self-attention mechanism has been introduced in many fields and scenarios, achieving remarkable research results. For instance, Liu et al. (2022) proposed a prediction model based on Long Short-Term Memory (LSTM) and the spatial attention mechanism, which effectively enhanced the accuracy of power consumption prediction in the raw cement material grinding system [17]. Han et al. (2022) proposed a load classification Long Short-Term Memory with Self-Attention Mechanism (SA-LSTM) model in response to the problems of category imbalance and insufficient performance of the classification model in user load classification [18]. Cai et al. (2024) presented a method based on the self-attention long short-term memory network to investigate the problem of dam deformation prediction, and the results indicated that this approach achieved relatively good prediction effects on the actual dam deformation data [19]. Hu et al. (2024) employed a new model combining the long short-term memory network and the self-attention mechanism to address the accuracy problem of photovoltaic power generation output prediction, especially under the influence of weather conditions [20]. Rao et al. (2023) presented a new type of statistical model called non-linear function-on-function regression and found it is effective for analyzing complicated data through simulations and real-world examples [21].
However, relevant research in the shipping carbon emission reduction field is still rather rare [22,23,24,25,26,27]. Since the prediction of fuel consumption and carbon intensity possesses temporal characteristics [28], the LSTM model can capture the short-term dependency relationship in the sequence, but there may be certain limitations in handling long-term dependency relationships. The self-attention mechanism can effectively capture the long-distance dependency relationship between different positions in the sequence. Introducing the self-attention mechanism increases the model’s ability to model the overall information of the sequence, making it more suitable for processing ship operation data that contains long-term dependencies. Therefore, the SA-LSTM model with the introduction of the self-attention mechanism may have the potential to further improve the accuracy of ship fuel consumption prediction.
The main objectives of this study are to investigate the impact of the self-attention mechanism on the LSTM model and to evaluate the performance of the SA-LSTM model in predicting ship fuel consumption using multi-source heterogeneous data, including AIS data, fuel flow sensor data, meteorological data, and sea condition data.
This study is one of the few studies that investigates the application of the self-attention mechanism and LSTM model in predicting ship fuel consumption. It considerably improved the prediction accuracy as compared to most machine learning methods and traditional LSTM models.
The organization of the paper is as follows: Section 1 introduces the research background of this paper and a review of relevant literature. Section 2 introduces the main data and processing of this research. Section 3 introduces the main methodological approach in this research. Section 4 takes a container ship equipped with multiple sensing devices as an example to comparatively analyze the fuel consumption and carbon intensity prediction results of various methods. Finally, relevant discussions are conducted and the conclusions of this paper are summarized.

2. Data Acquisition and Processing

The principal data examined in this study include ship AIS data, sensor data, and meteorological and sea condition data. The meteorological and sea condition data are collected hourly with a spatio-temporal resolution of 0.25° × 0.25°. AIS and sensor data are processed to match this spatio-temporal resolution through aggregation and interpolation.

2.1. AIS Data Acquisition and Processing

AIS data consist of dynamic and static information. Due to various factors, such as weather, location, and equipment, the update frequency of AIS data varies, leading to significant data gaps in some areas. To address this, linear interpolation is employed to fill in missing data, interpolating longitude, latitude, and speed every five minutes. The ship’s sailing distance is calculated by altering the longitude and latitude of adjacent AIS points in the interpolated data. Given the hourly collection frequency of meteorological and sea condition data, the AIS data are aggregated hourly to maintain consistent time resolution.

2.2. Sensor Data Acquisition and Processing

Real-time data related to ships are acquired through various sensors. The real-time fuel consumption data of the ship’s main engine, auxiliary engine, and boiler are obtained using mass flowmeters. Sensor data also provide information on the ship’s main engine speed and trim. The original units for heavy and light oil data (kg/h) are converted to kg based on each data time interval. Missing values in the fuel consumption data are addressed using linear interpolation, interpolating fuel consumption, speed, and trim data with a one-hour interval.

2.3. Navigational Environment Data Acquisition and Processing

Navigational environment data (included meteorological and sea condition data) are sourced from the European Centre for Medium-Range Weather Forecasts (ECMWF) and the Copernicus Marine Service. These data are collected hourly and downloaded in grid format (0.25° × 0.25°).
Meteorological and sea condition data are matched with the longitude, latitude, and time in AIS. Wind speed and current rate components are combined to determine the actual wind speed, wind direction, current speed, and current direction. The relative direction of the case ship is calculated by combing course and direction data. Three parts of the data are fused based on time in Figure 1.

3. Methodological Approach

3.1. Carbon Intensity Prediction Methodology

This article begins by cleansing and preprocessing ship AIS data, sensor data, meteorological data, and sea condition data, followed by spatio-temporal fusion. Next, it extracts relevant features and examines their correlations. The data are then divided into a training set and a test set based on a 0.75:0.25 time series split. Using the devised SA-LSTM model, the study compares its performance with traditional machine learning models [29,30,31]. The hyperparameters of the different models are optimized using Bayesian search and cross-validation. Figure 2 illustrates the overall methodology for ship CII prediction.

3.2. Ship Fuel Consumption Prediction Model Based on SA-LSTM

Due to the successful application of the SA-LSTM model in previous studies across various fields, this paper considers applying it to predict ship fuel consumption. This model can concurrently consider information from each time step within a sequence and dynamically adjust based on the significance of elements in the sequence, effectively capturing patterns and trends in the time series data. The self-attention mechanism, originating from the Transformer model, weighs elements at different positions in a sequence to determine their importance. The structure of the SA-LSTM model used in this study is depicted in Figure 3.
In the SA-LSTM model, the self-attention mechanism is integrated into the LSTM model to enhance the model’s ability to focus on information at various positions in the sequence, thereby better capturing long-distance dependencies within the sequence. The introduction of the self-attention mechanism involves the following steps:
(1) Input Sequence Representation: The sequence signifying the ship’s operating parameters is presented as X = x 1 , x 2 , x t , where t is the sequence length.
(2) Setting queries, keys, and values: For any given input, first set the weights W q , W k , W v of the query, key, and value, respectively, and transform the input linearly to obtain the corresponding results Q , K , and V , and the expression is:
Q = W q X K = W k X V = W v X
Embedding the input sequence, mapping each parameter, xi, to a vector in a high-dimensional space, E ( X ) = e 1 , e 2 , e t . According to the high-dimensional mapping, it can assist the model in better comprehending the relationships among different parameters.
(3) Self-attention calculation: Evaluates the correlation, S , between the Q and K , and then performs a softmax operation on each row of the attention score matrix, S , to obtain the attention weight matrix, A . The specific calculation formula is as follows:
S ( q , k ) = v tanh ( W q Q + W k K ) A = softmax ( S ( q , k ) )
where v is the learnable vector, tanh is the activation function, and W q and W k are the learnable parameters.
(4) Attention-weighted representation: Use the attention weights to perform a weighted sum of the embedded vectors to obtain the attention-weighted representation vector, C , at each position.
C = A ( Q , K , V ) = softmax ( S ( q , k ) ) V
where C is the attention-weighted representation vector.
(5) Combination with the LSTM model: Input the obtained attention-weighted representation vector, C , into the LSTM model as its input sequence. SA-LSTM can capture the long-term dependency relationships in the sequence through the recursive operation of the time step, t , and the recursive formula is as follows:
i t = σ W i i · x t + b i i + W h i · h t 1 + b h i , f t = σ W i f · x t + b i f + W h f h t 1 + b h f , g t = tanh ( W i g x t + b i g + W h g h t 1 + b h g ) , o t = σ ( W i o x t + b i o + W h o h t 1 + b h o ) , c t = f t + c t 1 + i t · g t , h t = o t · tanh c t ,
where the parameters in Equation (4) are described as follows in the Table 1.
Most previous literature has adopted traditional machine learning methods for relevant research, lacking the application of deep learning models, and the SA-LSTM model has shown excellent performance in studies related to temporal sequence prediction. Therefore, to determine the best fuel consumption prediction model, this study compares traditional machine learning models, LSTM, and SA-LSTM models using several regression evaluation indicators. The effectiveness of the predictive models is assessed using the following metrics: the Mean Absolute Error (MAE), the Mean Square Error (MSE), the Root Mean Square Error (RMSE), and the Mean Absolute Percentage Error (MAPE).

3.3. Carbon Intensity Rating Method

To evaluate the corresponding carbon intensity change yielded by the model, it is essential to further manipulate the fuel consumption data prognosticated by the aforementioned model and to convert it into the carbon intensity, C I I t , up to a particular moment, and the specific computational means is as follows [32]:
C I I t = CF × t = 1 t F C t C × t = 1 t D t
where CF represents the carbon conversion coefficient of fuel oil. Due to heavy oil being the main fuel consumed by ships and the limited availability of data, this paper only considers the fuel consumption of heavy oil, so the value is 3.114; t = 1 t F C t represents the total fuel consumption up to moment t ; C represents the deadweight tonnage of the target ship, and the value in this paper is 35337; and t = 1 t D t represents the total sailing distance of the target ship up to moment t .
According to the formula above, the carbon intensity at each moment of the target ship can be acquired, thereby ascertaining the carbon intensity change in the ship within one year. Additionally, in accordance with the definition of carbon intensity, the mantissa obtained by computing through this approach is the annual carbon intensity value of the target ship.
The CII reference baseline of a specific ship type and the rating boundary are combined to determine the carbon intensity rating of the ship. The formula of baseline is as follows [33,34,35]:
  C I I Req = 1 Z 100 × a C c
where C is the deadweight tonnage (DWT) of the ship; a and c are parameters of different ship types; and Z is the reduction coefficient of CII in different years. The formula of the boundary value, B i , is as follows:
B i = exp ( d i )   C I I Req

4. Results and Discussions

This study takes a container ship of 2400 TEU as an example to illustrate the proposed model. The ship was built in 2019 with a DWT of 35,337 and equipped with one main engine for propulsion and two auxiliary engines for electricity. A dataset of 3723 rows with 22 features associated with the fuel consumption of this ship during the year of 2022 was collected.

4.1. Feature Selection Analysis

To investigate the impact of different feature sets on model performance, this paper constructs five datasets with various feature selection methods including Filter, Wrapper, and Embedded.
(1)
Variance Selection
Features with low variance usually have little effect on final prediction results. In this study, features with variance below a threshold of three were removed, which includes draft, trim, wind wave period, combined wave height, wind wave height, wave height, and flow velocity.
(2)
Correlation Coefficient Selection
The setting of correlation coefficient threshold is generally based on experience and specific data, aiming to remove features with relatively weak correlations while preserving the features as much as possible. In this study, features with correlation coefficient greater than 0.1 were selected. Based on this criterion, six features—wind wave period, wind wave height, wind speed, turning rate, draft, and flow direction—were removed. Figure 4 shows correlation coefficient between features and fuel consumption.
(3)
Recursive feature elimination
Recursive feature elimination (RFE) is a feature selection method including a gradient boosting regression model and cross-validation. Four key features including rot, wind direction, merpm, and main engine power are identified by RFE. The specific result of recursive feature elimination is shown in Figure 5.
It is discernible from the figure that the cross-validation score corresponding to the feature amount of four is the highest. Through outputting the features that are in correspondence with this point, it is ascertained that the steering rate, wind direction, main engine speed, and main engine power have the most significant impact on fuel consumption.
(4)
Feature selection based on LASSO
The LASSO model (L1 = 0.001) and identified and discarded features with zero importance. Results are shown in Figure 6.
Combining filtering, wrapper, and embedding methods, this paper constructed four feature subsets for prediction analysis. The specific outcomes are summarized in Table 2.
Hyperparameters are crucial elements affecting model accuracy. To enhance model accuracy, this study analyzes hyperparameter optimization. Typical approaches include grid search, random search, manual adjustment, and Bayesian search. This study selects Bayesian search due to its global optimization capability based on Bayesian theory, proving more effective than grid or random searches. The optimization results are shown in Figure 7.
The results demonstrate that hyperparameter adjustment significantly improves model performance across diverse data sets. Enhancements in MAE, MSE, RMSE, and MAPE attest to the efficacy of the hyperparameter adjustment process in elevating the prediction accuracy of SA-LSTM.

4.2. The Result of Fuel Consumption Prediction

After three Bayesian searches and cross-validation to optimize hyperparameters, this study evaluated model performance on different data sets. The results are presented in Table 3.
As presented in Table 3, the SA-LSTM model shows the best prediction performance on the feature subset selected through the embedding method, whereas SVR has the highest prediction error. Compared to the pre-embedding state, the four evaluation indicators for the SA-LSTM model on the five data sets decreased noticeably, with MAPE decreasing by up to 12%. The specific results are depicted in Figure 8.
It can be found that adding the self-attention mechanism significantly impacts the SA-LSTM across each data set, reducing MAE by up to 17.5%, MSE by up to 22%, RMSE by up to 13%, and MAPE by up to 12%.

4.3. The Assessment of Carbon Intensity

In Figure 9, the CII of the target ship fluctuates significantly in January, stabilizing within 7 to 8 subsequently. This fluctuation is due to limited initial voyages and relatively large CO2 emissions. As cumulative voyage values increase, CII stabilizes when calculated with deadweight tonnage.
By attaining the optimal state of each model through comparative analysis, we predict the fuel consumption within one year and transform the predictions into carbon intensity. The CII distribution for each model’s predictions is shown in Figure 10.
Figure 10 shows that the CII distribution from the SA-LSTM closely aligns with the actual distribution of the target ship, while LSTM and ANN models show larger discrepancies. To visually compare the fit of each model’s carbon intensity predictions to the actual values, a scatter plot is shown in Figure 11.
Figure 11 reveals that LASSO and SVR models have high dispersion and more outliers, deviating significantly from actual values. The ANN and LSTM models show reduced dispersion but still have numerous outliers. Ensemble models (XGBoost, RF, LGB, and ET) exhibit similar, more concentrated scatter distributions. The scatter points of SA-LSTM are concentrated near the fitting curve, indicating minimal dispersion and prediction deviation.

4.4. The Analysis of Carbon Intensity Rating Result

Given that the case ship is a container ship, the rating boundary is calculated with a = 1984 and c = 0.489; Zhang et al. (2023) showed the detailed calculation process [36]. A comparison of the results of all these models are shown in Table 4. It indicates that the proposed SA-LSTM model has the best performance in predicting the annual CII value of this ship.

5. Conclusions

This paper proposed an SA-LSTM model and compared it with ten other models to predict ship fuel consumption and the annual CII value. The self-attention mechanism is introduced to assign weights to the inputs, transforming them into a new matrix, which is fed into the LSTM for model training and prediction. The accuracy of the SA-LSTM model is validated through comparisons with the standard LSTM and traditional models.
The proposed SA-LSTM could be used to dynamically predict the CII rating of ships, and the ship operators could use this information to dynamically adjust their ship operational and management practices to enhance the CII ratings of their ships. (1) However, ships need to install the required sensors and their management systems to collect data in advance, which may be expensive and time-consuming. In practice, both data of history and real-time data should be prepared in order to take advantage of this algorithm.
One limitation of this study is that the proposed SA-LSTM model is simply tested by one case ship, due to the difficulty in collecting real-time fuel consumption data from ships, which may affect the generalization of the results. Additionally, the meteorological and sea condition data were obtained from ECMWF and Copernicus rather than from the ship’s sensors, potentially causing a discrepancy in environmental characteristics that could influence model accuracy.
Future research may focus on improving the generalization of the proposed model by training on data from more ships with different types and sizes. It is also necessary to integrate more real-time operational and environmental datasets and the latest techniques to further improve the performance of the proposed model.

Author Contributions

Conceptualization, Z.W.; methodology, Z.W. and W.L.; software, T.L.; validation, T.L.; formal analysis, Z.W. and T.L.; investigation, Z.W.; resources, Z.W., Y.H.; data curation, T.L.; writing—original draft preparation, T.L.; writing—review and editing, Z.W., T.L., Y.H.; visualization, C.Z.; supervision, Z.W.; project administration, Y.H.; funding acquisition, Z.W., C.Z. and X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by PRC Ministry of Science and Technology-Subtopics of National Key Research and Development Program (2022YFB4301400) and the Shanghai Municipal Science and Technology Program: “Shanghai Ship Intelligent Operation and Maintenance and Energy Efficiency Monitoring Engineering Technology Research Center” (20DZ2252300).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets presented in this article are not readily available because confidentiality of data. Requests to access the datasets should be directed to Tianye Lu.

Conflicts of Interest

Author Yi Han was employed by the company COSCO Shipping Technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. MEPC.328(76). Amendments to the Annex of the Protocol of 1997 to Amend the International Convention for the Prevention of Pollution from Ships, 1973, as Modified by the Protocol of 1978 Relating Thereto. 2021 Revisez MARPOL Annex VI. 2021. Available online: https://wwwcdn.imo.org/localresources/en/KnowledgeCentre/IndexofIMOResolutions/MEPCDocuments/MEPC.328(76).pdf (accessed on 8 December 2022).
  2. Chuah, L.F.; Mokhtar, K.; Ruslan, S.M.M.; Bakar, A.A.; Abdullah, M.A.; Osman, N.H.; Bokhari, A.; Mubashir, M.; Show, P.L. Implementation of the energy efficiency existing ship index and carbon intensity indicator on domestic ship for marine environmental protection. Environ. Res. 2023, 222, 115348. [Google Scholar] [CrossRef] [PubMed]
  3. Oldendorff. CII Is Not the Answer, What Do We Do Now? 2022. Available online: https://oldendorff-website-assets.s3.amazonaws.com/assets/downloads/Oldendorff-EMISSIONS.pdf (accessed on 8 January 2023).
  4. Hoffmann, M. The Impact of ‘Fouling Idling’on Ship Performance and Carbon Intensity Indicator (CII). 2022. Available online: https://selektope.com/wp-content/uploads/2022/06/HullPIC-2022_ITech-conference-paper-.pdf (accessed on 4 January 2023).
  5. Chen, Z.S.; Lam, J.S.L.; Xiao, Z. Prediction of harbour vessel fuel consumption based on machine learning approach. Ocean Eng. 2023, 278, 114483. [Google Scholar] [CrossRef]
  6. Martić, I.; Degiuli, N.; Grlj, C.G. Prediction of Added Resistance of Container Ships in Regular Head Waves Using an Artificial Neural Network. J. Mar. Sci. Eng. 2023, 11, 1293. [Google Scholar] [CrossRef]
  7. Chen, Z.S.; Lam, J.S.L.; Xiao, Z. Prediction of harbour vessel emissions based on machine learning approach. Transp. Res. Part D Transp. Environ. 2024, 131, 104214. [Google Scholar] [CrossRef]
  8. Su, M.; Su, Z.Q.; Cao, S.L.; Park, K.S.; Bae, S.H. Fuel Consumption Prediction and Optimization Model for Pure Car/Truck Transport Ships. J. Mar. Sci. Eng. 2023, 11, 1231. [Google Scholar] [CrossRef]
  9. Wang, S.; Ji, B.; Zhao, J.; Liu, W.; Xu, T. Predicting ship fuel consumption based on LASSO regression. Transp. Res. Part D Transp. Environ. 2018, 65, 817–824. [Google Scholar] [CrossRef]
  10. Jeon, M.; Noh, Y.; Shin, Y.; Lim, O.K.; Lee, I.; Cho, D. Prediction of ship fuel consumption by using an artificial neural network. J. Mech. Sci. Technol. 2018, 32, 5785–5796. [Google Scholar] [CrossRef]
  11. Ren, F.; Wang, S.; Liu, Y.; Han, Y. Container Ship Carbon and Fuel Estimation in Voyages Utilizing Meteorological Data with Data Fusion and Machine Learning Techniques. Math. Probl. Eng. 2022, 2022, 4773395. [Google Scholar] [CrossRef]
  12. Li, X.; Du, Y.; Chen, Y.; Nguyen, S.; Zhang, W.; Schönborn, A.; Sun, Z. Data fusion and machine learning for ship fuel efficiency modeling: Part I—Voyage report data and meteorological data. Commun. Transp. Res. 2022, 2, 100074. [Google Scholar] [CrossRef]
  13. Du, Y.; Chen, Y.; Li, X.; Schönborn, A.; Sun, Z. Data fusion and machine learning for ship fuel efficiency modeling: Part II—Voyage report data, AIS data and meteorological data. Commun. Transp. Res. 2022, 2, 100073. [Google Scholar] [CrossRef]
  14. Du, Y.; Chen, Y.; Li, X.; Schönborn, A.; Sun, Z. Data fusion and machine learning for ship fuel efficiency modeling: Part III—Sensor data and meteorological data. Commun. Transp. Res. 2022, 2, 100072. [Google Scholar] [CrossRef]
  15. Uyanık, T.; Karatuğ, Ç.; Arslanoğlu, Y. Machine learning approach to ship fuel consumption: A case of container vessel. Transp. Res. Part D Transp. Environ. 2020, 84, 102389. [Google Scholar] [CrossRef]
  16. Waswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.; Kaiser, L.; Polosukhin, I. Attention is all you need. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
  17. Liu, G.; Wang, K.; Hao, X.; Zhang, Z.; Zhao, Y.; Xu, Q. SA-LSTMs: A new advance prediction method of energy consumption in cement raw materials grinding system. Energy 2022, 241, 122768. [Google Scholar] [CrossRef]
  18. Han, D.; Wang, S.; Hua, Y.; Bai, H.; Guo, H.; Huang, Y. A Load Classification Method Based on SA-LSTM Network Considering Category Imbalance Processing. In Proceedings of the 2022 2nd International Conference on Intelligent Technology and Embedded Systems (ICITES), Chengdu, China, 23–26 September 2022; pp. 109–115. [Google Scholar]
  19. Cai, S.; Gao, H.; Zhang, J.; Peng, M. A self-attention-LSTM method for dam deformation prediction based on CEEMDAN optimization. Appl. Soft Comput. 2024, 159, 111615. [Google Scholar] [CrossRef]
  20. Hu, Z.; Gao, Y.; Ji, S.; Mae, M.; Imaizumi, T. Improved multistep ahead photovoltaic power prediction model based on LSTM and self-attention with weather forecast data. Appl. Energy 2024, 359, 122709. [Google Scholar] [CrossRef]
  21. Rao, A.R.; Reimherr, M. Modern non-linear function-on-function regression. Stat. Comput. 2023, 33, 130. [Google Scholar] [CrossRef]
  22. Yuan, Z.; Liu, J.; Liu, Y.; Yuan, Y.; Zhang, Q.; Li, Z. Fitting analysis of inland ship fuel consumption considering navigation status and environmental factors. IEEE Access 2020, 8, 187441–187454. [Google Scholar] [CrossRef]
  23. Yuan, Z.; Liu, J.; Zhang, Q.; Liu, Y.; Yuan, Y.; Li, Z. Prediction and optimisation of fuel consumption for inland ships considering real-time status and environmental factors. Ocean Eng. 2021, 221, 108530. [Google Scholar] [CrossRef]
  24. Kalajdžić, M.; Vasilev, M.; Momčilović, N. Power reduction considerations for bulk carriers with respect to novel energy efficiency regulations. Brodogr. Teor. i Praksa Brodogr. i Pomor. Teh. 2022, 73, 79–92. [Google Scholar] [CrossRef]
  25. Wang, S.; Psaraftis, H.N.; Qi, J. Paradox of international maritime organization’s carbon intensity indicator. Commun. Transp. Res. 2021, 1, 100005. [Google Scholar] [CrossRef]
  26. Elkafas, A.G.; Rivarolo, M.; Massardo, A.F. Environmental economic analysis of speed reduction measure onboard container ships. Environ. Sci. Pollut. Res. 2023, 30, 59645–59659. [Google Scholar] [CrossRef] [PubMed]
  27. Dewan, M.H.; Godina, R. Effective Training of Seafarers on Energy Efficient Operations of Ships in the Maritime Industry. Procedia Comput. Sci. 2023, 217, 1688–1698. [Google Scholar] [CrossRef]
  28. Li, X.Y.; Zuo, Y.; Jiang, J.H. Application of Regression Analysis Using Broad Learning System for Time-Series Forecast of Ship Fuel Consumption. Sustainability 2023, 15, 380. [Google Scholar] [CrossRef]
  29. Yildiz, B. Prediction of residual resistance of a trimaran vessel by using an artificial neural network. Brodogr. Teor. i Praksa Brodogr. i Pomor. Teh. 2022, 73, 127–140. [Google Scholar] [CrossRef]
  30. Onur, Y.; Murat, B.; Mustafa, S. Comparative study of machine learning techniques to predict fuel consumption of a marine diesel engine. Ocean Eng. 2023, 286, 115505. [Google Scholar]
  31. Xie, X.; Sun, B.; Li, X.; Olsson, T.; Maleki, N.; Ahlgren, F. Fuel Consumption Prediction Models Based on Machine Learning and Mathematical Methods. J. Mar. Sci. Eng. 2023, 11, 738. [Google Scholar] [CrossRef]
  32. Bayraktar, M.; Yuksel, O. A scenario-based assessment of the energy efficiency existing ship index (EEXI) and carbon intensity indicator (CII) regulations. Ocean Eng. 2023, 278, 114295. [Google Scholar] [CrossRef]
  33. Gianni, M.; Pietra, A.; Coraddu, A.; Taccani, R. Impact of SOFC Power Generation Plant on Carbon Intensity Index (CII) Calculation for Cruise Ships. J. Mar. Sci. Eng. 2022, 10, 1478. [Google Scholar] [CrossRef]
  34. Rauca, L.; Batrinca, G. Impact of Carbon Intensity Indicator on the Vessels’ Operation and Analysis of Onboard Operational Measures. Sustainability 2023, 15, 11387. [Google Scholar] [CrossRef]
  35. Sun, L.; Wang, X.; Lu, Y.; Hu, Z. Assessment of ship speed, operational carbon intensity indicator penalty and charterer profit of time charter ships. Heliyon 2023, 9, e20719. [Google Scholar] [CrossRef]
  36. Zhang, C.; Lu, T.; Wang, Z.; Zeng, X. Research on Carbon Intensity Prediction Method for Ships Based on Sensors and Meteorological Data. J. Mar. Sci. Eng. 2023, 11, 2249. [Google Scholar] [CrossRef]
Figure 1. The case of data fusion based on time.
Figure 1. The case of data fusion based on time.
Applsci 14 08526 g001
Figure 2. The methodology of the ship carbon intensity prediction model.
Figure 2. The methodology of the ship carbon intensity prediction model.
Applsci 14 08526 g002
Figure 3. General structure of the SA−LSTM model.
Figure 3. General structure of the SA−LSTM model.
Applsci 14 08526 g003
Figure 4. The correlation coefficient between features and fuel consumption.
Figure 4. The correlation coefficient between features and fuel consumption.
Applsci 14 08526 g004
Figure 5. The result of recursive feature elimination.
Figure 5. The result of recursive feature elimination.
Applsci 14 08526 g005
Figure 6. The result feature selecting based on the LASSO model.
Figure 6. The result feature selecting based on the LASSO model.
Applsci 14 08526 g006
Figure 7. Analysis of hyperparameter optimization of SA-LSTM.
Figure 7. Analysis of hyperparameter optimization of SA-LSTM.
Applsci 14 08526 g007
Figure 8. The percentage of error reduction of the SA-LSTM model.
Figure 8. The percentage of error reduction of the SA-LSTM model.
Applsci 14 08526 g008
Figure 9. The true carbon intensity distribution of the case ship.
Figure 9. The true carbon intensity distribution of the case ship.
Applsci 14 08526 g009
Figure 10. CII assessment results of different models.
Figure 10. CII assessment results of different models.
Applsci 14 08526 g010
Figure 11. The fitting degree between the CII evaluated by models and the actual value.
Figure 11. The fitting degree between the CII evaluated by models and the actual value.
Applsci 14 08526 g011
Table 1. Interpretation of the parameters in Equation (4).
Table 1. Interpretation of the parameters in Equation (4).
Parameters Interpretation
i t input gate
f t forget gate
g t cell state
o t output gate
σ activation function
W i i the weight matrix from x t to the input gate i t
x t the input at the current time step
b i i the bias of x t to the input gate i t
W h i the weight matrix from h t 1 to the input gate i t
h t 1 the hidden state at the previous time step
b h i the bias of h t 1 to the input gate i t
W i f the weight matrix from x t to the forget gate f t
b i f the bias of x t to the forget gate f t
W h f the weight matrix from h t 1 to the forget gate f t
b h f the bias of h t 1 to the forget gate f t
tanh activation function
W i g the weight matrix from input x t to the candidate memory cell
b i g the bias of the candidate memory cell
W h g the weight matrix from h t 1 to the candidate memory cell
b h g the bias from h t 1 to the candidate memory cell
W i o the weight matrix from input x t to the output gate o t
b i o the bias of x t to the output gate o t
W h o the weight matrix from h t 1 to the output gate o t
b h o the bias of from h t 1 to the output gate o t
c t the memory cell state at the current time step
c t 1 the memory cell state at the previous time step
h t hidden state
Table 2. Feature subsets under the results of different feature selecting methods.
Table 2. Feature subsets under the results of different feature selecting methods.
Data SourcesFeatureOrigin Data/(Set1)Variance/(Set2)Correlation Coefficient/(Set3)RFECV/(Set4)LASSO/(Set5)
AISspeed
rot
draught
distance
Meteorological and Sea State Datawind speed
mpts
mpww
mwp
shww
swh
wwh
swell direction
dww
wd
sst
current spped
wind direction
current direction
sensors datamerpm
trim
power
FC
√: indicate selected features
Table 3. The prediction results of models on different data sets.
Table 3. The prediction results of models on different data sets.
ModelData SetsMAEMSERMSEMAPE
XGBoostSet10.41730.28010.52920.3148
Set20.40380.29240.54080.3004
Set30.41670.29380.54200.3156
Set40.41260.28280.53180.3083
Set50.41720.28230.53130.3184
RFSet10.41650.28430.53320.3161
Set20.41310.28930.53280.3098
Set30.41400.28660.53530.3132
Set40.41300.28590.53470.3095
Set50.41460.28460.53350.3156
LGBSet10.41200.28310.53210.3111
Set20.40820.28290.53190.3073
Set30.41200.28770.53640.3119
Set40.41270.28640.53510.3066
Set50.41230.27820.52750.3109
ETSet10.42270.28650.53530.3226
Set20.41990.28610.53490.3152
Set30.41950.28980.53830.3173
Set40.42030.29110.53950.3172
Set50.42230.28570.53450.3205
LASSOSet10.42990.28590.53470.3172
Set20.43010.28590.53470.3177
Set30.43090.28600.53480.3197
Set40.43410.29100.53940.3261
Set50.43140.28620.53500.3207
SVRSet10.51200.49320.70230.5000
Set20.51100.50050.70750.4878
Set30.51420.48920.69940.5091
Set40.47170.39990.63240.3955
Set50.51110.50210.70860.4858
ANNSet10.43560.32850.57310.3434
Set20.42450.29600.54400.3127
Set30.43560.32850.57310.3434
Set40.42380.32080.56640.3278
Set50.42780.29150.53990.3305
ARIMASet10.59580.61420.783751.4353
Set20.59080.60570.778351.4297
Set30.59080.60570.778351.4297
Set40.58240.60320.776751.4199
Set50.57250.60120.775451.4138
Exponential SmoothingSet10.54430.51210.715650.3562
Set20.53430.50010.707250.3357
Set30.53430.50010.707250.3357
Set40.53120.49880.706350.3328
Set50.52380.49570.704150.3255
LSTMSet10.37260.33040.57480.2596
Set20.38850.27490.52430.2891
Set30.37930.30240.54990.2701
Set40.37660.28800.53670.2646
Set50.37200.27380.52330.2633
SA-LSTMSet10.32700.25700.50700.2471
Set20.33470.24920.49920.2541
Set30.34740.28570.53450.2609
Set40.34740.28570.53450.2609
Set50.30670.24050.49040.2428
Table 4. The annual carbon intensity results of the evaluation by models.
Table 4. The annual carbon intensity results of the evaluation by models.
ModelAnnual CIIErrorAccuracyRating Grade
XGBoost7.77760.033899.58%A
RF7.79870.012899.84%A
LGB7.82110.009699.88%A
ET7.82190.010499.87%A
LASSO7.80130.010299.87%A
SVR7.79270.018899.76%A
ANN7.38370.427794.52%A
ARIMA1.37736.434217.63%A
Exponential Smoothing1.47396.337618.86%A
LSTM7.37230.439194.38%A
SA-LSTM7.81300.001699.98%A
True7.81150100%A
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Z.; Lu, T.; Han, Y.; Zhang, C.; Zeng, X.; Li, W. Improving Ship Fuel Consumption and Carbon Intensity Prediction Accuracy Based on a Long Short-Term Memory Model with Self-Attention Mechanism. Appl. Sci. 2024, 14, 8526. https://doi.org/10.3390/app14188526

AMA Style

Wang Z, Lu T, Han Y, Zhang C, Zeng X, Li W. Improving Ship Fuel Consumption and Carbon Intensity Prediction Accuracy Based on a Long Short-Term Memory Model with Self-Attention Mechanism. Applied Sciences. 2024; 14(18):8526. https://doi.org/10.3390/app14188526

Chicago/Turabian Style

Wang, Zhihuan, Tianye Lu, Yi Han, Chunchang Zhang, Xiangming Zeng, and Wei Li. 2024. "Improving Ship Fuel Consumption and Carbon Intensity Prediction Accuracy Based on a Long Short-Term Memory Model with Self-Attention Mechanism" Applied Sciences 14, no. 18: 8526. https://doi.org/10.3390/app14188526

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop