Data-Driven Approach for Estimating Power and Fuel Consumption of Ship: A Case of Container Vessel

Uyanık, Tayfun; Yalman, Yunus; Kalenderli, Özcan; Arslanoğlu, Yasin; Terriche, Yacine; Su, Chun-Lien; Guerrero, Josep M.

doi:10.3390/math10224167

Open AccessArticle

Data-Driven Approach for Estimating Power and Fuel Consumption of Ship: A Case of Container Vessel

by

Tayfun Uyanık

^1,2,*

,

Yunus Yalman

^2,3

,

Özcan Kalenderli

⁴

,

Yasin Arslanoğlu

¹

,

Yacine Terriche

²,

Chun-Lien Su

^5,*

and

Josep M. Guerrero

²

¹

Maritime Faculty, Istanbul Technical University, Tuzla, İstanbul 34940, Turkey

²

Center for Research on Microgrids, AAU Energy, Aalborg University, 9220 Aalborg, Denmark

³

Faculty of Engineering and Natural Sciences, Ankara Yıldırım Beyazıt University, Çubuk, Ankara 06760, Turkey

⁴

Faculty of Electrical and Electronics Engineering, Istanbul Technical University, Maslak, İstanbul 34469, Turkey

⁵

Department of Electrical Engineering, National Kaohsiung University of Science and Technology, Kaohsiung City 807618, Taiwan

^*

Authors to whom correspondence should be addressed.

Mathematics 2022, 10(22), 4167; https://doi.org/10.3390/math10224167

Submission received: 8 October 2022 / Revised: 1 November 2022 / Accepted: 5 November 2022 / Published: 8 November 2022

(This article belongs to the Special Issue Advanced Statistical Control and Predictive Maintenance Models for Industry 4.0)

Download

Browse Figures

Versions Notes

Abstract

:

In recent years, shipborne emissions have become a growing environmental threat. The International Maritime Organization has implemented various rules and regulations to resolve this concern. The Ship Energy Efficiency Management Plan, Energy Efficiency Design Index, and Energy Efficiency Operational Indicator are examples of guidelines that increase energy efficiency and reduce shipborne emissions. The main engine shaft power (MESP) and fuel consumption (FC) are the critical components used in ship energy efficiency calculations. Errors in ship energy efficiency calculation methodologies are also caused by misinterpretation of these values. This study aims to predict the MESP and FC of a container ship with the help of data-driven methodologies utilizing actual voyage data to assist in the calculation process of the ship’s energy efficiency indexes appropriately. The algorithms’ prediction success was measured using the RMSE, MAE, and R² error metrics. When the simulation results were analyzed, the Deep Neural Network and Bayes algorithms predicted MESP best with 0.000001 and 0.000002 RMSE, 0.000987 and 0.000991 MAE, and 0.999999 R², respectively, while the Multiple-Linear Regression and Kernel Ridge algorithms estimated FC best with 0.000208 and 0.000216 RMSE, 0.001375 and 0.001471 MAE, and 0.999999 R², respectively.

Keywords:

fuel consumption; energy efficiency; machine learning; deep neural network; power prediction

MSC:

68T07

1. Introduction

1.1. Background

The maritime sector has become more consolidated as the volume of global commerce has increased in recent years [1]. As a result, there has been an increase in emissions caused by shipping because of using fossil fuels [2]. Various rules and regulations were enacted by the International Maritime Organization (IMO) to limit emissions in the shipping industry [3,4]. Therefore, several indices such as the Energy Efficiency Design Index (EEDI), Ship Energy Efficiency Management Plan (SEEMP), and Energy Efficiency Operational Indicator (EEOI) have been proposed to determine and enhance the energy efficiency level of the global fleet of marine vessels [5]. In terms of shipping companies, it can be claimed that energy efficiency begins at the design stage [6] and that ship energy efficiency may be boosted through operational ways in addition to numerous design approaches [7,8,9,10]. To improve operational energy efficiency, methods such as cruise route optimization [11], speed optimization [12], alternative fuels [13], electrical power optimization [14,15,16] and other improvements have been recommended in the literature [7,17]. The use of methodologies like SEEMP and EEOI to measure energy efficiency in ships was developed by studies on the relationship between speed and main engine shaft power [18,19]. These methods are regarded as effective, although they do have certain drawbacks. Because of internal, external, and severe circumstances, the connection between speed and propulsion power aboard might diverge from design data. This is due to a number of internal and external variables that affect the vessel’s function [20]. Aside from exterior meteorological elements like wind, wave, and current, numerous situations originating from the ship’s internal dynamics have a significant impact on its efficiency [21]. As a result, in addition to evaluating the energy efficiency of ships under ideal test conditions, it is required to make use of a variety of data collected throughout the voyage [22].

During the voyage, the vessel may need to increase its speed under certain scenarios. In such cases, it will be necessary to create more propulsion power in order to achieve the desired speed values, which demand more fuel. Hence, this process results in consuming more fuel, emitting more exhaust gases into the environment, and decreasing the ship’s energy efficiency [23]. It can be a beneficial approach to evaluate power and fuel consumption at this moment to solve difficulties linked to determining ship energy efficiency in the maritime sector [24]. With the advancement of technology, data-driven methods have expanded their field of application and proven their success in a wide variety of industries [25,26,27,28,29]. Obtaining data from a system in the maritime industry can be described as a challenging process until the last ten years. The development of sensor technology has made it possible to obtain meaningful voyage data from ships. Through sensors in the ship’s engine room, the main engine, and the engine logbook, the shaft power and fuel consumption were evaluated using data-driven approaches in this research. The dataset was analyzed using correlation analysis [30]. Another technique depends on the pair plots to identify the variables most highly correlated with power and fuel consumption during dataset analysis. Further, for predicting shaft power and fuel consumption, some techniques like Multiple Linear, Ridge, Lasso, Kernel Ridge, Elastic Net, Artificial Neural Network, XGradient Boosting, Deep Neural Network, and Bayes algorithms are proposed. To detect the overfitting status of the prediction models, the K-Fold Cross Validation [31] approach was used. The parameters of the algorithm have been optimized to improve the accuracy of the predictions. When compared to other approaches, the Deep Neural Network and Bayes algorithms showed the best prediction performance for the shaft power prediction of the main engine, while the Kernel Ridge and Multiple Linear Regression algorithms showed the best prediction performance for fuel consumption prediction. The contributions of this study are summarized below.

Related studies reviewed in the literature usually only estimate one variable (fuel consumption or power), and this study estimated fuel consumption and main engine power variables separately.
In addition to the four models that are mostly discussed in the studies reviewed in the literature, five different data-driven algorithms are used for fuel consumption and power estimation cases.
The performance values of the algorithms before and after parameter optimization are compared and discussed in detail.
A pair plot was used, in addition to the correlation analysis, to analyze the relationships among the variables in the dataset in more detail.

The main sections of the study are as follows: The material and methodology are in Section 2, and the case study is in Section 3. The simulation findings are reported in Section 4, and the results are evaluated and proposed future research is discussed in the conclusion and discussion.

1.2. Related Works

There are various applications of data-driven methods on ships. Hu et al. argued that estimating ship fuel consumption requires a two-stage strategy. Data collection and processing operations are carried out in the first step, and trim optimization is suggested in the second. Furthermore, trim optimization has been claimed to reduce carbon emissions [32]. According to Vettor and Soares, depending on the route, weather conditions would affect sea wave conditions, which would affect fuel consumption. A 90% success rate was achieved in the container ship fuel consumption estimation study [33]. Zhou et al. estimated fuel consumption using machine learning algorithms. In the fuel consumption estimation study, the ANN, SVR, Lasso, and Random Forest algorithms proposed hyperparameter optimization for optimizing the methods and used this process for four methods. They observed that hyperparameter optimization increased prediction success by 0.0773% to 2.1653% as a result of the simulations [34]. Yan et al. were able to estimate fuel consumption with the Random Forest algorithm with an error of about 7% in their study [35]. Yuan et al. suggested that ship fuel consumption is critical for factors such as energy management, cruise planning, and smart decision-making. In the study, environmental factors, water depth, and various sensor data were used [36]. Tien Anh Tran used machine learning and the Monte Carlo method for fuel consumption estimation. He also performed the estimation process with ANN and Multiple Linear Regression methods [37]. Karagiannidis and Temelis claimed that knowing the actual positions of the hull and propeller parts of the ship would contribute to operational energy efficiency, and they argued that the shaft power and fuel consumption values are important in terms of energy efficiency. In the estimation study, only an Artificial Neural Network model was used [38]. Fan et al., in a literature review, divided the available fuel consumption estimation methods into three classes. In addition, they discussed the factors affecting fuel consumption on board [39]. Ferreira et al. used Decision Tree, Artificial Neural Network, and Random Forest Regression methods for ship propulsion power estimation in their study [40]. In their study, Lee et al. were able to predict ship power using an Artificial Neural Network with an error margin of 3.5% to 4% [29]. According to related studies, power was not estimated in studies that estimated fuel consumption, and fuel consumption was not estimated in studies that estimated power, and both values were estimated in this study. Furthermore, the methods investigated in the literature are primarily concerned with the classical Artificial Neural Network structure, as well as the Random Forest, SVM, Decision Tree, and Multiple-Linear Regression algorithms. In this study, nine different data-driven algorithms, including the Deep Neural Network method, were used to estimate fuel consumption and power.

2. Materials and Methods

Data-driven methods were used to estimate the electrical power and fuel consumption values on a commercial ship in this study. To begin, the dataset is divided into two sections, including training and testing. The training set was used to develop the prediction models, while the test set was used to calculate the algorithms’ prediction success. The data-driven methodologies can be used to calculate the propulsion power and fuel consumption for a ship cruise. Support vector regression (SVR) was used in a study in this area to predict propulsion power more accurately than conventional methods [5,41]. Another study indicated that machine learning approaches outperform the ANN method in specific cases for predicting shaft power onboard ships using AIS data and weather data [42]. For shipping operational optimization, Leifsson et al. combined gray-box and white-box models with ANN. The gray-box model was discussed in this research as having certain advantages over other techniques for a container vessel [43]. Petersen et al. argue that propulsion power plays an important role in ship fuel economy, and they utilize Artificial Neural Networks and statistical models to estimate propulsion power, demonstrating that both techniques provide good results [44].

To investigate the energy efficiency of a container ship, various data-driven models were used to predict shaft power and fuel consumption factors, and Figure 1 depicts the study’s approach. The first 700 days of voyage data from a container ship were gathered for the estimating procedure. These figures were compiled from 75 distinct data sources aboard, including various equipment. Figure 2 shows the findings of the Pearson Correlation Analysis. The study revealed that several factors in the data set had a higher correlation with the shaft power and fuel consumption variables. The correlation matrix was examined, and data with poor correlation with these variables were excluded from the analysis and estimation procedures. To better comprehend the relationship between power and fuel consumption, a pair plot was created, with the highest correlations illustrated in Figure 3. The data set was randomly picked by the computer as training data (66 percent) and test data (33 percent) and separated into two parts after it was processed [45]. The training data was utilized for training the algorithms, while the remaining test data was not used. The outcomes of the algorithm-based prediction method were compared to the actual test data. The estimation was done using data-driven techniques such as Multiple Linear Regression, Ridge Regression, Lasso Regression, Kernel Ridge Regression, Elastic Net, Artificial Neural Network, XGradient Boosting, Deep Neural Network, and Bayesian Regression. Since the expected results were not obtained from the estimation in the first stage of the prediction process, the parameters of the algorithms were changed to increase the algorithm’s performance. To validate the findings and detect overfitting, the K-Fold Cross Validation method was used [46]. The algorithm’s results were then compared using error metrics such as Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Coefficient of Determination (R²) [47]. Figure 1 shows the steps done in further detail. This section is organized as follows; Section 2.1 and Section 2.2 explain the data collection and data pre-processing phases. Section 2.3 describes the stages of model development and prediction. Validation and evaluation techniques are introduced in Section 2.4.

2.1. Data Collection

The data collection process is a major challenge for data-driven studies on scientific procedures [48]. The vessel in this analysis has a length of 328 m, a width of 46 m, and a draft of 9.7 m. Propulsion power is provided by the main engine, model 10S90MEC9. This main engine has nine cylinders and is constructed with two strokes. The data was compiled by merging the engine logbook, noon report, and main engine sensors from a cargo ship. Engine power (%), main engine shaft rotating per minute, fuel oil consumption (t/d), main engine shaft torque (kNm), main engine shaft power (kW), temperature values of main engine jacket cooling water, main engine jacket freshwater, thrust pad, scavenging air, cylinder, and other parameters are included in this dataset. Table 1 shows the statistical analysis of a part of the data set. Data was received via three diesel generators, one shaft generator, and one emergency generator. The microgrid of this ship is represented in Figure 2.

2.2. Data Pre-Processing

Correlation Analysis

Understanding how data-driven methods work requires an understanding of the correlation. Thanks to this, it is revealed how the variables in the data set are related to each other. Data-driven methods can also predict the target variable by using these variables’ relationships. Another important factor that draws attention here is the degree of correlation. If the correlation value is close to zero, it is called a low correlation. In other words, it can be said that the two variables affect each other slightly or not at all. Suppose the correlation is between zero and −1 and closer to this value. In that case, a strong negative correlation occurs, meaning an inverse correlation exists between the two related variables in the dataset. The closer the correlation value is to 1, the greater the degree of correlation between the two variables. In this case, as one of the two variables changes, the other will change in parallel with the value of this variable [49].

The relationship between any two variables was determined using correlation analysis, which is a method for analyzing and illustrating the relationship between variables [50]. The Pearson Correlation Coefficient is commonly utilized and calculated in this investigation [45]. A good correlation exists when the coefficient is positive; however, an inverse correlation is observed when the sign is negative. When there is a relationship between two variables, a linear shape will emerge in the pair plot of these variables. If there is no correlation, the pair plot of these two variables will not have a linear shape. The Pearson Correlation Analysis is illustrated in Figure 3.

The data of main engine shaft speed (rpm), main engine scavenging air temperature, main engine thrust pad temperature, and main engine fuel oil consumption form a strong correlation with the power, which can be noticed when the correlation matrix is studied. The pair plot in Figure 4 provides a more detailed examination of the association between these variables.

The power does not vary until the shaft speed is around 35 rpm, as shown in the pair plot. After this value, it can be concluded that power and shaft rpm have a significant connection. The main engine scavenging air temperature and main engine fuel oil consumption statistics form a correlation with the power, as can be seen in Figure 4. Further, up to 48 °C, the main engine thrust pad temperature data has no effect on the power, and beyond that, there is a link between them.

2.3. Model Development and Prediction

2.3.1. Multiple Linear Regression

Multiple Linear Regression is a frequently used algorithm in machine learning applications and is a statistical method that predicts the dependent variable from the independent variables [51]. Equation (1) is used for Multiple Linear Regression [52].

y = a_{0} + a_{1} x_{1} + \dots + a_{n} x_{n}

(1)

where

a_{0}, a_{1}, \dots a_{n}

are coefficients, y is the dependent variable, and

x_{1}, x_{2}, \dots, x_{n}

are independent variables. In this method,

a_{n}

(coefficients) are calculated as

a_{n} = {argmin}_{(a)}^{(\sum_{i = 1}^{n} {(y_{i} - a_{0} - \sum_{j = 1}^{n} a_{j} x_{ij})}^{2})}

(2)

2.3.2. Ridge Regression

The Ridge Regression algorithm (RR) is a method that is generally used for coefficient estimation and sometimes does this according to the least-squares method [53]. In this method, the coefficients (

a_{n}

) are found with the following, Equation (3).

a_{n} = {argmin}_{(a)}^{(\sum_{i = 1}^{n} {(y_{i} - a_{0} - \sum_{j = 1}^{D} a_{j} x_{ij})}^{2} + (µ \sum_{j = 1}^{D} a_{j}^{2}))}

(3)

In this equation,

µ > 0

is a regularisation hyperparameter [54].

2.3.3. Lasso Regression

LASSO emerged as a variable selection method based on the least-squares method [55]. In this method, the least-squares method is used to find the coefficient

(a_{n})

. The equation for finding the coefficient with this method is given below (4).

a_{n} = {argmin}_{(a)}^{(\frac{1}{2} \sum_{i = 1}^{n} {(y_{i} - a_{0} - \sum_{j = 1}^{D} a_{j} x_{ij})}^{2} + (µ \sum_{j = 1}^{D} |a_{j}|))}

(4)

2.3.4. Kernel Ridge Regression

The Kernel Ridge algorithm is an improved version of the Ridge regression method [56]. The equations of this algorithm are given in Equations (5) and (6) below.

F (x) = y = \sum_{i = 1}^{n} Ɛ_{i} K (x, x_{i})

(5)

for this equation, K is the kernel function of the algorithm, and

Ɛ_{i}

is the weight, which is calculated as:

Ɛ_{i} = (K + µ l) y .

(6)

In this equation, the regularization parameter is

µ,

and the identity matrix is

l

,

y = (y_{1}, y_{2}, \dots, y_{n}

)^T.

2.3.5. Elastic Net

In this method, regularization parameters (

µ

) come from LASSO and Ridge algorithms. Hyperparameters (

a

and

{µ l}_{ratio}

) of this algorithm are used in the equations below, Equations (7) and (8) [57].

a = µ_{Ridge} + µ_{LASSO}

(7)

{µ l}_{ratio} = \frac{µ_{LASSO}}{a}

(8)

2.3.6. Bayesian Regression

This method has emerged as a result of applying the Bayesian approach to parameter selection in the linear regression algorithm. In this method, if the error values are in a normal distribution, the model parameters can be obtained by examining the previous situation [58].

2.3.7. Artificial Neural Network

The Artificial Neural Network (ANN) is a popular tool for solving regression and classification issues. During the model construction phase, the human brain system structure is emulated [59]. When looking at the model structure, there are three layers: the input layer, the hidden layer, and the output layer. When the layers are investigated, it is discovered that the information generated in each layer is multiplied by weight coefficient w and sent to the next layer [60]. Figure 5 depicts a typical neural network structure.

2.3.8. X-Gradient Boosting Regression

The XGradient Boosting method, introduced by Chen and Guestrin as an improved form of the gradient boosting algorithm, is a decision-tree-based statistical method [61]. The XGBoost is an effective statistical method that can provide accurate and high-speed solutions to data-driven problems [62]. Due to the high efficiency, speed, and flexibility of this method, its use has increased in recent years [63].

2.3.9. Deep Neural Network

In recent years, the Deep Neural Network approach has helped to popularize data-driven solutions in a variety of sectors [64,65,66,67]. Unlike Artificial Neural Networks, the success rate of this technology has grown as the number of layers has increased [68,69,70,71]. The Deep Neural Network approach, which has gained prominence in applications like image recognition and cyber security, has also demonstrated its effectiveness in regression problems [72,73,74]. Figure 6 depicts a typical Deep Neural Network structure.

2.4. Validation and Evaluation

2.4.1. K-Fold Cross-Validation

K-Fold Cross-Validation was used as the validation method to verify the success of the algorithm in estimating and detecting the overfitting problem [75]. As can be seen in Figure 7, the data set is divided into five equal parts. One of these parts was used for validation, one was used as test data, and the other three were used as training data [76]. The process continues until all data in the data set is processed. The average of the results obtained from the operations performed was taken as the validation score [77].

2.4.2. Error Metrics

In this study, error metrics were used to evaluate the success of machine learning algorithms in the evaluation phase [78]. Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and Coefficient of Determination (R²) are error metrics used to evaluate the success of algorithms.

Root Mean Squared Error

One measure of the difference between the real values in the data set and the values predicted by the algorithms is called Root Mean Squared Error (RMSE) [79]. The calculation of the RMSE error metric is given in Equation (9):

RMSE (A, P) = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(A_{i} - P_{i})}^{2}}

(9)

In this equation, A is the actual value, and P is the predicted value.

b.: Mean Absolute Error

The measure of the absolute value of the distance between the real values in the data set and the values predicted by the algorithms is called Mean Absolute Error (MAE) [80]. The calculation of the MAE error metric is shown in Equation (10):

MAE (A, P) = \frac{1}{n} \sum_{i = 1}^{n} |A_{i} - P_{i}|

(10)

where, A is the actual and P is the predicted data.

c.: Coefficient of Determination

Another measure of the distance between predicted values and actual values is called the Coefficient of Determination (R²) [81]. The equation for calculating R² is given below (11):

R^{2} (A, P) = 1 - \frac{\sum_{i = 1}^{n} {(A_{i} - P_{i})}^{2}}{\sum_{i = 1}^{n} {(A_{i} - k)}^{2}}

(11)

In this equation, A is the actual value, P is the predicted value, and k is the mean of the actual values.

2.5. Case Study

In this study, the main engine shaft power and fuel consumption were estimated using Python 3.7.7 and the Spyder 4.1.5 interface in the TensorFlow 2.0 environment. The data set is divided into training and test data in research that uses data-driven methodologies. Depending on the size of the data collection, the ratio between training and test data sets may change because this ratio is commonly employed in research in this field [45,80,82]. To be consistent with the literature, it was decided to utilize this ratio in this study. The computer randomly selected the dataset (700 days) for the calculation of shaft power using machine learning methods and divided it into two parts: training (2/3) and test data (1/3). The computer was taught 467 days of voyage data as training data, and the models assessed the vessel’s shaft power and fuel consumption variables in 233 days of voyage data (test data). To compare algorithm success, three alternative error metric approaches were utilized.

3. Simulation Results

As a result of the predictions, which are utilized as error metrics produced to determine the success of the algorithms, some of the algorithms did not generate the required results at the start of the prediction stage, according to the error metric values (RMSE, MAE, and R²). The failing algorithms’ hyperparameters were tuned using the ‘Grid Search’ approach for the prediction process. Tuned hyperparameters illustrated in Table 2. To define an overfitting condition and validate algorithms, the K-Fold Cross Validation method is applied. One part of the data set was used as test data, one part was used as validation data, and the remaining three parts were used as training data. This procedure was repeated until all of the data in the data set had been processed (5 iterations). The mean MAE error metric values found in all iterations were averaged, and the average validation score was determined when the 5th iteration was completed.

Table 3 and Table 4 exhibit cross-validation findings, whereas Table 5 and Table 6 reveal the error metric values for the primary findings, and Table 7 and Table 8 show the final findings for the case studies. After the power estimation and fuel consumption, the MAE, R² and RMSE error metrics were determined.

In Figure 8 and Figure 9, 30 days of data were randomly picked from 233 days of test data, and the predictions generated by the algorithms were plotted to evaluate the prediction success of machine learning algorithms. Figure 8 and Figure 9 provide comparison graphs of estimated and real power and fuel consumption.

4. Conclusions and Discussion

In maritime industries, data-driven algorithm techniques have been employed in areas such as wind speed, wave height, wind direction, ship detection, wave direction, ship speed, and ship fuel consumption. In this study, nine different data-driven algorithms were effective in estimating the container vessel’s main engine power and fuel consumption. In this study, the methods that were determined to be frequently used in the literature were examined first, then methods other than the classical algorithms that were thought to improve the novelty of the study were added, and finally, a different approach, such as DNN, was tried for this specific case. These methods also enriched and added to the research’s originality. This investigation employed real voyage data, and a feasible approach is proposed for determining the main engine power and fuel consumption variables required in energy efficiency calculations via utilizing the real dataset rather than complicated formulas. For power prediction, simulations revealed that the Deep Neural Network technique outperformed other systems. The Multiple Linear Regression approach, on the other hand, performed better in the situation of fuel consumption. These findings demonstrated that data-driven algorithms could accurately forecast the main engine shaft power and fuel consumption in ships.

Error metrics are a quantified expression of how close the estimates are to the actual values. This way, the prediction successes of the algorithms used can be compared, and studies can be made to develop the models. Three different error metrics were used to determine the success of the models created in this study. The error metric values obtained from the simulations (Table 5 and Table 6) and the effect of the parameter optimization process’s impacts on the models’ performance are discussed below.

When Table 5 and Table 6 are compared to Table 7 and Table 8, it is clear that the algorithms do not produce satisfactory results in the first simulations. Therefore, for simulations related to power estimation, while the Ridge achieved 1.237432 RMSE, 1.512512 MAE, and 0.659222 R² values in the initial simulations, parameter optimization resulted in error metric values of 0.500782 RMSE, 0.517583 MAE, and 0.999965 R². When the Lasso is analyzed, 1.264545 RMSE, 1.127521 MAE, and 0.775724 R² can be obtained as a result of the first simulations, while these values are updated as 0.299883 RMSE, 0.260465 MAE, and 0.999971 R² after parameter optimization. In the first simulations, the error metric values that were 0.009053 RMSE, 0.095750 MAE, and 0.993211 R² in the XGradient Boosting model reached 0.129474 RMSE, 0.114669 MAE, and 0.999871 R². If a comparison is made for the Elastic Net method; It can be said that the values of 1.203749 RMSE, 1.097155 MAE, 0.647199 R² reached 0.082140 RMSE, 0.154781 MAE, 0.999991 R². When the ANN algorithm’s performance values before and after optimization are compared, it can be seen that the values of 0.801357 RMSE, 0.892518 MAE, and 0.703928 R² have been updated to 0.001547 RMSE, 0.001621 MAE, and 0.999992 R². When the simulation results of the DNN algorithm are compared, it can be said that the error metric values of 0.684111 RMSE, 0.827112 MAE, and 0.724955 R² reached 0.000001 RMSE, 0.000987 MAE, and 0.999999 R². Similarly, when the fuel consumption estimation simulation results are examined, the algorithms can be said to have produced more successful results after the parameter tuning process.

When the study is evaluated in terms of limitations, the data set cannot be obtained in real-time due to maritime industry conditions and does not consist of many samples. If the dataset contains a much larger number of samples, more powerful models can be built. Furthermore, the difficulty of obtaining instant data from commercial ships making intercontinental voyages with current maritime technology stands out as a problem that must be solved in the coming years. When this issue is resolved, the use of real-time applications in maritime industries can be expanded. In this way, real-time power and fuel consumption estimation and optimization studies can be performed with data-driven approaches.

Container ships cruise 200–250 days per year on active voyages, and their commercial life varies depending on maintenance conditions but is normally between 30 and 40 years. The information gathered for this study represents around 10% of the ship’s commercial life. As these technologies become more frequently employed, the number and descriptiveness of data sets will grow even more, which is promising for the marine sector. The dataset comprised a variety of situations in which the ship’s propulsion power and fuel consumption were estimated based on the ship’s encounters in these severe conditions, and it was proven that these variables could even be calculated based on the ship’s encounters in these extreme conditions. To improve the model’s reliability and comprehensibility in future studies, the number and types of vessels will be expanded. Furthermore, by applying data-driven methodologies for load prediction in the generators employed onboard, the ship’s electrical load can be accurately examined, and faults in the generators may be averted in advance.

Author Contributions

Conceptualization, Investigation, Methodology, Software, Validation, Formal analysis, Writing—original draft, Review & Editing, T.U. and Y.Y.; Supervision, Validation, Writing—review & Editing, Ö.K., Y.A., Y.T., C.-L.S. and J.M.G. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by The Scientific and Technological Research Council of Turkey BIDEB 2214-A International Doctoral Research Fellowship Programme. The research completed by Chun-Lien Su was funded by the Ministry of Science and Technology of Taiwan under Grant MOST 107-2221-E-992-073-MY3.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

UNCTAD. Review of Maritime Transport 2017; United Nations: New York, NY, USA, 2017. [Google Scholar]
Demirel, Y.K.; Song, S.; Turan, O.; Incecik, A. Practical Added Resistance Diagrams to Predict Fouling Impact on Ship Performance. Ocean Eng. 2019, 186, 106112. [Google Scholar] [CrossRef]
International Maritime Organization(IMO). MEPC 214 63; International Maritime Organization(IMO): London, UK, 2012. [Google Scholar]
IMO. International Management Code for the Safe Operations of Ships and for Pollution Prevention, A741(18), International Safety Management (ISM) Code; IMO: London, UK, 1993. [Google Scholar]
Kim, K.S.; Roh, M. Il Iso 15016:2015-Based Method for Estimating the Fuel Oil Consumption of a Ship. J. Mar. Sci. Eng. 2020, 8, 791. [Google Scholar] [CrossRef]
International Maritime Organization(IMO). Guıdelınes on the Method of Calculatıon of the Attaıned Energy Effıcıency Desıgn Index (EEDI) for New Shıps; International Maritime Organization(IMO): London, UK, 2012. [Google Scholar]
Armstrong, V.N.; Banks, C. Integrated Approach to Vessel Energy Efficiency. Ocean Eng. 2015, 110, 39–48. [Google Scholar] [CrossRef] [Green Version]
Jeon, M.; Noh, Y.; Shin, Y.; Lim, O.K.; Lee, I.; Cho, D. Prediction of Ship Fuel Consumption by Using an Artificial Neural Network. J. Mech. Sci. Technol. 2018, 32, 5785–5796. [Google Scholar] [CrossRef]
Li, S.; Yang, P.; Liu, L.; Chen, L.; Bi, L.; Cui, G.; Zhang, C. Research on Grid-Connected Operation of Novel Variable Speed Constant Frequency (VSCF) Shaft Generator System on Modern Ship. In Proceedings of the 2012 15th International Conference on Electrical Machines and Systems (ICEMS), Sapporo, Japan, 21–24 October 2012; pp. 2–6. [Google Scholar]
Wu, Z.; Xia, X. Tariff-Driven Demand Side Management of Green Ship. Sol. Energy 2018, 170, 991–1000. [Google Scholar] [CrossRef] [Green Version]
Kobayashi, E.; Yoneda, S.; Morita, A. Advanced Route Optimization in Ship Navigation. In Proceedings of the 2014 4th International Conference On Simulation And Modeling Methodologies, Technologies And Applications (SIMULTECH), Vienna, Austria, 28–30 August 2014; pp. 572–577. [Google Scholar] [CrossRef]
Psaraftis, H.N. Speed Optimization vs Speed Reduction: The Choice between Speed Limits and a Bunker Levy. Sustainability 2019, 11, 2249. [Google Scholar] [CrossRef] [Green Version]
Thomson, H.; Corbett, J.J.; Winebrake, J.J. Natural Gas as a Marine Fuel. Energy Policy 2015, 87, 153–167. [Google Scholar] [CrossRef] [Green Version]
Shang, C.; Fu, L.; Bao, X.; Xu, X.; Zhang, Y.; Xiao, H. Energy Optimal Dispatching of Ship’s Integrated Power System Based on Deep Reinforcement Learning. Electr. Power Syst. Res. 2022, 208, 107885. [Google Scholar] [CrossRef]
Musbah, H.; Ali, G.; Aly, H.H.; Little, T.A. Energy Management Using Multi-Criteria Decision Making and Machine Learning Classification Algorithms for Intelligent System. Electr. Power Syst. Res. 2022, 203, 107645. [Google Scholar] [CrossRef]
Rodrigues, T.A.; Neves, G.S.; Gouveia, L.C.S.; Abi-Ramia, M.A.; Fortes, M.Z.; Gomes, S. Impact of Electric Propulsion on the Electric Power Quality of Vessels. Electr. Power Syst. Res. 2018, 155, 350–362. [Google Scholar] [CrossRef]
Hansen, E.K.; Rasmussen, H.B.; Lützen, M. Making Shipping More Carbon-Friendly? Exploring Ship Energy Efficiency Management Plans in Legislation and Practice. Energy Res. Soc. Sci. 2020, 65, 101459. [Google Scholar] [CrossRef]
Psaraftis, H.N.; Kontovas, C.A. Ship Speed Optimization: Concepts, Models and Combined Speed-Routing Scenarios. Transp. Res. Part C Emerg. Technol. 2014, 44, 52–69. [Google Scholar] [CrossRef] [Green Version]
Rahman, S.; Zakaria, N.M.G. Development of an Energy Efficiency Design Index for Inland Vessels of Bangladesh. J. Mar. Sci. Appl. 2020, 19, 275–282. [Google Scholar] [CrossRef]
Perera, L.P.; Mo, B. Ship Speed Power Performance under Relative Wind Profiles in Relation to Sensor Fault Detection. J. Ocean Eng. Sci. 2018, 3, 355–366. [Google Scholar] [CrossRef]
Luo, S.; Ma, N.; Hirakawa, Y. Evaluation of Resistance Increase and Speed Loss of a Ship in Wind and Waves. J. Ocean Eng. Sci. 2016, 1, 212–218. [Google Scholar] [CrossRef] [Green Version]
Kim, S.H.; Roh, M.I.; Oh, M.J.; Park, S.W.; Kim, I.I. Estimation of Ship Operational Efficiency from AIS Data Using Big Data Technology. Int. J. Nav. Archit. Ocean Eng. 2020, 12, 440–454. [Google Scholar] [CrossRef]
Kanellos, F.D.; Anvari-Moghaddam, A.; Guerrero, J.M. A Cost-Effective and Emission-Aware Power Management System for Ships with Integrated Full Electric Propulsion. Electr. Power Syst. Res. 2017, 150, 63–75. [Google Scholar] [CrossRef] [Green Version]
Uyanık, T.; Karatuğ, Ç.; Arslanoğlu, Y. Machine Learning Approach to Ship Fuel Consumption: A Case of Container Vessel. Transp. Res. Part D Transp. Environ. 2020, 84, 102389. [Google Scholar] [CrossRef]
Yalman, Y.; Uyanık, T.; Atlı, İ.; Tan, A.; Bayındır, K.Ç.; Karal, Ö.; Golestan, S.; Guerrero, J.M. Prediction of Voltage Sag Relative Location with Data-Driven. Energies 2022, 15, 6641. [Google Scholar] [CrossRef]
Velasco-Gallego, C.; Lazakis, I. A Real-Time Data-Driven Framework for the Identification of Steady States of Marine Machinery. Appl. Ocean Res. 2022, 121, 103052. [Google Scholar] [CrossRef]
Yuan, J.; Nian, V. Ship Energy Consumption Prediction with Gaussian Process Metamodel. Energy Procedia 2018, 152, 655–660. [Google Scholar] [CrossRef]
Bakar, N.N.A.; Bazmohammadi, N.; Çimen, H.; Uyanik, T.; Vasquez, J.C.; Guerrero, J.M. Data-Driven Ship Berthing Forecasting for Cold Ironing in Maritime Transportation. Appl. Energy 2022, 326, 119947. [Google Scholar] [CrossRef]
Lee, J.B.; Roh, M.I.; Kim, K.S. Prediction of Ship Power Based on Variation in Deep Feed-Forward Neural Network. Int. J. Nav. Archit. Ocean Eng. 2021, 13, 641–649. [Google Scholar] [CrossRef]
Xia, Z.; Song, Y.; Ma, J.; Zhou, L.; Dong, Z. Research on the Pearson Correlation Coefficient Evaluation Method of Analog Signal in the Process of Unit Peak Load Regulation. In Proceedings of the 2017 13th IEEE International Conference on Electronic Measurement & Instruments (ICEMI), Yangzhou, China, 20–22 October 2017; pp. 522–527. [Google Scholar] [CrossRef]
Choi, J.; Gu, B.; Chin, S.; Lee, J.S. Machine Learning Predictive Model Based on National Data for Fatal Accidents of Construction Workers. Autom. Constr. 2020, 110, 102974. [Google Scholar] [CrossRef]
Hu, Z.; Zhou, T.; Zhen, R.; Jin, Y.; Li, X.; Osman, M.T. A Two-Step Strategy for Fuel Consumption Prediction and Optimization of Ocean-Going Ships. Ocean Eng. 2022, 249, 110904. [Google Scholar] [CrossRef]
Vettor, R.; Guedes Soares, C. Reflecting the Uncertainties of Ensemble Weather Forecasts on the Predictions of Ship Fuel Consumption. Ocean Eng. 2022, 250, 111009. [Google Scholar] [CrossRef]
Zhou, T.; Hu, Q.; Hu, Z.; Zhen, R. An Adaptive Hyper Parameter Tuning Model for Ship Fuel Consumption Prediction under Complex Maritime Environments. J. Ocean Eng. Sci. 2022, 7, 255–263. [Google Scholar] [CrossRef]
Yan, R.; Wang, S.; Du, Y. Development of a Two-Stage Ship Fuel Consumption Prediction and Reduction Model for a Dry Bulk Ship. Transp. Res. Part E Logist. Transp. Rev. 2020, 138, 101930. [Google Scholar] [CrossRef]
Yuan, Z.; Liu, J.; Zhang, Q.; Liu, Y.; Yuan, Y.; Li, Z. Prediction and Optimisation of Fuel Consumption for Inland Ships Considering Real-Time Status and Environmental Factors. Ocean Eng. 2021, 221, 108530. [Google Scholar] [CrossRef]
Anh Tran, T. Comparative Analysis on the Fuel Consumption Prediction Model for Bulk Carriers from Ship Launching to Current States Based on Sea Trial Data and Machine Learning Technique. J. Ocean Eng. Sci. 2021, 6, 317–339. [Google Scholar] [CrossRef]
Karagiannidis, P.; Themelis, N. Data-Driven Modelling of Ship Propulsion and the Effect of Data Pre-Processing on the Prediction of Ship Fuel Consumption and Speed Loss. Ocean Eng. 2021, 222, 108616. [Google Scholar] [CrossRef]
Fan, A.; Yang, J.; Yang, L.; Wu, D.; Vladimir, N. A Review of Ship Fuel Consumption Models. Ocean Eng. 2022, 264, 112405. [Google Scholar] [CrossRef]
dos Santos Ferreira, R.; Padilha de Lima, J.V.; Caprace, J.D. Comparative Analysis of Machine Learning Prediction Models of Container Ships Propulsion Power. Ocean Eng. 2022, 255, 111439. [Google Scholar] [CrossRef]
Kim, D.; Lee, S.; Lee, J. Data-Driven Prediction of Vessel Propulsion Power Using Support Vector Regression with Onboard Measurement and Ocean Data. Sensors 2020, 20, 1588. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Liu, M.; Zhou, Q.; Wang, X.; Yu, C.; Kang, M. Voyage Performance Evaluation Based on a Digital Twin Model. IOP Conf. Ser. Mater. Sci. Eng. 2020, 929, 012027. [Google Scholar] [CrossRef]
Leifsson, L.T.; Sævarsdóttir, H.; Sigurdsson, S.T.; Vésteinsson, A. Grey-Box Modeling of an Ocean Vessel for Operational Optimization. Simul. Model. Pract. Theory 2008, 16, 923–932. [Google Scholar] [CrossRef]
Petersen, J.P.; Jacobsen, D.J.; Winther, O. Statistical Modelling for Ship Propulsion Efficiency. J. Mar. Sci. Technol. 2012, 17, 30–39. [Google Scholar] [CrossRef]
Coraddu, A.; Oneto, L.; Baldi, F.; Anguita, D. Vessels Fuel Consumption Forecast and Trim Optimisation: A Data Analytics Perspective. Ocean Eng. 2017, 130, 351–370. [Google Scholar] [CrossRef]
Ling, H.; Qian, C.; Kang, W.; Liang, C.; Chen, H. Combination of Support Vector Machine and K-Fold Cross Validation to Predict Compressive Strength of Concrete in Marine Environment. Constr. Build. Mater. 2019, 206, 355–363. [Google Scholar] [CrossRef]
Davoudi, F.; Freeman, S.A.; Mosher, G.A. Evaluating Machine Learning Performance in Predicting Injury Severity in Agribusiness Industries. Saf. Sci. 2019, 117, 257–262. [Google Scholar] [CrossRef]
Feng, Y.; Duives, D.; Daamen, W.; Hoogendoorn, S. Data Collection Methods for Studying Pedestrian Behaviour: A Systematic Review. Build. Environ. 2020, 187, 107329. [Google Scholar] [CrossRef]
Rani, P.; Tripathy, P.P. Thermal Characteristics of a Flat Plate Solar Collector: Influence of Air Mass Flow Rate and Correlation Analysis among Process Parameters. Sol. Energy 2020, 211, 464–477. [Google Scholar] [CrossRef]
Goossens, W.R.A. Review of the Empirical Correlations for the Drag Coefficient of Rigid Spheres. Powder Technol. 2019, 352, 350–359. [Google Scholar] [CrossRef]
Xie, X.; Wu, T.; Zhu, M.; Jiang, G.; Xu, Y.; Wang, X.; Pu, L. Comparison of Random Forest and Multiple Linear Regression Models for Estimation of Soil Extracellular Enzyme Activities in Agricultural Reclaimed Coastal Saline Land. Ecol. Indic. 2021, 120, 106925. [Google Scholar] [CrossRef]
Kapadia, K.; Abdel-jaber, H.; Thabtah, F.; Hadi, W. Applied Computing and Informatics Sport Analytics for Cricket Game Results Using Machine Learning: An Experimental Study. Appl. Comput. Inform. 2019. [Google Scholar] [CrossRef]
Xiaohong, D.; Huajiang, C.; Bagherzadeh, S.A.; Shayan, M.; Akbari, M. Statistical Estimation the Thermal Conductivity of MWCNTs-SiO2/Water-EG Nanofluid Using the Ridge Regression Method. Phys. A Stat. Mech. İts Appl. 2020, 537, 122782. [Google Scholar] [CrossRef]
Chen, H.; Leclair, J. Optimizing Etching Process Recipe Based on Kernel Ridge Regression. J. Manuf. Process. 2021, 61, 454–460. [Google Scholar] [CrossRef]
Wang, S.; Ji, B.; Zhao, J.; Liu, W.; Xu, T. Predicting Ship Fuel Consumption Based on LASSO Regression. Transp. Res. Part D Transp. Environ. 2018, 65, 817–824. [Google Scholar] [CrossRef]
Moreno-Salinas, D.; Moreno, R.; Pereira, A.; Aranda, J.; de la Cruz, J.M. Modelling of a Surface Marine Vehicle with Kernel Ridge Regression Confidence Machine. Appl. Soft Comput. J. 2019, 76, 237–250. [Google Scholar] [CrossRef]
Zhu, W.; Peng, Y. Elastic Net Regularized Kernel Non-Negative Matrix Factorization Algorithm for Clustering Guided Image Representation. Appl. Soft Comput. J. 2020, 97, 106774. [Google Scholar] [CrossRef]
Baldwin, S.A.; Larson, M.J. An Introduction to Using Bayesian Linear Regression with Clinical Data. Behav. Res. Ther. 2017, 98, 58–75. [Google Scholar] [CrossRef] [PubMed]
Choi, S.; Kim, Y.J. Artificial Neural Network Models for Airport Capacity Prediction. J. Air Transp. Manag. 2021, 97, 102146. [Google Scholar] [CrossRef]
Luíza da Costa, N.; Dias de Lima, M.; Barbosa, R. Evaluation of Feature Selection Methods Based on Artificial Neural Network Weights. Expert Syst. Appl. 2021, 168, 114312. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016; Volume 42, pp. 785–794. [Google Scholar]
Mateo, J.; Rius-Peris, J.M.; Maraña-Pérez, A.I.; Valiente-Armero, A.; Torres, A.M. Extreme Gradient Boosting Machine Learning Method for Predicting Medical Treatment in Patients with Acute Bronchiolitis. Biocybern. Biomed. Eng. 2021, 41, 792–801. [Google Scholar] [CrossRef]
Nguyen, H.D.; Truong, G.T.; Shin, M. Development of Extreme Gradient Boosting Model for Prediction of Punching Shear Resistance of r/c Interior Slabs. Eng. Struct. 2021, 235, 112067. [Google Scholar] [CrossRef]
Chebel, E.; Tunc, B. Deep Neural Network Approach for Estimating the Three-Dimensional Human Center of Mass Using Joint Angles. J. Biomech. 2021, 126, 110648. [Google Scholar] [CrossRef]
Wu, Y.; Tan, H.; Peng, J.; Zhang, H.; He, H. Deep Reinforcement Learning of Energy Management with Continuous Control Strategy and Tra Ffi c Information for a Series-Parallel Plug-in Hybrid Electric Bus. Appl. Energy 2019, 247, 454–466. [Google Scholar] [CrossRef]
Glavic, M. Annual Reviews in Control ( Deep ) Reinforcement Learning for Electric Power System Control and Related Problems: A Short Review and Perspectives. Annu. Rev. Control 2019, 48, 22–35. [Google Scholar] [CrossRef]
Yin, L.; Gao, Q.; Zhao, L.; Wang, T. Expandable Deep Learning for Real-Time Economic Generation Dispatch and Control of Three-State Energies Based Future Smart Grids. Energy 2019, 191, 116561. [Google Scholar] [CrossRef]
Yoo, G.R.; Owhadi, H. Deep Regularization and Direct Training of the Inner Layers of Neural Networks with Kernel Flows. Phys. D Nonlinear Phenom. 2021, 426, 132952. [Google Scholar] [CrossRef]
Peer, D.; Stabinger, S.; Rodríguez-Sánchez, A. Conflicting_bundle.Py—A Python Module to Identify Problematic Layers in Deep Neural Networks[Formula Presented]. Softw. Impacts 2021, 7, 100053. [Google Scholar] [CrossRef]
Kong, N.C.L.; Kaneshiro, B.; Yamins, D.L.K.; Norcia, A.M. Time-Resolved Correspondences between Deep Neural Network Layers and EEG Measurements in Object Processing. Vision Res. 2020, 172, 27–45. [Google Scholar] [CrossRef] [PubMed]
ArunKumar, K.E.; Kalaga, D.V.; Kumar, C.M.S.; Kawaji, M.; Brenza, T.M. Forecasting of COVID-19 Using Deep Layer Recurrent Neural Networks (RNNs) with Gated Recurrent Units (GRUs) and Long Short-Term Memory (LSTM) Cells. Chaos Solitons Fractals 2021, 146, 110861. [Google Scholar] [CrossRef] [PubMed]
Patruno, L.; Craighero, F.; Maspero, D.; Graudenzi, A.; Damiani, C. Combining Multi-Target Regression Deep Neural Networks and Kinetic Modeling to Predict Relative Fluxes in Reaction Systems. Inf. Comput. 2021, 1, 104798. [Google Scholar] [CrossRef]
Bose, A.; Hsu, C.-H.; Roy, S.S.; Lee, K.C.; Mohammadi-ivatloo, B.; Abimannan, S. Forecasting Stock Price by Hybrid Model of Cascading Multivariate Adaptive Regression Splines and Deep Neural Network. Comput. Electr. Eng. 2021, 95, 107405. [Google Scholar] [CrossRef]
Das, L.; Sivaram, A.; Venkatasubramanian, V. Hidden Representations in Deep Neural Networks: Part 2. Regression Problems. Comput. Chem. Eng. 2020, 139, 106895. [Google Scholar] [CrossRef]
Olson, L.M.; Qi, M.; Zhang, X.; Zhao, X. Machine Learning Loss given Default for Corporate Debt. J. Empir. Financ. 2021, 64, 144–159. [Google Scholar] [CrossRef]
Maepa, F.; Smith, R.S.; Tessema, A. Support Vector Machine and Artificial Neural Network Modelling of Orogenic Gold Prospectivity Mapping in the Swayze Greenstone Belt, Ontario, Canada. Ore Geol. Rev. 2021, 130, 103968. [Google Scholar] [CrossRef]
Wong, T.T. Parametric Methods for Comparing the Performance of Two Classification Algorithms Evaluated by K-Fold Cross Validation on Multiple Data Sets. Pattern Recognit. 2017, 65, 97–107. [Google Scholar] [CrossRef]
Jackson, E.K.; Roberts, W.; Nelsen, B.; Williams, G.P.; Nelson, E.J.; Ames, D.P. Introductory Overview: Error Metrics for Hydrologic Modelling—A Review of Common Practices and an Open Source Library to Facilitate Use and Adoption. Environ. Model. Softw. 2019, 119, 32–48. [Google Scholar] [CrossRef]
Ćalasan, M.; Abdel Aleem, S.H.E.; Zobaa, A.F. On the Root Mean Square Error (RMSE) Calculation for Parameter Estimation of Photovoltaic Models: A Novel Exact Analytical Solution Based on Lambert W Function. Energy Convers. Manag. 2020, 210, 112716. [Google Scholar] [CrossRef]
Frías-Paredes, L.; Mallor, F.; Gastón-Romeo, M.; León, T. Dynamic Mean Absolute Error as New Measure for Assessing Forecasting Errors. Energy Convers. Manag. 2018, 162, 176–188. [Google Scholar] [CrossRef]
Ueki, M. Testing Conditional Mean through Regression Model Sequence Using Yanai’s Generalized Coefficient of Determination. Comput. Stat. Data Anal. 2021, 158, 107168. [Google Scholar] [CrossRef]
Anguita, D.; Ghio, A.; Oneto, L.; Ridella, S. In-Sample and out-of-Sample Model Selection and Error Estimation for Support Vector Machines. IEEE Trans. Neural Netw. Learn. Syst. 2012, 23, 1390–1406. [Google Scholar] [CrossRef]

Figure 1. The methodology of the study.

Figure 2. A general illustration of the ship’s microgrid.

Figure 3. Pearson correlation matrix of the dataset.

Figure 4. Strongest correlation variables with power and fuel consumption variables.

Figure 5. Typical Artificial Neural Network scheme.

Figure 6. Typical Deep Neural Network scheme.

Figure 7. The K-Fold Cross-Validation process.

Figure 8. Comparison of actual power with predictions.

Figure 9. Comparison of fuel consumption with predictions.

Table 1. Statistical input of the dataset in detail.

	Engine Power (%)	Main Engine Shaft Speed (rpm)	Main Engine Fuel Oil Consumption (t/d)	Main Engine Shaft Torque (kN × m)	Main Engine Shaft Power (kW × 102)	…
Mean	21.153	39.324	53.697	1755.146	115.141	…
Std	19.329	28.473	43.503	1408.702	98.616	…
Min	0	0	0	0	0	…
25%	0	26.05	26.5	978.5	52.51	…
50%	20.589	51.1	53.05	1957	105.3	…
75%	38.794	63.7	89.41	2976.25	198.235	…
Max	62.307	72.6	168.26	4254	318.2	…

Table 2. Tuned hyperparameters.

Method	Hyperparameter
Multiple Linear	None
Ridge	alpha = 0.1, solver = ‘lsqr’, tol = 0.00001
Lasso	alpha = 0.4
Kernel Ridge	None
Xgradient Boosting	loss = “ls”, alpha = 0.3
Elastic Net	alpha = 0.1
Bayes	None
Artificial Neural Network	solver = ‘lbfgs’, alpha = ‘0.00001’, max_iteration = 15,000, activation = ‘relu’, hidden_layer_size = 9, power_t = 0.7, validation_fraction = 0.3, batch_size = 110
Deep Neural Network	Epoch = ‘1500’, optimizer = ‘adam’, activation = ‘relu’, hidden_layer_count = 17

Table 3. K-Fold Cross-Validation scores for power prediction.

Method	Validation Score (MAE)
Method	Iter. 1	Iter. 2	Iter. 3	Iter. 4	Iter. 5	Mean
Multiple Linear	0.012862	0.012559	0.012618	0.012797	0.012841	0.012735
Ridge	0.510485	0.495327	0.483942	0.495371	0.453798	0.487785
Lasso	0.231584	0.214795	0.241357	0.214186	0.221935	0.224771
Kernel Ridge	0.012689	0.013487	0.016741	0.013523	0.014652	0.014218
XG. Boosting	0.084215	0.043156	0.076897	0.032481	0.064215	0.060193
Elastic Net	0.135145	0.127468	0.141544	0.134523	0.126524	0.133041
Bayes	0.001847	0.001429	0.001421	0.001627	0.001712	0.001607
ANN	0.173515	0.194257	0.161526	0.178426	0.145795	0.170703

Table 4. K-Fold Cross-Validation scores for fuel consumption prediction.

Method	Validation Score (MAE)
Method	Iter. 1	Iter. 2	Iter. 3	Iter. 4	Iter. 5	Mean
Multiple Linear	0.003076	0.003255	0.003261	0.003279	0.003214	0.003217
Ridge	0.248713	0.255327	0.243942	0.255371	0.253798	0.251113
Lasso	0.561487	0.574795	0.541357	0.564186	0.551935	0.558752
Kernel Ridge	0.002894	0.003487	0.002741	0.003523	0.002652	0.003059
XG. Boosting	0.001678	0.001556	0.001897	0.001481	0.001215	0.001565
Elastic Net	0.544513	0.537468	0.541544	0.534523	0.526524	0.536914
Bayes	0.003811	0.003429	0.003421	0.003627	0.003711	0.003599
ANN	0.001947	0.001894	0.001875	0.001952	0.001971	0.001927

Table 5. Error metric values for power prediction (primary findings).

Method	RMSE	MAE	R²
Multiple Linear	0.000003	0.000996	0.999999
Ridge	1.237432	1.512512	0.659222
Lasso	1.264545	1.127521	0.775724
Kernel Ridge	0.855976	0.925191	0.999993
XGradient Boosting	0.009053	0.095750	0.993211
Elastic Net	1.203749	1.097155	0.647199
Bayes	0.000002	0.000991	0.999999
Artificial Neural Network	0.801357	0.892518	0.703928
Deep Neural Network	0.684111	0.827112	0.724955

Table 6. Error metric values for fuel consumption prediction (primary findings).

Method	RMSE	MAE	R²
Multiple Linear	0.000208	0.001375	0.999999
Ridge	0.177651	0.421492	0.992594
Lasso	2.452164	1.314547	0.894411
Kernel Ridge	0.000216	0.001471	0.999999
XGradient Boosting	1.591357	1.101458	0.903437
Elastic Net	3.947521	0.924571	0.795421
Bayes	0.000299	0.001743	0.999998
Artificial Neural Network	0.091456	0.051465	0.887452
Deep Neural Network	0.021364	0.031451	0.895415

Table 7. Error metric values for power prediction.

Method	RMSE	MAE	R²
Multiple Linear	0.000003	0.000996	0.999999
Ridge	0.500782	0.517583	0.999965
Lasso	0.299883	0.260465	0.999971
Kernel Ridge	0.000621	0.013221	0.999993
XGradient Boosting	0.129474	0.114669	0.999871
Elastic Net	0.082140	0.154781	0.999991
Bayes	0.000002	0.000991	0.999999
Artificial Neural Network	0.001547	0.001621	0.999992
Deep Neural Network	0.000001	0.000987	0.999999

Table 8. Error metric values for fuel consumption prediction.

Method	RMSE	MAE	R²
Multiple Linear	0.000208	0.001375	0.999999
Ridge	0.001875	0.002494	0.999999
Lasso	1.850958	0.536233	0.999905
Kernel Ridge	0.000216	0.001471	0.999999
XGradient Boosting	0.000274	0.001459	0.999771
Elastic Net	2.384741	0.524532	0.998768
Bayes	0.000299	0.001743	0.999998
Artificial Neural Network	0.003248	0.001745	0.999981
Deep Neural Network	0.000368	0.001674	0.999999

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Uyanık, T.; Yalman, Y.; Kalenderli, Ö.; Arslanoğlu, Y.; Terriche, Y.; Su, C.-L.; Guerrero, J.M. Data-Driven Approach for Estimating Power and Fuel Consumption of Ship: A Case of Container Vessel. Mathematics 2022, 10, 4167. https://doi.org/10.3390/math10224167

AMA Style

Uyanık T, Yalman Y, Kalenderli Ö, Arslanoğlu Y, Terriche Y, Su C-L, Guerrero JM. Data-Driven Approach for Estimating Power and Fuel Consumption of Ship: A Case of Container Vessel. Mathematics. 2022; 10(22):4167. https://doi.org/10.3390/math10224167

Chicago/Turabian Style

Uyanık, Tayfun, Yunus Yalman, Özcan Kalenderli, Yasin Arslanoğlu, Yacine Terriche, Chun-Lien Su, and Josep M. Guerrero. 2022. "Data-Driven Approach for Estimating Power and Fuel Consumption of Ship: A Case of Container Vessel" Mathematics 10, no. 22: 4167. https://doi.org/10.3390/math10224167

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Data-Driven Approach for Estimating Power and Fuel Consumption of Ship: A Case of Container Vessel

Abstract

1. Introduction

1.1. Background

1.2. Related Works

2. Materials and Methods

2.1. Data Collection

2.2. Data Pre-Processing

Correlation Analysis

2.3. Model Development and Prediction

2.3.1. Multiple Linear Regression

2.3.2. Ridge Regression

2.3.3. Lasso Regression

2.3.4. Kernel Ridge Regression

2.3.5. Elastic Net

2.3.6. Bayesian Regression

2.3.7. Artificial Neural Network

2.3.8. X-Gradient Boosting Regression

2.3.9. Deep Neural Network

2.4. Validation and Evaluation

2.4.1. K-Fold Cross-Validation

2.4.2. Error Metrics

2.5. Case Study

3. Simulation Results

4. Conclusions and Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI