1. Introduction
In the maritime transport industry, smart ships and reducing ship greenhouse gas emissions have been actively researched [
1]. Smart ships have emerged as information and communications technology, the internet of things, and big data technologies have advanced. Different from a conventional ship, a smart ship is characterized by its ability to use data collected by sensors installed within the ship to self-navigate or to provide appropriate information to assist in the decisions of crew members operating the ship [
2]. Studies on reducing ship greenhouse gas emissions have mainly focused on reducing ship fuel consumption and eco-friendly ships that do not use oil.
Research on smart ships has been actively pursued, and various methodologies and network architectures that can smoothly collect data from ships in operation have been proposed. This research has included areas such as big data collection systems [
3], cyber security considerations [
4], data management to reduce learning costs [
5], framework structures for index systems [
6], surveys of architectures and applications [
7], and priority items for smart shipping [
8]. Furthermore, various data on actual ships is being collected, which is used in research on reducing ship fuel consumption by deep learning or conventional methods to perform knowledge-free path planning with reinforcement learning [
9], energy-saving auto path planning algorithms [
10], energy-saving management systems for smart ships [
11], power scheduling for saving energy with reinforcement learning [
12], and forecasting ship fuel consumption [
13]. However, the application of ship operation mode classification remains unresearched.
Due to the characteristics of ship operations, the operating state of a ship can be broadly classified as At Sea, Stand By, or In Port. In the At Sea state, all of the devices on the ship are connected and powered, and the load changes in each device are small. Stand By refers to the state in which the ship is entering or exiting a port, and it is characterized by large fluctuations in the total power consumption due to changes in the ship speed and the use of auxiliary devices. Lastly, In Port refers to the operating state in which a ship has entered a port and cargo is being loaded or unloaded from the ship. In this state, fewer auxiliary devices are being powered on the ship, and total power consumption and power fluctuations are low. The power fluctuation characteristics of each ship type are as follows:
Container Ship: During the In Port state, the total power consumption and power fluctuations are large if many reefer containers are being carried.
LNG Ship: Cargo pumps are used when loading or unloading crude oil. Therefore, the ship is under the highest power load during the In Port state.
Bulk Ship: Bulk ships with cargo cranes installed are under a very large power load during the In Port state.
The operating states of ships, which are presently being acquired in large quantities, have not, however, been adequately researched. As such, researchers are faced with the problem of needing to label data manually based on ship voyage information to use the collected ship data. Research on ship state classification models that can classify the operating states of ships is required to overcome this issue.
Prior studies have not addressed the development of algorithms or deep learning-based models to classify the operating states of ships. Autoencoders are used in handwriting recognition [
14], anomaly detection [
15], fault diagnosis [
16], and fraud diagnosis [
17] and produce better results in comparison with existing algorithms. Thus, we selected the autoencoder model for classifying ship data.
However, the performance of the stacked autoencoder model depends on proper control of the components. When utilizing an autoencoder-based classification model, model design considerations include:
Which structure has better performance?
What are the appropriate values for the number of hidden layers and the number of neurons in each layer to achieve better performance?
What size of latent layer is suitable? The size of the latent layer has a significant effect on the performance of the classification model.
In order to address these issues, we conducted comparative experiments on the structure of the classification model, the appropriate values for the number of hidden layers and the number of neurons in each layer, and changes in the size of latent layers. We find the best model to classify the ship’s operating state as either At Sea, Stand By, or In Port using actual ship power load data.
This paper makes the following contributions: First, the structure of the first stacked autoencoder model using actual ship data is presented. Second, the performance change of the model according to the components of the stacked autoencoder was investigated. In particular, since there is no previous study that uses actual ship data, we design and perform performance comparison experiments of classification models according to structural changes of the stacked autoencoder. Third, the experimental results are analyzed.
2. Related Works
In the past several years, many studies have applied autoencoder models to solve practical problems. An autoencoder [
18] is a learning neural network model that approximates input and output values to reconstruct them the same, and the main purpose of the autoencoder is to learn informative representations of data using an unsupervised method. Types of autoencoders include the stacked autoencoder, sparse autoencoder, denoising autoencoder, contractive autoencoder, and variational autoencoder [
19].
The stacked autoencoder can learn efficiently to create robust features from training data. Research has been conducted on the benefits of stacked autoencoders to solve their problems. Ghosh et al. [
20] used a stacked autoencoder model to classify human emotional data and achieved good results in categorizing human emotions with a spectrogram dataset. Ghosh’s approach was able to produce better results compared to traditional methods, which could not distinguish between happy and angry people.
Ambaw et al. [
21] compared different conventional neural networks, support vector machines, and stacked autoencoders on the recognition of continuous phase frequency-shift keying under carrier frequency offset, noisy, and fast-fading channel conditions with the based model. In this study, the three features selected for recognition were the approximate entropy of the received signal, the approximate entropy of the received signal’s phase, and the approximate entropy of the instantaneous frequency of the received signal. It was found that the stacked autoencoder performed better than support vector machines and traditional neural networks; a stacked autoencoder model can give a better accuracy result for most signal to noise ratios.
Singh et al. [
22] proposed a stacked autoencoder model to reduce complexity and processing time for detecting epilepsy. The model classified epileptic data into normal, ictal, and preictal. He selected machine learning algorithms such as Bayes Net, Naïve Bayes, multilayer perceptron, radial basis function neural networks, and the C4.5 decision tree classifier as comparison models. He proved that the stacked autoencoder model had the best performance score with the least processing time.
Law et al. [
23] suggested using a cascade of two types of networks, stacked autoencoders and extreme learning machines, for multi-label classification to enhance a stacked autoencoder’s performance. The proposed model was compared with eleven other algorithms with seven datasets. However, she claimed that the model had promising performance.
Aouedi et al. [
24] introduced a stacked sparse autoencoder model to integrate feature extraction and classification processes. The model uses denoising and a dropout technique to enhance feature extraction performance and prevent overfitting. It was proven that the model produced a better output than conventional models.
Deperlioglu [
25] built a stacked autoencoder classification model for heart sound analysis. Traditional methods use data transforms to get the S1 and S2 segments of heart sounds. Deperlioglu’s novel approach was to utilize only a stacked autoencoder model to get segments of heart sounds for direct classification. This model was compared with conventional algorithms. The proposed model’s performance was similar to prior models. According to this study, a stacked autoencoder can be used in the medical field with efficient and effective classification results.
Gokhale et al. [
26] compared a proposed stacked autoencoder model with seven previously established algorithms using ten datasets to find the key genes for cancer. Traditional gene selection approaches using statistical or feature selection methods have accuracy problems. However, the proposed stacked autoencoder-based framework outperformed conventional methods in this study.
Arafa et al. [
27] introduced a reduced noise-autoencoder for solving the problem of imbalanced data in genomic datasets. Arafa’s approach was able to solve the dimensionality problem with the stacked autoencoder with feature reduction and create new low-dimensional data. In addition, the accuracy score was improved by more than eight percent.
6. Conclusions
Artificial intelligence research using ship data is being actively conducted as the shipping market evolves. However, studies have not been performed on classifying ship operating modes, despite previous studies on using ship data to predict power loads. An SAE has the advantage of being able to perform effective classification by analyzing the features of the input data. Therefore, we conducted a pilot study on deep learning models that can classify ship operating modes using an SAE. Furthermore, experiments were performed to compare the performance according to changes in the structure of the SAE and changes in the size of its latent layer. The key points to be verified through research are as follows:
TPR, FPR, accuracy, and MCC were selected as evaluation metrics to perform experiments that compared the performance according to changes in the structure of the SAE and the size of its latent layer. Even though previous studies have not been conducted in this area, performance comparison among the models was possible according to changes in the structure of the SAE and the size of its latent layer.
In the results of the SAE model comparison experiments, the (5–128) structure with a latent layer size of 9 showed the best operating mode classification performance.
The classification performance for the In Port and At Sea modes used in the experiments was generally excellent; however, the classification performance for the Stand By mode was low. Therefore, more data are needed to improve the prediction performance of the Stand By mode.
Through this pilot study, we found that our VAE-based deep learning model can be used to analyze ship operating modes. Furthermore, a model that can be used as a comparison target group for the classification of the actual operating state of the ship in the future was found. Based on this model structure, it will be possible to develop enhanced models. Although the classification performance for the Stand By mode was limited because of the imbalanced data, it was possible to propose a VAE model structure that can maximize the data classification performance of the In Port and At Sea modes. However, further research is required to address some limitations of this study. First, the issue of handling imbalanced data needs to be studied using real ship data; data-level techniques, algorithm-level methods, and hybrid methodologies can be utilized to find the most appropriate method to improve the classification model. Second, research to compare denoising, sparse, and stacking autoencoder models could be carried out to improve classification performance and establish which autoencoder-based model best interprets the features of real ship data.