Next Article in Journal
Too Hot to Handle: Effects of Water Temperature on the Early Life Stages of Gongolaria barbata (Fucales)
Previous Article in Journal
Net Transport Patterns of Surficial Marine Sediments in the North Aegean Sea, Greece
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Adaptive Multimodal Data Vessel Trajectory Prediction Model Based on a Satellite Automatic Identification System and Environmental Data

1
College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
2
School of Inteligent Manufacturing and Mechanical Engineering, Hunan Institute of Technology, Hengyang 421002, China
3
School of Electrical Engineering, North China University of Water Resources and Electric Power, Zhengzhou 450045, China
*
Author to whom correspondence should be addressed.
J. Mar. Sci. Eng. 2024, 12(3), 513; https://doi.org/10.3390/jmse12030513
Submission received: 22 February 2024 / Revised: 15 March 2024 / Accepted: 17 March 2024 / Published: 20 March 2024

Abstract

:
Ship trajectory prediction is essential for ensuring safe route planning and to have advanced warning of the dangers at sea. With the development of deep learning, most of the current research has explored advanced prediction methods based on historical spatio-temporal Automatic Identification System (AIS) data. However, environmental factors such as sea wind and visibility also affect ship navigation in real-world maritime shipping. Therefore, developing reliable models utilizing multimodal data, such as AIS and environmental data, is challenging. In this research, we design an adaptive multimodal vessel trajectory data prediction model (termed AMD) based on satellite AIS and environmental data. The AMD model mainly consists of an AIS-based extraction network, an environmental-based extraction network, and a fusion block. In particular, this work considers multimodal data such as historical spatio-temporal information and environmental factors. Time stamps and distances are correlated with AIS and environmental data, and a multilayer perceptron and gated recurrent unit networks are used to design multimodal feature extraction networks. Finally, the fusion block realizes the fusion output of multimodal features to improve the reliability of the AMD model. Several quantitative and qualitative experiments are conducted using real-world AIS and multimodal environmental datasets. Numerous experimental results prove that prediction performance using multimodal data can ensure satisfactory accuracy and reliability while exhibiting a positive impact on improving maritime transport services.

1. Introduction

The development of world trade has promoted the prosperity of the maritime transportation industry [1]. Therefore, improving maritime safety and ship transport efficiency has been a global focus. Trajectory prediction is important for improving marine traffic safety and transportation efficiency [2,3]. Because prediction technology can predict the behavior of ships in advance, it can realize functions such as the early warning of maritime risks and the planning of safe routes [4]. However, the complexity of the maritime transport environment leaves room for improvement in forecasting technology. For example, there are many ships in ports and densely populated routes; in addition, limited monitoring methods in distant oceans lead to late warnings. Therefore, researchers have become eager to explore more accurate and efficient methods of predicting the trajectory of ships [5,6,7].
The Automatic Identification System (AIS) is utilized for condition monitoring and the condition data management of seagoing vessels [8,9]. It is characterized by its wide coverage and large data volume. AIS data record important information, such as a ship’s position, course, and speed during navigation, which is extremely valuable for studying advanced models in terms of predicting ship trajectories [10]. In particular, satellite-based AIS systems can overcome the limitations of limited distances from land-based base stations and can monitor the status of oceangoing vessels [11]. Therefore, AIS data are frequently utilized to study advanced models for forecasting the trajectory of ships [12].
Existing methods for predicting vessel trajectory fall into three categories: traditional [13,14], machine learning [15,16], and deep learning [17,18,19]. Traditional approaches primarily rely on empirical and mathematical models following specific physical laws [20,21]. However, the application scenarios of traditional methods depend on boundary conditions. Machine learning methods can improve prediction accuracy by creating complex mathematical models to simulate ship movements [22,23]. However, machine learning approaches need to collect a considerable amount of labeled data and the establishment of appropriate rules. Deep learning methods can extract features from large-scale historical ship trajectory data via neural networks and are good at processing nonlinear and high-dimensional data [24,25,26]. However, the complexity of maritime application scenarios and the changing traffic environment leave considerable room for improvement in existing methods.
Notably, during sea travel, ships are affected by various environmental factors such as sea wind, visibility, temperature, etc. These environmental factors influence the speed, direction, and navigation path of the ship. Therefore, relying solely on AIS data is insufficient to achieve highly reliable forecasting methodology. The Maritime Traffic Services System contains AIS data, which are used to monitor ship statuses and maritime environmental data. Therefore, how to build a reliable model for forecasting a ship’s trajectory in a multimodal data environment, such as AIS and environmental data, still needs to be studied.
In this study, we aim to design an adaptive multimodal data vessel trajectory prediction model (termed AMD) based on satellite AIS and environmental data. This model considers the spatio-temporal information and environmental factors that affect ship navigation at sea. The AMD model can make reliable predictions under multimodal data to ensure the safe navigation of vessels at sea. First, a hybrid correlation of AIS and environmental data is performed based on the closest period and shortest distance. Then, a multilayer perceptron (MLP) and gated recurrent unit (GRU) networks are applied to the AIS- and environmental-based extraction networks, respectively, and modal prediction information is integrated to obtain the final prediction result. Finally, a comparison with current existing research validates the effectiveness of the AMD model. Typical tests and ablation experiments also show the effectiveness of the AMD model.

2. Related Works

In intelligent maritime transportation, numerous research efforts have been made to improve prediction performance. These methods can automatically extract features from vast amounts of data and achieve teh high-precision prediction of ship trajectories through complex pattern recognition [27]. More details about these three mentioned approaches are given in the following sections.

2.1. Traditional Methods

Traditional methods typically rely on empirical and mathematical models. These methods forecast ship trajectories through simulation or statistical analysis. Traditional methods mainly include simulation, statistical, knowledge-based, and control theory-based methods. For example, Xu et al. [20] designed a high-precision, long-period oil tanker trajectory prediction algorithm, which applies the density-based spatial clustering of applications with noise (DBSCAN) clustering to process AIS data, as well as extracts a series of key points representing critical navigation modes. They then developed, in turn, a novel path search algorithm to select one part of these key points to generate a predicted trajectory to a fixed target. Zhang et al. [21] presented a big data analysis strategy for proactively reducing baseline risk. The technique groups environmental parameters utilizing k-means and DBSCAN big data clustering methods, as well as principal component analysis, and it then predicts chosen ship movement dynamics employing multi-output Gaussian process regression. Xiao et al. [28] introduced a novel model that integrates motion modeling and filtering processes. Srivastava et al. [14] presented a lightweight, short-term forecast model based on linear stationary models for ship trajectory prediction and real-time anomaly detection. They integrated the best-fitting auto-regressive integrated moving average (ARIMA) model with window generator models of different sizes and visualizations for recursive real-time predictions. However, these methods often have data quality and model complexity limitations, thus making it difficult to make highly accurate predictions. For example, simulation methods may have difficulty dealing with significant time interval uncertainties. Statistical methods are based on similarity searches and Bayesian inference. Knowledge-based methods require the extraction, storage, and retrieval of knowledge. These limitations point to the need for more advanced prediction techniques.

2.2. Machine Learning Methods

A growing number of studies are focusing on using machine learning methods to ship trajectories. Zhang et al. [29] designed a position prediction approach to increase ship position forecast accuracy in ship traffic engineering by employing the k-nearest neighbors (KNN) algorithm. Sedagha et al. [22] presented a system framework for online maritime traffic monitoring aimed at the real-time tracking of vessels on waterways and the prediction of their subsequent positions. Through employing a decomposition reconstruction process and adaptive segmented error correction, Wei et al. [30] developed a multi-objective heterogeneous integration approach for predicting ship motion. Xiao et al. [31] designed a machine learning model employing physical information to build a gray box model (GBM) to predict the speed of ships crossing the ocean. Dong et al. [23] introduced a new mathematical data integration prediction (MDIP) model for predicting the maneuvering movement of ships. The MDIP model, proposed using mathematical data integration, exhibits greater generalization ability and opens up new ways for predicting the maneuvering movement of ships. However, machine learning methods require extensive data labeling and exhibit a weak learning ability for the dynamic characteristics of ships. Therefore, the training and prediction of machine learning models still face numerous challenges.

2.3. Deep Learning Methods

Deep learning methods are reasonable at utilizing neural networks to extract high-dimensional features from large amounts of data and have been widely employed. Wu et al. [32] designed a combined convolutional long short-term memory (LSTM) and sequence-to-sequence (Seq2Seq) model for improved accuracy. Chen et al. [24] designed a new framework to obtain and predict ship trajectories more accurately using the bidirectional LSTM (BiLSTM) model. Zhao et al. [26] designed a model for predicting ship trajectories using a graph attention network (GAT) and the LSTM model. Guo et al. [33] proposed a multimodal-data-based approach for predicting ship trajectories by incorporating additional hidden states to define complex modes independently. A context-driven, data-driven framework for predicting ship trajectories was presented by Mehri et al. [34]. This framework first enriched the contextual information of a trajectory, including the evaluation methods for annotating a trajectory, and then applied feature selection techniques to solve high-dimensional problems. Finally, selected factors were used in the context-aware LSTM network. However, current deep learning methods still face numerous challenges. For example, they have difficulty effectively fitting complex environmental changes and uncertainty factors. In this study, we investigate how to develop efficient methods for predicting ship trajectories under complex environmental factors.

3. Preliminaries

3.1. Problem Definition

A multimodal dataset containing AIS and environmental data can be defined as time-ordered tuple sequences S = { T , P , E } containing N samples. The sequence contains a list of time points T = { t 1 , , t N } , t 1 < < t N , a discrete space–time sequence P = { p 1 , , p N } , p 1 < < p N , and a discrete environment sequence E = { e 1 , , e N } , e 1 < < e N .
The problem of neural network-based trajectory prediction is the difficulty in learning spatio-temporal mapping from a multimodal dataset that can predict future motion locations based on previously observed features. We adopted a sliding window to rearrange S into a new data representation. The input sequence X g { x i } i = 1 g = { t i } i = 1 g { p i } i = 1 g { e i } i = 1 g R d × g is represented as a sequence with input length g, the output sequence Y l { y i } i = 1 l = { p i } i = 1 l R d × l is defined as a sequence with output length l, and d represents the data dimension.

3.2. Multilayer Perceptron Network

The MLP network comprises input, hidden, and output layers [35,36]. We define the observation sequence X g ais = { x t ais } t = 1 g in the AIS dataset to represent a trajectory spatio-temporal sequence of length g. The internal hidden state is updated as the elements in the input sequence are read sequentially:
h j = σ j ( h j 1 w j + b j ; θ ) .
An activation function with θ -learnable parameters is represented by σ j . In the j-th hidden layer, b j represents bias, while w j represents weight. Finally, the output layer receives the hidden layer state h j as an input to make sequential predictions.

3.3. GRU Network

Figure 1a shows that the GRU network is less complex than LSTM. This difference is because the GRU network uses fewer gates and does not have a separate internal memory. The GRU relies entirely on hidden states as its memory, thus resulting in a simpler architecture. Therefore—compared to LSTM (as shown in Figure 1b), which has a greater number of gates and states—the GRU has fewer parameters and may have less computational complexity. The GRU algorithm allows for faster convergence and internal weight updates [37]. The gating vector can determine whether the information in the long-term sequence is not cleared or removed over time.
We define the observation sequence X g env = { x t env } t = 1 g in the environmental dataset to represent an ecological mode sequence of length g. The GRU algorithm reads the elements in the sequence one by one and updates the internal hidden state. Specifically, the refresh rate at time t is calculated using the following formula:
z t = σ ( W z x t env + U z h t 1 + b z ) ,
where the input vector at time step t, denoted as x t e x t e n v , is the t-th element of the environmental modal input sequence e x t X g e x t e n v , which experiences a linear transformation. In addition, h t 1 saves the previous time step t 1, and W z and U z are weight matrices.
Update gates determine the amount of past information that is needed to propagate into the future. The following expression calculates the reset gate:
r t = σ ( W r x t env + U r h t 1 + b r ) .
The reset gate stores past relevant information through the following formula:
h t = tanh ( W h x t env + r t U h h t 1 + b h ) .
The input x t env and the previous statement h t 1 change linearly through the weight matrices W h and U h ; then, the corresponding element product r t and hadamard product U h h t 1 are calculated. In addition, h t is calculated using the following formula:
h t = ( 1 z t ) h t 1 + z t h t ,
where z t and h t 1 denote the information retained between time steps.

4. Adaptive Multimodal Data Method

4.1. Overview Architecture

To address the trajectory prediction problem described in the previous section, we propose a new AMD model. Our model is designed to map AIS and environmental multimodal data input sequences to the output sequences of specified lengths. In a given time step s, our model aims to learn the following prediction distribution:
p ( y 1 , , y l | x 1 , , x g ) ,
which represents the Y l probability that g multimodal observation data in a given input sequence X g map to the l target trajectory sequences in the future. The neural network model can directly sample the target sequence once the predictive distribution has been learned.
The task of the proposed AMD model is essentially a regression problem. The purpose is to train the adaptive multimodal data neural network-based model M g , l using AIS X g ais and environmental X g env data. An output sequence Y l of length l is generated, thereby maximizing the conditional probability Equation (6), that is
Y l = M g , l ( X g ais , X g env ) = arg max Y p ( Y l | X g ais , X g env ) ,
where the parameterized function M g , l ( X g ais , X g env ; θ ) can be trained from a given multimodal dataset to find the parameter set θ ^ in the training samples nearest to the mapping function in Equation (7), thereby minimizing the following task-relevant error measure L:
θ ^ = arg min θ 1 N i = 1 N L ( M g , l ( X g ais , X g env ) , Y l i ) ,
where N represents the overall sample count, and ( X g ais , X g env ) and Y l i denote the input and target sequences.
Figure 2 shows that the AMD model mainly contains an AIS-based extraction network, a weather-based extraction network, and a fusion block. First, the observation sequence X g ais is read based on the AIS extraction network, and the temporary prediction value Y l ais is obtained through neural network training. Then, the environment observation sequence X g env is read based on the environment extraction network, and the temporary prediction value Y l env is obtained through neural network learning. The FC is the fully connected layer, which is used to transform the matrix learned via the modal data feature extraction network according to the target sequence dimension. Finally, the fusion block fuses the predicted values Y l ais and Y l env to generate the Y l of the final future trajectory.
We learn the mapping relationship M g , l in Equation (9) through AIS- and environmental-based extraction networks. The initial calculation functions are given in the following formulas:
Y l ais = M ais ( X g ais ; θ ais ) ,
Y l env = M env ( X g env ; θ env ) .
A neural network with parameters θ ais and θ env can be described using Equations (9) and (10). A mapping θ ais is defined between the input sequence X g ais and internal expressions Y l ais = { h t ais } t = 1 g , such that h t ais R q is the output of the MLP’s hidden state. θ env maps the input sequence X g env to the internal expression sequence Y l env = { h t env } t = 1 g , such that h t env R q is the fully connected layer output of the hidden state connection GRU.
Finally, through the fusion block pairs Y l ais and Y l env , the mapping relationship M g , l in Equation (7) is fused and learned, and the following output sequence of length l is finally generated:
Y l = F 2 l ( [ Y l ais , Y l env ] ; θ F ) ,
where Equation (11) is a neural network with θ F as a parameter, which maps the input intermediate prediction value to a series of internal representations Y h = { h t fused } t = 1 2 l , such that h t fused R q is the output after fusion and addition.
In our study, the AIS-based extraction and prediction network utilizes the MLP network, and the environmental-based extraction and prediction network employs the GRU network. The following subsections will provide more details about AIS- and environmental-based extraction networks, as well as fusion blocks.

4.2. AIS-Based Extraction Network

The observation sequence in the AIS data is captured via the neural network input layer based on the AIS extraction network. Afterward, the hidden layer learns the weights and parameters, and the fully connected layer outputs predictions. We employ Equation (9) to map X g ais to the output, that is, the hidden sequence H g ais { h t ais } t = 1 g , h t ais R q , and this is iterated through the following neural network function:
h t ais = MLP ( x t ais , h t 1 ais ; θ ais ) .
In Equation (9), Y l ais is a sequence of length l obtained via the hidden layer output vector { h 1 ais , , h g ais } after another connection operation. The AIS-based extraction network encodes the spatio-temporal information from each element h t ais R q .

4.3. Environmental-Based Extraction Network

The GRU neural network uses a environmental-based extraction network to capture the observation sequence in the environmental data. Afterward, the hidden layer learns the weights and parameters, and the fully connected layer outputs the predictions. We employ the GRU network in Equation (10) to map the X g env to the output, that is, the hidden sequence H g env { h t env } t = 1 g , h t e n v R q , and this is iterated through the following neural network function:
h t env = GRU ( x t env , h t 1 env ; θ env ) .
In Equation (10), Y l env is a sequence of length l obtained after performing another connection operation on the hidden layer output vector { h 1 env , , h g env } . Each element h t envs R q encodes the environmental modal information input from the t-th component of the sequence to the environmental-based extraction network.

4.4. Fusion Block

The fusion block of Equation (11) aims to generate future trajectories given the temporarily predicted values Y l ais and Y l env . Mathematically, the fusion prediction block decomposes the fusion probability into ordered conditions to calculate the conditional probability given the temporary prediction value. Given the prediction sequences based on AIS and environment extraction networks, observations and future sequences are assumed to be conditionally independent as follows:
p ( y j | y 1 , , y j 1 ) = F ( [ Y l ais , Y l env ] , u j ; θ F ) ,
where the future trajectory y j is calculated via F given the temporary prediction values Y l ais and Y l env , the observation sequence for time step j, and u j in the fusion prediction block.

4.5. Data Preprocessing

We elaborated on the data preprocessing process with regard to the association between AIS and the environmental data, as well as with respect to the processing of training data.

4.5.1. Correlating AIS and Environmental Data

This study adopted real-world AIS and environmental data. Each row in the AIS dataset records the ship’s spatio-temporal message, including the timestamp ( t i m e ), the maritime mobile service identity ( M M S I ), longitude ( l o n ), latitude ( l a t ), course over ground ( c o g ), and speed over ground ( s o g ). Each row in the environmental dataset records local weather data, including timestamp ( t i m e ), longitude ( l o n ), latitude ( l a t ), wind ( w i n ), visibility ( v i s ), and temperature ( t e m p ). The original AIS data do not contain environmental factors and need to be effectively correlated. Therefore, we utilized the closest and shortest distance methods to correlate the AIS and environmental information.
Firstly, for each AIS data point, the closest environmental message cluster was selected from the environmental dataset based on the timestamp field.
Secondly, the environmental data nearest to the AIS data were selected from the filtered environmental message clusters.
Finally, the filtered environmental data were combined with the AIS data to form new multimodal original data containing both AIS and environmental data.
The above steps were repeated to form a new multimodal AIS and environmental dataset.

4.5.2. Dataset Normalization

To ensure the quality of data utilization, we preprocessed the correlated multimodal dataset. Normalization can eliminate the differences in data across different dimensions in the original dataset while preserving data characteristics as much as possible. The min–max normalization method was applied to normalize the multimodal datasets, as shown in the following formula:
X * = X min x max x min x ,
where X denotes the original data, max x denotes the maximum value, and min x and X * are the minimum and normalized data, respectively.
To avoid distorting the obtained model due to any changes in the regional extent, when normalizing the longitudinal and latitudinal values, we set the longitudinal max x l o n = 180 and min x l o n = −180, as well as the latitudinal max x l a t = 90 and min x l a t = 0. We did not utilize the maximum and minimum longitudinal and latitudinal values within the data of a given area.
Meanwhile, to obtain multiple continuous space–time sub-trajectory sequence data for each ship, a window sliding method based on timestamps was applied to segment the normalized dataset. The length of the observed sequence for each sub-trajectory is g, and the size of the predicted sequence trajectories is l. The proposed AMD model can set a reasonable data input and output length according to the task requirements.

4.6. Implementation Details

This subsection mainly introduces the detailed configuration of the AMD model. Our model was implemented on the Python 3.6 platform, and all experiments were performed on a PC equipped with an Intel Xeon(R) E5-2650L v3 @ 1.80 GHz CPU with 32 GB of RAM. Multivariate input sequences were first transferred into 64 units for analysis using MLP and GRU networks, and this was followed by a step-by-step learning of the output sequences. Using Adam’s algorithm, the batch size was 64 and the number of epochs was 300. The detailed configuration is shown in Table 1. In addition, before training the network, the dataset was split into 70% training and 30% testing sets.

5. Results and Analysis

5.1. Datasets and Evaluation Metrics

This study mainly used AIS and environmental datasets from the geographical area of the Australian and Argentinian oceans. The AIS dataset was obtained from the China Maritime Safety Administration, and the environmental dataset was obtained from the National Centers for Environmental Information Data. Specifically, we selected the data from 1 April to 30 April 2020 to verify this research method. As shown in Figure 3, we chose the marine around Australia and southern Argentina. The detailed latitudinal and longitudinal range of the selected sea areas is shown in Table 2. We verified the effectiveness of the AMD model using the two datasets. After the AMD model was trained on one of the sea areas, it was not directly transferred to the other sea area for direct use. Data preprocessing was performed for AIS and environmental data using the preprocessing method. The results before and after processing are shown in Table 3.
The AIS dataset of the Australian oceans has three characteristics: (1) in some areas, navigation is intensive, and there are scenarios with high maritime traffic complexity; (2) there are intensive ship activities at sea, and the tracks in the Australian marine area are dense, as shown in Figure 3; and (3) many types of ships exist, as described in the following Section 5.3.2, and the dataset includes eight types of ships, with each ship’s primary business at sea being different. Given these marine areas with dense routes, various ship types, and high complexity, there is an urgent need to improve maritime traffic safety and rationally plan routes for maritime ship navigation to avoid ship collisions.
This study mainly used R M S E , M A P E , and M A E metrics to evaluate the usefulness of the AMD model. An error between the observed and predicted value was calculated to measure the quality of the vessel trajectory prediction task. Specifically, the aforementioned metric formulas were computed as follows:
R M S E = 1 N i = 1 N l a t ^ i l a t i 2 + l o n ^ i l o n i 2 ,
M A P E = 100 % N i = 1 N l a t ^ i l a t i l a t i + l o n ^ i l o n i l o n i ,
where l a t and l o n are the observed values, and l a t ^ and l o n ^ are the predicted values. N denotes the total samples. A smaller error value means a better prediction.
Furthermore, we computed the distance between the predicted and observed values employing the haversine formula. The half-positive vector distance of any two trajectory points, such as p 1 and p 2 , is calculated as follows:
d i s H = 2 R arcsin sin 2 l a t 2 l a t 1 2 + cos l o n 1 cos l o n 2 sin 2 l o n 2 l o n 1 2 ,
where R denotes the Earth’s radius. Using nautical miles ( n m i ) as the distance measurement unit, we define the mean square error M A E for N sample points to evaluate the AMD model performance better:
M A E = 1 N i = 1 N d i s H y ^ i , y i .
A predicted trajectory value is denoted by y ^ i , while a measured (ground truth) trajectory value is denoted by y i .

5.2. Quantitative Analysis

For a detailed comparison, we compared the AMD model against the current work on vessel trajectory prediction. We used the LSTM [38], GRU [39], ANN [40], BLSTM [41], EncDec [42], MPLSTM [43], and ST-Seq2seq [44] models as the baseline models. In particular, LSTM and GRU are variant models of RNN networks, and several other models are effective trajectory prediction methods that were built using neural networks. Each model sets its learning rate for training according to the baseline model. The detailed configuration is shown in Table 4. The models were trained on the training set to obtain trainable models, and these were then evaluated on the test set.
Our experimental results were based on the Australian and Argentina datasets. Table 5 shows that the AMD model was the most effective on various evaluation indicators. Specifically, in terms of R M S E and M A P E metrics, the results of various models in the Australian dataset showed that our model exhibited improvements of at least 55.4% and 57.09%, respectively. These results proved that using AMD modal information with various neural networks to adapt to multimodal AIS and environmental data has obvious advantages in improving vessel trajectory prediction capabilities.
Moreover, when contrasted against the seven comparison models, the AMD model improved, on average, by 75.44% in terms of the R M S E evaluation metric on the Australian dataset. Compared to the benchmark models, the R M S E of the AMD model improved by 45.49% on the Argentina dataset. It is worth noting that the prediction results of the ST-Seq2seq baseline model deviated from other models by more than 100 times. This showed that the ST-Seq2seq model does not perform well on our ocean dataset, which is closely related to the ST-Seq2seq model design goals and dataset.
M A E measures the error between a model’s predicted and actual values. As shown in Table 5, the AMD model improved by an average of 84.39% and 74.66% compared to the baseline comparison model on the Australian and Argentina datasets, respectively. We selected the ship’s trajectory data ( M M S I = 636,092,810) on the Australian dataset and the ship’s ( M M S I = 740,349,000) on the Argentina dataset to draw a schematic trajectory diagram. We set the length of the observation sequence and the target prediction sequence to 9.
To explicitly show the effectiveness of the AMD model, we selected a period of historical trajectory data in the dataset as the test data to draw a trajectory display chart. We defined the known trajectory in the input prediction model as the ‘observation’ trajectories and the actual trajectory of the target predicted trajectory segment as the ‘truth’ trajectories. Among them, the ‘observation’ trajectories are also the same starting point sequence of the different prediction models. As shown in Figure 4, across different datasets, AMD’s prediction was closest to the true trajectory. However, the predicted trajectories of the other models significantly deviated by several nautical miles. This trajectory diagram is consistent with the experimental results. In summary, our AMD model achieved satisfactory performance in vessel trajectory prediction.

5.3. Qualitative Analysis

5.3.1. Comparison with Sub-Extraction Models

As mentioned earlier, the AMD model was based on MLP and GRU networks. To verify the efficacy of the AMD model, we compared it to those from the sub-extraction models constructed from MLP and GRU. AIS and environmental datasets were directly associated with training the sub-model. That is, the input dimension and input step were 10 and 2, respectively.
The AMD model exhibited better prediction across various evaluation indicators when compared to the sub-model, as shown in Table 6. Through comparing the predicted value and the actual value evaluation index M A E , Figure 5 shows that our proposed model performed best on the two datasets used for evaluation. This experimental result proved that simply utilizing a single neural network to train mixed AIS and environmental data cannot achieve prediction results with good accuracy.
We extracted the ship’s trajectory data ( M M S I = 636,017,002) from the Australian dataset and the ship’s trajectory data ( M M S I = 740,349,000) from the Argentina dataset, and we also drew a schematic diagram of the AMD model and sub-model trajectory. The AMD model predicted the trajectory more accurately than the single model, as shown in Figure 6. Therefore, AIS-based and environmental-based multimodal data fusion prediction can improve the reliability of ship trajectory prediction.

5.3.2. Comparison of Different Types of Ships

There are many types of ships at sea, and the navigation characteristics of each ship are different. For example, obvious patterns can be seen in some ship tracks, while others are scattered due to a lack of regularity. The results of the prediction model differ depending on the vessel type. As shown in Table 7, there are eight types of ship data in the Australian dataset.
The data of these eight ship types were separated from the dataset; then, our trained AMD model was used to predict the experimental results for different ship type datasets. As shown in Figure 7, under the R M S E and M A P E evaluation metrics, the ship types of 20 (wing-effect craft) and 40 (high-speed craft) had the best prediction results. This may be related to the satellite AIS receivers exceeding the distance limit. In general, land-based AIS receivers cannot effectively collect ship data due to distance limitations, but satellites can overcome this limitation.
As shown in Figure 8, when the ship type = 30 (fishing boat), the error of the AMD model is the smallest. This is because fishing boats often operate in designated activity areas, and the AMD model can learn certain regular vessel trajectory characteristics more easily. Notably, the scientific magnitudes of the R M S E and M A P E evaluation indicators were very small, and the errors between the different types of ships were not very different. Moreover, the M A E evaluation index showed that the prediction results of the various types of boats hardly differed. The AMD model predicted similar results for different types of ships, thereby proving that the AMD model can be well adapted for the prediction of ship trajectories.

5.4. Ablation Experiments Used to Verify the AMD Model’s Superiority

5.4.1. Comparison of the Prediction Results of Different Network Structures

In contrast to using AIS datasets to study the methods for predicting ship trajectories, this study also combined environmental modal data. Both AIS data and environmental data exhibited temporal sequence features. We compared the AIS and environmental data learning networks with typical networks to verify the superiority of the multimodal datasets. As shown in Table 8, five variant models were set up for comparison. The data training process and evaluation indicators were consistent with the proposed AMD method.
Figure 9 shows the experimental results of the variant model on the two datasets. The multimodal merger prediction network consisting of MLP and GRU networks performed best. The reason for this was that the ship navigation data in the two sea areas were dense, and the spatio-temporal information data were sufficient; thus, the AIS data have linear entity characteristics. Therefore, utilizing MLP works best in the AIS data extraction network. However, the space–time information itself was not directly related. We employed the nearest time and shortest distance techniques to combine the two types of modal data, thereby establishing a correlation between the environmental and spatio-temporal information. It had a weakly nonlinear time series data characteristic. Thus, the GRU network was the best in terms of environmental data extraction.

5.4.2. Comparison of the Prediction Results under Various Combinations of Environmental Factors

This study explicitly considered three environmental factors: wind (including wind speed and direction), visibility, and air temperature. This section details the series of experiments that were set up to verify whether considering sufficient environmental factors is necessary to improve the AMD model’s reliability. Specifically, six comparison experiment groups were created using the control variable method. The AIS data input dimension information remained unchanged and the environmental data factor input type changed. That is, the environmental dataset contains only win factors; only v i s factors; only t e m p factors; w i n and v i s factors; w i n and t e m p factors; and t e m p and v i s factors. The AMD method we designed was trained on these six comparison datasets to obtain a model; then, the prediction results were obtained when the model was applied to the test dataset.
Table 9 shows the prediction results of our model for various comparison datasets. In the RMSE evaluation metric, our method is at least 56.56% better than other combinations of environmental factors (including only v i s factors). According to Figure 10, the AMD model was the most accurate when considering all three environmental factors together. Therefore, from the prediction results, it is reasonable for the experimental results to be different under different combinations of environmental factors because they have different effects on ship navigation. For example, if the wind speed is too high, the ship’s navigation will be disrupted, thus causing it to veer off course or move slowly. When visibility is poor, ships often use safer navigation methods, such as traveling at low speeds. In terms of the M A P E and M A E evaluation metrics, temperature factors are considered as having a better effect than considering wind factors. However, in terms of the R M S E evaluation index, the impact of temperature factors on the model is considerably smaller than other environmental factors (such as wind or visibility). This is reasonable because the effects of temperature factors on ship navigation are relatively small in real-world navigation conditions.
We recorded the navigation route of the ship with M M S I = 538,008,487 in a specific area of Australia. Figure 11 shows that the AMD model-predicted trajectory was closer to the actual value when considering the three environmental factors. In comparison, the effect was significantly worse when one or two factors were considered. In summary, considering as many environmental factors as possible when the aim is to improve the AMD model’s reliability is of positive importance. Therefore, including all three environmental factors in the AMD model can improve its reliability from the perspective of multimodal data.

6. Conclusions

To improve the ability to predict ship trajectories utilizing multimodal datasets, we proposed a ship trajectory prediction model that adapts to satellite multimodal AIS and environmental data. Unlike most existing studies that only consider a ship’s historical spatio-temporal trajectory data, the AMD model also considers the maritime environmental factors that affect ship navigation. Various modalities of data, such as space–time and environmental data, exhibit various effects on ship trajectory prediction. Thus, the multimodal feature extraction network was designed to obtain multimodal data features. Specifically, we associated two types of modal data through the closest timestamp and shortest distance methods, which were then used to construct multimodal datasets. Then, an effective modal data extraction network was designed to obtain multimodal data features, and a weighted sum method was used to fuse multimodal feature prediction output.
The AMD model realized the fusion of multimodal data, and the realized multimodal data features in addition to the spatio-temporal data were needed to ensure the prediction model’s reliability. Numerous experimental results proved that the AMD model improves the RMSE and MAPE metrics by at least 55.4% and 57.09%, respectively, when compared to the advanced baseline model. In addition, the superiority of the AMD model was verified by comparing the prediction performance with critical components, and the prediction results were obtained by considering different ship types. Finally, the effectiveness of the AMD model design was demonstrated through a series of ablation experiments. However, in a real-world maritime traffic service system, various modalities of data related to ship navigation exist, and the performance of existing prediction models can still be improved.
Although this study explored a multimodal data adaptive prediction model, there remain various areas to further improve the prediction performance. Our future work will employ attention mechanisms to improve the AMD model’s ability to learn characteristic elements based on multimodal data. These include the acceleration and deceleration behavior of ships during navigation, abnormal ship behavior in extreme weather environments, etc. Moreover, the navigation characteristics of different types of vessels should be further studied, thereby encompassing their business behaviors and navigation characteristics in diverse maritime scenarios.

Author Contributions

Conceptualization, Y.X. (Ye Xiao) and J.L.; Data curation, Y.X. (Ye Xiao); Formal analysis, Y.X. (Ye Xiao); Funding acquisition, Y.X. (Ye Xiao), Y.H., and J.L.; Investigation, Y.X. (Yi Xiao) and Q.L.; Methodology, Y.X. (Ye Xiao) and Y.H.; Project administration, Y.X. (Ye Xiao) and Y.H.; Resources, Y.H. and Q.L.; Software, Y.X. (Ye Xiao); Supervision, Y.H.; Validation, Y.X. (Ye Xiao), Y.H. and J.L.; Visualization, Y.X. (Yi Xiao); Writing—original draft, Y.X. (Ye Xiao); Writing—review and editing, Y.H., J.L., and Y.X. (Yi Xiao). All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Distinguished Youth Fund Project of Hunan Province (grant no. 2022JJ10018).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding authors on reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Tian, X.; Yan, R.; Liu, Y.; Shuaian, W. A smart predict-then-optimize method for targeted and cost-effective maritime transportation. Transport. Res. B-Meth. 2023, 172, 32–52. [Google Scholar] [CrossRef]
  2. Kaklis, D.; Kontopoulos, I.; Varlamis, I.; Emiris, I.Z.; Varelas, T. Trajectory mining and routing: A cross-sectoral approach. J. Mar. Sci. Eng. 2024, 12, 157. [Google Scholar] [CrossRef]
  3. Chondrodima, E.; Pelekis, N.; Pikrakis, A.; Theodoridis, Y. An efficient LSTM neural network-based framework for vessel location forecasting. IEEE Trans. Intell. Transp. Syst. 2023, 24, 4872–4888. [Google Scholar] [CrossRef]
  4. Zhang, M.; Yu, S.R.; Chung, K.S.; Chen, M.L.; Yuan, Z.M. Time-optimal path planning and tracking based on nonlinear model predictive control and its application on automatic berthing. Ocean Eng. 2023, 286, 115228. [Google Scholar] [CrossRef]
  5. Xiao, Y.; Li, X.; Yao, W.; Chen, J.; Hu, Y. Bidirectional Data-Driven Trajectory Prediction for Intelligent Maritime Traffic. IEEE Trans. Intell. Transp. Syst. 2022, 24, 1773–1785. [Google Scholar] [CrossRef]
  6. Jia, H.; Yang, Y.; An, J.; Fu, R. A ship trajectory prediction model based on attention-BILSTM optimized by the whale optimization algorithm. Appl. Sci. 2023, 13, 4907. [Google Scholar] [CrossRef]
  7. Gao, D.; Zhu, Y.; Soares, C.G. Uncertainty modelling and dynamic risk assessment for long-sequence AIS trajectory based on multivariate Gaussian Process. Reliab. Eng. Syst. Saf. 2023, 230, 108963. [Google Scholar] [CrossRef]
  8. Park, J.; Jeong, J.; Park, Y. Ship trajectory prediction based on bi-LSTM using spectral-clustered AIS data. J. Mar. Sci. Eng. 2021, 9, 1037. [Google Scholar] [CrossRef]
  9. Wang, X.; Xiao, Y. A deep learning model for ship trajectory prediction using automatic identification system (AIS) data. Information 2023, 14, 212. [Google Scholar] [CrossRef]
  10. Zhang, X.; Liu, J.; Gong, P.; Chen, C.; Han, B.; Wu, Z. Trajectory prediction of seagoing ships in dynamic traffic scenes via a gated spatio-temporal graph aggregation network. Ocean Eng. 2023, 287, 115886. [Google Scholar] [CrossRef]
  11. Wang, S.; Li, Y.; Xing, H. A novel method for ship trajectory prediction in complex scenarios based on spatio-temporal features extraction of AIS data. Ocean Eng. 2023, 281, 114846. [Google Scholar] [CrossRef]
  12. Zhang, J.; Liu, J.; Hirdaris, S.; Zhang, M.; Tian, W. An interpretable knowledge-based decision support method for ship collision avoidance using AIS data. Reliab. Eng. Syst. Saf. 2023, 230, 108919. [Google Scholar] [CrossRef]
  13. Xu, X.; Wu, B.; Xie, L.; Teixeira, Â.P.; Yan, X. A novel ship speed and heading estimation approach using radar sequential images. IEEE Trans. Intell. Transp. Syst. 2023, 24, 11107–11120. [Google Scholar] [CrossRef]
  14. Srivastava, S.; Kumar, L.; Jeyanthi, R.; Deepa, K.; Aggrawal, V. Framework for ship trajectory forecasting based on linear stationary models using automatic identification system. Procedia Comput. Sci. 2023, 218, 1463–1474. [Google Scholar] [CrossRef]
  15. Xu, J.; Gong, J.; Wang, L.; Li, Y. Integrating k-means clustering and LSTM for enhanced ship heading prediction in oblique stern wave. J. Mar. Sci. Eng. 2023, 11, 2185. [Google Scholar] [CrossRef]
  16. Liu, J.; Zhang, J.; Billah, M.M.; Zhang, T. ABiLSTM based prediction model for AUV trajectory. J. Mar. Sci. Eng. 2023, 11, 1295. [Google Scholar] [CrossRef]
  17. Wu, Y.; Deng, L.; He, W. BwimNet: A novel method for identifying moving vehicles utilizing a modified encoder-decoder architecture. Sensors 2020, 20, 7170. [Google Scholar] [CrossRef] [PubMed]
  18. Wang, Y.; Liu, J.; Liu, R.W.; Wu, W.; Liu, Y. Interval prediction of vessel trajectory based on lower and upper bound estimation and attention-modified LSTM with bayesian optimization. Physica A 2023, 630, 129275. [Google Scholar] [CrossRef]
  19. Zhang, Y.; Han, Z.; Zhou, X.; Li, B.; Zhang, L.; Zhen, E.; Wang, S.; Zhao, Z.; Guo, Z. METO-S2S: A S2S based vessel tra-jectory prediction method with multiple-semantic encoder and type-oriented decoder. Ocean Eng. 2023, 277, 114248. [Google Scholar] [CrossRef]
  20. Xu, X.; Liu, C.; Li, J.; Miao, Y.; Zhao, L. Long-term trajectory prediction for oil tankers via grid-based clustering. J. Mar. Sci. Eng. 2023, 11, 1211. [Google Scholar] [CrossRef]
  21. Zhang, M.; Kujala, P.; Musharraf, M.; Zhang, J.; Hirdaris, S. A machine learning method for the prediction of ship motion trajectories in real operational conditions. Ocean Eng. 2023, 283, 114905. [Google Scholar] [CrossRef]
  22. Sedaghat, A.; Arbabkhah, H.; Jafari Kang, M.; Hamidi, M. Deep learning applications in vessel dead reck-oning to deal with missing automatic identification system data. J. Mar. Sci. Eng. 2024, 12, 152. [Google Scholar] [CrossRef]
  23. Dong, Q.; Wang, N.; Song, J.; Hao, L.; Liu, S.; Han, B.; Qu, K. Math-data integrated prediction model for ship maneuvering motion. Ocean Eng. 2023, 285, 115255. [Google Scholar] [CrossRef]
  24. Chen, X.; Wei, C.; Zhou, G.; Wu, H.; Wang, Z.; Biancardo, S.A. Automatic identification system (AIS) data supported ship trajectory prediction and analysis via a deep learning model. J. Mar. Sci. Eng. 2022, 10, 1314. [Google Scholar] [CrossRef]
  25. Bao, K.; Bi, J.; Gao, M.; Sun, Y.; Zhang, X.; Zhang, W. An improved ship trajectory prediction based on AIS data using MHA-BiGRU. J. Mar. Sci. Eng. 2022, 10, 804. [Google Scholar] [CrossRef]
  26. Zhao, J.; Yan, Z.; Zhou, Z.; Chen, X.; Wu, B.; Wang, S. A ship trajectory prediction method based on GAT and LSTM. Ocean Eng. 2023, 289, 116159. [Google Scholar] [CrossRef]
  27. Wang, S.; Li, Y.; Zhang, Z.; Xing, H. Big data driven vessel trajectory prediction based on sparse mul-ti-graph convolutional hybrid network with spatio-temporal awareness. Ocean Eng. 2023, 287, 115695. [Google Scholar] [CrossRef]
  28. Xiao, Z.; Fu, X.; Zhang, L.; Zhang, W.; Liu, R.W.; Liu, Z.; Goh, R.S.M. Big data driven vessel trajectory and navigating state prediction with adaptive learning, motion modeling and particle filtering techniques. IEEE Trans. Intell. Transp. Syst. 2020, 23, 3696–3709. [Google Scholar] [CrossRef]
  29. Zhang, M.; Huang, L.; Wen, Y.; Zhang, J.; Huang, Y.; Zhu, M. Short-term trajectory prediction of maritime vessel using k-nearest neighbor points. J. Mar. Sci. Eng. 2022, 10, 1939. [Google Scholar] [CrossRef]
  30. Wei, Y.; Chen, Z.; Zhao, C.; Chen, X.; He, J.; Zhang, C. A three-stage multi-objective heterogeneous inte-grated model with decomposition-reconstruction mechanism and adaptive segmentation error correction method for ship motion multi-step prediction. Adv. Eng. Inform. 2023, 56, 101954. [Google Scholar] [CrossRef]
  31. Lang, X.; Wu, D.; Mao, W. Physics-informed machine learning models for ship speed prediction. Expert. Syst. Appl. 2024, 238, 121877. [Google Scholar] [CrossRef]
  32. Wu, W.; Chen, P.; Chen, L.; Mou, J. Ship trajectory prediction: An integrated approach using Con-vLSTM-based sequence-to-sequence model. J. Mar. Sci. Eng. 2023, 11, 1484. [Google Scholar] [CrossRef]
  33. Guo, S.; Zhang, H.; Guo, Y. Toward multimodal vessel trajectory prediction by modeling the distribution of modes. Ocean Eng. 2023, 282, 115020. [Google Scholar] [CrossRef]
  34. Mehri, S.; Alesheikh, A.A.; Basiri, A. A context-aware approach for vessels’ trajectory prediction. Ocean Eng. 2023, 282, 114916. [Google Scholar] [CrossRef]
  35. Ghimire, S.; Nguyen-Huy, T.; Prasad, R.; Deo, R.C.; Casillas-Perez, D.; Salcedo-Sanz, S.; Bhandari, B. Hybrid convolutional neural network-multilayer perceptron model for solar radiation prediction. Cogn. Comput. 2023, 15, 645–671. [Google Scholar] [CrossRef]
  36. Li, H.; Gao, W.; Xie, J.; Yen, G.G. Multiobjective bilevel programming model for multilayer perceptron neural networks. Inform. Sciences 2023, 642, 119031. [Google Scholar] [CrossRef]
  37. Mirzavand Borujeni, S.; Arras, L.; Srinivasan, V.; Samek, W. Explainable sequence-to-sequence GRU neural network for pollution forecasting. Sci. Rep. 2023, 13, 9940. [Google Scholar] [CrossRef] [PubMed]
  38. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural. Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
  39. Chung, J.; Gulcehre, C.; Cho, K.; Bengio, Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv 2014, arXiv:1412.3555. [Google Scholar]
  40. Perera, L.P.; Oliveira, P.; Soares, C.G. Maritime traffic monitoring based on vessel detection, tracking, state estimation, and trajectory prediction. IEEE Trans. Intell. Transp. Syst. 2012, 13, 1188–1200. [Google Scholar] [CrossRef]
  41. Liu, R.W.; Nie, J.; Garg, S.; Xiong, Z.; Zhang, Y.; Hossain, M.S. Data-driven trajectory quality improvement for promoting intelligent vessel traffic services in 6G-enabled maritime IoT systems. IEEE Internet Things J. 2020, 8, 5374–5385. [Google Scholar] [CrossRef]
  42. Capobianco, S.; Millefiori, L.M.; Forti, N.; Braca, P.; Willett, P. Deep learning methods for vessel trajectory prediction based on recurrent neural networks. IEEE Trans. Aerosp. Electron. Syst. 2021, 57, 4329–4346. [Google Scholar] [CrossRef]
  43. Gao, D.W.; Zhu, Y.S.; Zhang, J.F.; He, Y.K.; Yan, K.; Yan, B.R. A novel MP-LSTM method for ship tra-jectory prediction based on AIS data. Ocean Eng. 2021, 228, 108956. [Google Scholar] [CrossRef]
  44. You, L.; Xiao, S.; Peng, Q.; Claramunt, C.; Han, X.; Guan, Z.; Zhang, J. St-seq2seq: A spatio-temporal feature-optimized seq2seq model for short-term vessel trajectory prediction. IEEE Access 2020, 8, 218565–218574. [Google Scholar] [CrossRef]
Figure 1. Structural diagrams of the networks. (a) The GRU. (b) The LSTM.
Figure 1. Structural diagrams of the networks. (a) The GRU. (b) The LSTM.
Jmse 12 00513 g001
Figure 2. The architecture diagram of the adaptive multimodal data model in this research.
Figure 2. The architecture diagram of the adaptive multimodal data model in this research.
Jmse 12 00513 g002
Figure 3. The sea area selected for this study. The red dots in the figure are ship track points. (a) Australia. (b) Argentina.
Figure 3. The sea area selected for this study. The red dots in the figure are ship track points. (a) Australia. (b) Argentina.
Jmse 12 00513 g003
Figure 4. Schematic diagram of the predicted trajectories of ships under different models on the Australian and Argentina datasets. (a) Vessel trajectories ( M M S I = 636,092,810) in the Australian dataset. (b) Vessel trajectories ( M M S I = 740,349,000) in the Argentina dataset.
Figure 4. Schematic diagram of the predicted trajectories of ships under different models on the Australian and Argentina datasets. (a) Vessel trajectories ( M M S I = 636,092,810) in the Australian dataset. (b) Vessel trajectories ( M M S I = 740,349,000) in the Argentina dataset.
Jmse 12 00513 g004
Figure 5. Compare MAE indicators in the AMD model with the sub-extraction model.
Figure 5. Compare MAE indicators in the AMD model with the sub-extraction model.
Jmse 12 00513 g005
Figure 6. Comparison of the predicted trajectories of the AMD model with the sub-extraction model. (a) Vessel trajectories ( M M S I = 636,017,002) in the Australian dataset. (b) Vessel trajectories ( M M S I = 740,349,000) in the Argentina dataset.
Figure 6. Comparison of the predicted trajectories of the AMD model with the sub-extraction model. (a) Vessel trajectories ( M M S I = 636,017,002) in the Australian dataset. (b) Vessel trajectories ( M M S I = 740,349,000) in the Argentina dataset.
Jmse 12 00513 g006
Figure 7. Comparison of the trajectory prediction results of the different types of ships.
Figure 7. Comparison of the trajectory prediction results of the different types of ships.
Jmse 12 00513 g007
Figure 8. Comparison of the errors between predicted values and true values for different types of ship trajectories.
Figure 8. Comparison of the errors between predicted values and true values for different types of ship trajectories.
Jmse 12 00513 g008
Figure 9. Comparison of the prediction performance of the variant models on the two datasets: (a) Australia dataset. (b) Argentina dataset.
Figure 9. Comparison of the prediction performance of the variant models on the two datasets: (a) Australia dataset. (b) Argentina dataset.
Jmse 12 00513 g009
Figure 10. The error between the predicted value and the actual value of the AMD model under different environmental factors.
Figure 10. The error between the predicted value and the actual value of the AMD model under different environmental factors.
Jmse 12 00513 g010
Figure 11. A schematic diagram of the trajectory predicted by the AMD model under different environmental factors.
Figure 11. A schematic diagram of the trajectory predicted by the AMD model under different environmental factors.
Jmse 12 00513 g011
Table 1. Detailed configuration information of the training process.
Table 1. Detailed configuration information of the training process.
NameValue
Learning rate0.001
Batches64
OptimizerAdam
Epochs300
Loss functionMSE
Activationtanh
Table 2. Latitude and longitude area.
Table 2. Latitude and longitude area.
LongitudeLatitude
Min. Max. Min. Max.
Australia105.65178.85−54.92−6.56
Argentina−100.36−30.46−66.91−32.23
Table 3. Preprocessing entries for AIS and environmental datasets.
Table 3. Preprocessing entries for AIS and environmental datasets.
DatasetsAIS Data (Rows)Environmental Data (Rows)
Raw Data Entries Preprocessed Data Entries Ships Raw Data Entries Preprocessed Data Entries
Australia795,132314,8651761,168,140380,066
Argentina1,485,687631,027227872,199217,859
Table 4. Detail parameter table for baseline models.
Table 4. Detail parameter table for baseline models.
ModelLearning RateInput LayerBatch SizeActivation
LSTM [38]0.0011864tanh
GRU [39]0.0011864tanh
ANN [40]0.0011864tanh
BLSTM [41]0.005185000tanh
EncDec [42]0.000118200tanh
MPLSTM [43]0.0011864tanh
ST-Seq2seq [44]0.0011864tanh
Table 5. Comparison of the prediction results of the various models under Australian and Argentina datasets.
Table 5. Comparison of the prediction results of the various models under Australian and Argentina datasets.
Baseline ModelsAustralia DatasetsArgentina Datasets
RMSE ( 10 3 ) MAPE ( 10 3 ) MAE ( nmi ) RMSE ( 10 3 ) MAPE ( 10 3 ) MAE ( nmi )
LSTM [38]0.4931.0113.6480.2620.5351.082
GRU [39]0.3620.4544.0050.3430.8261.663
ANN [40]0.7821.2167.8710.4761.7433.526
BLSTM [41]1.0431.7106.7760.7752.4185.656
EncDec [42]0.9131.8476.9240.4341.2732.687
MPLSTM [43]0.3970.9132.8350.3860.6952.073
ST-Seq2seq [44]121.686200.1641450.61452.652200.713450.236
Our model0.1610.1950.8430.2540.2440.624
Table 6. Comparison of the AMD model with the sub-extraction model.
Table 6. Comparison of the AMD model with the sub-extraction model.
DatasetsModelsRMSE ( 10 4 )MAPE ( 10 4 )
Australia datasetMLP sub-model8.968.32
GRU sub-model4.597.05
Our1.611.95
Argentina datasetMLP sub-model3.528.66
GRU sub-model3.225.83
Our2.542.44
Table 7. The different ship type numbers that correspond to ship types.
Table 7. The different ship type numbers that correspond to ship types.
Types NumberType Names
20Wing in ground (WIG)
30Fishing
40High speed craft (HSC)
50Pilot vessel
60Passenger
70Cargo
80Tanker
90Other type
Table 8. The five variant models composed of typical network structures.
Table 8. The five variant models composed of typical network structures.
Variant 1Variant 2Variant 3Variant 4Variant 5
AIS ex-netgrulstmgrugrubigru
Environmental ex-netgrulstmbigrulstmmlp
Variant modeldgrudlstmgrubigrugrulstmbigrumlp
Table 9. Prediction results of the AMD model under different combinations of environmental factors (Australian dataset).
Table 9. Prediction results of the AMD model under different combinations of environmental factors (Australian dataset).
winvistempwin and viswin and temptemp and viswin, vis and temp
RMSE ( 10 4 )4.613.726.665.134.825.641.61
MAPE ( 10 4 )9.866.522.395.092.752.031.95
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xiao, Y.; Hu, Y.; Liu, J.; Xiao, Y.; Liu, Q. An Adaptive Multimodal Data Vessel Trajectory Prediction Model Based on a Satellite Automatic Identification System and Environmental Data. J. Mar. Sci. Eng. 2024, 12, 513. https://doi.org/10.3390/jmse12030513

AMA Style

Xiao Y, Hu Y, Liu J, Xiao Y, Liu Q. An Adaptive Multimodal Data Vessel Trajectory Prediction Model Based on a Satellite Automatic Identification System and Environmental Data. Journal of Marine Science and Engineering. 2024; 12(3):513. https://doi.org/10.3390/jmse12030513

Chicago/Turabian Style

Xiao, Ye, Yupeng Hu, Jizhao Liu, Yi Xiao, and Qianzhen Liu. 2024. "An Adaptive Multimodal Data Vessel Trajectory Prediction Model Based on a Satellite Automatic Identification System and Environmental Data" Journal of Marine Science and Engineering 12, no. 3: 513. https://doi.org/10.3390/jmse12030513

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop