Multi-Attack Intrusion Detection for In-Vehicle CAN-FD Messages

Gao, Fei; Liu, Jinshuo; Liu, Yingqi; Gao, Zhenhai; Zhao, Rui

doi:10.3390/s24113461

Open AccessArticle

Multi-Attack Intrusion Detection for In-Vehicle CAN-FD Messages

by

Fei Gao

¹

,

Jinshuo Liu

²

,

Yingqi Liu

³,

Zhenhai Gao

² and

Rui Zhao

^2,*

¹

State Key Laboratory of Automotive Simulation and Control, Jilin University, Changchun 130025, China

²

College of Automotive Engineering, Jilin University, Changchun 130025, China

³

Strategic Cooperation Department, Mobje Co., Ltd., Changchun 130013, China

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(11), 3461; https://doi.org/10.3390/s24113461

Submission received: 23 April 2024 / Revised: 24 May 2024 / Accepted: 25 May 2024 / Published: 27 May 2024

(This article belongs to the Section Vehicular Sensing)

Download

Browse Figures

Versions Notes

Abstract

:

As an enhanced version of standard CAN, the Controller Area Network with Flexible Data (CAN-FD) rate is vulnerable to attacks due to its lack of information security measures. However, although anomaly detection is an effective method to prevent attacks, the accuracy of detection needs further improvement. In this paper, we propose a novel intrusion detection model for the CAN-FD bus, comprising two sub-models: Anomaly Data Detection Model (ADDM) for spotting anomalies and Anomaly Classification Detection Model (ACDM) for identifying and classifying anomaly types. ADDM employs Long Short-Term Memory (LSTM) layers to capture the long-range dependencies and temporal patterns within CAN-FD frame data, thus identifying frames that deviate from established norms. ACDM is enhanced with the attention mechanism that weights LSTM outputs, further improving the identification of sequence-based relationships and facilitating multi-attack classification. The method is evaluated on two datasets: a real-vehicle dataset including frames designed by us based on known attack patterns, and the CAN-FD Intrusion Dataset, developed by the Hacking and Countermeasure Research Lab. Our method offers broader applicability and more refined classification in anomaly detection. Compared with existing advanced LSTM-based and CNN-LSTM-based methods, our method exhibits superior performance in detection, achieving an improvement in accuracy of 1.44% and 1.01%, respectively.

Keywords:

CAN-FD; anomaly detection; LSTM; attention mechanism; deep learning; vehicle security

1. Introduction

Intelligent connected vehicles are equipped with advanced sensors, enabling interaction with the external environment. They involve extensive data and code, providing functionalities for driving assistance and cockpit interaction. With the widespread adoption and increasing sophistication of communication interfaces, external attackers are able to intrude into the in-vehicle network and subsequently target the Electronic Control Units (ECUs) connected to this network. ECUs are critical components within autonomous vehicles, playing an essential role in processing real-time decisions and data exchange [1]. Securing communication among ECUs is pivotal in thwarting potential cyberattacks and safeguarding in-vehicle operational safety.

ECUs in vehicles widely use Controller Area Network (CAN)/Controller Area Network with Flexible Data (CAN-FD) Rate bus for communication. The original CAN/CAN-FD bus design does not adequately consider information security [2], lacking sender verification and message authentication in its broadcasting mechanism, which makes in-vehicle networks vulnerable to tampering and injection attacks. As depicted in Figure 1, the increasing connectivity of modern vehicles with external devices [3], such as through OBD-II ports [4], USB ports, Wi-Fi, and Bluetooth, exacerbates this vulnerability [5,6]. Once network attackers gain access to the bus [7], they can manipulate communication, including injecting anomalous frames that transmit various erroneous commands or data to targeted vehicle components. The prevalent types of abnormal messages include Denial of Service (DoS) [8], Fuzzing [9,10], Replay [11], Spoofing [12], Scaling attack [13,14], and Ramp attack [13,14]. Researchers have explored the security issues of in-vehicle networks [15]. Koscher et al. [11] are among the first to show how attackers gain complete control over vehicle functions by intercepting and reverse-engineering CAN bus data. This unauthorized access can disrupt critical automotive functions, like braking and steering, raising significant safety risks. It can also expose personal data such as vehicle movements and driving preferences. Through the analysis of the past ten years, it was found that in the most common automotive attack incidents, those directly or indirectly caused by the CAN/CAN-FD bus accounted for 13.61% of the total [16]. Therefore, enhancing the security of CAN/CAN-FD systems is essential to protecting in-vehicle networks from increasing cyber threats.

Intrusion Detection Systems (IDS) are crucial for securing in-vehicle networks by effectively identifying and mitigating cyberattacks on CAN/CAN-FD [17,18]. Intrusion detection algorithms are generally classified into three types: periodicity-based, semantic-based, and data-based [19], each varying in their detection principles and scope. WP29 R155 [20], ISO 21434 [21], and Cybersecurity Best Practices for the Safety of Modern Vehicles [22] all emphasize that anomaly detection is an effective and necessary means to prevent vehicle attacks.

Periodicity-based IDS [23,24,25] primarily focus on the temporal characteristics of network communication, particularly the periodicity of messages. These systems detect anomalies by analyzing deviations from expected transmission intervals. Halder et al. [23] proposed an IDS utilizing the clock offset of the transmitting ECU to monitor the in-vehicle network and identify anomalies using the Cumulative Sum method. Olufowobi et al. [24] developed an IDS based on real-time schedulable response time analysis of CAN buses, identifying anomalies by comparing the actual completion time of messages with the forecasts from a constructed message timing model. Ji et al. [25] employed clock drift to discern timing patterns in CAN bus messages for intrusion detection. However, these IDS can miss non-periodic or dynamically timed messages, which could allow irregular attack patterns to go undetected. Such limitations stem from the variable nature of some CAN/CAN-FD messages, and attackers might exploit this by mimicking normal message periodicity, thereby evading detection. Additionally, some attacks that do not alter message timing can remain undetected, reducing the effectiveness of periodicity-based IDS.

Semantic-based IDS [26,27,28] concentrates on the analysis of the meaning or logic of message contents to ensure alignment with system operations. The approaches verify that messages adhere to expected formats and values. Narayanan et al. [26] employed a hidden Markov model to identify abnormal states in CAN messages, capturing data via a sliding window and interpreting it as observation sequences. This model calculates probabilities to pinpoint abnormal signals. Guo et al. [27] implemented a ring architecture with multiple sensors, applying edge-based data fusion for anomaly detection. Wasicek et al. [28] introduced a context-aware intrusion detection method for anomalies in automotive control systems. Semantic-based IDS require a thorough understanding of CAN protocols and message behaviors, and system updates may necessitate reconfigurations. These systems might not detect new attack types that deviate from known patterns. Furthermore, semantic-based IDS is typically more complex than other types of IDS, as it needs to discern complex content, structure, and contextual meanings.

Data-domain-based IDS [19,29,30,31,32,33] utilizes machine learning and data analysis to identify network anomalies by analyzing extensive network traffic. These models are trained to recognize normal behavior patterns and detect deviations, adapting well to dynamic network environments. Wang et al. [29] introduced an IDS utilizing Hierarchical Temporal Memory learning. In their approach, a binary data stream is fed into each data sequence predictor prior to decoding, and the predicted output is then evaluated using an anomaly scoring mechanism. Based on prediction, Taylor et al. [30] developed a neural network with a Long Short-Term Memory (LSTM) [34] layer to predict the next package. The error measure is made using the predicted output value of the log loss function, and its error is used as a signal for detecting anomalies in the sequence. Based on classification, M. D. Hossain et al. [31] developed attack datasets for the CAN system and established an LSTM-based model for anomaly defense. Building on LSTM, Chen et al. [32] introduced causality into the method for anomaly detection that exhibited commendable performance. Also based on LSTM, Qin et al. [33] introduce an anomaly detection algorithm for the CAN bus system, aimed at reducing false positives and maximizing detection accuracy by evaluating a range of loss functions. However, challenges include difficulty in precisely identifying attack types and implementing specific countermeasures, and the scarcity of real-vehicle data for training robust models. These models often rely on simulated data, limiting their effectiveness in real-life scenarios. Furthermore, there is a continuous need for enhancement in the accuracy of these data-driven. False Negatives (FNs) and False Positives (FPs) remain a concern. Improving accuracy is crucial for the reliability of these systems in practical applications.

In this work, we propose a data-driven intrusion detection model comprising two sub-models: the Anomaly Data Detection Model (ADDM) and the Anomaly Classification Detection Model (ACDM). ADDM, based on LSTM, is designed to identify anomalous frames within CAN-FD datasets containing attacks, utilizing a stacked LSTM layer structure that deepens the neural network and allows each layer to further refine information learned from its predecessors. This methodology aids the model in effectively learning and capturing complex features and long-range dependencies inherent in time-series data. The ACDM model enhances stacked LSTM layers with temporal attention [35] to categorize anomaly types in CAN-FD frames. LSTM networks can learn the temporal features of repetitive request patterns observed in DoS and Replay attacks. The attention mechanism assists the model in focusing on the most crucial elements of the current predictive task, such as identifying tampered messages during Spoofing attacks. Furthermore, the attention mechanism dynamically adjusts the weights of input features, enabling the model to automatically adapt its focus on attack characteristics learned from historical data, such as the random or anomalous data patterns in Fuzzing attacks. Our approach surpasses the limitations of frequency-based detection that depends on periodic messaging. Grounded in extensive experiments with real CAN-FD frame data from actual vehicles, our method achieves high accuracy in identifying various types of anomalies without needing extensive prior knowledge, unlike semantic-based IDS.

The main contributions of this paper can be summarized as follows:

Our model employs LSTM layers with attention mechanisms to innovatively detect various CAN-FD bus attacks, such as DoS, Fuzzing, Replay, Spoofing, Scaling, and Ramp. This detection approach does not require the CAN-FD communication matrix or extensive prior knowledge, enhancing its broad applicability.
As far as we know, we are the first to collect raw CAN-FD bus message data directly from actual vehicles and generate an attack dataset based on it. This dataset reflects diverse attack patterns, offering high adaptability for simulating scenarios across various vehicular environments.
To validate our method’s effectiveness, we conduct experiments on two datasets: the real-vehicle dataset and the automotive CAN-FD Intrusion Dataset from the Hacking and Countermeasure Research Lab [36] in Korea. Compared with the State-of-the-Art (SOTA) method, our model provides refined classification capabilities for different types of attacks. Experimental results show that our model improves attack classification accuracy by 1.01% over the SOTA method, effectively identifying the type of attack in anomalous frames.

The rest of the paper is organized as follows: Section 2 introduces the background of this work, including the structure of the CAN-FD frame and types of attacks. Section 3 introduces the problem definition. Section 4 focuses on the research methods of this paper, presenting methodology encompassing data collection, generation of abnormal datasets, data preprocessing, and the architecture of two sub-models. Section 5 introduces the experimental results of this method and carries out comparisons and discussions. Section 6 concludes this work and highlights the scope for future work.

2. Background

2.1. Controller Area Network Bus with Flexible Data Rate

Retaining the fundamental structure of the classic CAN bus, the CAN-FD bus introduces pivotal enhancements. During the data transmission phase, it can support speeds up to 5 Mbps or more, significantly surpassing the 1 Mbps limit of the traditional CAN bus. Furthermore, the data frame size has been expanded to accommodate up to 64 bytes of payload, a substantial increase from the 8-byte payload of the classic CAN, thus markedly boosting data transmission efficiency. These improvements in data transmission speed and capacity, devised for modern automotive electronics and high-speed industrial automation systems, cater to more complex and data-intensive applications.

The CAN-FD data frame is illustrated in Figure 2, which includes several critical components [37,38,39]:

SOF: The Start of Frame (SOF) bit is utilized for synchronization and to alert all nodes about the commencement of a CAN-FD message transmission. It is consistently set to 0.
Identifier: Each ECU has a unique identifier code, known as the Identifier (ID), which determines the recipient of a message. The ID spans 11 bits in length, with the message priority dictated by this field; typically, a lower numerical value signifies a higher priority. Consequently, when an ECU faces a choice between two identifiers, the comparison is conducted based on their sequential order. If one identifier exhibits a logical “1” at any position while the other shows a logical “0”, the identifier with the “1” will lose its priority.
IDE: The Identifier Extension (IDE) bit distinguishes between standard and extended frames. A logical “0” indicates an 11-bit ID, whereas a logical “1” indicates a 29-bit ID.
DLC: Data Length Code (DLC) specifies the number of valid bytes within the data field, with the maximum representable byte count being 64.
Data Field: The data field contains the payload data, which is interpreted by the designated receiving ECU.

Our IDS primarily focuses on the frame ID in the arbitration field, the DLC in the control field, the data field, and the timestamp when the frame is recorded. The frame ID, consisting of either 11 or 29 bits, serves to identify messages and facilitate arbitration. Within the CAN-FD protocol, the data field plays a crucial role in the in-vehicle communication network, carrying the actual payload data for communication between various ECUs. This data may encompass information collected from diverse sensors, such as temperature, speed, and acceleration sensors. It controls commands for managing different vehicle components and various status information.

In bus architectures with multiple masters as defined by the CAN-FD protocol, ECUs broadcast messages onto the CAN-FD bus [40]. This architecture employs an arbitration mechanism for message transmission, where the priority of a message is determined by its frame ID; a lower ID value indicates a higher priority frame. This mechanism enables each ECU to attempt bus access and utilization at any time. However, this also introduces vulnerabilities in bus security. The fact that ECU nodes broadcast messages to the CAN-FD bus allows attackers to easily intercept and capture all messages transmitted within the network, potentially leading to Spoofing, Fuzzing, Replay, Scaling, or Ramp attacks; the priority arbitration mechanism may facilitate DoS attacks. Furthermore, automotive companies utilize vehicle communication matrices to define the flow of messages between ECUs, specifying the sending and receiving ECUs, as well as standardizing communication formats, protocols, and parameters. This plays a critical role in ensuring effective arbitration. Without a communication matrix, frame messages often become opaque data, and the configuration methods of communication matrices vary among different companies. Despite this, attackers can still reverse-engineer and modify any frame message within the network without recognizing the physical significance of the data fields, particularly by influencing frame arbitration priorities to execute DoS attacks easily.

2.2. Type of Attack

In view of the characteristics described above, any ECU responsible for exchanging information with the external environment could potentially serve as an entry point for attackers to infiltrate the CAN-FD bus system. However, without additional security measures, the frame ID alone cannot verify the identity of the message sender. Such access could lead to various forms of attacks, posing risks to the security of the CAN-FD bus. In this paper, some anomalous attacks examined are shown in Figure 3, including DoS attacks, Fuzzing attacks, Replay attacks, and Spoofing attacks.

DoS: Attackers inject high-frequency messages with high-priority frame ID into the network bus, aiming to hinder authorized entities from accessing resources or inducing delays in time-sensitive system operations, thereby disrupting normal communications within the vehicle’s internal network. During an attack, attackers typically send high-priority data or requests to the network, resulting in network congestion that can impact or completely halt data exchanges between ECUs, thereby affecting the normal operation of the vehicle. Particularly in CAN/CAN-FD bus systems, if an attacker sends highest-priority data via an ECU, other ECUs can be prevented from using the bus to transmit messages because of the arbitration mechanism.

Replay: Attackers record data from a node over a certain period and replay it later [12]. By duplicating and resending previously transmitted messages in the in-vehicle CAN-FD network, the attacker causes the ECUs to mistakenly respond to outdated signals as if they are current, leading to inappropriate system responses at specific moments. This can lead to repeated or improper actions in the vehicle system, thus disrupting its normal functioning.

Fuzzing: A Fuzzing attack involves injecting randomized or semi-randomized data frames into a target’s CAN-FD system, particularly in high-traffic environments. The goal is to test the system’s response to abnormal or invalid inputs [19]. This type of attack, characterized by sending many aberrant or random CAN-FD messages, can result in erratic vehicle behavior like steering wheel jitters, unusual signal light activations, or unexpected gear shifts [15].

Spoofing: Attackers transmit entirely forged data frames to a target node, aiming to mimic legitimate data or commands from a real source in the network bus [41]. By simulating authentic ECU communication with these forged CAN-FD messages and altering their control and data fields, attackers deceive other ECUs or the vehicle control system. This can lead to data deviations in ECUs, resulting in loss of vehicle control, execution of unintended operations, or system chaos, ultimately leading to in-vehicle malfunction [42].

Scaling attack: A Scaling attack is a cybernetic assault that manipulates the measurement signals to detrimentally affect the performance of a system. In this form of attack, perpetrators alter the genuine measurement signals based on a predetermined scaling factor, which can precipitate outcomes that deviate from the system’s expected responses. For instance, should the speed measurements of an Automated Guided Vehicle (AGV) be artificially inflated, the system might misconstrue the need for increased velocity, consequently initiating inappropriate responses such as excessive acceleration. This could potentially compromise the safety and efficiency of the system. The essence of a Scaling attack lies in its simplicity and the potential chaos it can introduce, necessitating the implementation of robust detection and recovery mechanisms to uphold the stability and security of the system [13,14].

Ramp attack: A ramp attack represents a methodical cyber strategy where the attacker incrementally modifies or “ramps” the measurement signals over a period, thereby causing the system’s behavior to stray from its predefined operational conditions. Characterized by a progressive and either ascending or descending alteration of the signals, this attack can surreptitiously steer the system towards suboptimal or even unstable conditions. For example, if an AGV’s navigation system gradually receives increased positional errors, it could lead to the AGV deviating from its planned trajectory, heightening the risk of collisions or inadvertently entering hazardous zones. The covert nature of ramp attacks demands that systems possess highly sensitive anomaly detection capabilities to swiftly identify and counteract these gradual signal changes [13,14].

3. Problem Definition and Overview of Method

3.1. Problem Definition

This work conceptualizes the anomaly detection of CAN-FD data as a classification task for the target frames. The model considers historical time series data spanning a window of length

l

, to determine whether the current received data frame is normal or anomalous. Furthermore, it is possible to determine the type of attack causing the anomaly based on the characteristics of the anomalous frames. The concepts of CAN-FD bus anomaly detection are given as follows:

Definition 1.

Anomaly Data Detection. The goal is to differentiate between anomalous and normal messages, effectively executing a binary classification for anomaly detection. To address this challenge, a time-window method is adopted. The message window input for ADDM at the current time

T

is defined as follows:

X_{i n, T} = [\begin{matrix} X_{T} \\ X_{T + 1} \\ \begin{matrix} X_{T + 2} \\ ⋮ \\ X_{T + l - 1} \end{matrix} \end{matrix}] = [\begin{matrix} x_{T, 1} & x_{T, 2} & \begin{matrix} \dots & x_{T, d} \end{matrix} \\ x_{T + 1,1} & x_{T + 1,2} & \begin{matrix} \dots & x_{T + 1, d} \end{matrix} \\ \begin{matrix} x_{T + 2,1} \\ ⋮ \\ x_{T + l - 1,1} \end{matrix} & \begin{matrix} x_{T + 2,1} \\ ⋮ \\ x_{T + l - 1,2} \end{matrix} & \begin{matrix} \begin{matrix} \dots & x_{T + 2, d} \\ ⋱ & ⋮ \end{matrix} \\ \begin{matrix} \dots & x_{T + l - 1, d} \end{matrix} \end{matrix} \end{matrix}],

(1)

where

X_{t} (t \in \{T, T + 1, T + 2, \dots, T + l - 1\})

is the extracted features of the whole frame at time

t

,

x_{t, j} (t \in \{T, T + 1, T + 2, \dots, T + l - 1\}, j \in {1,2, 3, \dots, d})

is the

j

-th feature in the data frame at the time

t

. And the number of features in each frame is

d

.

The output of ADDM is a 1-D vector

\hat{Y_{1}}

, which represents the score of result at the time

t

. A threshold is set as

θ

. If the

\hat{Y_{1}} < θ

, the CAN-FD frame will be considered normal; otherwise, an anomaly will be detected.

Definition 2.

Anomaly Classification Detection. Assume the additional goal is to classify all messages into multiple categories, including Benign, Replay, DoS, Fuzzing, Spoofing, Scaling attack, and Ramp attack. This multi-classification task involves not only detecting anomalous patterns but also discerning the specific types of irregularities or malfunctions they may indicate. The complexity of this task lies in differentiating standard operational messages from a spectrum of anomalies, each with its own distinct characteristics and implications for the vehicle’s system integrity.

For the current time

T

, let the message window input for the ACDM be denoted as

X_{i n, T}

, corresponding to Equation (1). The output of ACDM is then a multi-dimensional vector:

\hat{Y_{2}} = [p_{B e n i g n}, p_{R e p l a y}, p_{D o S}, p_{F u z z i n g}, p_{S p o o f i n g}, p_{S c a l i n g}, p_{R a m p}],

(2)

where

p_{B e n i g n}, p_{R e p l a y}, p_{D o S}, p_{F u z z i n g}, p_{S p o o f i n g}, p_{S c a l i n g}, p_{R a m p}

are used to represent the classification scores of Benign, Replay, DoS, Fuzzing, Spoofing, Scaling attack, and Ramp attack. The number of dimensions is the total number of categories. In our method, this equates to seven, indicating that the model categorizes the input data into one of seven distinct classes.

The model’s output is actually a set of probabilities indicating the likelihood that the input data belongs to each of these classes. Finally, the argmax operation is applied to the output. This operation identifies the index of the maximum value in the output vector, which corresponds to the most likely category that the input data belongs to. The label associated with this index is then taken as the final result of the multi-classification process.

3.2. Overview of Methods

As depicted in Figure 4, this work begins with the collection of data from the real vehicle. Subsequently, anomalies are injected into the collected dataset. This is followed by a preprocessing phase. The final step involves using ADDM for binary classification to distinguish between normal and abnormal frames and to identify anomalies. Alternatively, ACDM is used for multiclass classification to facilitate prediction calculations and to ascertain the category of the frames.

4. Methodology

4.1. Data Collection

We collected a dataset of messages from the CAN-FD bus of a real vehicle. This dataset encompasses CAN-FD messages that are transmitted across the in-vehicle network within a designated timeframe, including the timestamps, CAN-FD ID, data fields, and DLC. Subsequent processing of this dataset is carried out to remove erroneous data, perform numerical conversions, and conduct feature extraction. Following the steps, a refined dataset with normal messages can be obtained, denoted as

X_{n o r}

:

X_{n o r} = [\begin{matrix} X_{1} \\ X_{2} \\ \begin{matrix} ⋮ \\ X_{n} \end{matrix} \end{matrix}] = [\begin{matrix} x_{1,1} & x_{1,2} & \begin{matrix} \dots & x_{1, d} \end{matrix} \\ x_{2,1} & x_{2,2} & \begin{matrix} \dots & x_{2, d} \end{matrix} \\ \begin{matrix} ⋮ \\ x_{n, 1} \end{matrix} & \begin{matrix} ⋮ \\ x_{n, 2} \end{matrix} & \begin{matrix} \begin{matrix} ⋱ & ⋮ \end{matrix} \\ \begin{matrix} \dots & x_{n, d} \end{matrix} \end{matrix} \end{matrix}],

(3)

where

X_{1}, X_{2}, \dots X_{n}

denote the features in each frame and

x_{t, j} (t \in \{1,2, . . ., n\}, j \in \{1,2, . . ., d\})

indicates the

j

-th feature at the time

t

.

4.2. Generation of Anomalous Data

According to the characteristics of CAN-FD communication and attack patterns, we create an attack dataset containing anomalous data, including Replay, DoS, Fuzzing, Spoofing, Scaling, and Ramp attacks, as shown in Algorithm 1.

Algorithm 1: Generation of the Attack Dataset
Input:	Dataset without abnormal frame: $X_{n o r}$ .
Output:	Dataset with abnormal frame: $X_{a b n}$ .
1:	Initial $X_{a b n}$ as a copy of $X_{n o r}$
2:	for $i$ in $N_{D o S}$ :
3:	$n_{i n s}^{D o S}$ ←A random index number not used before.
4:	for $i^{I}$ in $n_{D o S}$ :
5:	$x_{D o S}$ ←DoS frames $x_{D o S}$ : set the entire data field to 0.
6:	$X_{a b n} = X_{a b n} \cup x_{D o S}$ ←Insert $x_{D o S}$ into $X_{n o r}$ behind row index $n_{i n s}^{D o S}$ .
7:	for $j$ in $N_{F u z}$ :
8:	$n_{i n s}^{F u z}$ ←A random index number not used before.
9:	for $j^{I}$ in $n_{F u z}$ :
10:	$X_{n o r}$ ←Fuzzing frames $x_{F u z}$ : choose integers in ${l i s t}_{F u z}$ not used as the frame ID and select random value to fill the data field.
11:	$X_{a b n} = X_{a b n} \cup x_{F u z}$ ←Insert $x_{F u z}$ into $X_{a b n}$ behind row index $n_{i n s}^{F u z}$
12:	for $k$ in $N_{S p o}$ :
13:	$n_{i n s}^{S p o}$ ←A random index number not used.
14:	for $k^{I}$ in $n_{S p o}$ :
15:	$x_{S p o}$ ←Spoofing frames $x_{S p o}$ : choose integers not used as frame IDs and select substantial values to fill in the data field.
16:	$X_{a b n} = X_{a b n} \cup x_{S p o}$ ←Insert $x_{S p o}$ into $X_{a b n}$ behind row index $n_{i n s}^{S p o}$ .
17:	for $l$ in $N_{R e p}$ :
18:	$n_{i n s}^{R e p}$ ←A random index number not used.
19:	for $l^{I}$ in $n_{R e p}$ :
20:	$x_{R e p}$ ←Replay frames $x_{R e p}$ : repeat the data following the frames.
21:	$X_{a b n} = X_{a b n} \cup x_{R e p}$ ←Insert $x_{R e p}$ into $X_{a b n}$ behind row index $n_{i n s}^{R e p}$ .
22:	for $h$ in $N_{S c a}$ :
23:	$n_{i n s}^{S c a}$ ←A random index number not used.
24:	for $h^{I}$ in $n_{S c a}$ :
25:	$x_{S c a}$ ←Scaling frames $x_{S c a}$ : generate the scaling value data frame
26:	$X_{a b n} = X_{a b n} \cup x_{S c a}$ ←Insert $x_{S c a}$ into $X_{a b n}$ behind row index $n_{i n s}^{S c a}$ .
27:	for $g$ in $N_{R a m}$ :
28:	$n_{i n s}^{R a m}$ ←A random index number not used.
29:	for $g^{I}$ in $n_{R a m}$ :
30:	$x_{R a m}$ ←Ramp frames $x_{R a m}$ : generate the ramping value data frame.
31:	$X_{a b n} = X_{a b n} \cup x_{R a m}$ ←Insert $x_{R a m}$ to $X_{a b n}$ behind row index $n_{i n s}^{R a m}$ .
32:	return $X_{a b n}$

The process of generation involves injecting abnormal data into the normal dataset to simulate attacks, representing the types of anomalous messages an attacker might inject into the CAN-FD bus. In the dataset with abnormal data frames, the numbers of attacks are set as

N_{D o S}

,

N_{F u z}

,

N_{S p o}

,

N_{R e p}

,

N_{S c a}

,

N_{R a m}

, and the number of anomalous frames within each attack are

n_{D o S}

,

n_{F u z}

,

n_{S p o}

,

n_{R e p}

,

n_{S c a}

,

n_{R a m}

, respectively.

The method for generating the attack dataset in this study is outlined in Algorithm 1.

N

is the length of the initial dataset without any abnormal data frames. The initial step involves determining the number of occurrences for each type of attack and quantifying the anomalous frames involved in each instance. The process begins with generating

X_{a b n}

as a copy of

X_{n o r}

in line 1. From lines 2 to 6, the method for creating DoS anomalous frames is detailed, setting data fields and frame IDs to zero. Lines 7 to 11 outline the generation of Fuzzing anomalous frames, which involves gathering all frame IDs from the original dataset to guarantee their uniqueness, then fabricating abnormal IDs and infusing arbitrary values into the data fields. The construction of Spoofing anomalous frames is elaborated in lines 12 to 16, where previously occurring IDs are used for the frame IDs, while the data fields are populated with highly variable values. Lines 17 to 21 detail the development of Replay anomalies, achieved by replicating successive frames and integrating them into the existing dataset. Lines 22 to 26 describe the creation of the Scaling attack, which is accomplished by applying a scaling factor to the data field values. Lines 27 to 31 detail the generation of the Ramp attack, executed by incrementally adjusting the value in the data field to create a ramp effect. Finally, as shown in lines 32, the dataset is compiled and outputted, encompassing all the injected attacks, thereby completing the attack dataset generation process.

4.3. Data Preprocessing

The min-max normalization is applied to the dataset

X_{a b n}

, offering several benefits, such as standardizing the range of values and enhancing the algorithm’s performance. For the classification labels assigned to each frame, a binary encoding scheme is used, where “0” indicates the absence and “1” denotes the presence of a particular category. For the binary classification in ADDM, the label can be written as follows:

Y_{1} = [Y_{B e n i g n}],

(4)

where the value

Y_{B e n i g n}

denotes the actual label of the CAN-FD frame. And for ACDM, the label can be written as follows:

Y_{2} = [Y_{B e n i g n}, Y_{R e p l a y}, Y_{D o S}, Y_{F u z z i n g}, Y_{S p o o f i n g}, Y_{S c a l i n g}, Y_{R a m p}],

(5)

where the values

Y_{B e n i g n}, Y_{R e p l a y}, Y_{D o S}, Y_{F u z z i n g}, Y_{S p o o f i n g}, Y_{S c a l i n g}, Y_{R a m p},

respectively, correspond to each category. This method facilitates clear differentiation between categories, simplifying the model’s task of classifying each frame. Moreover, it allows for a more precise and efficient analysis of the temporal relationships between consecutive frames, which is crucial for accurate anomaly detection in CAN-FD systems. A set of consecutive CAN-FD bus messages is defined as a single window. The sliding window approach is adopted, selecting a contiguous block of frames as the window. Time windows and strides are used to process the dataset. Then the preprocessed dataset

X_{i n, T}

can be obtained as the input for the model.

4.4. ADDM

In the context of anomaly detection in CAN-FD messages, there exists a temporal dependency and numerical relationships that change over time among different moments of CAN-FD messages. LSTM networks can leverage their internal gating mechanisms, including forget gates, input gates, and output gates, to learn the historical patterns of CAN-FD message data, encompassing variations in CAN-FD ID, data fields, and timestamps. Through the influence and retention of state data from historical moments on the current moment’s state data, LSTM networks adapt to changes in data and their dependency patterns across different times. By employing a substantial number of internal parameters for inference, the LSTM layers can find the difference when facing anomalies, which is capable of precisely identifying both long-term and short-term anomalies.

ADDM leverages the strengths of LSTM in processing and memorizing long-term sequential information. Each type of attack, such as DoS and Replay attacks, forms specific sequential patterns within the data stream, which LSTM learns effectively through its gating mechanisms. As network behaviors evolve, so do the characteristics of these attacks. LSTM is capable of continually updating its internal states based on new data, adapting to these changes, and enhancing its ability to detect novel attacks.

As shown in Figure 5, this neural network model, ADDM, features a multi-layered architecture designed for sequential data processing, suitable for time series analysis and anomaly detection in data streams. After preprocessing the frame data

X_{a b n}

to be tested, the input

X_{i n, T} (X_{i n, T} \subseteq X_{a b n})

at time

T

is fed into the model.

The LSTM unit’s main components include the forget gate, input gate, output gate, and a cell state. The LSTM layers process the input sequence as follows:

f_{t} = σ (W_{f} * [h_{t - 1}, X_{t}] + b_{f}),

(6)

where

f_{t}

represents the output of the forget gate,

σ

is the sigmoid function, constraining the output between 0 and 1,

W_{f}

is the weight of the forget gate,

b_{f}

is the bias,

h_{t - 1}

is the previous time step’s hidden state, and

X_{t}

is one of the time points of

X_{i n, T}

.

i_{t} = σ (W_{i} * [h_{t - 1}, X_{t}] + b_{i}),

(7)

\bar{C_{t}} = t a n h (W_{c} * [h_{t - 1}, X_{t}] + b_{c}),

(8)

where (7) is the input gate that decides how much new information to add to the cell state.

i_{t}

is the output of the input gate.

\bar{C_{t}}

is the new candidate value.

W_{i}

and

W_{c}

are the weights for the input gate and candidate value, respectively, and

b_{i}

and

b_{c}

are the biases.

C_{t} = f_{t} * C_{t - 1} + i_{t} * \bar{C_{t}},

(9)

where the new cell state

C_{t}

is a combination of two parts: one is the old state scaled by the forget gate, and the other is the new candidate value scaled by the input gate.

o_{t} = σ (W_{o} * [h_{t - 1}, X_{t}] + b_{o}),

(10)

h_{t} = o_{t} * t a n h (C_{t}),

(11)

where the output gate controls the value of the next hidden state.

o_{t}

is the output of the output gate.

W_{o}

is the weight for the output gate.

b_{o}

is the bias. The new hidden state

h_{t}

is the product of the output gate’s output and the cell state passed through a tanh function. These formulas collectively determine how the LSTM updates its internal state while processing sequential data, allowing it to effectively capture long-term dependencies. The input

X_{i n, T}

at time T will be processed sequentially by these formulas.

H_{t}

denotes the output of the single LSTM layer.

The ADDM network starts with an input layer tailored to ingest the initial data format. The input layer is followed by the LSTM1 layer and the LSTM4 layer. The LSTM layers can capture long-term dependencies and complex patterns in the data.

After the LSTM layers, the last hidden state

H_{t, 4}

including a 1-d vector, is passed to a repeat vector layer. The repeat vector layer serves to bridge the gap between the different LSTM layers. The Repeat Vector layer processes the input sequence as follows:

V_{r e p, 1} = R e p e a t V e c t o r (H_{t, 4}) .

(12)

where

V_{r e p, 1}

is the matrix that is repeated by

H_{t, 4}

.

Through the LSTM5 and LSTM6, the output of the layers can be written as

H_{t, 6}

. The subsequent fully connected neural network layer integrates the features extracted by the LSTM layers and applies a linear transformation. There is an activation function after that. In this model, the sigmoid function is chosen to provide a probabilistic output, which is particularly useful for binary classification tasks.

H_{t, 6}

is also the input of the fully connected layer. The fully connected neural network layer is as follows:

\hat{Y_{1}} = σ_{n 1} (W_{n 1} * H_{t, 6} + b_{n 1}) .

(13)

where

\hat{Y_{1}}

is the output of the model,

σ_{n 1}

denotes the sigmoid function,

W_{n 1}

is the weight of the layer, and

b_{n 1}

is the bias.

The network concludes with an output layer that renders the final classification. Binary Cross-Entropy (BCE) is chosen as the loss function, which is employed to quantify the deviation between the predicted probabilities

{\tilde{X}}_{o u t, t, 1}

and the actual labels. The BCE is shown as follows:

B C E = - \frac{1}{N} \sum_{n_{1} = 1}^{N} [Y_{1} \log \hat{Y_{1}} + (1 - Y_{1}) \log (1 - \hat{Y_{1}})] .

(14)

where

N

is the total number of samples. The optimization process is guided by the objective of minimizing the error between the predicted outcomes and the actual results.

4.5. ACDM

ACDM is designed to classify the frames in the CAN-FD dataset. In this work, the benign frame and several kinds of abnormal frames are all regarded as different types in the classification result.

In anomaly detection of CAN-FD buses, there is often a high dependency in the time series data, and interactions with high correlation are manifested across different moments and IDs in CAN-FD frames. CAN-FD messages at different time points reflect changes in the state of a vehicle at various moments, and the correlations between different data and times are dynamically changing. The attention mechanism is well-suited for calculating the dependencies and correlation relationships between CAN-FD data frames. By utilizing the differences in these correlation attributes, specifically the deviations in attention scores, anomalies deviating from normal patterns can be precisely detected, demonstrating exceptional inferential capabilities.

In ACDM, data at different time points are utilized as Query, Key, and Value in the attention mechanism to calculate the temporal relationships between data points at different times. If a specific frame ID or data pattern is highly correlated with anomalous behavior, the attention mechanism can aid the model by assigning higher weights to these particular time steps, facilitating the recognition of such behavior. ACDM can effectively classify anomalous behaviors: for replay attacks, LSTM can identify repetitive patterns in the time series, while the attention mechanism highlights the time points of these patterns; for DoS attacks, LSTM recognizes short-term substantial increases in priority requests within the data stream, and the attention mechanism helps the model focus on the critical phases at the start of the attack; fuzzing attacks involve attackers sending a large volume of randomly altered packets to the system to trigger errors, with the LSTM learning deviations from normal data and the attention mechanism identifying critical anomaly points; in spoofing attacks, messages mimic other devices or users. Both the LSTM and attention mechanism can recognize these changes through long-term data learning; scaling and ramp attacks gradually impact system performance by scaling or progressively increasing/decreasing certain parameter values. The LSTM can capture such scaling or gradual changes, while the attention mechanism identifies the most significant moments of change.

As depicted in Figure 6, ACDM features a sophisticated architecture that includes stacked LSTM layers augmented by the attention mechanism. The process begins with an input layer that introduces the input sequence

X_{i n, t}

to a series of LSTM layers, specifically LSTM1 to LSTM3. These layers are tasked with capturing the temporal dependencies within the sequence. The outputs of these three LSTM layers are denoted as

H_{t, 1}

,

H_{t, 2}

,

H_{t, 3}

respectively. Following the LSTM3 layer, a repeat vector layer uses

H_{t, 3}

, a 1-d vector, to generate a matrix

V_{r e p, 2}

that repeats the information from

H_{t, 3}

. The sequence then progresses to the LSTM4 layer, which processes

V_{r e p, 2}

to produce the output

H_{t, 4}

.

The output

H_{t, 1}

of the LSTM1 layer mentioned in this part is designed to work as a fully connected (FC) layer, as follows:

H_{F C} = σ_{m 1} (W_{m 1} * H_{t, 1} + b_{m 1}) .

(15)

where

σ_{m 1}

denotes the sigmoid function,

W_{m 1}

is the weight of the layer, and

b_{m 1}

is the bias.

H_{F C}

is the output of the FC layer.

Next, the network introduces an attention mechanism module. It employs the output

H_{F C}

from the FC layer as Query (

Q

), the output

H_{t, 4}

from the LSTM4 layer as Key (

K

) and Value (

V

). The attention mechanism is a crucial component that enables the model to focus on specific parts of the input sequence that are relevant for the classification task. The attention output is generated through a sentence of operations, including dot product, scaling, and applying the Softmax function to generate a weight sum. The sum represents the relative importance of each segment of the input sequence. The process of the attention module is delineated as follows, illustrated from Equations (16)–(19):

S_{a t t n} = Q * K^{T} = H_{F C} * {H_{t, 4}}^{T},

(16)

S_{a t t n, s c a l e d} = \frac{S_{a t t n}}{\sqrt{d_{K}}},

(17)

α = s o f t m a x ({W_{s} * S}_{a t t n, s c a l e d} + b_{s}),

(18)

A = α * V = α {* H}_{t, 4},

(19)

where Equation (16) computes the attention scores through a dot product between

H_{F C}

(Q) and

{H_{t, 4}}^{T}

(

K^{T}

). The attention score matrix is

S_{a t t n}

. In Equation (17),

S_{a t t n, s c a l e d}

is scaled

S_{a t t n}

.

W_{s}

is the weight.

b_{s}

is the bias. Equation (18) involves the normalization of these scores using the Softmax function to form the attention weight

α

. In Equation (19),

A

represents the output of the attention module.

After passing the attention layer, the data reaches the LSTM5 layer, resulting in an output denoted as

H_{t, 5}

. Subsequently, the data are processed through a FC layer, which outputs a 7-d vector, denoted as

Z = [z_{1}, z_{2}, z_{3}, z_{4}, z_{5}, z_{6}, z_{7}]

. For classification tasks, the final output layer utilizes the Softmax function to calculate the probabilities for each class. The process is detailed as follows:

s o f t m a x (z_{i}) = \frac{e^{z_{i}}}{\sum_{i = 1}^{7} e^{z_{i}}},

(20)

\hat{Y_{2}} = s o f t m a x ([z_{1}, z_{2}, z_{3}, z_{4}, z_{5}, z_{6}, z_{7}]),

(21)

where

\hat{Y_{2}}

is the output of the model.

One-hot coding for the actual labels is used and organized into a one-dimensional vector. To measure the prediction error, the model employs the Categorical Cross-Entropy (CCE) loss for the multi-class classification. The loss function is as follows:

C C E = - \sum_{n = 1}^{N} \sum_{m = 1}^{M} y_{2} l o g (\hat{y_{2}}),

(22)

where

N

is the total number of the samples.

y_{2}

is the actual class.

5. Experiment and Evaluation

5.1. Experiment Setup and Evaluation Metric

The two sub-models proposed in this work are validated on two datasets. The first includes original CAN-FD frame data from a real vehicle with L2 autonomous driving capability, using the device shown in Figure 7. The second dataset, known as the CAN-FD Intrusion Dataset, contains Flooding, Fuzzing, and Malfunction attacks and was developed by the Hacking and Countermeasure Research Lab (HCRL) in Korea. Additionally, we adopted the LSTM-based IDS with superior performance proposed by M. D. Hossain et al. [31] and the HyDL-IDS based on CNN-LSTM proposed by Lo et al. [43] for contrast. Our experimental setup is conducted using neural network models developed in the Python-based Keras framework, with computational support from the NVIDIA GEFORCE RTX 4090 GPU. The parameters chosen in the experiment are shown in Table 1.

In this study, the performance of the CAN-FD bus network attack detection method is assessed using several metrics, including accuracy, precision, recall rate, specificity, F1 score [44], and Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve. These models will be evaluated using these criteria in the subsequent analysis.

Accuracy measures the proportion of true results (both true positives and true negatives) among the total number of cases examined, calculated as

A c c u r a c y = \frac{T P + T N}{T P + T N + F P + F N}

, where TP, TN, FP, and FN represent true positive, true negative, false positive, and false negative, respectively. Precision, or positive predictive value, quantifies the accuracy of positive predictions, defined by the formula

P r e c i s i o n = \frac{T P}{T P + F P}

. Recall rate, also known as sensitivity, measures the ability of a model to identify all relevant instances, with

R e c a l l = \frac{T P}{T P + F N}

. Specificity, indicating the true negative rate, reflects the model’s ability to identify negative results, as given by

S p e c i f i c i t y = \frac{T N}{T N + F P}

. Finally, the F1 score is the harmonic mean of precision and recall, offering a balance between the two by computing

F 1 S c o r e = 2 \times \frac{P r e c i s i o n \times R e c a l l}{P r e c i s i o n + R e c a l l}

, thus capturing the accuracy of the model when dealing with skewed datasets or where false negatives and positives differ crucially.

5.2. Experiment with ADDM

The loss curves of the ADDM model on both datasets, the actual vehicle and HCRL, are depicted in Figure 8. From the curve, it is observed that the losses in the two models progressively decrease and eventually tend to converge. In Figure 9, the ROC curve for the ADDM model is presented. It is noteworthy that the AUC attains a value of 1 for both datasets, signifying an ideal classification accuracy of all instances. This observation underscores the efficacy of the proposed ADDM model in accurately classifying attacks on the CAN-FD bus network, indicating its robustness in this application domain.

As shown in Table 2, the evaluation metrics of three distinct models, namely ADDM, the LSTM-based model [31], and HyDL-IDS [43], are juxtaposed against two datasets: the Real-vehicle dataset and the HCRL dataset. On the Real-vehicle dataset, the ADDM model demonstrates remarkable performance, achieving an accuracy of 0.9995, a precision of 0.9997, an F1 score of 0.9997, a TNR of 0.9992, and a recall rate of 0.9996. These results notably surpass those of the LSTM-based model, which records an accuracy of 0.9900, a precision of 0.9918, an F1 score of 0.9932, a TNR of 0.9771, and a recall of 0.9946. Furthermore, ADDM’s metrics also exceed those of the HyDL-IDS experiment, which posts an accuracy of 0.9935, a precision of 0.9958, an F1 score of 0.9956, a TNR of 0.9882, and a recall rate of 0.9955. On the HCRL dataset, all three models reach uniformly high scores of 1.0000 or 0.9999 in accuracy, with the ADDM model demonstrating a marginally lower FPR compared with both the LSTM-based model and HyDL-IDS. The results indicate that the accuracy of ADDM on the real-vehicle dataset improves by approximately 0.50% compared with the LSTM-based model and by about 0.41% compared with the HyDL-IDS model.

The confusion matrix reveals specific outcomes for binary classification using the ADDM, the LSTM-based model, and HyDL-IDS. Evidently, in Figure 10, (a) registers a FP count of 12 and a FN count of 16, which is substantially lower than the 367 FPs and 241 FNs depicted in (c) and also lower than the 189 FPs and 204 FNs depicted in (e). When correlating these findings with the parameters described in Table 2, it is apparent that our ADDM model demonstrates superior performance on the Real-vehicle dataset compared with the LSTM-based model and HyDL-IDS. On the HCRL’s dataset, the analysis of classified sample outcomes in (b), (d), and (f) of Figure 10, combined with the data in Table 2, reveals that the ADDM model marginally outperforms the LSTM-based model and HyDL-IDS, recording only eleven FPs and no FN, fewer than the fourteen FPs and two FNs seen in the LSTM-based model and significantly less than the 109 FPs and 42 FNs observed in the result of HyDL-IDS.

The training durations for the ADDM and the two comparative models, including the LSTM-based model and HyDL-IDS, are similar. On the real-vehicle dataset, the inference times for ADDM, the LSTM-based model, and HyDL-IDS are 0.137 ms, 0.105 ms, and 0.119 ms, respectively. On the HCRL dataset, the inference times for ADDM, the LSTM-based model, and HyDL-IDS are 0.102 ms, 0.078 ms, and 0.081 ms, respectively.

5.3. Experiment with ACDM

The loss curves of the ACDM model on both datasets (actual vehicle and HCRL) are depicted in Figure 11. From the curve, it is observed that the losses in the two models progressively decrease and eventually tend to converge.

Table 3 delineates the performance metrics of the ACDM, LSTM-based model, and HyDL-IDS against various types of network attacks within the two datasets mentioned before. In the real-vehicle dataset, for Replay attacks, the ACDM model achieves detection results of 0.9994, 0.9939, 0.9953, and 0.9996 for accuracy, precision, recall, specificity, and an F1 score of 0.9946, surpassing the performance of an LSTM-based model, which recorded an accuracy of 0.9850, a precision of 0.9447, a recall of 0.7916, and an F1 score of 0.8614, as well as surpassing the HyDL-IDS model, which scored 0.9893 in accuracy, 0.8472 in recall, and 0.9027 in F1 score. For DoS attacks, all three models achieve scores of 1.0000 across all metrics. For Fuzzing attacks, ACDM demonstrated results of 0.9987, 0.9795, 0.9974, 0.9987, and 0.9884 for accuracy, precision, recall, specificity, and F1 score, respectively, exceeding the LSTM-based model’s precision of 0.9570 and accuracy of 0.9966, as well as the HyDL-IDS model’s accuracy of 0.9971 and precision of 0.9614. In the case of Spoofing attacks, ACDM’s performance includes scores of 0.9988, 0.9935, 0.9629, 0.9998, and 0.9780, outperforming the LSTM-based model, which achieved an accuracy of 0.9964 and a recall of 0.9041, and the HyDL-IDS model, which recorded an accuracy of 0.9973 and a recall of 0.9269. For Scaling attacks, ACDM attains metrics of 0.9996, 0.9939, 0.9917, 0.9998, and 0.9928, superior to the LSTM-based model’s accuracy of 0.9991 and recall of 0.9041, and better than the HyDL-IDS results of 0.9991 in accuracy and 0.9824 in precision. For Ramp attacks, ACDM reaches 0.9998, 0.9995, 0.9954, 0.9999, and 0.9975 in accuracy, precision, recall, specificity, and F1 score, surpassing the LSTM-Based model’s recall of 0.9888 and F1 score of 0.9893 and also outperforming the HyDL-IDS model’s recall of 0.9863.

As is shown in Table 3, on the HCRL dataset, for Flooding attacks, all models achieve scores of 1.0000 across all metrics. For Fuzzing attacks, ACDM achieves 0.9999, 0.9999, 1.0000, 0.9999, and 0.9999 in accuracy, precision, recall, specificity, and F1 score, exceeding the LSTM-Based model’s precision of 0.9996 and recall of 0.9997 and surpassing the HyDL-IDS model’s recall of 0.9989. For Malfunction attacks, ACDM achieves perfect scores of 1.0000 across all evaluated metrics, matching the performance of the LSTM-based model, which also attains a score of 1.0000 on every metric. And the performance of ACDM is superior to that of the HyDL-IDS, which scored 0.9999 on all metrics.

On the Real-vehicle dataset, ACDM demonstrates significant performance advantages in handling Replay, DoS, Fuzzing, Spoofing, Scaling, and Ramp attack types. Across crucial performance metrics including accuracy, precision, recall, specificity, and F1 score, the ACDM model surpasses those based on LSTM and HyDL-IDS. Upon evaluation with the HCRL dataset, all models perform well in Flooding and Malfunction attack detection, yet the ACDM maintained a slight edge over the LSTM-based model and HyDL-IDS in Fuzzing attack detection. This comparative analysis reveals the ACDM’s superior detection capabilities, particularly within the Real-vehicle dataset, suggesting its enhanced robustness in identifying nuanced network anomalies.

Figure 12 displays the result of multi-classification with the labels in subplots (a), (b), and (c), which correspond to different types of CAN-FD frame data. Those express the quantitative relationship between the predicted and actual class labels. It can be seen that the performance of the ACDM model is superior to the LSTM-based model and HyDL-IDS for correctly classified positive samples.

Figure 13, Figure 14 and Figure 15 consist of subplots that demonstrate the comparative classification outcomes for three distinct models when applied to a real-vehicle dataset across various attack categories. Specifically, for the Replay attack, subplot (a) in Figure 14 illustrates 746 FNs and 166 FPs, and subplot (a) in Figure 15 shows 547 FNs and 107 FPs. These figures significantly exceed the 17 FNs and 22 FPs recorded by the ACDM model in its respective subplot, highlighting ACDM’s superior classification capability for Replay attacks. In the context of DoS attacks, subplots (b) from Figure 13, Figure 14 and Figure 15 all exhibit 0 FN and FP instances, indicating effective DoS anomaly detection by all models. For Fuzzing attacks, subplots (c) of Figure 14 and Figure 15 reveal a higher occurrence of FNs and FPs in the LSTM-based model, with fifty-two FNs and one hundred and sixty FPs, and in the HyDL-IDS model, with thirty-six FNs and one hundred and thirty-nine FPs, compared with the nine FNs and seventy-three FPs reported in Figure 13’s subplot (c). This underscores the enhanced efficiency of the ACDM model in detecting Fuzzing anomalies. Spoofing anomaly detection, as depicted in subplot (d) of Figure 13, shows fewer FNs and FPs, specifically 65 FNs and 11 FPs, compared with 168 FNs and 51 FPs in Figure 14’s subplot (d) and 128 FNs and 39 FPs in Figure 15’s subplot (d), indicating ACDM’s superior performance in discerning Spoofing anomalies. Regarding Scaling attacks, ACDM’s results include 15 FNs and 11 FPs, which are fewer than the 17 FNs and 38 FPs observed in the LSTM-based model and also fewer than the 21 FNs and 32 FPs in the HyDL-IDS model. For Ramp attacks, ACDM’s outcomes feature nine FNs and one FP, which are less than the twenty FNs and twenty-two FPs in the LSTM-based model and the twenty-seven FNs and nine FPs in the HyDL-IDS model, further illustrating the robustness of ACDM in classifying various cyber threats.

A low recall rate in the LSTM-based model and HyDL-IDS for Replay is attributed to a high quantity of FNs, signifying those numerous actual positive cases remain undetected by the model. For anomaly detection, the comprehension of an event’s context within its sequence is critical. Replay anomalies are characterized by their repetitive nature and thus possess strong contextual correlations, which the LSTM-based model and HyDL-IDS, devoid of the attention mechanism inherent in ACDM, fail to sufficiently capture, resulting in an elevated count of missed positive samples. The distinct characteristics of DoS anomalies, marked by their high-priority features, are accurately identified by all the models without any false positives. When addressing Fuzzing and Spoofing anomalies, the ACDM model’s enhanced feature learning and contextual understanding surpass the LSTM-based model, leading to the superior performance of ACDM. Conclusively, the ACDM demonstrates better detection effectiveness on the real-vehicle dataset and has a broader applicability when compared with the LSTM-based model and HyDL-IDS.

In addition to the previous analysis, three methods are compared on the CAN-FD Intrusion Dataset from HCRL. As shown in the multi-class results of Figure 16, data are presented for four labels: Benign, Flooding, Fuzzing, and Malfunction.

Figure 17, Figure 18 and Figure 19 show the detection results of each type of attack message in multi-class anomaly monitoring. Comparing each subplot and the data in Table 3, for Flooding attacks, both ACDM and the LSTM-based model have no FNs or FPs, whereas HyDL-IDS has one FN. For Fuzzing attacks, ACDM results show zero FNs and six FPs, fewer than the LSTM-based model, which records 25 FNs and 38 FPs, and significantly less than HyDL-IDS, which detects 107 FNs and 22 FPs. For Malfunction attacks, ACDM’s results include two FNs and no FPs, aligning with the detection performance of the LSTM-based model and HyDL-IDS. This indicates that our model has achieved better performance than that of the LSTM-based model and HyDL-IDS in detecting and classifying anomalies in large-scale datasets containing abnormalities including Flooding, Fuzzing, and Malfunction.

The integration of LSTM with attention mechanisms, as compared with using LSTM alone and HyDL-IDS, offers significant advantages in the context of multi-classification anomaly detection problems. Firstly, the attention mechanism enhances the model’s ability to focus on specific parts of the input sequence that are more relevant for making a decision. This is particularly useful in anomaly detection, where certain patterns or anomalies might be subtle and not uniformly distributed throughout the data. Secondly, the combination allows the model to capture long-term dependencies more effectively. While LSTM units are adept at handling such dependencies, the attention layer provides an additional filter to emphasize the most critical information over longer sequences, which can be crucial in identifying complex or temporally distant anomalies. Thirdly, the LSTM with attention mechanism can improve interpretability. At last, this approach can lead to improvements in accuracy and efficiency. By directing the model’s focus to key areas, the attention mechanism can reduce the influence of irrelevant data, potentially leading to better performance in detecting anomalies.

The training times for the ACDM and the two comparative models, including the LSTM-based model and HyDL-IDS, are closely aligned. On the real-vehicle dataset, the inference times for ACDM, the LSTM-based model, and HyDL-IDS are recorded at 0.141 ms, 0.099 ms, and 0.123 ms, respectively. On the HCRL dataset, the inference times for ADDM, the LSTM-Based model, and HyDL-IDS are 0.126 ms, 0.070 ms, and 0.093 ms, respectively.

5.4. Disscusion

In the detection of anomalies in CAN-FD message frames, models based on attention and LSTM offer unique advantages compared with those solely based on LSTM or the CNN-LSTM-based model HyDL-IDS. Firstly, the attention mechanism allows the model to more effectively focus on key features within anomalous behaviors because it dynamically adjusts the model’s focus on different parts of the input data, thereby enhancing the model’s precision in recognizing abnormal patterns. In contrast, traditional LSTM models, while capable of capturing long-term dependencies in time-series data, may lack sufficient flexibility when faced with complex or subtle anomalies. The CNN-LSTM-based model HyDL-IDS, although it strengthens feature extraction through convolutional layers, does not match the dynamic adjustment capabilities of attention-based models in terms of temporal correlations within the sequence. Thus, models combining attention and LSTM not only retain the advantages of LSTM in processing time-series data but also enhance the ability to identify critical features in abnormal signals through the attention mechanism, providing more accurate and flexible anomaly detection performance.

Compared with the baseline methods, the approach proposed in this paper conducts refined multi-class anomaly detection that includes six types of attacks rather than the inference for individual attack types as done by the comparative methods. By incorporating the attention mechanism, the detection accuracy is successfully improved for Replay attacks, significantly reducing FNs and FPs. The approach proposed in this paper achieves ideal results for DoS attacks, comparable to the comparative methods. For Fuzzing, Spoofing, Scaling, and Ramp attacks, our method improves detection performance by combining the attention mechanism with LSTM and stacking LSTM layers. Besides using a real-vehicle dataset that we collected and injected with attacks, the model proposed in this work is also validated on the HCRL-provided open-source CAN-FD Intrusion Dataset, where results show that our proposed method outperforms the comparative methods.

6. Conclusions

This study proposes two anomaly detection models based on LSTM and temporal attention mechanisms for the CAN-FD bus, focusing on the temporal characteristics between the CAN-FD frames and aiming to identify the anomalous data and type of the attack. The first sub-model, ADDM, utilizes stacked LSTM to perform binary classification of normal and anomalous data within the CAN-FD dataset, thereby detecting external attacks. The second sub-model, ACDM, combines LSTM with an attention mechanism and capitalizes on the characteristics of various anomalous messages to classify messages within the CAN-FD dataset, thus identifying the types of external attacks. Compared with existing advanced LSTM-based and HyDL-IDS, our method exhibits superior performance in detection, achieving an improvement in both the real-vehicle dataset and the CAN-FD Intrusion Dataset from HCRL. For binary normal/abnormal anomaly detection on the real-vehicle dataset, ADDM achieves a classification accuracy that is 0.50% higher than the LSTM-based model and 0.41% higher than HyDL-IDS; on the HCRL dataset, the classification accuracy of our ADDM matches that of the LSTM-based model and HyDL-IDS, all at 99.99%. For the refined six-category multiclass anomaly detection on the real-vehicle dataset, ACDM achieved a maximum accuracy improvement of 1.44% compared with the LSTM-based model and 1.01% compared with HyDL-IDS. On the HCRL dataset, the multiclass detection average accuracy of ACDM is on par with the LSTM-based model, both at 100%, and improved by 0.01% compared with the accuracy of HyDL-IDS. And fewer FNs and FPs demonstrate the broader applicability of ACDM. Based on statistics on automotive attack vectors over the past decade, attacks directly or indirectly through the CAN/CAN-FD bus account for approximately 13.61% of all external attacks. This study can mitigate about 9% of them.

In the future, in order to delve deeper into anomaly detection and its countermeasures regarding the physical significance of anomalous signals, we aim to develop a more adaptive framework capable of dynamically adjusting to new types of network threats. This will allow us to maintain robust defense mechanisms as automotive technologies continue to evolve. To achieve these objectives, we will use the causal relationships of CAN/CAN-FD messages as inputs and learn the structural and positional relationships between graph nodes through graph structures. By utilizing transformers, we aim to carry out predictions, classifications, or scoring mechanisms to fulfill the purposes of anomaly detection and defense.

Author Contributions

Author Contributions: Conceptualization, F.G.; methodology, R.Z.; software, J.L.; validation, Y.L.; formal analysis, F.G.; investigation, R.Z.; resources, F.G.; data curation, J.L.; writing—original draft preparation, J.L.; writing—review and editing, R.Z.; visualization, F.G.; supervision, Z.G.; project administration, Z.G.; funding acquisition, F.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Science Foundation of China under Grants 52202494 and 52202495.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

Yingqi Liu is employed by Mobje Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

de Andrade, R.; Santos, M.M.D.; Justo, J.F.; Yoshioka, L.R.; Hof, H.-J.; Kleinschmidt, J.H. Security Architecture for Automotive Communication Networks with CAN FD. Comput. Secur. 2023, 129, 103203. [Google Scholar] [CrossRef]
Wang, C.; Xu, X.; Xiao, K.; He, Y.; Yang, G. Traffic Anomaly Detection Algorithm for CAN Bus Using Similarity Analysis. High-Confid. Comput. 2024, 1, 100207. [Google Scholar] [CrossRef]
Tanksale, V. Intrusion detection system for controller area network. Cybersecurity 2024, 7, 4. [Google Scholar] [CrossRef]
Rajapaksha, S.; Kalutarage, H.; Al-Kadri, M.O.; Petrovski, A.; Madzudzo, G.; Cheah, M. Ai-based intrusion detection systems for in-vehicle networks: A survey. ACM Comput. Surv. 2023, 55, 1–40. [Google Scholar] [CrossRef]
Seo, E.; Song, H.M.; Kim, H.K. GIDS: GAN Based Intrusion Detection System for In-Vehicle Network. In Proceedings of the 2018 16th Annual Conference on Privacy, Security and Trust (PST), Belfast, UK, 28–30 August 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–6. [Google Scholar]
Han, M.L.; Kwak, B.I.; Kim, H.K. Anomaly Intrusion Detection Method for Vehicular Networks Based on Survival Analysis. Veh. Commun. 2018, 14, 52–63. [Google Scholar] [CrossRef]
Jia, J.; Xu, Y.; Wu, Y.; Xu, Y.; Fan, Y.; Zhang, D. Intelligent Connected Vehicle CAN-FD Bus Network Security Protocol. In Proceedings of the 2023 International Conference on Mobile Internet, Cloud Computing and Information Security (MICCIS), Nanjing, China, 7–9 April 2023; pp. 57–62. [Google Scholar] [CrossRef]
Lin, C.W.; Sangiovanni-Vincentelli, A. Cyber-security for the controller area network (CAN) communication protocol. In Proceedings of the 2012 International Conference on Cyber Security, Alexandria, VA, USA, 14–16 December 2012; pp. 1–7. [Google Scholar] [CrossRef]
Lee, H.; Jeong, S.H.; Kim, H.K. OTIDS: A novel intrusion detection system for in-vehicle network by using remote frame. In Proceedings of the 2017 15th Annual Conference on Privacy, Security and Trust (PST), Calgary, AB, Canada, 28–30 August 2017; pp. 57–5709. [Google Scholar] [CrossRef]
Islam, R.; Refat, R.U.D. Improving CAN bus security by assigning dynamic arbitration IDs. J. Transp. Secur. 2020, 13, 19–31. [Google Scholar] [CrossRef]
Koscher, K.; Czeskis, A.; Roesner, F.; Patel, S.; Kohno, T.; Checkoway, S.; McCoy, D.; Kantor, B.; Anderson, D.; Shacham, H.; et al. Experimental security analysis of a modern automobile. In Proceedings of the 2010 IEEE Symposium on Security and Privacy, Oakland, CA, USA, 16–19 May 2010; pp. 447–462. [Google Scholar] [CrossRef]
Iehira, K.; Inoue, H.; Ishida, K. Spoofing Attack Using Bus-Off Attacks Against a Specific ECU of the CAN Bus. In Proceedings of the 2018 15th IEEE Annual Consumer Communications & Networking Conference (CCNC), Las Vegas, NV, USA, 12–15 January 2018; pp. 1–4. [Google Scholar] [CrossRef]
Prasad, S. Counteractive Control against Cyber-Attack Uncertainties on Frequency Regulation in the Power System. IET Cyber-Phys. Syst. Theory Appl. 2020, 5, 394–408. [Google Scholar] [CrossRef]
Elsisi, M.; Tran, M.-Q. Development of an IoT Architecture Based on a Deep Neural Network against Cyber Attacks for Automated Guided Vehicles. Sensors 2021, 21, 8467. [Google Scholar] [CrossRef] [PubMed]
Kleberger, P.; Olovsson, T.; Jonsson, E. Security Aspects of the In-Vehicle Network in the Connected Car. In Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), Baden-Baden, Germany, 5–9 June 2011; pp. 528–533. [Google Scholar] [CrossRef]
Upstream Security. 2020 Global Automotive Cybersecurity Report. Netw. Secur. 2020, 2020, 4. [Google Scholar] [CrossRef]
Heidari, A.; Jabraeil Jamali, M.A. Internet of Things intrusion detection systems: A comprehensive review and future directions. Clust. Comput. 2023, 26, 3753–3780. [Google Scholar] [CrossRef]
Cui, J.; Zong, L.; Xie, J.; Tang, M. A novel multi-module integrated intrusion detection system for high-dimensional imbalanced data. Appl. Intell. 2023, 53, 272–288. [Google Scholar] [CrossRef] [PubMed]
Sun, H.; Chen, M.; Weng, J.; Liu, Z.; Geng, G. Anomaly Detection for In-Vehicle Network Using CNN-LSTM with Attention Mechanism. IEEE Trans. Veh. Technol. 2021, 70, 10880–10893. [Google Scholar] [CrossRef]
UN Regulation No. 155. Cyber Security and Cyber Security Management System; United Nations Economic Commission for Europe: Geneva, Switzerland, 2021; Available online: https://unece.org/sites/default/files/2021-03/R155e.pdf (accessed on 3 April 2021).
ISO/SAE 21434:2021; Road Vehicles—Cybersecurity Engineering. International Organization for Standardization: Geneva, Switzerland, 2021. Available online: https://www.iso.org/standard/70918.html (accessed on 30 April 2024).
U.S. Department of Transportation, National Highway Traffic Safety Administration. Cybersecurity Best Practices for the Safety of Modern Vehicles. Available online: https://www.nhtsa.gov/sites/nhtsa.gov/files/2022-09/cybersecurity-best-practices-safety-modern-vehicles-2022-tag.pdf (accessed on 30 April 2024).
Halder, S.; Conti, M.; Das, S.K. Coids: A Clock Offset Based Intrusion Detection System for Controller Area Networks. In Proceedings of the 21st International Conference on Distributed Computing and Networking, Kolkata, India, 4–7 January 2020; pp. 1–10. [Google Scholar] [CrossRef]
Olufowobi, H.; Young, C.; Zambreno, J.; Bloom, G. SAIDuCANT: Specification-Based Automotive Intrusion Detection Using Controller Area Network (CAN) Timing. IEEE Trans. Veh. Technol. 2020, 69, 1484–1494. [Google Scholar] [CrossRef]
Ji, H.; Wang, Y.; Qin, H.; Wu, X.; Yu, G. Investigating the Effects of Attack Detection for In-Vehicle Networks Based on Clock Drift of ECUs. IEEE Access 2018, 6, 49375–49384. [Google Scholar] [CrossRef]
Narayanan, S.N.; Mittal, S.; Joshi, A. Using Data Analytics to Detect Anomalous States in Vehicles. arXiv 2015, arXiv:1512.08048. [Google Scholar] [CrossRef]
Guo, F.; Wang, Z.; Du, S.; Li, H.; Zhu, H.; Pei, Q.; Cao, Z.; Zhao, J.; Guo, F. Detecting Vehicle Anomaly in the Edge via Sensor Consistency and Frequency Characteristic. IEEE Trans. Veh. Technol. 2019, 68, 5618–5628. [Google Scholar] [CrossRef]
Wasicek, A.; Burakova, Y. Context-aware Intrusion Detection in Automotive Control Systems. In Proceedings of the 5th ESCAR USA Conference, Detroit, MI, USA, 21–22 June 2017. [Google Scholar]
Wang, C.; Zhao, Z.; Gong, L.; Zhu, L.; Liu, Z.; Cheng, X. A Distributed Anomaly Detection System for In-Vehicle Network Using HTM. IEEE Access 2018, 6, 9091–9098. [Google Scholar] [CrossRef]
Taylor, A.; Leblanc, S.; Japkowicz, N. Anomaly Detection in Automobile Control Network Data with Long Short-Term Memory Networks. In Proceedings of the 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA), Montreal, QC, Canada, 17–19 October 2016; pp. 130–139. [Google Scholar] [CrossRef]
Hossain, M.D.; Inoue, H.; Ochiai, H.; Fall, D.; Kadobayashi, Y. LSTM-Based Intrusion Detection System for In-Vehicle CAN Bus Communications. IEEE Access 2020, 8, 185489–185502. [Google Scholar] [CrossRef]
Chen, S.; Jin, G.; Ma, X. Detection and Analysis of Real-Time Anomalies in Large-Scale Complex System. Measurement 2021, 184, 109929. [Google Scholar] [CrossRef]
Qin, H.; Yan, M.; Ji, H. Application of Controller Area Network (CAN) Bus Anomaly Detection Based on Time Series Prediction. Veh. Commun. 2021, 27, 100291. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is All You Need. Adv. Neural Inf. Process. Syst. 2017, 30, 1–11. [Google Scholar]
CAN-FD Intrusion Dataset. Hacking and Countermeasure Research Lab (HCRL). Available online: https://ocslab.hksecurity.net/Datasets/can-fd-intrusion-dataset (accessed on 21 July 2022).
Kishore, C.R.; Rao, D.C.; Nayak, J.; Behera, H.S. Intelligent Intrusion Detection Framework for Anomaly-Based CAN Bus Network Using Bidirectional Long Short-Term Memory. J. Inst. Eng. (India) Ser. B 2024. [Google Scholar] [CrossRef]
Yan, W.; Wei, D.; Fu, B.; Zhang, H.; Liu, Q. A Mixed-Criticality Traffic Scheduler with Mitigating Congestion for CAN-to-TSN Gateway. ACM Trans. Des. Autom. Electron. Syst. 2024. [Google Scholar] [CrossRef]
Park, J.; Lee, M.; Park, S.; Jeong, Y.; Kim, H.; Choi, S.; Song, S. Controller Area Network with Flexible Data Rate (CAN FD) Eye Diagram Prediction. IEEE Trans. Electromagn. Compat. 2024. [Google Scholar] [CrossRef]
Han, M.L.; Kwak, B.I.; Kim, H.K. Event-Triggered Interval-Based Anomaly Detection and Attack Identification Methods for an In-Vehicle Network. IEEE Trans. Inf. Forensics Secur. 2021, 16, 2941–2956. [Google Scholar] [CrossRef]
Hoppe, T.; Kiltz, S.; Dittmann, J. Security threats to automotive CAN networks—Practical examples and selected short-term countermeasures. Reliab. Eng. Syst. Saf. 2011, 96, 11–25. [Google Scholar] [CrossRef]
Kwak, B.I.; Han, M.L.; Kim, H.K. Cosine Similarity Based Anomaly Detection Methodology for the CAN Bus. Expert Syst. Appl. 2021, 166, 114066. [Google Scholar] [CrossRef]
Lo, W.; Alqahtani, H.; Thakur, K.; Almadhor, A.; Chander, S.; Kumar, G. A Hybrid Deep Learning Based Intrusion Detection System Using Spatial-Temporal Representation of In-Vehicle Network Traffic. Veh. Commun. 2022, 35, 100471. [Google Scholar] [CrossRef]
Scikit-Learn Project. Receiver Operating Characteristic (ROC). Available online: https://scikit-learn.org/stable/auto_examples/model_selection/plot_roc.html (accessed on 3 March 2020).

Figure 1. In-vehicle CAN/CAN-FD architecture and attack surface.

Figure 2. The structure of the CAN-FD frame.

Figure 3. The four types of attacks in this work are (a) Replay; (b) DoS; (c) Fuzzing; and (d) Spoofing.

Figure 4. The overview diagram of dual models. The framework is composed of data collection, anomaly injection, data preprocessing, and detection models, including ADDM and ACDM.

Figure 5. The overview of ADDM architecture.

Figure 6. The overview of ACDM architecture.

Figure 7. Collection of authentic CAN-FD datasets from a real vehicle.

Figure 8. Training loss of the ADDM model on the datasets from a real vehicle and HCRL: (a) dataset from an actual real vehicle; (b) dataset from HCRL.

Figure 9. ROC curve of the dataset from the actual vehicle and HCRL.

Figure 10. (a) Confusion matrix for ADDM trained on the dataset from the actual vehicle; (b) confusion matrix for ADDM trained on the dataset from HCRL; (c) confusion matrix for LSTM-based model trained on the dataset from the actual vehicle; (d) confusion matrix for LSTM-based model trained on the dataset from HCRL; (e) confusion matrix for HyDL-IDS trained on the dataset from the actual vehicle; (f) confusion matrix for HyDL-IDS trained on the dataset from HCRL.

Figure 11. Training loss of the ACDM model on the datasets from a real vehicle and HCRL: (a) dataset from an actual real vehicle; (b) dataset from HCRL.

Figure 12. Confusion matrix for the model trained on the dataset from an actual vehicle: (a) ACDM; (b) LSTM-based model; and (c) HyDL-IDS.

Figure 13. The six figures are the classification results of ACDM trained on the dataset from the actual vehicle: (a) Confusion matrix about Replay; (b) confusion matrix about Dos; (c) confusion matrix about Fuzzing; (d) confusion matrix about Spoofing; (e) confusion matrix about Scaling attack; and (f) confusion matrix about Ramp attack.

Figure 14. The six figures are the classification results of the LSTM-based model trained on the dataset from the actual vehicle: (a) confusion matrix about Replay; (b) confusion matrix about DoS; (c) confusion matrix about Fuzzing; (d) confusion matrix about Spoofing; (e) confusion matrix about Scaling attack; and (f) confusion matrix about Ramp attack.

Figure 15. The six figures are the classification results of HyDL-IDS trained on the dataset from the actual vehicle: (a) confusion matrix about Replay; (b) confusion matrix about DoS; (c) confusion matrix about Fuzzing; (d) confusion matrix about Spoofing; (e) confusion matrix about Scaling attack; and (f) confusion matrix about Ramp attack.

Figure 16. Confusion matrix for the model trained on the dataset from HCRL: (a) ACDM; (b) LSTM-based model; and (c) HyDL-IDS.

Figure 17. The classification results of ACDM trained on the dataset from HCRL: (a) confusion matrix about Flooding; (b) confusion matrix about Fuzzing; and (c) confusion matrix about Malfunction.

Figure 18. The classification results of the LSTM-based model trained on the dataset from HCRL: (a) confusion matrix about Flooding; (b) confusion matrix about Fuzzing; and (c) confusion matrix about Malfunction.

Figure 19. The classification results of HyDL-IDS trained on the dataset from HCRL: (a) confusion matrix about Flooding; (b) confusion matrix about Fuzzing; and (c) confusion matrix about Malfunction.

Table 1. The configuration of model parameters.

Parameters	Value
Parameters	ADDM	ACDM
Learning Sate	$10^{- 3}$	$10^{- 4}$
Epochs	100	100
Batch Size	128	128
LSTM Unit Number	256,256,64,64,32	512,512,128,128,64
Dropout	0.2	0.2
Window Size	4	4
Stride	1	1
Optimizer	Adam	Adam

Table 2. Table for results and evaluation metrics of ADDM, LSTM-based models, and HyDL-IDS on the different datasets.

	Type of Attack	TPR (Recall)	FPR	TNR (Specificity)	FNR	Accuracy	Precision	F1 Score
Real-vehicle dataset	ADDM	0.9996	0.0008	0.9992	0.0004	0.9995	0.9997	0.9997
	LSTM-based model	0.9946	0.0229	0.9771	0.0054	0.9900	0.9918	0.9932
	HyDL-IDS	0.9955	0.0118	0.9882	0.0045	0.9935	0.9958	0.9956
HCRL dataset	ADDM	1.000	3 × 10⁻⁵	0.9999	0	0.9999	0.9999	0.9999
	LSTM-based model	0.9999	4 × $10^{- 5}$	0.9999	2 × $10^{- 6}$	0.9999	0.9999	0.9999
	HyDL-IDS	0.9999	0.0003	0.9997	4 × $10^{- 5}$	0.9999	0.9999	0.9999

Table 3. Evaluation metrics of ACDM, LSTM-based models, and HyDL-IDS on the different datasets.

Dataset	Type of Attack	Model	Accuracy	Precision	Recall	Specificity	F1 Score
Real-vehicle dataset	Replay	ACDM	0.9994	0.9939	0.9953	0.9996	0.9946
		LSTM-based model	0.9850	0.9447	0.7916	0.9971	0.8614
		HyDL-IDS	0.9893	0.9659	0.8472	0.9981	0.9027
	DoS	ACDM	1.0000	1.0000	1.0000	1.0000	1.0000
		LSTM-based model	1.0000	1.0000	1.0000	1.0000	1.0000
		HyDL-IDS	1.0000	1.0000	1.0000	1.0000	1.0000
	Fuzzing	ACDM	0.9987	0.9795	0.9974	0.9987	0.9884
		LSTM-based model	0.9966	0.9570	0.9851	0.9973	0.9709
		HyDL-IDS	0.9971	0.9614	0.9897	0.9976	0.9754
	Spoofing	ACDM	0.9988	0.9935	0.9629	0.9998	0.9780
		LSTM-based model	0.9964	0.9688	0.9041	0.9991	0.9353
		HyDL-IDS	0.9973	0.9765	0.9269	0.9993	0.9511
	Scaling	ACDM	0.9996	0.9939	0.9917	0.9998	0.9928
		LSTM-based model	0.9991	0.9793	0.9906	0.9994	0.9849
		HyDL-IDS	0.9991	0.9824	0.9884	0.9995	0.9854
	Ramp	ACDM	0.9998	0.9995	0.9954	0.9999	0.9975
		LSTM-based model	0.9993	0.9888	0.9898	0.9996	0.9893
		HyDL-IDS	0.9994	0.9954	0.9863	0.9998	0.9908
HCRL dataset	Flooding	ACDM	1.0000	1.0000	1.0000	1.0000	1.0000
		LSTM-based model	1.0000	1.0000	1.0000	1.0000	1.0000
		HyDL-IDS	1.0000	1.0000	1.0000	1.0000	1.0000
	Fuzzing	ACDM	0.9999	0.9999	1.0000	0.9999	0.9999
		LSTM-based model	0.9999	0.9996	0.9997	0.9999	0.9997
		HyDL-IDS	0.9999	0.9998	0.9989	0.9999	0.9993
	Malfunction	ACDM	1.0000	1.0000	1.0000	1.0000	1.0000
		LSTM-based model	1.0000	1.0000	1.0000	1.0000	1.0000
		HyDL-IDS	0.9999	0.9999	0.9999	0.9999	0.9999

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gao, F.; Liu, J.; Liu, Y.; Gao, Z.; Zhao, R. Multi-Attack Intrusion Detection for In-Vehicle CAN-FD Messages. Sensors 2024, 24, 3461. https://doi.org/10.3390/s24113461

AMA Style

Gao F, Liu J, Liu Y, Gao Z, Zhao R. Multi-Attack Intrusion Detection for In-Vehicle CAN-FD Messages. Sensors. 2024; 24(11):3461. https://doi.org/10.3390/s24113461

Chicago/Turabian Style

Gao, Fei, Jinshuo Liu, Yingqi Liu, Zhenhai Gao, and Rui Zhao. 2024. "Multi-Attack Intrusion Detection for In-Vehicle CAN-FD Messages" Sensors 24, no. 11: 3461. https://doi.org/10.3390/s24113461

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multi-Attack Intrusion Detection for In-Vehicle CAN-FD Messages

Abstract

1. Introduction

2. Background

2.1. Controller Area Network Bus with Flexible Data Rate

2.2. Type of Attack

3. Problem Definition and Overview of Method

3.1. Problem Definition

3.2. Overview of Methods

4. Methodology

4.1. Data Collection

4.2. Generation of Anomalous Data

4.3. Data Preprocessing

4.4. ADDM

4.5. ACDM

5. Experiment and Evaluation

5.1. Experiment Setup and Evaluation Metric

5.2. Experiment with ADDM

5.3. Experiment with ACDM

5.4. Disscusion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI