Empowering Advanced Driver-Assistance Systems from Topological Data Analysis

Frahi, Tarek; Chinesta, Francisco; Falcó, Antonio; Badias, Alberto; Cueto, Elias; Choi, Hyung Yun; Han, Manyong; Duval, Jean-Louis

doi:10.3390/math9060634

Open AccessArticle

Empowering Advanced Driver-Assistance Systems from Topological Data Analysis

by

Tarek Frahi

^1,*,

Francisco Chinesta

¹,

Antonio Falcó

²

,

Alberto Badias

³,

Elias Cueto

³

,

Hyung Yun Choi

⁴,

Manyong Han

⁵ and

Jean-Louis Duval

⁶

¹

PIMM Lab, Arts et Metiers Institute of Technology, 151 boulevard de l’Hopital, 75013 Paris, France

²

Departamento de Matemáticas, Física y Ciencias Tecnológicas, Universidad Cardenal Herrera-CEU, CEU Universities, San Bartolome 55, 46115 Alfara del Patriarca, Valencia, Spain

³

I3A, Aragon Institute of Engineering Research, Universidad de Zaragoza, 50018 Zaragoza, Aragon, Spain

⁴

Department of Mechanical and System Design Engineering, Hongik University, 94 Wausanro, Mapogu, Seoul 04066, Korea

⁵

Digital Human Lab, Hongik University, 94 Wausanro, Mapogu, Seoul 04066, Korea

⁶

ESI Group, 3bis rue Saarinen, CEDEX 1, 94528 Rungis, France

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(6), 634; https://doi.org/10.3390/math9060634

Submission received: 31 January 2021 / Revised: 6 March 2021 / Accepted: 11 March 2021 / Published: 16 March 2021

(This article belongs to the Special Issue Numerical Simulation in Biomechanics and Biomedical Engineering)

Download

Browse Figures

Versions Notes

Abstract

We are interested in evaluating the state of drivers to determine whether they are attentive to the road or not by using motion sensor data collected from car driving experiments. That is, our goal is to design a predictive model that can estimate the state of drivers given the data collected from motion sensors. For that purpose, we leverage recent developments in topological data analysis (TDA) to analyze and transform the data coming from sensor time series and build a machine learning model based on the topological features extracted with the TDA. We provide some experiments showing that our model proves to be accurate in the identification of the state of the user, predicting whether they are relaxed or tense.

Keywords:

Morse theory; topological data analysis; machine learning; time series; smart driving

1. Introduction

While there have recently been considerable advances in self-driving car technology, driving still relies mainly on human factors. Even in self-driving mode, human drivers must often make decision in a fraction of a second to avoid accidents. Therefore, it is still of utmost importance to develop systems capable of discerning if the human driver is attentive or not to the road conditions. In general, the so-called advanced driver assistance systems (ADAS) [1,2] are systems that are able to improve the driver’s performance, among which, adaptive speed limiters, pedestrian detectors [3], and cruise controllers are some of the most popular systems. Fatigue alerting systems are among the most useful among ADAS systems, and the aim of this work is to contribute to the development of such a system based on a systematic analysis of drivers in actual driving conditions.

The estimation of the driver’s condition (degree of attention to the road, fatigue, etc.) is a very important factor to ensure safety in driving [4,5]. A recent review on the topic can be found in [6]. The goal of this work is to extract behavior patterns from car user data to be able to accurately estimate their state. We used data obtained by the laboratory of prof. Hyung Yun Choi at Hongik University in Seoul. His experiment involved the application of mechanical stimulation to people seated in an automobile.

Our main goal is to extract patterns of behavior from experimental data so as to allow us to learn the most relevant factors affecting driver’s attention to the situation of the road.

In the present work, we combine some tools from Morse theory [7] and topological data analysis (TDA) with all of the associated concepts and methods (e.g., Betti numbers, homology persistence, barcodes, persistence images, etc.) [8], most of them introduced and employed later in order to analyze and classify the experimental data. This allows us to introduce concepts as barcodes, that is, persistent and life-time diagrams in a similar way to how they are used in persistent homology. Our main goal is to predict car user behavior following a supervised approach [9]. Instead of considering an original sensor signal as the quantity of interest, we focus on its topological features. In this sense, the framework proposed in this paper allows us to unveil the true dimensionality of data or, in other words, the actual number of factors affecting driver’s performance. Thus, we model a sensor signal as a dynamical system, and, therefore, our approach seems to be better at describing its properties, or rather its variations, such as extrema, patterns, and self-similarity, than other approaches. We note that our approach is, in some senses, similar to that followed by Milnor and Thurston [10] in the study of the combinatorial properties of dynamical systems by combining tools from automata theory.

The structure of the paper is as follows: In Section 2, we describe the material and methods employed in this work. Particular attention is paid to the process of data acquisition and the description of time series and data curation. In Section 3, we present the main results of this work, and we discuss the main consequences in Section 4. As a complement, in Appendix A, we thoroughly illustrate the process of computing persistence images for the data of interest.

2. Material and Methods

In this section, we describe the collection and preprocessing of the experimental data. In Section 2.1, we describe the data acquisition, and in Section 2.2, we provide a description of the time series. Section 2.3 is devoted to data preprocessing. The mathematical tools used to describe the times series at a topological level are explained in Section 2.4. Finally, the image classification methodology is given in Section 2.5.

2.1. Data Acquisition

Our proposed predictor directly uses the data collected from the experiments. The data acquisition process involves measuring the response of human behavior when an excitation is applied to the seat. Figure 1 shows the location of the sensors in the experiments.

The excitation signal is an angular acceleration imposed on the seat of the user. This acceleration is an oscillating chirp function with a frequency range of 1 to 7.5 Hz on the X axis in rotation. The linear acceleration

a = (a_{x}, a_{y}, a_{z})

and angular velocity

ω = (ω_{x}, ω_{y}, ω_{z})

were measured in both the head and the seat by two IMU (Shimmer inertia measurement unit (IMU) sensors) at 256 Hz. By observing the floor excitation signals, we noted that the excitation is purely rotational around the X-axis—see Figure 2.

Several experiences were conducted by nine people by taking into account a set of six fixed states: driver, passenger, tense person, relaxed person, rigid seat, and SAV (sport activity vehicle seat). In particular, for each individual, eight experiments for the six available states were performed:

\{\begin{matrix} Class & Label \\ 1 & SAVRelaxedPassager \\ 2 & SAVTensePassager \\ 3 & SAVRelaxedDriver \\ 4 & SAVTenseDriver \\ 5 & RigidRelaxedPassager \\ 6 & RigidTensePassager \\ 7 & RigidRelaxedDriver \\ 8 & RigidTenseDriver \end{matrix}

As a consequence, we worked with a sample of 72 experiences, each of them encoded in a time series (as we explain later). Our goal is to classify the behavior of a generic driver, assigning one of the two states (tense or relaxed) by using the sensor data.

2.2. Time Series Description

The data acquired from sensors (see Figure 3 and Figure 4) were stored into six-dimensional time series, for both linear acceleration and angular velocity of the head movement. The sampling frequency of the data was 256 Hz, and the duration of the experiment was 34 s; hence, the resulting data dimensionality is

256 \times 34 = 8704

. For each times series, where

1 \leq t \leq 8704

, we constructed three new times series called the sliding window, embedding a length of 5800. The first one is given by the times values from

t = 1

to

t = 5800

, the second is given by the times values from

t = 1450

to

t = 7250

, and, to conclude, the third time window is defined as from

t = 2904

to

t = 8704

. Each element in the sample

(1 \leq i \leq 72)

was encoded by means of three six-dimensional time series representing each of the three sliding windows that we represent in matrix form as follows:

T S_{3 (i - 1) + 1} = [\begin{matrix} a_{x}^{ℓ} (1) & a_{x}^{ℓ} (2) & \dots & a_{x}^{ℓ} (5800) \\ a_{y}^{ℓ} (1) & a_{y}^{ℓ} (2) & \dots & a_{y}^{ℓ} (5800) \\ a_{z}^{ℓ} (1) & a_{z}^{ℓ} (2) & \dots & a_{z}^{ℓ} (5800) \\ ω_{x}^{ℓ} (1) & ω_{x}^{ℓ} (2) & \dots & ω_{x}^{ℓ} (5800) \\ ω_{y}^{ℓ} (1) & ω_{y}^{ℓ} (2) & \dots & ω_{y}^{ℓ} (5800) \\ ω_{z}^{ℓ} (1) & ω_{z}^{ℓ} (2) & \dots & ω_{z}^{ℓ} (5800) \end{matrix}], T S_{3 (i - 1) + 2} = [\begin{matrix} a_{x}^{ℓ} (1450) & a_{x}^{ℓ} (1451) & \dots & a_{x}^{ℓ} (7251) \\ a_{y}^{ℓ} (1450) & a_{y}^{ℓ} (1451) & \dots & a_{y}^{ℓ} (7251) \\ a_{z}^{ℓ} (1450) & a_{z}^{ℓ} (1451) & \dots & a_{z}^{ℓ} (7251) \\ ω_{x}^{ℓ} (1450) & ω_{x}^{ℓ} (1451) & \dots & ω_{x}^{ℓ} (7251) \\ ω_{y}^{ℓ} (1450) & ω_{y}^{ℓ} (1451) & \dots & ω_{y}^{ℓ} (7251) \\ ω_{z}^{ℓ} (1450) & ω_{z}^{ℓ} (1451) & \dots & ω_{z}^{ℓ} (7251) \end{matrix}]

and

T S_{3 i} = [\begin{matrix} a_{x}^{ℓ} (2903) & a_{x}^{ℓ} (2905) & \dots & a_{x}^{ℓ} (8704) \\ a_{y}^{ℓ} (2903) & a_{y}^{ℓ} (2905) & \dots & a_{y}^{ℓ} (8704) \\ a_{z}^{ℓ} (2903) & a_{z}^{ℓ} (2905) & \dots & a_{z}^{ℓ} (8704) \\ ω_{x}^{ℓ} (2903) & ω_{x}^{ℓ} (2905) & \dots & ω_{x}^{ℓ} (8704) \\ ω_{y}^{ℓ} (2903) & ω_{y}^{ℓ} (2905) & \dots & ω_{y}^{ℓ} (8704) \\ ω_{z}^{ℓ} (2903) & ω_{z}^{ℓ} (2905) & \dots & ω_{z}^{ℓ} (8704) \end{matrix}] .

Here, the matrices have a size of

6 \times 5800

and

1 \leq i \leq 72

. This allows us to represent the information by using a third-order tensor, namely,

Z \in R^{216 \times 6 \times 5800}

defined by

Z_{i, j, k} : = {(T S_{i})}_{j, k}

for

1 \leq i \leq 216

,

1 \leq j \leq 6

and

1 \leq k \leq 5800

. We can identify

Z_{i} = T S_{i}

for

1 \leq i \leq 216

.

2.3. Data Preprocessing

In order to obtain a single series for each observation, we concatenated all of the 6 time series (linear accelerations and angular velocities) for each observation horizontally and then created a data frame by stacking the 216 in sample observations.

The concatenation operation on the multidimensional time series collapsed the last two dimensions into one dimensional arrays with a length of

5800 \times 6 = 34,800

. The result is the two-dimensional table of concatenated time series

D = [\begin{matrix} v e c (Z_{1, :, :}) \\ \dots \\ v e c (Z_{216, :, :}) \end{matrix}] \in R^{216 \times 34800} .

We chose not to filter the signals because the topological sub-level set method should filter the high-frequency features naturally. We also chose to keep working on acceleration signals in order to avoid signal deviations after two integrations in time so as to obtain positions, the sensors not always keeping a zero mean height. Thus, the approach is completely (topologically) data-based.

The six time series

Z_{i}

of each observation were collapsed into a single concatenated time series with a size of 34,800—see Figure 5. The concatenated time series for the 216 observations were then stacked to create the dataset D with a size of

216 \times

34,800. We also used binary labels in the chained time series

Z_{i}

on the two target classes that we were interested in. In particular, we wrote

Z_{i}^{(α)}

where

α

is "0" for a relaxed driver and “1” for a tense one.

2.4. Extracting Topological Features from a Time Series

The idea to extract the topological information regarding the times series is to consider each sample observation as a piecewise linear continuous map from a closed interval to the real line. Therefore, we used a construction closely related to the Reeb graph [11] used in Morse theory to describe the times series at the topological level.

To this end, we consider the time series

x_{t}

for

0 \leq t \leq N - 1

(N \geq 3)

given by a vector

X = (x_{0}, x_{1}, \dots, x_{N - 1}) \in R^{N} .

we can view

X

as a function also denoted by

X : {0, 1, \dots, N - 1} ⟶ R

defined by

X (i) = x_{i}

for

0 \leq i \leq N - 1

. Here, to study the topological features of

X

we use the sub-level set of a piecewise-linear function

f_{X} : R ⟶ R

associated with

X

satisfying that

f_{X} (i) = X (i) = x_{i}

for

0 \leq i \leq N - 1

.

To construct this function, we consider the basis functions

{φ_{0}, \dots, φ_{N - 1}}

of continuous functions

φ_{i} : R ⟶ R

defined by

φ_{i} (s) : = \{\begin{matrix} s - i + 1 & if & i - 1 \leq s \leq i \\ i + 1 - s & if & i \leq s \leq i + 1 \\ 0 & if & s \notin] i - 1, i + 1 [ \end{matrix}

where

i = 1, \dots, N - 2

and

φ_{0} (s) : = \{\begin{matrix} 1 - s & if & 0 \leq s \leq 1 \\ 0 & if & s \in [0, 1] \end{matrix}

φ_{N - 1} (s) : = \{\begin{matrix} s - N + 2 & if & N - 2 \leq s \leq N - 1 \\ 0 & if & s \notin [N - 2, N - 1 [ \end{matrix}

This allows us to construct a piecewise continuous map

f_{X} : R ⟶ R

by

f_{X} (s) = \sum_{j = 0}^{N - 1} x_{j} φ_{j} (s),

and also to endow

R^{N}

with a norm given by

∥ X ∥ : = ∥ f_{X} ∥_{L^{2} (R)} = {(\int_{- \infty}^{\infty} {| f_{X} (s) |}^{2} d s)}^{1 / 2} .

In particular, we prove the following result, which helps us to identify the time series given by the vector

X

in

R^{N}

with the function

f_{X}

in

L^{2} (R)

.

Proposition 1.

The linear map

Φ : (R^{N}, ∥ \cdot ∥) ⟶ (L^{2} {(R), ∥ \cdot ∥}_{L^{2} (R)})

given by

Φ (X) = f_{X}

is an injective isometry between Hilbert spaces. Furthermore,

Φ (R^{N})

is a closed subspace in

L^{2} (R^{N})

.

Proof.

The map is clearly isometric and injective because

{φ_{0}, \dots, φ_{N - 1}}

is a set of linear independent functions in

L^{2} (R)

. □

Here, we describe the maps

f_{X} \in Φ (R^{N})

at the combinatorial level using the connected components (intervals) associated with its

λ

sub-level sets

{LS}_{λ} (f_{X}) : = {x \in [0, N - 1] : f_{X} (x) \leq λ}

for

λ \in R

. For this purpose, we introduce the following distinguished objects related to the

supp (f_{X}) = [0, N - 1] \subset R

of

f_{X}

:

The nodes or vertices denoted by

$V : = {[0], [1], \dots, [N - 1]}$

that represent the components of the vector $X,$ ;
The faces denoted by

$F : = {[0, 1] [1, 2], \dots, [N - 2, N - 1]}$

that represent the intervals used to construct the connected components of the sub-level sets of the map $f_{X}$ . Recall that we consider

$[i, i + 1] : = {z \in R : z = μ x_{i + 1} + (1 - μ) x_{i}, 0 \leq μ \leq 1} \subset R .$

Let

λ_{max} = max_{s \in [0, N - 1]} f_{X} (s) = max_{0 \leq i \leq N - 1} X (i),

and

λ_{min} = min_{s \in [0, N - 1]} f_{X} (s) = min_{0 \leq i \leq N - 1} X (i) .

For each

λ_{min} \leq λ \leq λ_{max}

, we introduce the following symbolic

λ

sub-level set for the map

f_{X}

:

L S_{λ} (f_{X}) : = \{σ \in F : f (σ) \leq λ\}

For

λ_{min} \leq λ \leq λ^{'} \leq λ_{max}

, it holds

L S_{λ} (f_{X}) \subset L S_{λ^{'}} (f_{X}) .

Our next goal was to quantify the evolution of the above symbolic

λ

sub-level with. To this end, we introduce the notion of feature associated with the

λ

sub-level set

L S_{λ} (f_{X})

.

We define the set of features for functions in

Φ (R^{N})

as

F (Φ (R^{N})) : = \{[i, j] \subset R : 0 \leq i < j \leq N - 1\} .

We note that

L S_{λ} (f_{X}) \subset F \subset F (Φ (R^{N}))

. Then next definition introduces the notion of features for a symbolic

λ

sub-level set as the interval of

F (Φ (R^{N}))

constructed by a maximal union of faces of

L S_{λ} (f_{X})

.

Definition 1.

We suggest that

I \in F (Φ (R^{N}))

is a feature for the symbolic λ sub-level set

L S_{λ} (f_{X})

if there exists

I_{1}, \dots, I_{k} \in L S_{λ} (f_{X})

such that

I = ⋃_{j = 1}^{k} I_{k}

and for every

J \in L S_{λ} (f_{X})

such that

J \neq I_{i}

for

1 \leq i \leq k

it holds that

I \cap J = \emptyset

. We denote by

F (L S_{λ} (f_{X}))

the set of features for the λ sub-level set

L S_{λ} (f_{X})

.

A feature for a

λ

sub-level set

L S_{λ} (f_{X})

is the maximal interval of

F (Φ (R^{N}))

that we can construct by unions of intervals in

L S_{λ} (f_{X})

. To illustrate this definition, we give the following example:

Example 1.

Let us consider the time series

X = (11, 14, 9, 7, 9, 7, 8, 10, 9) .

This allows us to construct the map

f_{X}

as shown in Figure 6. Then,

λ_{m i n} = 7

and

λ_{max} = 14

, and we have the following symbolic λ sub-level sets.

\begin{matrix} L S_{λ = 7} (f_{X}) & = \emptyset \\ L S_{λ = 8} (f_{X}) & = L S_{λ = 7} (f_{X}) \cup {[5, 6]} \\ L S_{λ = 9} (f_{X}) & = L S_{λ = 8} (f_{X}) \cup {[2, 3], [3, 4], [4, 5]} \\ L S_{λ = 10} (f_{X}) & = L S_{λ = 9} (f_{X}) \cup {[6, 7], [7, 8]} \\ L S_{λ = 11} (f_{X}) & = L S_{λ = 10} (f_{X}) \\ L S_{λ = 12} (f_{X}) & = L S_{λ = 11} (f_{X}) \\ L S_{λ = 13} (f_{X}) & = L S_{λ = 11} (f_{X}) \\ L S_{λ = 14} (f_{X}) & = L S_{λ = 11} (f_{X}) \cup {[0, 1]} . \end{matrix}

This allows us to compute the available features for each λ-value:

	$λ = 7$	$λ = 8$	$λ = 9$	$λ = 10$	$λ = 11$	$λ = 12$	$λ = 13$	$λ = 14$
$F (L S_{λ} (f_{X}))$	∅	$[5, 6]$	$[2, 6]$	$[2, 8]$	$[2, 8]$	$[2, 8]$	$[2, 8]$	$[0, 8]$

Let

F (f_{X})

be the whole set of features for

f_{X}

, that is,

F (f_{X}) = \{I : I \in F (L S_{λ} (f_{X})) for some λ_{min} \leq λ \leq λ_{max}\} .

Example 2.

From Example 1, we obtain

F (f_{X}) = {[5, 6], [2, 6], [2, 8], [0, 8]} .

We can represent the map

λ \mapsto L S_{λ} (f_{X})

from

[λ_{min}, λ_{max}]

to

F (f_{X})

as shown in Figure 7.

Let

I \in F (f_{X})

; in order to quantify the persistence of this particular feature for the map

f_{X}

, we use the map

λ \mapsto L S_{λ} (f_{X})

from

[λ_{min}, λ_{max}]

to

F (f_{X})

. To this end, we introduce the following definition: the birth point of the feature

I

is defined by

a (I) = inf \{λ : I \in F (L S_{λ} (f_{X}))\}

and the corresponding death point by

b (I) = sup \{λ : I \in F (L S_{λ} (f_{X}))\} .

In particular, we note that

a ([0, N - 1]) = λ_{max}

(see Figure 7). Since

a (I) \leq b (I) < \infty

holds for all

I \in F (f_{X}),

I \neq [0, N - 1],

we call the finite interval

[a (I), b (I)]

the barcode of the feature

I \in F (f_{X}) \ {[0, N - 1]}

.

Example 3.

From Example 1 we consider the features

[5, 6] \in L S_{λ = 8} (f_{X})

,

[2, 6] \in L S_{λ = 9} (f_{X})

, and

[2, 8] \in L S_{λ = 10} (f_{X})

. Then, the feature

[5, 6]

has its birth point at

a ([5, 6]) = 8

and its death point at

b ([5, 6]) = 9

; the feature

[2, 6]

has its birth point at

a ([2, 6]) = 9

and its death point at

b ([2, 6]) = 10

. Finally, the feature

[2, 8]

has its birth point at

a ([2, 8]) = 10

and its death point at

b ([2, 8]) = 14

. As a consequence, the set

B (f_{X}) : = {([5, 6]; 8, 9), ([2, 6]; 9, 10), ([2, 8]; 10, 14)}

of features and its corresponding barcodes contain the relevant information of the shape of

f_{X}

(see Figure 7).

Thus, we define the set of barcodes for

f_{X}

by

B (f_{X}) = \{(I; a (I), b (I)) : I \in F (f_{X}) \ {[0, N - 1]}\}

and its persistence diagram as

PD (f_{X}) : = \{(a (I), b (I)) \in R^{2} : I \in F (f_{X}) \ {[0, N - 1]}\}

(see Figure 8). An equivalent representation of the persistence diagram is the life-time diagram for

f_{X}

, which is constructed by means of a bijective transformation

T (a, b) = (a, b - a)

, acting over

PD (f_{X})

, that is,

LT (f_{X}) : = \{(a (I), b (I) - a (I)) \in R^{2} : I \in F (f_{X})) \ {[0, N - 1]}\};

see Figure 9.

In order to determine the grade of similarity between two barcodes from two different time series, we need to set a similarity metric. To this end, we construct the persistent image for

f_{X}

as follows: we observe that

LT (f_{X})

is a finite set of points, namely,

LT (f_{X}) = {(a_{1}, b_{1} - a_{1}), \dots, (a_{k}, b_{k} - a_{k})}

for some natural numbers

k \geq 1

and such that

b_{1} - a_{1} \leq b_{2} - a_{2} \dots \leq b_{k} - a_{k}

. Then, we consider a non-negative weighting function

w : LT (f_{X}) ⟶ [0, 1]

given by

w (a_{i}, b_{i} - a_{i}) = \frac{b_{i} - a_{i}}{b_{k} - a_{k}} for 1 \leq i \leq k

.

Finally, we fix M, a natural number, and take a bivariate normal distribution

g_{u} (x, y)

centered at each point

u \in LT (f_{X})

and with its variance

σ i d = \frac{1}{M} {max}_{1 \leq i \leq k} (b_{i} - a_{i}) i d,

where

i d

is the

2 \times 2

identity matrix. A persistence kernel is then defined by means of a function

ρ_{X} : R^{2} \to R

, where

ρ_{X} (x, y) = \sum_{u \in LT (f_{X})} w (u) g_{u} (x, y) .

(1)

We associate with each

X \in R

a matrix in

R^{M \times M}

as follows: let

ε > 0

be a non-negative real number that is sufficiently small, and then consider a square region

Ω_{X, ε} = [α, β] \times [α^{*}, β^{*}] \subset R^{2}

, covering the support of

ρ_{X} (x, y)

(up to a certain precision), such that

{\int \int}_{Ω_{X, ε}} ρ_{X} (x, y) d x d y \geq 1 - ε

holds. Next, we consider two equispaced partitions of the intervals

α = p_{0} \leq p_{1} \dots \leq p_{M} = β and α^{*} = p_{0}^{*} \leq p_{1}^{*} \dots \leq p_{M}^{*} = β^{*} .

Now, we put

Ω_{X, ε} = ⋃_{i = 0}^{M - 1} ⋃_{j = 0}^{M - 1} [p_{i}, p_{i + 1}] \times [p_{j}^{*}, p_{j + 1}^{*}] = ⋃_{i = 0}^{M - 1} ⋃_{j = 0}^{M - 1} P_{i, j}

The persistence image of

X

associated with the partition

P = {P_{i, j}}

is then described by the matrix given by the following equation:

P I (X, M, P, ε) = {({\int \int}_{P_{i, j}} ρ_{X} (x, y) d x d y)}_{i = 0, j = 0}^{i = M - 1, j = M - 1} \in R^{M \times M} .

(2)

2.5. Classification

Image classification is a procedure that is used to automatically categorize images into classes by assigning to each image a label representative of its class. A supervised classification algorithm requires a training sample for each class, that is, a collection of data points whose class of interest is known. Labels are assigned to each class of interest. The classification problem applied to a new observation is thus based on how close a new point is to each training sample. The Euclidean distance is the most common distance metric used in low-dimensional datasets. The training samples are representative of the known classes of interest to the analyst. In order to classify the persistence diagrams, we can use any state-of-the-art technique. In our case, we considered the random forest classification.

Recall that we conducted 9 different experiments, with 24 samples associated with each one of them corresponding to 3 samples for each of the different experimental conditions: relaxed rigid driver, relaxed rigid passenger, relaxed SAV driver, relaxed SAV passenger, tense rigid driver, tense rigid passenger, tense SAV driver, and tense SAV passenger. Their respective labels are

{0, 0, 0, 0, 1, 1, 1, 1}

. Therefore, we designed the following training validation process: The model is trained over 144 samples and evaluated over the remaining unseen 72 experiments (two-to-one training-to-testing ratio). The split between training and sampling is achieved using random shuffling and stratification to ensure balance between the classes. In order to improve the evaluation of the model generalizability, we also performed a cross-validation procedure following a leave-one-out strategy, consisting of iteratively training over the full dataset except one sample that was left out and used to test and score the model. We used the accuracy metric to evaluate the classification model. We can represent the performance of the model using the so-called confusion matrix: a 2D entries table where elements account for the number of samples in each category, with the first axis representing the true labels and the second axis the predicted labels. We also computed the different classification metrics to obtain a more detailed reporting of the model performances.

3. Results

The trained random forest classifier model for the persistence images has a notably high accuracy score on the training dataset (144) for both approaches and high accuracy for the testing dataset (72 samples). This suggests strong differentiation of the images with the respect to their generating signals, see Figure 10. The scores on the training and testing are 93 and 83%, respectively. The leave-one-out cross-validation achieved a score of 81%, indicating a good variance–bias trade-off and good generalization potential of the model.

4. Discussion

The combination of Morse theory and topological data analysis allows us to extract information from real data without the need for smoothness or regularity assumption on the time series. In our case, input data for each experiment were reduced from six-sensor time series of measurements to one single image containing the persistent pattern for attention to the road. Using the obtained persistence images as the new inputs, supervised learning proved to successfully predict the attention state of the driver or passenger.

The procedure used and described in this paper does not involve any additional pre-processing of the sensor data; is robust to noise and degraded signals; and supports large quantities of data, which makes it efficient and scalable.

It is important to highlight the fact that while the proposed methodology based on the TDA (successfully applied in large datasets [9]) seems general and powerful and it was able to extract the main data features, the validity of the driver behaviors observed in the analyzed dataset should be carefully checked due to the overly reduced dataset employed (limited to nine individuals) that does not allow for the full validation of prediction robustness.

Author Contributions

Conceptualization, F.C., H.Y.C. and M.H.; data curation, T.F., A.B. and M.H.; formal analysis, F.C., A.F., A.B., E.C., H.Y.C. and J.-L.D.; funding acquisition, F.C.; investigation, T.F., A.F. and M.H.; methodology, F.C., A.F., A.B. and H.Y.C.; software, A.B.; supervision, F.C., E.C. and H.Y.C.; writing—original draft, T.F.; writing—review editing, A.B. and E.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by ESI Group, Contract 2019-0060 with the University of Zaragoza.

Institutional Review Board Statement

Ethical review and approval were waived for this study due to the research presents no risk of harm to subjects and only collects non-personalized anonymized data.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data are available under request.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A

We can illustrate the process of computing the persistence diagrams, the lifetime diagrams, and the persistence images for the driver time series for each experimental setup:

Relaxed driver with SAV seat;
Relaxed driver with rigid seat;
Relaxed passenger with SAV seat;
Relaxed passenger with rigid seat;
Tense driver with SAV seat;
Tense driver with rigid seat;
Tense passenger with SAV seat;
Tense passenger with rigid seat.

Figure A1. Relaxed driver with SAV seat.

Figure A2. Relaxed driver with rigid seat.

Figure A3. Relaxed passenger with SAV seat.

Figure A4. Relaxed passenger with rigid seat.

Figure A5. Tense driver with SAV seat.

Figure A6. Tense driver with rigid seat.

Figure A7. Tense passenger with SAV seat.

Figure A8. Tense passenger with rigid seat.

Appendix B

To better evaluate a classification model, we are interested in quantities that express how often a sample is correctly or wrongly labelled into a particular class over all the samples and all the classes:

A $T r u e p o s i t i v e$ (TP): the correct prediction of a sample into a class;
A $T r u e n e g a t i v e$ (TN): the correct prediction of a sample out of a class;
A $F a l s e p o s i t i v e$ (FP): the incorrect prediction of a sample into a class;
A $F a l s e n e g a t i v e$ (FN): the incorrect prediction of a sample out of class.

Therefore, we can examine in more detail the classification model performance using the following metrics:

The $p r e c i s i o n$ P is the number of correct positive results divided by the number of all positive results.

$P = \frac{T P}{T P + F P}$

(A1)
The $r e c a l l$ R is the number of correct positive results divided by the number of all relevant samples.

$R = \frac{T P}{T P + F N}$

(A2)
The F-1 score is the harmonic mean of precision and recall.

$F 1 = 2 \times \frac{P \times R}{P + R}$

(A3)
The $a c c u r a c y$ A is the number of correct predictions over the number of all samples.

$A = \frac{T P + T N}{T P + T N + F P + F N}$

(A4)

We can summarize the presented metrics for our model in the following two reports:

Figure A9. Classification report.

References

Paul, A.; Chauhan, R.; Srivastava, R.; Baruah, M. Advanced Driver Assistance Systems (No. 2016-28-0223); SAE Technical Paper; SAE International: Warrendale, PA, USA, 2016. [Google Scholar] [CrossRef]
Shaout, A.; Colella, D.; Awad, S.S. Advanced Driver Assistance Systems-Past, Present and Future. In Proceedings of the 2011 Seventh International Computer Engineering Conference (ICENCO’2011), Cairo, Egypt, 27–28 December 2011; pp. 72–82. [Google Scholar]
Geronimo, D.; Lopez, A.M.; Sappa, A.D.; Graf, T. Survey of pedestrian detection for advanced driver assistance systems. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 32, 1239–1258. [Google Scholar] [CrossRef]
Lindgren, A.; Chen, F. State of the art analysis: An overview of advanced driver assistance systems (adas) and possible human factors issues. Hum. Factors Econ. Asp. Saf. 2006, 38, 50. [Google Scholar]
Izquierdo-Reyes, J.; Ramirez-Mendoza, R.A.; Bustamante-Bello, M.R. A study of the effects of advanced driver assistance systems alerts on driver performance. Int. J. Interact. Des. Manuf. 2018, 12, 263–272. [Google Scholar] [CrossRef]
Sikander, G.; Anwar, S. Driver fatigue detection systems: A review. IEEE Trans. Intell. Transp. Syst. 2018, 20, 2339–2352. [Google Scholar] [CrossRef]
Milnor, J. Morse Theory; Princeton University Press: Princeton, NJ, USA, 1963. [Google Scholar]
Epstein, C.; Carlsson, G.; Edelsbrunner, H. Topological data analysis. Inverse Probl. 2011, 27, 120201. [Google Scholar]
Frahi, T.; Argerich, C.; Youn, M.; Chinesta, F.; Falco, A. Tape surfaces characterization with persistence images. AIMS Mater. Sci. 2020, 7, 364–380. [Google Scholar] [CrossRef]
Milnor, J.W.; Thurston, W. On iterated maps of the interval, Dynamical systems (College Park, MD, 1986–87). In Lecture Notes in Mathematics; Springer: Berlin/Heidelberg, Germany, 1988; pp. 465–563. [Google Scholar]
Reeb, G. Sur les points singuliers d’une forme de Pfaff complètement intégrable ou d’une fonction numérique. C. R. Acad. Sci. Paris 1946, 222, 847–849. [Google Scholar]

Figure 1. Scheme of the data acquisition process showing the location of the sensors.

Figure 2. Floor excitation: X-axis angular velocity time series.

Figure 3. Sensor data: linear acceleration time series.

Figure 4. Sensor data: angular velocity time series.

Figure 5. Tensor reduction of a sensor time series.

Figure 6. The map

f_{X}

for

X = (11, 14, 9, 7, 9, 7, 8, 10, 9)

.

Figure 6. The map

f_{X}

for

X = (11, 14, 9, 7, 9, 7, 8, 10, 9)

.

Figure 7. The map

λ \mapsto L S_{λ} (f_{X})

for

X = (11, 14, 9, 7, 9, 7, 8, 10, 9)

.

Figure 7. The map

λ \mapsto L S_{λ} (f_{X})

for

X = (11, 14, 9, 7, 9, 7, 8, 10, 9)

.

Figure 8. Persistence diagram for the map

f_{X}

when

X = (11, 14, 9, 7, 9, 7, 8, 10, 9)

.

Figure 8. Persistence diagram for the map

f_{X}

when

X = (11, 14, 9, 7, 9, 7, 8, 10, 9)

.

Figure 9. Life-time diagram for the map

f_{X}

when

X = (11, 14, 9, 7, 9, 7, 8, 10, 9)

.

Figure 9. Life-time diagram for the map

f_{X}

when

X = (11, 14, 9, 7, 9, 7, 8, 10, 9)

.

Figure 10. Model performance for prediciting the attention state.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Frahi, T.; Chinesta, F.; Falcó, A.; Badias, A.; Cueto, E.; Choi, H.Y.; Han, M.; Duval, J.-L. Empowering Advanced Driver-Assistance Systems from Topological Data Analysis. Mathematics 2021, 9, 634. https://doi.org/10.3390/math9060634

AMA Style

Frahi T, Chinesta F, Falcó A, Badias A, Cueto E, Choi HY, Han M, Duval J-L. Empowering Advanced Driver-Assistance Systems from Topological Data Analysis. Mathematics. 2021; 9(6):634. https://doi.org/10.3390/math9060634

Chicago/Turabian Style

Frahi, Tarek, Francisco Chinesta, Antonio Falcó, Alberto Badias, Elias Cueto, Hyung Yun Choi, Manyong Han, and Jean-Louis Duval. 2021. "Empowering Advanced Driver-Assistance Systems from Topological Data Analysis" Mathematics 9, no. 6: 634. https://doi.org/10.3390/math9060634

APA Style

Frahi, T., Chinesta, F., Falcó, A., Badias, A., Cueto, E., Choi, H. Y., Han, M., & Duval, J.-L. (2021). Empowering Advanced Driver-Assistance Systems from Topological Data Analysis. Mathematics, 9(6), 634. https://doi.org/10.3390/math9060634

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Empowering Advanced Driver-Assistance Systems from Topological Data Analysis

Abstract

1. Introduction

2. Material and Methods

2.1. Data Acquisition

2.2. Time Series Description

2.3. Data Preprocessing

2.4. Extracting Topological Features from a Time Series

2.5. Classification

3. Results

4. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI