An Iterative Method Based on the Marginalized Particle Filter for Nonlinear B-Spline Data Approximation and Trajectory Optimization

Jauch, Jens; Bleimund, Felix; Frey, Michael; Gauterin, Frank

doi:10.3390/math7040355

Open AccessArticle

An Iterative Method Based on the Marginalized Particle Filter for Nonlinear B-Spline Data Approximation and Trajectory Optimization

by

Jens Jauch

,

Felix Bleimund

,

Michael Frey

^*

and

Frank Gauterin

Institute of Vehicle System Technology, Karlsruhe Institute of Technology, 76131 Karlsruhe, Germany

^*

Author to whom correspondence should be addressed.

Mathematics 2019, 7(4), 355; https://doi.org/10.3390/math7040355

Submission received: 3 February 2019 / Revised: 1 April 2019 / Accepted: 10 April 2019 / Published: 16 April 2019

(This article belongs to the Special Issue Recent Trends in Multiobjective Optimization and Optimal Control)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The B-spline function representation is commonly used for data approximation and trajectory definition, but filter-based methods for nonlinear weighted least squares (NWLS) approximation are restricted to a bounded definition range. We present an algorithm termed nonlinear recursive B-spline approximation (NRBA) for an iterative NWLS approximation of an unbounded set of data points by a B-spline function. NRBA is based on a marginalized particle filter (MPF), in which a Kalman filter (KF) solves the linear subproblem optimally while a particle filter (PF) deals with nonlinear approximation goals. NRBA can adjust the bounded definition range of the approximating B-spline function during run-time such that, regardless of the initially chosen definition range, all data points can be processed. In numerical experiments, NRBA achieves approximation results close to those of the Levenberg–Marquardt algorithm. An NWLS approximation problem is a nonlinear optimization problem. The direct trajectory optimization approach also leads to a nonlinear problem. The computational effort of most solution methods grows exponentially with the trajectory length. We demonstrate how NRBA can be applied for a multiobjective trajectory optimization for a battery electric vehicle in order to determine an energy-efficient velocity trajectory. With NRBA, the effort increases only linearly with the processed data points and the trajectory length.

Keywords:

nonlinear; recursive; iterative; B-spline; approximation; marginalized particle filter; Rao-Blackwellized particle filter; multiobjective; trajectory; optimization

1. Introduction

B-spline functions, curves, and surfaces are widely used for approximation [1,2,3] and for defining the trajectories of vehicles [4,5], robots [6,7] and industrial machines [8]. Furthermore, they are common in computer graphics [9,10] and signal processing for filter design and signal representation [11,12,13,14,15].

We address the approximation of a set of data points by a B-spline function in the nonlinear weighted least squares (NWLS) sense as well as the nonlinear optimization of a B-spline trajectory. In both cases, a Bayesian filter determines the coefficients of the B-spline function.

1.1. Nonlinear Weighted Least Squares Data Approximation

In NWLS approximation problems, the solution depends on the function coefficients in a nonlinear fashion. Based on the results of numerical experiments, Reference [16] reported that a B-spline function was beneficial in solving NWLS problems because of its piecewise polynomial character and smoothness.

In offline applications, a bounded number of data points needs to be processed and all data points are known at the same time. Therefore, the problem can be solved using a batch method. Batch methods for NWLS problems include the Gauss–Newton algorithm and the Levenberg-Marquardt (LM) algorithm. None of these algorithms is an exact method [16]. The LM algorithm solves in each iteration a linearized NWLS problem [17]. A method for separable NWLS problems, in which some parameters affect the solution linearly, is derived in References [18,19].

In contrast, in online applications such as signal processing, data points become available consecutively and their number is often unbounded. Sliding window algorithms keep the required memory constant by processing only a subset consisting of the latest data points [20]. A sliding window implementation of the LM algorithm for online applications is proposed in [21].

Recursive methods only store an already existing solution and update it with each additional data point. Therefore, they are suitable for online applications and usually require less memory and computational effort than batch algorithms that have been adapted for online applications.

NWLS approximation problems are nonlinear optimization problems. Therefore, recursive algorithms for NWLS problems can be based on nonlinear Bayesian filters.

1.2. Trajectory Optimization

Many driver assistance systems calculate a desired vehicle movement, also denoted trajectory, by solving a multiobjective optimization problem with respect to target criteria such as comfort, safety, energy consumption, and travel time. The trajectory optimization methods can be divided into Dynamic Programming (DP), direct methods (DM), and indirect methods (IM).

DP is based on Bellmann’s principle of optimality and determines globally optimal solutions. Its computational effort grows linearly with the temporal length of the trajectory and exponentially with the dimensions of the optimization problem. An adaptive cruise control based on DP is proposed in Reference [22]. DP-based algorithms for energy-efficient automated vehicle longitudinal control exist for vehicles with an internal combustion engine [23], hybrid electric vehicles [24], and plug-in hybrid electric vehicles [25]. In vehicles with a conventional powertrain, one dimension of the optimization problem refers to the selected gear. In case of a vehicle with a hybrid powertrain, there is at least one additional dimension for the operating mode, i.e., how power flows between the combustion engine, electric motor, and wheels. These degrees of freedom come along with various constraints, and frequently, the optimization problem needs to be simplified such that it can be solved in real-time.

DM lead to an optimization problem, in which the optimization variables are the parameters of a functional trajectory representation. The problem is usually nonlinear and solved using sequential quadratic programming methods or interior point methods. An example for a DM is the model predictive control, which solves the trajectory optimization problem on a receding horizon. DM are locally optimal, and their computational effort grows polynomially with the dimensions but mostly exponentially with the temporal trajectory length. Therefore, the optimization horizon is usually restricted to a few seconds.

IM are based on variational calculus and require solving a nonlinear equation system. They offer a polynomial complexity increase with the number of dimensions and the time horizon.

In practice, mainly the two first approaches are used and combined for solving difficult, farsighted trajectory optimization problems because of their complementary properties. Then DP provides a rough long-term reference trajectory for a DM that computes feasible trajectories within a short horizon [26,27].

1.3. Bayesian Filters

The Bayesian approach to a state estimation for dynamic systems calculates the probability density function (pdf) of the unknown system state. The required information stems partly from a system model and partly from previous measurements. The state estimation is performed by a recursive filter that alternates between a time update that predicts the state via the system model and a measurement update that corrects the estimate with the current measurement.

The Kalman filter (KF) computes an optimal state estimate for systems with linear system and measurement equations and Gaussian system and measurement noises [28]. Use cases include path planning applications [29]. However, in many scenarios, the linear Gaussian assumptions do not apply and suboptimal approximate nonlinear Bayesian filters such as the extended Kalman filter (EKF), unscented Kalman filter (UKF), or particle filter (PF) are required [30].

The EKF applies a local first order Taylor approximation to the nonlinear system and measurement functions via Jacobians in order to keep the linear state and measurement equations. The system and measurement noises are both approximated with Gaussian pdfs [28]. Although the EKF is not suitable for systems with a strong nonlinearity or non-Gaussian noise, it is still often successfully used for a nonlinear state estimation [31]. For example, an NWLS approximation via a modified EKF is presented in Reference [32].

An alternative to the approximation of the nonlinear state and measurement functions is the approximation of the pdfs. This can be done by propagating a few state samples called sigma points through the nonlinear functions. A filter that follows this approach is referred to as a sigma point Kalman filter. One of the most well-known representatives is the UKF. It uses

2 \cdot J + 1

deterministically chosen sigma points, whereby J denotes the dimensions of the system state. The pdfs are approximated as Gaussians of which the means and variances are determined from the propagated sigma points [28].

Compared to the EKF, the UKF offers at least a second-order accuracy [33] and is a derivative-free filter [28], meaning that it does not require the evaluation of Jacobians, which is often computationally expensive in the EKF [31]. Several publications report nonlinear problems in which the UKF performs better than the EKF, e.g., for a trajectory estimation [33,34]. However, if the pdf cannot be well-approximated by a Gaussian because the pdf is multimodal or has a strong skew, the UKF will also not perform well. Under such conditions, sequential Monte Carlo methods like the PF outperform Gaussian filters like EKF and UKF [30].

The PF approximates the pdf by a large set of randomly chosen state samples called particles. The state estimate is a weighted average of the particles. With increasing number of particles, the pdf approximation by the particles becomes equivalent to the functional pdf representation and the estimate converges against the optimal estimate [30]. For nonlinear and non-Gaussian systems, the PF allows the determination of various statistical moments, whereas EKF and UKF are limited to the approximation of the first two moments [31]. However, the number of particles that is needed for a sufficient approximation of the pdf increases exponentially with the state dimension [35]. The PF has been applied to the optimization [36] and prediction [37] of trajectories successfully as well.

Many use cases involve a mixed linear/nonlinear system. Typically, there are few nonlinear state dimensions and comparatively many linear Gaussian state dimensions. The marginalized particle filter (MPF) is beneficial for such problems as it combines KF and PF. The PF is only applied to the nonlinear states because the linear part of the state vector is marginalized out and optimally filtered with the KF. This approach is known as Rao–Blackwellization and can be described as an optimal Gaussian mixture approximation. Therefore, the MPF is also called a Rao–Blackwellized particle filter or mixture Kalman filter. Marginalizing out linear states from the PF strongly reduces the computational effort because less particles suffice and often enables real-time applications. Simultaneously, the estimation accuracy usually increases [31,38].

In the recent past, several publications have proposed approaches for localization [39,40] and trajectory tracking [38,41] that are based on the MPF because of its advantages for mixed linear/nonlinear systems. Automotive use cases include a road target tracking application, of which the multimodality requires using a PF or MPF [42]. The MPF is chosen as it allows a reduction in the number of particles for less computational effort. Similarly, Reference [35] presents a MPF application for lane tracking, in which the achieved particle reduction compared to a pure PF enables the execution of the algorithm in real-time in an embedded system.

1.4. Contribution

By definition, a B-spline function with a bounded number of coefficients has a bounded definition range. Usually, approximation algorithms require a bounded number of coefficients which restricts the approximation of data points with a B-spline function to a bounded interval that needs to be determined in advance.

In online applications, the required B-spline function definition range might not be precisely known or vary. This can result in the issuse of unprocessable data points outside the selected definition range.

In Reference [43], we presented the recursive B-spline approximation (RBA) algorithm, which iteratively approximates an unbounded set of data points in the linear weighted least squares (WLS) sense with a B-spline function using a KF. A novel shift operation enables an adaptation of the definition range during run-time such that the latest data point can always be approximated.

However, recursive NWLS B-spline approximation methods are still restricted to a constant approximation interval. We contribute to closing this research gap by proposing and investigating an algorithm termed nonlinear recursive B-spline approximation (NRBA) for the case of NWLS approximation problems.

NRBA comprises an MPF that addresses nonlinear target criteria with its PF while it determines the optimal solution for linear target criteria with a KF [44]. The target criteria that refer to the value of the B-spline function or its derivatives directly are linear criteria. Hereby, the benefit of using MPF is that it can deal with strong nonlinearities, that its computational effort can be adapted by changing the number of particles in order to meet computation time constraints, and that it accepts the known measurement matrix for linear target critera as an input, whereas other nonlinear filters estimate the relationship between measurements and function coefficients.

In automotive applications, the exponential growth of the computational effort with an increasing time horizon limits the application of DM to short time horizons. Hence, the research gap regarding trajectory optimization consists of available DM with a lower complexity. Compared to conventional and hybrid vehicles, the powertrain of a battery electric vehicle (BEV) often only has a constant gear ratio which enables savings in computational effort.

Since the NWLS approximation problem that NRBA solves is an unconstrained nonlinear optimization problem, NRBA can be applied for multiobjective trajectory optimization. Our contribution regarding trajectory optimization is an iterative local direct optimization method for B-spline trajectories of which the computational effort only grows linearly with the time horizon instead of exponentially. Due to the iterative nature of NRBA, the optimization can be paused, and if computation time is available, the temporal length of the trajectory can be extended by calculating additional coefficients.

1.5. Structure of the Data Set

Analogous to Reference [43], we consider the data point sequence

{(s_{t}, y_{t})}_{t = 1, 2, \dots, n}

. The index t indicates the time step at which the data point

(s_{t}, y_{t})

becomes available.

s_{t}

denotes the value of the independent variable s at t. The vector

y_{t} = {(y_{t, 1}, y_{t, 2}, \dots, y_{t, v}, \dots, y_{t, V_{t}})}^{⊤}

summarizes

V_{t}

scalar measurements y. The superscript

^{⊤}

indicates transposed quantities.

V_{t} \in N

can vary with t, but we suppose that

V_{t} ≪ n \forall t

. The vector

y

comprises all measurements and is given by

y = {(\underset{= : y_{1}^{⊤}}{\underset{︸}{y_{1, 1}, \dots, y_{1, V_{1}}}}, \dots, y_{t}^{⊤}, \dots, \underset{= : y_{n}^{⊤}}{\underset{︸}{y_{n, 1}, \dots, y_{n, V_{n}}}})}^{⊤}

(1)

1.6. Outline

Section 2.1 states the used B-spline function definition. In Section 2.2, we specify the MPF and the chosen state-space model. Section 2.3 proposes the NRBA algorithm for an NWLS approximation. The numerical experiments in Section 3 investigate the capabilities of NRBA compared to the LM algorithm as well as the influences of the NRBA parameters on the result and convergence before we demonstrate how NRBA can be applied for a multiobjective trajectory optimization in Section 4. In Section 5, we recapitulate the features of NRBA and conclude.

2. Methods

2.1. B-Spline Function Representation

The value of a B-spline function results from the weighted sum of J polynomial basis functions called B-splines. All B-splines possess the same degree d. The B-splines are defined by d, and the knot vector

κ = (κ_{1}, κ_{2}, \dots, κ_{J + d + 1})

. We suppose that the values of the knots

κ

grow strictly monotonously (

κ_{k} < κ_{k + 1}, k = 1, 2, \dots, J + d

).

μ

with

d + 1 \leq μ \leq J

is the spline interval index and

[κ_{μ}, κ_{μ + 1})

is the corresponding spline interval of the B-spline function.

In the jth B-spline

b_{j} (s), j = 1, 2, \dots, J

is positive for

s \in (κ_{j}, κ_{j + d + 1})

and diminishes everywhere else. This feature is referred to as local support and causes the B-spline function to be piecewise defined for each spline interval. For

s \in [κ_{μ}, κ_{μ + 1})

, only the B-splines

b_{j} (s), j = μ - d, \dots, μ

can be positive.

Their values for a specific s are comprised in the B-spline vector

b_{μ, d} (s) = (b_{μ - d} (s), b_{μ - d + 1} (s), \dots, b_{μ} (s)) \in R^{1 \times (d + 1)}

which is calculated according to Equation (2):

\begin{matrix} b_{μ, d} (s) = \underset{\in R^{1 \times 2}}{\underset{︸}{B_{μ, 1} (s)}} \underset{\in R^{2 \times 3}}{\underset{︸}{B_{μ, 2} (s)}} \dots \underset{\in R^{δ \times (δ + 1)}}{\underset{︸}{B_{μ, δ} (s)}} \dots \underset{\in R^{d \times (d + 1)}}{\underset{︸}{B_{μ, d} (s)}} \end{matrix}

(2)

The B-spline matrix

B_{μ, δ} (s) \in R^{δ \times (δ + 1)}

with

δ \in N

and

δ \leq d

reads

B_{μ, δ} (s) = [\begin{matrix} \frac{κ_{μ + 1} - s}{κ_{μ + 1} - κ_{μ + 1 - δ}} & \frac{s - κ_{μ + 1 - δ}}{κ_{μ + 1} - κ_{μ + 1 - δ}} & 0 & \dots & 0 \\ 0 & \frac{κ_{μ + 2} - s}{κ_{μ + 2} - κ_{μ + 2 - δ}} & \frac{s - κ_{μ + 2 - δ}}{κ_{μ + 2} - κ_{μ + 2 - δ}} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & 0 & \dots & \frac{κ_{μ + δ} - s}{κ_{μ + δ} - κ_{μ}} & \frac{s - κ_{μ}}{κ_{μ + δ} - κ_{μ}} \end{matrix}] .

(3)

D = [κ_{d + 1}, κ_{J + 1})

is the definition range of the B-spline function

f : D \to R, s \mapsto f (s)

. For

s \in [κ_{μ}, κ_{μ + 1})

, f is defined by

f (s) = b_{μ, d} (s) x_{μ, d}

(4)

with coefficient vector

x_{μ, d} = {(x_{μ - d}, x_{μ - d + 1}, \dots, x_{μ})}^{⊤} .

(5)

f has

d - 1

continuous derivatives. For

r \in N_{0}

, the rth derivative

\frac{\partial^{r}}{\partial s^{r}} f (s)

of f reads

\frac{\partial^{r}}{\partial s^{r}} f (s) = \frac{\partial^{r}}{\partial s^{r}} b_{μ, d} (s) x_{μ, d}

(6)

with B-spline vector

\frac{\partial^{r}}{\partial s^{r}} b_{μ, d} (s) = \{\begin{matrix} \frac{d!}{(d - r)!} B_{μ, 1} (s) \dots B_{μ, d - r} (s) B_{μ, d - r + 1}^{'} \dots B_{μ, d}^{'}, & if r \leq d \\ 0_{1 \times (d + 1)}, & otherwise . \end{matrix}

(7)

0_{1 \times (d + 1)}

is a

1 \times (d + 1)

zero matrix. The matrix

B_{μ, δ}^{'} \in R^{δ \times (δ + 1)}

results from computing the derivative with respect to s for each element of

B_{μ, δ} (s)

[43,45]:

B_{μ, δ}^{'} = [\begin{matrix} \frac{- 1}{κ_{μ + 1} - κ_{μ + 1 - δ}} & \frac{1}{κ_{μ + 1} - κ_{μ + 1 - δ}} & \dots & 0 \\ ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & \dots & \frac{- 1}{κ_{μ + δ} - κ_{μ}} & \frac{1}{κ_{μ + δ} - κ_{μ}} \end{matrix}]

(8)

2.2. Marginalized Particle Filter

The marginalized particle filter (MPF) is an iterative algorithm for estimating the unknown state vector

x_{t}

of a system at time step

t \in N

.

In the MPF,

x_{t}

is subdivided into

x_{t} = {({(x_{t}^{L})}^{⊤}, {(x_{t}^{N})}^{⊤})}^{⊤},

whereby the KF optimally estimates the linear state vector

x_{t}^{L}

and a PF estimates the nonlinear state vector

x_{t}^{N}

. Exploiting linear substructures allows for better estimates and a reduction of the computational effort. Therefore, the MPF is beneficial for mixed linear/nonlinear state-space models [46]. Due to Equations (4) and (6), linear substructures will occur in approximation problems as long as there are target criteria that refer to the value of the B-spline function or its derivatives directly.

MPF algorithms for several state-space models can be found in Reference [46] along with a Matlab example that can be downloaded from [47]. An equivalent but new formulation of the MPF that allows for reused, efficient, and well-studied implementations of standard filtering components is stated in Reference [44].

For an NWLS approximation, we apply the following state-space model derived from Reference [44]:

\begin{matrix} x_{t + 1}^{N} & = A_{t}^{N} x_{t}^{N} + ω_{t}^{N} + u_{t}^{N} & (nonlinear state equation) \end{matrix}

(9)

\begin{matrix} x_{t + 1}^{L} & = A_{t}^{L} x_{t}^{L} + ω_{t}^{L} + u_{t}^{L} & (linear state equation) \end{matrix}

(10)

\begin{matrix} y_{t} & = C x_{t}^{L} + c (x_{t}^{N}) + υ_{t} & (measurement equation) \end{matrix}

(11)

The superscripts

^{L}

and

^{N}

indicate that the corresponding quantity refers to linear or nonlinear state variables, respectively.

A_{t}

denotes the state transition matrix,

u_{t}

is the known input vector,

y_{t}

is the vector of measurements,

C_{t}

is the measurement matrix, and

c

is the nonlinear measurement function that depends on

x_{t}^{N}

.

ω_{t}^{L}

denotes the process noise of the linear state vector with a covariance matrix

Q_{t}^{L}

,

ω_{t}^{N}

is the process noise of the nonlinear state vector with a covariance matrix

Q_{t}^{N}

, and

υ_{t}

is the measurement noise with a covariance matrix

R_{t}

.

The model of the conditionally linear subsystem in the KF has the state vector

{(ξ^{⊤}, {(x^{L})}^{⊤})}^{⊤}

, whereby

ξ

describes the linear dynamics of

x^{N}

:

\begin{matrix} (\begin{matrix} ξ_{t + 1} \\ x_{t + 1}^{L} \end{matrix}) & = (\begin{matrix} 0 & A_{t}^{N} \\ 0 & A_{t}^{L} \end{matrix}) (\begin{matrix} ξ_{t} \\ x_{t}^{L} \end{matrix}) + (\begin{matrix} u_{t}^{N} \\ u_{t}^{L} \end{matrix}) + (\begin{matrix} ω_{t}^{N} \\ ω_{t}^{L} \end{matrix}) \\ y_{t} & = (\begin{matrix} 0 & C_{t} \end{matrix}) (\begin{matrix} ξ_{t} \\ x_{t}^{L} \end{matrix}) + c (x_{t}^{L}) + υ_{t} \end{matrix}

(12)

The covariance matrix of process noise is

(\begin{matrix} Q_{t}^{N} & 0 \\ 0 & Q_{t}^{L} \end{matrix})

, and

0

denotes a zero matrix with a suitable size.

A PF with the model

\begin{matrix} x_{t + 1}^{N} & = {\bar{ω}}_{t}^{N} \\ y_{t} & = {\bar{υ}}_{t} \end{matrix}

(13)

deals with the remaining nonlinear effects. The noise depends on the estimates indicated by ^ from the conditionally linear model:

\begin{matrix} {\bar{ω}}_{t}^{N} & \sim N (\hat{ξ}_{t}, P_{t}^{ξ, -}) \\ {\bar{υ}}_{t} & \sim N (c (x_{t}^{N}) + C_{t} (x_{t}^{N}) {\hat{x}}^{L, -}, S_{t}) \end{matrix}

(14)

with

S_{t} = C_{t} P_{t}^{L, -} C_{t}^{⊤} + R_{t}

(15)

where the superscript

^{-}

refers to a priori quantities that are computed in the time update which is based on the state of Equations (9) and (10). In contrast,

^{+}

denotes a posteriori quantities that are calculated in the following measurement update based on the measurement of Equation (11).

P_{t}^{L, -}

and

P_{t}^{ξ, -}

are the covariance matrices of the estimation errors that belong to

{\hat{x}}_{t}^{L}

and

\hat{ξ}_{t}

, respectively.

The PF uses multiple state estimates called particles simultaneously. The superscript

^{p}

with

p = 1, \dots, P

is the particle index and

P

is the particle count. In general, a KF is used for each particle. In the chosen state-space model, however,

A_{t}^{L}

,

A_{t}^{N}

,

Q_{t}^{L}

, and

Q_{t}^{N}

are independent of

x_{t}^{L}

and

x_{t}^{N}

. This implies that

P_{t}^{L, -}

and

P_{t}^{ξ, -}

are identical for all KFs which reduces the computational effort substantially [44,46].

Algorithm 1 states the equations for one MPF iteration and was derived from References [44,46]. For an implementation in Matlab, we adapted the example from Reference [47]. Note that, in Algorithm 1, the measurement update of the previous time step

t - 1

occurs before the time update for the current time step

t

, similar to the algorithm in Reference [48].

In line 4 of Algorithm 1, linear particles are resampled according to their corresponding normalized importance weights. After resampling, particles with a low measurement error occur more frequently in the set of particles. Subsequently, all particles

{\hat{x}}_{t - 1}^{L, +, p}

are aggregated in line 5 to a single estimate

{\hat{x}}_{t - 1}^{+}

by calculating their mean.

After both KF and PF have been time updated, the KF is adjusted based on the PF estimates in a mixing step with the cross-covariances of the estimation errors,

P_{t}^{ξ L, -}

and

P_{t}^{L ξ, -}

.

In the new formulation from Reference [44], resampling occurs after the measurement update of both PF and KF. Therefore, the quantities computed for the measurement update of the PF can be reused for the KF measurement update. In particular, each particle is only evaluated once in line 1 of each MPF iteration instead of twice as with the previous formulation in Reference [46].

Algorithm 1: The marginalized particle filter derived from References [44,46]

2.3. Nonlinear Recursive B-Spline Approximation

The Nonlinear recursive B-spline approximation (NRBA) iteratively adapts a B-spline function

f (s)

with degree d to the data set from Section 1.5. Algorithm 2 states the instructions for one iteration of NRBA, which is based on the MPF.

In each iteration t, NRBA modifies f in

I \in N

consecutive spline intervals. Each linear particle

\hat{x}_{t}^{L, p} = {(\hat{x}_{t_{1}}^{L}, \hat{x}_{t_{2}}^{L}, \dots, \hat{x}_{t_{J}}^{L})}^{⊤}

and each nonlinear particle

\hat{x}_{t}^{N, p} = {(\hat{x}_{t_{1}}^{N}, \hat{x}_{t_{2}}^{N}, \dots, \hat{x}_{t_{J}}^{N})}^{⊤}

contains estimates for

J = d + I

function coefficients of f.

κ_{t} = (κ_{t_{1}}, κ_{t_{2}}, \dots, κ_{t_{K}})

denotes the knot vector comprising

K = J + d + 1

knots. The resulting definition range

D_{t}

of f is given by

D_{t} = [κ_{t_{d + 1}}, κ_{t_{J + 1}})

. NRBA checks if

s_{t}

is in the definition range of the previous time step,

D_{t - 1}

. If not,

D_{t - 1}

needs to be shifted such that

s_{t} \in D_{t}

. A shift can be conducted in the MPF time update. The result of the time update is the a priori estimate

{\hat{x}}_{t}^{-}

. In the following measurement update, we need

s_{t}

again to compute the measurement matrix

C_{t}

, and then, to take into account

y_{t}

. The result of the measurement update is the a posteriori estimate

{\hat{x}}_{t}^{+}

.

Figure 1 depicts the allocation of available data points and computed estimates

\hat{x}

to KF iterations in RBA versus MPF iterations in NRBA. The arrows indicate the needed information for computing the estimates. The KF is initialized with

{\hat{x}}_{0}^{+}

and conducts in each iteration a time update first and then a measurement update. Therefore, we need

n

iterations for

n

data points. In contrast, the MPF performs the measurement update first and is initialized with

{\hat{x}}_{0}^{-}

. Therefore, we have to save

y_{t}

and provide

s_{t}

,

s_{t + 1}

, and

y_{t}

for iteration

t + 1

. Hence, we need one iteration more than with the KF in order to take into account all data points. By definition, we use

(s_{1}, y_{1})

for computing

{\hat{x}}_{0}^{+}

and

s_{n}

for

{\hat{x}}_{n + 1}^{-}

as indicated by the dashed arrows.

2.3.1. Initialization

Each linear particle

{\hat{x}}_{0}^{L, -, p}

is initialized with

{\hat{x}}_{0}^{L, -, p} = {\bar{x}}^{Init} 1_{J \times 1}

, and each nonlinear particle

{\hat{x}}_{0}^{N, -, p}

is initialized with

{\hat{x}}_{0}^{N, -, p} = {\bar{x}}^{Init} 1_{J \times 1} + chol (\bar{p} I_{J \times J}) \cdot rnd_{J \times 1}

. Hereby,

1_{J \times 1}

is a

J \times 1

matrix of ones and

{\bar{x}}^{Init}

indicates an initial value equal to the scalar measurement

y_{1, v}

referring to f.

chol (\cdot)

computes the Cholesky factorization, and

rnd_{J \times 1}

is a

J \times 1

vector of random values drawn from the standard normal distribution. The covariance matrix of a priori estimation error of linear states,

P^{L, -}

, is initialized with

P_{0}^{L, -} = \bar{p} I_{J \times J}

.

I_{J \times J}

denotes a

J \times J

identity matrix.

The large scalar value

\bar{p}

causes

\hat{x}_{t}

to quickly change such that f adapts to the data. Provided that the values in

Q_{t}^{L}

are small, the values in

P_{t}^{L, -}

decrease as t grows because of line 8 of Algorithm 1. Small elements in

P_{t}^{L, -}

correspond to certain estimates. Therefore, the particles

{\hat{x}}_{t}^{L, -, p}

and

{\hat{x}}_{t}^{N, -, p}

are slower to be updated using measurements such that f converges. Analogous statements hold for

P_{t}^{ξ, -}

because of line 9 of Algorithm 1.

Hence, the process noises are defined as

Q_{t}^{L} = {\bar{q}}^{L} I_{J \times J}

and

Q_{t}^{N} = {\bar{q}}^{N} I_{J \times J}

with small positive values

{\bar{q}}^{L}

and

{\bar{q}}^{N}

, respectively.

2.3.2. Measurement Update

The measurement update from line 1 to line 4 of Algorithm 1 adapts

f (s)

based on

(s_{t - 1}, y_{t - 1})

.

The vth dimension of

y_{t - 1}

refers to either f itself or a derivative of f. Therefore, the vth row of the

V_{t - 1} \times J

measurement matrix

C_{t - 1}

reads

C_{{t - 1}_{v; 1, \dots, J}} = (0_{1 \times (μ - (d + 1))}, \frac{\partial^{r}}{\partial s^{r}} b_{μ, d} (s_{t - 1}), 0_{1 \times (J - μ)}),

(16)

whereby

s_{t - 1} \in [κ_{μ}, κ_{μ + 1})

and

r \in N_{0}

. Algorithm 2 computes

C_{t - 1}

in line 7 using Equation (16).

The value of the nonlinear measurement function

c

depends on the nonlinear particles

{\hat{x}}_{t - 1}^{N, -, p}

. Furthermore,

c

can depend on additional quantities that vary with the application and are not stated in Algorithm 1.

The diagonal

V_{t} \times V_{t}

covariance matrix of measurement noise

R_{t - 1}

enables a relative weighting of the dimensions of

y_{t - 1}

because the influence of the vth dimension of the measurement error

e_{t}^{p} = (y_{t - 1} - {\hat{y}}^{p})

on

{\hat{x}}_{t - 1}^{L, -, p}

and

{\hat{x}}_{t - 1}^{N, -, p}

decreases with a growing positive value

R_{{t - 1}_{v; v}}

.

Algorithm 2: Nonlinear recursive B-spline approximation

2.3.3. Time Update with Shift Operation

Based on a comparison between

κ_{t - 1}

and

s_{t}

, NRBA decides if a shift operation of the B-spline function definition range is required to achieve that

s_{t} \in D_{t}

.

The variable

σ

calculated from line 8 to line 21 of Algorithm 2 states the shift direction of

D_{t - 1}

and by how many positions components in

κ_{t - 1}

,

{\hat{x}}_{t - 1}^{L, -, p}

and

{\hat{x}}_{t - 1}^{N, -, p}

need to be moved for that purpose.

σ > 0

indicates a right shift of

D_{t - 1}

,

σ < 0

indicates a left shift, and

σ = 0

means that no shift is conducted because

s_{t} \in D_{t - 1}

.

Algorithm 2 expects that, for

σ > 0

, the

| σ |

additionally needed knots are the

σ

last entries of the knot vector

{\bar{κ}}_{t} = ({\bar{κ}}_{t_{1}}, {\bar{κ}}_{t_{2}}, \dots, {\bar{κ}}_{t_{K}})

and that they are the

- σ

first entries of

{\bar{κ}}_{t}

if

σ < 0

.

Case 1: Right shift of definition range (

σ \geq 0

)

The updated knot vector reads

\begin{matrix} κ_{t} \leftarrow (κ_{{t - 1}_{σ + 1}}, κ_{{t - 1}_{σ + 2}}, \dots, κ_{{t - 1}_{K}}, \\ {\bar{κ}}_{t_{K - σ + 1}}, {\bar{κ}}_{t_{K - σ + 2}}, \dots, {\bar{κ}}_{t_{K}}) \end{matrix}

(17)

and line 6 of Algorithm 1 updates

{\hat{x}}_{t - 1}^{L, +, p}

to

{\hat{x}}_{t}^{L, -, p}

using the state transition matrix

A_{t}^{L} = A_{t}

(18)

with

A_{t} \in R^{J \times J} with A_{t_{g; h}} = \{\begin{matrix} 1, & if h = g + σ \\ 0, & otherwise . \end{matrix}

(19)

and the input signal vector

u_{t}^{L} = u_{t}

(20)

with

u_{t} = {(0_{1 \times (J - σ)}, \bar{x} 1_{1 \times σ})}^{⊤} .

(21)

Thereby all entries of

{\hat{x}}_{t - 1}^{L, +, p}

are moved to the left and the last

σ

entries of

{\hat{x}}_{t}^{L, -, p}

have an arbitrary initial value

\bar{x}

:

{\hat{x}}_{t}^{L, -, p} = {(\hat{x}_{{t - 1}_{σ + 1}}^{L}, \hat{x}_{{t - 1}_{σ + 2}}^{L}, \dots, \hat{x}_{{t - 1}_{J - σ}}^{L}, \bar{x} 1_{1 \times σ})}^{⊤}

(22)

During a right shift of the definition range, we set

\bar{x}

to the last element of

{\hat{x}}_{t - 2}^{+}

, which is determined during the preceding call of Algorithm 1 in line 5. This is based on the assumption that

{\hat{x}}_{t - 2}^{+}

is a good initial value in the magnitude of the data.

Additionally, line 8 of Algorithm 1 updates

P_{t - 1}^{L, +}

to

P_{t}^{L, -}

using Equation (18) and

Q_{t}^{L} \in R^{J \times J} with Q_{t_{g; h}}^{L} = \{\begin{matrix} \bar{p}, & if h = g \land Q \\ {\bar{q}}^{L}, & if h = g \land \neg Q \\ 0, & otherwise . \end{matrix}

(23)

with

Q = \{\begin{matrix} h \geq J - σ + 1, & if σ \geq 0 \\ h \leq - σ, & if σ < 0 \end{matrix}

(24)

The update operation moves the elements in

P_{t - 1}^{L, +}

to the top left and replaces the zeros on the last σ main diagonal elements of

Q_{t}^{L}

with

\bar{p}

in order to get large values on the last σ main diagonal elements of

P_{t}^{L, -}

and a fast adaption of the initial estimates

\bar{x}

to the data points.

In line 7 and line 9, Algorithm 1 computes the the quantities

{\hat{ξ}}_{t}^{p}

and

P_{t}^{ξ, -}

that are needed for the PF time update. The calculations of the state transition matrix

A^{N}

with

A_{t}^{N} = A_{t}

(25)

and the input signal vector

u^{N}

with

u_{t}^{N} = u_{t}

(26)

are analogous to those for the linear quantities.

Q^{N}

uses

{\bar{q}}^{N}

instead of

{\bar{q}}^{L}

:

Q_{t}^{N} \in R^{J \times J} with Q_{t_{g; h}}^{N} = \{\begin{matrix} \bar{p}, & if h = g \land Q \\ {\bar{q}}^{N}, & if h = g \land \neg Q \\ 0, & otherwise . \end{matrix}

(27)

Case 2: Left shift of definition range (

σ < 0

)

The updated knot vector is

κ_{t} \leftarrow ({\bar{κ}}_{t_{1}}, {\bar{κ}}_{t_{2}}, \dots, {\bar{κ}}_{t_{- σ}}, κ_{{t - 1}_{1}}, κ_{{t - 1}_{2}}, \dots, κ_{{t - 1}_{K + σ}}),

(28)

the input signal vector for linear states

u^{L}

reads

u_{t}^{L} = u_{t}

(29)

and the input signal vector for nonlinear states

u^{N}

is given by

u_{t}^{N} = u_{t}

(30)

with

u_{t} \leftarrow {(\bar{x} 1_{1 \times (- σ)}, 0_{1 \times (J + σ)})}^{⊤} .

(31)

Additionally, we set

\bar{x}

to the first component of

{\hat{x}}_{t - 2}^{+}

.

Note that since

A_{t}^{L}

and

A_{t}^{N}

are identical in the chosen state-space model, we can save computational effort when calculating the covariances and cross-covariances from line 8 to line 11 in Algorithm 1.

2.3.4. Effect of the Shift Operation

The shift operation decouples the dimension of the state vector from the total number of estimated coefficients. As a result, NRBA can determine an unknown and unbounded number of coefficients while the effort per iteration only depends on the number of spline intervals in which the approximating function can be adapted simultaneously.

However, the shift operation causes NRBA to partially forget the approximation result in order to keep the dimensions of matrices and vectors constant.

κ_{t}

and

\hat{x}_{t}

only allow an evaluation of

f (s)

for

s \in [κ_{t_{d + 1}}, κ_{t_{d + I + 1}})

. The forgetting mechanism can be circumvened by copying old NRBA elements before they are overwritten.

3. Numerical Experiments

We apply Algorithm 2 in numerical experiments. Thereby, we also investigate the effects of the number of simultaneously adaptable spline intervals and the particle count on the NRBA solution. An implementation in Matlab is provided in [49]. The LM algorithm [50] with Matlab standard settings serves as a benchmark.

3.1. General Experimental Setup

The data set

{(s_{t}, y_{t})}_{t = 1, 2, \dots, n}

is defined according to Section 1.5, whereby

\begin{matrix} s_{t} & = 0.25 + 0.5 \cdot (t - 1), \end{matrix}

(32)

\begin{matrix} y_{t, 1} & = \{\begin{matrix} 40, if 80 \leq s_{t} < 120 \\ 30, otherwise \end{matrix} \end{matrix}

(33)

\begin{matrix} y_{t, 2} & = y_{t, 3} = y_{t, 4} = 0 \forall t \end{matrix}

(34)

\begin{matrix} n & = 400 . \end{matrix}

(35)

A B-spline function

f (s)

of degree

d = 3

and with knot vector

κ = (- 30, - 20, \dots, 230)

approximates the data. Thereby, we suppose that

y_{t, 1}

refers to f,

y_{t, 2}

to the first derivative of f,

y_{t, 3}

to the second derivative of f, and

y_{t, 4}

to the value of the nonlinear measurement function

c

.

The nonlinear measurement function

c

is defined as a quadratic B-spline function with

κ = (- 5, 0, \dots, 70)

and

x = {(0, 0, 0, 0.25, 1.5, 5, 5, 0, 0, 6, 8, 8, 8)}^{⊤}

.

c

depends on the value of the approximating function

f (s)

and is displayed in Figure 2. The input variable

f (s)

of

c

is restricted to the definition range

[5, 60]

of

c

.

The diagonal measurement covariance matrix

R_{t} \in R^{4 \times 4}

with

R_{t_{1; 1}} = 1

,

R_{t_{2; 2}} = 5 \cdot 10^{- 2}

,

R_{t_{3; 3}} = 5 \cdot 10^{- 3}

and

R_{t_{4; 4}} = 0.8

or

10^{6}

, respectively, comprises the reciprocal weights of

y_{t, 1}

,

y_{t, 2}

,

y_{t, 3}

and

y_{t, 4}

. The reciprocal weight values for the first three dimensions of

y_{t}

avoid that f oscillates and cause that f smooths the jumps in the first dimension of the measurements. With

R_{t_{4; 4}} = 0.8

, we weight the nonlinear target criterion

c (f (s)) = 0

heavily, whereas with

R_{t_{4; 4}} = 10^{6}

, it is almost completely neglected.

Depending on the applied algorithm, solutions for the former weighting case are denoted with

{NRBA}^{N}

or

{LM}^{N}

, indicating the nonlinear approximation. Solutions for the latter case are denoted with

{NRBA}^{L}

or

{LM}^{L}

, indicating that we apply the corresponding algorithm to a quasi-linear approximation problem.

We analyze solutions for two different numbers of spline intervals

I

. For

I = 1

, we initialize

κ

with

κ_{0} = (- 30, 20, \dots, 40)

, which leads to an initial definition range

[0, 10)

of f. For

I = 3

, we initialize

κ

with

κ_{0} = (- 30, 20, \dots, 60)

, and the resulting definition range is

[0, 30)

. In both cases, NRBA approximates the data by repeatedly shifting the function definition range to the right. Each time, an additional knot value

{\bar{κ}}_{t_{K}}

needs to be provided in the vector

{\bar{κ}}_{t}

. For

I = 1

, these values are

{\bar{κ}}_{t_{K}} = 50, 60, \dots, 230

, and for

I = 3

, they read

{\bar{κ}}_{t_{K}} = 70, 80, \dots, 230

.

In order to display the NRBA results for the whole data set, we store all values that are moved out of NRBA matrices and vectors elsewhere. The remaining NRBA parameters are

{\bar{q}}^{L} = 0.005

,

{\bar{q}}^{N} = 0.25

, and

\bar{p} = 30

. The LM algorithm uses

{\bar{x}}^{Init} = 30

as the initial value for each coefficient.

Due to the included PF, NRBA is a sampling-based, nondeterministic method and its results vary between different approximation runs. Therefore, we apply a Monte Carlo analysis and perform 50 runs for each approximation setting. For each run, we calculate the normalized root mean square error (NRMSE) between the B-spline function determined by NRBA,

f_{NRBA}

, and the B-spline function according to LM,

f_{LM}

, as follows:

NRMSE = \frac{1}{{max}_{t = 1, \dots, n} {f_{LM} (s_{t})} - {min}_{t = 1, \dots, n} {f_{LM} (s_{t})}} \cdot \sqrt{\frac{\sum_{t = 1}^{n} {(f_{NRBA} (s_{t}) - f_{LM} (s_{t}))}^{2}}{n}}

(36)

With the notation

{NRMSE}^{min}

,

{NRMSE}^{med}

, and

{NRMSE}^{max}

, we refer to the NRBA solution with the minimum, median, or maximum NRMSE, respectively, in each set of 50 runs.

3.2. Effect of Weighting and Nonlinear Measurement Function

Figure 3 shows the approximating functions of each algorithm for both

R_{t_{4; 4}} = 0.8

and

R_{t_{4; 4}} = 10^{6}

. It displays for each weighting the NRBA solutions that achieve the median and the maximum NRMSE compared to the LM solution with a same weighting.

I

is set to one for all NRBA approximations; hence, the MPF state vector comprises four linear and four nonlinear components. Furthermore, we choose

P = 6561 = 9^{4}

; therefore, the PF creates nine samples per nonlinear state dimension.

The black dots depict the first component y_t,1 of the data points (s_t, y_t). For a better visualization of the approximating functions, only two representative data dots per spline interval are displayed. For f (s) = 30, the deviation between the value of c and its target value y_t,4 = 0 has a local maximum (c.f. Figure 2). In NRBA^N and LM^N, this deviation is penalized strongly; hence, these solutions avoid f (s) = 30. In contrast, NRBA^L and LM^L approximate data with y_t,1 = 30 closely because the nonlinear criterion is weighted only to a negligible extent.

Dashed vertical lines indicate knots, whereby the first and last knots are not shown. Data and knot vector are symmetrical to the straight line defined by s = 100. Since the LM algorithm processes all data simultaneously in each iteration, the solutions LM^L and LM^N in Figure 3 reflect this symmetry.

In contrast, NRBA processes the data from left to right and can only adapt some coefficients at a time. For I = 1, these are the four coefficients that influence the B-spline function in the spline interval in which the current data point lies. The double-headed arrow in Figure 3 visualizes the range in which NRBA can adapt f simultaneously while (s_n, y_n) is taken into account.

The solutions NRBA^L and NRBA^N are both asymmetrical and mostly delayed with respect to LM^L and LM^N. However, with NRBA^N, the asymmetry is less distinct. The reason for this is that, in the nonlinear problem, the PF removes states with a high delay more quickly from the particle set because they create a larger error. Additionally, the range of values in NRBA^N is smaller than in NRBA^L so that a present lag is less obvious.

Furthermore, we see that, for the same weighting, NRMSE^med and NRMSE^max differ only slightly. This suggests that, for the investigated settings, P = 6561 suffices for a convergence of NRBA solutions.

3.3. Effect of Interval Count

The number of spline intervals

I

determines the number of intervals in which NRBA can adapt the approximating B-spline function simultaneously.

When we proposed the algorithm RBA for a linear-weighted least squares approximation in Reference [43], we conducted numerical experiments similiar to the ones in this publication but without any nonlinear approximation criterion. For

I = 1

, we observed a strong asymmetry and delay with RBA, analogous to

{NRBA}^{L}

in Figure 3. The filter delay diminished when

I

was increased to seven. This is because the filter is then able to update more coefficient estimates with hinsight based on

P^{L, +}

.

In this subsection, we investigate the effect of increasing

I

from one to three with NRBA. With

I = 3

, NRBA can simultaneously adapt not only the coefficients that are relevant for the spline interval in which the current data point lies but also the two coefficients that affect the two spline intervals to the left. However,

I

also determines the dimension of the state space. With

I = 3

, there are six linear and six nonlinear components. The PF samples the state space less densely unless the particle count is increased exponentially with

I

.

First, we keep the sampling density per nonlinear state space dimension constant by choosing

P = 625 = 5^{4}

for

I = 1

and

P

= 15,625 =

5^{6}

for

I = 3

.

Figure 4 displays the results for the quasi-linear approximation problem. With

I = 3

, the NRBA solution is more symmetrical than with

I = 1

for

70 \leq s < 120

as it follows the increase of

y_{t, 1}

more closely. However, a comparison of

{NRMSE}^{med}

for

I = 1

and

I = 3

indicates that the increase of

I

does not translate to a reduction of the delay for

s \geq 120

. The ability to adapt more coefficient estimates with hinsight can also lead not necessarily to beneficial effects. The examples are the too low course of NRBA for

I = 3

between

s = 40

and

s = 60

and the overcompensation of the delay between

s = 60

and

s = 75

.

For I = 1, NRMSE^max differs more from NRMSE^med and shows larger oscillation amplitudes between s = 130 and s = 170 than for I = 3. This suggests that P = 625 is not sufficient for a convergence of NRBA for I = 1. Although we use only 625 particles for I = 1, the required increase to P = 15,625 for I = 3 is quite strong. This illustrates that keeping the sampling density constant quickly becomes infeasible, especially if computation time constraints are present [44]. Figure 5 shows the results for the nonlinear approximation problem and supports the previously drawn conclusions. Additionally, we see for s ≤ 20, that the conflicting target criteria in the nonlinear approximation problem cause a larger period for stabilization.

Second, we investigate the effect of an exclusive I increase from I = 1 to I = 3 while maintaining the particle count of Section 3.2. Figure 3 then depicts the case for I = 1, and Figure 6 depicts the results for I = 3. When we compare in both figures the NRMSE^max solution to the corresponding NRMSE^med solution, we notice that they differ much more for I = 3. This indicates that more particles are needed for convergence for I = 3. Especially, we notice that, with I = 3, these differences are much larger for NRBA^N than for NRBA^L.

With the chosen setup, an increasing I yields no clear approximation improvement when we compare corresponding NRMSE^med solutions in both figures. Figure 6 also shows that NRBA^N temporarily decreases below f (s) = 30, the position of the maximum of c (c.f. Figure 2). This illustrates how the sequential data processing of filter-based methods can lead to solutions that differ from those of a batch method.

3.4. Effect of Particle Count on Convergence

The computational effort of MPF increases linearly with the particle count

P

. For an example with seven linear and two nonlinear state vector components, Reference [46] chooses

P = 5000

and reports that, up to this particle count, increasing

P

reduces the convergence time significantly and leads to better estimates. Other examples in References [44,51] with four linear and two nonlinear state vector components uses

P = 2000

. The Matlab example in Reference [47] with three linear components and one nonlinear component uses only

P = 200

.

Figure 7 depicts the effect of

P

on the convergence of NRBA. For each combination of quasi-linear approximation problem L and nonlinear approximation problem N with

I = 1

and

I = 3

, the figure shows the courses of

{NRMSE}^{min}

,

{NRMSE}^{med}

, and

{NRMSE}^{max}

versus

P

. The investigated particle counts are

256 = 4^{4}, 625 = 5^{4}, 729 = 3^{6}, 1296 = 6^{4}, 2401 = 7^{4}, 4096 = 4^{6} = 8^{4}, 6561 = 9^{4}

, and 15,625

= 5^{6}

.

Between

{NRBA}^{L}

and

{NRBA}^{N}

, the NRMSE values are on different levels because the LM reference in the NRMSE from Equation (36) differs between

{LM}^{L}

and

{LM}^{N}

and the normalization factor in Equation (36) does not fully compensate for this. For

{NRBA}^{L}

with

I = 1

and

I = 3

and

{NRBA}^{N}

with

I = 1

,

{NRMSE}^{min}

and

{NRMSE}^{med}

decrease quickly and remain almost constant when

P

is further increased from

P = 4096

on. For

{NRBA}^{N}

with

I = 3

, the courses of

{NRMSE}^{min}

and

{NRMSE}^{med}

suggest using

P = 6561

.

{NRMSE}^{max}

are the observed worst case results. According to the

{NRMSE}^{max}

courses,

P = 6561

should be used for

{NRBA}^{L}

,

P =

15,625 for

{NRBA}^{N}

with

I = 1

and at least

P =

15,625 for

{NRBA}^{N}

with

I = 3

.

For

{NRBA}^{N}

with

I = 3

,

{NRMSE}^{max}

remains comparatively large because, in some runs, the approximating functions are below

f (s) = 30

as shown in Figure 6. Only for

P =

15,625, such results are not observed anymore (c.f. Figure 5) and the

{NRMSE}^{max}

value is similar to that for

{NRBA}^{L}

. As stated, the heavy penalization of the nonlinear criterion causes the MPF to remove bad particles quickly from the particle set, which reduces the filter lag. However, the MPF then relies more on the state-space sampling on the suboptimal PF than on the optimal KF. In combination with too few particles, this affects the results very negatively in the experiments.

3.5. Mean and Standard Deviation of NRBA Error

For insights into the statistical features of NRBA, we determine the mean and standard deviation of the error vector

e

between the NRBA and LM solutions over all 50 Monte Carlo runs for each approximation setting. Hereby, we consider the error vector of function values between NRBA and LM,

e_{f}

, as well as the error vector of coefficient values between NRBA and LM,

e_{x}

. The mean

\bar{e}

with

\bar{e} = \frac{1}{E} \sum_{i = 1}^{E} e_{i}

is an indicator for a bias of NRBA estimates, whereas the sample standard deviation

σ_{e}

with

σ_{e} = \sqrt{\frac{1}{E - 1} \sum_{i = 1}^{E} {(e_{i} - \bar{e})}^{2}}

is a measure for bias stability.

e

is a single error vector component, and

E

denotes the number of components in the error vector.

Table 1 displays the mean and standard deviation of

e_{f}

, and Table 2 shows these statistic measures for

e_{x}

. Both tables enable the following statements:

The mean of the error vector of the quasi-linear approximation problem is not clearly influenced by the particle count. Furthermore, the varying signs of the means close to zero speak against a bias for the quasi-linear approximation problem. In contrast, for the nonlinear approximation problem, the means are always negative and, therefore, biased. The negative sign is a problem-specific result and means that the solution

{NRBA}^{N}

is, in general, between

{LM}^{L}

and

{LM}^{N}

. The bias itself, however, seems to be a systematic effect of the interaction of KF and PF of which the system models are weighted relative to each other according to the covariance matrix of process noise for linear states

Q^{L}

and the covariance matrix of process noise for nonlinear states

Q^{N}

. Decreasing

Q^{N}

might help to reduce the magnitude of this bias. Moreover, we note that the absolute values of the means become smaller as the particle count is increased. As the NRBA results for nonlinear approximation problem rely heavily on the PF, this relation is comprehensible.

The standard deviation, in general, decreases for all investigated settings as the particle count is increased. With 15,625 particles, the standard deviations of the quasi-linear approximation problems are two to three times larger than those of the nonlinear approximation problems. This also is a problem-specific effect. For the nonlinear approximation problem, the range of the function values is considerably smaller than that of the quasi-linear problem, which favors lower standard deviations. For the nonlinear problem with the number of spline intervals equal to three, the relatively large standard deviations reflect the often disadvantageous courses of the approximating functions again as depicted in Figure 6.

4. Trajectory Optimization

This section demonstrates how NRBA can be applied for a multiobjective trajectory optimization. The trajectory represents the planned vehicle velocity with respect to time

τ

measured from present into the future and is a B-spline function as defined in Equation (4) with degree

d = 3

, knot vector

κ

, and coefficient vector

x

. Due to its interpretation as a temporal velocity trajectory, we refer to the B-spline function as

v_{TJY} (τ)

instead of

f (s)

.

κ

has equidistant and strictly monotonously increasing entries

κ = (κ_{1}, κ_{2}, \dots, κ_{K}) = (- Δ τ_{K} \cdot d, Δ τ_{K} \cdot (d + 1), \dots, Δ τ_{K} \cdot d + K - 1)

(37)

where

Δ τ_{K}

denotes the constant temporal distance of neighboring knots. Due to the choice of

κ

,

v_{TJY} (τ)

can be evaluated for

τ \geq 0

.

τ

is discretized using a positive constant temporal distance of neighboring data points

Δ τ_{It}

:

τ_{t} = (t - 1) \cdot Δ τ_{It}, t = 1, 2, \dots, n

(38)

Each component of the vector of measurements

y_{t}

of the data set in Section 1.5 is interpreted as a target value of an optimization goal.

y_{t, 1}

is assumed to be a suggested time-discrete course of velocity with a velocity set point

v_{Set}

which comes from a preceding planning method:

y_{t, 1} = v_{Set, t}

(39)

The remaining components

y_{t, 2}

,

y_{t, 3}

, and

y_{t, 4}

of

y_{t}

are assumed zero as before. NRBA solves the optimization problem

\hat{x} = \underset{x}{arg min} \sum_{t = 1}^{n} (R_{v}^{- 1} \cdot {(v_{Set, t} - v_{TJY} (τ_{t}))}^{2} + R_{a}^{- 1} \cdot a_{TJY} {(τ_{t})}^{2} + R_{j}^{- 1} \cdot j_{TJY} {(τ_{t})}^{2} + R_{P}^{- 1} \cdot {\hat{P}}_{elec} {(τ_{t})}^{2})

(40)

Each summand of the optimization function refers to an optimization goal. Under the assumption that

v_{Set}

takes into account driving dynamics, the first summand can be interpreted as driving safety and the optimized trajectory should remain close to the course of

v_{Set}

.

a_{TJY}

denotes the trajectory acceleration and

j_{TJY}

the trajectory jerk. These quantities are the first and second derivatives of

v_{TJY}

and can be derived according to Equation (6). The second and third summands demand a smooth drive with a low acceleration and acceleration changes and, thus, refer to driving comfort. The last summand penalizes the absolute values of the estimated electric traction power

{\hat{P}}_{elec}

, which is used as a measure for driving efficiency.

Each optimization goal has a corresponding weight.

R_{v}^{- 1}

denotes the weight of velocity error square,

R_{a}^{- 1}

denotes the weight of acceleration error square,

R_{j}^{- 1}

denotes the weight of jerk error square, and

R_{P}^{- 1}

denotes the weight of power error square. The reciprocals of the weights follow the interpretation of the filter algorithms and refer to the variances of the artificial measurements.

R_{v}

is the variance of velocity measurement,

R_{a}

is the variance of acceleration measurement,

R_{j}

is the variance of jerk measurement, and

R_{P}

is the variance of power measurement.

Without the fourth goal, RBA would suffice for solving the problem because the first three goals are all linear in the coefficients. However, the energy consumption minimization goal requires a nonlinear method. In the following, we consider a BEV based on the Porsche Boxster (type 981), which is described in References [52,53,54]. Like most BEVs, its powertrain has a fixed gear ratio, which simplifies the optimization problem and allows us to apply NRBA.

In a BEV, the powertrain converts electric traction power

P_{elec}

provided by the battery into mechanic traction power

P_{mech}

for vehicle propulsion. During recuperative braking, the power flow is vice versa. We will neglect the additional power for auxillaries such as air conditioning because it depends on environmental conditions and comfort requirements strongly.

P_{mech}

equals the product of the traction force

F_{trac}

and the vehicle velocity

v_{vhcl}

, whereby

F_{trac}

equals the sum of driving resistances. The dominant driving resistances are air resistance. which increases quadratically with

v_{vhcl}

, the inertial force which is a linear function of the vehicle acceleration

a_{vhcl}

and the climbing force which depends on the road slope angle

α

.

During this power conversion, losses occur in various components of the powertrain. In order to provide sufficient

F_{trac}

for a high acceleration or high velocity, the electric motor must generate a high torque which requires a large electric current

I

. The internal ohmic resistance

R

of electric components such as the battery causes an ohmic traction power loss

P_{loss, ohmic}

which is given by

P_{loss, ohmic} = R \cdot I^{2}

. Furthermore, friction losses in the gearbox increase with rotation speed and transmitted torque [55].

P_{elec}

can be computed in the vehicle from voltage and current sensor data. However, due to a lack of sensors,

F_{trac}

and

P_{mech}

cannot be calculated, and therefore, power losses cannot be determined in the vehicle during its operation. As power losses increase with the absolute value of

P_{elec}

, we use

P_{elec}

as a measure for power losses and create a mathematical model of

P_{elec}

that outputs the estimated electric traction power

{\hat{P}}_{elec}

based on the inputs

v_{vhcl}

,

a_{vhcl}

, and

α

. The mathematical model can adapt its parameters during vehicle operation because both model outputs and model inputs are known quantities during vehicle operation. The adaption is neccessary for accurate estimates because vehicle parameters such as mass or air drag coefficient can change and the driving resistances also depend on these parameters. The mathematical model serves as nonlinear measurement function for NRBA, whereby we assume that

v_{TJY} = v_{vhcl}

and

a_{TJY} = a_{vhcl}

. By penalizing the absolute value of

{\hat{P}}_{elec}

in Equation (40), we encourage NRBA to determine energy-efficient velocity trajectories.

The first diagram of Figure 8 displays the velocity v versus the time

τ

according to the velocity set points

v_{Set}

of the reference as well as three trajectories optimized by NRBA. The NRBA trajectories are denoted

{NRBA}^{1}

,

{NRBA}^{2}

, and

{NRBA}^{3}

and differ in the choice of

R_{P}

. We use

R_{P} = 10^{4}

for

{NRBA}^{1}

,

R_{P} = 500

for

{NRBA}^{2}

, and

R_{P} = 100

for

{NRBA}^{3}

. The remaining parameter values are

R_{v} = 5

,

R_{a} = 10

,

R_{j} = 1

,

I = 1

,

{\bar{q}}^{L} = 0.005

,

{\bar{q}}^{N} = 0.5

,

\bar{p} = 15

,

P = 1000

,

Δ τ_{K} = 2

, and

Δ τ_{It} = 0.25

.

The second diagram shows the estimated electric traction power

{\hat{P}}_{elec}

according to the mathematical model. The traction power loss

P_{loss}

and traction energy

E

depicted in the third and fourth diagram originate from a detailed vehicle model. The detailed vehicle model includes parameters for all relevant power losses. These parameters were derived from component tests on test benches. The detailed model requires

F_{trac}

as an input and, therefore, assumptions concerning the driving resistance parameters. For simplicity, we assume a slope-free road in this example. An implementation of this example in Matlab is also provided in Reference [49].

The trajectory

{NRBA}^{1}

follows the reference closely apart from some short and large changes of

v_{Set}

between

τ = 250

and

τ = 300

. Staying close to the reference requires several positive and negative peaks in

{\hat{P}}_{elec}

, which are almost not penalized because of the high variance of power measurement

R_{P}

. As

R_{P}

is decreased, the trajectories exhibit lower velocities and absolute values of acceleration in order to avoid large absolute values of

{\hat{P}}_{elec}

. Between

τ = 250

and

τ = 300

decreasing,

R_{P}

has almost no effect because

{\hat{P}}_{elec}

is close to zero because the velocity is low.

The last three diagrams show that

{\hat{P}}_{elec}

is a suitable measure for the goal of a lower energy consumption. A comparison of the peaks in

{\hat{P}}_{elec}

and

P_{loss}

at

τ = 310

illustrates that

P_{loss}

increases with

| {\hat{P}}_{elec} |

more than linearly.

Note that there are some situations in which the trajectories exceed

v_{Set}

. Depending on the exact application, interpreting

v_{Set}

as an upper velocity limit might be more suitable. By penalizing positive deviations

(v_{Set, t} - v_{TJY} (τ_{t}))

more strongly than negative ones in each NRBA iteration using a sign-dependent

R_{v}

value, exceeding

v_{Set}

can be avoided.

5. Conclusions

We presented a filter-based algorithm denoted nonlinear recursive B-spline approximation (NRBA) that determines a B-spline function such that it approximates an unbounded number of data points in the nonlinear weighted least squares sense. NRBA uses a marginalized particle filter (MPF), also denoted a Rao–Blackwellized particle filter, for solving the approximation problem iteratively. In the MPF, a particle filter (PF) takes into account the approximation criteria that relate to the function coefficients in a nonlinear fashion whereas a Kalman filter (KF) solves any linear subproblem optimally. Thus, the particle count in the PF can be reduced.

As the value of the B-spline function and its derivatives depend linearly on the coefficient values, linear approximation criteria will occur in most approximation applications. The MPF accepts the exactly known values of the B-spline function basis functions as an input and does not need to estimate them like many other nonlinear filters do. Therefore, the MPF enables a reduction in the computational effort and an achievement of better results compared to purely nonlinear filters [46].

NRBA can shift estimated coefficients in the MPF state vector which allows an adaptation of the bounded B-spline function definition range during run-time such that, regardless of the initially selected definition range, all data points can be processed. Additionally the shift operation enables a decrease in the dimension of the state vector for less computational effort.

In numerical experiments, we compared NRBA to the Levenberg-Marquardt (LM) algorithm and investigated the effects of NRBA parameters on the approximation result using a Monte Carlo simulation. Provided that the NRBA parameters are chosen appropriately, the NRBA solution is close to the LM solution apart from some filter-typical delay. For a strong weighting of the nonlinear approximation criteria, the result relies more on the state-space sampling of the PF than on the KF. In combination with too few particles, the approximating function tends to oscillate.

NRBA use cases are nonlinear weighted least squares (NWLS) problems in which a linearization of nonlinear criteria is not desired or promising, for example, because of strong nonlinearities. For linear weighted least squares problems, the recursive B-spline approximation (RBA) algorithm proposed in Reference [43] should be used instead of NRBA. RBA is based on the KF, which computes an optimal solution [38]. In contrast, the PF in NRBA causes NRBA to, at best, reach the same approximation quality provided that the particle count is large enough, which requires more computational effort.

Furthermore, with NRBA, the approximation depends more heavily on the parameterization of the underlying filter algorithm than with RBA. An increase of the number of coefficients that NRBA can adapt simultaneously is not as unambiguously beneficial as with RBA and usually needs to be combined with an exponential increase of the particle count in the PF for an improvement of the approximation.

As demonstrated, NRBA is suitable for an unconstrained multiobjective trajectory optimization. Thereby, a major advantage of NRBA is a linear increase of the computational effort with the number of processed data points as opposed to an exponential increase with most other direct trajectory optimization methods. NRBA can also be applied during the processing of discrete signals in a time domain. Then NRBA can provide a sparse, continuous, and smoothed representation of the signals themselves or of their derivatives.

The chosen MPF formulation allows an easy replacement of the standard KF and PF. For example, Reference [56] presents a PF, in which the particles are determined with a particle swarm optimization, and reports that less particles are needed compared with the standard PF. An improvement of the MPF is proposed by Reference [57]. Investigating these algorithms in combination with NRBA can be the subject of further research.

Author Contributions

Conceptualization, J.J. and F.B.; methodology, J.J. and F.B.; software, J.J.; investigation, J.J.; writing—original draft preparation, J.J. and F.B.; writing—review and editing, M.F., F.G; supervision, F.G. and M.F.; project administration, F.G.; funding acquisition, M.F.

Funding

This research was funded by the German Federal Ministry of Education and Research under grant number 16EMO0071. The APC was funded by the KIT-Publication Fund of the Karlsruhe Institute of Technology.

Acknowledgments

We would like to thank the Ing. h.c. F. Porsche AG, who was among the research project partners, for the provision of vehicle data. Furthermore, we appreciate the valuable comments and suggestions of the anonymous reviewers for the improvement of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhao, X.; Kargoll, B.; Omidalizarandi, M.; Xu, X.; Alkhatib, H. Model Selection for Parametric Surfaces Approximating 3D Point Clouds for Deformation Analysis. Remote Sens. 2018, 10, 634. [Google Scholar] [CrossRef]
Jiang, Z. A New Approximation Method with High Order Accuracy. Math. Comput. Appl. 2017, 22, 11. [Google Scholar] [CrossRef]
Majid Amirfakhrian, S.D. Approximation of Parametric Functions by Bicubic B-spline Functions. J. Am. Sci. 2013, 9. [Google Scholar]
Du, M.; Mei, T.; Liang, H.; Chen, J.; Huang, R.; Zhao, P. Drivers’ Visual Behavior-Guided RRT Motion Planner for Autonomous On-Road Driving. Sensors 2016, 16, 102. [Google Scholar] [CrossRef]
Elbanhawi, M.; Simic, M.; Jazar, R.N. Continuous Path Smoothing for Car-Like Robots Using B-Spline Curves. J. Intell. Robot. Syst. 2015, 80, 23–56. [Google Scholar] [CrossRef]
Shih, C.L.; Lin, L.C. Trajectory Planning and Tracking Control of a Differential-Drive Mobile Robot in a Picture Drawing Application. Robotics 2017, 6, 17. [Google Scholar] [CrossRef]
Liu, H.; Lai, X.; Wu, W. Time-optimal and jerk-continuous trajectory planning for robot manipulators with kinematic constraints. Robot. Comput.-Integr. Manuf. 2013, 29, 309–317. [Google Scholar] [CrossRef]
Zhao, D.; Guo, H. A Trajectory Planning Method for Polishing Optical Elements Based on a Non-Uniform Rational B-Spline Curve. Appl. Sci. 2018, 8, 1355. [Google Scholar] [CrossRef]
Kineri, Y.; Wang, M.; Lin, H.; Maekawa, T. B-spline surface fitting by iterative geometric interpolation/approximation algorithms. Comput.-Aided Des. 2012, 44, 697–708. [Google Scholar] [CrossRef]
Yunbao Huang, X.Q. Dynamic B-spline surface reconstruction: Closing the Sensing-and-modeling loop in 3D digitization. Comput.-Aided Des. 2007, 39, 987–1002. [Google Scholar] [CrossRef]
Monir, A.; Mraoui, H. Spline approximations of the Lambert W function and application to simulate generalized Gaussian noise with exponent α = 1/2. Digit. Signal Process. 2014, 33, 34–41. [Google Scholar] [CrossRef]
Rebollo-Neira, L.; Xu, Z. Sparse signal representation by adaptive non-uniform B-spline dictionaries on a compact interval. Signal Process. 2010, 90, 2308–2313. [Google Scholar] [CrossRef]
Reis, M.J.; Ferreira, P.J.; Soares, S.F. Linear combinations of B-splines as generating functions for signal approximation. Digit. Signal Process. 2005, 15, 226–236. [Google Scholar] [CrossRef]
Panda, R.; Chatterji, B. Least squares generalized B-spline signal and image processing. Signal Process. 2001, 81, 2005–2017. [Google Scholar] [CrossRef]
Roark, R.M.; Escabi, M.A. B-spline design of maximally flat and prolate spheroidal-type FIR filters. IEEE Trans. Signal Process. 1999, 47, 701–716. [Google Scholar] [CrossRef]
Izadian, J.; Farahbakhsh, N. Solving Nonlinear Least Squares Problems with B-Spline Functions. Appl. Math. Sci. 2012, 6, 1667–1676. [Google Scholar]
Dahmen, W.; Reusken, A. (Eds.) Numerik für Ingenieure und Naturwissenschaftler, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
Ruhe, A.; Wedin, P.A. Algorithms for Separable Nonlinear Least Squares Problems. SIAM Rev. 1980, 22, 318–337. [Google Scholar] [CrossRef]
Haupt, G.T.; Kasdin, N.J.; Keiser, G.M.; Parkinson, B.W. Optimal recursive iterative algorithm for discrete nonlinear least-squares estimation. J. Guid. Control Dyn. 1996, 19, 643–649. [Google Scholar] [CrossRef]
Zhao, K.; Ling, F.; Lev-Ari, H.; Proakis, J.G. Sliding window order-recursive least-squares algorithms. IEEE Trans. Signal Process. 1994, 42, 1961–1972. [Google Scholar] [CrossRef]
Dias, F.M.; Antunes, A.; Vieira, J.; Mota, A. A sliding window solution for the on-line implementation of the Levenberg-Marquardt algorithm. Eng. Appl. Arti. Intell. 2006, 19, 1–7. [Google Scholar] [CrossRef]
Gao, Z.; Yan, W.; Hu, H.; Li, H. Human-centered headway control for adaptive cruise-controlled vehicles. Adv. Mech. Eng. 2015, 7. [Google Scholar] [CrossRef]
Radke, T. Energieoptimale Längsführung von Kraftfahrzeugen durch Einsatz vorausschauender Fahrstrategien. Ph.D. Thesis, KIT Scientific Publishing, Karlsruhe, Germany, 2013. [Google Scholar] [CrossRef]
Wahl, H.G. Optimale Regelung eines prädiktiven Energiemanagements von Hybridfahrzeugen. Ph.D. Thesis, KIT Scientific Publishing, Karlsruhe, Germany, 2015. [Google Scholar] [CrossRef]
Zhang, S.; Xiong, R. Adaptive energy management of a plug-in hybrid electric vehicle based on driving pattern recognition and dynamic programming. Appl. Energy 2015, 155, 68–78. [Google Scholar] [CrossRef]
Winner, H.; Hakuli, S.; Lotz, F.; Singer, C. Handbook of Driver Assistance Systems—Basic Information, Components and Systems for Active Safety and Comfort; Springer: Cham, Switzerland, 2016. [Google Scholar] [CrossRef]
Passenberg, B. Theory and Algorithms for Indirect Methods in Optimal Control of Hybrid Systems. Ph.D. Thesis, Technische Universität München, Munich, Germany, 2012. [Google Scholar]
Giron-Sierra, J.M. Kalman Filter, Particle Filter and Other Bayesian Filters. In Digital Signal Processing with Matlab Examples; Springer: Singapore, 2017; Volume 3, pp. 3–148. [Google Scholar] [CrossRef]
Wu, Z.; Li, J.; Zuo, J.; Li, S. Path Planning of UAVs Based on Collision Probability and Kalman Filter. IEEE Access 2018, 6, 34237–34245. [Google Scholar] [CrossRef]
Arulampalam, M.S.; Maskell, S.; Gordon, N.; Clapp, T. A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. Signal Process. 2002, 50, 174–188. [Google Scholar] [CrossRef]
Cappe, O.; Godsill, S.J.; Moulines, E. An Overview of Existing Methods and Recent Advances in Sequential Monte Carlo. Proc. IEEE 2007, 95, 899–924. [Google Scholar] [CrossRef]
Alessandri, A.; Cuneo, M.; Pagnan, S.; Sanguineti, M. A recursive algorithm for nonlinear least-squares problems. Comput. Optim. Appl. 2007, 38, 195–216. [Google Scholar] [CrossRef]
Deng, F.; Chen, J.; Chen, C. Adaptive unscented Kalman filter for parameter and state estimation of nonlinear high-speed objects. J. Syst. Eng. Electron. 2013, 24, 655–665. [Google Scholar] [CrossRef]
Ha, X.V.; Ha, C.; Lee, J. Trajectory Estimation of a Tracked Mobile Robot Using the Sigma-Point Kalman Filter with an IMU and Optical Encoder. In Intelligent Computing Technology; Huang, D.S., Jiang, C., Bevilacqua, V., Figueroa, J.C., Eds.; Springer: Berlin/Heidelberg, Germany, 2012; pp. 415–422. [Google Scholar]
Nieto, M.; Cortés, A.; Otaegui, O.; Arróspide, J.; Salgado, L. Real-time lane tracking using Rao-Blackwellized particle filter. J. Real-Time Image Process. 2016, 11, 179–191. [Google Scholar] [CrossRef]
Lee, Y. Optimization of Moving Objects Trajectory Using Particle Filter. In Intelligent Computing Theory; Huang, D.S., Bevilacqua, V., Premaratne, P., Eds.; Springer International Publishing: Cham, Switzerlan, 2014; pp. 55–60. [Google Scholar]
Xin, L.; Hailong, P.; Jianqiang, L. Trajectory prediction based on particle filter application in mobile robot system. In Proceedings of the 2008 27th Chinese Control Conference, Kunming, China, 16–18 July 2008; pp. 389–393. [Google Scholar] [CrossRef]
Zhou, F.; He, W.J.; Fan, X.Y. Marginalized Particle Filter for Maneuvering Target Tracking Application. In Advances in Grid and Pervasive Computing; Bellavista, P., Chang, R.S., Chao, H.C., Lin, S.F., Sloot, P.M.A., Eds.; Springer: Berlin/Heidelberg, Germany, 2010; pp. 542–551. [Google Scholar]
Qian, K.; Ma, X.; Dai, X.; Fang, F. Improved Rao-Blackwellized particle filter for simultaneous robot localization and person-tracking with single mobile sensor. J. Control Theory Appl. 2011, 9, 472–478. [Google Scholar] [CrossRef]
Yatim, N.M.; Buniyamin, N. Development of Rao-Blackwellized Particle Filter (RBPF) SLAM Algorithm Using Low Proximity Infrared Sensors. In Proceedings of the 9th International Conference on Robotic, Vision, Signal Processing and Power Applications, Penang, Malaysia, 2–3 February 2017; Ibrahim, H., Iqbal, S., Teoh, S.S., Mustaffa, M.T., Eds.; Springer: Singapore, 2017; pp. 395–405. [Google Scholar]
Liu, J.; Wang, Z.; Xu, M. A Kalman Estimation Based Rao-Blackwellized Particle Filtering for Radar Tracking. IEEE Access 2017, 5, 8162–8174. [Google Scholar] [CrossRef]
Skoglar, P.; Orguner, U.; Tornqvist, D.; Gustafsson, F. Road target tracking with an approximative Rao-Blackwellized Particle Filter. In Proceedings of the 2009 12th International Conference on Information Fusion, Seattle, WA, USA, 6–9 July 2009; pp. 17–24. [Google Scholar]
Jauch, J.; Bleimund, F.; Rhode, S.; Gauterin, F. Recursive B-spline approximation using the Kalman filter. Eng. Sci. Technol. Int. J. 2017, 20, 28–34. [Google Scholar] [CrossRef]
Hendeby, G.; Karlsson, R.; Gustafsson, F. A New Formulation of the Rao-Blackwellized Particle Filter. In Proceedings of the 2007 IEEE/SP 14th Workshop on Statistical Signal Processing, Madison, WI, USA, 26–29 August 2007; pp. 84–88. [Google Scholar] [CrossRef]
Lyche, T.; Mørken, K. Spline Methods Draft; Department of Informatics, Centre of Mathematics for Applications, University of Oslo: Oslo, Norway, 2008. [Google Scholar]
Schön, T.; Gustafsson, F.; Nordlund, P.J. Marginalized particle filters for mixed linear/nonlinear state-space models. IEEE Trans. Signal Process. 2005, 53, 2279–2289. [Google Scholar] [CrossRef]
Schön, T. Rao-Blackwellized Particle Filter—MATLAB Code. 2011. Available online: http://user.it.uu.se/~thosc112/research/rao-blackwellized-particle.html (accessed on 16 January 2018).
Arasaratnam, I.; Haykin, S. Cubature Kalman Filters. IEEE Trans. Autom. Control 2009, 54, 1254–1269. [Google Scholar] [CrossRef]
Jauch, J. NRBA Matlab Files. 2019. Available online: http://github.com/JensJauch/nrba (accessed on 23 March 2019).
Marquardt, D.W. An Algorithm for Least-Squares Estimation of Nonlinear Parameters. J. Soc. Ind. Appl. Math. 1963, 11, 431–441. [Google Scholar] [CrossRef]
Hendeby, G.; Karlsson, R.; Gustafsson (EURASIPMember), F. The Rao-Blackwellized Particle Filter: A Filter Bank Implementation. EURASIP J. Adv. Signal Process. 2010, 2010, 724087. [Google Scholar] [CrossRef]
Bargende, M.; Reuss, H.; Wiedemann, J. 14. Internationales Stuttgarter Symposium: Automobil- und Motorentechnik; Springer: Wiesbaden, Germany, 2014. [Google Scholar]
Bender, S.; Chodura, H.; Groß, M.; Kühn, T.; Watteroth, V. e-generation—Ein Forschungsprojekt mit positiver Bilanz. Porsche Eng. Mag. 2015, 2, 22–27. [Google Scholar]
Zimmer, M. Durchgängiger Simulationsprozess zur Effizienzsteigerung und Reifegraderhöhung von Konzeptbewertungen in der Frühen Phase der Produktentstehung; Wissenschaftliche Reihe Fahrzeugtechnik Universität Stuttgart, Springer: Wiesbaden, Germany, 2015. [Google Scholar]
Vaillant, M. Design Space Exploration zur multikriteriellen Optimierung elektrischer Sportwagenantriebsstränge. Ph.D. Thesis, KIT Scientific Publishing, Karlsruhe, Germany, 2016. [Google Scholar] [CrossRef]
Zhao, Z.S.; Feng, X.; Lin, Y.y.; Wei, F.; Wang, S.K.; Xiao, T.L.; Cao, M.Y.; Hou, Z.G.; Tan, M. Improved Rao-Blackwellized Particle Filter by Particle Swarm Optimization. J. Appl. Math. 2013, 2013, 302170. [Google Scholar] [CrossRef]
Yin, J.; Zhang, J.; Mike, K. The Marginal Rao-Blackwellized Particle Filter for Mixed Linear/Nonlinear State Space Models. Chin. J. Aeron. 2007, 20, 346–352. [Google Scholar] [CrossRef]

Figure 1. The allocation of the available data points and computed estimates

\hat{x}

to KF iterations in RBA versus MPF iterations in NRBA: The arrows indicate the needed information for computing the estimates. By definition, we use

(s_{1}, y_{1})

for computing

{\hat{x}}_{0}^{+}

and

s_{n}

for

{\hat{x}}_{n + 1}^{-}

as indicated by the dashed arrows.

Figure 1. The allocation of the available data points and computed estimates

\hat{x}

to KF iterations in RBA versus MPF iterations in NRBA: The arrows indicate the needed information for computing the estimates. By definition, we use

(s_{1}, y_{1})

for computing

{\hat{x}}_{0}^{+}

and

s_{n}

for

{\hat{x}}_{n + 1}^{-}

as indicated by the dashed arrows.

Figure 2. The nonlinear measurement function

c (f (s))

that depends on the value of the B-spline function

f (s)

that approximates the data. c is itself a B-spline function.

Figure 2. The nonlinear measurement function

c (f (s))

that depends on the value of the B-spline function

f (s)

that approximates the data. c is itself a B-spline function.

Figure 3. Approximating the B-spline function f determined by NRBA with a number of spline intervals

I = 1

and particle count

P = 6561 = 9^{4}

in comparison to the LM solution:

{NRBA}^{L}

and

{LM}^{L}

denote solutions of the algorithms for the quasi-linear problem whereas

{NRBA}^{N}

and

{LM}^{N}

refer to solutions for the nonlinear problem.

{NRMSE}^{med}

and

{NRMSE}^{max}

denote the NRBA solution with the median or maximum normalized root mean square error (NRMSE) compared to the LM solution with the same weighting. Forty of the 400 data points

(s_{t}, y_{t, 1})

and the knots

κ = 0, 5, \dots, 200

are shown. The arrow indicates the range in which NRBA can adapt

f (s)

, while the data in the interval

[190, 200)

is processed.

Figure 3. Approximating the B-spline function f determined by NRBA with a number of spline intervals

I = 1

and particle count

P = 6561 = 9^{4}

in comparison to the LM solution:

{NRBA}^{L}

and

{LM}^{L}

denote solutions of the algorithms for the quasi-linear problem whereas

{NRBA}^{N}

and

{LM}^{N}

refer to solutions for the nonlinear problem.

{NRMSE}^{med}

and

{NRMSE}^{max}

denote the NRBA solution with the median or maximum normalized root mean square error (NRMSE) compared to the LM solution with the same weighting. Forty of the 400 data points

(s_{t}, y_{t, 1})

and the knots

κ = 0, 5, \dots, 200

are shown. The arrow indicates the range in which NRBA can adapt

f (s)

, while the data in the interval

[190, 200)

is processed.

Figure 4. Approximating B-spline function, f is determined by NRBA for various numbers of spline intervals

I

and various particle counts

P

in comparison to the LM solution.

{NRBA}^{L}

and

{LM}^{L}

denote solutions of the corresponding algorithm for the quasi-linear approximation problem.

{NRMSE}^{med}

and

{NRMSE}^{\max}

denote the NRBA solution with the median or maximum normalized root mean square error (NRMSE) compared to the LM solution with the same weighting. Forty of the 400 data points

(s_{t}, y_{t, 1})

and the knots

κ = 0, 5, \dots, 200

are shown. The arrows indicate the range in which NRBA can adapt

f (s)

, while the data in the interval

[190, 200)

is processed.

Figure 4. Approximating B-spline function, f is determined by NRBA for various numbers of spline intervals

I

and various particle counts

P

in comparison to the LM solution.

{NRBA}^{L}

and

{LM}^{L}

denote solutions of the corresponding algorithm for the quasi-linear approximation problem.

{NRMSE}^{med}

and

{NRMSE}^{\max}

denote the NRBA solution with the median or maximum normalized root mean square error (NRMSE) compared to the LM solution with the same weighting. Forty of the 400 data points

(s_{t}, y_{t, 1})

and the knots

κ = 0, 5, \dots, 200

are shown. The arrows indicate the range in which NRBA can adapt

f (s)

, while the data in the interval

[190, 200)

is processed.

Figure 5. Approximating B-spline function, f is determined by NRBA for various numbers of spline intervals

I

and various particle counts

P

in comparison to the LM solution.

{NRBA}^{N}

and

{LM}^{N}

denote solutions of the corresponding algorithm for the nonlinear approximation problem.

{NRMSE}^{med}

and

{NRMSE}^{\max}

denote the NRBA solution with the median or maximum normalized root mean square error (NRMSE) compared to the LM solution with the same weighting. Forty of the 400 data points

(s_{t}, y_{t, 1})

and the knots

κ = 0, 5, \dots, 200

are shown. The arrows indicate the range in which NRBA can adapt

f (s)

, while the data in the interval

[190, 200)

is processed.

Figure 5. Approximating B-spline function, f is determined by NRBA for various numbers of spline intervals

I

and various particle counts

P

in comparison to the LM solution.

{NRBA}^{N}

and

{LM}^{N}

denote solutions of the corresponding algorithm for the nonlinear approximation problem.

{NRMSE}^{med}

and

{NRMSE}^{\max}

denote the NRBA solution with the median or maximum normalized root mean square error (NRMSE) compared to the LM solution with the same weighting. Forty of the 400 data points

(s_{t}, y_{t, 1})

and the knots

κ = 0, 5, \dots, 200

are shown. The arrows indicate the range in which NRBA can adapt

f (s)

, while the data in the interval

[190, 200)

is processed.

Figure 6. Approximating B-spline function, f is determined by NRBA with the number of spline intervals

I = 3

and particle count

P = 9^{4} = 6561

in comparison to the LM solution.

{NRBA}^{L}

and

{LM}^{L}

denote solutions of the algorithms for the quasi-linear problem whereas

{NRBA}^{N}

and

{LM}^{N}

refer to solutions for the nonlinear problem.

{NRMSE}^{med}

and

{NRMSE}^{max}

denote the NRBA solution with the median or maximum normalized root mean square error (NRMSE) compared to the LM solution with the same weighting. Forty of the 400 data points

(s_{t}, y_{t, 1})

and the knots

κ = 0, 5, \dots, 200

are shown. The arrow indicates the range in which NRBA can adapt

f (s)

, while the data in the interval

[190, 200)

is processed.

Figure 6. Approximating B-spline function, f is determined by NRBA with the number of spline intervals

I = 3

and particle count

P = 9^{4} = 6561

in comparison to the LM solution.

{NRBA}^{L}

and

{LM}^{L}

denote solutions of the algorithms for the quasi-linear problem whereas

{NRBA}^{N}

and

{LM}^{N}

refer to solutions for the nonlinear problem.

{NRMSE}^{med}

and

{NRMSE}^{max}

denote the NRBA solution with the median or maximum normalized root mean square error (NRMSE) compared to the LM solution with the same weighting. Forty of the 400 data points

(s_{t}, y_{t, 1})

and the knots

κ = 0, 5, \dots, 200

are shown. The arrow indicates the range in which NRBA can adapt

f (s)

, while the data in the interval

[190, 200)

is processed.

Figure 7. The convergence of NRBA: Normalized root mean square error (NRMSE) of NRBA versus the particle count

P

.

{NRMSE}^{\min}

,

{NRMSE}^{med}

, and

{NRMSE}^{\max}

denote the nonlinear recursive B-spline approximation (NRBA) solution with the minimum, median, or maximum NRMSE compared to the LM solution in the Monte Carlo analysis. L and N denote the quasi-linear and nonlinear weighting and I, the number of spline intervals.

Figure 7. The convergence of NRBA: Normalized root mean square error (NRMSE) of NRBA versus the particle count

P

.

{NRMSE}^{\min}

,

{NRMSE}^{med}

, and

{NRMSE}^{\max}

denote the nonlinear recursive B-spline approximation (NRBA) solution with the minimum, median, or maximum NRMSE compared to the LM solution in the Monte Carlo analysis. L and N denote the quasi-linear and nonlinear weighting and I, the number of spline intervals.

Figure 8. First diagram: Velocity v versus time

τ

according to the velocity set points

v_{Set}

of the reference and three trajectories

{NRBA}^{1}

,

{NRBA}^{2}

, and

{NRBA}^{3}

optimized by NRBA that differ in the variance of power measurement. Second diagram: Estimated electric traction power

{\hat{P}}_{elec}

according to mathematical model. Third diagram: Traction power loss

P_{loss}

. Fourth diagram: Traction energy E.

Figure 8. First diagram: Velocity v versus time

τ

according to the velocity set points

v_{Set}

of the reference and three trajectories

{NRBA}^{1}

,

{NRBA}^{2}

, and

{NRBA}^{3}

optimized by NRBA that differ in the variance of power measurement. Second diagram: Estimated electric traction power

{\hat{P}}_{elec}

according to mathematical model. Third diagram: Traction power loss

P_{loss}

. Fourth diagram: Traction energy E.

Table 1. The mean and standard deviation of error vector of function values between NRBA and LM over all 50 Monte Carlo runs with a quasi-linear approximation problem L, nonlinear approximation problem N, and number of spline intervals

I

.

Table 1. The mean and standard deviation of error vector of function values between NRBA and LM over all 50 Monte Carlo runs with a quasi-linear approximation problem L, nonlinear approximation problem N, and number of spline intervals

I

.

	Mean of Error Vector				Standard Deviation of Error Vector
Particle Count	L, I = 1	L, I = 3	N, I = 1	N, I = 3	L, I = 1	L, I = 3	N, I = 1	N, I = 3
256	0.0041	−0.0088	−0.2268	−0.5820	0.8738	0.9143	0.6225	1.2525
625	0.0150	0.0072	−0.0979	−0.4386	0.7692	0.8224	0.4030	1.0902
729	−0.0064	−0.0176	−0.0930	−0.4350	0.8231	0.7988	0.3904	1.0975
1296	−0.0028	−0.0156	−0.0611	−0.2248	0.7294	0.7365	0.3361	0.6988
2401	0.0005	−0.0009	−0.0445	−0.1851	0.6480	0.6851	0.2965	0.6454
4096	0.0014	0.0050	−0.0296	−0.2189	0.6436	0.6538	0.2583	0.7106
6561	0.0011	−0.0069	−0.0334	−0.1340	0.5930	0.6084	0.2498	0.5673
15,625	0.0056	−0.0015	−0.0204	−0.0512	0.5502	0.5715	0.2216	0.3124

Table 2. The mean and standard deviation of error vector of coefficient values between NRBA and LM over all 50 Monte Carlo runs with a quasi-linear approximation problem L, a nonlinear approximation problem N, and number of spline intervals

I

.

Table 2. The mean and standard deviation of error vector of coefficient values between NRBA and LM over all 50 Monte Carlo runs with a quasi-linear approximation problem L, a nonlinear approximation problem N, and number of spline intervals

I

.

	Mean of Error Vector				Standard Deviation of Error Vector
Particle Count	L, I = 1	L, I = 3	N, I = 1	N, I = 3	L, I = 1	L, I = 3	N, I = 1	N, I = 3
256	0.0041	−0.0098	−0.3036	−0.5902	1.2019	1.3057	1.2664	1.7251
625	0.0133	0.0044	−0.1640	−0.4562	1.0725	1.1953	0.8450	1.4869
729	−0.0040	−0.0055	−0.1507	−0.4742	1.1971	1.1239	0.8092	1.5128
1296	−0.0024	−0.0171	−0.1212	−0.2695	1.0045	1.0828	0.7122	1.1541
2401	0.0005	0.0001	−0.1099	−0.2271	0.9309	0.9916	0.6654	1.0487
4096	0.0019	0.0072	−0.0725	−0.2511	0.9334	0.9402	0.5621	1.0439
6561	0.0013	−0.0053	−0.0738	−0.1889	0.8614	0.9000	0.5201	0.9672
15,625	0.0054	−0.0008	−0.0506	−0.1064	0.8211	0.8329	0.4605	0.6524

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jauch, J.; Bleimund, F.; Frey, M.; Gauterin, F. An Iterative Method Based on the Marginalized Particle Filter for Nonlinear B-Spline Data Approximation and Trajectory Optimization. Mathematics 2019, 7, 355. https://doi.org/10.3390/math7040355

AMA Style

Jauch J, Bleimund F, Frey M, Gauterin F. An Iterative Method Based on the Marginalized Particle Filter for Nonlinear B-Spline Data Approximation and Trajectory Optimization. Mathematics. 2019; 7(4):355. https://doi.org/10.3390/math7040355

Chicago/Turabian Style

Jauch, Jens, Felix Bleimund, Michael Frey, and Frank Gauterin. 2019. "An Iterative Method Based on the Marginalized Particle Filter for Nonlinear B-Spline Data Approximation and Trajectory Optimization" Mathematics 7, no. 4: 355. https://doi.org/10.3390/math7040355

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Iterative Method Based on the Marginalized Particle Filter for Nonlinear B-Spline Data Approximation and Trajectory Optimization

Abstract

1. Introduction

1.1. Nonlinear Weighted Least Squares Data Approximation

1.2. Trajectory Optimization

1.3. Bayesian Filters

1.4. Contribution

1.5. Structure of the Data Set

1.6. Outline

2. Methods

2.1. B-Spline Function Representation

2.2. Marginalized Particle Filter

2.3. Nonlinear Recursive B-Spline Approximation

2.3.1. Initialization

2.3.2. Measurement Update

2.3.3. Time Update with Shift Operation

2.3.4. Effect of the Shift Operation

3. Numerical Experiments

3.1. General Experimental Setup

3.2. Effect of Weighting and Nonlinear Measurement Function

3.3. Effect of Interval Count

3.4. Effect of Particle Count on Convergence

3.5. Mean and Standard Deviation of NRBA Error

4. Trajectory Optimization

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI