Handling Measurement Delay in Iterative Real-Time Optimization Methods

Gottu Mukkula, Anwesh Reddy; Engell, Sebastian

doi:10.3390/pr9101800

Open AccessArticle

Handling Measurement Delay in Iterative Real-Time Optimization Methods

by

Anwesh Reddy Gottu Mukkula

^*

and

Sebastian Engell

Process Dynamics and Operations Group, Technische Universität Dortmund, 44221 Dortmund, Germany

^*

Author to whom correspondence should be addressed.

Processes 2021, 9(10), 1800; https://doi.org/10.3390/pr9101800

Submission received: 9 August 2021 / Revised: 28 September 2021 / Accepted: 29 September 2021 / Published: 11 October 2021

(This article belongs to the Section Process Control and Monitoring)

Download

Browse Figures

Versions Notes

Abstract

:

This paper is concerned with the real-time optimization (RTO) of chemical plants, i.e., the optimization of the steady-state operating points during operation, based on inaccurate models. Specifically, modifier adaptation is employed to cope with the plant-model mismatch, which corrects the plant model and the constraint functions by bias and gradient correction terms that are computed from measured variables at the steady-states of the plant. This implies that the sampling time of the iterative RTO scheme is lower-bounded by the time to reach a new steady-state after the previously computed inputs were applied. If analytical process measurements (PAT technology) are used to obtain the steady-state responses, time delays occur due to the measurement delay of the PAT device and due to the transportation delay if the samples are transported to the instrument via pipes. This situation is quite common because the PAT devices can often only be installed at a certain distance from the measurement location. The presence of these time delays slows down the iterative real-time optimization, as the time from the application of a new set of inputs to receiving the steady-state information increases further. In this paper, a proactive perturbation scheme is proposed to efficiently utilize the idle time by intelligently scheduling the process inputs taking into account the time delays to obtain the steady-state process measurements. The performance of the proposed proactive perturbation scheme is demonstrated for two examples, the Williams–Otto reactor benchmark and a lithiation process. The simulation results show that the proposed proactive perturbation scheme can speed up the convergence to the true plant optimum significantly.

Keywords:

iterative optimization; real-time optimization; modifier adaptation; plant-model mismatch; time delay; active perturbation strategy; quadratic approximation; gradient estimation; guaranteed model adequacy

1. Introduction

There is a strong interest in the process industries to identify the best steady-state operating conditions of their processes such that they are operated at their economic optimum while meeting constraints on product qualities, emissions, and equipment capabilities. Optimal operating conditions of a process can be identified by formulating and solving an optimization problem where a cost (or profit) function is minimized (or maximized) respecting all process limitations on the basis of a mathematical model of the plant. As the parameters of the optimization problem (e.g., prices for raw materials, energy, and products) and the behavior of the plant vary over time, such optimization is performed repeatedly during plant operation, which is called RTO [1,2] (real-time optimization). At each call of the optimizer, the problem at hand is a constrained nonlinear program (NLP) which usually is solved by gradient-based algorithms, e.g., SQP solvers [3,4] or interior point algorithms [5,6]. These solvers compute local optima, which is sufficient under the tacit assumption that only minor adaptations are made and the problem is convex in the region of interest. As far as the model-based optimization is concerned, in principle also other optimization methods can be applied, as e.g., derivative free methods [7,8], population-based methods [9], or nature-inspired algorithms [10]. While for the solution of the optimization problems efficient algorithms are available, a major practical problem is the mismatch between the model and the true behavior of the plant. No matter how the optimum is computed, if the model and the behavior of the plant do not match, the computed optimum won’t be the true optimum for the plant and the solution may even violate the constraints of the plant. Of course the model could be discarded completely and a brute force optimization could be performed using any derivative-free or meta-heuristic algorithm, but this implies performing a large number of exploratory moves with the real plant, which is highly undesirable. In particular, large constraint violations of internal variables must be avoided, which is impossible if the optimization is completely model-free. Therefore, a combination of model-based and data-based optimization is advantageous. The classical approach to this problem is to add a parameter estimation element, called the two-stage approach [11].

In the two-step approach, some model parameters are updated repeatedly using least-squares estimation and the optimization problem with updated model parameters is solved to identify the process optimum. The performance of the two-step approach is discussed in detail in Chen and Joseph [12]. The two-step approach is designed to handle parametric plant-model mismatch, which means that the model can, with the correct parameter values, represent the true plant exactly. However, there always is also structural plant-model mismatch, i.e., even with optimized parameters, the predictions of the model are inaccurate. This issue was addressed in the “Integrated system optimization and parameter estimation” (ISOPE) approach proposed in Roberts [13]. In ISOPE scheme, in addition to updating the model parameters in each iteration, the objective function of the model based optimization problem is modified iteratively by adding bias and gradient correction terms so that the solution converges iteratively to the optimum of the real plant. These correction terms (also called modifiers) are estimated from the available measurements.

Later, by Tatjewski [14], it was shown that the model parameters do not have to be updated in each iteration to converge to a process optimum. Based on this insight, Tatjewski [14] simplified the ISOPE approach and proposed the redesigned-ISOPE scheme without the parameter-update step from the ISOPE approach. The redesigned-ISOPE scheme was extended in Gao and Engell [15] to handle process-dependent constraints, by adding bias and gradient correction terms also to the constraint functions, and was termed iterative gradient modification optimization (IGMO). The IGMO scheme in Gao and Engell [15] was analyzed in detail in Marchetti et al. [16] and there was given the name modifier adaptation (MA).

The efficiency of MA-based iterative RTO methods depends on the accuracy of the gradient modifiers, which contain the gradients of the response of the cost function and of the constraints to the inputs. As the plant gradients usually cannot be measured directly, they have to be computed from the available measurements. The traditional way of doing this is to use finite differences, preferably employing the past input moves as in Gao and Engell [15] where this was enhanced with probing moves if the gradient estimation problem becomes ill-conditioned. However, in finite differences-based schemes, a trade-off between the approximation error and the effect of measurement noise has to be found. Therefore, Gao et al. [17] proposed to use quadratic approximation (QA) for the approximation of the plant gradients, which is more robust at measuring noise compared to that of finite differences (FD) and Broyden’s formula [18]. The resulting modifier adaptation with quadratic approximation (MAWQA) scheme combines elements of MA, QA, and derivative free optimization [7] (DFO).

For all adaptation-based iterative RTO methods, it is a prerequisite that the adapted process model is adequate at the—unknown—optimum of the real plant [1,16,19,20,21]. A model is considered adequate if the Lagrangian of the model-based optimization problem satisfies the necessary second-order conditions of optimality at the optimum of the real process. In Faulwasser and Bonvin [22], it was proposed to use second-order correction terms (modifiers) in the objective and constraint functions to match the Hessian of the nominal model to that of the plant. However, it is difficult to estimate the plant Hessian [23] from real data. Model adequacy was addressed in François and Bonvin [24] by using convex model approximation of the objective and constraint functions. This may however slow down the rate of convergence to the process optimum. Ahmad et al. [25] addressed model adequacy by updating some model parameters only if the model adequacy conditions are satisfied at the current operating input. In Gottu Mukkula and Engell [26], a guaranteed model adequacy (GMA) scheme for MAWQA was proposed where model adequacy is ensured by strictly convex quadratic approximations of the objective and constraint functions if the goal of the optimization is to minimize an objective function. The MAWQA with GMA scheme may converge to the process optimum in fewer iterations than MAWQA [26].

No matter how the optimization model is corrected, by parameter adaptation or by modifier adaptation of by a combination of both [23], the solution of the resulting model-based optimization problem can be performed by any algorithm. In our work, we use gradient-based algorithms, as the cost functions and the constraints are smooth, no discrete variables are involved, and guaranteed (local) optimality is important.

Modifier adaptation-based iterative RTO methods require steady-state process measurements to compute a new input that provides a lower limit to the time between iterates. This can be alleviated by the use of transient process measurements [27,28,29,30,31], but this is outside the scope of this paper. In addition to the time required to reach a steady-state, the convergence to a process optimum may also be slowed down due to delays in obtaining measurement information. Considerable measurement delays may occur due to the time required for the measurement device (e.g., gas chromatography) to analyze the sample and due to the remote positioning of a measurement device. Figure 1 illustrates a situation when the time delay is caused by the remote positioning of the measurement device for safety reasons, e.g., due to harsh conditions at the plant. Measurement devices like NMR have to be placed in a certified (e.g., ATEX, IECEx) enclosure located at a distance from the process equipment. The thin, long tubings that carry the sample from plant to the remotely positioned measurement device can cause a significant delay, and this is quite common in the process industries.

Gottu Mukkula et al. [32,33] proposed two schemes where additional plant perturbations are performed during the waiting period. The purpose was to gain additional information about the process instead of remaining idle until the effect of a new operating input propagated through the measurement device setup. Here, we propose a strategy in the context of MA that proactively schedules the input moves and thereby significantly reduces the effect of the time delay caused due to the remote positioning of a measurement device. The input perturbations in the proposed method are computed by solving an optimization problem using the latest measurement information.

The rest of the paper is organized as follows. Firstly, the MA scheme used in this paper is presented in detail. Then, the previously proposed approaches to handle the time delay due to the positioning of a measurement device are presented. Thereafter, the new proactive perturbation scheme to effectively handle the time delay is described. Finally, the performance of the proposed scheme is analyzed using the Williams–Otto reactor benchmark and a lithiation reaction case study.

2. Preliminaries

2.1. Model

Consider the steady-state mathematical models

y = F_{p} (u)

and

\hat{y} = F_{m} (u)

as the exact representation of a process and as the nominal model that was built to represent the underlying process. The mapping functions

F_{p} : R^{n_{u}} \to R^{n_{y}}

and

F_{m} : R^{n_{u}} \to R^{n_{y}}

map the

n_{u}

-dimensional vector of manipulated variables

u

to the

n_{y}

-dimensional vector of measured variables

y

and

\hat{y}

. Let

J : R^{n_{u}} \times R^{n_{y}} \to R

represent the objective function, which is continuous and twice differentiable with respect to

u

, that should be minimized, and let

G : R^{n_{u}} \times R^{n_{y}} \to R^{n_{c}}

be a continuous and twice differentiable with respect to

u

n_{c}

-dimensional vector of constraint functions, which may include process, safety, and quality restrictions.

An optimal input of the process lying within

u^{L}

and

u^{U}

can be obtained theoretically by solving

\begin{matrix} u_{p}^{*} = arg & min_{u \in [u^{L}, u^{U}]} J (y, u) \end{matrix}

(1a)

\begin{matrix} s . t . y = F_{p} (u), \end{matrix}

(1b)

\begin{matrix} G (y, u) \leq 0, \end{matrix}

(1c)

but the mapping function (1b) is not available. The optimum

u_{m}^{*}

results from solving the optimization problem using the nominal model:

\begin{matrix} u_{m}^{*} = arg & min_{u \in [u^{L}, u^{U}]} J (\hat{y}, u) \end{matrix}

(2a)

\begin{matrix} s . t . \hat{y} = F_{m} (u), \end{matrix}

(2b)

\begin{matrix} G (\hat{y}, u) \leq 0, \end{matrix}

(2c)

and may differ from

u_{p}^{*}

considerably if

F_{m} \neq F_{p}

. To simplify the notation, we from here on replace

J (y, u), G (y, u)

by

J_{p} (u), G_{p} (u)

and

J (\hat{y}, u), G (\hat{y}, u)

by

J_{m} (u), G_{m} (u)

.

2.2. Modifier Adaptation

Modifier adaption handles plant-model mismatch by adding and iteratively updating gradient correction term to the objective function, and by adding and iteratively updating bias and gradient correction terms to the constraint functions of the model based optimization problem (2) as described in Gao and Engell [15], Marchetti et al. [16]. In the kth iteration, the modified objective and constraint functions of the model based optimization problem, i.e., the MA problem, are

\begin{matrix} J_{m}^{a d, k} (u) & : = J_{m} (u) + {(\nabla J_{p}^{k} - \nabla J_{m}^{k})}^{T} (u - u^{k}), \end{matrix}

(3a)

\begin{matrix} G_{m}^{a d, k} (u) & : = G_{m} (u) + (G_{p}^{k} - G_{m}^{k}) + {(\nabla G_{p}^{k} - \nabla G_{m}^{k})}^{T} (u - u^{k}), \end{matrix}

(3b)

where

\nabla J_{p}^{k}, \nabla J_{m}^{k}

are the gradients of the plant objective function

J_{p} (u)

and of the nominal model objective function

J_{m} (u)

with respect to the process input for the kth iteration. Similarly,

\nabla G_{p}^{k}

,

\nabla G_{m}^{k}

represent the gradients of the plant constraint functions

G_{p} (u)

and the constraint functions of the nominal model

G_{m} (u)

with respect to the process input for the kth iteration. The modified optimization problem that is solved in the kth iteration is:

\begin{matrix} {\hat{u}}^{k + 1} = arg & min_{u \in [u^{L}, u^{U}]} J_{m}^{a d, k} (u) \end{matrix}

(4a)

\begin{matrix} s . t . G_{m}^{a d, k} (u) \leq 0, \end{matrix}

(4b)

where the determined optimum

({\hat{u}}^{k + 1})

is the

k + 1

th input which is applied to the plant.

The gradients of the plant objective function (

\nabla J_{p}^{k}

) and of the plant constraint functions (

\nabla G_{p}^{k}

) can be obtained for example using finite differences for which additional plant perturbations are required. Methods to approximate the plant gradients using the available past ℓ number of inputs and steady-state measurements, without the need for additional plant perturbations, are proposed in Roberts [18], Brdyś and Tatjewski [34], and Gao et al. [35].

In MAWQA, the gradients of the plant objective function (

\nabla J_{p}^{k}

) and of the plant constraint functions (

\nabla G_{p}^{k}

) are obtained using surrogate QA models as long as the cardinality condition, i.e., the minimum number of data points required to identify all the parameters of the QA function, is fulfilled. A minimum of

ℓ : = \frac{(n_{u} + 1) (n_{u} + 2)}{2}

data points, i.e., process inputs and their corresponding steady-state process measurements, are required to fit a surrogate QA function (Q) for the plant objective function and for each of the plant constraint functions. Upon satisfying the cardinality condition,

\nabla J_{p}^{k}

and

\nabla G_{p}^{k}

are then obtained by evaluating the analytical derivatives of the surrogate QA models at

u^{k}

. A surrogate QA model can be defined as

Q (p, u) = \sum_{i = 1}^{n_{u}} \sum_{j = 1}^{i} a_{i, j} u_{i} u_{j} + \sum_{i = 1}^{n_{u}} b_{i} u_{i} + c,

(5)

where,

p : = {a_{1, 1}, \dots, a_{n_{u}, n_{u}}, b_{1}, \dots, b_{n_{u}}, c}

are the parameters of the surrogate QA model.

To obtain a surrogate QA model for each of the objective and constraint functions, a subset of at least ℓ data points (

U^{k}

) is selected from the set of all data points available until the kth iteration (

U^{k}

). Wenzel et al. [36] reported a comparative study of existing methods to screen

U^{k}

from

U^{k}

. In general, the methods for screening

U^{k}

attempt to include distant data points which are well distributed around

u^{k}

(

U_{d i s t}^{k}

) and can act as anchor points for the QA. They also include all neighboring data points (

U_{n b}^{k}

), which lie within the inner-circle around

u^{k}

. The inner-circle is defined as an n-dimensional sphere around

u^{k}

with the tuning parameter Δu as its radius. As proposed in Gao et al. [17], the quality of the distribution of data points in

U_{d i s t}^{k}

can be evaluated using a so-called conditionality-check criterion, the inverse of the condition number of

s^{k}

(

κ^{- 1} (s^{k})

) has to be greater than a desired value (

δ

).

s^{k}

in the conditionality-check criterion is defined as

s^{k} = {[u^{k}]}^{n_{u} \times 1} \otimes 1^{cardinality (U_{d i s t}^{k}) \times 1} - [U_{d i s t}^{k}],

(6)

where

[U_{d i s t}^{k}]

and

[u^{k}]

are matrix representations of the set

U_{d i s t}^{k}

and of

u^{k}

. If the conditionality-check criterion fails, additional plant perturbations are performed and added to

U^{k}

to improve the distribution of the anchor points used for fitting the QA-function.

As the data points in

U^{k}

consist of input variables positioned locally around

u^{k}

, the QA functions fitted using

U^{k}

are also local approximations of

J_{p}

and

G_{p}

. In MAWQA, this is taken into account by restricting the process input for the next iteration (

{\hat{u}}^{k + 1}

) obtained by solving the optimization (4) to lie inside an ellipsoidal trust region by adding the following constraint to the modifier adaptation problem in (4)

{(u - u^{k})}^{T} cov (U^{k}) (u - u^{k}) \leq γ^{2},

(7)

where

γ

is a tuning parameter that scales the trust region [17]. In addition, there are also elements of DFO [7] that include the criticality-check, the quality-check and a possible switch to an optimization based on the quadratic approximation model. We refer the readers to Gao et al. [17,37] for a detailed discussion on MAWQA.

Convergence to the plant optimum is assumed when the evaluation function

C

defined as

C_{i} \overset{def}{=} \sum_{j = 1}^{n_{u}} (| u_{j}^{i} - u_{j}^{i - 1} | | \nabla J_{p, j}^{i} |)

is less than

| ε J_{p, b e s t}^{k} | \forall i = {k + 1 - N_{c}, \dots, k}

where

\nabla J_{p, j}^{i}

is the gradient of the plant objective function of the jth input-variable of kth input

(u^{k})

[38]. The tuning parameter

N_{c}

is the number of iterations for which the condition

C \leq | ε J_{p, b e s t}^{k} |

has to be satisfied continuously.

J_{p, b e s t}^{k}

is the best value of the objective function realized by the plant up to the kth-iteration and

ε

is a tuning parameter that must be chosen taking into account the measurement noise and the required tolerance of the value of the optimum.

3. Active Perturbation Strategies

Figure 1 illustrates a situation that occurs typically in the process industries while implementing MA based RTO methods to optimize their processes. Consider a process that reaches a steady-state for constant inputs within

τ_{p}

time units and the measurement device that requires a maximum of

τ_{m}

time units to analyze a sample. An additional

τ_{d}

time units is required for the sample to be transported to the measurement device making the total time to obtain steady-state measurements after a change of the inputs

τ_{t} : = τ_{p} + τ_{d} + τ_{m}

. If the measurement device is located at a remote position, this leads to a waiting period.

Figure 2 illustrates the waiting period in an iteration of a MA algorithm. If an input

u^{k + 1}

is given to the plant at time t, its corresponding steady-state plant measurement becomes available at time

t + τ_{t}

. The time between iterations in MA is thus at least

τ_{t}

time units if the plant gradients are computed using past inputs and measurements, and even larger if additional perturbations are performed. During the waiting period the plant remains at a steady-state and no additional process information is gained during this period, hence the convergence to the optimum steady-state is slowed down. Gottu Mukkula et al. [32,33] proposed active perturbation strategies to perform input perturbations during the waiting period to gain additional process information that is used later to drive the process to its optimum faster. Figure 2 illustrates the active perturbation scheme where additional process perturbing inputs are given to the process, for example, between

t + τ_{t} + τ_{p}

and

t + 2 τ_{t}

, to gain additional process information that can be used in the subsequent iterations, once the measurements of the effects of the input perturbations arrive. In general, the maximum possible number of input perturbations that can be given to a process within the waiting period can be computed as [32,33]

P_{m a x} : = ⌊ min \{\frac{τ_{d} + τ_{m}}{τ_{p}}, \frac{τ_{d} + τ_{p}}{τ_{m}}\} ⌋ .

(8)

In the following subsections, we present the active perturbation strategies proposed in Gottu Mukkula et al. [32,33]. Both active perturbation strategies are illustrated using an example case with 2 input variables and by assuming the maximum possible perturbations per iteration (

P_{m a x}

) as 4.

3.1. Active Perturbation around the Current Input (APCI)

According to the method proposed in Gottu Mukkula et al. [32], perturbing inputs around the input in the current iteration, i.e.,

u^{k}

, are given to the process during the waiting period. Figure 3, illustrates this active perturbation method. In the figure, Processes 09 01800 i008

represents the inputs from MA. Let

u^{k}

be the input given to the plant in the kth iteration at time t. The plant reaches its steady-state at

t + τ_{p}

, and the steady-state measurements for

u^{k}

are available at

t + τ_{t}

. During the waiting period, i.e., from

t + τ_{p}

to

t + τ_{t}

,

P_{m a x} = 4

additional perturbations represented by circles (◯) are performed around

u^{k}

. Redundant probing is avoided by suppressing perturbation inputs that are closer than a threshold

ξ

from one of the earlier inputs. For example, the perturbing input represented by

⊘

is suppressed as it is in close vicinity to

u^{k - 1}

, and therefore, would be redundant. During the remaining waiting period, which is available due to suppressing of some perturbations, the plant is given the best known input

(u_{p, b e s t}^{k})

to avoid a loss of performance. As the perturbing inputs are not computed optimally, not all of them provide useful information. For example, in Figure 3, only the perturbation inputs represented by

\oplus

are used by the QA scheme for computing the gradient at

u^{k + 1}

.

3.2. Active Perturbation around an Estimate of the Next Input (APENI)

Here, the process is perturbed around an estimate of the next input, i.e.,

{\hat{u}}_{e}^{k + 1}

, instead of perturbing around current input

u^{k}

. Due to the unavailability of the measurements for the input

u^{k}

during the waiting period, the plant gradients at

u^{k}

cannot not be computed. So, the next input

u^{k + 1}

cannot be calculated precisely without the plant gradients at

u^{k}

, and therefore is estimated using the following assumption: If

u^{k}

is close to

u^{k - 1}

, the difference between the plant and the model gradients at

u^{k}

is approximately equal to the gradient difference at

u^{k - 1}

, i.e.,

(\nabla J_{p}^{k} - \nabla J_{m}^{k}) \approx (\nabla J_{p}^{k - 1} - \nabla J_{m}^{k - 1})

and

(\nabla G_{p}^{k} - \nabla G_{m}^{k}) \approx (\nabla G_{p}^{k - 1} - \nabla G_{m}^{k - 1})

. The same is assumed for the bias correction term

(G_{p}^{k} - G_{m}^{k})

. In other words, the optimization problem is unchanged. The input in the next iteration can then be estimated by solving the following modified MA problem:

\begin{matrix} {\hat{u}}_{e}^{k + 1} = arg & min_{u \in [u^{L}, u^{U}]} J_{m, e}^{a d, k} (u) \end{matrix}

(9a)

\begin{matrix} s . t . G_{m, e}^{a d, k} (u) \leq 0, \end{matrix}

(9b)

\begin{matrix} {(u - u^{k})}^{T} cov (U^{k}) (u - u^{k}) \leq γ^{2}, \end{matrix}

(9c)

where

\begin{matrix} J_{m, e}^{a d, k} (u) & = \begin{matrix} J_{m} (u) + {(\nabla J_{p}^{k - 1} - \nabla J_{m}^{k - 1})}^{T} (u - u^{k}), \end{matrix} \end{matrix}

(10a)

\begin{matrix} G_{m, e}^{a d, k} (u) & = \begin{matrix} G_{m} (u) + (G_{p}^{k - 1} - G_{m}^{k - 1}) + {(\nabla G_{p}^{k - 1} - \nabla G_{m}^{k - 1})}^{T} (u - u^{k - 1}) . \end{matrix} \end{matrix}

(10b)

The problem in (9) for computing

{\hat{u}}_{e}^{k + 1}

is the same as the problem in (3) for computing

u^{k + 1}

, except that the trust-region in (7) for computing

{\hat{u}}_{e}^{k + 1}

is now centered around

u^{k}

.

In Figure 4a, an ideal scenario for an active perturbation around next input is illustrated. In the ideal case, the assumption on the modifiers is valid, i.e.,

(\nabla J_{p}^{k} - \nabla J_{m}^{k}) \approx (\nabla J_{p}^{k - 1} - \nabla J_{m}^{k - 1})

and

(\nabla G_{p}^{k} - \nabla G_{m}^{k}) \approx (\nabla G_{p}^{k - 1} - \nabla G_{m}^{k - 1})

. Then,

{\hat{u}}_{e}^{k + 1} \approx u^{k + 1}

, and therefore, the measurement information from the perturbed inputs are used to compute the inputs in upcoming iterations. Figure 4b illustrates a nonideal scenario where the assumption about the modifiers does not hold, so

{\hat{u}}_{e}^{k + 1} \neq u^{k + 1}

. Therefore, as shown in Figure 4b, not all measurement information from the perturbing inputs may be useful in the upcoming iterations. Redundant perturbation inputs are suppressed similar to the active perturbation around the current input (APCI) method.

4. Proactive Perturbation Scheme

Although the perturbation schemes proposed in Gottu Mukkula et al. [32,33] gain additional measurement information by perturbing the process during the waiting period, they are suboptimal as they are based on heuristics. Also, the measurement information gained from the additional input perturbations is not used immediately. Therefore, in this paper, we propose a proactive perturbation scheme where the additional perturbation inputs are computed by solving an optimization problem using all available process information. The key idea of the proactive perturbation scheme is illustrated in Figure 5 using an example case with

n_{u} = 1

,

P_{m a x} = 5

, and where the plant gradients are estimated using QA. At

t : = 0

, an input

u^{0}

is given to the process, the corresponding measurements become available after time

τ_{t}

. In the initial (0th) iteration, FD is used for gradient approximation as there are no past data points for gradient approximation. Therefore, after applying

u^{0}

for

max {τ_{p}, τ_{m}}

units of time, the process is operated with

u_{f d_{1}}^{0}

, for gradient approximation, the corresponding measurements are available at time

ζ : = τ_{t} + (n_{u} \times max {τ_{p}, τ_{m}})

and a new iteration input

u^{6}

is computed by solving the MA problem in (3). During the waiting period in the 0th iteration, i.e., from

(n_{u} + 1) \times max {τ_{p}, τ_{m}}

to

ζ

,

P_{m a x}

additional perturbation moves (from

u^{1}

to

u^{5}

) around

u^{0}

are performed to gain additional knowledge about the process. For

n_{u} : = 1

, steady-state measurements for at least

ℓ : = 3

different inputs are required for the process gradients to be approximated using QA (cardinality condition). We note that ℓ varies if other methods are used for plant gradient estimation using past data. In this example, after time

ζ + max {τ_{p}, τ_{m}}

, there are already measurements for ℓ inputs. From here on, an MA problem is solved each time a new measurement is available as the plant gradients are computed using past inputs and measurements. From this point onward, each input to the process is optimal with respect to the available process knowledge, thereby making the proposed proactive perturbation scheme more efficient than the active perturbation methods proposed in Gottu Mukkula et al. [32,33].

A flowsheet for the implementation of the proposed proactive perturbation scheme is presented in Figure 6. In the flowsheet,

ϕ

is the total number of inputs given to the plant, also including the inputs whose steady-state measurements are not yet available and

ψ

is the total number of inputs given to the plant for whom the steady-state plant measurements are already available. The major steps in the flowsheet are:

➀: perturbation input for gradient estimation using FD
➁: perturbation input for gradient estimation using FD, also considered as an input ( $u^{k}$ ) for MA problem formulation
➂: perturbation input to gain additional information during the waiting period
➃: perturbation input to gain additional information during the waiting period, also considered as an input ( $u^{k}$ ) for MA problem formulation
➄: input ( $u^{k + 1}$ ) computed by solving a MA problem (4) with plant gradients approximated using FD
➅: perturbation input considered as an input ( $u^{k}$ ) for MA problem formulation, to ensure a continuous flow of steady-state measurements every $max {τ_{p}, τ_{m}}$ units of time
➆: input ( $u^{k + 1}$ ) computed by solving MA problem with gradients approximated from past (at least ℓ) measurements

In all the active perturbation strategies discussed in this section, to avoid frequent probing of the plant with unsuccessful inputs, and thus, deteriorating the plant performance, a list of unsuccessful moves is stored. An input from MA

u^{k}

is applied to the plant only if the number of past inputs on the list that are not farther than

φ

from

u^{k}

is less than a predefined number

χ_{c o u n t}

. The parameters

φ, χ_{c o u n t}

are defined depending on the expected level of measurement noise.

5. Williams–Otto Reactor Case Study

5.1. Process Description

The Williams–Otto reactor case study [39] is a benchmark case study widely used to analyze the performance of RTO methods. Reactants A and B react in a CSTR to produce products E and P, and a side product G. While the reaction in the plant follows a three-step reaction mechanism, the nominal model considers a two-step reaction mechanism, ignoring the formation of an intermediate component C, leading to a structural plant-model mismatch. The reaction mechanisms for the plant and in the nominal model are:

\begin{matrix} Plant : & Nominal model : \\ A + B \overset{k_{1}}{⟶} C, & A + 2 B \overset{{\tilde{k}}_{1}}{⟶} P + E, \\ C + B \overset{k_{2}}{⟶} P + E, & A + B + P \overset{{\tilde{k}}_{2}}{⟶} G + E . \\ P + C \overset{k_{3}}{⟶} G . \end{matrix}

The goal is to maximize the profit function

J (y, u) : = (1143.38 x_{P, s s} + 25.92 x_{E, s s}) F_{R} - 76.23 F_{A} - 114.34 F_{B}

, where

x_{P, s s}, x_{E, s s}

are the measured steady-state concentrations of products

P, E

, and

F_{R}

is the summation of the feed flowrates of reactants A and B, i.e.,

F_{R} : = F_{A} + F_{B}

.

F_{B}

and the reaction temperature

T_{R}

are the manipulated variables (

u = [F_{B}, T_{R}]

) with a range

[3, 6] k g s^{- 1}

and

[343.15, 373.15] K

. The steady-state plant model (

F_{p} (u)

) and the nominal model (

F_{m} (u)

) are:

F_{p} (u) : = (\begin{matrix} F_{A} - F_{R} x_{A} - m_{R} r_{1} \\ F_{B} - F_{R} x_{B} - m_{R} (r_{1} + r_{2}) \\ - F_{R} x_{C} + m_{R} (2 r_{1} - 2 r_{2} - r_{3}) \\ - F_{R} x_{E} + 2 m_{R} r_{2} \\ - F_{R} x_{G} + 1.5 m_{R} r_{3} \\ - F_{R} x_{P} + m_{R} (r_{2} - 0.5 r_{3}) \\ F_{R} - F_{A} - F_{B} \end{matrix}) = 0,

(11)

and

F_{m} (u) : = (\begin{matrix} F_{A} - F x_{A} - m_{R} ({\tilde{r}}_{1} + {\tilde{r}}_{2}) \\ F_{B} - F x_{B} - m_{R} (2 {\tilde{r}}_{1} + {\tilde{r}}_{2}) \\ - F_{R} x_{E} + 2 m_{R} {\tilde{r}}_{1} \\ - F_{R} x_{G} + 3 m_{R} {\tilde{r}}_{2} \\ - F_{R} x_{P} + m_{R} ({\tilde{r}}_{1} - {\tilde{r}}_{2}) \\ F_{R} - F_{A} - F_{B} \end{matrix}) = 0,

(12)

where

x_{i}

is the concentration of component i and

m_{R} : =

2105 kg is the mass of the reaction mixture in the reactor.

r : = {r_{1}, r_{2}, r_{3}}, \tilde{r} : = {{\tilde{r}}_{1}, {\tilde{r}}_{2}}

represent the reaction rates in the plant, and in the nominal model. The reaction rates and their kinetic parameters are defined as:

\begin{matrix} r_{1} = k_{1} x_{A} x_{B}, & k_{1} = 1.660 \times 10^{6} e^{- 6666.7 / (T_{R} + 273.15)}, \\ r_{2} = k_{2} x_{B} x_{C}, & k_{2} = 7.212 \times 10^{8} e^{- 8333.3 / (T_{R} + 273.15)}, \\ r_{3} = k_{3} x_{C} x_{P}, & k_{3} = 2.675 \times 10^{6} e^{- 11,111 / (T_{R} + 273.15)}, \\ {\tilde{r}}_{1} = {\tilde{k}}_{1} x_{A} x_{B}^{2}, & {\tilde{k}}_{1} = 2.19 \times 10^{8} e^{- 8077.6 / (T_{R} + 273.15)}, \\ {\tilde{r}}_{2} = {\tilde{k}}_{2} x_{A} x_{B} x_{P}, & {\tilde{k}}_{2} = 4.31 \times 10^{13} e^{- 12,438.5 / (T_{R} + 273.15)} . \end{matrix}

For the purpose of illustration of the performance of the proposed proactive perturbation scheme, we consider that

τ_{p}, τ_{d}

and

τ_{m}

are

3, 12

and 3

\min

.

5.2. Simulation Results

The modifier adaptation scheme MAWQA with GMA [26] is used in the simulation study to compare the methods active perturbation around the current input, active perturbation around an estimate of the next input, and the proposed proactive perturbation scheme using the tuning parameters listed in Table 1. In MAWQA with GMA, the plant gradients are estimated from past inputs and steady-state measurements using QA. All schemes are initialized at

u^{0} : = [3, 100]

. For the simulation study, the measured plant profit is corrupted by Gaussian noise with zero mean and

1.0

standard deviation.

Figure 7 illustrates the evolution of the inputs (scaled) and the plant profit function from MAWQA with GMA, MAWQA with GMA with active perturbation schemes APCI and APENI, and MAWQA with GMA with the proposed proactive perturbation scheme. Descriptions of all markers in the figure are given in Table 2. The axis for the value of the profit function ( Processes 09 01800 i005

) can be found on the right-side of the plot and the axis for the inputs (scaled) to the plant is on the left. Each vertical grid line in the plot represents the completion of an iteration, which includes all steps from giving input to a process to computing a new input by solving a modifier-adaptation problem. For MAWQA with GMA and MAWQA with GMA with active perturbation schemes APCI and APENI, after applying

u^{0}

([

,

]) at

t = 0

, two input steps (perturbations [ Processes 09 01800 i001

,

]) with a scaled step length

Δ h_{u}

are performed to compute the plant gradients using finite differences. Each input is applied to the plant until it reaches a steady-state, i.e., for

τ_{p} : = 3

\min

. The steady-state plant measurement ( Processes 09 01800 i005

) for each applied input are obtained after

τ_{t} : = 18

\min

from applying an input. Upon computing the plant gradients using the steady-state plant measurements, the modifier adaptation problem in (4) is solved to compute the the input

u^{1}

. As the condition for cardinality is not met, i.e.,

U^{1} < ℓ

,

n_{u} : = 2

additional perturbations ([ Processes 09 01800 i001

,

]) are performed after

u^{1}

([

,

]) to compute the plant gradients. Similar to the 0th iteration, the MA problem in (4) is solved in the

1 st

iteration to compute

u^{2}

. From here on, the cardinality condition is always satisfied. Therefore, QA is used for gradient approximation for all further iterations. In MAWQA with GMA, additional input perturbations are made only if the criticality-check criterion is not satisfied, for example in the

2 nd

iteration after applying

u^{2}

to the plant.

In MAWQA with GMA with active perturbation schemes APCI and APENI, additional perturbations are made once the cardinality condition is satisfied, i.e., from the 2nd iteration onwards, and if there are no past inputs lying closer than

ξ

to the scaled planned perturbation inputs. In MAWQA with GMA with APCI, the additional perturbations are stopped from the 11th iteration, as from

k : = 11

, there are no past inputs lying farther than

ξ

from all the planned perturbation inputs. In MAWQA with GMA with APENI, the perturbation inputs in kth iteration are chosen around an estimate of iteration input for

k + 1

th iteration. From the iteration

k : = 10

onwards, all the planned perturbation inputs are not farther than

ξ

from earlier inputs, so all active perturbations are stopped.

In the new scheme, MAWQA with GMA with proactive perturbation, after applying the input

u^{0}

([

,

]) to the plant, as

ϕ : = 1

and

ϕ < ℓ

, two input perturbations ([ Processes 09 01800 i001

,

])

u_{f d_{1}}^{0}, u_{f d_{2}}^{0}

are performed to compute the plant gradients using finite differences. All necessary steady-state measurements for gradient correction, i.e.,

u^{0}

and two additional perturbations

u_{f d_{1}}^{0}, u_{f d_{2}}^{0}

are available after

ζ : =

24

\min

. During the waiting period, i.e., from

(n_{u} + 1) \times max {τ_{p}, τ_{m}} : =

9

\min

to

ζ : =

24

\min

,

P_{m a x} : = 5

additional input perturbations

u_{a p_{1}}^{0}, \dots, u_{a p_{5}}^{0}

are performed to gain additional plant information. As the input perturbations when

ϕ \geq ℓ

are considered as inputs from MA (according to the flowsheet in Figure 6), among the 5 additional perturbations,

u_{a p_{3}}^{0}, \dots, u_{a p_{5}}^{0}

are renamed as

u^{1}, u^{2}, u^{3}

. At 24

\min

, steady-state measurements for

u^{0}

and

u_{f d_{1}}^{0}, u_{f d_{2}}^{0}

are available. Therefore, an MA problem is formulated and solved to compute the next input

u^{4}

. For the next iteration, as

ϕ : = 9

and

ϕ ≮ ℓ

, FD is no more used for gradient approximation. However, there are not enough measurements for QA, i.e.,

ψ : = 4

, after applying

u^{4}

. As

(ℓ - ψ) : = 2

, input perturbations

u^{5}

and

u^{6}

are added to smoothly switch from using FD for gradient approximation to QA. After

u^{6}

,

ψ = ℓ

and a new steady-state measurement is always available every

max {τ_{p}, τ_{m}}

thereby overcoming

τ_{d}

and

τ_{m}

. From here on, a new iteration input is computed every

max {τ_{p}, τ_{m}} : = 3

\min

taking into account the latest measurement information available.

Although all the schemes converged to an input in the neighborhood of the plant optimum, the MAWQA with GMA scheme took the most time, 375

\min

and 20 iterations to converge. The time of convergence is indicated with a red-dashed vertical line in Figure 7. The MAWQA with GMA, APCI, and APENI gained additional plant information from suboptimal input perturbations, and therefore converged in 303, 249

\min

and

16, 13

iterations. Finally, MAWQA with GMA and proactive perturbation scheme converged in only 123

\min

with 31 iterations. It gained more information per time from additional perturbations than that of the earlier proposed active perturbation schemes and also by using the measurement information immediately after it is available.

The mean profit function for 100 simulations for all schemes is shown in Figure 8. The mean and standard deviation of the profit function did not vary significantly upon convergence. The mean and standard deviation of the times of convergence for all modifier adaptation schemes is also shown. In the proactive perturbation scheme, the profit function converges earlier to the plant optimum, followed by the active perturbations schemes APCI, APENI, and the standard modifier adaptation scheme. The mean time of convergence for the MAWQA with GMA if

τ_{d} = 0

, shown in green, is longer than the mean time of convergence of the proposed proactive perturbation scheme as the proposed scheme also overcomes

τ_{m}

in addition to

τ_{d}

. This illustrates that the proposed scheme overcame the effect of time delay

(τ_{d} > > τ_{p}, τ_{m})

in this case.

The Friedman ranking test [40] was performed for a ranking comparison of the proposed proactive perturbation scheme with standard MAWQA, and the APCI and APENI schemes based on the time of convergence for 100 realizations of the measurement noise. The Friedman test is a nonparametric statistical test to recognize performance differences among different methods across multiple test samples. For each realization of the measurement noise, the methods are ranked based on their time of convergence. The lowest mean rank for a method implies its consistently superior performance, and vice versa for the highest mean rank. The mean rank of the standard scheme with no additional perturbations is

3.87

, for APCI it is

2.6

, for APENI

2.51

, and the proposed proactive perturbation scheme has a mean rank of

1.02

. Thus, the proposed scheme clearly showed a superior performance when compared to that of the other methods.

6. Lithiation Process Case Study

6.1. Process Description

The lithiation case study is a real case where

τ_{d} > > {τ_{p}, τ_{m}}

[41]. The iterative real-time optimization method MAWQA was successfully applied to the real process by Gottu Mukkula et al. [41] but there is still scope for improvement in the time of convergence to the real process optimum due to the large

waiting period

in the process. In this simulation study, we aim at proving that the plant optimum could be identified much earlier with the proposed proactive perturbation scheme.

A lithiation reaction [41] is performed to produce Lithium 2-Nitrodiphenylamine (component H) in a containerized reactor module developed within the

F^{3}

-factory project [42] at the INVITE GmbH facility. All chemical components involved in the lithiation reaction are listed in Table 3 and the reaction mechanism is shown in Figure 9. The main modules of the containerized reactor, i.e., the coiled tubular reactor, a NIR flow cell, an online-NMR sensor [43] and a product filter, are shown in Figure 10. The byproduct Lithium fluoride (component G) precipitates along the length of the tubular reactor, and the solid G is filtered out before the reaction mixture from the outlet of the coiled reactor is collected in a product tank. A filtered sample of the reaction mixture is continuously fed to an online-NMR measurement system and to a NIR flow cell to measure the concentration of the product in the reaction mixture at the reactor outlet [43].

The following steady-state plant model was developed assuming a negligible value of the dispersion coefficients for concentrations and temperature:

\begin{matrix} \frac{\partial C_{i}}{\partial z} & = \frac{r_{i}}{v_{z}} \forall i \in {A, B, C, D, E, F, G, H}, \end{matrix}

(13a)

\begin{array}{l} \frac{\partial T_{R}}{\partial z} & = \frac{1}{v_{z} ρ c_{p}} (K A (T_{R} - T_{E n v i}) - Δ_{H_{1}} k_{1} C_{A} C_{B} - \\ Δ_{H_{2}} k_{2} C_{C} C_{E} - Δ_{H_{3}} k_{3} C_{C} C_{F} - \\ Δ_{H_{4}} k_{4} C_{B} C_{F}), \end{array}

(13b)

where

r_{i}

is the reaction rate of component i, described by:

\begin{matrix} r_{A} & = - k_{1} C_{A} C_{B} + k_{3} C_{C} C_{F}, & r_{E} = - k_{2} C_{C} C_{E}, \end{matrix}

(14a)

\begin{matrix} r_{B} & = - k_{1} C_{A} C_{B} + k_{4} C_{B} C_{F}, & r_{F} = k_{2} C_{C} C_{E} - k_{3} C_{C} C_{F} - k_{4} C_{B} C_{F}, \end{matrix}

(14b)

\begin{matrix} r_{C} & = k_{1} C_{A} C_{B} - k_{2} C_{C} C_{E} - k_{3} C_{C} C_{F}, & r_{G} = k_{2} C_{C} C_{E}, \end{matrix}

(14c)

\begin{matrix} r_{D} & = k_{1} C_{A} C_{B} + k_{4} C_{F} C_{B}, & r_{H} = k_{3} C_{C} C_{F} + k_{4} C_{F} C_{B} . \end{matrix}

(14d)

The model parameters are provided in Table 4, and the reaction rates constants

k_{i} \forall i = 1, \dots, 4

depend on the temperature via an Arrhenius-type equation:

k_{i} = k_{i 0} exp (\frac{- E}{R T_{R}}) .

(15)

Although a four-step reaction takes place in the reactor, the nominal model instead considers a single reaction:

A + 2 B + E ⟶ 2 D + G + H,

(16)

leading to a structural plant-model mismatch. The steady-state nominal model is defined as follows:

\begin{matrix} \frac{\partial C_{i}}{\partial z} & = \frac{{\bar{r}}_{i}}{v_{z}} \forall i \in {A, B, D, E, G, H}, \end{matrix}

(17a)

\begin{matrix} \frac{\partial T_{R}}{\partial z} & = \frac{1}{v_{z} ρ c_{p}} (K A (T_{R} - T_{E n v i}) - Δ_{H} k C_{A} C_{B} C_{E}) . \end{matrix}

(17b)

The reaction rates

{\bar{r}}_{i}

in the nominal model follow the elementary rate law with the rate constant k defined by the Arrhenius equation with

k = 17.8 exp (\frac{E}{R T_{R}})

.

The goal is to maximize the plant profit that is computed by the function

J (y, u) : = \frac{{\bar{w}}_{H} C_{H, s s} M_{H}}{ρ} (8 + \sum_{i = 1}^{2} u_{i}) - {\bar{w}}_{A} u_{A} - {\bar{w}}_{E} u_{E},

(18)

where

u_{A}

and

u_{E}

are the feed flowrates of Aniline and FNB in

k g

h^{- 1}

. The feed flow rate of LiHMDS is set to the maximum of 8

k g

h^{- 1}

. The manipulated variables

u_{A}

and

u_{E}

can take values between 3 and 8

k g

h^{- 1}

.

M_{H} =

0.1752

k g

{mol}^{- 1}

is the molecular weight of the component H,

{{\bar{w}}_{H}, {\bar{w}}_{A}, {\bar{w}}_{E}}

reflect the component prices, their values are {450,000, 10,000, 12,000} $

{kg}^{- 1}

and

C_{H, s s}

is the measured steady state concentration of component H in

mol

m^{- 3}

.

{τ_{p}, τ_{d}, τ_{m}}

are

{3, 12, 3}

\min

.

6.2. Simulation Results

All tuning parameters used in the simulation study are listed in Table 1. The RTO schemes are initialized at

u^{0} : = [3.58, 3.58]

and the measured variable

C_{H}

is corrupted by Gaussian noise with zero mean and

1.0

standard deviation. Similar to the Williams–Otto reactor case study, model adequacy is enforced using the guaranteed model adequacy (GMA) proposed in Gottu Mukkula and Engell [26]. Convergence to the process optimum, in this case study, is assumed when the profit from

N_{c}

consecutive iterations is greater than

1.175 \times 10^{5} $

. Figure 11 illustrates the mean and the 10 to 90 percentile range of the profit function, and the mean time of convergence for 100 simulations of MAWQA with GMA, MAWQA with GMA with active perturbation schemes APCI and APENI, and MAWQA with GMA with the proposed proactive perturbation scheme. The deviations in the profit function are due to the measurement noise, which affects the computation of the next inputs via the calculation of the quadratic approximation. The figure shows that the proposed proactive perturbation scheme converges faster when compared to that of MAWQA with GMA and the two active perturbation variants proposed earlier. The reason for this is that the idle time is used better by intelligently scheduling the process inputs taking into account the time delays to obtain the steady-state process measurements. Figure 12 shows the mean value of the profit function from MAWQA with GMA, MAWQA with GMA, APCI, and APENI, and MAWQA with GMA and the proposed proactive perturbation scheme. The mean time of convergence and the standard deviation of the time of convergence for 100 simulations are shown as error bars. In addition to the faster convergence of the proposed proactive perturbation scheme, the standard deviation is also smaller when compared to the other schemes. Similar to the Williams–Otto reactor case study, the mean time of convergence of the proposed proactive perturbation scheme is even shorter than that of the MAWQA with GMA scheme with

τ_{d} = 0

. The mean Friedman rank [40] of the standard scheme with no additional perturbations is

3.64

, for APCI it is

2.82

, for APENI

2.5

, and for the proposed proactive perturbation scheme it is

1.06

.

The time to compute a new input in an Intel(R) Core(TM) i7-4790 CPU @ 3.60 GHz × 4 using a 64 bit Windows 10 operating system for both case studies ranged between 0.01 and 0.02 s. Such short computation time would enable the usage of the proposed proactive perturbation scheme also for processes with very short settling time.

7. Conclusions

In this paper, we address iterative RTO with plant-model mismatch in the presence of a time delay caused due to the positioning of an analytic measurement device at some distance from the place where the process stream is sampled. The time delay leads to a longer time of convergence to the real process optimum for modifier adaptation methods, during which the plant is not operated optimally. Perturbation schemes to handle this time delay in the sense that the period where one must wait for the result of the transfer of the sample and the analytic measurement is used for further probing of the plant were proposed earlier, but they rely on heuristics for the choice of the plant perturbations. A new proactive perturbation scheme is proposed in this paper to handle the time delay where additional perturbation inputs are computed by solving an optimization problem using all available process information, which contrasts the earlier proposed active perturbation schemes.

In the proposed proactive perturbation approach, the measurement delay affects the time of convergence only once at the beginning, until a number of plant measurements for gradient approximation using quadratic approximation are available. Later, the proactive perturbation scheme computes a new input by solving a modifier adaptation problem as soon as new measurement information is available, with a sampling time equal to the maximum of the settling time of the plant and the time taken by the measurement device to analyze a sample. The performance of the proposed scheme was evaluated for two examples and compared with that of other active perturbation schemes using two case studies. The proposed proactive perturbation scheme drives the real plant to its optimum significantly faster than the other approaches, and it significantly reduces the effect of the measurement time delay for both case studies because the scheme collects more useful information from the perturbations. An important issue in the application of MA schemes to real plants is to freeze and restart the adaptation when the input moves are mainly caused by measurement noise. Initial proposals for this can be found in [38], but this requires more research in the future.

Author Contributions

Conceptualization, formal analysis, methodology, software, writing—original draft preparation, validation, visualization, A.R.G.M.; conceptualization, funding acquisition, supervision, writing—review and detailed editing, S.E. Both authors have read and agreed to the published version of the manuscript.

Funding

The research leading to these results was funded the European Union’s Horizon 2020 research and innovation program under grant agreement number 636942 “CONSENS-Integrated Control and Sensing”. Gefördert durch die Deutsche Forschungsgemeinschaft (DFG)-TRR 63 “Integrierte chemische Prozesse in flüssigen Mehrphasensystemen” (Teilprojekt D4)-56091768 (supported by Deutsche Forschungsgemeinschaft in the context of the Interdisciplinary Research Centre TRR 63, Integrated Chemical Processes in Liquid Multiphase Systems, Subproject D4).

Conflicts of Interest

The authors declare no conflict of interest.

Symbols

The following symbols are used in this manuscript:

$u$	input variables
$n_{u}$	number of input variables
$F_{p} (u)$	true process model
$F_{m} (u)$	nominal process model
$y$	measured variables
$\hat{y}$	values of the measured variables estimated using a nominal process model
$n_{y}$	number of measured variables
$n_{c}$	number of constraint functions in the optimization problem
$u_{p}^{*}$	true process optimum
$u_{m}^{*}$	optimum computed using a nominal process model
$u^{L}$	lower bound of the input variables
$u^{U}$	upper bound of the input variables
$J (y, u)$	cost function in the optimization problem
$J_{p} (u)$	cost function computed using true process measurements
$J_{m} (u)$	cost function computed using estimated values of the measured variables
$J_{m}^{a d, k} (u)$	cost function of the modifier adaptation problem in the kth iteration
$J_{m, e}^{a d, k} (u)$	cost function of the optimization problem to estimate ${\hat{u}}^{k + 1}$
$G (y, u)$	constraint functions of the optimization problem
$G_{p} (u)$	constraint functions evaluated using the true process measurements
$G_{m} (u)$	constraint functions evaluated using estimated values of the measured variables
$G_{m}^{a d, k} (u)$	constraint functions of the modifier adaptation problem in the kth iteration
$G_{m, e}^{a d, k} (u)$	constraint functions of the optimization problem to estimate ${\hat{u}}^{k + 1}$
∇	gradient operator
$Q (p, u)$	quadratic function
$p$	parameters of the quadratic function
ℓ	number of parameters in the quadratic function
$U^{k}$	set of all data points available until the kth iteration of MA
$U^{k}$	selected data points for quadratic approximation in the kth iteration of MA
$U_{d i s t}^{k}$	anchor points in $U^{k}$
$U_{n b}^{k}$	neighboring points in $U^{k}$
Δu	radius of the inner circle (tuning parameter)
Δ $h_{u}$	step length for finite differences (tuning parameter)
$δ$	minimum value for the inverse of the condition number (tuning parameter)
$s^{k}$	inverse of the condition number in the kth iteration of MA
${\hat{u}}^{k + 1}$	input variables for the $k + 1$ iteration computed by solving the MA problem in kth iteration
${\hat{u}}_{e}^{k + 1}$	estimated input variables for the $k + 1$ iteration
$γ$	tuning parameter to scale the trust region (tuning parameter)
cov	covariance operator
$C$	evaluation function to identify the convergence of MA to true process optimum
$ε$	accepted variance of $J_{p} (u)$ for convergence (tuning parameter)
$J_{p, b e s t}^{k}$	best value of cost function computed using the plant measurements until the kth iteration of MA
$N_{c}$	number of iterations the convergence criterion has to be satisfied (tuning parameter)
$τ_{p}$	time taken by the process to reach steady-state after a change of the inputs
$τ_{d}$	time required for the sample to reach (or to be transported to) the measurement device
$τ_{m}$	time required for the measurement device to analyze the sample
$τ_{t}$	total time to obtain steady-state measurements after a change in the input
$P_{m a x}$	maximum possible number of input perturbations in the waiting period
$ϕ$	number of inputs given to the process, including the inputs whose measurements are not available
$ψ$	number of inputs given to the process for whom the measurements are already available
$ξ$	threshold to stop active perturbations (tuning parameter)
$φ$	threshold for unsuccessful iteration (tuning parameter)
$χ_{c o u n t}$	number of times an unsuccessful input can be probed (tuning parameter)

Abbreviations

The following abbreviations are used in this manuscript:

RTO	Real-time optimization
SQP	Sequential quadratic programming
ISOPE	Integrated system optimization and parameter estimation
IGMO	Iterative gradient modification optimization
MA	Modifier adaptation
FD	Finite differences
QA	Quadratic approximation
DFO	Derivative free optimization
MAWQA	Modifier adaptation with quadratic approximation
APCI	Active perturbation around the current input
APENI	Active perturbation around an estimate of the next input
GMA	Guaranteed model adequacy
NMR	Nuclear magnetic resonance
ATEX	Atmosphere explosibles
IECEx	International electrotechnical commission explosive

References

Marchetti, A.G.; François, G.; Faulwasser, T.; Bonvin, D. Modifier Adaptation for Real-Time Optimization–Methods and Applications. Processes 2016, 4, 55. [Google Scholar] [CrossRef] [Green Version]
Marlin, T.E.; Hrymak, A.N. Real-time operations optimization of continuous processes. In Chemical Process Control-V; AIChE Symposium Series; American Institute of Chemical Engineers: New York, NY, USA, 1997; Volume 93, pp. 156–164. [Google Scholar]
Philip, E.; Murray, W.; Saunders, M.A.; Wright, M.H. User’s Guide for NPSOL 5.0: A Fortran Package for Nonlinear Programming; Technical Report SOL 86–6; Stanford University: Stanford, CA, USA, 2001. [Google Scholar]
Gill, P.E.; Murray, W.; Saunders, M.A. SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization. SIAM Rev. 2005, 47, 99–131. [Google Scholar] [CrossRef]
Wächter, A.; Biegler, L.T. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Math. Program. 2006, 106, 25–57. [Google Scholar] [CrossRef]
Kaustuv. IPSOL: An Interior Point Solver for Nonconvex Optimization Problems. Ph.D. Thesis, Stanford University, Stanford, CA, USA, 2009. [Google Scholar]
Conn, A.R.; Scheinberg, K.; Vicente, L.N. Introduction to Derivative-Free Optimization; SIAM: Philadelphia, PA, USA, 2009. [Google Scholar]
Rios, L.M.; Sahinidis, N.V. Derivative-free optimization: A review of algorithms and comparison of software implementations. J. Glob. Optim. 2013, 56, 1247–1293. [Google Scholar] [CrossRef] [Green Version]
Jedrzejowicz, P. Current Trends in the Population-Based Optimization. In Proceedings of the 11th International Conference on Computational Collective Intelligence, Hendaye, France, 4–6 September 2019; Springer International Publishing: Cham, Switzerland, 2019; pp. 523–534. [Google Scholar]
Wang, Z.; Qin, C.; Wan, B.; Song, W.W. A Comparative Study of Common Nature-Inspired Algorithms for Continuous Function Optimization. Entropy 2021, 23, 874. [Google Scholar] [CrossRef]
Jang, S.S.; Joseph, B.; Mukai, H. On-line optimization of constrained multivariable chemical processes. AIChE J. 1987, 33, 26–35. [Google Scholar] [CrossRef]
Chen, C.Y.; Joseph, B. On-line optimization using a two-phase approach: An application study. Ind. Eng. Chem. Res. 1987, 26, 1924–1930. [Google Scholar] [CrossRef]
Roberts, P. An algorithm for steady-state system optimization and parameter estimation. Int. J. Syst. Sci. 1979, 10, 719–734. [Google Scholar] [CrossRef]
Tatjewski, P. Iterative optimizing set-point control—The basic principle redesigned. IFAC Proc. Vol. 2002, 35, 49–54. [Google Scholar] [CrossRef] [Green Version]
Gao, W.; Engell, S. Iterative set-point optimization of batch chromatography. Comput. Chem. Eng. 2005, 29, 1401–1409. [Google Scholar] [CrossRef]
Marchetti, A.; Chachuat, B.; Bonvin, D. Modifier-adaptation methodology for real-time optimization. Ind. Eng. Chem. Res. 2009, 48, 6022–6033. [Google Scholar] [CrossRef] [Green Version]
Gao, W.; Wenzel, S.; Engell, S. A reliable modifier-adaptation strategy for real-time optimization. Comput. Chem. Eng. 2016, 91, 318–328. [Google Scholar] [CrossRef] [Green Version]
Roberts, P. Broyden derivative approximation in ISOPE optimising and optimal control algorithms. IFAC Proc. Vol. 2000, 33, 293–298. [Google Scholar] [CrossRef]
Forbes, J.; Marlin, T.; MacGregor, J. Model adequacy requirements for optimizing plant operations. Comput. Chem. Eng. 1994, 18, 497–510. [Google Scholar] [CrossRef]
Forbes, J.; Marlin, T. Design cost: A systematic approach to technology selection for model-based real-time optimization systems. Comput. Chem. Eng. 1996, 20, 717–734. [Google Scholar] [CrossRef]
Bonvin, D.; Srinivasan, B. On the role of the necessary conditions of optimality in structuring dynamic real-time optimization schemes. Comput. Chem. Eng. 2013, 51, 172–180. [Google Scholar] [CrossRef] [Green Version]
Faulwasser, T.; Bonvin, D. On the Use of Second-Order Modifiers for Real-Time Optimization. IFAC Proc. Vol. 2014, 47, 7622–7628. [Google Scholar] [CrossRef] [Green Version]
Bunin, G.A.; François, G.; Bonvin, D. From Discrete Measurements to Bounded Gradient Estimates: A Look at Some Regularizing Structures. Ind. Eng. Chem. Res. 2013, 52, 12500–12513. [Google Scholar] [CrossRef] [Green Version]
François, G.; Bonvin, D. Use of Convex Model Approximations for Real-Time Optimization via Modifier Adaptation. Ind. Eng. Chem. Res. 2013, 52, 11614–11625. [Google Scholar] [CrossRef] [Green Version]
Ahmad, A.; Gao, W.; Engell, S. A study of model adaptation in iterative real-time optimization of processes with uncertainties. Comput. Chem. Eng. 2019, 122, 218–227. [Google Scholar] [CrossRef]
Gottu Mukkula, A.R.; Engell, S. Guaranteed Model Adequacy for Modifier Adaptation with Quadratic Approximation. In Proceedings of the 2020 European Control Conference (ECC), St. Petersburg, Russia, 12–15 May 2020; pp. 1037–1042. [Google Scholar]
Gao, W.; Hernández, R.; Engell, S. Real-time optimization of a novel hydroformylation process by using transient measurements in modifier adaptation. IFAC-PapersOnLine 2017, 50, 5731–5736. [Google Scholar] [CrossRef]
Ferreira, T.D.A.; François, G.; Marchetti, A.G.; Bonvin, D. Use of Transient Measurements for Static Real-Time Optimization. IFAC-PapersOnLine 2017, 50, 5737–5742. [Google Scholar] [CrossRef]
Rodríguez-Blanco, T.; Sarabia, D.; Pitarch, J.; de Prada, C. Modifier Adaptation methodology based on transient and static measurements for RTO to cope with structural uncertainty. Comput. Chem. Eng. 2017, 106, 480–500. [Google Scholar] [CrossRef]
Cadavid, J.; Hernández, R.; Engell, S. Speed-up of Iterative Real-Time Optimization by Estimating the Steady States in the Transient Phase using Nonlinear System Identification. IFAC-PapersOnLine 2017, 50, 11269–11274. [Google Scholar] [CrossRef]
Krishnamoorthy, D.; Jahanshahi, E.; Skogestad, S. Feedback Real-Time Optimization Strategy Using a Novel Steady-state Gradient Estimate and Transient Measurements. Ind. Eng. Chem. Res. 2018, 58, 207–216. [Google Scholar] [CrossRef]
Gottu Mukkula, A.R.; Wenzel, S.; Engell, S. Active Perturbation in Modifier Adaptation for Real Time Optimization to Cope with Measurement Delays. IFAC-PapersOnLine 2018, 51, 124–129. [Google Scholar] [CrossRef]
Gottu Mukkula, A.R.; Wenzel, S.; Engell, S. Active Perturbations Around Estimated Future Inputs in Modifier Adaptation to Cope with Measurement Delays. IFAC-PapersOnLine 2018, 51, 839–844. [Google Scholar] [CrossRef]
Brdyś, M.; Tatjewski, P. An Algorithm for Steady-State Optimizing Dual Control of Uncertain Plants. IFAC Proc. Vol. 1994, 27, 215–220. [Google Scholar] [CrossRef]
Gao, W.; Wenzel, S.; Engell, S. Modifier adaptation with quadratic approximation in iterative optimizing control. In Proceedings of the 2015 IEEE European Control Conference (ECC), Linz, Austria, 15–17 July 2015; pp. 2527–2532. [Google Scholar]
Wenzel, S.; Yfantis, V.; Gao, W. Comparison of regression data selection strategies for quadratic approximation in RTO. Comput. Aided Chem. Eng. 2017, 40, 1711–1716. [Google Scholar]
Gao, W.; Wenzel, S.; Engell, S. Integration of gradient adaptation and quadratic approximation in real-time optimization. In Proceedings of the 2015 34th Chinese Control Conference (CCC), Hangzhou, China, 28–30 July 2015; pp. 2780–2785. [Google Scholar]
Gottu Mukkula, A.R.; Ahmad, A.; Engell, S. Start-up and Shut-down Conditions for Iterative Real-Time Optimization Methods. In Proceedings of the 2019 Sixth Indian Control Conference (ICC), Hyderabad, India, 18–20 December 2019; pp. 158–163. [Google Scholar]
Williams, T.J.; Otto, R.E. A generalized chemical processing model for the investigation of computer control. Trans. Am. Inst. Electr. Eng. Part I Commun. Electron. 1960, 79, 458–473. [Google Scholar] [CrossRef]
Friedman, M. The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance. J. Am. Stat. Assoc. 1937, 32, 675–701. [Google Scholar] [CrossRef]
Gottu Mukkula, A.; Kern, S.; Salge, M.; Holtkamp, M.; Guhl, S.; Fleicher, C.; Meyer, K.; Remelhe, M.; Maiwald, M.; Engell, S. An Application of Modifier Adaptation with Quadratic Approximation on a Pilot Scale Plant in Industrial Environment. IFAC-PapersOnLine 2020, 53, 11773–11779. [Google Scholar] [CrossRef]
Bieringer, T.; Buchholz, S.; Kockmann, N. Future production concepts in the chemical industry: Modular–small-scale–continuous. Chem. Eng. Technol. 2013, 36, 900–910. [Google Scholar] [CrossRef]
Kern, S.; Wander, L.; Meyer, K.; Guhl, S.; Gottu Mukkula, A.R.; Holtkamp, M.; Salge, M.; Fleischer, C.; Weber, N.; King, R.; et al. Flexible automation with compact NMR spectroscopy for continuous production of pharmaceuticals. Anal. Bioanal. Chem. 2019, 411, 3037–3046. [Google Scholar] [CrossRef] [PubMed] [Green Version]

Figure 1. Illustration of a general process with a measurement device at a remote location.

Figure 2. Illustration of standard modifier adaptation method and of active perturbation method.

Figure 3. Illustration of active perturbation strategy with input perturbations around current input.

Figure 4. Illustration of active perturbation strategy with input perturbations around estimated next input

({\hat{u}}_{e}^{k + 1})

represented by ∗.

Figure 4. Illustration of active perturbation strategy with input perturbations around estimated next input

({\hat{u}}_{e}^{k + 1})

represented by ∗.

Figure 5. Illustration of proactive perturbation scheme for MA.

Figure 6. Flowsheet for proactive perturbation scheme for MA.

Figure 7. Williams–Otto reactor case study: evolution of the plant profit function and the inputs from MAWQA with GMA with different active perturbation schemes. The vertical red-dashed line represents the time of convergence. Note that the time axis is scaled differently in the different subplots.

Figure 8. Williams–Otto reactor case study: evolution of mean profit function (solid line), mean time of convergence (vertical dashed line), and standard deviation of the time of convergence. Parameters of criterion for convergence are listed in Table 1.

Figure 9. Detailed reaction mechanism of lithiation reaction. Names of the chemical components are given in Table 3.

Figure 10. (A) picture of modular pilot plant for continuous production of Li-NDPA. (B) close-up of tubular reactor (D, inner diameter =

12.4

mm) and filter section (E). (C) picture of integrated compact NMR spectrometer (43 MHz) with ATEX certified pressurized housing for online concentration measurements. (F) indicates location of the NIR flow cell.

Figure 10. (A) picture of modular pilot plant for continuous production of Li-NDPA. (B) close-up of tubular reactor (D, inner diameter =

12.4

mm) and filter section (E). (C) picture of integrated compact NMR spectrometer (43 MHz) with ATEX certified pressurized housing for online concentration measurements. (F) indicates location of the NIR flow cell.

Figure 11. Lithiation reaction case study: mean and standard deviation of plant profit function and mean time of convergence for 100 simulations. Convergence is assumed when profit for inputs from five consecutive iterations is greater than

1.175 \times 10^{5} $

. (a) MAWQA with GMA. (b) MAWQA with GMA with APCI. (c) MAWQA with GMA with APENI. (d) MAWQA with GMA with proactive perturbation.

Figure 11. Lithiation reaction case study: mean and standard deviation of plant profit function and mean time of convergence for 100 simulations. Convergence is assumed when profit for inputs from five consecutive iterations is greater than

1.175 \times 10^{5} $

. (a) MAWQA with GMA. (b) MAWQA with GMA with APCI. (c) MAWQA with GMA with APENI. (d) MAWQA with GMA with proactive perturbation.

Figure 12. Lithiation reaction case study: evolution of mean profit function (solid lines), mean time of convergence (vertical dashed lines), and standard deviations of time of convergence.

Table 1. Tuning parameters for simulation study.

No.	Parameter	Williams–Otto	Lithiation Reaction
1.	trust-region scaling $(γ)$	3	3
2.	radius of inner circle $(Δ u)$	0.1	0.125
3.	step length for finite differences $(Δ h_{u})$	0.1	0.2
4.	conditionality limit $(δ)$	0.2	0.25
5.	threshold to stop active perturbations $(ξ)$	0.05	0.1
6.	threshold for unsuccessful iteration $(φ)$	0.005	0.005
7.	number of times an unsuccessful input	3	3
7.	is probed $(χ_{c o u n t})$	3	3
8.	number of inputs looked at for	5	5
8.	convergence $(N_{c})$	5	5
9.	accepted variance of J	$0.01$	−
9.	for convergence $(ε)$	$0.01$	−

Table 2. Description of markers in Figure 7. Input markers in red and black color refer to the input variable

u_{1}

; blue and magenta input markers refer to the input variable

u_{2}

.

Table 2. Description of markers in Figure 7. Input markers in red and black color refer to the input variable

u_{1}

; blue and magenta input markers refer to the input variable

u_{2}

.

Marker	Description
[ , ]	successful-iteration input ( $[u_{1}, u_{2}]$ )
[ , ]	unsuccessful-iteration input ( $[u_{1}, u_{2}]$ )
[ , ]	perturbation input ( $[u_{1}, u_{2}]$ )
	value of the plant profit function

Table 3. List of chemical components involved in lithiation reaction.

Label	Chemical Name (Abbreviation)	Role
A	Aniline (An)	reactant
B	Lithium bis(trimethylsilyl)amide (Li-HMDS)	reactant
C	Lithium phenylazanide (Li-An)	intermediate
D	Hexamethyldisilazane (HMDS)	by-product
E	1-Fluoro-2-nitrobenzene (FNB)	reactant
F	2-Nitrodiphenylamine (NDPA)	intermediate
G	Lithium fluoride (LiF)	by-product
H	Lithium 2-Nitrodiphenylamine (Li-NDPA)	product
	Tetrahydrofuran (THF)	solvent

Table 4. Parameter values used in plant model.

Parameter	Value	Parameter	Value
$Δ H_{1}, \dots, Δ H_{4}$	$- 6$ $J$ / $mol$	$T_{i n}$	$283.15$ $K$
$k_{10}, \dots, k_{40}$	$17.8$ $s^{- 1}$	E	$25 \times 10^{3}$ $J$ / $mol$
$K A$	$0.001$ $W$ / $K$	$Δ H$	$- 19$ $J$ / $mol$
$T_{E n v i}$	$293.15$ $K$	$c_{p}$	$0.123$ $J$ /( $mol$ $K$ )
$ρ$	900 $k g$ / $m^{3}$	R	$8.314$ $J$ /( $mol$ $K$ )

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gottu Mukkula, A.R.; Engell, S. Handling Measurement Delay in Iterative Real-Time Optimization Methods. Processes 2021, 9, 1800. https://doi.org/10.3390/pr9101800

AMA Style

Gottu Mukkula AR, Engell S. Handling Measurement Delay in Iterative Real-Time Optimization Methods. Processes. 2021; 9(10):1800. https://doi.org/10.3390/pr9101800

Chicago/Turabian Style

Gottu Mukkula, Anwesh Reddy, and Sebastian Engell. 2021. "Handling Measurement Delay in Iterative Real-Time Optimization Methods" Processes 9, no. 10: 1800. https://doi.org/10.3390/pr9101800

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Handling Measurement Delay in Iterative Real-Time Optimization Methods

Abstract

1. Introduction

2. Preliminaries

2.1. Model

2.2. Modifier Adaptation

3. Active Perturbation Strategies

3.1. Active Perturbation around the Current Input (APCI)

3.2. Active Perturbation around an Estimate of the Next Input (APENI)

4. Proactive Perturbation Scheme

5. Williams–Otto Reactor Case Study

5.1. Process Description

5.2. Simulation Results

6. Lithiation Process Case Study

6.1. Process Description

6.2. Simulation Results

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

Symbols

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI