Introducing Robust Statistics in the Uncertainty Quantification of Nuclear Safeguards Measurements

Cerasa, Andrea

doi:10.3390/e24081160

Open AccessArticle

Introducing Robust Statistics in the Uncertainty Quantification of Nuclear Safeguards Measurements

by

Andrea Cerasa

European Commission, Joint Research Centre, Via E. Fermi 2479, 21027 Ispra, VA, Italy

Entropy 2022, 24(8), 1160; https://doi.org/10.3390/e24081160

Submission received: 19 May 2022 / Revised: 9 August 2022 / Accepted: 17 August 2022 / Published: 19 August 2022

(This article belongs to the Special Issue Robust Methods in Complex Scenarios and Data Visualization)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The monitoring of nuclear safeguards measurements consists of verifying the coherence between the operator declarations and the corresponding inspector measurements on the same nuclear items. Significant deviations may be present in the data, as consequence of problems with the operator and/or inspector measurement systems. However, they could also be the result of data falsification. In both cases, quantitative analysis and statistical outcomes may be negatively affected by their presence unless robust statistical methods are used. This article aims to investigate the benefits deriving from the introduction of robust procedures in the nuclear safeguards context. In particular, we will introduce a robust estimator for the estimation of the uncertainty components of the measurement error model. The analysis will prove the capacity of robust procedures to limit the bias in simulated and empirical contexts to provide more sounding statistical outcomes. For these reasons, the introduction of robust procedures may represent a step forward in the still ongoing development of reliable uncertainty quantification methods for error variance estimation.

Keywords:

robust statistics; nuclear safeguards; one-way random effect model; ANOVA; Q-estimator

1. Introduction

Nuclear safeguards involve measures designed to deter and detect the diversion of nuclear material from the fuel cycle for illicit purposes [1]. The monitoring for possible data falsification by operators that could mask diversion is based on the statistical analysis of paired (operator, inspector) verification measurements. Significant deviations between the operator declarations and the corresponding inspector measurements may be the consequence of problems with their measurement systems. However, they could also be the result of the operator falsifying the data. In this context, the fundamental elements for a trustworthy analysis are (i) a proper measurement error model and (ii) an efficient and unbiased uncertainty quantification (UQ), i.e., the estimation of its uncertainty components.

Concerning this second point, two approaches are available: the bottom-up and the empirical top-down approach [2]. The former consists on quantifying and then propagating the uncertainty in all key steps of the assay, in order to derive the resulting uncertainty of the measured values. The latter, instead, relies fundamentally on modeling measurement data as a one-way random effect model with two variance components, one accounting for the variation within and the other for that between groups. A group is typically defined as an inspection period, within which the measurements are assumed to have the same random error. Between inspection periods, instead, the systematic error captures the changes that may occur in the metrological conditions, such as instrument calibration, change of inspectors, change of background radiation, and so on [3]. The two variance components with data from past inspections are estimated through the classical analysis of variance ([ANOVA], [4]), or using the Grubbs estimator [5], depending on the scope of the analysis.

In empirical application, the outcomes of the bottom-up and the empirical top-down approach could significantly diverge. Specifically, bottom-up estimations are often larger than their top-down counterparts [6]. The need to increase consistency between these two results has been stressed more than once (see, for example, [7,8]) and encouraged the study of potential improvements for both methods [9,10]. The proposed developments of top-down methods never took into consideration the possible presence of anomalous data in the estimation sample even though, as previously mentioned, significant deviations between operator and inspector measurements may occur due to multiple reasons.

The presence of outliers in the data may in fact seriously affect the reliability of the estimations. In random effect models, even one single anomalous observation may greatly influence the properties of the statistics [11]. The bias introduced by a single anomalous value will also affect the result of the subsequent analyses based on the between and within group standard deviations, as the setting of alarm thresholds in measurements (see, for example, [12]) or the material balance evaluation [1]. Consider, for example, the simulated case study in [12] (see Figure 1a). The 30 observations, equally split in 3 groups, are simulated using a random effect model with between and within group standard deviations both equal to 0.010. They mimic 30 possible relative distances of the measurements provided by the operator and the inspector in three different inspection periods. Classical analysis-of-variance expressions provide respective estimates of 0.009 and 0.007. Suppose now to change only one single value and convert it in an outliers, as we do with the observation highlighted in red in Figure 1b. This small change yields a remarkable decrease of the sample average of the 10 observations of group 1, and the classical ANOVA estimates for the between and within group standard deviations become respectively 0.003 and 0.012. It is important to stress that the 30 simulated observations cannot be considered as a time series, even if their plots may suggest it. This is because they are mimicking a use case in the nuclear safeguards context, where the time lag between two single inspections and even between two groups of inspection is not fixed and constant. Therefore, methods coming from time series literature cannot be employed in this context.

The aim of this article is to propose an alternative approach for the UQ problem through the introduction of robust estimation of the between and within group variance estimators. The robust analysis is expected to limit the influence of outliers in past data on the outcomes and to improve the quality and the reliability of the statistical outcomes. For this purpose, we will compare in a simulation exercise the accuracy of the classical estimator with respect to one of the many methods available for estimating the variance components of a random effect model. In particular, we focus on the robust estimators proposed by [13] and based on the Q-estimator for one-sample scale introduced by [14], whose main strengths are:

Suitable breakdown point: they have the capacity to handle a consistent proportion of outliers in the data before returning an incorrect result;
High efficiency: they provide estimates with desirable properties even when there are no outliers in the data;
Closed expression: they do not depend on optimization algorithms and are rather easy to calculate.

Thanks to these properties, the proposed estimators are extremely flexible, and may represent a step forward in the still ongoing development of reliable UQ methods for error variance estimation [9].

This paper proceeds as follows. In the next section, the measurement error model and the classical estimation of its variance components are introduced. Then, in Section 3, the alternative estimation method based on robust statistics is described. In Section 4, the results of a simulation experiment that analyzes the behavior of the two estimators subject to different contamination scenarios are presented. An empirical application of UQ in nuclear safeguards measurements through classical and robust estimates is described in Section 6. Finally, Section 6 concludes.

2. Measurement Error Model and the Classical Estimation of Its Variance Components

Given a set of data from past inspections and assuming the same number of measurements in each inspection (i.e., balanced design), a typical model for representing the inspector (I) and operator (O) measurements of the item k during inspection j is the following:

I_{j k} = μ_{j k} + s_{I j} + r_{I j k} O_{j k} = μ_{j k} + s_{O j} + r_{O j k} j = 1, \dots, g k = 1, \dots, n

where

μ_{j k}

is the (unknown) true value of the measurand,

s_{* j} \sim N (0, σ_{s *}^{2})

are the short-term systematic errors and

r_{* j k} \sim N (0, σ_{r *}^{2})

are the random errors. Therefore, the difference between operator and inspector measurements is given by:

O_{j k} - I_{j k} = (s_{O j} - s_{I j}) + (r_{O j k} - r_{I j k}) = s_{j} + r_{j k}

(1)

which is a random effect model with variance components equal to

σ_{s}^{2} = σ_{s O}^{2} + σ_{s I}^{2}

and

σ_{r}^{2} = σ_{r O}^{2} + σ_{r I}^{2}

.

If the error is expected to scale with the true value of the measurand, the starting models become:

I_{j k} = μ_{j k} (1 + S_{I j} + R_{I j k}) O_{j k} = μ_{j k} (1 + S_{O j} + R_{O j k})

where

S_{* j} \sim N (0, σ_{S *}^{2})

and

R_{* j k} \sim N (0, σ_{R *}^{2})

. In this case, the next step is defining the relative difference, given by:

\frac{O_{j k} - I_{j k}}{μ_{j k}} = \frac{μ_{j k} (1 + S_{O j} + R_{O j k}) - μ_{j k} (1 + S_{I j} + R_{I j k})}{μ_{j k}} = S_{j} + R_{j k} .

(2)

This expression is again a random effect model, with variance components equal to

σ_{S}^{2} = σ_{S O}^{2} + σ_{S I}^{2}

and

σ_{R}^{2} = σ_{R O}^{2} + σ_{R I}^{2}

. Since the measurand

μ_{j k}

is unknown, a reasonable solution is to substitute it with

O_{j k}

, because the operator measurement is typically more accurate and precise than the inspector’s measurements [7]. Ref. [12] shows that (i) assuming a truncated normal distribution for

O_{j k}

that guarantees finite moments for the ratio

\frac{O_{j k} - I_{j k}}{O_{j k}}

, and (ii) considering that typically

\sqrt{σ_{S O}^{2} + σ_{R O}^{2}} \leq 0.02

and

\sqrt{σ_{S I}^{2} + σ_{R I}^{2}} \leq 0.05

, we have that:

-: the distribution of the ratio $\frac{O_{j k} - I_{j k}}{O_{j k}}$ is extremely close to a normal distribution;
-: it provides an accurate approximation of the target variable.

Expressions (1) and (2) represent both a balanced one-way random effect models. The former assumes no relation between the magnitude of the measurand

μ_{j k}

and its corresponding error, whereas in the later the error is proportional to the measurand. Since nuclear measurements often have larger uncertainty at larger true values, a multiplicative rather than an additive model is employed [9]. It is also possible to transform a multiplicative model into an additive one through the well-known approximation

log (1 + x) \approx x

. However, this solution is not recommended due to the inaccuracies it could generate in many situations [2].

The parameters that need to be estimated in both (1) and (2) are the two variance components: the short-term systematic and the random one. In order to extend our focus also to unbalanced cases, in what follows, we will refer to the general formulation of the unbalanced one-way random effect model:

y_{j k} = μ + ς_{j} + ρ_{j k} k = 1, \dots, n_{j} j = 1, \dots, g

(3)

where

ς_{j} \sim N (0, σ_{ς}^{2})

and

ρ_{j k} \sim N (0, σ_{ρ}^{2})

. Therefore, we have that:

Cov (y_{j k}, y_{l m}) = \{\begin{matrix} σ_{τ}^{2} = σ_{ς}^{2} + σ_{ρ}^{2} & if j = l and k = m \\ σ_{ς}^{2} & if j = l and k \neq m \\ 0 & if j \neq l \end{matrix}

(4)

Classical estimators for

σ_{ς}^{2}

and

σ_{ρ}^{2}

are given by [15]:

\begin{matrix} {\hat{σ}}_{ρ}^{2} = \frac{\sum_{j = 1}^{g} \sum_{k = 1}^{n_{j}} (y_{j k} - {\bar{y}}_{j \cdot})}{N - g} \\ {\hat{σ}}_{ς}^{2} = \frac{\sum_{j = 1}^{g} ({\bar{y}}_{j \cdot} - {\bar{y}}_{\cdot \cdot})}{g} - \bar{n} {\hat{σ}}_{ρ}^{2} \end{matrix}

(5)

where:

{\bar{y}}_{j \cdot} = \frac{\sum_{k = 1}^{n_{j}} y_{j k}}{n_{j}} {\bar{y}}_{\cdot \cdot} = \frac{\sum_{j = 1}^{g} {\bar{y}}_{j \cdot}}{g} N = \sum_{j = 1}^{g} n_{j} \bar{n} = \frac{\sum_{j = 1}^{g} n_{j}^{- 1}}{g}

when

n_{j} = n

for all

j = 1, \dots, g

, expressions (5) simplify to the well-known ANOVA estimators for balanced design.

Finally, it could happen that

{\hat{σ}}_{ς}^{2} < 0

. In this case, the usual practice is to put

{\hat{σ}}_{ς}^{2} = 0

, that implies

{\hat{σ}}_{τ}^{2} = {\hat{σ}}_{ρ}^{2}

. In nuclear safeguards contexts, this case represents the situation where there are no significant differences between groups of measurements, and all the variability is due to the within groups component.

3. Robust Estimation of Unbalanced One-Way Random Effect Models

Classical estimation of model (3) through the expressions (5) yields results with desirable statistical properties whenever no anomalous observations are present in the data. When this assumption is not reliable, the use of robust statistical methods allows to avoid the bias introduced by the contaminated observations. In particular, two different kinds of outlier may affect one-way random effect models:

(a): Anomalous values of the group error $ς_{j}$ , also called outlying blocks;
(b): Anomalous values of the individual measurement error $ρ_{j k}$ , also called outlying measurements within blocks.

This double nature of contamination is reflected also in a double definition of breakdown point, defined in general as the smallest proportion of outlying observations that can lead to estimated values which tend to 0 or ∞. Following the definitions provided by [16], we will define the block breakdown point for contamination (a) and measurement breakdown point for contamination (b). Actually, [17] proposed the definition of a third outlier category, that will be not considered in detail, represented by an extremely small or large variation within one group.

In statistical terms, the price to pay when using robust estimators is a loss of efficiency, given that they are in general less efficient than their classical counterparts. However, an accurate choice of the robust method could sensibly mitigate this drawback, allowing to select the robust estimator that offers the most convenient payoff between breakdown point and efficiency. At this proposal, different choices are available for robustly estimating

σ_{τ}

and

σ_{ρ}

: from the M-estimators described in [18,19,20,21] to the S-estimator proposed by [22] and the GM estimator developed by [23]. Among all these options, we decided to consider the estimators proposed by [13] and based on the Q-estimator for one-sample scale introduced by [14]. The definition of the estimators requires the introduction of the following sets:

\begin{matrix} D_{b e t w e e n} \equiv {| y_{j k} - y_{l m} |, 1 \leq j < l \leq g, 1 \leq k \leq n_{j}, 1 \leq m \leq n_{l}} \\ D_{w i t h i n} \equiv {| y_{j k} - y_{j m} |, 1 \leq j \leq g, 1 \leq k < m \leq n_{j}} \\ D_{+}^{2} \equiv {| y_{j k} - y_{j s} + y_{l m} - y_{l h} |, 1 \leq j < l \leq g; 1 \leq k < s \leq n_{j}; 1 \leq m < h \leq n_{l}} \\ D_{-}^{2} \equiv {| y_{j k} - y_{j s} - y_{l m} + y_{l h} |, 1 \leq j < l \leq g; 1 \leq k < s \leq n_{j}; 1 \leq m < h \leq n_{l}} \end{matrix}

D_{b e t w e e n}

and

D_{w i t h i n}

are respectively the sets of the absolute intragroup and intergroup differences, whereas the union of

D_{+}^{2}

and

D_{-}^{2}

represents the set of the absolute second order differences of each group pair. The calculation of the estimators is based on the first or second quartile of these two numerical sets. The robust estimator for

σ_{τ}

is given by:

{\tilde{σ}}_{τ} = 2.2191 {d : d \in D_{b e t w e e n}}_{(0.25)}

(6)

whereas the two alternative proposals for estimating

σ_{ρ}

are:

\begin{matrix} {\tilde{σ}}_{ρ, 1} = 1.0484 {d : d \in D_{w i t h i n}}_{(0.50)} \\ {\tilde{σ}}_{ρ, 2} = 1.5692 {d : d \in D_{+}^{2} \lor d \in D_{-}^{2}}_{(0.25)} \end{matrix}

(7)

where the expression

A_{(p)}

represents the p-percentile of set

A

. The constant factors 2.2191, 1.0484 and 1.5692 guarantee Fisher consistency of the estimators when the data are normally distributed. Therefore, in applications with real data, a normality test for assessing the Gaussian assumption is recommended. Finally, as for the classical estimates, there is the possibility to obtain incoherent estimates, that is,

{\tilde{σ}}_{ρ, *} > {\tilde{σ}}_{τ} .

(8)

The solution in this case is again to put

{\tilde{σ}}_{τ} = {\tilde{σ}}_{ρ, *}

.

The reasons that motivated the choice of these particular estimators are the following:

Suitable breakdown point: [16] proved that, in case of balanced model (i.e.,

n_{j} = n

for all

j = 1, \dots, g

in expression (3)), the block breakdown points of

{\tilde{σ}}_{τ}

,

{\tilde{σ}}_{ρ, 1}

and

{\tilde{σ}}_{ρ, 2}

are equal to

⌈ g / 2 ⌉ / g

, whereas their measurement breakdown points are asymptotically larger or equal to 0.25.

High asymptotic efficiency: defining the asymptotic efficiency as the ratio between the variances of asymptotic distributions of the robust estimators and those of the classical estimators, [16] proved that the asymptotic efficiency of ${\tilde{σ}}_{τ}$ and ${\tilde{σ}}_{ρ, 2}$ in a balanced experiment is at least 82%, and can even exceed 90% depending on the values of n and $σ_{ς} / σ_{ρ}$ . The asymptotic efficiency of ${\tilde{σ}}_{ρ, 1}$ , instead, starts at 37% for $n = 2$ and increase up to 86% when n increases.
Closed expression: ${\tilde{σ}}_{τ}$ , ${\tilde{σ}}_{ρ, 1}$ and ${\tilde{σ}}_{ρ, 2}$ are the direct outcome of an explicit formula applied on a set of data. This means that they are not the result of iterative procedures and/or optimization algorithms that may be difficult to calculate and, more importantly, rather time consuming. The only complication of the proposed estimators that might lead to a sensible raise in the execution time of quartile calculation is the growth of the cardinality of $D_{+}^{2}$ and $D_{-}^{2}$ when N and g increase. However, for moderate values of N and g as the ones considered in this article, this does not seem to be an issue. Moreover, ${\tilde{σ}}_{τ}$ , ${\tilde{σ}}_{ρ, 1}$ and ${\tilde{σ}}_{ρ, 2}$ do not depend on the estimation of any location parameter.

4. Small Sample Comparison of Classical and Robust Estimates

In this section, we want to study and compare through a simulation exercise the behavior of the classical and robust variance estimators defined in (5)–(7). At this proposal, we will firstly simulate random effect models according to (3) without any kind of contamination. The objective of this first experiment is twofold. From one side, we want to compare the behavior of classical and robust estimates in an ideal empirical environment. From the other, it represents a benchmark for a fair assessment of the effects of outliers on both estimates in the second set of simulations that involves contaminated samples.

In order to have an exhaustive view on the main patterns that characterize the estimators in a finite-sample environment, the contaminated and uncontaminated simulations will consider different combinations of settings regarding:

(i)

the total number of observations

N = \sum_{j = 1}^{g} n_{j}

;

(ii)

the number of groups g, that in nuclear safeguards context corresponds to the number of calibration or inspection periods;

(iii)

balanced and unbalanced samples. The values of

n_{j}

that describe the unbalanced cases are provided in Table 1;

(iv)

the ratio between the systematic variance component and the total variance:

ψ^{2} = σ_{ς}^{2} / σ_{τ}^{2}

. Actually, looking at expression (4), the value of

ψ^{2}

represents the correlation between observations in the same group. Fixing the value of the total variance to 1 (that is,

σ_{τ}^{2} = σ_{ρ}^{2} + σ_{ς}^{2} = 1

), we will consider three different cases:

-: $ψ^{2} = 0.25$ , that implies $σ_{ς} = 0.500$ and $σ_{ρ} = 0.866$ ;
-: $ψ^{2} = 0.5$ , that implies $σ_{ρ} = σ_{ς} = 0.707$ ;
-: $ψ^{2} = 0.75$ , that implies $σ_{ς} = 0.866$ and $σ_{ρ} = 0.500$ .

Similar to [23], in each simulated sample, we considered three different contamination schemes:

(A): contamination of 10% of the random errors $ρ_{j k}$ in different groups replaced by $6 σ_{ρ}$ ;
(B): contamination of 10% of the short-term systematic errors $ς_{j}$ , replaced by $6 σ_{ς}$ . For this scheme, we skip the case $g = 3$ , and when $g = 6$ , we contaminate one systematic error in each simulation, so slightly more than 10%;
(C): 10% of the random errors $ρ_{j k}$ in different groups replaced by $6 σ_{ρ}$ and at least 10% of the short-term systematic errors $ς_{j}$ , whose random errors are not contaminated, replaced by $6 σ_{ς}$ . Whenever possible, the two contamination schemes do not involve the same group. As before, we skip the case $g = 3$ , and when $g = 6$ we contaminate one systematic error in each simulation, so slightly more than 10%;

The programming environment used for the simulation exercise is MATLAB (release R2021a). The codes for running and replicating our experiment are available upon request. They are described in the Appendix A, where we also provide all the implementation details.

Before presenting the results, it is important to notice that condition (8) may lead to two set of estimates for

{\tilde{σ}}_{τ}

, one connected to

{\tilde{σ}}_{ρ, 1}

, and a second connected to

{\tilde{σ}}_{ρ, 2}

. However, the simulations outcomes did not highlight relevant differences in the average behavior of the two sets. Therefore, only the results obtained on the first will be presented.

Table 1. List of the

n_{j}

values in unbalanced samples.

Table 1. List of the

n_{j}

values in unbalanced samples.

N	g	$n_{j}$
30	3	4,10,16
30	6	3,3,5,5,7,7
60	6	4,4,8,12,16,16
60	10	4,4,5,5,6,6,7,7,8,8
100	10	4,4,7,7,10,10,13,13,16,16
100	20	3,3,3,3,4,4,4,4,5,5,5,5,6,6,6,6,7,7,7,7
200	20	4,4,4,4,7,7,7,7,10,10,10,10,13,13,13,13,16,16,16,16
200	40	3,3,3,3,3,3,3,3,4,4,4,4,4,4,4,4,5,5,5,5,5,5,5,5,6,6,6,6,6,6,6,6,7,7,7,7,7,7,7,7

4.1. Simulation Results—No Contamination

Figure 2 presents the values of the Mean Absolute Bias (MAB) multiplied by 100 of the classical and robust variance components estimates obtained in the 100,000 simulations. As expected, in all cases, the absolute biases sensibly decrease on average when N and g increase. Similar patterns characterize the values of both classical and robust estimates, with the only difference that the classical estimates always offer better performances than the robust ones. There are only marginal differences between the MAB in balanced and unbalanced samples, with the former slightly outperforming the latter. The value of

ψ^{2}

has, instead, a remarkable effect on the outcomes. Increasing the value of

ψ^{2}

yields always to worst estimates of

σ_{τ}

and a more accurate estimation of

σ_{ρ}

. The pattern of MAB for

σ_{ρ}

is also affected by the values of N and g: increasing g (for a fixed N) worsens the accuracy of classical and robust estimators, whereas the opposite happens if we increase N (for a fixed g). The estimates of

σ_{τ}

, instead, seem to be only marginally influenced by the combination of N and g. Finally, it is worth pointing out that

{\tilde{σ}}_{ρ, 2}

provides always more accurate estimates on average than

{\tilde{σ}}_{ρ, 1}

.

Figure 3 shows the estimated efficiencies of

{\tilde{σ}}_{τ}

,

{\tilde{σ}}_{ρ, 1}

and

{\tilde{σ}}_{ρ, 2}

. Following [24], in order to obtain a natural measure of accuracy of the estimator, the calculation of the efficiency is based on the standardized variance that, for a generic estimator

S_{M}

, can be calculated as:

η (S_{M}) = M \times Var (S_{M}) / Mean {(S_{M})}^{2}

(9)

where Mean

(S_{M})

and Var

(S_{M})

refer to the values obtained from the simulations. The ratio

η ({\hat{σ}}_{*}) / η ({\tilde{σ}}_{*})

between the standardized variances of the classical and the robust estimator will then provide an estimate of the efficiency. Coherently to what proved in [16], the panels of Figure 3 confirm that

{\tilde{σ}}_{ρ, 1}

is less efficient than

{\tilde{σ}}_{ρ, 2}

, and that the efficiencies of both do not depend in general on the ratio

ψ^{2}

, or on the values of N and g. Even in small samples, the estimated efficiency of

{\tilde{σ}}_{ρ, 2}

is about 95% in balanced samples and 90% in unbalanced samples, whereas that one of

{\tilde{σ}}_{ρ, 1}

oscillates around 65% in both cases. The estimated efficiency of

{\tilde{σ}}_{τ}

, instead, varies from 30% (when N and g are small) to 85% (for larger values of N and g). The value of

ψ^{2}

has a marginal effect, whereas remarkable differences characterize the value of the estimated efficiency of

{\tilde{σ}}_{τ}

in balanced and unbalanced samples.

Concluding, simulations results on noncontaminated data provided a comprehensive description of the finite-sample properties of classical and robust estimators of variance components. Even though classical estimates resulted more accurate on average, the gap with respect to the respective robust counterparts is limited. Also in terms of efficiency, the choice of the robust alternative does not imply a remarkable loss, especially for

σ_{ρ}

.

4.2. Simulation Results—Contaminated Samples

Figure 4 shows the effects of contamination (A) in terms of MAB on the classical and robust estimates, and give the possibility to compare these results with the benchmark case of no contamination. As expected, both figures highlight a remarkable increase of the estimated bias of all estimators. However, robust estimators have the capacity to limit the distortion yielded by outlying observations, showing more accuracy than classical estimators. This is particularly evident for the estimates of

σ_{ρ}

independently of the simulation settings. As expected, the bigger efficiency of

{\tilde{σ}}_{ρ, 2}

with respect to

{\tilde{σ}}_{ρ, 1}

highlighted in Figure 3 is counterbalanced by a smaller accuracy in contaminated contexts. Concerning the estimates of

σ_{τ}

, the robust estimator clearly show more accurate outcomes when

ψ^{2} < 0.75

, whereas when

ψ^{2} = 0.75

,

{\tilde{σ}}_{τ}

is slightly outperformed by

{\hat{σ}}_{τ}

only in small samples (i.e.,

N = 30

). However, in empirical application on nuclear measurement data, usually

ψ^{2} \leq 0.5

, given that

σ_{ς}

tends to be smaller than

σ_{ρ}

[10].

Results concerning contamination (B) are displayed in Figure 5. Since the contamination involves only the systematic component of model (2), both classical and robust estimates of

σ_{ρ}

are not affected by anomalous data and confirm the same results previously observed in Figure 2b. Their MAB values are almost the same, and this explains why in panel Figure 5b the grey lines (representing the results on contaminated samples) are indistinguishable from the black lines (representing the results on samples without contamination).

{\hat{σ}}_{τ}

and

{\tilde{σ}}_{τ}

, instead, worsen their performances due to contamination. However, once again, the robust version of the estimator limits the negative effects of the outliers and provides more accurate estimates. This is particularly true when the samples are balanced, and when

ψ^{2} \geq 0.50

in unbalanced designs.

Finally, Figure 6 presents the simulation results obtained for contamination (C). The best results in terms of MAB offered by the robust estimators are quite evident. The two robust estimators of

σ_{ρ}

always provide remarkably smaller values of MAB than the classical one, with again

{\tilde{σ}}_{ρ, 1}

outperforming

{\tilde{σ}}_{ρ, 2}

. The same can be said also for

{\tilde{σ}}_{τ}

, with the only exception of small unbalanced samples, where the two estimators yield approximately the same outcome.

Figure 7 shows in detail the simulated observations for a balanced sample of

N = 60

observations in

g = 6

groups under contamination (A). These data may mimic the situation where an inspector has to assess the measurements obtained in the sixth inspection period. The assessment is usually based on the outcomes obtained in the previous five inspections, and on the

3 σ

rule for the declaration of the outliers. This common practice implicitly assumes that previous measurements are free from anomalies. The plot clearly shows that, if this assumption does not hold, the outlier in the period of interest could remain undetected. Robust estimators, instead, are calculated using also the values of

\frac{O - I}{O}

that we aim to assess and, being only partially affected by past anomalous values, allow a correct identification of the outlier in the last inspection.

5. Empirical Application

Figure 8 presents the

(O_{j k} - I_{j k}) / O_{j k}

values effectively registered in 6 inspection periods where impure Plutonium items were measured in attended mode. The 52 values form an unbalanced sample, since the number of measurements in each of the 6 inspection periods is 12, 8, 10, 10, 7 and 5. There are 4 values (highlighted in red) that are clearly far from the general pattern. Posterior cross check between the operator and inspection measurements led to a confirmation that those 4 values were inconsistent with the declarations and that the corresponding measurements were actually outliers.

Classical estimates obtained for

σ_{τ}

,

σ_{ρ}

and

σ_{ς}

are given respectively by 0.1315, 0.1133 and 0.0667. The robust method, instead, leads to

{\tilde{σ}}_{ρ, 1} = 0.0894

,

{\tilde{σ}}_{ρ, 2} = 0.0958

and

{\tilde{σ}}_{τ} = 0.0869

. Therefore, in both cases, robust estimates satisfy condition Figure 8, and then we need to put

{\tilde{σ}}_{τ} = {\tilde{σ}}_{ρ, *}

, that in turn implies

{\tilde{σ}}_{ς} = 0

. Therefore, the presence of the 4 anomalous data had a twofold effect on the outcomes:

classical estimates for $σ_{τ}$ are always larger than robust ones;
differently from classical one, robust estimation does not detect any relevant short-term systematic error component.

Coherently to what we observed in the simulations, the presence of outliers in the sample yields an overestimation in the variance components that the robust methods is capable to limit. The

3 σ

thresholds represented in the figure provide a further confirmation. The classical ones, based on the sample mean and on

{\hat{σ}}_{τ}

, include almost all the outliers. This is due to the effect that the 4 anomalies has on the calculation of both the sample mean and the variance components. The robust ones, based on the median and on

{\tilde{σ}}_{τ}

, are tighter and more centered with the good measurements, given that the median is a robust estimator of the central tendency, and its value is not affected by the anomalous data. As a result, the four outliers are all outside the robust

3 σ

thresholds.

6. Conclusions

The availability of an efficient and unbiased estimation of the uncertainty components in nuclear safety measurements is a fundamental element for a reliable detection of possible data falsification by operators that could mask diversion. Top-down methods for UQ rely fundamentally on modeling measurement data as a one-way random effect model with within and between groups variance components, which are usually estimated through the classical ANOVA expressions. However, the reliability of these estimates could be seriously affected by the presence of outliers in the data.

The aim of this article was to introduce the use of robust approaches in the UQ of nuclear safety measurements, and to analyze the general benefits in terms of estimates accuracy and bias reduction. For this purpose, we proposed a robust estimator of the variance components that combines several desirable statistical properties, i.e., suitable breakdown point, high efficiency, closed expression. Simulation results and the empirical application proved that the presence of even on single outlier may significantly affect the reliability of classical estimates. The robust estimators, instead, have the capacity to handle anomalous data, and to remarkably limit their negative effects on the final estimates.

Robust statistics offers many other alternative methods for estimating within and between group variance. The promising outcomes obtained through the simulation exercise and the empirical application may be considered as a starting point for a broader investigation aimed to compare different robust estimators for one-way random effect models in simulated and empirical contexts, and to identify the more suitable one. Such analysis will undoubtedly represent a solid contribution to the still ongoing development of reliable UQ methods for error variance estimation.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

I would like to thank Alice Tomanin, Cristina Versino, Marc Couland and Domenico Perrotta for the useful discussions and insights they provided throughout the course of this work, and two anonymous reviewers for their suggestions and comments.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

UQ	Uncertainty quantification
ANOVA	Analysis-Of-Variance
MAB	Mean Absolute Bias

Appendix A. MATLAB Codes and Implementation Details

The whole simulation exercise is based on the following MATLAB codes developed by the author and available upon request:

1.

MonteCarlo.m

-: INPUT: (i) Ng: a ( $g \times 1$ ) vector of integers representing the number of observation in each group; (ii) sigmaRand: the value of $σ_{ρ}$ ; (iii) sigmaSyst: the value of $σ_{ρ}$ ; (iv) contamination: the kind of contamination (A, B, C or empty for no contamination); (v) nsim: the number of simulation to generate.
-: OUTPUT: (i) outClassical: (nsim $\times 3$ ) matrix containing the values of $[{\hat{σ}}_{τ} {\hat{σ}}_{ρ} {\hat{σ}}_{ς}]$ obtained in each simulation; (ii) outRobust1: (nsim $\times 3$ ) matrix containing the values of $[{\tilde{σ}}_{τ} {\tilde{σ}}_{ρ, 1} {\tilde{σ}}_{ς, 1}]$ obtained in each simulation; (iii) outRobust2: (nsim $\times 3$ ) matrix containing the values of $[{\tilde{σ}}_{τ} {\tilde{σ}}_{ρ, 2} {\tilde{σ}}_{ς, 2}]$ obtained in each simulation.

2.

Gestimator.m

-: INPUT: (i) y: $(N \times 1)$ vector of observations; (ii) g: $(N \times 1)$ vector of integers representing that group of each observation.
-: OUTPUT: (i) sigmaTot1: the value obtained for ${\tilde{σ}}_{τ}$ ; (ii) sigmaRand1: the value obtained for ${\tilde{σ}}_{ρ, 1}$ ; (iii) sigmaSyst1: the value obtained for ${\tilde{σ}}_{ς, 1}$ ; (iv) sigmaRand2: the value obtained for ${\tilde{σ}}_{ρ, 2}$ ; (v) sigmaSyst2: the value obtained for ${\tilde{σ}}_{ς, 2}$ ;

References

Walsh, S.; Burr, T.; Martin, K. Discussion of the IAEA error approach to producing variance estimates for use in material balance evaluation and the international target values, and comparison to metrological definitions of precision. J. Nucl. Mater. Manag. 2017, 45, 4–14. [Google Scholar]
Bonner, E.; Burr, T.; Guzzardo, T.; Krieger, T.; Norman, C.; Zhao, K.; Beddingfield, D.; Geist, W.; Laughter, M.; Lee, T. Ensuring the effectiveness of safeguards through comprehensive uncertainty quantification. J. Nucl. Mater. Manag. 2016, 44, 53–61. [Google Scholar]
Burr, T.L.; Knepper, P.L. A study of the effect of measurement error in predictor variables in nondestructive assay. Appl. Radiat. Isot. 2000, 53, 547–555. [Google Scholar] [CrossRef]
Scheffe, H. The Analysis of Variance; John Wiley & Sons: Hoboken, NJ, USA, 1999; Volume 72. [Google Scholar]
Grubbs, F.E. On estimating precision of measuring instruments and product variability. J. Am. Stat. Assoc. 1948, 43, 243–264. [Google Scholar] [CrossRef]
Burr, T.; Croft, S.; Krieger, T.; Martin, K.; Norman, C.; Walsh, S. Uncertainty quantification for radiation measurements: Bottom-up error variance estimation using calibration information. Appl. Radiat. Isot. 2016, 108, 49–57. [Google Scholar] [CrossRef]
Burr, T.; Krieger, T.; Norman, C.; Zhao, K. The impact of metrology study sample size on uncertainty in IAEA safeguards calculations. EPJ Nucl. Sci. Technol. 2016, 2, 36. [Google Scholar] [CrossRef]
Burr, T.; Croft, S.; Dale, D.; Favalli, A.; Weaver, B.; Williams, B. Emerging applications of bottom-up uncertainty quantification in nondestructive assay. ESARDA Bull. 2015, 53, 54–61. [Google Scholar]
Burr, T.; Croft, S.; Jarman, K.; Nicholson, A.; Norman, C.; Walsh, S. Improved uncertainty quantification in nondestructive assay for nonproliferation. Chemom. Intell. Lab. Syst. 2016, 159, 164–173. [Google Scholar] [CrossRef]
Burr, T.; Martin, K.; Norman, C.; Zhao, K. Analysis of variance for item differences in verification data with unknown groups. Sci. Technol. Nucl. Install. 2019, 2019, 1769149. [Google Scholar] [CrossRef]
Zhou, J.; Zhu, H. Robust estimation and design procedures for the random effects model. Can. J. Stat. 2003, 31, 99–110. [Google Scholar] [CrossRef]
Burr, T.; Bonner, E.; Krzysztoszek, K.; Norman, C. Setting Alarm Thresholds in Measurements with Systematic and Random Errors. Stats 2019, 2, 259–271. [Google Scholar] [CrossRef]
Uhlig, S. Robust estimation of variance components with high breakdown point in the 1-way random effect model. In Industrial Statistics; Springer: Berlin/Heidelberg, Germany, 1997; pp. 65–73. [Google Scholar]
Rousseeuw, P.J.; Croux, C. Alternatives to the median absolute deviation. J. Am. Stat. Assoc. 1993, 88, 1273–1283. [Google Scholar] [CrossRef]
Krishnamoorthy, K.; Mathew, T. One-sided tolerance limits in balanced and unbalanced one-way random models based on generalized confidence intervals. Technometrics 2004, 46, 44–52. [Google Scholar] [CrossRef]
Müller, C.H.; Uhlig, S. Estimation of variance components with high breakdown point and high efficiency. Biometrika 2001, 88, 353–366. [Google Scholar] [CrossRef]
Wellmann, J.; Gather, U. Identification of outliers in a one-way random effects model. Stat. Pap. 2003, 44, 335–348. [Google Scholar] [CrossRef]
Rocke, D.M. Robust statistical analysis of interlaboratory studies. Biometrika 1983, 70, 421–431. [Google Scholar] [CrossRef]
Fellner, W.H. Robust estimation of variance components. Technometrics 1986, 28, 51–60. [Google Scholar] [CrossRef]
Richardson, A.M.; Welsh, A.H. Robust restricted maximum likelihood in mixed linear models. Biometrics 1995, 51, 1429–1439. [Google Scholar] [CrossRef]
Huggins, R. A robust approach to the analysis of repeated measures. Biometrics 1993, 49, 715–720. [Google Scholar] [CrossRef]
Wellmann, J. Robustness of an S-Estimator in the One-Way Random Effects Model. Biom. J. J. Math. Methods Biosci. 2000, 42, 215–221. [Google Scholar] [CrossRef]
Gervini, D.; Yohai, V.J. Robust estimation of variance components. Can. J. Stat. 1998, 26, 419–430. [Google Scholar] [CrossRef]
Bickel, P.J.; Lehmann, E.L. Descriptive statistics for nonparametric models. III. Dispersion. In Selected Works of EL Lehmann; Springer: Berlin/Heidelberg, Germany, 2012; pp. 499–518. [Google Scholar]

Figure 1. Effect of the introduction of one single outlier (highlighted in red) on the simulated case study of [12] (dashed lines represent the average of the 10 observations in each group). (a) Clean data; (b) One outlier.

Figure 2.

100 \times

MAB of the variance components estimates obtained in the 100,000 simulations without contamination^*. (a)

100 \times

MAB of

{\hat{σ}}_{τ}

(classical estimator, solid line) and

{\tilde{σ}}_{τ}

(robust estimator, dashed line); (b)

100 \times

MAB of

{\hat{σ}}_{ρ}

(classical estimator, solid line),

{\tilde{σ}}_{ρ, 1}

(robust estimator, dashed line) and

{\tilde{σ}}_{ρ, 2}

(robust estimator, dotted line). ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 2.

100 \times

MAB of the variance components estimates obtained in the 100,000 simulations without contamination^*. (a)

100 \times

MAB of

{\hat{σ}}_{τ}

(classical estimator, solid line) and

{\tilde{σ}}_{τ}

(robust estimator, dashed line); (b)

100 \times

MAB of

{\hat{σ}}_{ρ}

(classical estimator, solid line),

{\tilde{σ}}_{ρ, 1}

(robust estimator, dashed line) and

{\tilde{σ}}_{ρ, 2}

(robust estimator, dotted line). ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 3. Estimated efficiency of the robust estimators

{\tilde{σ}}_{τ}

(solid line),

{\tilde{σ}}_{ρ, 1}

(dashed line) and

{\tilde{σ}}_{ρ, 2}

(dotted line) calculated on the 100,000 simulations without contamination^*. ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 3. Estimated efficiency of the robust estimators

{\tilde{σ}}_{τ}

(solid line),

{\tilde{σ}}_{ρ, 1}

(dashed line) and

{\tilde{σ}}_{ρ, 2}

(dotted line) calculated on the 100,000 simulations without contamination^*. ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 4.

100 \times

MAB of the variance components estimates obtained in the 100,000 simulations without contamination (gray lines) and under contamination (A) (black lines)^*. (a)

100 \times

MAB of

{\hat{σ}}_{τ}

(classical estimator, solid line) and

{\tilde{σ}}_{τ}

(robust estimator, dashed line); (b)

100 \times

MAB of

{\hat{σ}}_{ρ}

(classical estimator, solid line),

{\tilde{σ}}_{ρ, 1}

(robust estimator, dashed line) and

{\tilde{σ}}_{ρ, 2}

(robust estimator, dotted line). ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 4.

100 \times

MAB of the variance components estimates obtained in the 100,000 simulations without contamination (gray lines) and under contamination (A) (black lines)^*. (a)

100 \times

MAB of

{\hat{σ}}_{τ}

(classical estimator, solid line) and

{\tilde{σ}}_{τ}

(robust estimator, dashed line); (b)

100 \times

MAB of

{\hat{σ}}_{ρ}

(classical estimator, solid line),

{\tilde{σ}}_{ρ, 1}

(robust estimator, dashed line) and

{\tilde{σ}}_{ρ, 2}

(robust estimator, dotted line). ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 5.

100 \times

MAB of the variance components estimates obtained in the 100,000 simulations without contamination (gray lines) and under contamination (B) (black lines)^*. In panel Figure 5b, black and grey lines overlap. (a)

100 \times

MAB of

{\hat{σ}}_{τ}

(classical estimator, solid line) and

{\tilde{σ}}_{τ}

(robust estimator, dashed line); (b)

100 \times

MAB of

{\hat{σ}}_{ρ}

(classical estimator, solid line),

{\tilde{σ}}_{ρ, 1}

(robust estimator, dashed line) and

{\tilde{σ}}_{ρ, 2}

(robust estimator, dotted line). ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 5.

100 \times

MAB of the variance components estimates obtained in the 100,000 simulations without contamination (gray lines) and under contamination (B) (black lines)^*. In panel Figure 5b, black and grey lines overlap. (a)

100 \times

MAB of

{\hat{σ}}_{τ}

(classical estimator, solid line) and

{\tilde{σ}}_{τ}

(robust estimator, dashed line); (b)

100 \times

MAB of

{\hat{σ}}_{ρ}

(classical estimator, solid line),

{\tilde{σ}}_{ρ, 1}

(robust estimator, dashed line) and

{\tilde{σ}}_{ρ, 2}

(robust estimator, dotted line). ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 6.

100 \times

MAB of the variance components estimates obtained in the 100,000 simulations without contamination (gray lines) and under contamination (C) (black lines)^*. (a)

100 \times

MAB of

{\hat{σ}}_{τ}

(classical estimator, solid line) and

{\tilde{σ}}_{τ}

(robust estimator, dashed line); (b)

100 \times

MAB of

{\hat{σ}}_{ρ}

(classical estimator, solid line),

{\tilde{σ}}_{ρ, 1}

(robust estimator, dashed line) and

{\tilde{σ}}_{ρ, 2}

(robust estimator, dotted line). ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 6.

100 \times

MAB of the variance components estimates obtained in the 100,000 simulations without contamination (gray lines) and under contamination (C) (black lines)^*. (a)

100 \times

MAB of

{\hat{σ}}_{τ}

(classical estimator, solid line) and

{\tilde{σ}}_{τ}

(robust estimator, dashed line); (b)

100 \times

MAB of

{\hat{σ}}_{ρ}

(classical estimator, solid line),

{\tilde{σ}}_{ρ, 1}

(robust estimator, dashed line) and

{\tilde{σ}}_{ρ, 2}

(robust estimator, dotted line). ^* In all simulations

σ_{τ} = 1

. The labels on the x-axis are in the form N–g.

Figure 7. Simulated observations of a balanced sample with

N = 60

and

g = 6

under contamination (A). Solid line:

3 σ

threshold calculated with the real value of the mean and of the total standard error

σ_{τ}

. Dotted line:

3 σ

threshold based on the real value of the mean and on the classical estimation

{\hat{σ}}_{τ}

calculated with the data of the first five inspections. Dashed line:

3 σ

threshold based on the real value of the mean and on the robust estimation

{\tilde{σ}}_{τ}

.

Figure 7. Simulated observations of a balanced sample with

N = 60

and

g = 6

under contamination (A). Solid line:

3 σ

threshold calculated with the real value of the mean and of the total standard error

σ_{τ}

. Dotted line:

3 σ

threshold based on the real value of the mean and on the classical estimation

{\hat{σ}}_{τ}

calculated with the data of the first five inspections. Dashed line:

3 σ

threshold based on the real value of the mean and on the robust estimation

{\tilde{σ}}_{τ}

.

Figure 8. Classical and Robust estimation of

σ_{τ}

on a set of empirical data concerning impure Plutonium items measured in attended mode. Dotted line:

3 σ

threshold based on the sample mean and on the classical estimation

{\hat{σ}}_{τ}

. Dashed line:

3 σ

threshold based on the sample median and on the robust estimation

{\tilde{σ}}_{τ} = {\tilde{σ}}_{ρ, 1}

.

Figure 8. Classical and Robust estimation of

σ_{τ}

on a set of empirical data concerning impure Plutonium items measured in attended mode. Dotted line:

3 σ

threshold based on the sample mean and on the classical estimation

{\hat{σ}}_{τ}

. Dashed line:

3 σ

threshold based on the sample median and on the robust estimation

{\tilde{σ}}_{τ} = {\tilde{σ}}_{ρ, 1}

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cerasa, A. Introducing Robust Statistics in the Uncertainty Quantification of Nuclear Safeguards Measurements. Entropy 2022, 24, 1160. https://doi.org/10.3390/e24081160

AMA Style

Cerasa A. Introducing Robust Statistics in the Uncertainty Quantification of Nuclear Safeguards Measurements. Entropy. 2022; 24(8):1160. https://doi.org/10.3390/e24081160

Chicago/Turabian Style

Cerasa, Andrea. 2022. "Introducing Robust Statistics in the Uncertainty Quantification of Nuclear Safeguards Measurements" Entropy 24, no. 8: 1160. https://doi.org/10.3390/e24081160

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Introducing Robust Statistics in the Uncertainty Quantification of Nuclear Safeguards Measurements

Abstract

1. Introduction

2. Measurement Error Model and the Classical Estimation of Its Variance Components

3. Robust Estimation of Unbalanced One-Way Random Effect Models

4. Small Sample Comparison of Classical and Robust Estimates

4.1. Simulation Results—No Contamination

4.2. Simulation Results—Contaminated Samples

5. Empirical Application

6. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. MATLAB Codes and Implementation Details

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI