1. Introduction
In case of a nuclear accident, radioactive particles and gasses may be released to the atmosphere. Consequently, an important part of emergency preparedness is to run simulations with atmospheric dispersion models, thereby predicting the atmospheric distribution as well as deposition of radioactive particles and gasses on the surface of the Earth. However, such models are subject to a number of uncertainties, the most important being the uncertainties of the meteorological predictions, inaccurate physics parameterizations in the dispersion model, and uncertainties of the estimated source term. Immediately after an accident in a nuclear power plant, only limited information about the release may be available. Thus, at the early stages of the accident, the dominating source of uncertainty is most likely the source term. If this is the case, inverse modelling can be used to obtain a source term estimate, which in turn can be used for running the atmospheric dispersion model. The aim of this study is to develop an inverse method for source term estimation, which is suited for operational use for emergency preparedness at the early stages of an accident, i.e., providing a source term estimate based on the limited data available shortly after the accident.
In the early phase of a nuclear power plant accident, a limited number of air concentration observations will be available, and these will typically have a low spatial and temporal resolution, e.g., the filters in such measurement stations may be changed every 24 h or even less frequently. In addition, there may exist gamma dose rate observations at much higher resolution, both spatially and temporally. However, since such measurements are the sum of contributions from all the different radionuclides, it is not clear a priori if they are useful for source term estimation.
Previous studies have used inverse methods for source term estimation. Lately, the still unaccounted for release of Ru-106 in the fall of 2017, was subject to several studies, e.g., [
1,
2,
3,
4]. However, since the release location has still not been confirmed, the main focus of these studies is localization of the source. The Fukushima Daiichi nuclear disaster in 2011, on the other hand, demonstrated that in-plant monitoring systems may not be working during a severe accident. Thus, different inverse methods have been applied in order to assess the source term. Some studies have estimated the release of certain radionuclides based solely on air concentration measurements [
5,
6], other include surface deposition measurements [
7,
8], while other again also include gamma dose rates [
9]. Saunier et al. [
9] demonstrate that information about ratios between the amounts of certain radionuclides can be used to further constrain the release rates. They use a variational approach to assess the source term, thereby providing a deterministic estimate. However, by using different Bayesian approaches, Liu et al. [
6] show that significant uncertainties are associated with the estimated source term, indicating that probabilistic methods are better suited for this type of problem.
Most previous studies in this field aim at estimating the source term associated with accidents a long time after they occurred. However, for emergency preparedness, it is also important to be able to estimate source terms during the early stages, where especially air concentration measurement data are limited. This was addressed by Saunier et al. [
9], who further developed their method to be applicable in real-time in case of an accident [
10]. Our method is inspired by Saunier et al. [
9,
10], but instead we use a Bayesian inference method to be able to realistically account for uncertainties of the estimated source term, similar to Liu et al. [
6].
The method is applied to an idealized artificial release case from the Finnish Loviisa nuclear power plant. A set of simulated air concentration measurements and gamma dose rate measurements have been created as described in
Section 2.1. The same meteorological data and dispersion model have been used for data creation and for the source term estimation. Thus, the study demonstrates the uncertainties of the estimated source term arising only from the information loss due to the limited measurement capabilities. Due to the idealized nature of our study, our results apply to a situation, where model errors are negligible. In reality, meteorological uncertainties and model errors will further increase the uncertainty of the estimated source term.
3. Results and Discussions
As described in
Section 2.5, the results are obtained by using the NUTS algorithm [
25], which is implemented in the PyMC3 python library [
26]. The algorithm is constructed in such a way that almost no parameter tuning is necessary. To ensure convergence, the target acceptance rate was increased from the default 0.8 to 0.99. Aside from this, everything was kept at PyMC3’s default values; two simultaneously running chains, each with 1000 tuning steps and 1000 draws from the target distribution. This provides a total of 2000 realizations of the posterior probability distribution. For further details on the NUTS parameters, see [
25,
26].
In our analysis, we include 10 of the 11 radionuclides described in
Section 2.1.1, excluding Xe-135m based on the rationale that its short half-life of approximately 15 min makes it unimportant on longer temporal, and thus also spatial scales. This means that there is not enough information in the measurement data to sufficiently constrain the release rate of Xe-135m. The other three noble gasses are included, although there are no filter measurements to help constrain their release rates. However, as long as their half-lives are sufficiently different, we expect the gamma dose rate patterns to differ enough to be able to distinguish between their contributions. The prior probability distributions for the release rates of Kr-88, Xe-133, Xe-135, Cs-137, I-131 and Te-132 were defined as log-normal distributions, Equation (
4) with mean and standard deviations given by Equation (
5). The release rate for Cs-134 was defined as a deterministic variable, equal to the release rate for Cs-137 multiplied by the fixed ratio
. Finally, the prior distributions for the release rates of I-132, I-133 and I-135 were defined as bound log-normal distributions Equation (
7), where the bounds are given by the flexible constraints, Equation (
6).
We assume that the time of the emergency shutdown of the nuclear reactor (SCRAM), 22 September, 05:00 UTC, is known. We therefore consider this as the first possible time of release. We then consider the release during the following 24 h by assuming twelve 2-h constant releases, i.e.,
s. The source term estimation is based on the simulated measurements described in
Section 2.1.1, but only measurements until 23 September, 08:00 UTC are used for the source term estimation, leaving the remaining measurements for validation of model predictions based on the estimated source term. Thus, for all particles, only two 24-h filter measurements from each of the five filter stations are available, i.e., ten filter measurements per particle. However, first, all measurements without any information are discarded; if a given measurement is not influenced by any of the time-binned unit releases, it is removed from the data set. After this automatic removal of data, only one filter measurement per particle from each of the two filter stations in southern Finland are left. Thus, even when using the additional constraints described in
Section 2.6, the amount of filter measurement data is very limited.
The gamma dose rates, on the other hand, are measured every hour at 214 different locations, see
Figure 1. Thus, from 22 September, 05:00 UTC to 23 September, 08:00 UTC, a total of 5778 measurements. After the automatic removal of data without information, 1918 measurements are left.
Given the high dimensionality of the parameter space, it is not possible to visualize all elements of the actual posterior distribution. Instead the individual release rates are shown in
Figure 2. The plots show the median release rates as well as the 10th and 90th percentiles based on marginal distributions for each 2-h release period. Further,
Figure 3 shows histograms of the marginal distributions of time integrated releases for all radionuclides. The only release rate, which is well determined for most time bins is that of Xe-133. This makes sense, since it is the only relatively long-lived noble gas; the half-life is approximately five days, while Xe-135 and Kr-88 have half-lives of roughly nine and three hours, respectively. Further, since the noble gasses do not deposit, the gamma dose rate pattern of Xe-133 will also be easy to distinguish from those of the long-lived particles. For the particles, the estimated release rates clearly indicate the effects of the constraints in Equation (
6); the release rates of the four iodine isotopes, which are all “tied together”, are better estimated than those of both the caesium isotopes and of Te-132. Since the release rates of the two caesium isotopes are forced to differ only by a factor, we also expect these to be better estimated than the release rate of Te-132. While it is not easy to see that this is the case, it is clear from
Figure 3 that the released amounts of the two caesium isotopes are better estimated than Te-132.
The histograms in
Figure 3 show that for some radionuclides, the amounts are quite well constrained, e.g., the release of I-131, which varies from roughly 70 PBq to 180 PBq, and Xe-133, which varies from roughly 3.6 EBq to 4.4 EBq. The latter, however, only barely include the true released amount in the probability distribution. For the remaining radionuclides, the released amounts are not very accurately estimated, especially not for Kr-88 and Xe-135. Given the limited amount of measurement data, this result is not surprising. Further, it is important to note that the log-normal prior distribution ensures release rates of positive values. Hence, the estimated release will necessarily have the same duration as the considered release period, 24 h in this case. However, we see from
Figure 2 that most release rates drop significantly in magnitude after 12 h from SCRAM.
From
Figure 2 and
Figure 3, it may seem that the source term is not sufficiently constrained by the data. Clearly, release rates for some nuclides are poorly estimated, e.g., Kr-88 and Xe-135, and it may therefore be tempting to exclude these from the source term. However, we found that when excluding these, the estimated release rates of the remaining nuclides are less accurate. Thus, it seems that the release of some of the other nuclides compensate for their lacking contribution. On the other hand, it is important to note that including Kr-88 and Xe-135 in the source term does not seem to compromise the release rates of the remaining nuclides. Thus, when it is not known a priori which nuclides constitute the best possible source term, the safer choice seems to be to include more nuclides than necessary. Further, the marginal distributions are obtained by integrating over the remaining parameters of the multi-nuclide source term, and therefore all correlations between parameters are ignored. As demonstrated below, though the marginal distributions of individual releases might be uncertain, the gamma dose rate patterns of different realizations of the multi-nuclide source term vary significantly less.
Figure 4 shows predicted air concentrations and gamma dose rates as function of observations. The upper plots show filter measurements, and the lower plots show gamma dose rates. The left plots show measurements before 23 September, 08:00 UTC, i.e., the measurements that are used for the source term estimation. The right plots show measurements after 23 September, 08:00 UTC and therefore show a prediction of future values based on the estimated source term. The percentiles are estimated by first calculating the concentrations and gamma dose rates from all source terms in the posterior distribution and then finding the percentiles in the calculated values. The plots with the gamma dose rates show a randomly selected subset of 300 observations, since more data in the plot makes it impossible to distinguish the different data points. The figure shows that the average activity concentrations at the filter stations are generally estimated to match the observations within the uncertainties, although some allow for a wide variation. On the other hand, the predicted gamma dose rates fit very well with the observed even for the predicted values. Considering the fact that a total of 1918 gamma measurements and only 2 filter measurements for each nuclide are used for the inversion, it is not surprising that the gamma dose rates are more accurately estimated.
Figure 5 shows the predicted gamma dose rates at the locations of six selected gamma stations, viz. the six stations that measured the highest values. The plots show that there is good agreement between modelled observed gamma dose rates and that even the time evolution is captured very well.
Finally,
Figure 6 shows the probability distributions of the two uncertainty parameters
and
; both parameter distributions indicate relatively narrow log-normal distributions, which is expected given that model errors are negligible.
3.1. Including All Data
For comparison, we show the estimated source term when including all measurements.
Figure 7 shows the release rates and probability densities of released amounts for three selected nuclides, Cs-134, I-131 and Xe-133. Interestingly, the release rates are all better defined than the previous result, i.e., the distributions are narrower. However, the release rate estimates are not necessarily more accurate. On the other hand, comparison with
Figure 2 shows that the use of later measurements allows for a better estimate of the duration, as all release rates are very low after 16 h from the SCRAM.
As discussed previously, there are not many filter measurements available, and therefore the gamma dose rates are dominant; thus, the estimated source term is more likely to match the gamma dose rates than the filter measurements. This is apparent from
Figure 8, which shows the modelled air concentrations and gamma dose rates as function of observations, similar to
Figure 4. There is a very good agreement for gamma dose rates, while for air concentrations, the discrepancy is somewhat larger.
3.2. Efficiency
Regarding efficiency, we only have rough estimates of the computation time. However, we see that the time depend strongly on the amount of data included. The computation time for the first result, using data from only the first 24 h, was approximately half an hour. When including all data, the computation time was approximately 3.5 h. These estimates are the wall times of the runs of the NUTS algorithm, when running the algorithm in parallel on two CPUs on a standard modern laptop. In addition, some time is of course required for running the dispersion model and restructuring the data.
When operationalized, the code should be adapted to run on an HPC facility to further decrease computation time. In addition, the total set of gamma dose rate observations constitute 8953 measurements from a relatively dense network sampling at every hour. We suspect that there is a lot of redundant information in this data set, so instead using a subsample of this data set might be sufficient and would reduce computation time significantly.
4. Summary and Conclusions
We have developed a Bayesian inverse method for probabilistic source term estimation to be used for accidental nuclear releases to the atmosphere. The source term probability distribution is sampled using the Hamiltonian Monte Carlo algorithm NUTS, which is robust and needs only limited parameter tuning. In theory, this makes it directly applicable to other cases without making significant changes to the method.
The method is applied to a synthetic data set derived by running an atmospheric dispersion model for a realistic accident at a nuclear power plant. The data set consists of air concentration measurements at existing filter stations as well as gamma dose rates at gamma stations. We have shown that even with a limited set of air concentration measurements, realistic source term estimation is possible based on early observations of gamma dose rates. Further, the results indicate that additional constraints on the release rates based on information on the nuclear reactor core inventory can be used to improve the accuracy of the predictions. The estimated released amounts of most individual radionuclides are described by relatively wide probability distributions. However, the gamma dose rates predicted using the probabilistic source term correspond well with observations.
Of course, when applied to a real-world case, we expect that model errors will reduce the accuracy of the predictions to some extent. However, if the models used are unbiased, we anticipate that the predicted gamma dose rates will still be more accurately estimated than the release rates of the individual radionuclides. Further, to make the method as generally applicable as possible, we treat the uncertainty parameters as nuisance parameters. Hence, no assumptions about the magnitude of the uncertainties are made; the only assumption is that the residuals are log-normal distributed.
In conclusion, we have developed a method that performs well applied to the simulated release case, and the results indicate that even with limited measurement data available, it is possible to construct a probabilistic source term that provides accurate predictions of gamma dose rates and reasonable estimates of the released amounts of most of the radionuclides considered. Due to the few assumptions made and the robust theoretical foundation, we expect the method to generalize well. However, in order to fully examine the performance of the method, future application to real-world cases is necessary.