1. Introduction
Modern numerical weather prediction (NWP) systems require accurate determination of the initial atmospheric state to produce a reliable weather forecast. This is achieved through data assimilation (DA). The DA attempts to blend two sources of information, the observations and the background fields, which are weighted by their respective error statistics. Although many types of observations can be assimilated in NWP systems, we focused on satellite observations, which have been found to have the largest impact of all the different observation types [
1]. The observation error covariance matrix (
R matrix) describes the deviation of observed radiances from true (or expected) radiances, and the background error covariance matrix (
B matrix) describes the difference between the background state and the true atmospheric state. Ideally,
R and
B matrices should be independent of each other. Unfortunately, the design of most current DA systems forces the two to interact with each other to some extent (i.e., through iterative optimization) [
2]. As a result, analysis based on the DA process highly depends on
R and
B matrices. However, the
R and
B matrices are not perfectly known, contributing to the uncertainty of subsequent weather forecasts.
While forecast is affected by analysis, it is also affected by the forecast model. This paper focuses on DA, and has little to do with forecast. Beside
and
matrices, the analysis also depends on the DA methodology. For global NWPs, DA methods started as simple horizontal interpolation methods [
3]. Later, more advanced approaches were developed, such as three-dimensional (3D) or four-dimensional (4D) with a multivariate relationship [
4]. For example, 3D and 4D variational methods were created to minimize the cost function to determine the optimal analyzed state based on a priori state, observations, and the prescribed Gaussian uncertainty statistics for the background and observations [
5]; various forms of ensemble Kalman filters were introduced to consider the flow-dependent forecast (i.e., background) error covariance in the DA process [
6], since background errors are expected to be flow-dependent on weather conditions. Later, hybrid approaches combining variational and ensemble DA methods were proposed [
7,
8], where an ensemble was used to build the initial background error covariances followed by evolving or updating the background error covariances by the NWP forecast model at the end of the DA process. Hybrid DA methods have become more popular and been employed at several NWP centers and groups [
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22]. More information on the formulation of various DA methods can be found in Gustafsson et al. [
23] and Bannister [
24].
The DA methods are processed under some assumptions when minimizing the cost function. In practical applications, however, these assumptions are difficult to justify [
25]. One assumption that plays a critical role in DA process is about the
B matrix (see review by Bannister [
24] and references therein). If variational DA methods are used in operations, the
B matrix is assumed to be static for a set period of time [
26]. If ensemble-variational DA methods are employed in operations, the
B matrix is assumed to be flow-dependent and to evolve with NWP forecast during the time window. In general, the
B matrix is estimated from forecast ensemble. However, it is not clear how much the variation within the forecast ensemble can represent the background deviation from the true atmospheric state. The NWP model engaged for evolution of
B matrix in the ensemble-variational DA process contains uncertainty or model error. As such, the derived
B matrix is likely not representative.
Another assumption that plays an important role in DA process is about the
R matrix. Until recently, the diagonal
R matrix has been engaged in most operational DA systems at NWP centers [
27]. Therein, instrument noise (plus forward model uncertainty and representativeness error) is placed in
’s diagonals by leaving zeros in
’s off-diagonals. However, from a physical viewpoint and as shown by many studies, inclusion of inter-channel error correlations in the
R matrix is physically sound and may improve analysis accuracy [
28,
29,
30,
31,
32,
33,
34,
35,
36,
37,
38]. Other studies have focused on diagnosing inter-channel error correlations in assimilation analysis for certain observation [
39,
40,
41,
42,
43,
44,
45,
46]. Currently the Met Office has already accounted for inter-channel error correlations in operational global and regional systems for Atmospheric Infrared Sounder (AIRS), Infrared Atmospheric Sounding Interferometer (IASI), and Cross-track Infrared Sounder (CrIS) instruments [
23]. Note that the term “inter-channel error correlations” in this paper refers to correlations among channels or bands of a detected radiance spectrum in which each radiance channel or band corresponds to its specified physical quantity, such as temperature, water vapor, gas compositions, surface temperature, and emissivity, in the atmosphere.
Since 1963, many methods have been developed for quantifying the inter-channel error correlations in the
R matrix [
47,
48,
49,
50,
51,
52]. Among these methods, the one proposed by Desroziers et al. [
52] has been widely used to estimate the observation error variances and inter-channel error correlations in Météo-France [
52] and ECMWF (European Center for Medium-Range Weather Forecasts) [
37] 4D-Var systems, and has also been applied to IASI data to estimate the structure of
R matrix in the Met Office 4D-Var system [
39]. Desroziers et al.’s scheme [
52] was combined with the maximum-likelihood approach to estimate the spatial correlation length-scale of observation errors [
53]. Some other studies also applied this technique [
35,
39,
52], for example.
As the
R matrix based on Desroziers et al. [
52] relies on the observation operator (
H), analyzed state (
),
B matrix or background state (
), and selected forecast ensemble, all of which contain uncertainties, the
R matrix used in the DA process may cause some problems. For example, the convergence speed of the DA process may decrease [
27,
39], which might indicate a possible limitation of those inter-channel error correlations and problems in the methods used to estimate the
R matrix in DA process [
2]. The most common problem is that this
R matrix is not symmetrical (note that symmetry is a necessity for
R matrix), leading to the eigenvalues of this
R matrix being not positive-definite (note that positive-definite eigenvalues are required in DA process). Thus, this
R matrix must be artificially made to be symmetrical and the eigenvalues must be altered to be positive-definite prior to their use in assimilation systems [
2,
35,
39]. Tabeart et al. [
2] suggested that these problems are caused by the interaction between the
R and
B matrices. The interaction is a numerical interaction rather than a physical interaction, which is a well-known issue during the iterations of the DA process in an effort to produce optimal outcomes for succeeding weather forecasts.
Given the above limitations, a better method to characterize
R matrix is needed, as stated in previous studies [
2,
23,
39]. Since the assumptions used in estimating
R and
B matrices are not always valid, reduction or entire elimination of the dependency on the assumptions and complete separation of the
R matrix from the
B matrix should enable us to produce a more physically sound
R matrix. This study introduces a potential method to derive the
R matrix, which requires no additional assumptions. Following the path of radiation through the atmosphere up to the radiation received by sensors and transformed to radiance, the correlations among the channels or bands in a detected atmospheric radiance spectrum allows us to propose a data-derived
R matrix. This
R matrix is built fully based on a large number (
N) of detected atmospheric radiance spectra constructed from
N real-time measurements (
Appendix A), where
N real-time measurements can be acquired by observing some location of interest during a short amount of time. The only two requirements in the data derivation of the
R matrix are the fabrication of instruments and the data calibration procedures, which are fundamental for any data analysis.
Our first motivation for creating a data-derived R matrix using a large number of real-time measurements is as follows. In traditional approaches, the derived R matrix based on statistical forecast ensemble cannot fully represent the observation errors because it is unclear how much the variation within the ensemble sample can represent the observation errors (i.e., variance and covariance) between observed and true radiance. This derived R matrix may not fully characterize the atmospheric state of an interested moment. Conversely, since real-time measured data contain representative information of the atmospheric state at the time when the data were recorded, the derived R matrix based on real-time data can appropriately characterize the observation errors for the atmospheric state at that time when data were recorded. One critical difference between our proposed method and traditional approaches is that our proposed data-derived R matrix is built without assumptions (i.e., no dependence on , , , or matrix), whereas the traditional R matrices are constructed upon some assumptions that may not be always valid. The other motivation for this study is the desire to inspire the community to consider completely using real-time data to construct R matrix for satellite radiance observations from a different perspective.
Three notes need to be explained. Firstly, the method proposed in this paper for construction of a data-derived matrix is only applicable to satellite measurements. Secondly, the meaning of “a large number of measurements” used here does not refer to the multi-measurements that are used in current numerical weather prediction within a certain time window. For multi-measurements, one single measurement is taken from each observation location at a moment of interest and thus multi-measurements correspond to multiple different atmospheric states. We do not use multi-measurements to construct the matrix because the matrix constructed from multiple different atmospheric states may be good enough within the time period when the multi-measurements were recorded, but not necessary ideal for assimilating the measurements. Thirdly, this paper presents the theoretical basis of a data-derived matrix based on N real-time measured radiances and suggests a conceptual design of a trigger configuration for satellites to record N real-time measurements. To record N real-time measurements, practical experimentation is required.
This paper is organized as follows.
Section 2 briefly describes the methods that have been used to estimate the
R matrix in literature. Remarks are provided about these methods and the long-used diagonal-only
R matrix.
Section 3 explains (i) radiation transmission through atmosphere and the radiation received at the sensors of an instrument and transformed to radiance, and (ii) the accompanying correlations among bands or channels of a detected atmospheric radiance spectrum. These are used to form the theoretical basis of the data-derived
R matrix using
N real-time measured radiances.
Section 4 lists the advantages of using a data-derived
matrix. Since no real data currently exist for constructing a data-derived
matrix,
Section 5 presents a conceptual design of a trigger configuration for satellites to record such data, and
Section 6 provides simple simulation data with simple toy examples to demonstrate our proposed method. Our summary and recommendations for future work are discussed in
Section 7 and
Section 8, respectively.
4. Advantages of Using Data-Derived R Matrix
There are many advantages of using data-derived matrix, such as independence on forward model, , or for , no forward model uncertainty, no representative error, no entanglement with matrix, and symmetric matrix etc. Several other advantages are explained below.
1. View from Macroscopic and Microscopic Aspects
An atmospheric state in equilibrium can be viewed from a macroscopic viewpoint that is composed of many microscopic aspects, where each microscopic aspect represents one fluctuation state around the macroscopic state in equilibrium. That is, by taking repeated measurements at an observation location during a short amount of time, each measurement represents one fluctuation state, whereas the average of those repeated measurements is regarded as the expectation of the truth of the macroscopic state in equilibrium. The deviation between each measurement and the average is the observation error and is used for constructing a data-derived matrix.
2. Accurate Expectation of True Radiance
The
matrix in Equation (1) is constructed based on Desroziers et al. [
52], where
and
are taken as the true radiance
. However,
and
are the radiance calculated from a radiative transfer model or algorithm (i.e., forward model) upon assumptions such that
relies on
,
, and
(or
matrix).
For a data-derived
matrix, in probability and statistics theory [
63,
64], the expected value (known as expectation) of a variable is the average value of repeated measurements of the same experiments. Taking the average of repeated measurements as the expectation of the true value of a variable is generally used in experimental measurements [
57]. We here apply such a concept to the data-derived
matrix by taking the average of the observed radiance (i.e.,
, see Equation (6)) as the expectation of the true radiance
, which is consistent with what is stated in note 1).
3. Correlations Accounted Among All Physical Quantities
The data-derived matrix does not only address one particular atmospheric physical quantity, but also contains information of variances and correlations for all atmospheric physical quantities, where each radiance band or channel corresponds to its own representative atmospheric physical quantity. simply characterizes the variance of atmospheric physical quantity in the ith radiance band or channel, and the correlation between any two atmospheric physical quantities in the ith and jth bands or channels can be readily obtained by Equation (13).
4. Less Flow Dependency
The observation error in an observation error covariance matrix is defined as the difference between the observed radiance
and the true radiance
, and attained
and
depends on the weather conditions at the time when data are recorded. The messages in the data-derived
matrix thus only depend on the weather conditions at a given observation location at a given time. Therefore, given an observation location at a time of interest, the data-derived
built in Equation (12) is only computed once. Hence, there is no need to update the data-derived
matrix within a DA process period. This considerably reduces computation time as compared with the traditional approach. The current DA system and the proposed DA system are compared in
Figure 2.
In the current DA system (
Figure 2a), if the output
from the DA process is not converged, adjustments to both
(i.e., either
or
) and
matrices are needed before processing the next DA cycle. This leads to complexity in the DA process and a less accurate
used for next-step weather forecast because the greater the number of factors (i.e.,
and
matrices here) influencing
, the greater the uncertainty associated with
. Conversely, in the proposed DA system (
Figure 2b), at each given observation location and time, the content in the data-derived
matrix is only calculated once and fixed during iterations of the DA process. Thus, only the
matrix needs adjustment for next cycle of the DA process if
has not converged. This approach simplifies the DA process as compared with the traditional approach and is expected to produce a more accurate
because only one factor (i.e., the
matrix) needs adjustment in the DA process. Note that
must be computed in each iteration of both DA systems.
5. Temporal Variations of Inter-Channel Error Correlations
In the current DA system, the or matrix is usually assumed to be static for the day, a few days, or even for months. The messages in an observation error covariance matrix depend on weather conditions. Therefore, the data-derived matrix allows us to quantify the change in inter-channel error correlations over time, where the change in the magnitude of ’s messages relies on how quickly the weather conditions change over time. Currently, there are no such real-time data; thus, we are unable to present the magnitude of change in this paper.
6. Gaussian Distribution of Variation
Here provides a supplementary explanation to the cost functions. The expression
in both Equations (15) and (16) is basically derived from the least square method [
63,
65] upon the assumption of Gaussian or Gaussian-like distribution of variation in each radiance band or channel [
63], which is widely accepted in real world practices as mentioned previously. This means that the variation in each radiance band or channel follows a Gaussian or Gaussian-like distribution. A large enough sample can be used to verify if the distribution is a Gaussian or Gaussian-like distribution. However, recording only a few measurements does not prevent the variation in each radiance band or channel from behaving as a Gaussian or Gaussian-like distribution if the variation does follow a Gaussian or Gaussian-like distribution.
6. Simple Simulation Studies
Since no such real data exist, we instead used simulation data with simple toy examples to demonstrate how the data-derived
matrix works and how the retrieved outputs perform compared with the methods employed in current DA systems. This was inspired by Desroziers et al. [
52], where simulation data with toy examples were used to demonstrate their methods.
Figure 4 illustrates the simple simulation example, where three bands or channels were considered for simplification. The atmospheric state parameters are represented as
and the measured radiance at the instrument sensor is described as
. The relationship between
and
is through matrix
, which is
. The first term in the cost function,
, was unable to be simulated in this study and thus is ignored here, which would not affect the purpose of our demonstration. Four different observation operators were adopted in this study as follows:
and
In the following, notations corr, cr, and wr denote correlation, correct, and wrong, respectively; () means and/or are/is used rather than indicating that is a function of . Hence, () is taken as a correct (), both and (both and ) are taken as two incorrect ’s (two incorrect ’s), and both and (both and ) are used for illustration of correlations between any two channels or bands. is assigned as the true values of ASPs in the equilibrium state; radiances and are regarded as representing two fluctuation states around an atmospheric state in equilibrium and are obtained according to when single measurement is considered, where is a Gaussian distribution centered at zero with a deviation of 0.05 if no other deviation values are applied elsewhere; and is the averaged radiance from N measured radiances. Similar to what is usually performed in DA analysis, when looking for the best that is closest to , is restricted within during minimization of the cost function. Note that equation used for radiance generation and for construction of matrix is abbreviated as hereafter. For a large number of measurements, N = 200 is used in our study.
Below, we present four cases (Cases A–D). The purpose of Case A (
Section 6.1) was to demonstrate the importance of verification for correctness of
H and to investigate the impact of fluctuation in measured radiances on the retrieved
. The questions addressed in Case B (
Section 6.2) were: Is diagonal-only
matrix appropriate? Is single measurement enough? Does
need verification? Similar to Case B, the purpose of Case C (
Section 6.3) was to answer: Is
matrix appropriate? Is single measurement enough? Does
need verification? Finally, the purpose of Case D (
Section 6.4) was to determine if correlations could be automatically manifested from the
matrix. For each case, we described its analysis and provided results and discussions.
6.1. Case A
Firstly, N sets of measured radiances were obtained from N real-time measurements, which were generated according to . The data-derived matrix was then constructed using the N sets of radiances based on Equations (6) and (12). Subsequently, the second term of the cost function in Equation (16) was minimized by looping over to look for the best using various (). Lastly, the uncertainty of each retrieved in could be obtained from the covariance U matrix (Equation (14)).
The fluctuation in measured radiance could be magnified as a result of taking
N measurements over a short time span. To visualize the effect of taking
N measurements on the retrieved
, the fluctuation was simulated by the deviation in the Gaussian function. Deviation values of 0.05, 0.10, 0.15, 0.20, 0.30, and 0.40 were investigated, and the corresponding generated radiances were denoted by
or simply
hereafter, with subscript
d representing the deviation values. The procedure of this analysis is depicted in
Figure 5.
The results are listed in
Table 1. In the cost function (
Figure 5),
is the average of all
N sets of measured radiances, considered as the expectation of true radiance; and the
matrix was fully built from real-time data, providing good representativeness of the true
matrix. As seen in
Table 1, only a correct
(
) could produce a correct
that was consistent with the true
, where both the cost function value
and the uncertainty for each retrieved
were the smallest. Note that the best
would fall outside
with a bigger
value if any component of the retrieved
reaches the boundary of
. This suggests that verification of the correctness of
(or the forward model) is necessary for ASP retrieval in a DA system.
Under the same conditions except for deviation values, the results show that the extracted values of
using various deviations (i.e., various extents of fluctuation) were almost the same. Hence, only the results using deviations of 0.05, 0.15, and 0.40 are listed in
Table 1. These results indicate that fluctuation in measured radiances did not have any significant impact on the retrieved
even when the deviation was up to 40%.
We noted three features in
Table 1. The first one was that the bigger the deviation, the smaller the value of
. This occurred because the
matrix was in the denominator of the cost function
and a larger fluctuation produced bigger errors and bigger error correlations in the
matrix, resulting in smaller
value. The second feature was that the
value for the case using
was smaller than when using
because
was chosen to be closer to
than
. The third feature was that the larger the deviation (i.e., fluctuation) in the measured radiances, the greater the uncertainties in the retrieved
, which was reasonable.
6.2. Case B
Single measurement for radiance
(or
) and diagonal-only
matrix were considered in this case. Firstly,
(or
) was adopted as the measured radiance with the existence of deviation from the average of radiance
. Secondly, the variances in the
matrix were taken from the diagonal variances of the ensemble-derived
matrix, where the ensemble-derived
matrix in Case B was built using
N sets of ensemble radiances that were generated according to
with
,
, or
. Unlike the current DA process in which the
matrix varies with
when it needs re-computation during minimization iterations,
was used to compute the ensemble-derived
matrix for all minimization iterations in Case B (and Case C as well). The purpose for doing so was to imitate the consideration of forward model uncertainty and representativeness error in the ensemble-derived
matrix and
’s diagonals so that both the ensemble-derived
matrix and
’s diagonals were as close as the true
matrix. Thirdly, the second term of the cost function in Equation (15) was minimized by looping over
to look for the best
using various
(
), where
and
(or
) were used here. Note that the
(
) matrix used in building the ensemble-derived
matrix (i.e., the second step) and minimization of the cost function (i.e., the third step) were the same. Finally, the uncertainty of each retrieved
in
could be obtained from the covariance
U matrix. The procedure of this analysis is displayed in
Figure 6.
The results are presented in
Table 2. No
(
) was able to produce an
consistent with
because single measured radiance
(or
) only represents one fluctuation state around an atmospheric state in equilibrium, introducing deviation from the equilibrium state, and because
does not contain full contents. Notwithstanding, the use of a correct
(
) in minimizing the cost function could produce an
closest to
with the smallest
values and the least uncertainty for each retrieved
, which again reveals the importance of verifying the correctness of
. With respect to Case A, this study inspired us to think that a single measurement for the radiance spectrum may not be sufficient and
may not be appropriate either.
6.3. Case C
The analysis procedure in Case C was the same as that in Case B except
taken from the ensemble-derived
matrix was adopted. See
Figure 7 for the analysis procedure.
The results are shown in
Table 3. Another study was conducted using
for all
constructions, the results of which are provided in
Table 4. No
(
) was able to produce an
consistent with
. Similar to Case B, this occurred because
(or
) was used and
might not possess good representativeness of the true
matrix. Once again, using a correct
(
) to minimize the cost function could produce an
closest to
with the smallest
values and least uncertainty for each retrieved
. As such, verification of the correctness of
is important.
In summary, the findings from Cases A, B, and C provided some important learnings. Firstly, with the adoption of
N real-time measured radiances, constructing the data-derived
matrix from
N measured radiances, and the correct observation operator
could produce an
consistent with
with the smallest
value and almost the least uncertainty for each retrieved
(see the first result in each case of using different deviations in
Table 1). Secondly, conversely, if
or
together with single measured
(or
) was employed, discrepancy appeared in the retrieved
compared to
. Thirdly, as long as a correct
was used in the minimization of the cost function, the retrieved
was closest to
with the smallest
value and the least uncertainty for each retrieved
. This reveals the importance of verifying the correctness of
(or the forward model) used in the DA process. One important result was that fluctuation in the measured radiances did not have any significant impact on the retrieved
, even when the deviation was up to 40%. This suggests that fluctuation possibly caused by taking
N measurements (if any) is not an issue.
Although linear was employed in our toy simulation studies, the conclusions presented here should be the same even if the real was non-linear. That is because the nonlinear forward problem could be linearized around the background.
6.4. Case D
The generation of the data-derived
matrix in Case D was the same as that in Case A, where
(
) and
(
) were employed for illustration of correlations between any two channels or bands. Once the data-derived
matrix was built, the correlation matrix was acquired based on Equation (13).
Figure 8 outlines the analysis procedure.
The results of the correlations are listed in
Table 5. When using
(
), the off-diagonals in the data-derived
matrix were much smaller than those in the diagonals, resulting in much smaller correlations in the off-diagonals than those in the diagonals. Conversely, if
(
) was used, the magnitudes in the diagonals and off-diagonals in the data-derived
matrix were comparable, and the correlations in both the diagonals and off-diagonals were also comparable. This study demonstrates that the correlations between any two channels or bands could be manifested from the
matrix itself. Therefore, despite significant or insignificant correlations between any two channels or bands, the data-derived
matrix proposed in this paper provides a method to acquire correlations between any two channels or bands (i.e., correlations between any two atmospheric physical quantities).
7. Summary
Atmospheric state parameter profiles (atmospheric profiles) are key inputs for numerical weather prediction. The observation error covariance (
R) matrix factor plays a pivotal role in the retrievals of atmospheric profiles. Since a single measurement is used in atmospheric research analyses and operational weather forecast (ARWF), correct variance and covariance cannot be obtained for the
R matrix. As a consequence, the diagonal
matrix arranged with instrument noise (plus forward model uncertainty and representativeness error) in the diagonal elements to serve as variances by leaving zeros in off-diagonal has been used in ARWF for years. Recently, a new
matrix structure based on the method of Desroziers et al. [
52] was introduced and has been adopted in, for example Météo-France and ECMWF 4D-Var systems, and has also been applied to AIRS, IASI, and CrIS data in Met Office 4D-Var systems.
However, the instrument noise acting as variance in ’s diagonal led to the dependence of the retrieved ASPs on the instrument being employed. This conflicts with reality in that ASPs should not depend on any instrument. The matrix relies on H, , and the B matrix or , yet the correctness of H requires validation and no true value exists for or . This leads to the matrix being not uniformly symmetrical, resulting in eigenvalues not fully positive-definite, causing less representativeness because ’s structure depends on users’ assignment to and , users’ choices of H, and the selected forecast ensemble that generally cannot describe the atmospheric state at the time when data were captured. Even after diagnosis of the matrix through iterations in the DA process, the diagnosed matrix still has problems as reported in the literature. The problems are caused by the use of single measurement, resulting in an R matrix entangled with B matrix in the DA process.
To unentangle the R and B matrices, the complete isolation of the R matrix from the B matrix is required. To acquire correct variances and covariances for the R matrix, a potential solution is to use a large number (N) of real-time measurements within a short detection time to replace single measurement. An atmospheric state in equilibrium (viewed from macroscopic aspects) can be determined from averaging over many atmospheric variation (or fluctuation) states (viewed from a microscopic aspect). Thus, the R matrix used in the DA system for retrieval of parameters of an atmospheric state in equilibrium can be achieved by taking many fluctuation states. From a physical viewpoint based on microscopic aspects, each fluctuation state involves interactions between radiation and the particles in the atmosphere, and the interactions comply with the phenomenon of the energy absorption and emission occurring between incident radiation and atoms (or molecules). The wavenumber (i.e., energy) of the outgoing radiation may change after passing through all the layers in the atmosphere, subsequently being received at sensors installed in instruments and converted to radiance to form a radiance spectrum. This suggests correlations exist among bands or channels of a detected atmospheric radiance spectrum. Following probability theory and statistics, a correct R matrix can thus be derived based on N real-time measurements, independent of any assumed quantities, and hence having no ambiguity.
Apart from constructing the R matrix, using N real-time measurements has several other advantages. Unlike single measurement, which is merely a variation around an equilibrium state, the mean of the N measurements can adequately represent the equilibrium state during the detection time . Another advantage is that an N real-time measured sample is capable of describing the atmospheric state at the time when the data were captured, whereas forecast ensemble is generally incapable of doing so. A further advantage is that the uncertainty of the N measured radiance spectra conveys some messages. Firstly, finer structures of H would not signify if they were smaller than the uncertainty, equivalently meaning that a more precise H does not mean more accurate retrieval. Secondly, this uncertainty provides us with some information about the uncertainty level that can be tolerated on H when H is linearized, whereas linearization of H may potentially be used to verify the correctness of H using the N measured radiance spectra.
Technically, taking N measurements during a short amount of detection duration is achievable by modifying the trigger configuration of data recording from ground. The traditional trigger configuration captures single measurements successively, but many data are not used in practice. Therefore, to fully use detected data and to reduce the load of the captured data, the proposed trigger configuration involves recording N measurements within with some time between two consecutive N-measurement events. Our study revealed that fluctuation in the measured radiances that could be inferred by taking N measurements does not have any significant impact on the retrieved ASPs.
Several advantages of taking N real-time measurements of radiances for constructing a data-derived matrix were discussed in this paper. Studies using simulation data with simple toy examples were also presented in this paper. These studies demonstrate that the adoption of N real-time measured radiances, a data-derived matrix constructed from the N measured radiances, and a correct can produce accurate ASPs with smallest cost function value and the least uncertainty for each retrieved ASP. These studies also revealed that verification for correctness of (or forward model) is necessary.
8. Recommendations for Future Work
As mentioned in
Section 1, the retrieval results of ASPs from the DA system for successive weather forecast depend on
H, the design of the DA methods, and the information in
R and
B matrices. In traditional ARWF, entanglement occurs in the DA system among (1)
H, which needs verification; (2) DA methods; (3) the
R matrix, which relies on instrument noise or
H,
, and
; and (4) the
B matrix whose evolution depends on the NWP (or forecast) model. Furthermore,
R and/or
B matrices are usually constructed from forecast ensemble rather than real-time data. Thus, even if the
R matrix is acquired correctly, the DA system still cannot guarantee producing accurate retrieved ASPs without proper
H, correct
B, and an appropriate DA method. Therefore, to ensure the whole DA system functions properly for correct production of retrieved ASPs and for next-step weather forecasting, it is a step-by-step course. The first and easiest step would be to fully isolate the
R matrix from
H,
, and
(or
B matrix) and to independently and directly acquire the
R matrix from
N real-time measurements as explained in this paper. After that, the following tasks would be verification of
H and building the
B matrix with the measured
N real-time data, with the aim of being free from reliance on the NWP (or forecast) model or historical forecast ensemble. Once
R,
H, and
B are attained without entanglement with one another, the goal would be the determination of the DA method that performs the best.
Lastly, we hope to inspire people to think from a new perspective about which one, single measurement or a large number of measurements, should be used for the matrix as well as ARWF, and what can be accomplished with a large number of measurements to improve the accuracy of the retrieval of atmospheric state parameters and of numerical weather prediction.