2.1. LandSat 8/9 OLI and TIRS
The operational land imager OLI, onboard NASA/USGS LandSat 8 and LandSat 9, provides high-resolution images of our planet in eleven different bands of the electromagnetic spectrum, ranging from visible to thermal radiation, the latter thanks to the TIR spectrometer (TIRS). The resolution, or more precisely the spatial sampling interval (SSI), of image products ranges between 15 and 100 m. OLI/TIRS data are used to create detailed maps of surface temperature, emissivity, spectral reflectance, and elevation. Analogous to LandSat “enhanced thematic mapper plus” (ETM+), OLI features a panchromatic (Pan) channel, whose spectral coverage has been specifically devised to expedite the sharpening of the thermal bands, with two instead of one.
Figure 1 presents a synoptic comparison of the two instruments: in addition to the narrow-band Pan of OLI, differences are the coastal and cirrus bands, suitable for atmosphere studies, and the split of the unique thermal band of ETM+ to allow for thermal diversity. The bands of OLI, including TIRS, are described in
Table 1. There are three groups of bands in three different spectral ranges, VNIR, short wave infrared (SWIR), and TIR, with different spatial resolutions.
A LandSat 9 scene was acquired near Perugia, in Central Italy, on 22 March 2022. The geographical position of the test image is shown in
Figure 2.
Figure 3 displays the whole scene: 15 m Pan (B8) and true color composition of B4, B3, and B2, interpolated at 15 m. In addition to the lake, the scene includes a built-up area and a surrounding vegetated region. As it appears, the optical acquisition is perfectly cloud-free.
Figure 4 shows the two thermal bands, B10 and B11, displayed as absolute temperature maps with the associated color palette. The TIRS data are delivered and displayed in LST format, with measure unit K. For calculating correlations with the optical bands, and for subsequent fusion, they have been converted to surface spectral radiance units [W m
−2sr
−1μ
−1] by using the file metadata to apply Planck’s law. The conversions of surface radiance to LST and of LST back to radiance would assume an average emissivity coefficient of the land surface equal to 0.97 (see
https://earthexplorer.usgs.gov, (accessed on 19 October 2024)). This explains the singular effect that two adjacent TIR bands measure slightly different LSTs: the conversion coefficients to and from LST assume a constant emissivity equal to 0.97, which is an approximation of its unknown true value.
The statistical similarity of the images to be merged is crucial for the quality of the fusion.
Table 2 details the correlation matrix of the test image, calculated between the original TIR bands (B10 and B11), VNIR bands (B1, B2, B3, B4 and B5), SWIR bands (B6 and B7), panchromatic band (B8), and narrow cirrus band (B9), suitable for detecting ice crystals in atmosphere. The Pan band has been previously degraded at 30 m and 100 m resolutions retaining the original 15 m scale, i.e., pixel size, the former for calculating correlations with the other VNIR bands and the latter for correlations involving the two TIR bands. All 30 m optical bands have been resampled at 15 m scale and degraded at 100 m resolution. The correlation between two bands is trivially measured at the same spatial scale but also at the lower resolution, i.e., that of the less resolved band. Therefore, the band with greater spatial resolution must be spatially degraded by means of proper low-pass filters. As it appears, the optical bands are somewhat correlated, one with another, with the exception of the NIR band, which contains the radiation reflected by vegetation. The interpretation is made difficult by the presence of water pixels in the scene, whose response in the NIR and SWIR wavelengths is close to zero. This fact artificially increases the correlations of B5, B6, B7, and B9 towards TIR (B10 and B11) and decreases those of the visible bands, including B8 (Pan).
Given the relatively high correlation values of some bands, the strategy that appears to be a viable alternative to plain pansharpening would be using the 30 m VNIR/SWIR bands and the 15 m Pan (B8) for the synthesis of two sharpening bands, each tailored to the thermal features of a 100 m TIR band. In fact, if the correlation between the sharpening and the sharpened band is sufficiently high, the results of thermal fusion at 15 m are expected to be likely. This can be achieved by resorting to concepts of data assimilation to generate the synthetic bands and to the hypersharpening paradigm, which foresees as many sharpening images as there are bands that shall be sharpened.
2.2. Hypersharpening of TIRS Data
While pansharpening increases the geometric resolution of a multiband image by means of a Pan observation of the same scene with greater resolution, whenever the higher-resolution image is not unique, hypersharpening concerns the synthesis of a unique image from which the spatial details shall be extracted in order to optimize the products of fusion [
23]. This synthetic Pan is generally different for each band that is enhanced. The idea is similar to predict the sharpening image through a hard or soft combination of a series of bands at higher resolution [
31]. The combination can be driven by data assimilation concepts [
25]. The wide choice of optical bands of OLI, including 15 m Pan, and the characteristics of the correlation matrix in
Table 2 suggest exploiting the pixel-by-pixel combination of the optical bands according to the least squares (LS) coefficients of a multivariate regression of the optical bands towards each of the TIRS bands.
Figure 5 outlines the flow diagram of the OLI/TIRS sharpening procedure: the synthetic high-resolution sharpening bands of each of the two TIRS bands are assimilated from the optical bands; the two synthetic Pans at 15 m are used for the hypersharpening of the TIR bands. Although the Pan image (B8) is unique, there are two different sharpening images, one for B10, one for B11.
Let denote the more spatially resolved bands, e.g., the eight 30 m VNIR bands of OLI and the 15 m Pan image, the former resampled at the scale of the latter (15 m) for homogeneity of notation. Let also indicate the bands having lower resolution, the TIRS bands, both at 100 m spatial resolution, and R the ratio of spatial scales of and (, , and , for OLI/TIRS). The sharpening band , of the ith band with lower resolution, , is synthesized as follows.
First, the
bands are low-pass filtered with a frequency cutoff equal to
, to yield the bands
spatially degraded to the resolution of the bands that are being enhanced. The fractional scale ratio can be easily tackled by resorting to the generalized Laplacian pyramid [
32], widely used for pansharpening [
33]. Then, the relationships between the lower-resolution bands interpolated by
R,
, and
are modeled as a multivariate linear regression:
in which
is the space-varying residue. The set of optimal constant weights,
, is calculated as the least squares (LS) solution of Equation (
1). The spectral coefficients,
, are used to synthesize the set of sharpening bands,
:
The block labeled “assimilation” in the flowchart of
Figure 5 performs the operations described in Equations (
1) and (
2).
Finally, the hypersharpened bands,
, at a resolution of 15 m, can be computed by means of a projection-based fusion algorithm [
34] between
and
:
in which
and
denote covariance and variance of random variables,
is low-pass-filtered
with 15/100 spatial frequency cutoff, or, equivalently,
Equations (
3) and (
4) define the operations carried out by the “hypersharpening” block in
Figure 5.
Different models of detail injection, a contrast-based model [
10] or a genetic model [
35], can be adopted. The projection-based injection gain, defined as the ratio of covariance of source–destination to variance of source, measured at the spatial scale of the fusion product, 15 m, but at the resolution of the destination prior to fusion, that is, 100 m, can be calculated on a local sliding window in a space-varying fashion [
36], instead of being global over the whole scene. The projection coefficient balances the injection of spatial details to the extent of correlation between the source of spatial details and the destination of such details. Note that Equation (
3) denotes conventional pansharpening if
is replaced with the original 15 m OLI Pan,
, (B8) and
with its low-pass version,
.
Very and extremely high-resolution MS scanners, e.g., WorldView-3, in which four instruments with three different resolutions are present [
37], may exhibit residual shifts due to uncorrected parallax views. In the case of LandSat 8/9, the negligible residual spatial shifts between the datasets allow for the use of fusion methods based on separable [
38] or non-separable [
39] multiresolution analysis of the sharpening image. Such methods are also recommended for merging data coming from different platforms [
40,
41].
The coefficient of determination (CD) of multivariate regression in Equation (
1) measures the success of matching between each of the lower-resolution bands and the sharpening image synthesized from the higher-resolution VNIR/SWIR/Pan bands [
42]. Note that the use of a multivariate regression to synthesize the sharpening band makes the method independent of the data format [
43]. However, while the math derivation of the sharpening bands does not depend on the physical format of the data, e.g., top-of-atmosphere (TOA) spectral radiance or surface reflectance, provided that a linear affine relationship (straight line with slope and intercept) can be stated between the variables that appear in either the regression and in the fusion rule [
43], the multiplicative injection gain derived from the radiative transfer model would assume that all band data are in a surface reflectance or a surface spectral radiance format without offset/intercept between variables due to atmospheric scattering effects. The surface reflectance is a level-two (L2) product and is usually distributed for global-coverage optical systems, such as OLI and Sentinel-2. The surface spectral radiance can be derived from the surface temperature format of TIRS data through the available metadata.