An Improved Version of the Generalized Laplacian Pyramid Algorithm for Pansharpening

Addesso, Paolo; Restaino, Rocco; Vivone, Gemine

doi:10.3390/rs13173386

Open AccessArticle

An Improved Version of the Generalized Laplacian Pyramid Algorithm for Pansharpening

by

Paolo Addesso

^1,*

,

Rocco Restaino

¹

and

Gemine Vivone

²

¹

DIEM, University of Salerno, 84084 Fisciano, Italy

²

Institute of Methodologies for Environmental Analysis, CNR-IMAA, 85050 Tito Scalo, Italy

^*

Author to whom correspondence should be addressed.

Remote Sens. 2021, 13(17), 3386; https://doi.org/10.3390/rs13173386

Submission received: 6 July 2021 / Revised: 11 August 2021 / Accepted: 20 August 2021 / Published: 26 August 2021

(This article belongs to the Section Remote Sensing Image Processing)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The spatial resolution of multispectral data can be synthetically improved by exploiting the spatial content of a companion panchromatic image. This process, named pansharpening, is widely employed by data providers to augment the quality of images made available for many applications. The huge demand requires the utilization of efficient fusion algorithms that do not require specific training phases, but rather exploit physical considerations to combine the available data. For this reason, classical model-based approaches are still widely used in practice. We created and assessed a method for improving a widespread approach, based on the generalized Laplacian pyramid decomposition, by combining two different cost-effective upgrades: the estimation of the detail-extraction filter from data and the utilization of an improved injection scheme based on multilinear regression. The proposed method was compared with several existing efficient pansharpening algorithms, employing the most credited performance evaluation protocols. The capability of achieving optimal results in very different scenarios was demonstrated by employing data acquired by the IKONOS and WorldView-3 satellites.

Keywords:

pansharpening; multispectral images; generalized laplacian pyramid; multilinear regression; filter estimation

Graphical Abstract

1. Introduction

Pansharpening [1,2,3,4,5] has generated growing interest in the last years due to the numerous requests for accurate reproductions of the Earth surface, which pushed researchers to enhance the performance of algorithms based on remotely sensed data. Indeed, pansharpening represents a crucial step in the production of images aimed at visual interpretation in widely exploited software such as Google Earth and Bing Maps. Likewise, many other applications take advantage of this kind of fused data, for instance, agriculture (e.g., for crop type [6] and tree species [7] classification and for precision farming [8]), land cover change detection (e.g., for snow [9], forest [10] and urban [11] monitoring), archaeology [12] and even space mission data analysis [13].

Although the term is often used for a large set of combined data, the pansharpening process more exactly indicates the enhancement of a multispectral (MS) image through fusion with a higher resolution panchromatic (PAN) representing the same scene, as depicted in Figure 1. This setting allows one to obtain a very high quality final product, since the acquisitions can be collected almost contemporaneously from the same platform, thanks to the availability of the two required sensors on many operating satellites.

Several algorithms have been developed for completing the pansharpening process. They can be categorized in different ways according to their main features. In particular, an useful taxonomy that can guide the choice of the user distinguishes two main classes composed of classical and emerging approaches [5]. Essentially, the first group includes the methods which have been developed over the years, starting from the analysis of the physical processes underlying the acquisition of the signals involved. It includes both component substitution (CS) approaches—for example those based on intensity-hue-saturation (IHS) [14,15], Gram–Schmidt [16,17] or principal component analysis [18,19] decompositions—and multiresolution analysis (MRA) methods, which use signal decompositions based on box filters, Laplacian pyramids [20,21,22,23], wavelets [24,25,26] and morphological filters [27,28,29]. Instead, the methods contained in the second group are more focused on the optimization of the fusion result, which aim at obtaining the best image quality through the application of more general estimation approaches. Techniques based on variational optimization (VO) approaches [30,31,32] and on machine learning (ML) [33,34,35,36,37] belong to this class.

The literature shows several papers in which the results achievable through these approaches are compared [1,5,38,39,40,41]. As in many other fields, the recent development of more efficient computational approaches has constituted a major breakthrough in data fusion, making feasible the utilization of variational and ML-based methods. More in detail, in the last decade, pioneering works in ML category were compressive sensing and dictionary-based solutions, such as [31,42,43,44,45,46]. Subsequently, deep learning (DL) approaches became more and more popular in the remote sensing field [47], including pansharpening [33,34,35,36,48,49,50,51,52,53,54,55], and intimately related tasks such as super-resolution [56,57,58,59] and hyper-/multi-spectral data fusion [60,61]. The main issue of the above-mentioned ML approaches is the assumption of a training paradigm relying on a resolution downgrade process (e.g., Wald’s protocol). More recently, different training paradigms, mainly based on multi-objective strategies, such as in [62,63], have been proposed to address such a problem.

On the other hand, a careful analysis of the literature testifies that the performance enhancements obtained through recent implementations of classical methods, such as those proposed in [64,65,66], or efficient implementations of VO, such as the one proposed in [30], lead to high quality pansharpened images that do not require extensive training phases.

For this reason, we focus in this paper on possible improvements of these efficient approaches, and in particular, we exploit the classical method scheme that is composed by two successive phases [1]: (i) the extraction of the details from a high resolution PAN image and (ii) the injection of those details into a low resolution MS image. We tackled the investigation of both phases, following the lines traced by the recent literature.

In more detail, we focus on an MRA approach based on the generalized Laplacian pyramid (GLP) [20] with a modulation transfer function (MTF)-matched filter. Namely, the details are extracted from the PAN image through a filter, whose amplitude response is matched to the MTF of the MS sensor [21]. This technique points toward obtaining the most relevant data to the enhancement of the MS image spatial resolution, since it isolates the PAN information that was cut out off by the MS acquisition process. The cited reviews contain a vast list of pansharpening approaches using the MTF-shaped extraction filters. However, the exact form of the MS sensor’s MTF is frequently unavailable in practice, due to the lack of accurate and updated on-board measurements of the actual response, which change over time due to the acquisition device aging [67]. Accordingly, good practice consists of estimating the actual shape of the detail-extraction filter directly from data [30]. If the response is significantly different across bands, it is advisable to estimate a different filter for each band [68].

Typically, the subsequent phase of detail injection is completed by adding the image extracted from the PAN image weighted by an injection coefficient matrix, which can in general contain a different entry for each pixel and each band. The values of the specific weights can be derived through physical considerations (as it happens for the HPM method [67,69]) or through mathematical optimization approaches starting from suitable criteria. The projective (or regression-based) injection model [21,70] belongs to the latter type, since it implements the minimum mean square error (MMSE) estimation approach. However, the described linear injection rule does not always represent the optimal choice, as it can also be argued from the studies that propose non-linear approaches, implemented through local methods [15,66,71] or non-linear networks [33,48]. In this work we elaborate on this thesis, while aiming to preserve the computational efficiency of classical methods. To that end, we adopted a slight generalization of the linear approach that consists of estimating the best polynomial approximation of the optimal relationship between the details extracted from the PAN image and those missing in the MS image. This approach allows one to estimate the optimal injection coefficients, according to the MMSE criterion, through a simple closed formula implementing a multilinear regression (MLR) scheme [29].

The original contribution of this study relies on the definition of a novel fusion architecture. The combination of the filter estimation and the MLR injection approach has been assessed and compared to several existing approaches. We show that with the conjunction of the two techniques, it is possible to obtain remarkable robustness against the diversity of the illuminated scenes, thereby enhancing a key feature of algorithms in practical applications. We tested the proposed method using two different real datasets, acquired from the IKONOS and the WorldView-3 sensors, which allowed us to evaluate the effective performance in different working scenarios. The quality of the final products has been assessed by exploiting both the reduced scale and the full scale assessment protocols [30]. The former allowed us to evaluate the performances of the algorithms in a controlled scenario, where the original MS images were used as references for the fusion of images that were purposely degraded to lower resolution. The latter constituted a realistic scenario in which the MS and PAN images were combined at the original resolution, in the absence of reference images.

In the next section, we describe the problem at hand, define the quantities used in the paper and provide an overview of the proposed approach. The following two sections are devoted to detailed descriptions of the chosen implementations of the two phases that compose the fusion process. In Section 5, we describe the simulation setting and report the outcomes of the experimental tests. The discussion of the results is in Section 6. Finally, conclusions are drawn in Section 7.

2. Problem Statement

Firstly, we introduce the mathematical notation that will be used in this paper. Bold uppercase, example

X

, indicates an image. Accordingly,

P

is used to denote the panchromatic (PAN) image, which is an

L_{r} \times L_{c}

two-dimensional array, where

L_{r}

and

L_{c}

are the numbers of rows and columns of the PAN image, respectively. Instead, the multispectral (MS) image is a three-dimensional array with dimensions

L_{r} / R \times L_{c} / R \times B

, where B is the number of bands, and R is the resolution ratio between the original MS and the PAN data; it is denoted by

M = {M_{b}}_{b = 1, \dots, B}

, where

M_{b}

indicates the b-th spectral band.

The purpose of pansharpening is to identify a near-optimal procedure to obtain

L_{r} \times L_{c} \times B

MS data—

\hat{M} = {{\hat{M}}_{b}}_{b = 1, \dots, B}

—that is characterized by the same spectral resolution of the original MS image

M

and the same spatial resolution of the PAN image

P

. We also define an

\tilde{M} = {{\tilde{M}}_{b}}_{b = 1, \dots, B}

, the

L_{r} \times L_{c} \times B

MS image obtained by solely upsampling

M

to the PAN scale.

The general formulation of a classical fusion process is given by the following expression [1]:

{\hat{M}}_{b} = {\tilde{M}}_{b} + F_{b} [Δ P_{b}], b = 1, \dots, B,

(1)

where:

$Δ P_{b}$ are the PAN detail images, computed as $Δ P_{b} = P_{b} - P_{b}^{L}$ , i.e., as the differences between the band by band equalized PAN image $P_{b}$ (the equalization is performed as suggested in [27]) and its low-pass filtered version $P_{b}^{L}$ ;
$F_{b} [\cdot]$ are the functions (different for each band b) that inject the PAN details into each MS band.

According to (1), in classical methods each band is treated independently and an additive form is assumed for the injection procedure. The different techniques adopted to compute

P_{b}^{L}

(and hence the details

Δ P_{b}

) and

F_{b} [\cdot]

identify the specific approaches. More specifically, if

P_{b}^{L}

is obtained by combining the channels of the interpolated MS image

\tilde{M}

, the method is said to belong to the CS class, whereas if

P_{b}^{L}

is obtained by extracting the low-pass part from the PAN image

P

, the method is said to belong to the MRA class. This distinction is not purely formal, since the two classes can be shown to be characterized by very different visual and quantitative features [72].

Starting from this framework, in this work we propose an architecture, depicted in Figure 2, in which the main steps are the following ones.

(1): Filter estimation. The low-pass filtered PAN image $P_{b}^{L}$ is obtained via an MRA scheme, where the low-pass filter impulse response $h_{b}$ (that can be different for each band $b \in {1, \dots, B}$ ) is estimated from the data by using semiblind deconvolution [30,68], as will be detailed in Section 3.
(2): Multilinear regression-based injection. The functions $F_{b} [\cdot]$ are supposed to have a non-linear form that can be approximated by a polynomial expansion of order M, whose coefficients are computed by using a multilinear regression (MLR) approach [29]. The complete description of this procedure will be provided in Section 4.

Indeed, our purpose was to evaluate the joint effect of both filter estimation and MLR-based injection on the final fused product. Therefore, in the following sections, we present in detail our proposal for implementing these two steps.

3. MRA via Filter Estimation

In this section we present two MRA approaches based on the estimation of the lowpass filter impulse response, namely, the filter estimation (FE) and the multi-band filter estimation (MBFE) algorithms, introduced in [30,68], respectively.

Both these algorithms rely on approaching the estimation problem in the following form. Let

x \in R^{N}

be a single band lexicographic ordered image, where

N = L_{r} L_{c}

, and let

y \in R^{N}

be its spatially degraded version, obtained via a low-pass filter (LPF) with the finite impulse response

h \in R^{N}

and the addition of noise

n \in R^{N}

. In formulas, we can write the following equation:

y = C (x) h + n,

(2)

where

C (\cdot)

is an operator that generates a block circulant with circulant block (BCCB) matrix by suitably rearranging the entries of its argument such that the matrix product between

C (x) \in R^{N \times N}

and

h

yields the convolution between the image

x

and the filter

h

[68,73].

In this setup, the filter estimation problem is addressed by solving the constrained minimization problem:

\begin{matrix} \min_{h} \{{∥y - C (x) h∥}^{2} + λ {∥h∥}^{2} + μ ({∥C (d_{v}) h∥}^{2} + {∥C (d_{h}) h∥}^{2})\} \\ subject to h^{T} 1 = 1, h \in H, \end{matrix}

(3)

where

\cdot^{T}

is the transpose operator;

C (d_{v}) \in R^{N \times N}

and

C (d_{h}) \in R^{N \times N}

are BCCB matrices that are computed from the filters

d_{v}

and

d_{h}

and that perform the first-order finite difference in the vertical and horizontal directions, respectively; and

1

is a row vector of all ones. This formulation derives from the combination of different terms. The first one is the so-called data-fitting term, which is the main quantity to be optimized. Then, there are two regularization terms, introduced due to the ill-posedness of this estimation problem [74] that put some constraints on the resulting solution. The first regularization term (namely,

{∥h∥}^{2}

) helps to obtain a limited-energy solution. On the other hand, the second regularization term (namely,

{∥C (d_{v}) h∥}^{2} + {∥C (d_{h}) h∥}^{2}

) is useful to enforce a smooth solution. The weights

λ

and

μ

aim to tune the importance of the regularization terms when finding the solution

h

, which is subject to two constraints: it has to be normalized (i.e.,

h^{T} 1 = 1

) and limited to a finite, non-empty and convex support,

H \subset R^{N}

. As stated in [30,68], the choice of the squared

ℓ_{2}

norm

{∥ \cdot ∥}^{2}

allows one to solve the problem in (3) in a closed form. Indeed, the quadratic cost function within the minimization problem in (3) attains its (global) minimum when

[C {(x)}^{T} C (x) + λ I + μ C {(d_{v})}^{T} C (d_{v}) + μ C {(d_{h})}^{T} C (d_{h})] h = C {(x)}^{H} y,

(4)

where

I

is the identity matrix and

\cdot^{H}

indicates the Hermitian transpose operator.

A computational efficient solution of (4) exists and can be explicitly written in the frequency domain as

h = F^{- 1} \{\circ \frac{F {x} * \circ F {y}}{F {x} * \circ F {x} + λ + μ (F {d_{h}} * \circ F {d_{h}} + F {d_{v}} * \circ F {d_{v}}} \circ\},

(5)

where the ∘ symbol indicates pointwise (entry-by-entry) multiplication and division operations;

F {\cdot}

and

F^{- 1} {\cdot}

are the 2-D discrete Fourier transform operator and its inverse, respectively; and

(\cdot) *

denotes the complex conjugate. Indeed, due to their BCCB structure, the matrices

C (x)

,

C (d_{h})

and

C (d_{v})

, can be diagonalized by the 2D discrete Fourier transform matrix, leading to a computational cost dominated by the number of operations required to perform the fast Fourier transform (FFT) transform, i.e.,

O (N log N)

. Finally, in order to fully profit from the FFT, which assumes a periodic boundary structure of the image, a preprocessing step aimed at smoothing out the unavoidable discontinuities present in the real-world images is needed. The adopted solution relies on blurring the image borders, as suggested in the image processing literature [75].

In the following, we briefly describe the two most promising and effective approaches to complete the filter estimation task for pansharpening, i.e.,:

The filter estimation (FE) method [30], which estimates a single filter for all the MS channels;
The multi-band filter estimation (MBFE) algorithm [68], which overcomes the main limitation of the FE by estimating a filter for each band.

3.1. FE Algorithm

This approach consists of estimating a single filter; i.e., for all spectral bands

b \in {1, \dots, B}

,

h_{b}

is equal to the same estimated filter, say

h_{P}

. The P subscript refers to the implementation of the FE algorithm that exploits the relationship between the PAN image

p \in R^{N}

and an equivalent PAN image

p_{e} \in R^{N}

generated by projecting the MS image into the PAN domain via the formula

p_{e} = w^{a u g} {\tilde{M}}^{a u g} .

(6)

In (6),

{\tilde{M}}^{a u g} = {[{\tilde{m}}_{1}^{T}, \dots, {\tilde{m}}_{b}^{T}, \dots, {\tilde{m}}_{B}^{T}, 1^{T}]}^{T}

is obtained by stacking the lexicographic ordered single band images

{{\tilde{m}}_{b}}_{b = 1, \dots, B}

and the all-ones row vector

1

, and

w^{a u g} = {[w^{T}, w_{0}]}^{T}

is composed by the vector

w = [w_{1}, \dots, w_{b}, \dots, w_{B}]

, whose elements are the weights measuring the overlap between the PAN image and each spectral band, and

w_{0}

is a bias coefficient.

The algorithm is based on the following two interdependent alternating steps, starting from an initial estimate for the filter

h_{P}

.

Estimation of the weights $w^{a u g}$ . This step consists of imposing the equality between the image $p_{e}$ defined in (6) and a low-pass filtered version of $P_{b}$ computed via the current estimate of $h_{P}$ . Therefore, the estimate of the weights $w^{a u g}$ is easily found via a classic multivariate regression framework.
Estimation of the filter $h_{P}$ . This estimate is found by using (5) in which $p_{e}$ plays the role of the blurred and degraded image $y$ and $p$ plays the role of the matrix $x$ . The resulting filter is finally normalized (in order to have a unitary gain) and the values outside a given window are set to zero.

In order to help a fast convergence of the iterative algorithm (usually in a couple of iterations), the Starck and Murtagh low-pass filter [76] (used in pansharpening in the popular “à trous” algorithm) is chosen as initial guess for the filter

h_{P}

. Moreover, the maximum number of iterations is fixed to 10, in order to ensure that the algorithm stops.

3.2. MBFE Algorithm

An effective algorithm aimed at estimating a specific degradation filter

h_{b}

for each

b \in {1, \dots, B}

is the MBFE method [68]. The low coherence between some bands of the MS image and the PAN image prevents satisfying performance by estimating

h_{b}

directly from the PAN image [30]. This problem is solved by generating an initial estimate of

\hat{M}

(say it

{\hat{M}}^{0} = {{\hat{M}}_{b}^{0}}_{b = 1, \dots, B}

) that can be used as an approximation of the ground-truth (GT) for the filter estimation task, ensuring higher coherence between the high and the low resolution MS images.

Natural candidates for estimating

{\hat{M}}^{0}

are the CS-based methods. Indeed, they are able to generate a fused product that completely retains the PAN spatial details [30], which are key for an accurate estimation of the filters. Therefore, for each band

b \in {1, \dots, B}

, the estimation of

h_{b}

is performed by using (5) in which

{\tilde{m}}_{b}

plays the role of the blurred and degraded image

y

and

{\hat{m}}_{b}^{0}

(i.e., the lexicographic ordered version of

{\hat{M}}_{b}^{0}

) plays the role of the matrix

x

. Among the CS algorithms, the band-dependent spatial-detail (BDSD) technique [77] and the Gram–Schmidt adaptive (GSA) method [17] have been proved to generate initial guess images

{\hat{M}}^{0}

that are well-suited for use in MBFE [68]. Therefore, in the following, we will consider two MBFE variants: the MBFE BDSD and the MBFE GSA.

4. MLR-Based Injection

In this section we briefly present the MLR-based injection scheme that is the second pillar of the proposed pansharpening architecture. This scheme is a natural extension of the following classical approaches, in which

F_{b} [\cdot]

is linear.

CBD Injection Scheme. In the context-based decision (CBD) injection model, for each channel b, the details of the PAN image are multiplied by a scalar coefficient, namely,

$F_{b} [Δ P_{b}] = g_{b} Δ P_{b} .$

(7)

The injection coefficients $g_{b}, \forall b \in {1, \dots, B}$ are computed by the regression of the b-th MS channel on the PAN images. It is worth noting that this scheme is also used in other pansharpening algorithms, such as the aforementioned GSA.
HPM Injection Scheme. The high-pass modulation (HPM) injection scheme relies on the pointwise multiplication of the PAN details by a coefficient matrix $G_{b}$ , according to

$F_{b} [Δ P_{b}] = G_{b} \circ Δ P_{b} .$

(8)

Additionally, in this case, other pansharpening algorithms use this scheme, such as the Brovey transform (BT), which is a classic multiplicative scheme belonging to the CS family [78].

On the contrary, the proposed scheme employs a non-linear injection function

F_{b} [\cdot]

in (1) that is approximated via a polynomial expansion of order M around zero, i.e.,

F_{b} [Δ P_{b}] = \sum_{m = 0}^{M} g_{b, m} {(Δ P_{b})}^{m} .

(9)

This formulation is linear with respect to the coefficients

{g_{b, m}}_{m = 0, \dots, M}

; therefore, it is possible to use the MLR framework [79] that is the multidimensional extension of the classic ordinary least square method. More in detail, we should estimate the coefficients

{g_{b, m}}_{m = 0, \dots, M}

by solving the problem

Δ {\hat{M}}_{b} = \sum_{m = 0}^{M} g_{b, m} {(Δ P_{b})}^{m} + R_{b}

(10)

where

Δ {\hat{M}}_{b} = {\hat{M}}_{b} - {\tilde{M}}_{b}

are the details of the target MS image and the optimal (in the least-squares sense) coefficients

{g_{b, m}}_{m = 0, \dots, M}

minimize the Frobenius norm of the residuals

R_{b}

. Unfortunately, it is impossible to compute the details of the MS image, because this approach would require the knowledge of

{\hat{M}}_{b}

—that is, the image to estimate. Therefore, the solution is to solve the reduced resolution companion problem

δ M_{b} = \sum_{m = 0}^{M} g_{b, m}^{R R} {(δ P_{b}^{R R})}^{m} + R_{b}^{R R},

(11)

defined in terms of the corresponding reduced resolution versions, indicated by the superscript

R R

. More specifically,

\begin{matrix} δ M_{b} = M_{b} - h_{b} * M_{b}, \\ δ P_{b}^{R R} = P_{b}^{R R} - h_{b} * P_{b}^{R R}, \end{matrix}

(12)

where

P_{b}^{R R}

is the downsampled version of

P_{b}^{L}

, defined in Section 2 (see also Figure 2). Finally, according to the findings of [29], we use

M = 2

, which shows a good trade-off between the complexity of the model and its performance.

5. Experimental Results

The crucial phase of this study was constituted by the experimental tests that have allowed us to verify the suitability of the proposed technique. Due to the above-mentioned lack of a reference image, the assessment of fusion algorithms had to be performed in two distinguished steps, involving the evaluation of the performance at both reduced scale and full scale [1]. The former consisted of reproducing the fusion problem at a lower resolution through the appropriate degradation of the available images. In particular, the choice of a scaling factor equal to the resolution ratio R allows one to downsize the PAN image at the resolution of the original MS image, which can thus be used as the fusion target. In this case, some indexes can be used for the evaluation of the image quality. On the other hand, changing the working resolution does not represent an ideal solution for two main reasons. The information concerning the illuminated area is somewhat different at the two scales, due to a substantial reduction of perceivable details. Moreover, the image processing algorithms employed for coarsening the available images can only approximate the sensor acquisition, leading to significant deviations form the actual operating scenarios. For this reason, the analysis of the algorithms’ behavior at the original resolution is mandatory, especially if the reliability of the applicable quality measures is highly questionable.

We utilized two different datasets for the assessment of the proposed technique, comparing it to the MS image interpolation using a polynomial kernel with 23 coefficients (EXP) and many classical approaches, i.e., non-linear IHS (NLIHS) [15], Gram–Schmidt (GS) [16], Gram–Schmidt adaptive (GSA) [17], band-dependent spatial-detail (BDSD) [77], smoothing filter-based intensity modulation (SFIM) [69], additive à trous wavelet transform (ATWT) [26] and the pyramidal decomposition scheme using morphological filters based on half gradient (MF-HG) [80]. Obviously, we included in the comparison the baseline methods that constitute the starting point of the proposed approach, namely, the GLP using MTF-matched filter [21] with both multiplicative (HPM) [26] and regression-based (CBD) [70] injection models. Moreover, we considered the more recent versions of the MTF-GLP approaches that include the filter estimation procedure, based on the estimation of either a single filter (FE) [30], or a different filter for each band (MBFE) [68]. Analogously, we report the results related to the sole introduction of the MLR injection scheme [29] within the baseline methods.

5.1. Datasets

The datasets were selected to cover the most typical settings encountered in the practice. For this reason, two different cases have been considered: the fusion of a PAN image with a 4-band MS image and the fusion of a PAN image with an 8-band MS image. Moreover, the two datasets refer to different scenarios, one constituted by a mountainous, partly vegetated area, and one representing an urban zone.

The China dataset was also used in the assessment of the classical methods presented in [1] and is composed by images collected by the IKONOS sensor over the Sichuan region in China (see the images on the left in Figure 3). The MS image had four channels (blue, green, red and near infra-red (NIR)) and the spatial sampling interval (SSI) was 1.2 m. The IKONOS resolution ratio between the MS and the PAN image was

R = 4

and the radiometric resolution was 11 bits. A cut of size 300 × 300 pixels of the original MS image was employed in this work as the ground-truth (GT) for the reduced resolution assessment.

The Tripoli dataset was acquired by the WorldView-3 satellite and was used to test the capability of the proposed method for the enhancement of an MS with a larger number of bands (coastal, blue, green, yellow, red, red edge, NIR1 and NIR2). The employed imagery included a PAN image of size 1024 × 2048 pixels and an MS image of size 256 × 256 pixels, which was also in this case

R = 4

. The MS WorldView (WV)-3 sensor is characterized by a 1.2 m SSI and the radiometric resolution is 11 bits. The Tripoli dataset was used both at the original scale for performing the full resolution assessment and at a lower scale, obtained by degrading the original images by a factor

R = 4

.

5.2. Reduced Resolution Validation

The reduced resolution (RR) assessment represents a very crucial phase in the evaluation of algorithm performance, since the availability of the reference image allows one to accurately evaluate the quality of the final product. The method consists of using the original MS image as the target product that has to be obtained by combining the degraded versions of the same MS image and of the original PAN. The procedure for completing such experiments has been formalized through Wald’s protocol [81], which is based on both the consistency and the synthesis properties. While the latter points at specifying the characteristics of the fused image, the former requires that the fused high resolution MS image obtains the low resolution MS image once properly degraded. This means that the degradation systems should reproduce the overall acquisition process that yields the real images. Accordingly, the amplitude frequency responses of the MS degradation filters are matched to the MTFs of the MS sensor, while an ideal filter is employed to decimate the PAN image [21].

The quality of the fused products can thus be assessed through several indexes that have been developed. We adopted four widespread measures: the well-known peak signal-to-noise ratio (PSNR); the relative dimensionless global error in synthesis (ERGAS) [82] that is a normalized version of the root mean square error (RMSE); the spectral angle mapper (SAM) [83] that quantifies the spectral distortion as the mean angle between the fused and reference pixel vectors; the

Q 2^{n}

[84,85] that extends the universal image quality index (UIQI) [86] to multi-channel images.

The main results of the RR assessment are summarized in Table 1, where the comparison of the proposed technique with both the baseline methods and the other cited pansharpening approaches is shown.

In order to give some additional insight about data fusion performance, we also present some closeups for the two RR data in Figure 4 and Figure 5, focusing only on the GLP details extraction scheme, we show several

Q 2^{n}

maps for the Tripoli dataset in Figure 6.

5.3. Full Resolution Validation

The full resolution (FR) assessment allowed us to analyze the behavior of the algorithms at their effective working scale. In particular, the Tripoli dataset contains images with high resolution details, namely, with physical dimensions very similar to the SSI of the sensors. Accordingly, it can be properly used for this second investigation phase, in which the visual analysis assumes a central role. In fact, all the available quality indexes cannot be considered totally reliable because they assess the final product without a reference image.

For this reason, the quantitative evaluation is typically performed by measuring the coherence of the pansharpened product with the original available images. In particular, one assesses the spectral similarity of the fused image and the low resolution MS image and the correspondence between the PAN and MS spatial details at the original and enhanced resolutions [87,88].

The quality with no reference (QNR) [88] index is the best known measure adopting this rationale. It is composed by a spectral index that measures the relationships among the MS channels and a spatial index that quantifies the quantity and the appropriateness of the spatial details present in each band. Several other quality indexes have been proposed in the literature for the full resolution assessment [23,89,90,91,92]. We adopted here the hybrid QNR (HQNR) [93] that combines the use of the QNR spatial index

D_{S}

and of the spectral index

D_{λ}

proposed in [89], thereby providing appreciable soundness and computational efficiency [5].

Table 2 reports the values of the HQNR indexes computed by applying the considered pansharpening algorithms to the Tripoli dataset.

6. Discussion

The reduced resolution and the full resolution assessment protocols allowed us to provide a clear illustration of the results achievable through the proposed pansharpening scheme. The key considerations that can be derived from these two complementary phases are detailed in the following sections.

6.1. Reduced Resolution

The most evident property is the high performance achieved by the proposed approach in both the tests. This behavior stood out on our datasets, presenting complementary features of the illuminated scene. Indeed, as it can be noticed by examining most of the compared algorithms, it is difficult to find algorithms that were characterized by optimal performance in both the scenarios. Additionally, the baseline methods, namely, the implementations of the GLP approaches exploiting the HPM and the CBD injection schemes, were not immune to this performance tradeoff, due to the large presence of high resolution details in the Tripoli datasets, whose counterparts are the large homogeneous zones in the China dataset.

In both the cases, the best results in terms of the most comprehensive index, namely, the

Q 2^{n}

, were obtained by the methods that utilize the multi-band filter estimation. In particular, the MBFE-BDSD-MLR approach achieved the highest value for the China dataset, and the MBFE-GSA-MLR produced the best image for the Tripoli dataset. An important remark regards the single filter estimation approach (FE-MLR) that showed remarkable robustness, since it achieved results very similar to the best methods; this was partly due to the shape of the MTFs of the various bands, which are characterized by almost equal gains at the Nyquist frequency. Moreover, one can note that the improvement of the final product quality implied by the filter estimation procedure is always in terms of spectral quality of the image, as demonstrated by the higher values of the SAM index with respect to the GLP-MLR. In fact, the MLR coefficient estimation points to optimizing the detail injection scheme for each specific channel, without taking into account the spectral coherence of the final product. Naturally, this issue is made worse by the multiband approach, which estimates a different filter for each band, causing a more significant spectral unbalance in the pansharpened image. Nevertheless, this issue is largely compensated by the greater ability of injecting the most useful spatial information contained in the PAN image.

The suitability of the proposed approaches is also testified by the closeups shown in Figure 4 and Figure 5 that highlight the capability of producing images with accurate spatial reproduction of the details, without excessively sacrificing the chromatic fidelity of the objects presents in the scene. The performance analysis can be eased by evaluating the algorithms in pairs, as we show in Figure 6, where the MBFE-GSA-MLR is compared, in terms of

Q 2^{n}

map, to six other methods based on the GLP detail-extraction scheme. The green pixels represent the zones of the images in which the MBFE-GSA-MLR achieved higher quality index scores with respect to the competitors, and the red pixels highlight the opposite. The proposed method achieved almost uniform performance improvements with respect to the approaches compared in panels (a)–(d). More specifically, figures (b) and (d) show that the MBFE-GSA-MLR is significantly superior to the algorithms that implement either the MLR injection scheme (GLP-MLR) or the MBFE technique (MBFE-GSA-CBD), thereby motivating the joint use of the two methods. The uniformity of the green pixels demonstrate that the performance increase was not due to the improvement of a specific zone of the image, but rather to a more precise evaluation of the best extraction filter and of a specific formula for the data combination. Panels (e) and (f) illustrate a more diversified result that is a consequence of an alternative injection scheme. In fact, the HPM modulated the PAN details point-wise, thereby achieving a very different final product. However, the higher overall quality of the MBFE-GSA-MLR approach can be easily argued by the larger extent of the green zones, which are also characterized by high color saturation, indicating a significant improvement of the

Q 2^{n}

value.

6.2. Full Resolution

The results corroborate the analysis carried out at reduced resolution, showing that the approaches using an estimated filter for the extraction and a multilinear regression for the injection obtained the best overall results. In particular, the use of a specific index

D_{S}

assessing the spatial quality of the images allows one to confirm the deduction that the main improvements were obtained in terms of a more faithful reproduction of the geometric information.

Further information can be derived from the visual analysis of the pansharpened products. We present in Figure 7 the injected details, namely, the differences between the final products

\hat{M}

and the upsampled image

\tilde{M}

. Panels (m) and (n) immediately stand out for the richness and intelligibility of the representation, which testify the accurate reproduction of the object borders detectable at the highest scale. Moreover, although Table 2 confirms that the proposed methods resulted in slightly worse spectral quality of the final products, the homogeneity of the detailed images demonstrates that the particulars were not excessively boosted in any specific zone or band, as happened for the HPM-based schemes.

6.3. Computational Analysis

The analysis of the computational complexity of the proposed approach is finally reported in Table 3, which contains the times required by an Intel®Core™I7 3.2 GHz processor to complete the fusion process. Almost all the approaches exploiting a multiresolution decomposition of the images required perceptibly more computational effort, since the considered images are quite large. A further increase occurred for the filter estimation procedure, whose effort is proportional to the number of impulse responses to be estimated. Accordingly, the multiband (MBFE) approach took twice as much computational time as the baseline GLP approach in the case of eight bands, though the additional effort required by the single filter (FE) method is almost negligible. In any case, the main point is that the proposed approach strictly preserves the feasibility of the classical methods, thereby representing a viable technique for processing a large amount of data.

7. Conclusions

In this work, a step forward has been made with respect to existing techniques for the efficient combinations of multispectral and panchromatic images acquired by the same satellite. The usefulness of pansharpened data for many applications demands the capacity of providers to efficiently perform the fusion process, and thus most algorithms still resort to physical models to ease their adaptation to specific datasets.

In this class of approaches, the generalized Laplacian pyramid has emerged as the most widespread method, since it combines the accurate reproduction of the acquisition process characteristics with high computational efficiency. However, some improvements that do not result in an excessive computational burden are conceivable. More specifically, in this work we validated the joint use of a filter estimation procedure, which allows one to easily adapt the shape of the detail-extraction filters to the specific imagery, and of a polynomial combination function, which allows one to more properly inject the PAN information.

The effectiveness of the proposed scheme has been tested on two different datasets, which are characterized by unalike features of the illuminated scene and have been acquired by different sensors. The most important quality of the designed approach is the capacity to achieve the best performance among the tested methods in both the scenarios, differently from all the existing techniques. Among the possible implementations of the proposed approach, it has also been highlighted in this study that the estimation of a single filter for all the multispectral image channels allows one to obtain a still more efficient algorithm, without significantly sacrificing the overall quality of the final product.

Finally, future studies and developments will be devoted to extending the proposed architecture to hyperspectral sharpening, due to the great interest for the related applications [94,95], and the fusion of thermal data [96].

Author Contributions

Conceptualization, P.A., R.R. and G.V.; methodology, P.A., R.R. and G.V.; software, P.A., R.R. and G.V.; validation, P.A., R.R. and G.V.; formal analysis, P.A., R.R. and G.V.; investigation, P.A.; data curation, G.V.; writing—original draft preparation, R.R. and P.A.; writing—review and editing, G.V.; visualization, P.A. and R.R.; supervision, G.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available since have been removed from the original url.

Conflicts of Interest

The authors declare no conflict of interest.

References

Vivone, G.; Alparone, L.; Chanussot, J.; Dalla Mura, M.; Garzelli, A.; Licciardi, G.; Restaino, R.; Wald, L. A Critical Comparison Among Pansharpening Algorithms. IEEE Trans. Geosci. Remote Sens. 2015, 53, 2565–2586. [Google Scholar] [CrossRef]
Ghassemian, H. A review of remote sensing image fusion methods. Inf. Fusion 2016, 32, 75–89. [Google Scholar] [CrossRef]
Li, S.; Kang, X.; Fang, L.; Hu, J.; Yin, H. Pixel-level image fusion: A survey of the state of the art. Inf. Fusion 2017, 33, 100–112. [Google Scholar] [CrossRef]
Meng, X.; Shen, H.; Li, H.; Zhang, L.; Fu, R. Review of the pansharpening methods for remote sensing images based on the idea of meta-analysis: Practical discussion and challenges. Inf. Fusion 2019, 46, 102–113. [Google Scholar] [CrossRef]
Vivone, G.; Dalla Mura, M.; Garzelli, A.; Scarpa, G.; Ulfarsson, M.; Restaino, R.; Alparone, L.; Chanussot, J. A New Benchmark Based on Recent Advances in Multispectral Pansharpening: Revisiting pansharpening with classical and emerging pansharpening method. IEEE Geosci. Remote Sens. Mag. 2021, 9, 184–199. [Google Scholar] [CrossRef]
Gilbertson, J.K.; Kemp, J.; Van Niekerk, A. Effect of pan-sharpening multi-temporal Landsat 8 imagery for crop type differentiation using different classification techniques. Comput. Electron. Agric. 2017, 134, 151–159. [Google Scholar] [CrossRef] [Green Version]
Deur, M.; Gašparović, M.; Balenović, I. An Evaluation of Pixel- and Object-Based Tree Species Classification in Mixed Deciduous Forests Using Pansharpened Very High Spatial Resolution Satellite Imagery. Remote Sens. 2021, 13, 1868. [Google Scholar] [CrossRef]
Jones, E.G.; Wong, S.; Milton, A.; Sclauzero, J.; Whittenbury, H.; McDonnell, M.D. The Impact of Pan-Sharpening and Spectral Resolution on Vineyard Segmentation through Machine Learning. Remote Sens. 2020, 12, 934. [Google Scholar] [CrossRef] [Green Version]
Maurya, A.K.; Varade, D.; Dikshit, O. Effect of Pansharpening in Fusion Based Change Detection of Snow Cover Using Convolutional Neural Networks. IETE Tech. Rev. 2020, 37, 465–475. [Google Scholar] [CrossRef]
Lombard, F.; Andrieu, J. Mapping Mangrove Zonation Changes in Senegal with Landsat Imagery Using an OBIA Approach Combined with Linear Spectral Unmixing. Remote Sens. 2021, 13, 1961. [Google Scholar] [CrossRef]
Dong, Y.; Ren, Z.; Fu, Y.; Miao, Z.; Yang, R.; Sun, Y.; He, X. Recording Urban Land Dynamic and Its Effects during 2000–2019 at 15-m Resolution by Cloud Computing with Landsat Series. Remote Sens. 2020, 12, 2451. [Google Scholar] [CrossRef]
Noviello, M.; Ciminale, M.; De Pasquale, V. Combined application of pansharpening and enhancement methods to improve archaeological cropmark visibility and identification in QuickBird imagery: Two case studies from Apulia, Southern Italy. J. Archaeol. Sci. 2013, 40, 3604–3613. [Google Scholar] [CrossRef]
Kwan, C.; Budavari, B.; Dao, M.; Ayhan, B.; Bell, J.F. Pansharpening of Mastcam images. In Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA, 23–28 July 2017; pp. 5117–5120. [Google Scholar] [CrossRef]
Carper, W.; Lillesand, T.; Kiefer, R. The use of intensity-hue-saturation transformations for merging SPOT panchromatic and multispectral image data. Photogramm. Eng. Remote Sens. 1990, 56, 459–467. [Google Scholar]
Ghahremani, M.; Ghassemian, H. Nonlinear IHS: A promising method for pan-sharpening. IEEE Geosci. Remote Sens. Lett. 2016, 13, 1606–1610. [Google Scholar] [CrossRef]
Laben, C.A.; Brower, B.V. Process for Enhancing the Spatial Resolution of Multispectral Imagery Using Pan-Sharpening. U.S. Patent 6,011,875, 4 January 2000. [Google Scholar]
Aiazzi, B.; Baronti, S.; Selva, M. Improving component substitution pansharpening through multivariate regression of MS+Pan data. IEEE Trans. Geosci. Remote Sens. 2007, 45, 3230–3239. [Google Scholar] [CrossRef]
Chavez, P.S., Jr.; Kwarteng, A.W. Extracting spectral contrast in Landsat Thematic Mapper image data using selective principal component analysis. Photogramm. Eng. Remote Sens. 1989, 55, 339–348. [Google Scholar]
Licciardi, G.A.; Khan, M.M.; Chanussot, J.; Montanvert, A.; Condat, L.; Jutten, C. Fusion of hyperspectral and panchromatic images using multiresolution analysis and nonlinear PCA band reduction. EURASIP J. Adv. Signal Process. 2012, 2012, 1–17. [Google Scholar] [CrossRef]
Aiazzi, B.; Alparone, L.; Baronti, S.; Garzelli, A. Context-driven fusion of high spatial and spectral resolution images based on oversampled multiresolution analysis. IEEE Trans. Geosci. Remote Sens. 2002, 40, 2300–2312. [Google Scholar] [CrossRef]
Aiazzi, B.; Alparone, L.; Baronti, S.; Garzelli, A.; Selva, M. MTF-tailored multiscale fusion of high-resolution MS and Pan imagery. Photogramm. Eng. Remote Sens. 2006, 72, 591–596. [Google Scholar] [CrossRef]
Vivone, G.; Restaino, R.; Chanussot, J. A Regression-Based High-Pass Modulation Pansharpening Approach. IEEE Trans. Geosci. Remote Sens. 2018, 56, 984–996. [Google Scholar] [CrossRef]
Vivone, G.; Restaino, R.; Chanussot, J. A Bayesian procedure for full-resolution quality assessment of pansharpened products. IEEE Trans. Geosci. Remote Sens. 2018, 56, 4820–4834. [Google Scholar] [CrossRef]
Núñez, J.; Otazu, X.; Fors, O.; Prades, A.; Palà, V.; Arbiol, R. Multiresolution-based image fusion with additive wavelet decomposition. IEEE Trans. Geosci. Remote Sens. 1999, 37, 1204–1211. [Google Scholar] [CrossRef] [Green Version]
Otazu, X.; González-Audícana, M.; Fors, O.; Núñez, J. Introduction of sensor spectral response into image fusion methods. Application to wavelet-based methods. IEEE Trans. Geosci. Remote Sens. 2005, 43, 2376–2385. [Google Scholar] [CrossRef] [Green Version]
Vivone, G.; Restaino, R.; Dalla Mura, M.; Licciardi, G.; Chanussot, J. Contrast and Error-based Fusion Schemes for Multispectral Image Pansharpening. IEEE Geosci. Remote Sens. Lett. 2014, 11, 930–934. [Google Scholar] [CrossRef] [Green Version]
Alparone, L.; Garzelli, A.; Vivone, G. Inter-Sensor Statistical Matching for Pansharpening: Theoretical Issues and Practical Solutions. IEEE Trans. Geosci. Remote Sens. 2017, 55, 4682–4695. [Google Scholar] [CrossRef]
Vivone, G.; Alparone, L.; Garzelli, A.; Lolli, S. Fast Reproducible Pansharpening Based on Instrument and Acquisition Modeling: AWLP Revisited. Remote Sens. 2019, 11, 2315. [Google Scholar] [CrossRef] [Green Version]
Restaino, R.; Vivone, G.; Addesso, P.; Chanussot, J. A Pansharpening Approach Based on Multiple Linear Regression Estimation of Injection Coefficients. IEEE Geosci. Remote Sens. Lett. 2020, 17, 102–106. [Google Scholar] [CrossRef]
Vivone, G.; Simões, M.; Dalla Mura, M.; Restaino, R.; Bioucas-Dias, J.; Licciardi, G.; Chanussot, J. Pansharpening Based on Semiblind Deconvolution. IEEE Trans. Geosci. Remote Sens. 2015, 53, 1997–2010. [Google Scholar] [CrossRef]
Vicinanza, M.R.; Restaino, R.; Vivone, G.; Dalla Mura, M.; Licciardi, G.; Chanussot, J. A Pansharpening Method Based on the Sparse Representation of Injected Details. IEEE Geosci. Remote Sens. Lett. 2015, 12, 180–184. [Google Scholar] [CrossRef]
Palsson, F.; Sveinsson, J.R.; Ulfarsson, M.O. A New Pansharpening Algorithm Based on Total Variation. IEEE Geosci. Remote Sens. Lett. 2014, 11, 318–322. [Google Scholar] [CrossRef]
Masi, G.; Cozzolino, D.; Verdoliva, L.; Scarpa, G. Pansharpening by Convolutional Neural Networks. Remote Sens. 2016, 8, 594. [Google Scholar] [CrossRef] [Green Version]
Wei, Y.; Yuan, Q.; Shen, H.; Zhang, L. Boosting the accuracy of multispectral image pansharpening by learning a deep residual network. IEEE Geosci. Remote Sens. Lett. 2017, 14, 1795–1799. [Google Scholar] [CrossRef] [Green Version]
Yang, J.; Fu, X.; Hu, Y.; Huang, Y.; Ding, X.; Paisley, J. PanNet: A Deep Network Architecture for Pan-Sharpening. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 1753–1761. [Google Scholar]
Scarpa, G.; Vitale, S.; Cozzolino, D. Target-adaptive CNN-based pansharpening. IEEE Trans. Geosci. Remote Sens. 2018, 56, 5443–5457. [Google Scholar] [CrossRef] [Green Version]
Deng, L.; Vivone, G.; Jin, C.; Chanussot, J. Detail Injection-Based Deep Convolutional Neural Networks for Pansharpening. IEEE Trans. Geosci. Remote Sens. 2021, 59, 6995–7010. [Google Scholar] [CrossRef]
Amro, I.; Mateos, J.; Vega, M.; Molina, R.; Katsaggelos, A.K. A survey of classical methods and new trends in pansharpening of multispectral images. EURASIP J. Adv. Signal Process. 2011, 2011, 79:1–79:22. [Google Scholar] [CrossRef] [Green Version]
Aiazzi, B.; Alparone, L.; Baronti, S.; Garzelli, A.; Selva, M. Twenty-Five Years of Pansharpening: A Critical Review and New Developments. In Signal and Image Processing for Remote Sensing, 2nd ed.; Chen, C.H., Ed.; CRC Press: Boca Raton, FL, USA, 2012; pp. 533–548. [Google Scholar]
Dadrass Javan, F.; Samadzadegan, F.; Mehravar, S.; Toosi, A.; Khatami, R.; Stein, A. A review of image fusion techniques for pan-sharpening of high-resolution satellite imagery. ISPRS J. Photogramm. Remote Sens. 2021, 171, 101–117. [Google Scholar] [CrossRef]
Garzelli, A. A Review of Image Fusion Algorithms Based on the Super-Resolution Paradigm. Remote Sens. 2016, 8, 797. [Google Scholar] [CrossRef] [Green Version]
Li, S.; Yang, B. A New Pan-Sharpening Method Using a Compressed Sensing Technique. IEEE Trans. Geosci. Remote Sens. 2011, 49, 738–746. [Google Scholar] [CrossRef]
Li, S.; Yin, H.; Fang, L. Remote Sensing Image Fusion via Sparse Representations Over Learned Dictionaries. IEEE Trans. Geosci. Remote Sens. 2013, 51, 4779–4789. [Google Scholar] [CrossRef]
Zhu, X.X.; Bamler, R. A Sparse Image Fusion Algorithm With Application to Pan-Sharpening. IEEE Trans. Geosci. Remote Sens. 2013, 51, 2827–2836. [Google Scholar] [CrossRef]
Cheng, M.; Wang, C.; Li, J. Sparse representation based pansharpening using trained dictionary. IEEE Geosci. Remote Sens. Lett. 2014, 11, 293–297. [Google Scholar] [CrossRef]
Zhu, X.X.; Grohnfeld, C.; Bamler, R. Exploiting Joint Sparsity for Pansharpening: The J-SparseFI Algorithm. IEEE Trans. Geosci. Remote Sens. 2016, 54, 2664–2681. [Google Scholar] [CrossRef] [Green Version]
Zhu, X.X.; Tuia, D.; Mou, L.; Xia, G.; Zhang, L.; Xu, F.; Fraundorfer, F. Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources. IEEE Geosci. Remote Sens. Mag. 2017, 5, 8–36. [Google Scholar] [CrossRef] [Green Version]
Huang, W.; Xiao, L.; Wei, Z.; Liu, H.; Tang, S. A New Pan-Sharpening Method With Deep Neural Networks. IEEE Geosci. Remote Sens. Lett. 2015, 12, 1037–1041. [Google Scholar] [CrossRef]
Zhong, J.; Yang, B.; Huang, G.; Zhong, F.; Chen, Z. Remote Sensing Image Fusion with Convolutional Neural Network. Sens. Imaging 2016, 17, 10. [Google Scholar] [CrossRef]
Rao, Y.; He, L.; Zhu, J. A residual convolutional neural network for pan-sharpening. In Proceedings of the 2017 International Workshop on Remote Sensing with Intelligent Processing (RSIP), Shanghai, China, 18–21 May 2017; pp. 1–4. [Google Scholar]
Yuan, Q.; Wei, Y.; Meng, X.; Shen, H.; Zhang, L. A Multiscale and Multidepth Convolutional Neural Network for Remote Sensing Imagery Pan-Sharpening. IEEE J. Sel. Top. Appl. Earth Observ. 2018, 11, 978–989. [Google Scholar] [CrossRef] [Green Version]
Shao, Z.; Cai, J. Remote Sensing Image Fusion With Deep Convolutional Neural Network. IEEE J. Sel. Top. Appl. Earth Observ. 2018, 11, 1656–1669. [Google Scholar] [CrossRef]
Yao, W.; Zeng, Z.; Lian, C.; Tang, H. Pixel-wise regression using U-Net and its application on pansharpening. Neurocomputing 2018, 312, 364–371. [Google Scholar] [CrossRef]
Liu, X.; Wang, Y.; Liu, Q. PSGAN: A Generative Adversarial Network for Remote Sensing Image Pan-Sharpening. In Proceedings of the 2018 IEEE International Conference on Image Processing, Athens, Greece, 7–10 October 2018; pp. 873–877. [Google Scholar]
Zhang, Y.; Liu, C.; Sun, M.; Ou, Y. Pan-Sharpening Using an Efficient Bidirectional Pyramid Network. IEEE Trans. Geosci. Remote Sens. 2019, 57, 5549–5563. [Google Scholar] [CrossRef]
Mei, S.; Yuan, X.; Ji, J.; Zhang, Y.; Wan, S.; Du, Q. Hyperspectral Image Spatial Super-Resolution via 3D Full Convolutional Neural Network. Remote Sens. 2017, 9, 1139. [Google Scholar] [CrossRef] [Green Version]
Lanaras, C.; Bioucas-Dias, J.; Galliani, S.; Baltsavias, E.; Schindler, K. Super-resolution of Sentinel-2 images: Learning a globally applicable deep neural network. ISPRS J. Photogramm. Remote Sens. 2018, 146, 305–319. [Google Scholar] [CrossRef] [Green Version]
Gargiulo, M.; Mazza, A.; Gaetano, R.; Ruello, G.; Scarpa, G. Fast Super-Resolution of 20 m Sentinel-2 Bands Using Convolutional Neural Networks. Remote Sens. 2019, 11, 2635. [Google Scholar] [CrossRef] [Green Version]
Hu, J.F.; Huang, T.Z.; Deng, L.J.; Jiang, T.X.; Vivone, G.; Chanussot, J. Hyperspectral Image Super-Resolution via Deep Spatiospectral Attention Convolutional Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2021, 1–15. [Google Scholar] [CrossRef]
Qu, Y.; Qi, H.; Kwan, C. Unsupervised Sparse Dirichlet-Net for Hyperspectral Image Super-Resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 2511–2520. [Google Scholar]
Qu, Y.; Qi, H.; Kwan, C. Unsupervised and Unregistered Hyperspectral Image Super-Resolution with Mutual Dirichlet-Net. arXiv 2019, arXiv:1904.12175. [Google Scholar]
Vitale, S.; Scarpa, G. A Detail-Preserving Cross-Scale Learning Strategy for CNN-Based Pansharpening. Remote Sens. 2020, 12, 348. [Google Scholar] [CrossRef] [Green Version]
Ma, J.; Yu, W.; Chen, C.; Liang, P.; Guo, X.; Jiang, J. Pan-GAN: An unsupervised pan-sharpening method for remote sensing image fusion. Inf. Fusion 2020, 62, 110–120. [Google Scholar] [CrossRef]
Lolli, S.; Alparone, L.; Garzelli, A.; Vivone, G. Haze Correction for Contrast-Based Multispectral Pansharpening. IEEE Geosci. Remote Sens. Lett. 2017, 14, 2255–2259. [Google Scholar] [CrossRef]
Vivone, G. Robust Band-Dependent Spatial-Detail Approaches for Panchromatic Sharpening. IEEE Trans. Geosci. Remote Sens. 2019, 57, 6421–6433. [Google Scholar] [CrossRef]
Restaino, R.; Dalla Mura, M.; Vivone, G.; Chanussot, J. Context-adaptive Pansharpening Based on Image Segmentation. IEEE Trans. Geosci. Remote Sens. 2017, 55, 753–766. [Google Scholar] [CrossRef] [Green Version]
Schowengerdt, R.A. Remote Sensing: Models and Methods for Image Processing, 3rd ed.; Elsevier: Orlando, FL, USA, 2007. [Google Scholar]
Vivone, G.; Addesso, P.; Restaino, R.; Dalla Mura, M.; Chanussot, J. Pansharpening based on deconvolution for multiband filter estimation. IEEE Trans. Geosci. Remote Sens. 2019, 57, 540–553. [Google Scholar] [CrossRef]
Liu, J.G. Smoothing filter based intensity modulation: A spectral preserve image fusion technique for improving spatial details. Int. J. Remote Sens. 2000, 21, 3461–3472. [Google Scholar] [CrossRef]
Alparone, L.; Wald, L.; Chanussot, J.; Thomas, C.; Gamba, P.; Bruce, L.M. Comparison of pansharpening algorithms: Outcome of the 2006 GRS-S data fusion contest. IEEE Trans. Geosci. Remote Sens. 2007, 45, 3012–3021. [Google Scholar] [CrossRef] [Green Version]
Garzelli, A. Pansharpening of Multispectral Images Based on Nonlocal Parameter Optimization. IEEE Trans. Geosci. Remote Sens. 2015, 53, 2096–2107. [Google Scholar] [CrossRef]
Baronti, S.; Aiazzi, B.; Selva, M.; Garzelli, A.; Alparone, L. A theoretical analysis of the effects of aliasing and misregistration on pansharpened imagery. IEEE J. Sel. Top. Signal Process. 2011, 5, 446–453. [Google Scholar] [CrossRef]
Jain, A.K. Fundamentals of Digital Image Processing; Prentice Hall: Englewood Cliffs, NJ, USA, 1989. [Google Scholar]
Tikhonov, A.; Arsenin, V. Solutions of Ill-Posed Problems; Winston: New York, NY, USA, 1977. [Google Scholar]
Reeves, S. Fast image restoration without boundary artifacts. IEEE Trans. Image Process. 2005, 14, 1448–1453. [Google Scholar] [CrossRef]
Strang, G.; Nguyen, T. Wavelets and Filter Banks, 2nd ed.; Wellesley Cambridge Press: Wellesley, MA, USA, 1996. [Google Scholar]
Garzelli, A.; Nencini, F.; Capobianco, L. Optimal MMSE Pan sharpening of very high resolution multispectral images. IEEE Trans. Geosci. Remote Sens. 2008, 46, 228–236. [Google Scholar] [CrossRef]
Gillespie, A.R.; Kahle, A.B.; Walker, R.E. Color enhancement of highly correlated images-II. Channel ratio and “Chromaticity” Transform techniques. Remote Sens. Environ. 1987, 22, 343–365. [Google Scholar] [CrossRef]
Draper, N.; Smith, H. Applied Regression Analysis; Joh Wiley & Sons: New York, NY, USA, 1998. [Google Scholar]
Restaino, R.; Vivone, G.; Dalla Mura, M.; Chanussot, J. Fusion of Multispectral and Panchromatic Images Based on Morphological Operators. IEEE Trans. Image Process. 2016, 25, 2882–2895. [Google Scholar] [CrossRef] [Green Version]
Wald, L.; Ranchin, T.; Mangolini, M. Fusion of satellite images of different spatial resolutions: Assessing the quality of resulting images. Photogramm. Eng. Remote Sens. 1997, 63, 691–699. [Google Scholar]
Wald, L. Data Fusion: Definitions and Architectures—Fusion of Images of Different Spatial Resolutions; Les Presses de l’École des Mines: Paris, France, 2002. [Google Scholar]
Yuhas, R.H.; Goetz, A.F.H.; Boardman, J.W. Discrimination among semi-arid landscape endmembers using the Spectral Angle Mapper (SAM) algorithm. In Proceedings of the Summaries 3rd Annu. JPL Airborne Geoscience Workshop, Pasadena, CA, USA, 1–5 June 1992; pp. 147–149. [Google Scholar]
Alparone, L.; Baronti, S.; Garzelli, A.; Nencini, F. A global quality measurement of pan-sharpened multispectral imagery. IEEE Geosci. Remote Sens. Lett. 2004, 1, 313–317. [Google Scholar] [CrossRef]
Garzelli, A.; Nencini, F. Hypercomplex quality assessment of multi/hyperspectral images. IEEE Trans. Geosci. Remote Sens. 2009, 6, 662–665. [Google Scholar] [CrossRef]
Wang, Z.; Bovik, A.C. A universal image quality index. IEEE Signal Process. Lett. 2002, 9, 81–84. [Google Scholar] [CrossRef]
Zhou, J.; Civco, D.L.; Silander, J.A. A wavelet transform method to merge Landsat TM and SPOT panchromatic data. Int. J. Remote Sens. 1998, 19, 743–757. [Google Scholar] [CrossRef]
Alparone, L.; Aiazzi, B.; Baronti, S.; Garzelli, A.; Nencini, F.; Selva, M. Multispectral and panchromatic data fusion assessment without reference. Photogramm. Eng. Remote Sens. 2008, 74, 193–200. [Google Scholar] [CrossRef] [Green Version]
Khan, M.M.; Alparone, L.; Chanussot, J. Pansharpening quality assessment using the modulation transfer functions of instruments. IEEE Trans. Geosci. Remote Sens. 2009, 11, 3880–3891. [Google Scholar] [CrossRef]
Carla, R.; Santurri, L.; Aiazzi, B.; Baronti, S. Full-Scale Assessment of Pansharpening Through Polynomial Fitting of Multiscale Measurements. IEEE Trans. Geosci. Remote Sens. 2015, 53, 6344–6355. [Google Scholar] [CrossRef]
Kwan, C.; Budavari, B.; Bovik, A.C.; Marchisio, G. Blind Quality Assessment of Fused WorldView-3 Images by Using the Combinations of Pansharpening and Hypersharpening Paradigms. IEEE Geosci. Remote Sens. Lett. 2017, 14, 1835–1839. [Google Scholar] [CrossRef]
Vivone, G.; Addesso, P.; Chanussot, J. A Combiner-Based Full Resolution Quality Assessment Index for Pansharpening. IEEE Geosci. Remote Sens. Lett. 2019, 16, 437–441. [Google Scholar] [CrossRef]
Aiazzi, B.; Alparone, L.; Baronti, S.; Carlà, R.; Garzelli, A.; Santurri, L. Full scale assessment of pansharpening methods and data products. In Image and Signal Processing for Remote Sensing XX; SPIE: Bellingham, WA, USA, 2014; p. 924402. [Google Scholar]
Loncan, L.; Fabre, S.; Almeida, L.B.; Bioucas-Dias, J.M.; Wenzhi, L.; Briottet, X.; Licciardi, G.A.; Chanussot, J.; Simoes, M.; Dobigeon, N.; et al. Hyperspectral Pansharpening: A Review. IEEE Geosci. Remote Sens. Mag. 2015, 3, 27–46. [Google Scholar] [CrossRef] [Green Version]
Vivone, G.; Restaino, R.; Licciardi, G.; Dalla Mura, M.; Chanussot, J. MultiResolution Analysis and Component Substitution Techniques for Hyperspectral Pansharpening. In Proceedings of the 2014 IEEE Geoscience and Remote Sensing Symposium, Quebec City, QC, Canada, 13–18 July 2014; pp. 2649–2652. [Google Scholar]
Picaro, G.; Addesso, P.; Restaino, R.; Vivone, G.; Picone, D.; Dalla Mura, M. Thermal sharpening of VIIRS data. In Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China, 10–15 July 2016; pp. 7260–7263. [Google Scholar] [CrossRef]

Figure 1. An example of pansharpening: (a) MS (interpolated) image; (b) PAN image; (c) fused image (namely, via the MBFE-BDSD-MLR method, presented in Section 5).

Figure 2. Block scheme of the pansharpening framework.

Figure 3. Reduced resolution datasets: China dataset (on the left), Tripoli dataset (on the right). The first row reports the original MS image (used as the ground-truth) and the second row reports the degraded MS image, upsampled to the ground-truth size.

Figure 4. Close-ups of the fused results using the reduced resolution China dataset: (a) GT; (b) GSA; (c) MF-HG; (d) MBFEBDSD-HPM; (e) MBFE-GSA-HPM; (f) FE-HPM; (g) GLP-HPM; (h) MBFE-BDSD-CBD; (i) MBFE-GSA-CBD; (j) FE-CBD; (k) GLP-CBD; (l) MBFE-BDSD-MLR; (m) MBFE-GSA-MLR; (n) FE-MLR; (o) GLP-MLR.

Figure 5. Close-ups of the fused results using the reduced resolution Tripoli dataset: (a) GT; (b) GSA; (c) MF-HG; (d) MBFE-BDSD-HPM; (e) MBFE-GSA-HPM; (f) FE-HPM; (g) GLP-HPM; (h) MBFE-BDSD-CBD; (i) MBFE-GSA-CBD; (j) FE-CBD; (k) GLP-CBD; (l) MBFE-BDSD-MLR; (m) MBFE-GSA-MLR; (n) FE-MLR; (o) GLP-MLR.

Figure 6. Tripoli dataset (RR): differences between the Q

2^{n}

maps computed for the best algorithm, i.e., MBFE-GSA-MLR, and the baseline methods, i.e., (a) GSA, (b) GLP-MLR, (c) GLP-CBD, (d) MBFE-GSA-CBD, (e) GLP-HPM and (f) MBFE-GSA-HPM. Green values: better results obtained by MBFE-GSA-MLR; red values: better results obtained by the other algorithm.

Figure 6. Tripoli dataset (RR): differences between the Q

2^{n}

maps computed for the best algorithm, i.e., MBFE-GSA-MLR, and the baseline methods, i.e., (a) GSA, (b) GLP-MLR, (c) GLP-CBD, (d) MBFE-GSA-CBD, (e) GLP-HPM and (f) MBFE-GSA-HPM. Green values: better results obtained by MBFE-GSA-MLR; red values: better results obtained by the other algorithm.

Figure 7. Close-ups of the details of the fused results using the full resolution Tripoli dataset: (a) PAN; (b) EXP; (c) details for GSA; (d) details for MF-HG; (e) details for MBFE-BDSD-HPM; (f) details for MBFE-GSA-HPM; (g) details for FE-HPM; (h) details for GLP-HPM; (i) details for MBFE-BDSD-CBD; (j) details for MBFE-GSA-CBD; (k) details for FE-CBD; (l) details for GLP-CBD; (m) details for MBFE-BDSD-MLR; (n) details for MBFE-GSA-MLR; (o) details for FE-MLR; (p) details for GLP-MLR.

Table 1. Reduced resolution assessment: the first row contains the reference value for each index. The best results among the different versions of the MTF-GLP approaches are in boldface; the second best are underlined.

Algorithm		China				Tripoli
Algorithm		PSNR	ERGAS	SAM	Q4	PSNR	ERGAS	SAM	Q8
Reference		∞	0	0	1	∞	0	0	1
EXP		36.012	3.8736	4.4268	0.7389	20.408	9.2964	8.6679	0.6197
NL-IHS		38.115	3.2280	4.0268	0.7968	22.685	7.2531	8.8094	0.7719
GS		39.833	2.8310	3.5399	0.8475	23.554	6.5690	8.3871	0.8166
GSA		40.711	2.5829	3.0053	0.8756	26.597	4.7677	7.5888	0.9220
BDSD		41.003	2.4361	2.9272	0.8884	25.645	5.2490	8.1877	0.9133
SFIM		39.904	2.6076	3.2165	0.8731	24.381	5.9649	8.1231	0.8585
ATWT		40.270	2.5483	3.0916	0.8793	24.769	5.7169	7.8750	0.8750
MF-HG		40.085	2.6472	3.1053	0.8669	24.880	5.6423	7.9807	0.8874
HPM	MBFE BDSD	41.010	2.4904	2.9926	0.8824	25.181	5.4896	8.0262	0.8899
	MBFE GSA	40.973	2.5100	3.0095	0.8816	25.234	5.4580	7.9807	0.8887
	FE	41.011	2.4913	2.9871	0.8820	25.222	5.4672	7.9785	0.8882
	GLP	40.993	2.4834	2.9985	0.8825	25.147	5.5124	7.9880	0.8839
CBD	MBFE BDSD	40.914	2.5495	3.0055	0.8776	26.298	4.9096	7.9474	0.9224
	MBFE GSA	40.866	2.5754	3.0209	0.8762	26.641	4.7440	7.6183	0.9248
	FE	40.914	2.5495	3.0036	0.8773	26.627	4.7530	7.6086	0.9247
	GLP	40.936	2.5311	2.9709	0.8784	26.614	4.7546	7.5353	0.9211
MLR	MBFE BDSD	41.179	2.4149	2.9249	0.8885	26.305	4.9091	7.9526	0.9235
	MBFE GSA	41.158	2.4304	2.9179	0.8877	26.687	4.7251	7.5565	0.9263
	FE	41.177	2.4145	2.9255	0.8882	26.670	4.7358	7.5441	0.9262
	GLP	41.220	2.4006	2.8589	0.8876	26.660	4.7341	7.4512	0.9222

Table 2. Full resolution assessment on the Tripoli dataset: the first row contains the reference for each indicator. Best results for the two tested injection schemes (HPM and CBD) are in boldface; the second best are underlined.

Algorithm		D $_{s}$	D $_{λ}$	HQNR
Reference		0	0	1
EXP		0.0717	0.0317	0.8989
NL-IHS		0.0602	0.0795	0.8651
GS		0.0767	0.0536	0.8738
GSA		0.0775	0.0377	0.8877
BDSD		0.0755	0.1399	0.7952
SFIM		0.0657	0.0216	0.9141
ATWT		0.0654	0.0174	0.9183
MF-HG		0.0591	0.0183	0.9236
HPM	MBFE BDSD	0.0650	0.0181	0.9181
	MBFE GSA	0.0667	0.0188	0.9157
	FE	0.0679	0.0179	0.9155
	GLP	0.0684	0.0183	0.9146
CBD	MBFE BDSD	0.0637	0.0169	0.9205
	MBFE GSA	0.0661	0.0178	0.9173
	FE	0.0674	0.0167	0.9171
	GLP	0.0697	0.0171	0.9144
MLR	MBFE BDSD	0.0590	0.0196	0.9226
	MBFE GSA	0.0626	0.0199	0.9188
	FE	0.0634	0.0188	0.9190
	GLP	0.0698	0.0158	0.9156

Table 3. Computational times (in seconds) required for the datasets used.

Algorithm		China (RR)	Tripoli (RR)	Tripoli (FR)
NL-IHS		1.104	4.049	74.64
GS		0.0409	0.127	1.79
GSA		0.0970	0.253	2.73
BDSD		0.115	0.207	1.51
SFIM		0.0223	0.102	1.35
ATWT		0.117	0.570	8.76
MF-HG		0.135	0.303	3.62
HPM	MBFE BDSD	0.298	0.929	14.72
	MBFE GSA	0.283	0.979	15.40
	FE	0.147	0.427	6.70
	GLP	0.123	0.412	5.56
CBD	MBFE BDSD	0.314	0.949	14.60
	MBFE GSA	0.307	1.013	15.47
	FE	0.170	0.429	6.65
	GLP	0.123	0.411	5.45
MLR	MBFE BDSD	0.298	2.058	15.10
	MBFE GSA	0.306	2.132	15.88
	FE	0.164	1.127	7.08
	GLP	0.154	1.103	7.07

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Addesso, P.; Restaino, R.; Vivone, G. An Improved Version of the Generalized Laplacian Pyramid Algorithm for Pansharpening. Remote Sens. 2021, 13, 3386. https://doi.org/10.3390/rs13173386

AMA Style

Addesso P, Restaino R, Vivone G. An Improved Version of the Generalized Laplacian Pyramid Algorithm for Pansharpening. Remote Sensing. 2021; 13(17):3386. https://doi.org/10.3390/rs13173386

Chicago/Turabian Style

Addesso, Paolo, Rocco Restaino, and Gemine Vivone. 2021. "An Improved Version of the Generalized Laplacian Pyramid Algorithm for Pansharpening" Remote Sensing 13, no. 17: 3386. https://doi.org/10.3390/rs13173386

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Improved Version of the Generalized Laplacian Pyramid Algorithm for Pansharpening

Abstract

1. Introduction

2. Problem Statement

3. MRA via Filter Estimation

3.1. FE Algorithm

3.2. MBFE Algorithm

4. MLR-Based Injection

5. Experimental Results

5.1. Datasets

5.2. Reduced Resolution Validation

5.3. Full Resolution Validation

6. Discussion

6.1. Reduced Resolution

6.2. Full Resolution

6.3. Computational Analysis

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI