Some Notes on the Use of the Windowed Fourier Transform for Spectral Analysis of Discretely Sampled Data

Johnson, Robert W.

doi:10.3390/axioms2030286

Open AccessArticle

Some Notes on the Use of the Windowed Fourier Transform for Spectral Analysis of Discretely Sampled Data

by

Robert W. Johnson

Alphawave Research, 29 Stanebrook Court, Jonesboro, GA 30238, USA

Axioms 2013, 2(3), 286-310; https://doi.org/10.3390/axioms2030286

Submission received: 24 April 2013 / Revised: 20 May 2013 / Accepted: 21 May 2013 / Published: 24 June 2013

(This article belongs to the Special Issue Wavelets and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

The properties of the Gabor and Morlet transforms are examined with respect to the Fourier analysis of discretely sampled data. Forward and inverse transform pairs based on a fixed window with uniform sampling of the frequency axis can satisfy numerically the energy and reconstruction theorems; however, transform pairs based on a variable window or nonuniform frequency sampling in general do not. Instead of selecting the shape of the window as some function of the central frequency, we propose constructing a single window with unit energy from an arbitrary set of windows that is applied over the entire frequency axis. By virtue of using a fixed window with uniform frequency sampling, such a transform satisfies the energy and reconstruction theorems. The shape of the window can be tailored to meet the requirements of the investigator in terms of time/frequency resolution. The algorithm extends naturally to the case of nonuniform signal sampling without modification beyond identification of the Nyquist interval.

Keywords:

Fourier transform; Gabor transform; Morlet transform; multiresolution analysis

1. Introduction

The primary criticism leveled at the use of the continuous wavelet transform for the spectral analysis of discretely sampled data is that it fails to give quantitatively meaningful results. The power spectral density produced from the convolution of a wavelet basis and a discrete signal gives a qualitative picture of the temporal variation of its frequency content; however, when one attempts the reconstruction of the signal, the residual is not on the order of the precision of one’s computational device. Likewise, the integrated margins of the power spectral density do not precisely equal the energy of the original signal. In other words, the discrete implementation of the continuous wavelet transform and its inverse do not satisfy the energy and reconstruction theorems of spectral analysis. The goal of this investigation is to devise a multiresolution analysis that does satisfy those theorems.

Our insistence upon the satisfaction of the energy and reconstruction theorems is because they are first principle requirements related to the conservation of physical energy. When suitably defined, energy is not to be created nor destroyed; neither should it get lost in the shuffle. There is a deep relation between energy content and information content, as quantum mechanics teaches us, thus a loose grip on one implies a loose grip on the other. On a more practical level, the satisfaction of those theorems is among the requirements for a maximum entropy spectral analysis of data that includes the effects of measurement uncertainty, whose consideration is beyond the scope of this article.

The review by Torrence and Compo [1] remains a popular resource for practitioners of wavelet analysis. It relies on the method by Farge [2] for the reconstruction of the data signal. Grossmann and Morlet [3] are credited with establishing the reconstruction theorem in the continuum. Meyer [4] and Mallat [5] developed the theory of multiresolution analysis, and Daubechies [6] constructed the first orthonormal basis with compact support, leading to the implementation of the dyadic wavelet transform in terms of finite impulse response digital filter banks [7]. Some interesting applications of the continuous wavelet transform for the spectral analysis of data can be found in references [8,9,10,11,12,13,14,15,16,17,18,19].

This paper is organized as follows. First, we will quickly review the theory of the continuous Fourier transform and its discrete implementation. Next, we will look at the Gabor transform and its relation to the wavelet transform using the Morlet basis. We will then propose an algorithm for a layered window transform whose spectral density is similar to that of the Morlet transform yet which satisfies the energy and reconstruction theorems. Following that, we will demonstrate that the algorithm works unaltered for data with irregular sampling once the corresponding Nyquist interval is identified. Finally, we will look at how the selection of the window affects the time/frequency resolution of the transform. We will conclude with a brief summary and suggestions for applications.

Some notations used throughout this paper are explained here. Scalars are written as s, while vectors and matrices are written as

v

and

M

respectively and may be defined in terms of their components, e.g.,

v \equiv v_{s} = v (s)

. The transpose of a vector or matrix is indicated by the superscript

v^{T}

, and the conjugate transpose by

M^{†}

, using the standard rules for matrix multiplication. Inner products may be written in bra-ket notation as

{〈 a ∣ b 〉}_{s}

, and matrix entries as the ket-bra

∣ a 〉 〈 b ∣

. Expectation values are notated as

{〈 v (s) 〉}_{s}

. Sets will be indicated by their boundaries

[a, b]

, and whether continuous or discrete must be derived from context. The operation of rounding up, i.e., taking the next greatest integer, is denoted by

⌈ a ⌉

, and when the notation

a \to b

is encountered, it is understood to mean “a is replaced by b”. The programs used for this analysis are available online [20].

2. Continuous and Discrete Fourier Transforms

Let us begin by considering the Fourier transform in the continuum over axes of time and frequency. Suppose we have some signal

y (t)

, possibly complex valued, of finite energy

E_{y} \equiv {〈 y ∣ y 〉}_{t} = \int_{- \infty}^{\infty} y^{*} (t) y (t) d t < \infty

. If the signal carries units of

u_{y}

, then the signal energy has units of

u_{E} \equiv u_{y}^{2} u_{t}

. The units of signal energy are proportional to those of physical energy in joules by a factor of the load impedance

E_{J} = E_{y} / Z_{L}

; for example, if the signal has units of volts

u_{y} = V

and time is measured in seconds

u_{t} = s

so that

u_{E} = V^{2} s

, then accounting for the load impedance in ohms

u_{Z} = Ω

gives physical units of

V^{2} s / Ω = {(m l^{2} / t^{2} q)}^{2} t / (m l^{2} / t q^{2})

in terms of mass, length, time, and charge, which is equivalent to joules. The units of frequency are reciprocal to those of time

u_{f} = u_{t}^{- 1}

, and the term “power” is understood to mean “energy density” and must be qualified by the domain for its distribution. The squared modulus

{| y (t) |}^{2} \equiv P (t)

can thus be identified as the temporal power of the signal.

Under suitable conditions not elaborated here, the Fourier integral and its inverse,

\hat{y} (f) \equiv \int_{- \infty}^{\infty} exp (- i 2 π f t) y (t) d t, \hat{\hat{y}} (t) \equiv \int_{- \infty}^{\infty} exp (i 2 π f t) \hat{y} (f) d f

(1)

define an integral transform pair satisfying the Plancherel energy theorem

\int_{- \infty}^{\infty} {| y (t) |}^{2} d t = \int_{- \infty}^{\infty} {| \hat{y} (f) |}^{2} d f

and the Fourier inversion theorem

\hat{\hat{y}} (t) = y (t)

. The representation over the frequency axis

\hat{y} (f)

carries units of

u_{\hat{y}} = u_{y} u_{t}

so that the squared modulus

{| \hat{y} (f) |}^{2} \equiv P (f)

gives the spectral power of the signal, and the spectral energy is

E_{\hat{y}} \equiv {〈 \hat{y} ∣ \hat{y} 〉}_{f} = \int_{- \infty}^{\infty} {\hat{y}}^{*} (f) \hat{y} (f) d f

. When the signal is of finite duration

t \in [0, T]

, the Fourier transform’s response to a component sinusoid

y (t) = exp (i 2 π f^{'} t)

is

\hat{y} (f) = \int_{0}^{T} exp [- i 2 π (f - f^{'}) t] d t = exp (- i π f_{Δ} T) \frac{sin (π f_{Δ} T)}{π f_{Δ}}

(2)

expressed in terms of the frequency offset

f_{Δ} \equiv f - f^{'}

, thus the spectral power of the uniform window function

Φ_{T} (t) = T^{- 1 / 2}

normalized to unit energy describes the leakage in the frequency spectrum induced by the finite duration of the signal,

L_{T} (f_{Δ}) d f_{Δ} \equiv {| {\hat{Φ}}_{T} (f_{Δ}) |}^{2} d f_{Δ} = T^{- 1} {[\frac{sin (π f_{Δ} T)}{π f_{Δ}}]}^{2} d f_{Δ}

(3)

with domain

f_{Δ} \in [- \infty, \infty]

. The continuum leakage function is not periodic in

f_{Δ}

, as its magnitude decays according to

f_{Δ}^{- 2}

.

With regard to practical data analysis, let us suppose now that the signal is given in terms of discrete samples in time with uniform duration and spacing. (Those requirements will be relaxed in Section 5.) The time axis can then be described in terms of integers

t \in [1, T]

with unit

u_{t} \equiv Δ_{t}

given by the sample rate, allowing the signal to be written as a vector

y \equiv y_{t} = y (t)

. The signal energy can be expressed in terms of matrix multiplication as

E_{y} = y^{†} D_{t} y

, where the temporal metric

D_{t} = I_{T} Δ_{t}

is proportional to the identity matrix of order T. The effect of frequency aliasing induced by the uniform sampling is often misunderstood. If we evaluate the Fourier transform of the discrete window with unit energy,

{\hat{Φ}}_{T} (f_{Δ}) = T^{- 1 / 2} \sum_{t = 1}^{T} exp (- i 2 π f_{Δ} t) Δ_{t} = T^{- 1 / 2} exp [- i π f_{Δ} (T + 1)] \frac{sin (π f_{Δ} T)}{sin (π f_{Δ})} Δ_{t}

(4)

we find that the leakage function becomes periodic in

f_{Δ}

,

L_{T} (f_{Δ}) d f_{Δ} \to T^{- 1} {[\frac{sin (π f_{Δ} T)}{sin (π f_{Δ})}]}^{2} Δ_{t}^{2} d f_{Δ}

(5)

with period 1 in units of

u_{f} = Δ_{t}^{- 1}

. The principle branch is usually chosen as

f \in [- 1 / 2, 1 / 2] Δ_{t}^{- 1}

, but one should always respect the periodic nature of the spectrum. The normalization of the spectral power likewise is defined over one period of the frequency axis,

\sum_{t = 1}^{T} {| y (t) |}^{2} Δ_{t} = \int_{- 1 / 2}^{1 / 2} {| \hat{y} (f) |}^{2} d f

. In Figure 1 we compare the continuum and discrete leakage functions for a window with duration

T = 5

, where we see the expected oscillatory lobes as well as the periodicity induced by the discrete sampling in time.

Figure 1. Comparison of the continuous time leakage function in (a) to its discrete time counterpart in (b) for a uniform window of duration

T = 5

over a continuous frequency axis.

Figure 1. Comparison of the continuous time leakage function in (a) to its discrete time counterpart in (b) for a uniform window of duration

T = 5

over a continuous frequency axis.

Now let us consider the discretization of the frequency axis into uniform bins over its principle branch

f \in [- f_{c}, f_{c}]

, where

f_{c} \equiv {(2 Δ_{t})}^{- 1}

is the Nyquist critical frequency. To fully specify the frequency metric

D_{f}

, one must state its order N, equal to the number of positive frequencies

\leq f_{c}

, as well as its parity

P \in {0, 1}

, indicating whether an even or odd number of bins

N^{'} = 2 N + P

is used to span the domain. When

P = 1

, the bins on the edges corresponding to frequencies

\pm f_{c}

have a bin width equal to one half that of the others, so that the limits of integration are respected; because of aliasing, the integrand will have the same value at those locations, so that only one full bin’s contribution is counted. This convention differs from that usually given for the discrete Fourier transform [21], where the factor of 1/2 is absorbed by the coefficients rather than the metric. The order N specifies the spacing of the frequencies as

Δ_{f} = {(2 N Δ_{t})}^{- 1}

, and the metric can be written as

D_{f} / Δ_{f} = I_{N^{'}} - (∣ 1 〉 〈 1 ∣ + ∣ N^{'} 〉 〈 N^{'} ∣) P / 2

so that

Tr D_{f} = Δ_{t}^{- 1}

. The central frequencies of the bins can be expressed as

f_{n} = [n - (N^{'} + 1) / 2] Δ_{f}

for integers

n \in [1, N^{'}]

. Figure 2 compares the even and odd discretizations of order

N = 4

. While the odd parity discretization is perhaps more familiar, for many of our purposes the even discretization will prove more convenient.

Figure 2. Comparison of the even (a) and odd (b) frequency discretizations of order

N = 4

with the central frequencies indicated by ×.

Figure 2. Comparison of the even (a) and odd (b) frequency discretizations of order

N = 4

with the central frequencies indicated by ×.

We are finally ready to define the two-sided discrete Fourier transform (of order N and parity P) of a discretely sampled signal (of duration T). If we collect our Fourier basis functions into the form of a matrix,

Θ \equiv Θ (t, n) = exp (i 2 π f_{n} t)

, then we can easily write the discrete transform pair in terms of matrix multiplication,

\hat{y} \equiv Θ^{†} D_{t} y

and

\hat{\hat{y}} \equiv Θ D_{f} \hat{y}

, as well as the spectral energy

E_{\hat{y}} = {\hat{y}}^{†} D_{f} \hat{y}

. In component notation, one has

\hat{y} (n) \equiv \sum_{t = 1}^{T} exp (- i 2 π f_{n} t) y (t) Δ_{t}, \hat{\hat{y}} (t) \equiv \sum_{n = 1}^{N^{'}} exp (i 2 π f_{n} t) \hat{y} (n) Δ_{f}

(6)

where

Δ_{f}

is understood to account for the edge bins if

P = 1

. If the signal

y (t)

is real, so that

\hat{y} (- f) = \hat{y} {(f)}^{*}

, then one may define the one-sided transform, retaining only the nonnegative portion of the frequency axis with

N^{'} \to N + P

bins and

f_{n} \to [n - (P + 1) / 2] Δ_{f}

, by renormalizing the basis functions to twice the energy via

Θ \to \sqrt{2} Θ

and letting

\hat{\hat{y}} (t) \to Re \hat{\hat{y}} (t)

. Hereafter, we will assume

y (t)

is real so that we can focus our attention on the one-sided spectrum.

In the fully discrete setting, the satisfaction of the energy and reconstruction theorems is dependent upon there being a sufficient number of degrees of freedom (DOF) in the basis functions to fully represent the information content of the signal. For real

y (t)

, the number of DOF is equal to the signal duration T. For the one-sided transform, whether of even or odd parity, the number of DOF is equal to

2 N

, as each frequency’s coefficient has both an amplitude and a phase,

{\hat{y}}_{n} = A_{n} exp (i ω_{n})

, except for the cases

f_{n} = 0

or

f_{c}

that have only an amplitude. The critical frequency

f_{c}

is identified as the lowest positive frequency whose basis function is entirely real over the discrete time axis, an observation that will prove useful when we consider the case of irregularly sampled data. The minimal order beyond which the energy and reconstruction theorems are satisfied can be evaluated as

N_{\min} = ⌈ T / 2 ⌉

for a real signal (and

N_{\min} = T

for a complex one). That condition is realized (for even T) when

Θ^{†} Θ Δ_{t} = I_{N^{'}} T

, indicating that the basis functions Θ form an orthogonal set for

N = T / 2

. However, orthogonality over the discrete time axis is not a requirement, as

y^{†} D_{t} y = {\hat{y}}^{†} D_{f} \hat{y}

and

y = \hat{\hat{y}}

exactly (meaning to the precision of one’s computational device) for any

N \geq N_{\min}

and either parity P.

Lastly, let us look at what happens when one tries to express the fully discrete, one-sided Fourier transform over an axis of period (or scale) s rather than frequency f. In the continuum, the relation between the axes

f = s^{- 1}

yields the Jacobian

| d f / d s | = s^{- 2}

, thus the spectral power over scale

P (s) \equiv {| \hat{y} (s) |}^{2}

is equal to the spectral power over frequency

P (f)

multiplied by the Jacobian,

P (s) = f^{2} P (f)

, otherwise expressed as

\hat{y} (s) = f \hat{y} (f)

. In the discrete setting, however, one must be explicit with the mapping of the bin boundaries so that

\sum_{n} {| \hat{y} (f_{n}) |}^{2} Δ_{f} = \sum_{n} {| \hat{y} (s_{n}) |}^{2} Δ_{s}

. Let

Δ_{f} = b - a

be the width of one frequency bin centered on

f_{n} = (a + b) / 2

, which gets mapped to the scale bin

Δ_{s} = a^{- 1} - b^{- 1}

such that

Δ_{f} / Δ_{s} = a b

and

P (s_{n}) = a b P (f_{n})

. One may very well ask, in what sense is

s_{n} = f_{n}^{- 1}

the center of the scale bin? The value

f_{n}

is both the mean and the median of the frequency bin with uniform measure

p^{f} = Δ_{f}^{- 1}

, but the mean period over that bin is

{〈 f^{- 1} 〉}_{f} = log (b / a) Δ_{f}^{- 1}

. The value

s_{n}

is recognized as the median of the scale bin with measure

p^{s} = s^{- 2} Δ_{f}^{- 1}

, such that

\int_{1 / a}^{2 / (a + b)} p^{s} d s = 1 / 2

. The edges for odd parity are handled similarly with respect to the half-width bins. For either parity, the bin whose lower boundary is

a = 0

is the one that causes trouble, as

Δ_{s} = \infty

in that case. One can contrive to neglect that bin for odd parity by subtracting the mean of the signal

y (t) \to y (t) - {〈 y (t) 〉}_{t}

, but that procedure does not repair the problem for even parity.

To illustrate the difficulty, in Figure 3 we display the mapping of the spectral power for a signal with duration

T = 20

, a sum of two sinusoids at frequencies 1/40 and 1/4 normalized to unit energy

E_{y} = 1

. In panels (a) and (b) we show the mapping for the one-sided transform of order

N = 20

and odd parity over domain

f \in [0, f_{c}]

. While the higher frequency’s contribution remains apparent, that of the lower frequency has been washed away by the measure factor. The DOF carried by the lowest frequency bin are unrecoverable from the mapping over scale on account of the infinite bin width. In panels (c) and (d) we repeat the procedure but for domain

f \in [f_{c}, 2 f_{c}]

. This time, all the DOF are properly mapped, thus the energy and reconstruction theorems are satisfied. In Table 1 we give some numerical results from the evaluation shown in the figure. In short, there is nothing to be gained by working on the scale axis rather than frequency other than a headache in dealing with the effect of aliasing, as one requires the same set of basis functions Θ and minimal order

N_{\min}

to satisfy the fundamental theorems of spectral analysis in either case.

Figure 3. Comparison of the mapping over odd parity axes of frequency f and scale

s = f^{- 1}

of the discrete spectral power of a signal with two sinusoidal components for

f \in [0, f_{c}]

in (a) and (b) and for

f \in [f_{c}, 2 f_{c}]

in (c) and (d) with the central frequencies indicated by ×.

Figure 3. Comparison of the mapping over odd parity axes of frequency f and scale

s = f^{- 1}

of the discrete spectral power of a signal with two sinusoidal components for

f \in [0, f_{c}]

in (a) and (b) and for

f \in [f_{c}, 2 f_{c}]

in (c) and (d) with the central frequencies indicated by ×.

Table 1. Numerical results from the evaluation of Figure 3.

**Table 1.** Numerical results from the evaluation of Figure 3.
P	$f \in [0, f_{c}]$				$f \in [f_{c}, 2 f_{c}]$
P	$Tr D_{f}$	$\sum_{n} P (f_{n}) Δ_{f}$	$Tr D_{s}$	$\sum_{n} P (s_{n}) Δ_{s}$	$Tr D_{f}$	$\sum_{n} P (f_{n}) Δ_{f}$	$Tr D_{s}$	$\sum_{n} P (s_{n}) Δ_{s}$
0	0.5	1.0	INF	NAN	0.5	1.0	1.0	1.0
1	0.5	1.0	INF	NAN	0.5	1.0	1.0	1.0

3. Gabor and Morlet Transforms

Let us turn now to the consideration of how to transform the signal

y \equiv y (t)

into a time-frequency representation

\hat{Y} \equiv \hat{Y} (f, t)

by modulating the exponential oscillations of the Fourier basis with a discrete window function,

Θ (f, t) \to Ψ (f, t) \equiv Φ (t) Θ (f, t)

. The Gabor transform is defined by the use of a Gaussian window with decay parameter σ, which we will notate as

Φ (t) \propto {exp}^{- π / 2} (t^{2} / σ^{2}) \equiv e^{- π t^{2} / 2 σ^{2}}

. Its normalization to unit energy

Φ \to Φ / {〈 Φ ∣ Φ 〉}_{t}^{1 / 2}

depends upon the duration of the window, parametrized by its half-width τ. In the continuum, one can write

\int_{- \infty}^{\infty} Φ^{2} (t) d t \propto σ

and

\int_{- τ}^{τ} Φ^{2} (t) d t \propto σ erf (π^{1 / 2} τ / σ)

; however, in the discrete setting with index

t \in [- τ, τ]

, the window energy

\sum_{t} Φ^{2} (t) Δ_{t}

is not expressible in closed form. For the one-sided spectrum,

Φ \to \sqrt{2} Φ

so that its energy equals 2. To fully specify which Gabor transform is being used, one must state the values of N, P, σ, and τ. We will return to the question of finding the minimal order of the Gabor transform for a signal of duration T in Section 5; for now, we will assume that N is greater than

N_{\min}

of the previous section so that the fundamental theorems are satisfied.

There are two alternatives for the definition of the phase convention in the transform, which differ in whether the phase is expressed relative to the origin of the signal’s time axis or that of the window:

\begin{matrix} \hat{Y} (n, \hat{t}) & \equiv & \sum_{t^{'} = - τ}^{τ} Φ (- t^{'}) {exp}^{i 2 π} [- f_{n} (\hat{t} + t^{'})] y (\hat{t} + t^{'}) Δ_{t} \end{matrix}

(7)

\begin{matrix} \hat{\hat{y}} (t) & \equiv & \sum_{n = 1}^{N^{'}} {exp}^{i 2 π} (f_{n} t) \sum_{t^{'} = - τ}^{τ} Φ (t^{'}) \hat{Y} (n, t + t^{'}) Δ_{t} Δ_{f} \end{matrix}

(8)

where the time axis for the transform coefficients carries an index

\hat{t} \in [1 - τ, T + τ]

, or else

\begin{matrix} \hat{Y} (n, \hat{t}) & \equiv & \sum_{t^{'} = - τ}^{τ} Φ (- t^{'}) {exp}^{i 2 π} (- f_{n} t^{'}) y (\hat{t} + t^{'}) Δ_{t} \end{matrix}

(9)

\begin{matrix} \hat{\hat{y}} (t) & \equiv & \sum_{n = 1}^{N^{'}} \sum_{t^{'} = - τ}^{τ} Φ (t^{'}) {exp}^{i 2 π} (- f_{n} t^{'}) \hat{Y} (n, t + t^{'}) Δ_{t} Δ_{f} \end{matrix}

(10)

The latter is more familiar, but either is equally valid in terms of satisfying the energy and reconstruction theorems. The sign of the argument to the window function is chosen deliberately so that these expressions remain valid for the case of a window that is not symmetric in

t^{'}

. The signal is zero padded for values of

\hat{t} + t^{'}

outside the original domain of

[1, T]

, as no other value could be assigned without changing the energy hence information content of the signal. For either phase convention, the temporal spectral power is defined as

P (n, \hat{t}) \equiv {| \hat{Y} (n, \hat{t}) |}^{2}

with units of energy per time per frequency. (The normalization of the window introduces a factor of

Δ_{t}^{- 1 / 2}

so that

\hat{Y}

carries units of

u_{y} u_{t}^{1 / 2}

.) For comparison with the signal energy

E_{y}

, let us evaluate the spectrum’s energy as

E_{\hat{Y}} \equiv \sum_{n} \sum_{\hat{t}} P (n, \hat{t}) Δ_{t} Δ_{f}

and the reconstruction’s energy as

E_{\hat{\hat{y}}} \equiv \sum_{t} {| \hat{\hat{y}} (t) |}^{2} Δ_{t}

.

For this section and the next, let us consider a particular real signal

y (t)

comprised of 4 sinusoids with unit amplitude and non-stationary frequency of duration

T = 200

. The variation in the instantaneous frequencies is assigned an amplitude of 5% and a period of T. Phases for the oscillations and variations are selected randomly. In Figure 4 we show the results of the one-sided Gabor transform with

N = 200

and

P = 0

using window parameters of

σ = 2 \sqrt{π}

and

τ = 12

. Panel (a) displays the contours of the energy density

P (n, \hat{t})

over axes of time and frequency. Also indicated are the instantaneous frequencies used to generate the data. Panels (b) and (c) display the marginal energy densities over axes of frequency and time, respectively. The marginal energy densities are evaluated by taking each sum over an axis independently, i.e.,

P (n) \equiv \sum_{\hat{t}} P (n, \hat{t}) Δ_{t}

and

P (\hat{t}) \equiv \sum_{n} P (n, \hat{t}) Δ_{f}

. Panel (d) gives the absolute value of the residual

r (t) \equiv \hat{\hat{y}} (t) - y (t)

, which is on the order of the machine precision

\sim 10^{- 16}

. The ratios

E_{\hat{Y}} / E_{y}

and

E_{\hat{\hat{y}}} / E_{\hat{Y}}

are equal to unity to the same precision, thus the fundamental theorems are satisfied. At these parameter values, the spectral resolution is poor while the temporal resolution is sharp.

Figure 4. One-sided Gabor transform of a real signal as described in the text. The power spectrum is shown in (a) with the signal’s instantaneous frequencies indicated by the dashed lines. The marginal spectral power is shown in (b) as ×, as is the marginal temporal power in (c), and each is compared with the convolution of the window and signal energies indicated by +. The absolute value of the reconstruction residual is displayed in (d).

The marginal densities of the Gabor power spectrum can be compared with the convolution of the window’s energy density with that of the signal in either the temporal or spectral representations. Looking first at the temporal representation, one finds the relation

P (\hat{t}) = \sum_{t^{'} = - τ}^{τ} Φ^{2} (- t^{'}) y^{2} (\hat{t} + t^{'}) Δ_{t}

(11)

where Φ in this case is normalized to unit energy. In the spectral representation, the convolution is taken in the modular (periodic) sense, most easily performed using the two-sided transform. Let

\hat{y} (f_{n})

be the two-sided Fourier transform of the signal with order N and parity P, and let

\hat{Φ} (f_{m}^{'})

be the transform of the window function, again normalized to unit energy, with the same order and odd parity such that

M = 2 N + 1

. Then

P (n)

in the two-sided sense

n \in [1, 2 N + P]

can be written

P (n) \approx \sum_{m = 1}^{M} {| \hat{Φ} (M + 1 - m) |}^{2} {| \hat{y} [\mod (n + m - N - 2, 2 N) + 1] |}^{2} Δ_{f}

(12)

where

\mod (a, b) \equiv a mod b

and with respect to the half-width of the edge bins, from which the values corresponding to the one-sided transform can be extracted and doubled. The relation above is written as an approximation because there is some subtlety to the evaluation of the RHS.

So far, we have imposed no condition on the window duration

T^{'} = 2 τ + 1

other than being odd for integer τ, and indeed, there is none. In the lower limit

τ = 0

, the basis functions are constant,

Ψ (0, n) = \sqrt{2}

(or 1 for the two-sided transform), and all frequency resolution is lost, yet the fundamental theorems are still satisfied for any order

N \geq 1

. In the opposite extreme, for example

τ = T

such that

T^{'} ≫ T

with

σ = \infty

, the basis functions become essentially those of the Fourier transform with minimal order

N_{\min}

. Simply put, the duration of the window in the short-time Fourier transform does not need to be short. When the temporal bandwidth of the window approaches or exceeds that of the signal, the evaluation of the RHS in Equation (12) becomes suspect if the order N is not sufficient to resolve both the signal and the window; the practical solution is to increase N. Furthermore, one can verify that the Gabor transform remains well behaved (if not particularly useful) for the case

σ \to i σ

so that the window grows exponentially rather than decaying. The only requirement is that the window be real valued and normalized explicitly, i.e., discretely, to unit energy, times 2 for the one-sided transform.

We can now introduce the Morlet transform by promoting the window parameters from constants to functions of the central frequency,

Φ (t) \to Φ_{n} (t)

. Let

σ_{n} \equiv σ / f_{n}

, and with respect to the discrete sampling in time, let

τ_{n} \equiv ⌈ τ / f_{n} ⌉

for constants σ and τ. The parameter τ itself does not need to be an integer, as

τ_{n}

is what defines the window duration for bin n. A difficulty with

τ_{n}

arises immediately for odd parity when

f_{n} = 0

; one can ameliorate the situation by using

τ_{n}

from the lowest positive bin in that case. Again, either the phase convention of Equations (7 and 8) or (9 and 10) can be used. In Figure 5 we show the results of the one-sided Morlet transform of order 200 and even parity with parameters

σ = \sqrt{π}

and

τ = 6

chosen so that the window of the previous Gabor transform corresponds to the Morlet window at the critical frequency

f_{c}

. The power spectrum in panel (a) has the marginal densities shown in panels (b) and (c); however, this time the ratios

E_{\hat{Y}} / E_{y}

and

E_{\hat{\hat{y}}} / E_{\hat{Y}}

differ from unity on the order of 1%, interestingly by nearly the same value. Likewise, the absolute value of the residual displayed in panel (d) is on the order of 1%, comparable to the Farge method [2] as reported by Torrence and Compo [1]. While the lowest frequency component has been well resolved, the spectral resolution of the upper frequencies remains poor. Furthermore, without a single window in operation, one cannot perform the comparison of the marginal densities with the convolution of the window and signal energy densities corresponding to Equations (11) and (12).

Let us insert here some comments on what is called the admissibility condition. The statement often is made by authors that the wavelet must have a mean of zero so that its Fourier coefficient at

f = 0

vanishes. While that remark might apply in the continuum, it is not appropriate for discretely sampled wavelets of finite duration. If one examines in detail the proof of the admissibility condition [22], one finds that it relies crucially on the property of continuity. In full, the admissibility condition states that if the wavelet is continuous in time with continuous Fourier transform, then its mean must be zero. In the context considered here, neither of those conditions is met: In the temporal representation the wavelet is a piece-wise constant function with jump discontinuities at the edges of the temporal bins, and likewise for the frequency representation that, while of arbitrary resolution beyond the minimal order, must nonetheless be evaluated discretely for any practical analysis of data. If one computes the leakage functions for the discrete Morlet basis [23], one finds that an artificial discontinuity is induced by the imposition of the zero mean condition. As a final argument against subtracting the mean of the discrete basis functions, note that the Gabor transform works perfectly well for any values of the window parameters without including such procedure.

Figure 5. One-sided Morlet transform of a real signal as described in the text. The power spectrum is shown in (a) with the signal’s instantaneous frequencies indicated by the dashed lines. The marginal spectral power is shown in (b) as ×, as is the marginal temporal power in (c). The absolute value of the reconstruction residual is displayed in (d).

There are many suggestions found in the literature for improving the performance of the discretely implemented continuous wavelet transform. Let us examine a few of them here, with their absolute residuals displayed in Figure 6 and a numerical summary in Table 2. For panel (a) the mean of the signal is subtracted before entering the spectral analysis. As one can see, this procedure results in few changes, but for consistency of comparison the remaining panels will also use the mean subtracted signal. Noticing that the ratios

E_{\hat{Y}} / E_{y}

and

E_{\hat{\hat{y}}} / E_{\hat{Y}}

are very nearly equal, one can renormalize the spectrum and reconstruction a posteriori by their geometric mean

C = {(E_{\hat{\hat{y}}} / E_{y})}^{1 / 2}

, yielding the residual shown in panel (b). This renormalization prescription works best when there is not much energy in the upper portion of the frequency axis [23]. Perhaps more familiar is the prescription to shift the central frequency

2 π \to ω_{1}

. At these window parameter values, a shift of

ω_{1} / 2 π = 1.0127

is found to work well [24], giving the residual shown in panel (c). If one then applies the renormalization prescription, one gets the residual displayed in panel (d). While there has been a noticeable improvement of three orders of magnitude, the root-mean-square residual remains far above the machine precision.

Figure 6. Absolute residuals for several methods of improving the transform response as described in the text. In panel (a), the mean of the signal is subtracted; in panel (b), the mean is subtracted with renormalization of the coefficients; in panel (c), the mean is subtracted with a shift of central frequency; and in panel (d), the mean is subtracted with frequency shift and renormalization.

Table 2. Numerical results from the evaluation of Figure 6.

**Table 2.** Numerical results from the evaluation of Figure 6.
Panel	(a)	(b)	(c)	(d)
${〈 r^{2} (t) 〉}_{t}^{1 / 2}$	1.8215e −02	2.6142e −03	1.8213e −04	4.1799e −05
$E_{\hat{Y}} / E_{y} - 1$	1.2885e −02	−1.7468e −06	1.2674e −04	−4.4658e −10
$E_{\hat{\hat{y}}} / E_{\hat{Y}} - 1$	1.2889e −02	1.7468e −06	1.2674e −04	4.4658e −10

4. Layered Window Transform

The primary feature of the Morlet transform is that it uses a different window function

Φ_{n}

for each bin along the frequency axis. That is also its main difficulty, as each bin corresponds to a particular instance of the Gabor transform, any of which would work fine independently, but when the separate bins are isolated and combined, there is no reason to suppose that their marginal sums will equal the signal energy. The correspondence between the Morlet and Gabor transforms becomes more apparent if one redefines the window decay

σ_{n}

relative to the window width

τ_{n}

rather than the central frequency

f_{n}

, in which case regions along the frequency axis of the Morlet spectrum are drawn from a particular Gabor spectrum. Simply pasting together pieces from different Gabor transforms from the outset appears doomed to failure, and it is surprising that the Morlet transform in the discrete setting works as well as it does. While we have not given up entirely on the Morlet transform, if one wants to make quantitative use of the results of a spectral analysis, the energy and reconstruction theorems must be satisfied precisely.

How, then, are we to devise a multiresolution analysis that satisfies the fundamental theorems of spectral analysis? The answer lies in focusing our attention on the distribution of energy in the window function. Suppose we have one unit of energy, an “atom” so to speak, distributed with temporal power

Φ_{A}^{2} (t)

in the continuum, which we wish to use as the basis for a multiresolution analysis. Let the window at scale τ be defined by

Φ_{τ} (t) \propto Φ_{A} (t / τ)

for positive integer τ, which scales the relative unit of time by integral amounts. One can supplement this definition for

τ = 0

by letting

Φ_{0} (0) = 1

and 0 otherwise. For an exponential decay with scale invariant parameter σ, one can write

Φ_{τ} (t) \propto {exp}^{- π / 2} (t^{2} / τ^{2} σ^{2})

, where the normalization to unit energy is done explicitly over the integers

t \in [- τ, τ]

. (Alternately, one could hold the window width fixed for all τ.) The layered window

Φ_{L} (t)

is then defined to be the sum in terms of energy, not amplitude, of the windows

Φ_{τ} (t)

over some set of scales

τ \in [τ_{1}, τ_{L}]

with L members, normalized to unit energy,

Φ_{L}^{2} (t) \equiv L^{- 1} \sum_{τ} Φ_{τ}^{2} (t)

. For a one-sided transform,

Φ_{L} \to \sqrt{2} Φ_{L}

of course. This single window, with duration

T^{'} = 2 τ_{L} + 1

, is to be used for all the frequency bins in the windowed Fourier transform.

Since the atomic window

Φ_{A}

is arbitrary, the distinction between summing the window energies

Φ_{τ}^{2}

versus summing the window amplitudes

Φ_{τ}

is really just a matter of interpretation. Whichever way one defines the sum over scales, the layered window eventually is normalized explicitly to one unit of energy, or two units for a one-sided transform. Using the construction above in terms of energy, the amplitude of the layered window is the root-mean-square of the amplitudes of the scaled windows, while the construction summing the window amplitudes is not so simply normalized. With respect to its physical motivation, energy is the quantity that ultimately gets quantized, not amplitude, which is a property reflected in our favored definition of the layered window. The selection by the investigator of the set of scales

{τ}

to use in the construction specifies the domain of energy distributions to which the transform will be sensitive.

Let us return now to consideration of our four frequency test signal, with its mean restored. For comparison with the previous Gabor and Morlet transforms, let the decay parameter here be

σ = \sqrt{π} / 6

, and let

τ \in [12, 96]

. The results of such analysis are displayed in Figure 7. Compared with the Morlet transform, the spectral resolution is much improved in the upper portion of the frequency axis, as seen in panel (a). Since only one window is in operation, the marginal densities of panels (b) and (c) can be compared with the convolution of the window and signal energy densities in either the temporal or spectral representations. The absolute residual shown in panel (d) is on the order of the machine precision, with maybe a slight accumulation of truncation errors. The ratios

E_{\hat{Y}} / E_{y}

and

E_{\hat{\hat{y}}} / E_{\hat{Y}}

equal unity to machine precision. The layered window transform, by virtue of using a single window for all frequencies, satisfies the energy and reconstruction theorems to the accuracy of one’s computational device.

As a final comparison, let us put the Fourier power spectrum of the signal alongside the marginal spectral energy densities of the Gabor, Morlet, and layered window transforms, displayed in Figure 8. The Fourier spectrum in panel (a) is the typical mess one gets when analyzing non-stationary signals, but its net energy does equal the signal energy. The Gabor spectrum in panel (b) has lost its frequency resolution on account of the tight temporal bandwidth of the selected window parameters, yet retains the correct normalization. The Morlet spectrum in panel (c) resolves the low frequency portion of the spectrum but not the high frequency region and does not have the correct normalization. The layered window spectrum in panel (d) is normalized to the signal energy and resolves both the low and high frequency parts of the spectrum while maintaining sufficient temporal resolution to be useful in identifying non-stationary features in the signal.

Figure 7. One-sided layered window transform of a real signal as described in the text. The power spectrum is shown in (a) with the signal’s instantaneous frequencies indicated by the dashed lines. The marginal spectral power is shown in (b) as ×, as is the marginal temporal power in (c), and each is compared with the convolution of the window and signal energies indicated by +. The absolute value of the reconstruction residual is displayed in (d).

Figure 8. Comparison of the Fourier power spectrum in (a) with the marginal spectral energy densities of the Gabor transform in (b), the Morlet transform in (c), and the layered window transform in (d).

5. Irregular Sampling and Minimal Order

Let us now consider the case of an irregularly sampled signal

y (t)

, with regard to the analyses by Lomb [25] and Scargle [26]. Irregular sampling has long vexed practitioners of wavelet analysis, leading to a variety of suggestions for how to overcome its difficulties. Because of the nature of stellar observations, the astrophysical community often presents data that contains gaps or is otherwise on an irregular time axis. The method by Foster [27] focuses on the normalization of the gapped wavelet basis functions, while the method by Frick et al. [28] focuses on the admissibility condition; taken together, those ideas lead to the edge adapted algorithm proposed by Johnson [29]. More familiar perhaps is the lifting scheme by Sweldens [30], which operates in the context of the dyadic wavelet transform implemented in terms of finite impulse response digital filter banks.

For simplicity, we will assume that the samples each have a uniform duration

Δ_{t}

, which is not greater than any of the inter-measurement periods, so that the time metric

D_{t}

remains proportional to the identity matrix; otherwise, a suitably generalized

D_{t}

must be used. Let the observation times be given by the vector

t \equiv t_{d}

indexed by

d \in [1, D]

, where D is the total number of samples, in some unit

u_{t}

not necessarily equal to

Δ_{t}

such that the values

t_{d}

are integers;

u_{t}

is the resolution of the time measuring apparatus, thus

Δ_{t}

must also be an integer. Similarly, the measurements can be written as the vector

y \equiv y_{d}

so that the signal energy remains

E_{y} = y^{†} D_{t} y

in units of

u_{y}^{2} u_{t}

. Missing values, i.e., measurements at integer times not in

t

, are effectively treated as zero. Let us look first at how the discretized Fourier transform is modified in this case.

To identify the Nyquist critical frequency, one must find the lowest positive Fourier basis function that is entirely real (technically up to a constant phase) over the given set of observation times [31]. That procedure is easily accomplished by shifting and scaling the time axis

t \to t^{'}

such that

f_{c} \equiv {(2 u_{t^{'}})}^{- 1}

in the new time units. If one defines

t_{g}

to be the greatest common divisor over the set of inter-measurement periods, the new time axis can be written

t_{d}^{'} \equiv (t_{d} - t_{1}) / t_{g} + 1

in units of

u_{t^{'}} \equiv t_{g} u_{t}

such that

t_{d}^{'} \in [1, T]

remains integer valued. At the critical frequency, the Fourier basis function is

{exp}^{i π} (2 f_{c} t_{d}^{'}) = \pm 1

, and the Fourier spectrum has period 1 in units

u_{f} = u_{t^{'}}^{- 1}

. Having found the units in which

f_{c} = 1 / 2

, let us drop the prime distinguishing the scaled time axis in the following, and for convenience let us suppose

Δ_{t} = 1

in the scaled units; otherwise one must be extra careful with the scaling of the energy units.

The next task is to determine the minimal order for satisfaction of the energy and reconstruction theorems. Let

N_{D} \equiv ⌈ D / 2 ⌉

, and let

N_{T} \equiv ⌈ T / 2 ⌉

. For

N < N_{D}

there are an insufficient number of DOF in the discrete Fourier transform to fully represent the information content of the (real) signal, thus the fundamental theorems are not satisfied in general. Likewise, for

N \geq N_{T}

there are more than enough DOF to represent the signal;

N_{T}

gives the minimal order of the analogous regularly sampled signal where the missing entries are assigned the value zero. For

N_{D} \leq N < N_{T}

, the Fourier transform might work, depending upon the particulars of the irregular sampling. To satisfy the reconstruction theorem

\hat{\hat{y}} = y

for

\hat{\hat{y}} \equiv Q y

in the one-sided case, one must have

Q \equiv Re Θ D_{f} Θ^{†} D_{t} = I_{D}

to the order of machine precision; the energy theorem also is satisfied when that condition is met. (For a complex signal, the entire matrix, not just its real part, must be utilized, and of course

D_{f}

must cover one full period of the frequency axis.) The basis functions are evaluated only at the observation times

Θ (d, n) = {exp}^{i 2 π} (f_{n} t_{d})

, because those are the only locations at which the reconstruction theorem must be satisfied. In Table 3 we compare the norm of the reconstruction residual

r \equiv \hat{\hat{y}} - y

for some signal with

D = 10

and

T = 28

with the distance between the quality matrix

Q

and the identity

I_{D}

according to the Frobenius metric at various orders N with

P = 0

. To produce a smooth picture of the spectral content, one should take

N ≫ N_{T}

.

Table 3. Comparison of the norm of the residual of the Fourier transform with the distance from the quality matrix

Q

to the identity

I_{D}

at various orders N and even parity for an irregularly sampled signal with

D = 10

and

T = 28

such that

N_{D} = 5

and

N_{T} = 14

.

**Table 3.** Comparison of the norm of the residual of the Fourier transform with the distance from the quality matrix $Q$ to the identity $I_{D}$ at various orders N and even parity for an irregularly sampled signal with $D = 10$ and $T = 28$ such that $N_{D} = 5$ and $N_{T} = 14$ .
N	5	6	7	8	9
${∥ r ∥}_{F}$	3.5943e +00	5.1760e +00	3.0368e +00	5.5110e −15	3.0148e +00
${∥ Q - I_{D} ∥}_{F}$	2.4495e +00	2.8284e +00	2.4495e +00	5.6505e −15	2.0000e +00
$N$	10	11	12	13	14
${∥ r ∥}_{F}$	6.6842e −01	5.4143e −15	2.6771e +00	6.2415e −15	7.4742e −15
${∥ Q - I_{D} ∥}_{F}$	1.4142e +00	4.3648e −15	2.0000e +00	4.2983e −15	4.9648e −15

Turning now to the consideration of the Gabor transform (and by extension all windowed Fourier transforms with a fixed Φ), the construction of the quality matrix is a bit more complicated, owing to the convolutions along the time axis. It is commonly remarked that the number of DOF along the frequency axis is multiplied by the number of times the window duration

T^{'} = 2 τ + 1

fits within the signal duration T, but the situation is more subtle than that, as one must also account for the bandwidth of the window given by the decay σ. Beginning with the case of regular sampling

D = T

, using the phase convention of Equations (9) and 10), one can write the composition of the inverse and forward transforms as

\begin{matrix} \hat{\hat{y}} (t) & = & Re \sum_{n = 1}^{N^{'}} \sum_{t^{'} = - τ}^{τ} Φ (t^{'}) {exp}^{i 2 π} (- f_{n} t^{'}) \sum_{t^{''} = - τ}^{τ} Φ (- t^{''}) {exp}^{i 2 π} (- f_{n} t^{''}) y (t + t^{'} + t^{''}) Δ_{t} Δ_{t} Δ_{f} \end{matrix}

(13a)

\begin{matrix} = & Re \sum_{t^{'''} = - 2 τ}^{2 τ} \sum_{t^{'} + t^{''} = t^{'''}} Φ (t^{'}) Φ (- t^{''}) \sum_{n = 1}^{N^{'}} {exp}^{i 2 π} (- f_{n} t^{'''}) y (t + t^{'''}) Δ_{f} Δ_{t} Δ_{t} \end{matrix}

(13b)

\begin{matrix} \equiv & \sum_{t^{'''} = - 2 τ}^{2 τ} q (t^{'''}) y (t + t^{'''}) Δ_{t} \end{matrix}

(13c)

which yields one row in the quality matrix

Q

indexed by

t^{'''} \in [- 2 τ, 2 τ]

indicating the diagonal where

q (t^{'''})

appears such that

Q

has the form of a Toeplitz matrix of rank T. For a symmetric window

Φ (- t^{'}) = Φ (t^{'})

written as a diagonal matrix Φ such that

Ψ \equiv Φ Θ

, the values

q (t^{'''})

can be extracted from

Ψ^{*} D_{f} Ψ^{†}

. For an irregularly sampled signal

T > D

, one retains only those rows and columns of

Q

corresponding to actual data entries,

Q \to Q (t, t)

. In Table 4 we compare the residual norm of the Gabor transform with

τ = 12

and two values of σ with the distance from

Q

to

I_{D}

at various N with

P = 0

for an irregularly sampled signal of duration

T = 100

with

D = 30

entries. As N approaches

N_{D}

from below, the quality of reconstruction improves until the residual is on the order of machine precision.

Table 4. Comparison of the norm of the residual of the Gabor transform with width

τ = 12

and the indicated decay σ with the distance from the quality matrix

Q

to the identity

I_{D}

at various orders N and even parity for an irregularly sampled signal with

D = 30

and

T = 100

such that

N_{D} = 15

and

N_{T} = 50

.

**Table 4.** Comparison of the norm of the residual of the Gabor transform with width $τ = 12$ and the indicated decay σ with the distance from the quality matrix $Q$ to the identity $I_{D}$ at various orders N and even parity for an irregularly sampled signal with $D = 30$ and $T = 100$ such that $N_{D} = 15$ and $N_{T} = 50$ .
σ	N	6	7	8	9	10
$2 \sqrt{π}$	${∥ r ∥}_{F}$	5.4222e −04	2.8233e −05	5.1908e −07	1.1113e −08	7.3918e −11
	${∥ Q - I_{D} ∥}_{F}$	5.2358e −04	2.0300e −05	4.4962e −07	7.0963e −09	5.1610e −11
	$N$	11	12	13	14	15
	${∥ r ∥}_{F}$	2.7305e −13	1.2477e −14	6.2789e −15	5.8445e −15	7.2210e −15
	${∥ Q - I_{D} ∥}_{F}$	1.6628e −13	4.3047e −16	4.1495e −16	2.8105e −16	2.2323e −16
$σ$	$N$	6	7	8	9	10
$4 \sqrt{π}$	${∥ r ∥}_{F}$	4.5343e −01	2.6201e −01	7.5299e −02	3.4864e −02	6.9207e −03
	${∥ Q - I_{D} ∥}_{F}$	4.3784e −01	1.8839e −01	6.5223e −02	2.2263e −02	4.8319e −03
	$N$	11	12	13	14	15
	${∥ r ∥}_{F}$	1.0987e −03	7.8620e −05	1.0067e −14	1.2076e −14	1.2964e −14
	${∥ Q - I_{D} ∥}_{F}$	6.6716e −04	6.9627e −05	6.0970e −16	7.7092e −16	1.0855e −15

For the Morlet basis functions, the construction of Ψ is more involved, as

Ψ (t^{'}, n) = Φ (t^{'}, n) Θ (t^{'}, n)

. Nonetheless, one can evaluate

Q

from

Ψ^{*} D_{f} Ψ^{†}

for any rank T. As a function of order N, one finds that the deviation

{∥ Q - I_{T} ∥}_{F}

decreases until

N > T / 2

, after which it bottoms out on the order of a few percent times T. In Table 5 we compare the norm of the residual for the Morlet basis using

σ = \sqrt{π}

and

τ = 6

for some regularly sampled signal with

D = T = 30

with the deviation of

Q

from

I_{D}

at various N with

P = 0

. If one wishes to investigate the feasibility of devising a Morlet basis with perfect reconstruction, the evaluation of

Q

is where to start.

Table 5. Comparison of the norm of the residual of the Morlet transform with width

τ = 6

and decay

σ = \sqrt{π}

with the distance from the quality matrix

Q

to the identity

I_{D}

at various orders N and even parity for a regularly sampled signal with

D = T = 30

such that

N_{D} = N_{T} = 15

.

**Table 5.** Comparison of the norm of the residual of the Morlet transform with width $τ = 6$ and decay $σ = \sqrt{π}$ with the distance from the quality matrix $Q$ to the identity $I_{D}$ at various orders N and even parity for a regularly sampled signal with $D = T = 30$ such that $N_{D} = N_{T} = 15$ .
N	6	7	8	9	10
${∥ r ∥}_{F}$	3.3471e +00	2.9397e +00	2.7428e +00	2.2582e +00	1.9475e +00
${∥ Q - I_{D} ∥}_{F}$	2.9527e +00	2.4686e +00	2.0993e +00	1.8244e +00	1.5788e +00
$N$	11	12	13	14	15
${∥ r ∥}_{F}$	1.7911e +00	1.5917e +00	1.3310e +00	9.3137e −01	5.3677e −01
${∥ Q - I_{D} ∥}_{F}$	1.3451e +00	1.1139e +00	8.7648e −01	6.3175e −01	4.0199e −01

Let us close this section by looking at the analysis of the signal from the previous section but with the number of measurements reduced by a factor of a third (

N_{D} = 134

) for the same duration

T = 200

. The evaluation of the layered window Fourier transform proceeds as before, with the understanding that values of

y (\hat{t} + t^{'})

at times not in

t

are treated as zero; the time axis for the spectrum

\hat{Y} (n, \hat{t})

includes every integer

\hat{t} \in [1 - τ_{L}, T + τ_{L}]

. Using the same parameters as before,

σ = \sqrt{π} / 6

and

τ \in [12, 96]

with

N = 200

and

P = 0

, in Figure 9 we display the power spectrum, its marginal densities, and the reconstruction residual for our irregularly sampled signal. While the loss of information has affected the appearance of the spectrum relative to the regularly sampled case, it nonetheless remains a faithful representation of the available information. By virtue of using a single fixed window

Φ_{L}

, the layered window Fourier transform satisfies the fundamental theorems of spectral analysis even when there are gaps in the observation record.

Figure 9. One-sided layered window transform of an irregularly sampled signal as described in the text. The power spectrum is shown in (a) with the signal’s instantaneous frequencies indicated by the dashed lines. The marginal spectral power is shown in (b) as ×, as is the marginal temporal power in (c), and each is compared with the convolution of the window and signal energies indicated by +. The absolute value of the reconstruction residual is displayed in (d).

6. Window Comparison

Let us conclude the analysis with a comparison of the temporal and spectral bandwidths for several types of window function, as well as their estimates of the power spectral density carried by a test signal with a little more complexity than we had before. The test signal is regularly sampled with

T = 200

and four component frequencies, but now only two components are present in the first half of the duration, while the other two are in the second half. An abrupt transition occurs at the midpoint of the duration. The amplitude of the frequency variation is now 10%, with a period of

T / 2

, so that each component covers a broad range of instantaneous frequency. The order for all the transforms in this section will be the minimal Fourier order of the signal

N = 100

with even parity

P = 0

.

The parameters for the four types of window considered are displayed in Table 6, comprising of two Gabor windows at either end of the range in scale for the layered window also considered, in addition to a random window with an arbitrary duration. The first moments of time and frequency are indicated by

t_{1}

and

f_{1}

respectively, and the bandwidths (square root of the second moment about the first moment) by

t_{2}

and

f_{2}

. The two Gabor windows are observed to minimize the Fourier uncertainty relation

t_{2} f_{2} \geq 1 / 4 π

, thus in that sense are optimal, while the layered window has a slightly larger bandwidth product. The random window has a bandwidth product that is considerably larger, yet all these windows produce a valid discrete transform pair that satisfies the fundamental theorems of spectral analysis as indicated by the residual of reconstruction

{∥ r ∥}_{F}

. The energy densities in the temporal and spectral representation used to evaluate the bandwidth products are shown in Figure 10, where one can see the trade-off in time/frequency resolution in action.

Table 6. Comparison of the temporal and spectral bandwidths for various windows, as well as their rms reconstruction residual for the test signal as described in the text.

**Table 6.** Comparison of the temporal and spectral bandwidths for various windows, as well as their rms reconstruction residual for the test signal as described in the text.
Window	σ	τ	$t_{1}$	$t_{2}$	$f_{1}$	$f_{2}$	$4 π t_{2} f_{2}$
Gabor	$2 \sqrt{π}$	12	−0.000	1.414	−0.000	0.056	1.000
Gabor	$16 \sqrt{π}$	96	−0.000	11.314	0.000	0.007	1.000
Layered	$\sqrt{π} / 6$	$[12, 96]$	0.000	6.990	0.000	0.014	1.244
Random	—	24	−2.836	13.718	0.000	0.130	22.442

The power spectral density estimates evaluated from the test signal of this section using the various windows are displayed in Figure 11, arranged from (a) to (d) according to Table 6. The power spectrum given by the layered window in (c) does indeed incorporate features of either Gabor transform shown in (a) and (b); in a sense, it has combined the Gabor transforms over its range of scale in a way that preserves the energy and reconstruction theorems. With some algebraic manipulation of the expressions defining the layered window Fourier transform one might be able to show that explicitly, but for now that statement is intuitive speculation. Interestingly, the spectrum given by the random window is not much different from the others, driving home the point that the shape of the window truly is arbitrary as long as it is normalized appropriately.

Figure 10. Comparison of the temporal and spectral energy densities of the various windows arranged according to Table 6.

Figure 11. Comparison of the power spectral density of the test signal described in the text using the various windows arranged according to Table 6. The instantaneous frequencies used to generate the signal are indicated by the dashed lines.

7. Discussion

The primary result of this investigation is that the windowed Fourier transform has far more flexibility and utility than it is usually credited with. To achieve satisfaction of the energy and reconstruction theorems in the discrete setting, the only requirements on the window are that it be real, nonnegative, and normalized explicitly to unit energy, or two units for a one-sided transform. While we have looked only at symmetric windows here, one can easily verify that the expressions for the transform pair, either Equations (7 and 8) or (9 and 10), remain valid for a window that is not symmetric in

t^{'}

; with attention to the details of relocating the temporal bins, the coefficients can be assigned to the time corresponding to the peak of the window rather than its midpoint. Similarly, a window with an even length duration

T^{'}

could be used if one is careful with the definition of the phase and the location of the bins. In fact, any nonnegative function can be used for Φ, or even a list of arbitrary numbers, as long as it is suitably normalized.

The flexibility in Φ allows one to define a multiresolution spectral analysis in terms of an atomic unit of energy

Φ_{A} (t / τ)

evaluated over a range of scales τ. These multiple scalings of

Φ_{A}

are combined into a single layered window

Φ_{L}

that is applied to the entire frequency axis; the phase component of the basis is independent of the window function. Because only a single window is used, the fundamental theorems of spectral analysis are satisfied. With a Gaussian

Φ_{A}

, the form of

Φ_{L}

very closely resembles a Lorentzian function, which expresses uniformity over scale when expressed as an angle. The shape of

Φ_{L}

can be tailored to the needs of the investigation through inspection of its leakage function. Similarly to the wavelet transform, the trade-off between resolution in time or frequency is under the control of the investigator. Any structure seen in the spectral coefficients of the signal is understood to be conditioned on the selection of the window function; the answer one gets depends upon how the question is asked.

The difficulties faced by the continuous wavelet transform exemplified by the Morlet basis, with respect to the fundamental theorems of spectral analysis, can be summarized in the evaluation of its quality matrix

Q

. Perfect reconstruction in the discrete setting requires

Q

to equal the identity matrix to the precision of one’s computational device. The various adjustments looked at above that improve the reconstruction residual must be bringing

Q

closer to that form; however, none of them achieve the required precision. For an integral transform pair to have quantitative significance, one must demonstrate that the distance from its

Q

to

I

is numerically zero. This investigation has proposed an alternate approach to multiresolution analysis, which does satisfy the energy and reconstruction theorems while improving upon the resolution properties of the Gabor transform.

The effects of aliasing and of irregular sampling are easily understood with regard to the periodicity induced in the Fourier spectrum. The Nyquist interval is given simply by the span between frequencies that are indistinguishable over the stated measurement times; assigning a value of unity to that range such that

f_{c} = 1 / 2

is just a matter of choosing the appropriate units for time. The implementation of the windowed Fourier transform is unaffected beyond zero padding the signal at locations of missing measurements, whose justification is that no other procedure leaves the energy, hence information content, unchanged. One should observe that the ends of a regularly sampled signal are treated the same way. On that note, we can remark that there is no cone of influence for the windowed Fourier transform, as every spectral coefficient is important to the satisfaction of the fundamental theorems, even those outside the temporal domain of the data.

The minimal order

N_{\min}

required for a faithful representation of the signal depends upon the temporal bandwidth of the window and the duration of the data. Pinning that number down for the windowed Fourier transform is a bit tricky, and if needed is best evaluated explicitly through inspection of

Q

as a function of order N. What one can say in general is that

1 \leq N_{\min} \leq N_{T}

, where the bounds are given by the minimal orders for the fully temporal and fully spectral representations of the regularly sampled signal, respectively. Irregular sampling complicates matters by introducing an order

N_{D} < N_{T}

such that

N_{D} \leq N_{\min} \leq N_{T}

for the full-length Fourier transform and

N_{\min} \leq N_{D}

for the windowed Fourier transform. For most practical cases of data analysis, one is interested in producing a smooth plot of the spectral content, thus one would use an order

N ≫ N_{\min}

. To verify the marginal density of the windowed power spectrum in comparison with the convolution of the window and signal energy densities in the spectral representation, one requires an order N sufficient to resolve both the signal and the window temporal durations.

Thus far we have heard nary a peep from the actual basis functions used in this analysis, so let us close the discussion by looking at some of the stars of the show. In Figure 12 we display the real and imaginary parts of the basis functions, normalized to unit energy, for each of the transforms considered above at order

N = 50

with even parity

P = 0

at the lowest and highest entries on the frequency axis, i.e.,

f_{1} = 1 / 200

and

f_{N} = 99 / 200

. The Fourier transform has no intrinsic window, so it is shown for a duration

T^{'} = 101

that spans

T < T^{'}

. The window parameters chosen are those from the previous sections:

σ = 2 \sqrt{π}

and

τ = 12

for the Gabor transform,

σ = \sqrt{π}

and

τ = 6

for the Morlet transform, and

σ = \sqrt{π} / 6

and

τ \in [12, 96]

for the layered window transform. One can observe that the transforms whose

Q

equals the identity share the property that these basis functions are phase analogues over the discrete sample times, whereas the Morlet basis functions are scale analogues. By phase analogue we mean if

Ψ (t^{'}, 1) = a_{1} + i b_{1}

, then

Ψ (t^{'}, N) = a_{1} {(- 1)}^{t^{'}} + i b_{1} {(- 1)}^{t^{'} + 1}

, a feature shared by all positive frequency pairs whose midpoint is

f = 1 / 4

; the Morlet basis differs from the others in this regard. Whether that property is required for satisfaction of the fundamental theorems is unclear, but the results of this analysis suggest that it is.

Figure 12. Comparison of the lowest

Ψ (t^{'}, 1)

and highest

Ψ (t^{'}, N)

frequency basis functions at order

N = 50

with

P = 0

using the parameters found in the text, with the real part indicated by × and the imaginary part by +. The Fourier basis is in (a) and (b), the Gabor basis is in (c) and (d), the Morlet basis is in (e) and (f), and the layered window basis is in (g) and (h).

Figure 12. Comparison of the lowest

Ψ (t^{'}, 1)

and highest

Ψ (t^{'}, N)

frequency basis functions at order

N = 50

with

P = 0

using the parameters found in the text, with the real part indicated by × and the imaginary part by +. The Fourier basis is in (a) and (b), the Gabor basis is in (c) and (d), the Morlet basis is in (e) and (f), and the layered window basis is in (g) and (h).

8. Conclusions

In this article we have compared the Fourier, Gabor, and Morlet transforms in the fully discrete setting applicable to the analysis of sampled data. While the Fourier and Gabor transforms satisfy the fundamental theorems of energy conservation and perfect reconstruction, the Morlet transform does not. The magnitude of the residual can be related to the distance from the quality matrix to the identity in all cases, such that the minimal order for reconstruction can be determined for the Fourier and Gabor transforms. Various methods of improving the response of the Morlet transform are considered; however, none of them achieve the desired precision on par with the truncation error of one’s computational device.

An alternate approach to multiresolution analysis is proposed, which constructs a single layered window from multiple scalings of some atomic unit of energy. This layered window transform satisfies the fundamental theorems of spectral analysis while providing sufficient temporal resolution to identify non-stationary features in the signal. The trade-off between time and frequency resolution is under the control of the investigator through the selection of the atomic window and the scales over which it is evaluated. The power spectrum of the layered window transform is similar to that of the Morlet transform but provides much better frequency resolution.

The premise behind the wavelet transform is that the low frequency elements of a signal should have a much longer duration than the high frequency elements. There are, however, many examples of real world signals for which the converse is true. Consider, for example, the digital recording of a kick drum and cymbal rhythm such that the low frequency bursts have a relatively short duration compared with the ringing at high frequency. For that type of signal, the wavelet transform is going to provide a poor choice of basis even if it satisfied the fundamental theorems. The flexibility in the choice of window function used in the windowed Fourier transform allows one to tailor its response to the needs of the analysis far better. The result one gets depends upon how one defines the resolution of the window, any of which provide a valid spectral representation of the signal.

To make quantitative use of the spectral analysis of some discretely sampled signal, one must demonstrate that the fundamental theorems are satisfied. The windowed Fourier transform can be shown to satisfy those requirements for any real valued window function that is suitably normalized. The potential for gaps in the data, or some other form of irregular sampling, is found to pose no problem once the correct Nyquist interval is identified. Consequently, the windowed Fourier transform can be applied to data from a wide variety of sources, such as astronomical observations, which are limited physically to an irregular set of observation times, as well as the common case of regular sampling. The implementation of a multiresolution spectral analysis in the discrete setting appears to be possible only when the multiple scalings of the energy distribution are applied evenly across the frequency axis by combining them into a single layered window.

References

Torrence, C.; Compo, G.P. A practical guide to wavelet analysis. Bull. Am. Meteor. Soc. 1998, 79, 61–78. [Google Scholar] [CrossRef]
Farge, M. Wavelet transforms and their applications to turbulence. Ann. Rev. Fluid Mechan. 1992, 24, 395–458. [Google Scholar] [CrossRef]
Grossmann, A.; Morlet, J. Decomposition of hardy functions into square integrable wavelets of constant shape. SIAM J. Math. Anal. 1984, 15, 723–736. [Google Scholar] [CrossRef]
Meyer, Y. Ondelettes et fonctions splines. Available online: https://eudml.org/doc/111928 (accessed on 19 June 2013).
Mallat, S. A theory for multiresolution signal secomposition: The wavelet representation. Pattern Anal. Mach. Intell. IEEE Trans. 1989, 11, 674–693. [Google Scholar] [CrossRef]
Daubechies, I. Orthonormal bases of compactly supported wavelets. Commun. Pure Appl. Math. 1988, 41, 909–996. [Google Scholar] [CrossRef]
Vetterli, M.; Herley, C. Wavelets and filter banks: Theory and design. Signal Process. IEEE Trans. 1992, 40, 2207–2232. [Google Scholar] [CrossRef]
Kronland-Martinet, R.; Morlet, J.; Grossmann, A. Analysis of sound patterns through wavelet transforms. Int. J. Pattern Recognit. Artif. Intell. 1987, 1, 273–302. [Google Scholar] [CrossRef]
Meyers, S.D.; Kelly, B.G.; O’Brien, J.J. An introduction to wavelet analysis in oceanography and meteorology: With application to the dispersion of yanai waves. Mon. Weather Rev. 1993, 121, 2858–2866. [Google Scholar] [CrossRef]
Baliunas, S.; Frick, P.; Sokoloff, D.; Soon, W. Time scales and trends in the central england temperature data (1659–1990): A wavelet analysis. Geophys. Res. Lett. 1997, 24, 1351–1354. [Google Scholar] [CrossRef]
Fligge, M.; Solanki, S.K.; Beer, J. Determination of solar cycle length variations using the continuous wavelet transform. Astron. Astrophys. 1999, 346, 313–321. [Google Scholar]
Christopoulou, E.B.; Skodras, A.N.; Georgakilas, A.A. Time Series Analysis of Sunspot Oscillations Using the Wavelet Transform. In Proceedings of the 14th International Conference on Digital Signal Processing, Santorini, Greece, 1–3 July 2002; Volume 2, pp. 893–896.
Le, G.M.; Wang, J.L. Wavelet analysis of several important periodic properties in the relative sunspot numbers. Chin. J. Astron. Astrophys. 2003, 3, 391–394. [Google Scholar] [CrossRef]
Grinsted, A.; Moore, J.C.; Jevrejeva, S. Application of the cross wavelet transform and wavelet coherence to geophysical time series. Nonlinear Process. Geophys. 2004, 11, 561–566. [Google Scholar] [CrossRef]
Piscaronoft, P.; Kalvová, J.; Brázdil, R. Cycles and trends in the Czech temperature series using wavelet transforms. Int. J. Climatol. 2004, 24, 1661–1670. [Google Scholar]
Liu, Y.; Liang, X.S.; Weisberg, R.H. Rectification of the bias in the wavelet power spectrum. J. Atmos. Ocean. Technol. 2007, 24, 2093–2102. [Google Scholar] [CrossRef]
Jevrejeva, S.; Moore, J.C.; Grinsted, A. Influence of the Arctic Oscillation and El Nino-Southern Oscillation (ENSO) on ice conditions in the Baltic Sea: The wavelet approach. J. Geophys. Res. Atmos. 2003, 108. [Google Scholar] [CrossRef]
Echer, M.P.S.; Echer, E.; Nordemann, D.J.R.; Rigozo, N.R. Multi-resolution analysis of global surface air temperature and solar activity relationship. J. Atmos. Solar-Terr. Phys. 2009, 71, 41–44. [Google Scholar] [CrossRef]
Greene, N. Inverse wavelet reconstruction for resolving the Gibbs phenomenon. Int. J. Circuits Syst. Signal Process. 2008, 2, 73–77. [Google Scholar]
Alphawave Toolbox for Spectral Analysis. Available online: http://www.alphawaveresearch.com (accessed on 20 May 2013).
Press, W.; Teukolsky, S.; Vetterling, W.; Flannery, B. Numerical Recipes in C, 2nd ed.; Cambridge University Press: Cambridge, UK, 1992. [Google Scholar]
Sadowsky, J. Investigation of signal characteristics using the continuous wavelet transform. Johns Hopkins APL Tech. Dig. 1996, 17, 258–269. [Google Scholar]
Johnson, R.W. Wavelets: Classification, Theory and Applications; Nova Science Publishers: Hauppauge, NY, USA, 2012; Chapter 6; pp. 125–155. [Google Scholar]
Johnson, R.W. Symmetrization and enhancement of the continuous Morlet transform for spectral density estimation. Int. J. Wavelets Multiresolut. Inf. Process. 2012, 10. [Google Scholar] [CrossRef]
Lomb, N.R. Least-squares frequency analysis of unequally spaced data. Astrophys. Space Sci. 1976, 39, 447–462. [Google Scholar] [CrossRef]
Scargle, J.D. Studies in astronomical time series analysis. II. Statistical aspects of spectral analysis of unevenly spaced data. Astrophys. J. 1982, 263, 835–853. [Google Scholar] [CrossRef]
Foster, G. Wavelets for period analysis of unevenly samples time series. Astron. J. 1996, 112, 1709–1729. [Google Scholar] [CrossRef]
Frick, P.; Baliunas, S.L.; Galyagin, D.; Sokoloff, D.; Soon, W. Wavelet analysis of stellar chromospheric activity variations. Astrophys. J. 1997, 483, 426–434. [Google Scholar] [CrossRef]
Johnson, R.W. Edge adapted wavelets, solar magnetic activity, and climate change. Astrophys. Space Sci. 2010, 326, 181–189. [Google Scholar] [CrossRef]
Sweldens, W. The lifting scheme: A construction of second generation wavelets. SIAM J. Math. Anal. 1998, 29, 511–546. [Google Scholar] [CrossRef]
Johnson, R.W. MaxEnt power spectrum estimation using the Fourier transform for irregularly sampled data applied to a record of stellar luminosity. Astrophys. Space Sci. 2012, 338, 35–48. [Google Scholar] [CrossRef]

© 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Johnson, R.W. Some Notes on the Use of the Windowed Fourier Transform for Spectral Analysis of Discretely Sampled Data. Axioms 2013, 2, 286-310. https://doi.org/10.3390/axioms2030286

AMA Style

Johnson RW. Some Notes on the Use of the Windowed Fourier Transform for Spectral Analysis of Discretely Sampled Data. Axioms. 2013; 2(3):286-310. https://doi.org/10.3390/axioms2030286

Chicago/Turabian Style

Johnson, Robert W. 2013. "Some Notes on the Use of the Windowed Fourier Transform for Spectral Analysis of Discretely Sampled Data" Axioms 2, no. 3: 286-310. https://doi.org/10.3390/axioms2030286

APA Style

Johnson, R. W. (2013). Some Notes on the Use of the Windowed Fourier Transform for Spectral Analysis of Discretely Sampled Data. Axioms, 2(3), 286-310. https://doi.org/10.3390/axioms2030286

Article Menu

Some Notes on the Use of the Windowed Fourier Transform for Spectral Analysis of Discretely Sampled Data

Abstract

1. Introduction

2. Continuous and Discrete Fourier Transforms

3. Gabor and Morlet Transforms

4. Layered Window Transform

5. Irregular Sampling and Minimal Order

6. Window Comparison

7. Discussion

8. Conclusions

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI