Power Response and Modal Decay Estimation of Room Reflections from Spherical Microphone Array Measurements Using Eigenbeam Spatial Correlation Model

Bastine, Amy; Abhayapala, Thushara D.; Zhang, Jihui (Aimee)

doi:10.3390/app11167688

Open AccessArticle

Power Response and Modal Decay Estimation of Room Reflections from Spherical Microphone Array Measurements Using Eigenbeam Spatial Correlation Model

by

Amy Bastine

^*,†

,

Thushara D. Abhayapala

^†

and

Jihui (Aimee) Zhang

^†

Audio & Acoustic Signal Processing Group, The Australian National University, Canberra 2601, Australia

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Appl. Sci. 2021, 11(16), 7688; https://doi.org/10.3390/app11167688

Submission received: 30 June 2021 / Revised: 9 August 2021 / Accepted: 19 August 2021 / Published: 21 August 2021

(This article belongs to the Special Issue Advances in Architectural Acoustics)

Download

Browse Figures

Versions Notes

Abstract

:

Featured Application

Room Mode Analysis.

Abstract

Modal decays and modal power distribution in acoustic environments are key factors in deciding the perceptual quality and performance accuracy of audio applications. This paper presents the application of the eigenbeam spatial correlation method in estimating the time-frequency-dependent directional reflection powers and modal decay times. The experimental results evaluate the application of the proposed technique for two rooms with distinct environments using their room impulse response (RIR) measurements recorded by a spherical microphone array. The paper discusses the classical concepts behind room mode distribution and the reasons behind their complex behavior in real environments. The time-frequency spectrum of room reflections, the dominant reflection locations, and the directional decay rates emulate a realistic response with respect to the theoretical expectations. The experimental observations prove that our model is a promising tool in characterizing early and late reflections, which will be beneficial in controlling the perceptual factors of room acoustics.

Keywords:

reflection power; room response; directional decay rates; room modes; eigenbeam processing; spatial correlation

1. Introduction

In any enclosed acoustic space, the sound received by a listener is the superposition of the direct sound from the source and the reflected sounds from the surrounding surfaces. The numerous reflections termed reverberation cause persistence of sound even after the source ceases, until these reflected waves decay due to absorption by the surrounding surfaces. The intricate sound field generated by these reflected waves provides the sense of acoustic space to the perceived sound. However, severe reverberation can cause spectral distortions and reduce speech intelligibility. The study of reverberation is complicated since it is a product of many factors like sound frequency, room shape, room size, room geometry, source and receiver locations, source and receiver directivity, etc. A comprehensive understanding of the reflection sound field distribution, resonant frequencies, and modal decay rates is necessary to control audible artifacts and achieve desired sound perception quality in room acoustic applications.

Initially, the objective parameters like reverberation time, percentage articulation (PA) [1], decay rates [2], and statistical measures of room impulse responses (RIR) [3] were the only measures of reverberation. However, later studies [4,5] found that these measures vary with the sound frequency and wall surface properties. This necessitated the frequency-dependent spatio-temporal analysis of sound fields for accurate characterization of room acoustics. The existing 3D room acoustic parameter estimation methods either depend on predictions based on computational acoustics or derive the parameters directly from real sound field measurements. The room acoustic analysis using prominent computational models like ray/geometrical [6,7], wave/element [8], statistical energy [9], or synthetic RIR [10,11] methods are computationally complex and applicable to limited frequency ranges. The lack of proper consideration of the source and environment factors, frequency-dependent wave behavior, and precise reflection methods reduce the estimation accuracy of these computational approaches, especially in highly reverberant environments [12]. Furthermore, the analysis of intermediate frequencies using these computational models is complicated because of the dominant diffraction effects and the influence of both wave and ray acoustic behaviors.

The characterization of real acoustic environments requires 3D acoustic scene analysis using spatial sound field measurements. This led to the development of several microphone arrays designs [13,14,15] and processing methods like sound intensity mapping [16], plane-wave decomposition (PWD) and steered beamforming [17,18,19], sound intensity vector analysis [20], and multi-channel correlation model [21]. Gover et al. used PWD beamforming in [18] to estimate the angular distribution and anisotropy index of the spatial sound field from the RIRs recorded by a spherical microphone array. The recent works in [22,23,24] allow similar analysis in terms of isotropy measures and directional energy decays using Schroeder integration [25] and PWD of directional RIRs. However, these methods require a large number of RIR measurements for an accurate analysis of the room acoustic field. This problem was overcome with the introduction of higher-order spherical harmonic (eigenbeam)-based processing of spherical microphone array measurements [12,26,27,28], which provided higher spatial resolution for analysis compared to the previous methods. Subsequently, more robust techniques [29,30,31] were developed to achieve efficient parameterization of the spatial sound field using modal decomposition. In [32], the eigenbeam rotational invariance technique (EB-SPRIT) was used to identify room modes and damping parameters from RIRs. In [33,34], Samarasinghe et al. used the spatial correlation of higher-order eigenbeams to estimate the directional characteristics of the reverberant field, and this approach was able to achieve an accurate estimation of direct-to-reverberant energy ratio and dominant reflection directions.

The majority of the existing methods of directional characterization of room reflections derive the parameters from the aggregate sound field formed by the direct and reflected waves. Even though the direct path can be removed from the RIRs, the spatial resolution for directional analysis will be limited by the number of microphones. Moreover, a fine-scale separation of the spatial components of the direct path and reflected path is difficult without the knowledge of the source directivity. Additionally, the lack of incorporation of frequency-dependent surface reflectivities with distinct decay times can cause severe errors in the reflected sound field power distribution estimated by the existing methods [18,24,32]. Hence, a competent room characterization tool should integrate the frequency, time, and spatial dependencies in the formulation of the reflected sound field.

In this paper, we utilize the spatial correlation of higher-order eigenbeams to estimate the directional power response of room reflections by processing the RIR measurements. The proposed technique further facilitates room mode analysis and directional decay rate estimation. In comparison to the previous version of this method in [33,34], we model the reflection power as a function of time, frequency, and direction for comprehending the influence of frequency-dependent wall absorption properties of the room surfaces. This method allows the estimation of the directional features of reflections with higher spatial resolution independent of the direct sound component. The room mode features, directional decay rates and dominant reflection locations generated from the proposed tool can serve many applications like room response equalization, acoustic treatment design, architectural design simulations, room geometry inference, auralization of historic buildings, archaeoacoustics, and other machine hearing technologies.

The remainder of this paper is organized as follows: Section 2 discusses the formulation and implementation procedure of the eigenbeam spatial correlation model for estimating the reflection power response. Section 3 presents the experimental results including the time-frequency spectrum of reflection power, directional decay rates, and dominant reflection directions. Section 4 concludes the paper with a summary of the key findings and mentions the future research plans.

2. Reflection Power Estimation Using Eigenbeam Spatial Correlation Model

In this section, we present the formulation and synthesis of reflection power as a function of time, frequency, and space in the spherical harmonics domain.

2.1. Problem Formulation

Consider a convex room with a single sound source and a spherical microphone array of radius R with Q omnidirectional microphones centered at a location O, as shown in Figure 1. Let the spherical coordinate

y_{o} = (r_{o}, θ_{o}, ϕ_{o})

denote the sound source location with respect to O. Similarly, the

q

th microphone element is located at

x_{q} = (R, θ_{q}, ϕ_{q})

for

q \in {1, 2, \dots, Q}

. In this paper, all the elevation angles are

\in [0, π]

downwards from the Z-axis and the azimuth angles are

\in [0, 2 π)

counterclockwise from the X-axis.

We treat the room as a linear time-invariant (LTI) acoustic transmission system whose dynamic behavior is represented by the RIRs derived from the spherical microphone array measurements. Let

H (x_{q}, y_{o}, t, k)

be the room transfer function (RTF), between the source at

y_{o}

and the microphone element at

x_{q}

, obtained from the short-time Fourier transform (STFT) of the RIR. Here, t is the STFT temporal frame index and

k = 2 π f / c

is the wavenumber with f and c representing the frequency and speed of sound, respectively. Since the incident sound field at the receiver contains the direct sound and the reflections, we can decompose the RTF

H (x_{q}, y_{o}, t, k)

as

H (x_{q}, y_{o}, t, k) = H_{d} (x_{q}, y_{o}, t, k) + H_{r} (x_{q}, y_{o}, t, k)

(1)

where

H_{d} (x_{q}, y_{o}, t, k)

and

H_{r} (x_{q}, y_{o}, t, k)

are the direct path and reflected path components, respectively.

Assuming that the distance between

y_{o}

and

x_{q}

is significantly larger than the aperture size of the microphone array, we can represent

H_{d} (x_{q}, y_{o}, t, k)

and

H_{r} (x_{q}, y_{o}, t, k)

as a composition of plane waves in the spatial domain as

H_{d} (x_{q}, y_{o}, t, k) = G_{D} (t, k | y_{o}) e^{i k {\hat{y}}_{o} . x_{q}}

(2)

H_{r} (x_{q}, y_{o}, t, k) = \int_{\hat{y}} G_{R} (t, k, \hat{y} | y_{o}) e^{i k \hat{y} . x_{q}} d \hat{y}

(3)

where

G_{D} (t, k | y_{o})

is the direct path gain with respect to O,

{\hat{y}}_{o}

is the unit vector along the source direction,

i = \sqrt{- 1}

,

G_{R} (t, k, \hat{y} | y_{o})

is the gain of the reflected plane wave arriving from the direction

\hat{y} = (1, θ, ϕ)

, and

\int_{\hat{y}} d \hat{y} = \int_{0}^{2 π} \int_{0}^{π} sin θ d θ d ϕ

. Here, we have modeled the reflection gain

G_{R}

as a non-isotropic directional distribution function that varies with frequency and time to comprehend a real room with inhomogeneous surfaces that have frequency-dependent wall impedance and damping coefficients.

By examining

E \{H_{d} H_{d}^{*}\}

based on (2), where

E {\cdot}

represents the statistical expectation operator, we can express the direct path power as

P_{D} (t, k | y_{o}) = E \{{| G_{D} (t, k | y_{o}) |}^{2}\}

(4)

where

| \cdot |

denotes the absolute value. Similarly, by examining

E \{H_{r} H_{r}^{*}\}

based on (3), we can write the power of the reflected sound field component incoming from the direction

\hat{y}

as

P_{R} (t, k, \hat{y} | y_{o}) = E \{{| G_{R} (t, k, \hat{y} | y_{o}) |}^{2}\} .

(5)

We aim to estimate the reflection power

P_{R} (t, k, \hat{y} | y_{o})

from the RTFs

H (x_{q}, y_{o}, t, k)

∀ q obtained using a spherical microphone array. Since

P_{R} (t, k, \hat{y} | y_{o})

is a spherical function, we can simplify its estimation using the spherical harmonic decomposition [35] given by

P_{R} (t, k, \hat{y} | y_{o}) = \sum_{v = 0}^{\infty} \sum_{u = - v}^{v} γ_{v u} (t, k | y_{o}) Y_{v u} (\hat{y})

(6)

where

γ_{v u} (t, k | y_{o})

are the reflection power coefficients and

Y_{v u} (\cdot)

is the spherical harmonic function of

v

th order and

u

th mode. Thus, we can calculate the reflection power for any incoming direction and time-frequency bin once we estimate

γ_{v u} (t, k | y_{o})

coefficients.

2.2. Methodology

For determining the

γ_{v u} (t, k | y_{o})

coefficients, we utilize the spatial correlation of higher-order spherical harmonic (eigenbeam) coefficients of the incident sound field. The estimation of the reflection power response involves two main steps:

Step 1: Estimating spherical harmonic coefficients of the incident sound field

In this work, since we are interested in characterizing the room response independent of the source power spectrum, we assume a sound source emitting an impulse signal and treat

H (x_{q}, y_{o}, t, k)

as the incident sound field on the spherical microphone array. For deducing the higher-order spherical harmonic coefficients of the incident sound field, we represent

H (x_{q}, y_{o}, t, k)

as the spherical harmonic decomposition of Helmholtz wave equation solution to the interior sound field problem [12] as

H (x_{q}, y_{o}, t, k) = \sum_{n = 0}^{\infty} \sum_{m = - n}^{n} α_{n m} (t, k | y_{o}) b_{n} (k R) Y_{n m} ({\hat{x}}_{q})

(7)

where

α_{n m} (t, k | y_{o})

are the modal coefficients of the spatial sound field,

{\hat{x}}_{q}

is the unit vector in the direction of the

q

th microphone, and

b_{n} (k R) = \{\begin{matrix} j_{n} (k R) & for an open array \\ j_{n} (k R) - \frac{j_{n}^{^{'}} (k R)}{h_{n}^{^{'}} (k R)} h_{n} (k R) & for a rigid array \end{matrix}

(8)

with

j_{n} (\cdot)

and

h_{n} (\cdot)

denoting the spherical Bessel and Hankel functions of order n, respectively, and

{(\cdot)}^{^{'}}

represents the first derivative operation. From (7), we can estimate

α_{n m} (t, k | y_{o})

coefficients using the orthogonal property of spherical harmonics [36] as

α_{n m} (t, k | y_{o}) = \frac{\sum_{q = 1}^{Q} H (x_{q}, y_{o}, t, k) Y_{n m}^{*} ({\hat{x}}_{q})}{b_{n} (k R)}

(9)

where

{(\cdot)}^{*}

denotes the complex conjugation operation. Practically, we truncate

α_{n m} (t, k | y_{o})

to an order N, such that

N = ⌈ k R ⌉

and

Q \geq {(N + 1)}^{2}

, where

⌈ \cdot ⌉

denotes the ceiling operation, to avoid errors due to spatial aliasing and high-pass nature of higher-order Bessel functions [36].

Step 2: Estimating reflection gains using the spatial correlation model

We can now estimate

γ_{v u} (t, k | y_{o})

from the

α_{n m} (t, k | y_{o})

coefficients using the spatial correlation matrix expression [33] given by

\underset{Λ (t, k | y_{o})}{\underset{︸}{[\begin{matrix} Λ_{0000} \\ Λ_{001 - 1} \\ ⋮ \\ Λ_{00 N N} \\ Λ_{1 - 100} \\ ⋮ \\ Λ_{N N N N} \end{matrix}]}} = \underset{B (k, y_{o})}{\underset{︸}{[\begin{matrix} δ_{0000} & d_{000000} & \dots & d_{0000 V V} \\ δ_{001 - 1} & d_{001 - 100} & \dots & d_{001 - 1 V V} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ δ_{00 N N} & d_{00 N N 00} & \dots & d_{00 N N V V} \\ δ_{1 - 100} & d_{1 - 10000} & \dots & d_{1 - 100 V V} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ δ_{N N N N} & d_{N N N N 00} & \dots & d_{N N N N V V} \end{matrix}]}} \times \underset{Ω (t, k | y_{o})}{\underset{︸}{[\begin{matrix} P_{D} \\ γ_{00} \\ γ_{1 - 1} \\ ⋮ \\ γ_{V - V} \\ ⋮ \\ γ_{V V} \end{matrix}]}}

(10)

where

Λ_{n m n^{'} m^{'}} = E \{α_{n m} (t, k | y_{o}) α_{n^{'} m^{'}}^{*} (t, k | y_{o})\}

(11)

δ_{n m n^{'} m^{'}} = 16 π^{2} i^{(n - n^{'})} Y_{n m}^{*} ({\hat{y}}_{o}) Y_{n^{'} m^{'}} ({\hat{y}}_{o})

(12)

d_{n m n^{'} m^{'} v u} = 16 π^{2} i^{(n - n^{'})} {(- 1)}^{m} \sqrt{\frac{(2 v + 1) (2 n + 1) (2 n^{'} + 1)}{4 π}} W_{1} W_{2}

(13)

with

W_{1} = (\begin{matrix} v & n & n^{'} \\ 0 & 0 & 0 \end{matrix})

and

W_{2} = (\begin{matrix} v & n & n^{'} \\ u & - m & m^{'} \end{matrix})

representing the Wigner 3j symbols [37].

The elements in

Λ (t, k | y_{o})

and

B (k, y_{o})

can be generated from the

α_{n m} (t, k | y_{o})

coefficients and source direction information, respectively. Now, we can solve (10) to estimate

Ω (t, k | y_{o})

by

\hat{Ω} (t, k | y_{o}) = B^{†} (k, y_{o}) Λ (t, k | y_{o})

(14)

where

\hat{[\cdot]}

and

{[\cdot]}^{†}

indicate estimated values and pseudo-inversion operator, respectively. While solving (14), the order of

γ_{v u} (t, k | y_{o})

in

\hat{Ω} (t, k | y_{o})

is truncated to

V \leq ⌊\sqrt{{(N + 1)}^{4} - 1}⌋

, where

⌊ \cdot ⌋

indicate flooring operation, to avoid an underdetermined system [34]. Once the

γ_{v u} (t, k | y_{o})

coefficients are extracted from

\hat{Ω} (t, k | y_{o})

, we can generate the reflection power using Equation (6) for different incoming directions

\hat{y}

and time-frequency bins. From

P_{R} (t, k, \hat{y} | y_{o})

, we can estimate the total reflected power in any time-frequency bin as

P_{T} (t, k | y_{o}) = \int_{\hat{y}} P_{R} (t, k, \hat{y} | y_{o}) d \hat{y} .

(15)

Substituting (6) in (15) and using the symmetrical property of spherical harmonics [35]

P_{T} (t, k | y_{o}) = γ_{00} (t, k | y_{o})

. We can now use

P_{R} (t, k, \hat{y} | y_{o})

and

P_{T} (t, k | y_{o})

to analyze the reflection power variations with time, frequency, and direction.

3. Experimental Analysis

In this section, we present the analysis of the reflection power response of two rooms from their RIR datasets recorded using an em32 Eigenmike [38], which is a

Q = 32

element rigid spherical microphone array of radius

R = 0.042

m. Both the RIR datasets were measured using a source signal generated from a directional loudspeaker. The first RIR dataset available from the work in [39] is for a small audio laboratory room of size

3.54 \times 4.06 \times 2.70

m, hereafter referred to as Room-1. The second RIR dataset from [40] pertains to a larger classroom of size

6.5 \times 8.3 \times 2.9

m, hereafter referred to as Room-2. According to these datasets, the reverberation time

(T_{60})

of Room-1 and Room-2 are

0.329

s and

1.12

s, respectively. From the datasets, we have selected the RIRs for different source positions in the XY plane, i.e.,

θ_{o} = 90^{\circ}

at different

ϕ_{o}

angles, and at 1 m distance from the microphone array center. The direct path component from the source arrives at the receiver around 0.0026 s and 0.0028 s for Room-1 and Room-2, respectively.

From the selected 32-channel RIRs, we obtain

H (x_{q}, y_{o}, t, k)

using the STFT operation with a 1024-sample Hanning window with

50 %

overlap, 2048-point fast Fourier transform (FFT), and 48 KHz sampling frequency. We then follow the process described in Section 2.2 to generate

P_{R} (t, k, \hat{y} | y_{o})

for 500 uniformly distributed

\hat{y}

directions derived from spiral-based sampling [41] ∀

t, k

bins in the frequency band of 20 to 1500 Hz. These 500 spiral sampled directions provide sufficient spatial resolution to assimilate the sound reflectivity variations across the room surfaces at a reasonable computation cost. Finally, we estimate

P_{T} (t, k | y_{o})

for analyzing the time-frequency spectrum of the reflection power of the two rooms. While dealing with the temporal response in the following sections, the 0 s in the time-index indicates the moment of sound event occurrence. However, the reflection power response is calculated only from 0.01 s which is the center of the first STFT frame. This frame size was selected after considering a reasonable time-frequency resolution for proper spectral and temporal analysis of reflections in both rooms.

3.1. Theoretical Background

Here we discuss important theoretical concepts of room acoustics and room response characteristics according to prevalent literature [5,42,43,44] to validate the experimental analysis.

3.1.1. Modal Decay

The reverberation field inside a room leads to the persistence of sound even after the source ceases. The duration of this sound persistence, called the reverberation time

R_{T}

[5], is the most commonly used measure of room acoustic quality. In practical applications, acousticians calculate

R_{T}

as the 60 dB decay time since source cessation and is referred to as

T_{60}

[43]. Typically, such estimations assume diffuse sound field conditions and average wall absorption and calculate

R_{T}

as a single value to characterize the room acoustics. However, in reality, the wall absorption factors change with frequency [5,44], and hence accurate

R_{T}

estimates should be frequency-dependent. Furthermore, the room architecture, variations in surface materials, and source-receiver properties affect the reflection path length [44] and magnitude, which, in turn, influence the decay of different frequency components. Therefore, decay times should be a function of frequency and direction. Since an analytical solution to decay rate estimation is complex, we can derive them numerically through reflection sound field analysis.

3.1.2. Room Modes

The sound propagation in any acoustic enclosure follows different wave characteristic phenomena like reflection, scattering, diffraction, and interference. Such a complex interaction of innumerous waves is characterized through the acoustical wave equation [5]. The frequencies corresponding to the eigenvalues of the acoustic wave equation can form standing waves inside the room to create a resonant behavior leading to non-uniform distribution of reflection power and extended reverberation [5,43,44]. These frequencies are often referred to as room modes or eigenfrequencies.

According to [5,43], at low frequency ranges, the number of resonant frequencies will be small, and they can be excited individually. Hence, the room response will be quite irregular and anisotropic for these frequencies. When we move towards the higher frequencies, the eigenvalues are densely spaced, so they cannot be independently excited. Even though the higher frequencies contribute to the reflected sound pressure, the lack of independent resonance combined with increased scattering makes them relatively uniform and less prominent compared to the lower frequencies. Hence, in a typical room response, we expect high reflection powers with some resonant peaks for low (<300 Hz) to mid (300 to 600 Hz) audible frequencies and decaying magnitude towards the high (>600 Hz) frequencies. The cross-over frequency [5,43] that separates the resonant low-frequency response and the high-frequency diffused reflections is termed as Schroeder frequency

(ν_{S})

. It can be calculated using the empirical formula

ν_{S} \approx 2000 \sqrt{\frac{T_{60}}{Δ}}

(16)

where

Δ

is the room volume. From the dimensions and

T_{60}

of the test rooms, (16) gives

ν_{S}

\approx

184 Hz and

\approx

169 Hz for Room-1 and Room-2, respectively.

For a rectangular enclosure, we can calculate the eigenvalues of the wave equation [5,42,43,44] as

ν_{n_{x} n_{y} n_{z}} = \frac{c}{2} \sqrt{{(\frac{n_{x}}{l_{x}})}^{2} + {(\frac{n_{y}}{l_{y}})}^{2} + {(\frac{n_{z}}{l_{z}})}^{2}}

(17)

where

{n_{x}, n_{y}, n_{z}}

are non-negative integers and

l_{x} \times l_{y} \times l_{z}

are the room dimensions. When two of

{n_{x}, n_{y}, n_{z}}

equals zero, the solution of (17) gives the axial modes which are considered to be stronger with low decay rates compared to other modes [42]. We can calculate the tangential modes with two non-zero integers in

{n_{x}, n_{y}, n_{z}}

and oblique modes by substituting all non-zero integers in

{n_{x}, n_{y}, n_{z}}

.

Figure 2 shows the room mode distribution in Room-1 and Room-2. The axial and tangential modes are calculated from (17), and the line heights in Figure 2 represent the number of resonances occurring at a frequency since different

{n_{x}, n_{y}, n_{z}}

combinations can result in the same

ν_{n_{x} n_{y} n_{z}}

frequency. The axial modes were given a higher nominal weight [44] while calculating this distribution due to their inherent prominence. Theoretically, an empty rectangular room of the same dimensions should replicate this trend in their frequency response. However, in a real room environment, the interference of normal modes of different decay rates [44] and the influence of inhomogeneous surfaces and source directivity alter the assumptions behind (17). Therefore, the real room response may vary from the predicted distribution.

For practical validation of the real acoustic phenomenon, we will use the power response generated using the proposed technique to identify the variations in the room mode distribution and modal decays compared to the above theoretical expectations.

3.2. Reflection Power Spectrum

Figure 3 and Figure 4 show the spectrogram of

P_{T} (t, k | y_{o})

for different source positions in Room-1 and Room-2, respectively. For both rooms, the lower frequencies show some irregular peaks, and the reflection power of late reverberation clearly decays towards the higher frequencies as we predicted in Section 3.1.2. Additionally, the reflection power is maximum in the initial time instants, and then the power decays with time for all frequencies due to surface absorption. It should be noted that the power decay trend is varying with the frequencies due to the frequency-dependent wall impedance property [5]. Apart from some magnitude variations, the time-frequency spectrum trend is maintained for all source positions in both rooms. In the following sections, we will analyze the reflections power variations with frequency and time in more detail.

3.2.1. Frequency Response of Reflection Power

Figure 5 and Figure 6 show the frequency response of time-averaged

P_{T} (t, k | y_{o})

for different source positions in Room-1 and Room-2, respectively. These figures provide a clear view of the low-frequency peaks and the decay of power towards the higher frequencies. In Room-1, we can observe high powers around 164 Hz, 211 Hz, and 281 Hz before the onset of the power decay. Compared to Figure 2a, 164 Hz and 211 Hz are closer to the theoretical room modes, whereas many other predicted modes do not appear in the observed response in Figure 5. Similarly, some of the observed peaks in Room-2 around 164 Hz, 304 Hz, 328 Hz, and 492 Hz vary from the theoretical room mode estimates shown in Figure 2b. Additionally, the identification of

ν_{S}

is difficult from these responses, but is clearly greater than the predicted

ν_{S}

values mentioned in Section 3.1.2. This error is caused by the approximation in (16) by use of frequency-averaged

T_{60}

and from the influence of source directivity.

It should also be noted that there are no substantial variations in the frequency response of Room-2 for different source positions. Additionally, in Room-1, the differences are not drastic as should be expected in a smaller room with significant reverberation. This is the result of the formulation of reflection gains with respect to a common listening position (O) and the separation of the direct path component from the reflections. A direct analysis of the frequency response of RIR will show significant differences with the change in source positions. Therefore, the proposed technique can be used to predict the room response behavior independent of the source positions.

3.2.2. Temporal Response of Reflection Power

Figure 7 and Figure 8 show the temporal response of

P_{T} (t, k | y_{o})

at different frequencies for different source positions in Room-1 and Room-2, respectively. As evident from these figures, the reflection power decays due to surface absorption, and the decay trend is similar for all source positions. Since the damping constants of room surfaces are frequency-dependent, each frequency in Figure 7 and Figure 8 decays at different rates. The lower frequencies like 70 Hz, 141 Hz, and 211 Hz have slower decay rates compared to the other frequencies. As we move from 281 Hz to 633 Hz in Figure 7 and Figure 8, the decay rate stabilizes towards the higher frequencies. Furthermore, the decay of higher frequencies is nearly linear, whereas the lower frequencies (70 Hz to 211 Hz) exhibit a non-linear decay, especially in Room-2. This can be attributed to the highly non-uniform power distribution of the lower frequency resonant modes, which leads to the concentration of sound absorption to certain surfaces [42,43]. In comparison, the high frequencies have more diffused distribution of reflection power, and hence the decay behavior is averaged over broader surface areas.

3.3. Decay Time

From the time-frequency spectrum of reflection power, we can estimate the decay time of each frequency to predict the strong room modes in a real room environment. Figure 9 and Figure 10 show the 60 dB decay time of each frequency estimated from the

P_{T} (t, k | y_{o})

values for different source positions in Room-1 and Room-2, respectively. Even though the temporal response at each frequency in Figure 7 and Figure 8 seems relatively independent of the source positions, the decay times of the frequencies is slightly different for each source position according to Figure 9 and Figure 10. The average decay time, maximum decay time, and the corresponding frequency for each source position in both rooms are summarized under Table 1. We can say that the strongest modes in Room-1 are ≈140 Hz, ≈164 Hz, and ≈258 Hz, which are closer to the peak power frequencies observed in Figure 5. However, in Room-2, the frequencies with maximum decay time are different from the frequencies with maximum reflection power. Hence, we need a deeper insight into the directional variations of power and decay time which we will analyze in the next section.

3.4. Directional Decays and Dominant Reflection Directions

As we discussed in Section 3.1.1, decay times are a function of frequency and direction. Additionally, from Section 3.3, we found that the modes with higher decay times can be different from the modes with high reflection powers. Therefore, a more comprehensive analysis of the spatial spectrum of these reflections is necessary to identify room surfaces causing the observed behaviors for the frequencies of interest. Figure 11a,b shows the directional decay times of Room-1 for

y_{o} = (1, 90^{\circ}, 40^{\circ})

and

y_{o} = (1, 90^{\circ}, 120^{\circ})

, respectively, obtained from the 60 dB decay time of

P_{R} (t, k, \hat{y} | y_{o})

in each

\hat{y}

direction. Figure 12a,b shows the directions with high reflection powers in Room-1 for

y_{o} = (1, 90^{\circ}, 40^{\circ})

and

y_{o} = (1, 90^{\circ}, 120^{\circ})

, respectively. The letters indicated near the locations of highest reflection powers in Figure 12 are coarsely mapped onto the real Room-1 environment in Figure 13. As evident from this figure, the locations around ‘A’, ‘C’, ‘D’, and ‘E’ have glass surfaces with high reflectivity, and hence the observed dominant power directions are valid. Furthermore, there is no evident pattern between the distributions in Figure 11 and Figure 12 for the given modal frequencies, and hence the feature predictions based on computational room acoustic models can be imprecise. In such cases, we can employ the proposed technique to reproduce authentic spatio-temporal room responses.

According to Figure 11 and Figure 12, the directions of high decay times and dominant reflections are different from each other for every frequency and source position. Even though the dominant reflection locations and directional decay distribution have many common factors of influence, the reflection power in a direction strongly depends on the source directivity and source-to-wall distance, whereas the directional decay is mainly a function of the wall impedance coefficients and reflection paths. Hence, as seen in Figure 11, the directional decay will be different between the frequencies due to wall impedance variations, as well as for different source positions due to change in reflection path. In contrast, if we observe Figure 12 and Figure 13, the dominant reflection locations ’A’, ’B’, and ’C’ have similar azimuth values and source-to-wall distance when the source is at

y_{o} = (1, 90^{\circ}, 40^{\circ})

. Likewise, the elevation values of the dominant reflection locations ’D’ and ’E’ in Figure 12b are nearly the same when the source is at

y_{o} = (1 . 90^{\circ}, 120^{\circ})

. Additionally, the location of ’F’ is in the close vicinity of the source position. Thus, the dominant reflection locations are principally determined by the source position and source directivity. For locations with same source-to-wall distance, the dominant reflections will depend on the reflectivity of the surface materials.

Based on the above observations, the analysis of both directional decay and directional power is essential in characterizing the room reflections. This is particularly important while managing the features of early reflections and late reverberations to achieve desired perception quality. Since the early reflections undergo very few boundary reflections [45], they are mainly defined by the source directivity and source-to-wall distance. Hence, we can use the dominant reflection directions to characterize the behavior of early reflections. The late reverberation undergoes multiple boundary reflections, and they are integrated both spatially and temporally before reaching the receiver [45]. Since the late reverberation characteristics are primarily characterized by the surface absorption and room shape [45,46], we can analyze the directional decay rates to study their behavior. We can further visualize the power spectrum of

P_{R} (t, k, \hat{y} | y_{o})

across time for an extensive analysis of the variations in the anisotropic spatial properties between the early reflections and late reverberations.

The precise knowledge of frequencies and surfaces contributing to the salient features of these reflections will be useful for defining the perceptual targets for modal control methods [47], optimizing room mode redistribution to improve acoustic quality [48], and devising active [49] and passive [50,51] room acoustic treatment methods.

4. Conclusions

In this paper, we presented a reflection power response estimation technique utilizing the spatial correlation of higher-order eigenbeams derived from spherical microphone array measurements. The formulation of the reflection gain as a function of time, frequency, and direction helps in comprehending a faithful room response for a realistic non-diffuse sound field. The experimental results validate the frequency response and temporal response of the reflection power against the theoretical expectations.

The proposed technique can estimate the resonant frequencies and modal decays caused by directional speakers and complex room environments. Furthermore, the directional decay times and dominant reflection directions facilitate the distinction of early and late reflection features. The insights from this room acoustic evaluation technique will be beneficial in controlling the acoustic quality while designing performance spaces. Particularly, the findings from this method will be more reliable than computational room models while deciding acoustic treatment schemes compatible with the source directivity. Additionally, the room mode features identified from this method can be incorporated in spectral equalization algorithms to improve speech intelligibility and remove audible artifacts. The dominant reflection locations and directional decay spectrum can aid in the inference of room geometry and calibration of the room acoustics in virtual reality-based rendering of heritage sites.

The method can also be adapted for blind estimation of the discussed characteristics from the direct processing of microphone recordings for any arbitrary source signal, since we can separate the reflected power from the direct path power. Moreover, apart from spherical microphone arrays, any arbitrary array designs that can generate accurate spatial sound field coefficients can be integrated with the proposed algorithm. The future work shall expand the method to include multiple sources in noisy environments to conceive more real-world applications.

Author Contributions

Conceptualization, A.B., T.D.A. and J.Z.; Methodology, A.B., T.D.A. and J.Z.; Software, A.B.; Formal analysis, A.B.; Investigation, A.B.; Validation, A.B., T.D.A. and J.Z.; Resources, A.B., T.D.A. and J.Z.; Writing—original draft preparation, A.B.; Writing—review and editing, T.D.A. and J.Z.; Visualization, A.B.; Supervision, T.D.A. and J.Z.; Project administration, T.D.A.; Funding acquisition, T.D.A.; All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Australian Research Council (ARC) Discovery Project Grant No. DP180102375.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

RIR	Room impulse response
PWD	Plane-wave decomposition
EB-SPRIT	Eigenbeam rotational invariance technique
LTI	Linear time invariant
RTF	Room transfer function
STFT	Short time Fourier transform
FFT	Fast Fourier transform

References

Morse, P.M.; Bolt, R.H. Sound waves in rooms. Rev. Mod. Phys. 1944, 16, 69. [Google Scholar] [CrossRef]
Karjalainen, M.; Antsalo, P.; Makivirta, A.; Valimaki, V.; Peltonen, T. Estimation of Modal Decay Parameters from Noisy Response Measurements. J. Audio Eng. Soc. 2002, 50, 5290. [Google Scholar]
Stewart, R.; Sandler, M. Statistical measures of early reflections of room impulse responses. In Proceedings of the Conference Digital Audio Effects (DAFx-07), Bordeaux, France, 10–15 September 2007; pp. 59–62. [Google Scholar]
Long, M. Architectural Acoustics; Elsevier: Amsterdam, The Netherlands, 2005. [Google Scholar]
Kuttruff, H. Room Acoustics, 6th ed.; CRC Press: Boca Raton, FL, USA, 2017. [Google Scholar]
Allen, J.B.; Berkley, D.A. Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 1979, 65, 943–950. [Google Scholar] [CrossRef]
Lehmann, E.A.; Johansson, A.M. Prediction of energy decay in room impulse responses simulated with an image-source model. J. Acoust. Soc. Am. 2008, 124, 269–277. [Google Scholar] [CrossRef] [Green Version]
Hamilton, B. Finite Difference and Finite Volume Methods for Wave-Based Modelling of Room Acoustics. 2016. Available online: https://www.researchgate.net/profile/Brian-Hamilton-5/publication/310902744_Finite_Difference_and_Finite_Volume_Methods_for_Wave-based_Modelling_of_Room_Acoustics/links/583acf2a08ae3a74b4a01683/Finite-Difference-and-Finite-Volume-Methods-for-Wave-based-Modelling-of-Room-Acoustics.pdf (accessed on 27 May 2021).
Lyon, R.H. Theory and Application of Statistical Energy Analysis; Elsevier: Amsterdam, The Netherlands, 2014. [Google Scholar]
Kim, H.; Hernaggi, L.; Jackson, P.J.; Hilton, A. Immersive Spatial Audio Reproduction for VR/AR Using Room Acoustic Modelling from 360° Images. In Proceedings of the Virtual Reality 3D User Interfaces, Osaka, Japan, 23–27 March 2019; pp. 120–126. [Google Scholar]
Remaggi, L.; Neidhardt, A.; Hilton, A.; Philip, J.B.J. Perceived quality and spatial impression of room reverberation in VR reproduction from measured images and acoustics. In Proceedings of the 23rd International Congress Acoustics, Aachen, Germany, 9–13 September 2019. [Google Scholar]
Samarasinghe, P. Modal based Solutions for the Acquisition and Rendering of Large Spatial Soundfields. Ph.D. Thesis, College of Engineering and Computer Science, Australian National University, Canberra, Australia, 2014. [Google Scholar]
Schroeder, M.R. Measurement of sound diffusion in reverberation chambers. J. Acoust. Soc. Am. 1959, 31, 1407–1414. [Google Scholar] [CrossRef]
Broadhurst, A. An acoustic telescope for architectural acoustic measurements. Acta Acust. United Acust. 1980, 46, 299–310. [Google Scholar]
Yamasaki, Y.; Itow, T. Measurement of spatial information in sound fields by closely located four point microphone method. J. Acoust. Soc. Jpn. (E) 1989, 10, 101–110. [Google Scholar] [CrossRef] [Green Version]
Merimaa, J.; Lokki, T.; Peltonen, T.; Karjalainen, M. Measurement, Analysis, and Visualization of Directional Room Responses. In Proceedings of the Audio Engineering Society Convention, New York, NY, USA, 21–24 September 2001. [Google Scholar]
Ward, D.B.; Abhayapala, T.D. Reproduction of a plane-wave sound field using an array of loudspeakers. IEEE Trans. Speech Audio Process. 2001, 9, 697–707. [Google Scholar] [CrossRef] [Green Version]
Gover, B.N.; Ryan, J.G.; Stinson, M.R. Measurements of directional properties of reverberant sound fields in rooms using a spherical microphone array. J. Acoust. Soc. Am. 2004, 116, 2138–2148. [Google Scholar] [CrossRef] [Green Version]
Park, M.; Rafaely, B. Sound-field analysis by plane-wave decomposition using spherical microphone array. J. Acoust. Soc. Am. 2005, 118, 3094–3103. [Google Scholar] [CrossRef]
Tervo, S.; Korhonen, T.; Lokki, T. Estimation of reflections from impulse responses. Build. Acoust. 2011, 18, 159–173. [Google Scholar] [CrossRef]
Hioka, Y.; Niwa, K.; Sakauchi, S.; Furuya, K.; Haneda, Y. Estimating Direct-to-Reverberant Energy Ratio Using D/R Spatial Correlation Matrix Model. IEEE Trans. Audio Speech Lang. Process. 2011, 19, 2374–2384. [Google Scholar] [CrossRef]
Alary, B.; Massé, P.; Välimäki, V.; Noisternig, M. Assessing the anisotropic features of spatial impulse responses. In Proceedings of the EAA Spatial Audio Signal Processing Symposium, Paris, France, 6–7 September 2019; pp. 43–48. [Google Scholar]
Nolan, M.; Berzborn, M.; Fernandez-Grande, E. Isotropy in decaying reverberant sound fields. J. Acoust. Soc. Am. 2020, 148, 1077–1088. [Google Scholar] [CrossRef]
Berzborn, M.; Nolan, M.; Fernandez-Grande, E.; Vorländer, M. On the directional properties of energy decay curves. In Proceedings of the 23rd International Congress Acoustics, Aachen, Germany, 9–13 September 2019. [Google Scholar]
Schroeder, M.R. New method of measuring reverberation time. J. Acoust. Soc. Am. 1965, 37, 1187–1188. [Google Scholar] [CrossRef]
Abhayapala, T.D.; Ward, D.B. Theory and design of high order sound field microphones using spherical microphone array. In Proceedings of the 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA, 13–17 May 2002; Volume 2, pp. 1949–1952. [Google Scholar]
Poletti, M.A. Three-dimensional surround sound systems based on spherical harmonics. J. Audio Eng. Soc. 2005, 53, 1004–1025. [Google Scholar]
Lovedee-Turner, M.; Murphy, D. Three-dimensional reflector localisation and room geometry estimation using a spherical microphone array. J. Acoust. Soc. Am. 2019, 146, 3339–3352. [Google Scholar] [CrossRef]
Rafaely, B.; Balmages, I.; Eger, L. High-resolution plane-wave decomposition in an auditorium using a dual-radius scanning spherical microphone array. J. Acoust. Soc. Am. 2007, 122, 2661–2668. [Google Scholar] [CrossRef]
Rafaely, B.; Peled, Y.; Agmon, M.; Khaykin, D.; Fisher, E. Spherical Microphone Array Beamforming. In Speech Processing in Modern Communication: Challenges and Perspectives; Cohen, I., Benesty, J., Gannot, S., Eds.; Springer: Berlin/Heidelberg, Germany, 2010; pp. 281–305. [Google Scholar] [CrossRef]
Sun, H.; Mabande, E.; Kowalczyk, K.; Kellermann, W. Joint DOA and TDOA estimation for 3D localization of reflective surfaces using eigenbeam MVDR and spherical microphone arrays. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011; pp. 113–116. [Google Scholar] [CrossRef]
Kereliuk, C.; Herman, W.; Wedelich, R.; Gillespie, D.J. Modal analysis of room impulse responses using subband ESPRIT. In Proceedings of the International Conference Digital Audio Effects (DAFx-18), Aveiro, Portugal, 4–8 September 2018. [Google Scholar]
Samarasinghe, P.N.; Abhayapala, T.D.; Chen, H. Estimating the Direct-to-Reverberant Energy Ratio Using a Spherical Harmonics-Based Spatial Correlation Model. IEEE/ACM Trans. Audio Speech Lang. Process. 2016, 25, 310–319. [Google Scholar] [CrossRef] [Green Version]
Samarasinghe, P.N.; Abhayapala, T.D. Blind estimation of directional properties of room reverberation using a spherical microphone array. In Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 5–9 March 2017; pp. 351–355. [Google Scholar]
Williams, E.G. Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography; Academic Press: Cambridge, MA, USA, 1999. [Google Scholar]
Rafaely, B. Fundamentals of Spherical Array Processing; Springer: Berlin/Heidelberg, Germany, 2015; Volume 8. [Google Scholar]
Olver, F.W.; Lozier, D.W.; Boisvert, R.F.; Clark, C.W. NIST Handbook of Mathematical Functions; Cambridge University Press: New York, NY, USA, 2010. [Google Scholar]
em32 Eigenmike® Microphone Array Release Notes (v17.0). Available online: https://mhacoustics.com/sites/default/files/ReleaseNotes.pdf (accessed on 6 April 2021).
Birnie, L.I.; Abhayapala, T.D.; Samarasinghe, P.N. Reflection Assisted Sound Source Localization through a Harmonic Domain MUSIC Framework. IEEE/ACM Trans. Audio Speech Lang. Process. 2019, 28, 279–293. [Google Scholar] [CrossRef]
Olgun, O.; Hacihabiboglu, H. METU SPARG Eigenmike em32 Acoustic Impulse Response Dataset v0.1.0. Available online: http://doi.org/10.5281/zenodo.2635758 (accessed on 28 September 2020).
Semechko, A. Suite of Functions to Perform Uniform Sampling of a Sphere. 2020. Available online: https://www.mathworks.com/matlabcentral/fileexchange/37004-suite-of-functions-to-perform-uniform-sampling-of-a-sphere (accessed on 20 July 2020).
Cox, T.J.; D’Antonio, P.; Avis, M.R. Room sizing and optimization at low frequencies. J. Audio Eng. Soc. 2004, 52, 640–651. [Google Scholar]
Crocker, M.J. Handbook of Noise and Vibration Control; John Wiley & Sons: Hoboken, NJ, USA, 2007. [Google Scholar]
Everest, F.A. Master Handbook of Acoustics. J. Acoust. Soc. Am. 2001, 110, 1714–1715. [Google Scholar] [CrossRef]
Schimmel, S.M.; Muller, M.F.; Dillier, N. A fast and accurate “shoebox” room acoustics simulator. In Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, 19–24 April 2009; pp. 241–244. [Google Scholar]
Izumi, Y.; Otani, M. Relation between Direction-of-Arrival distribution of reflected sounds in late reverberation and room characteristics: Geometrical acoustics investigation. Appl. Acoust. 2021, 176, 107805. [Google Scholar] [CrossRef]
Fazenda, B.M.; Stephenson, M.; Goldberg, A. Perceptual thresholds for the effects of room modes as a function of modal decay. J. Acoust. Soc. Am. 2015, 137, 1088–1098. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Papadopoulos, C.I. Redistribution of the low frequency acoustic modes of a room: A finite element-based optimisation method. Appl. Acoust. 2001, 62, 1267–1285. [Google Scholar] [CrossRef]
Fazenda, B.; Wankling, M.; Hargreaves, J.; Elmer, L.; Hirst, J. Subjective preference of modal control methods in listening rooms. J. Audio Eng. Soc. 2012, 60, 338–349. [Google Scholar]
Fuchs, H.; Lamprecht, J. Covered broadband absorbers improving functional acoustics in communication rooms. Appl. Acoust. 2013, 74, 18–27. [Google Scholar] [CrossRef]
Cox, T.; d’Antonio, P. Acoustic Absorbers and Diffusers: Theory, Design and Application; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]

Figure 1. Geometric illustration of the spherical microphone array centered at the coordinate origin and the single sound source located at

y_{o} = (r_{o}, θ_{o}, ϕ_{o})

.

Figure 1. Geometric illustration of the spherical microphone array centered at the coordinate origin and the single sound source located at

y_{o} = (r_{o}, θ_{o}, ϕ_{o})

.

Figure 2. Room mode distribution in (a) Room-1 (b) Room-2.

Figure 3. Reflection power response of Room-1 for different source positions.

Figure 4. Reflection power response of Room-2 for different source positions.

Figure 5. Reflection power with frequency for different source positions in Room-1.

Figure 6. Reflection power with frequency for different source positions in Room-2.

Figure 7. Reflection power with time for different frequencies and source positions in Room-1.

Figure 8. Reflection power with time for different frequencies and source positions in Room-2.

Figure 9. Decay time with frequency for different source positions in Room-1.

Figure 10. Decay time with frequency for different source positions in Room-2.

Figure 11. Directional decay times inside Room-1 for the peak frequencies when source is located at (a)

y_{o} = (1, 90^{\circ}, 40^{\circ})

(b)

y_{o} = (1, 90^{\circ}, 120^{\circ})

.

Figure 11. Directional decay times inside Room-1 for the peak frequencies when source is located at (a)

y_{o} = (1, 90^{\circ}, 40^{\circ})

(b)

y_{o} = (1, 90^{\circ}, 120^{\circ})

.

Figure 12. Dominant reflection directions inside Room-1 for the peak frequencies when source is located at (a)

y_{o} = (1, 90^{\circ}, 40^{\circ})

(b)

y_{o} = (1, 90^{\circ}, 120^{\circ})

.

Figure 12. Dominant reflection directions inside Room-1 for the peak frequencies when source is located at (a)

y_{o} = (1, 90^{\circ}, 40^{\circ})

(b)

y_{o} = (1, 90^{\circ}, 120^{\circ})

.

Figure 13. Mapping of dominant reflection directions in Room-1. The letters A to C and D to F represent the directions of highest reflection powers with respect to Figure 12a,b, respectively.

Table 1. Maximum and average decay times in Room-1 and Room-2.

In Room	Source Position	Maximum Decay Time (s)	Frequency (Hz) with Maximum Decay Time	Average Decay Time (s)
Room-1	$y_{o} = (1, 90^{\circ}, 40^{\circ})$	0.4114	140	0.2822
	$y_{o} = (1, 90^{\circ}, 120^{\circ})$	0.4540	164	0.2899
	$y_{o} = (1, 90^{\circ}, 200^{\circ})$	0.4823	140	0.2995
	$y_{o} = (1, 90^{\circ}, 280^{\circ})$	0.4398	258	0.2936
Room-2	$y_{o} = (1, 90^{\circ}, 0^{\circ})$	1.1349	492	0.8133
	$y_{o} = (1, 90^{\circ}, 90^{\circ})$	1.1066	328	0.8288
	$y_{o} = (1, 90^{\circ}, 180^{\circ})$	1.0498	328	0.8341
	$y_{o} = (1, 90^{\circ}, 270^{\circ})$	1.0640	586	0.8182

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bastine, A.; Abhayapala, T.D.; Zhang, J. Power Response and Modal Decay Estimation of Room Reflections from Spherical Microphone Array Measurements Using Eigenbeam Spatial Correlation Model. Appl. Sci. 2021, 11, 7688. https://doi.org/10.3390/app11167688

AMA Style

Bastine A, Abhayapala TD, Zhang J. Power Response and Modal Decay Estimation of Room Reflections from Spherical Microphone Array Measurements Using Eigenbeam Spatial Correlation Model. Applied Sciences. 2021; 11(16):7688. https://doi.org/10.3390/app11167688

Chicago/Turabian Style

Bastine, Amy, Thushara D. Abhayapala, and Jihui (Aimee) Zhang. 2021. "Power Response and Modal Decay Estimation of Room Reflections from Spherical Microphone Array Measurements Using Eigenbeam Spatial Correlation Model" Applied Sciences 11, no. 16: 7688. https://doi.org/10.3390/app11167688

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Power Response and Modal Decay Estimation of Room Reflections from Spherical Microphone Array Measurements Using Eigenbeam Spatial Correlation Model

Abstract

Featured Application

Abstract

1. Introduction

2. Reflection Power Estimation Using Eigenbeam Spatial Correlation Model

2.1. Problem Formulation

2.2. Methodology

3. Experimental Analysis

3.1. Theoretical Background

3.1.1. Modal Decay

3.1.2. Room Modes

3.2. Reflection Power Spectrum

3.2.1. Frequency Response of Reflection Power

3.2.2. Temporal Response of Reflection Power

3.3. Decay Time

3.4. Directional Decays and Dominant Reflection Directions

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI