Next Article in Journal
Fault Diagnosis for Abnormal Wear of Rolling Element Bearing Fusing Oil Debris Monitoring
Next Article in Special Issue
Toward the Definition of a Soundscape Ranking Index (SRI) in an Urban Park Using Machine Learning Techniques
Previous Article in Journal
Effects of JPEG Compression on Vision Transformer Image Classification for Encryption-then-Compression Images
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Self-Consistent Soundscape Ranking Index: The Case of an Urban Park

1
Department of Earth and Environmental Sciences (DISAT), University of Milano-Bicocca, Piazza della Scienza 1, 20126 Milano, Italy
2
Department of Physics, University of Milano-Bicocca, Piazza della Scienza 3, 20126 Milano, Italy
*
Authors to whom correspondence should be addressed.
Sensors 2023, 23(7), 3401; https://doi.org/10.3390/s23073401
Submission received: 28 February 2023 / Revised: 15 March 2023 / Accepted: 20 March 2023 / Published: 23 March 2023
(This article belongs to the Special Issue Acoustic Sensing and Monitoring in Urban and Natural Environments)

Abstract

:
We have performed a detailed analysis of the soundscape inside an urban park (located in the city of Milan) based on simultaneous sound recordings at 16 locations within the park. The sound sensors were deployed over a regular grid covering an area of about 22 hectares, surrounded by a variety of anthropophonic sources. The recordings span 3.5 h each over a period of four consecutive days. We aimed at determining a soundscape ranking index (SRI) evaluated at each site in the grid by introducing 4 unknown parameters. To this end, a careful aural survey from a single day was performed in order to identify the presence of 19 predefined sound categories within a minute, every 3 minutes of recording. It is found that all SRI values fluctuate considerably within the 70 time intervals considered. The corresponding histograms were used to define a dissimilarity function for each pair of sites. Dissimilarity was found to increase significantly with the inter-site distance in space. Optimal values of the 4 parameters were obtained by minimizing the standard deviation of the data, consistent with a fifth parameter describing the variation of dissimilarity with distance. As a result, we classify the sites into three main categories: “poor”, “medium” and “good” environmental sound quality. This study can be useful to assess the quality of a soundscape in general situations.

1. Introduction

The acoustic quality of a habitat is generally recognized as a primordial requirement of wildlife conservation [1,2]. It has become customary to evaluate the environmental status of particularly exposed areas, such as large populated urban zones, in addition to natural habitats. As a result of increasing urbanization all over the world, the associated, and in many cases excessive human technophonies, usually called “noise”, together with a lack of planning to permit for a smooth transition between a built environment and a natural one, can yield several deleterious effects on biodiversity. An exhaustive picture on these issues has been documented in recents works [3,4,5,6].
The use of passive acoustic monitoring has become the main tool to study a particular habitat [7,8,9,10]. Its diffusion among researchers and technicians has evolved considerably together with its increasing memory capability. An important step towards its utilization on large scales came with the development of specific spectral and level characteristics of sound through the analysis of ecoacoustic indices (for a review, see [11]). This, in addition, allows the retrieval of important information about the way different sounds are assembled across both space and time. This new characteristic of the habitat sound is commonly referred to as the soundscape [12,13], and it has been recognized as a distinct feature or, more importantly, as an ecological “signature” of a landscape [14,15,16].
Urban parks are sources of natural sounds, but in many cases, they are mixed with anthropogenic noise, such as vehicle traffic noise, from surrounding built areas [17,18]. Many research works have shown that Brazilian and Italian parks exceed noise level thresholds due to the presence of traffic noise as the main source of disturbance [19,20,21]. These results also give evidence that the traffic noise perception in urban parks can undermine the potential beneficial effects of natural sounds [22].
The ecoacoustics indices merge the complex acoustic dynamics of an ecosystem, consisting of vocalizing species, anthropogenic noise, and natural phenomena [23], into sets of time series, allowing us to get a picture of the environmental changes at a given habitat [24]. Furthermore, they provide new insights on species diversity and human impacts across a wide range of terrestrial [25,26,27,28,29] and aquatic environments [30,31]. The validation of the calculation of ecoacoustic indices is usually sound-truthed by specialized operators who classify hours of recordings according to predefined sound categories.
The identification of sound sources by operators is highly time consuming and requires specific knowledge of animal vocalizations, thus limiting its applicability to small datasets [32,33]. However, a cumulative approach based on a qualitative description of the recorded sound (e.g., many/few vocalizing birds, many/few birds species, high/low traffic noise, etc.) was shown to improve the validation process, showing a good matching between the dentification of acoustic categories from the audio recording and the ecoacoustic indices [34].
In this work, we address the question of quantifying the quality of a local soundscape from a set of audio recordings at each site of a grid covering the area of interest. The intervention of an expert is required, who is expected to recognize the different sound categories determining the soundscape. The main goal is to associate a soundscape ranking index (SRI) with each site of the grid, in order to classify the quality of the local environmental sound into one of the following three categories: “poor”, “medium” and “good”. This is achieved by introducing a sound dissimilarity function between sites, found to be consistent with a power-law behavior with the inter-site distance. We conclude with a discussion of possible future work along these lines.

2. Materials and Methods

2.1. Area of the Study

The Parco Nord (Northern Park) of Milan (Italy) covers an area of approximately 790 hectares and is located within a highly urbanized area (see Figure 1). About 45% of its surface is dedicated to natural green spots and vegetation, while the remaining parts are devoted to agricultural activities and infrastructures. The area of study is a tree-covered parcel of approximately 22 hectares enclosed by agricultural fields, lawns, paths and roads. It has a semi-natural structure, which is characterized by herbaceous layers of nemoral flora, shrub and arboreal layers and dead woods.
The zone is crossed by numerous paths, and it is mainly used for recreational activities. It contains an artificial lake of about 300 m 2 surface area, located at approximately 250 m from the edge of the bush. The main traffic noise sources are the A4 highway and the Padre Turoldo street, both located north of the park at around 100 m from the wooded parcel. There is also the presence of a small airport (Bresso airport) on the west side at around 500 m from the tree-line edge.

2.2. Audio Recorders

Mapping the environmental sounds over large areas may require important financial commitments that could be alleviated by the availability of very low-cost recorders (VLCRs). Thus, we used SMT security digital audio recorders with a 48 kHz sampling rate and equipped with a two-week lifetime powerbank. The main disadvantage of using very low-cost recorders is the possibility that they present dissimilar microphone sensitivities. This may cause a different frequency and level response to a sound exposure. The response of each recorder was evaluated in terms of the following:
  • Computation of the acoustic complexity index (ACI, see [27]) for a white noise source.
  • Behavior of each VLCR at different frequencies.
The ACI computes the relative variation of recorded amplitudes of adjacent temporal steps in each frequency bin, as determined by the FFT analysis in the frequency range 0–24 kHz. Thus, its computation provides overall evidence of possible anomalies in the frequency response. Anomalies were identified when the recorder response strongly departed from the average ACI value. Based on these results, we selected audio recorders characterized by a response within 3% of the average ACI value. The full description of the procedure is reported in [35].

2.3. Measurement Scheme

The 22 recorders were initially positioned on two regular grids (see Figure 1), the first one (northen part) covering an area of approximately 500 × 270 m 2 , and the second grid (southern part) covering an area of about 300 × 270 m 2 . The recordings were scheduled for the period of greatest singing activity of the avifauna [10] and repeated over four days, namely on 25–28 May 2015, from 06:30 a.m. (UTC +2) to 10:00 a.m., corresponding to a 3.5 h long recording session for each day. Unfortunately, six recorders, numbered as 7, 9, 14, 15, 16 and 21 and indicated by the yellow spots in Figure 1, did not work properly, and thus, the audio files analyzed in this study reduced to only 16 sites.

2.4. Aural Survey

In order to quantify the biophonies [36], anthrophonies and geophonies [37], an aural survey was performed on one of the four days of recordings, of 3.5 h total duration for each of the 16 sites. Specifically, a single expert listened carefully to the 16 recordings according to the following scheme. For each site, the daily record duration of 210 min was grouped into 70 intervals of 3 min each, of which only the first 1-min interval was listened, while the following 2 min of recording were skipped. This operation spanned several weeks of careful work to accurately examine the whole set of recordings. The expert focused on quantifying the biophonic activity (mainly avian vocalizations) and technophonic sources (mainly traffic noise, including trains and airplanes, and other sources such as park maintenance activities), according to the scheme discussed in [38] and reported in detail in Table 1.
For each single minute of listening, bird vocalizations and non-biophonic sources were searched. The former was subdivided into three main categories according to the quantity of birds (many, few, none), number of species (>1, 1, none) and sound durations, (100, 75, 50, 25, 0)%. The third column displays the index n of the parameter P ( n ) ( n = 1 , 5 ), associated with each subcategory ( i = 1 , 19 ). The fourth column is aimed at providing a more specific characterization of the way the subcategory could be identified. Regarding the bird sound durations, the latter is expressed in terms of the percentage of bird singing activity identified within the considered minute. The effective time span in seconds is displayed in the fourth column.
The non-biophonic contributions are split into road traffic, trains, airplanes and other technophonic noise sources. The former is subdivided into two categories, level and type of traffic, while the last three are associated with the fifth parameter if they are present and with the third one if absent (see below). Finally, geophonies such as rain and wind were not considered due to their negligible contribution to the soundscape during the measurement campaign.

2.5. The Soundscape Ranking Index

We are interested in quantifying the quality of the local environment sound by means of a soundscape ranking index, SRI, as proposed recently [38]. In the following, we briefly discuss how to evaluate the SRI, aimed at describing the local soundscape, at a given site j in the network, in an average sense. The value of SRI ( j , t ) depends on the time interval t, in our case ( t = 1 , 70 ), listened during the aural survey at site j. For each time interval t and site j, we determine the event function, N ( i , j , t ) for the ith sound category ( i = 1 , 19 ), described in Table 1. The event function can be either N = 1 or N = 0 , depending on whether the event i is present or not. The SRI can then be obtained as the sum
SRI ( j , t ) = i = 1 19 N ( i , j , t ) w ( i ) ,
where the weights, w ( i ) = P ( n i ) c ( n i ) , with n i = ( 1 , 2 , 3 , 1 , 2 , 3 , 1 , 1 , 1 , 1 , 1 , 2 , 4 , 5 , 2 , 4 , 5 , 5 , 5 ) (cf. Table 1), while the additional factor, c ( n i ) = ( 1.0 , 0.75 , 0.50 , 0.25 , 0.0 ) for i = ( 7 , 8 , 9 , 10 , 11 ) , respectively, and c ( n i ) = 1 , otherwise. For instance, w ( 6 ) = P ( 3 ) c ( 6 ) = P ( 3 ) , while w ( 7 ) = P ( 1 ) c ( 7 ) = 0 , and w ( 8 ) = P ( 1 ) 0.25 , etc.
The choice of the values for the parameter P ( n ) is rather arbitrary of course, but we can stick to the following simple considerations in order to make up a representative and useful picture of the soundscape. We follow our previous work [38] and assume that P ( n ) > 0 if the sound is associated with a natural source, while we take P ( n ) < 0 if it is of anthropogenic origin. Given the fact that we consider five different situations, we chose the values reported in Table 2 to start with.
Now, we have all the ingredients to calculate the SRI ( j , t ) , using Equation (1), once the event function, N ( i , j , t ) , is known for all the subcategories i, at site j and time interval t. SRI is expected to fluctuate as a function of t, reflecting the ever changing environmental conditions of the varying soundscape. However, we may average the index over time in order to get a mean value, 〈SRI ( j ) 〉, at site j, which should provide us with a quantitative element to estimate the quality of the local environmental sound. By quality, we mean that 〈SRI ( j ) 〉 should be large for a natural soundscape and small for a poor one affected by strong anthropogenic perturbations. At this point, and in order to fix the ideas, we suggest a simple classification of the mean quality index, 〈SRI〉, as displayed in Table 3.

2.6. Optimization Procedure for P ( n )

The values of P ( n ) reported in Table 2 can be seen as our starting guess of the unknown parameters and are therefore arbitrary. As we discuss in Section 3, however, they appear to build a quite robust starting set from which one can search for new “optimized” values. In addition, we find that the optimized set is closer to the prescription in Table 3 than the original set. We show that this different behavior is a result of the self-consistent optimization method we have developed for obtaining the parameters P ( n ) .
The basic quantity in our approach is the probability distribution function, H j ( SRI ) , of the set of SRI values obtained at a given site j, at different time intervals t, using Equation (1). Based on the distribution functions obtained for all active sites, we can define a quantity representing how “dissimilar” two distributions are. The dissimilarity between say, sites i and j, denoted as D i , j , is here defined by the following relation:
D i , j = 1 1 A i A j d x H i ( x ) H j ( x ) , with A i 2 = d x H i 2 ( x ) ,
where the normalizing factor A i ensures that D i , i = 0 . The integral form used in Equation (2) can be formally seen as the internal product of two vectors, H i ( m ) and H j ( m ) , where m Z are the coordinates in a high-dimensional space. Specifically, if we define the histograms H i ( x ) on a set of discrete coordinates, x = m b , where b is the bin size used to build the histograms, we may regard H i ( m ) as the mth component of the vector H ¯ i . This internal product is reminiscent of a similar form, typically used in financial studies, to define a distance between time series in terms of their cross-correlations [39]. Intuitively, the larger D i , j is, the larger the soundscape dissimilarity between sites i and j. In other words, Equation (2) represents a measure that can be used to estimate a soundscape distance between sites. We can loosely say that two sites displaying very different SRI distribution functions are very distant in soundscape space.
Actually, sites i and j are located at well defined positions in space; in particular, they are at a fixed spatial distance, R i , j , within the network (cf. Figure 1). Therefore, the question arises of whether both distances, D i , j and R i , j , are somehow related to each other. Intuitively, one would expect that dissimilarity increases with distance. In other words, we are interested in finding how the local soundscape changes as a function of spatial distance R within the network. To do this, we assume, as our working hypothesis, a simple relation of the form
D i , j a ( R i , j [ m ] ) α ,
where the distances R i , j are expressed in meters, and the exponent α > 0 needs to be determined empirically. We expect that Equation (3) should be valid at least in an average sense. Note that it can be written as D ( R / R 0 ) α , where R 0 is an effective length scale associated with the network soundscape. This in keeping with the fact that D is actually dimensionless. Numerically, the constants a and R 0 are related to each other by, R 0 = a 1 / α . Note that R 0 is given in meters (see also Equation (4)). To simplify the notation, in what follows, we simply write R i , j α , meaning ( R i , j [ m ] ) α .
As a matter of fact, α becomes the sixth parameter in our approach. Notice, however, that P ( 3 ) = 0 in all cases, so we are effectively dealing with only five parameters: P ( 1 ) , P ( 2 ) , P ( 4 ) , P ( 5 ) and α . The five parameters are obtained by a least-square fit that minimizes the total deviation of the data from the fit, i.e.,
Σ 2 = 1 120 i = 1 , j > i 16 D i , j a R i , j α 2 , with a = D i , j R i , j α / R i , j 2 α R 0 α .
The constant a is obtained by requiring Σ / a = 0 , and as a result, it is a function of all the five unknown parameters. We note that in the expression for a, the symbols represent averages over the 120 distinct site pairs ( i , j ) in the network. The effective soundscape length R 0 now becomes
R 0 = [ R i , j 2 α / D i , j R i , j α ] 1 / α .

3. Results

The results presented in this section refer to the audio files recorded on 25 May 2015, from 06:30 a.m. to 10:00 a.m., and based on the aural survey discussed in Section 2.4.

3.1. Initial P ( N ) Values

We start the analysis of the soundscape using the initial values of the parameters reported in Table 2. As discussed in Section 2.5, we evaluate the SRI values using Equation (1) at the 70 time intervals for each of the 16 active sites. The corresponding histograms vs. SRI are reported in Figure 2.
Once the distribution functions, H(SRI), for each active site have been obtained (Figure 2), we can evaluate, using Equation (2), the dissimilarity distances between pairs of sites. We note that, so far, the exponent α has not been required. Indeed, it can be read off from the plot of D i , j values versus the corresponding inter-site distances R i , j , as shown in Figure 3. We notice in the figure a pronounced scattering of the empirical data from the behavior expected from Equation (3). Despite this fact, we can still recognize an increasing trend of D vs. R, as visualized by the obtained least-square fit representing the expected power-law featured in Equation (3). From the latter, we find α 1 / 3 , which is rather small, but significant enough to conclude that the assumed power-law (Equation (3)) may be acceptable.
The associated distribution function of dissimilarity is shown in Figure 4, together with a fitting function of a simple analytical form, which should be useful for making a quantitative comparison with the case of optimized parameters discussed below.
Finally, we plot in Figure 5 the mean SRI values at each site, obtained from Equation (1) and corresponding to the parameter set: P ( n ) = ( 2 , 1 , 0 , 1 , 2 ) and α = 0.33 . We note that all mean values turn out to be positive, 〈SRI〉 > 0 , and they are not quite consistent with the scenario expected from Table 3. This observation actually motivated us to search for a way to improve on this quality scenario results. The idea is then to search for a “better” set of parameters that can bring us beyond our initial guess, thus eliminating the arbitrariness endowed in our parameter values.

3.2. Optimized P ( N ) Values

In order to improve on the previous results, we let all the five parameters, P ( 1 ) , P ( 2 ) , P ( 4 ) , P ( 5 ) and α , vary independently and search for the “best” set that minimizes the function Σ in Equation (4). For each set of new parameters, we need to evaluate the SRI given by Equation (1), followed by the calculation of the dissimilarity distances, Equation (2), once the corresponding histograms of the SRI have been obtained. To speed up the search, we start varying each parameter value within a given range, subdivided into few equidistant smaller intervals. Once a minimum has been found, we choose new ranges centered around the putative minimum parameter values. In this way, we can refine the search more efficiently and converge fast towards a solution. Indeed, few such iterations are needed to get a stable result.
It is convenient to check the convergence of the method by first keeping α fixed and letting the P ( n ) values adjust themselves so as to minimize Σ . We tried both, α = 0.33 and α = 1 , and from each of the found sets, we minimized Σ with respect to α . We obtained similar values for α in both cases, yielding α 0.5 ± 0.2 . We then performed a full search with all five parameters, finding indeed α 0.5 ± 0.05 , suggesting that we may take α = 1 / 2 as our best value for this unknown exponent. Finally, we set α = 1 / 2 and optimized the search for our final set of parameters, yielding
P ( 1 ) = 2.290 , P ( 2 ) = 0.766 , P ( 4 ) = 1.528 , P ( 5 ) = 2.262 ,
with P ( 3 ) = 0 , and a final error, Σ = 0.139, for dissimilarity versus inter-site distance, as shown in Figure 6.
The distribution function of the newly obtained dissimilarity distances, D i , j , is displayed in Figure 7. Now, the mean dissimilarity, D 0.375 , becomes a bit larger than for the non-optimized parameters (cf. Figure 4), while the new distribution function becomes flatter and extends to larger values of D.
The distribution functions of the SRI for each site are displayed in Figure 8, and their mean values are shown in Figure 9.

4. Discussion and Conclusions

It is convenient to summarize the main results of our work as shown in Table 4. Let us start with the exponent α , expressing the way that dissimilarity decays with inter-site distance (Equation (3)). Quantitatively, the values indicate that for the initial set of parameters, dissimilarity decays more slowly (1/3) than for the optimized one (1/2), suggesting a more persistent behavior of the soundscape. The optimized values for the parameters P ( n ) display the same trend as the initial ones, differing by at most a 50 % as for P ( 4 ) . Despite this, crucial differences develop when we analyze the remaining quantities.
The SRI behaves quite differently. Indeed, the optimized set yields a much smaller mean SRI value for the whole network, about 23 %, than for the original non-optimized set of parameters. This again is consistent with the behavior of dissimilarity, indicating a lower soundscape quality than the one predicted by the original set. The dispersion of the SRI at each site, σ ( i ) , increases for the optimized parameter set, consistent with a more extended variability of dissimilarity (cf. Figure 4 and Figure 7), also reflected in their mean values, D . The effective length scale R 0 also manifests this difference as it is larger for the original set, suggesting that dissimilarity is smaller than for the case of optimized parameters. Finally, as expected, the optimized set yields a smaller dispersion of the data from the assumed power-law decay of dissimilarity as given by its smaller value of Σ .
We may conclude that in our case, the initial guess for the parameter values overestimate the sound quality in the urban habitat we studied. This conclusion is based on the smaller dispersion of the data obtained from the assumed empirical relation between dissimilarity and intersite distance, as shown in Equation (3). We may expect that in other habitats, the self-consistent parameters would be different from the initially chosen values in either way, i.e., the soundscape quality may be either smaller or larger than initially predicted. In both cases, the present method suggests which of the scenarios is more likely to be correct, as it provides us with a recipe to check for the self-consistency of the empirical data. From a more fundamental perspective, the problem of deriving the basic relation in Equation (3) should be considered for future studies.
In view of these results, we may suggest that sound dissimilarity between sites decays with their inter-site distance in space with an exponent α = 1 / 2 , yielding
D i , j [ R i , j / R 0 ] 1 / 2 ,
where the effective length scale in the network is given by,
R 0 = R i , j 2 / D i , j R i , j 2 .
These relations can be tested in other cases, and if confirmed, they may represent a useful technique to estimate the internal correlations of the soundscape in the area of interest. Indeed, the set of parameters P ( n ) , obtained in a self-consistent fashion with the assumed decay of dissimilarity, allows us to classify the sites according to our scheme discussed in Table 3.
According to the results in Figure 9, we find it quite remarkable that the local SRI mean values are consistent with our simple quality rules displayed in Table 3. We have therefore classified them according to these rules and plotted the corresponding set of sites with different colors, as shown in Figure 10. The good quality sites are in green color, the medium quality ones in yellow and the poor quality sites in red. There is a “borderline” site, number 4, which has a mean SRI close to 0. We have therefore indicated it as a full yellow circle with a red contour.
Environmental sound represents an important phenomenon that is fully integrated in the ecological system as a whole [40]. This feature is strongly interconnected with the ecological processes and patterns driven by biotic and abiotic relationships [40,41]. For this reason, environmental sounds may represent measurable indicators of ecological relationships and environmental degradation [42,43]. In addition, a simple but effective classification scheme to visualize the internal “structure” of the soundscape inside a habitat should be useful for many applications, including helping the monitoring of a given area for implementing better environmental policies.
One may also attempt to apply pattern recognition algorithms to analyze larger audio recorders in a systematic way, in order to extract the sound subcategories (and possibly other ones) considered in Table 1. This could be of great help, and one can rely on human intervention only to validate the fully automatized algorithm. Our approach represents a methodological effort to describe the environmental sound quality on the basis of a receiver perception. For this reason, although we used short recording periods (3.5 h), obtained from an area of approximately 22 hectares, and some of the geophonic categories were not represented in the audio recordings (wind, rain), we expect this study to provide a useful measure of a habitat soundscape quality in terms of a simple SRI index.
Finally, we expect these results to be useful also in connection with more elaborated approaches based on ecoacoustic indices and associated time series, analyzed using artificial intelligence (AI) methods, as they may serve as simpler qualitative indicators of the soundscape quality to complement the AI results. Work in this direction is in progress.

Author Contributions

Conceptualization, H.E.R., R.B. and G.Z.; methodology, H.E.R. and R.B.; formal analysis, H.E.R.; investigation, R.B., H.E.R. and A.A.; resources, G.Z.; data curation, A.P.; writing—original draft preparation, H.E.R.; writing—review and editing, H.E.R. and R.B.; visualization, R.B., H.E.R. and A.P.; funding acquisition, G.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are available upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Schafer, R.M. The Soundscape: Our Sonic Environment and the Tuning of the World; Simon and Schuster: New York, NY, USA, 1993. [Google Scholar]
  2. Dumyahn, S.L.; Pijanowski, B.C. Soundscape conservation. Landsc. Ecol. 2011, 26, 1327–1344. [Google Scholar] [CrossRef]
  3. Barber, J.R.; Crooks, K.R.; Fristrup, K.M. The costs of chronic noise exposure for terrestrial organisms. Trends Ecol. Evol. 2010, 25, 180–189. [Google Scholar] [CrossRef]
  4. Lawson, G.M. Networks cities and ecological habitats. In Networks Cities; Qun, F., Brearley, J., Eds.; China Architecture and Building Press: Beijing, China, 2011; pp. 250–253. Available online: https://eprints.qut.edu.au/40229/ (accessed on 15 March 2023).
  5. Francis, C.D.; Newman, P.; Taff, B.D.; White, C.; Monz, C.A.; Levenhagen, M.; Petrelli, A.R.; Abbott, L.C.; Newton, J.; Burson, S.; et al. Acoustic environments matter: Synergistic benefits to humans and ecological communities. J. Environ. Manag. 2017, 203, 245–254. [Google Scholar] [CrossRef]
  6. Doser, J.W.; Hannam, K.M.; Finley, A.O. Characterizing functional relationships between technophony and biophony: A western New York soundscape case study. Landsc. Ecol. 2020, 35, 689–707. [Google Scholar] [CrossRef] [Green Version]
  7. Sayuri Moreira, S.L.; Freire, S.T.S.; Wagner, R.J., Jr.; Diego, L. Terrestrial Passive Acoustic Monitoring: Review and Perspectives. BioScience 2019, 69, 15–25. [Google Scholar] [CrossRef]
  8. Mellinger, D.K.; Stafford, K.M.; Moore, S.E.; Dziak, R.P.; Matsumoto, H. An overview of fixed passive acoustic observation methods for cetaceans. Oceanography 2007, 20, 36–45. Available online: https://www.jstor.org/stable/24860138 (accessed on 15 March 2023). [CrossRef] [Green Version]
  9. Ribeiro, J.W.; Sugai, L.S.M.; Campos-Cerqueira, M. Passive acoustic monitoring as a complementary strategy to assess biodiversity in the Brazilian Amazonia. Biodivers. Conserv. 2017, 26, 2999–3002. [Google Scholar] [CrossRef] [Green Version]
  10. Browning, E.; Gibb, R.; Glover-Kapfer, P.; Jones, K.E. Passive Acoustic Monitoring in Ecology and Conservation; WWF: Woking, UK, 2017; p. 76. Available online: https://repository.oceanbestpractices.org/handle/11329/1370 (accessed on 15 March 2023).
  11. Sueur, J.; Farina, A.; Gasc, A.; Pieretti, N.; Pavoine, S. Acoustic indices for biodiversity assessment and landscape investigation. Acta Acust. United Acust. 2014, 100, 772–781. [Google Scholar] [CrossRef] [Green Version]
  12. Krause, B. The Loss of Natural Soundscapes. Earth Isl. J. 2002, 17, 27–29. Available online: www.earthisland.org/journal/index.php/magazine/archive (accessed on 15 March 2023).
  13. Pijanowski, B.C.; Farina, A.; Gage, S.H.; Dumyahn, S.L.; Krause, B.L. What is soundscape ecology? An introduction and overview of an emerging new science. Landsc. Ecol. 2011, 26, 1213–1232. [Google Scholar] [CrossRef]
  14. Yang, W.; Kang, J. Soundscape and sound preferences in urban squares: A case study in Sheffield. J. Urban Des. 2005, 10, 61–80. [Google Scholar] [CrossRef]
  15. Pavan, G. Fundamentals of Soundscape Conservation. In Ecoacoustics: The Ecological Role of Sounds; Farina, A., Gage, S.H., Eds.; Wiley & Sons: Hoboken, NJ, USA, 2017; pp. 235–258. [Google Scholar] [CrossRef]
  16. Sethi, S.S.; Jones, N.S.; Fulcher, B.D.; Picinali, L.; Clink, D.J.; Klinck, H.; Orme, C.D.L.; Wrege, P.H.; Ewers, R.M. Characterizing soundscapes across diverse ecosystems using a universal acoustic feature set. Proc. Natl. Acad. Sci. USA 2020, 117, 17049–17055. [Google Scholar] [CrossRef] [PubMed]
  17. Liu, J.; Kang, J.; Luo, T.; Behm, H. Landscape effects on soundscape experience in city parks. Sci. Total. Environ. 2013, 454–455, 474–481. [Google Scholar] [CrossRef]
  18. Raimbault, M.; Lavier, C.; Bérengier, M. Ambient sound assessment of urban environments: Field studies in two French cities. Appl. Acoust. 2003, 64, 1241–1256. [Google Scholar] [CrossRef]
  19. Uebel, K.; Rhodes, J.R.; Wilson, K.; Dean, A.J. Urban park soundscapes: Spatial and social factors influencing bird and traffic sound experiences. People Nat. 2022, 4, 1616–1628. [Google Scholar] [CrossRef]
  20. Brambilla, G.; Gallo, V.; Asdrubali, F.; D’Alessro, F. The perceived quality of soundscape in three urban parks in Rome. J. Acoust. Soc. Am. 2013, 134, 832–839. [Google Scholar] [CrossRef]
  21. Szeremeta, B.; Zannin, P.H.T. Analysis and evaluation of soundscapes in public parks through interviews and measurement of noise. Sci. Total. Environ. 2009, 407, 6143–6149. [Google Scholar] [CrossRef]
  22. Uebel, K.; Marselle, M.; Dean, A.J.; Rhodes, J.R.; Bonn, A. Urban green space soundscapes and their perceived restorativeness. People Nat. 2021, 3, 756–769. [Google Scholar] [CrossRef]
  23. Lellouch, L.; Pavoine, S.; Jiguet, F.; Glotin, H.; Sueur, J. Monitoring temporal change of bird communities with dissimilarity acoustic indices. Methods Ecol. Evol. 2014, 5, 495–505. [Google Scholar] [CrossRef]
  24. Kasten, E.P.; Gage, S.H.; Fox, J.; Joo, W. The remote environmental assessment laboratory’s acoustic library: An archive for studying soundscape ecology. Ecol. Inform. 2012, 12, 50–67. [Google Scholar] [CrossRef]
  25. Boelman, N.T.; Asner, G.P.; Hart, P.J.; Martin, R.E. Multitrophic invasion resistance in hawaii: Bioacoustics, field surveys, and airborne remote sensing. Ecol. Appl. 2007, 17, 2137–2144. [Google Scholar] [CrossRef] [PubMed]
  26. Sueur, J.; Pavoine, S.; Hamerlynck, O.; Duvail, S. Rapid acoustic survey for biodiversity appraisal. PLoS ONE 2008, 3, e4065. [Google Scholar] [CrossRef] [Green Version]
  27. Pieretti, N.; Farina, A.; Morri, D. A new methodology to infer the singing activity of an avian community: The Acoustic Complexity Index (ACI). Ecol. Indic. 2011, 11, 868–873. [Google Scholar] [CrossRef]
  28. Eldridge, A.; Guyot, P.; Moscoso, P.; Johnston, A.; Eyre-Walker, Y.; Peck, M. Sounding out ecoacoustic metrics: Avian species richness is predicted by acoustic indices in temperate but not tropical habitats. Ecol. Indic. 2018, 95, 939–952. [Google Scholar] [CrossRef] [Green Version]
  29. Benocci, R.; Brambilla, G.; Bisceglie, A.; Zambon, G. Eco-Acoustic Indices to Evaluate Soundscape Degradation Due to Human Intrusion. Sustainability 2020, 12, 10455. [Google Scholar] [CrossRef]
  30. Bertucci, F.; Parmentier, E.; Berten, L.; Brooker, R.M.; Lecchini, D. Temporal and spatial comparisons of underwater sound signatures of different reef habitats in Moorea Island, French Polynesia. PLoS ONE 2015, 10, e0135733. [Google Scholar] [CrossRef]
  31. Harris, S.A.; Shears, N.T.; Radford, C.A. Ecoacoustic indices as proxies for biodiversity on temperate reefs. Methods Ecol. Evol. 2016, 7, 713–724. [Google Scholar] [CrossRef] [Green Version]
  32. Shonfield, J.; Bayne, E.M. Autonomous recording units in avian ecological research: Current use and future applications. Avian Conserv. Ecol. 2017, 12, 14. [Google Scholar] [CrossRef]
  33. Pérez-Granados, C.; Traba, J. Estimating bird density using passive acoustic monitoring: A review of methods and suggestions for further research. Ibis 2021, 163, 765–783. [Google Scholar] [CrossRef]
  34. Benocci, R.; Roman, H.E.; Bisceglie, A.; Angelini, F.; Brambilla, G.; Zambon, G. Eco-acoustic assessment of an urban park by statistical analysis. Sustainability 2021, 13, 7857. [Google Scholar] [CrossRef]
  35. Benocci, R.; Potenza, A.; Bisceglie, A.; Roman, H.E.; Zambon, G. Mapping of the Acoustic Environment at an Urban Park in the City Area of Milan, Italy, Using Very Low-Cost Sensors. Sensors 2022, 22, 3528. [Google Scholar] [CrossRef]
  36. Truax, B. Handbook for Acoustic Ecology; A.R.C. Publications: Vancouver, BC, Canada, 1978. [Google Scholar]
  37. Pijanowski, B.C.; Villanueva-Rivera, L.J.; Dumyahn, S.L.; Farina, A.; Krause, B.L.; Napoletano, B.M.; Gage, S.H.; Pieretti, N. Soundscape ecology: The science of sound in the landscape. Bioscience 2011, 61, 203–216. [Google Scholar] [CrossRef] [Green Version]
  38. Benocci, R.; Roman, H.E.; Bisceglie, A.; Angelini, F.; Brambilla, G.; Zambon, G. Auto-correlations and long time memory of environment sound: The case of an Urban Park in the city of Milan (Italy). Ecol. Indic. 2022, 134, 108492. [Google Scholar] [CrossRef]
  39. Roman, H.E.; Albergante, M.; Colombo, M.; Croccolo, F.; Marini, F.; Riccardi, C. Modeling cross correlations within a many-assets market. Phys. Rev. E 2006, 73, 036129. [Google Scholar] [CrossRef]
  40. Gage, S.H.; Farina, A. Ecoacoustics: The Ecological Role of Sounds; John Wiley & Sons: Hoboken, NJ, USA, 2017. [Google Scholar]
  41. Sueur, J.; Farina, A. Ecoacoustics: The ecological investigation and interpretation of environment sound. Biosemiotics 2015, 8, 493–502. [Google Scholar] [CrossRef]
  42. Farina, A. Soundscape Ecology; Springer: Dordrecht, The Netherlands, 2014. [Google Scholar]
  43. Krause, B.; Farina, A. Using ecoacoustic methods to survey the impacts of climate change on biodiversity. Biol. Conserv. 2016, 195, 245–254. [Google Scholar] [CrossRef]
Figure 1. The Parco Nord (Northern Park) area of study. The sensor network is indicated by the numbered dots (1–22), deployed inside the tree-covered zone of the park. The network consists of two regular grids, the northern and the southern ones, composed of 16 and 6 nodes, respectively. Those in red color are the actually used sensors (total number 16), while those in yellow are the six ones discarded due to malfunctioning. One can see the A4 highway, the Padre Turoldo street and the artificial lake to the northern part of the park, while to the west side of it, the small Bresso airport runaway is located.
Figure 1. The Parco Nord (Northern Park) area of study. The sensor network is indicated by the numbered dots (1–22), deployed inside the tree-covered zone of the park. The network consists of two regular grids, the northern and the southern ones, composed of 16 and 6 nodes, respectively. Those in red color are the actually used sensors (total number 16), while those in yellow are the six ones discarded due to malfunctioning. One can see the A4 highway, the Padre Turoldo street and the artificial lake to the northern part of the park, while to the west side of it, the small Bresso airport runaway is located.
Sensors 23 03401 g001
Figure 2. Variability of SRI at each active site. The distribution functions H(SRI) were obtained using Equation (1), from the initial values of parameters P ( n ) = ( 2 , 1 , 0 , 1 , 2 ) . The scale for SRI is fixed for all sites in the range (−5, 10). The two sets of colors (black on top and red on bottom panel) are for convenience but indicate that the former are centered to lower values of SRI while the latter are centered to more positive ones, suggesting an underlying organization of the sites into two main groups.
Figure 2. Variability of SRI at each active site. The distribution functions H(SRI) were obtained using Equation (1), from the initial values of parameters P ( n ) = ( 2 , 1 , 0 , 1 , 2 ) . The scale for SRI is fixed for all sites in the range (−5, 10). The two sets of colors (black on top and red on bottom panel) are for convenience but indicate that the former are centered to lower values of SRI while the latter are centered to more positive ones, suggesting an underlying organization of the sites into two main groups.
Sensors 23 03401 g002
Figure 3. Dissimilarity, D i , j , between two sites, ( i , j ) vs. inter-site distance R i , j , obtained from Equation (2) using the initial set of parameters P ( n ) = ( 2 , 1 , 0 , 1 , 2 ) . The full squares are the empirical data, and the straight line is a least-square fit with the power law D = a R α , yielding a = 0.049 , corresponding to R 0 = 8500 m, and α 1 / 3 . Specifically, we find D R α 2.354 and R 2 α 48.03 . The total error of the fit is Σ 0 = 0.147 . The mean inter-site distance in the network is R i , j = 353.64 m, the minimum and maximum ones: R min = 75 m and R max = 800 m, respectively.
Figure 3. Dissimilarity, D i , j , between two sites, ( i , j ) vs. inter-site distance R i , j , obtained from Equation (2) using the initial set of parameters P ( n ) = ( 2 , 1 , 0 , 1 , 2 ) . The full squares are the empirical data, and the straight line is a least-square fit with the power law D = a R α , yielding a = 0.049 , corresponding to R 0 = 8500 m, and α 1 / 3 . Specifically, we find D R α 2.354 and R 2 α 48.03 . The total error of the fit is Σ 0 = 0.147 . The mean inter-site distance in the network is R i , j = 353.64 m, the minimum and maximum ones: R min = 75 m and R max = 800 m, respectively.
Sensors 23 03401 g003
Figure 4. Probability density distribution of dissimilarity, corresponding to the values used in Figure 3. We find D 0.333 (vertical red line). The continuous line is a fit with the form y = A x a / [ 1 + ( x / b ) c ] , with the normalization constant A = 19 , and the fit parameters a = 1.19 , b = 0.31 and c = 4.17 .
Figure 4. Probability density distribution of dissimilarity, corresponding to the values used in Figure 3. We find D 0.333 (vertical red line). The continuous line is a fit with the form y = A x a / [ 1 + ( x / b ) c ] , with the normalization constant A = 19 , and the fit parameters a = 1.19 , b = 0.31 and c = 4.17 .
Sensors 23 03401 g004
Figure 5. Mean 〈SRI 0 〉 obtained by averaging the index over the 70 temporal values for each active site (yellow bars). The red and blue bars display the one σ ( i ) variations of the data at each site i, above and below the local mean values, respectively. The dashed lines represent averages over all sites of the three set of data. We find 〈SRI 0 〉 = 2.14, 〈SRI 0 〉+ σ = 3.68, 〈SRI 0 〉− σ = 0.60, with σ = 1.54 . The actual site identification numbers are reported just below the zero line: (1–6, 8, 10–13, 17–20, 22).
Figure 5. Mean 〈SRI 0 〉 obtained by averaging the index over the 70 temporal values for each active site (yellow bars). The red and blue bars display the one σ ( i ) variations of the data at each site i, above and below the local mean values, respectively. The dashed lines represent averages over all sites of the three set of data. We find 〈SRI 0 〉 = 2.14, 〈SRI 0 〉+ σ = 3.68, 〈SRI 0 〉− σ = 0.60, with σ = 1.54 . The actual site identification numbers are reported just below the zero line: (1–6, 8, 10–13, 17–20, 22).
Sensors 23 03401 g005
Figure 6. Dissimilarity, D i , j , between two sites, ( i , j ) vs. inter-site distance R i , j , obtained from the optimized parameters P ( n ) = ( 2.29 , 0.766 , 0 , 1.528 , 2.262 ) , corresponding to the best value α = 1 / 2 . The full circles are the empirical data, and the straight line is a least-square fit with the power law D = a R α , yielding a = 0.021 , corresponding to R 0 = 2270 m, with D R α 7.39 and R 2 α 353.6 . The total error of the fit is Σ = 0.139 , which is smaller than the one obtained for the initial parameter values, Σ 0 = 0.147 (cf. Figure 3).
Figure 6. Dissimilarity, D i , j , between two sites, ( i , j ) vs. inter-site distance R i , j , obtained from the optimized parameters P ( n ) = ( 2.29 , 0.766 , 0 , 1.528 , 2.262 ) , corresponding to the best value α = 1 / 2 . The full circles are the empirical data, and the straight line is a least-square fit with the power law D = a R α , yielding a = 0.021 , corresponding to R 0 = 2270 m, with D R α 7.39 and R 2 α 353.6 . The total error of the fit is Σ = 0.139 , which is smaller than the one obtained for the initial parameter values, Σ 0 = 0.147 (cf. Figure 3).
Sensors 23 03401 g006
Figure 7. Probability density distribution of dissimilarity, corresponding to the values used in Figure 6. We find D 0.375 (vertical red line). The continuous line is a fit with the form y = A x a / [ 1 + ( x / b ) c ] , with the normalization constant A = 11 and the fit parameters a = 0.96 , b = 0.33 and c = 3.03 .
Figure 7. Probability density distribution of dissimilarity, corresponding to the values used in Figure 6. We find D 0.375 (vertical red line). The continuous line is a fit with the form y = A x a / [ 1 + ( x / b ) c ] , with the normalization constant A = 11 and the fit parameters a = 0.96 , b = 0.33 and c = 3.03 .
Sensors 23 03401 g007
Figure 8. The distribution functions H(SRI) for the optimized parameters displayed in Equation (6). The scale for SRI is the same used in Figure 2. Here again, we have used two sets of colors for the histograms.
Figure 8. The distribution functions H(SRI) for the optimized parameters displayed in Equation (6). The scale for SRI is the same used in Figure 2. Here again, we have used two sets of colors for the histograms.
Sensors 23 03401 g008
Figure 9. Same as in Figure 5 for the self-consistent set of parameters (Equation (6)). Here, we find 〈SRI〉 = 1.65 (2.14), 〈SRI〉+ σ = 3.53 (3.68), 〈SRI〉- σ = −0.23 (0.60), with σ = 1.88 (1.54). The corresponding values from Figure 5 are reported in parenthesis.
Figure 9. Same as in Figure 5 for the self-consistent set of parameters (Equation (6)). Here, we find 〈SRI〉 = 1.65 (2.14), 〈SRI〉+ σ = 3.53 (3.68), 〈SRI〉- σ = −0.23 (0.60), with σ = 1.88 (1.54). The corresponding values from Figure 5 are reported in parenthesis.
Sensors 23 03401 g009
Figure 10. Final classification of the active sensors: (red circles) 〈SRI〉 < 0 , (yellow circles) 0 < SRI < 2 (sensor 4 has been indicated with a red border line since its SRI 0 ), and (green circles) 〈SRI〉 > 2 . The latter represents the best quality locations in the network. The value of the optimized parameters are displayed in Table 4. The inactive sensors (7, 9, 14, 15, 16, 21) are depicted by the small circles.
Figure 10. Final classification of the active sensors: (red circles) 〈SRI〉 < 0 , (yellow circles) 0 < SRI < 2 (sensor 4 has been indicated with a red border line since its SRI 0 ), and (green circles) 〈SRI〉 > 2 . The latter represents the best quality locations in the network. The value of the optimized parameters are displayed in Table 4. The inactive sensors (7, 9, 14, 15, 16, 21) are depicted by the small circles.
Sensors 23 03401 g010
Table 1. Sound categories corresponding to the 19 identified sources. (First column) Sound sources. (Second column) Quantity/Duration/Level. (Third column) Parameter index n i ( n i { 1 5 } and i = 1 , 19 ) associated with the ith category. (Fourth column) Identification feature.
Table 1. Sound categories corresponding to the 19 identified sources. (First column) Sound sources. (Second column) Quantity/Duration/Level. (Third column) Parameter index n i ( n i { 1 5 } and i = 1 , 19 ) associated with the ith category. (Fourth column) Identification feature.
Sound SourceQuantityParametersIdentification
Many1Many birds
Birds numberFew2Few birds, no traffic
None3No birds, no other sources
>11Many species
Birds species12One species, no traffic
None3No birds, no other sources
100%160 s
Birds   75%145 s
sound   50%130 s
duration   25%115 s
    0%1  0 s
None2Few birds, no traffic
Traffic levelLow4Continuous/low traffic
High5Intermittent/high traffic
None2No traffic
Traffic typeContinuous4Continuous/low traffic
Intermittent5Intermittent/high traffic
TrainsPresent5Trains
AirplanesPresent5Airplanes, other sources
Table 2. The starting parameters P ( n ) ( n = 1 , 5 ) associated with each sound category to be used in Equation (1) (see also [38]).
Table 2. The starting parameters P ( n ) ( n = 1 , 5 ) associated with each sound category to be used in Equation (1) (see also [38]).
nP(n)Identification
1  2Many birds/species
2  1Few birds/no traffic
3  0No birds/no other sources
4−1Continuous/low traffic
5−2Intermittent/high traffic/other sources
Table 3. Quality intervals for the mean soundscape ranking index, 〈SRI〉.
Table 3. Quality intervals for the mean soundscape ranking index, 〈SRI〉.
Poor QualityMedium QualityGood Quality
SRI < 00 ≤ SRI ≤ 2SRI > 2
Table 4. Summary of the main results for the exponent α in Equation (3), the parameters P ( n ) ( n = 1 , 5 ), the mean value of the SRI in the network, the dispersion of the SRI values σ (cf. Figure 5 and Figure 9), the mean value of dissimilarity D among all sites, the effective scale distance R 0 (cf. Equation (3)) and the total error of the fit Σ (Equation (4)).
Table 4. Summary of the main results for the exponent α in Equation (3), the parameters P ( n ) ( n = 1 , 5 ), the mean value of the SRI in the network, the dispersion of the SRI values σ (cf. Figure 5 and Figure 9), the mean value of dissimilarity D among all sites, the effective scale distance R 0 (cf. Equation (3)) and the total error of the fit Σ (Equation (4)).
Initial ParametersOptimized Parameters
α 1/31/2
P(1) 2.0 2.290
P(2) 1.0 0.766
P(3) 0.0 0.000
P(4)−1.0−1.528
P(5)−2.0−2.262
〈SRI〉 2.14 1.65
σ  1.54 1.88
D  0.333 0.375
R 0 8500 m2270 m
Σ 0.1470.139
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Benocci, R.; Afify, A.; Potenza, A.; Roman, H.E.; Zambon, G. Self-Consistent Soundscape Ranking Index: The Case of an Urban Park. Sensors 2023, 23, 3401. https://doi.org/10.3390/s23073401

AMA Style

Benocci R, Afify A, Potenza A, Roman HE, Zambon G. Self-Consistent Soundscape Ranking Index: The Case of an Urban Park. Sensors. 2023; 23(7):3401. https://doi.org/10.3390/s23073401

Chicago/Turabian Style

Benocci, Roberto, Andrea Afify, Andrea Potenza, H. Eduardo Roman, and Giovanni Zambon. 2023. "Self-Consistent Soundscape Ranking Index: The Case of an Urban Park" Sensors 23, no. 7: 3401. https://doi.org/10.3390/s23073401

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop