1. Introduction
Real-time mapping systems for noise pollution monitoring is a challenging issue that has already been addressed in several research works [
1,
2]. In this context, the study of noise pollution in urban environments aimed at developing networked monitoring systems combined with spatial models for predicting sound pressure levels is not straightforward.
Noise pollution is a common problem in urban environments. It affects human behavior, health and even children’s cognition. Recognizing this as a major issue, the European Commission adopted a directive [
3], requiring main cities (with more than 100,000 inhabitants) to gather real world data on noise exposure in order to produce local action plans and to provide accurate mappings of noise pollution levels. The traditional way of conducting noise measurements is by collecting noise samples manually, but this technique has many drawbacks. On the one hand, only local and sparse measurements are taken. On the other hand, it is expensive due to the measuring equipment and personnel costs.
However, the European Commission recommends for higher granularity of noise data both in space and time. In this scenario, Wireless Sensor Networks (WSNs) represent an alternative that can overcome the drawbacks of the current noise data collection procedure. These WSNs are composed of small autonomous nodes with sensing capabilities. Each node has its own power supply, processing unit and memory. The nodes communicate using multi-hop routing protocols and at least one node (named sink) acts as a gateway for external connection. In the last decade, several studies have been conducted using WSN for noise pollution monitoring. All of these studies are based on the equivalent sound pressure level over time T.
Several works have considered the use of WASNs for noise monitoring. In [
4,
5], authors have evaluated a WASN using Tmote-Sky motes [
6] and Tmote Invent (TmI), to monitor traffic noise using the equivalent level,
and to count the number and type of vehicles. In this deployment, they used a sampling frequency of 8 kHz. In their study, they found that Tmote-Sky had excessive self-noise and TmI (with an integrated microphone) had apparently good audio features. In the references, the authors do not provide a specific calibration.
In [
7,
8], a WASN deployment in Ostrobothnia (Finland) is discussed. In these references, the authors report different tests to evaluate the noise impact. They measured the
with
ms using a sampling frequency of 33 kHz, with 14 calibrated motes (MicaZ from Crossbow -now this company is MEMSIC- with an ad hoc acquisition circuitry to allow a dynamic range of 60 dB), globally synchronized during 96 h with good results.
Other references such as [
1,
9,
10] have used mobile phones for noise pollution monitoring. Although the results are interesting, in our opinion, there is a lack of information about the recording conditions which avoids getting accurate noise measurements. When evaluating noise parameters, the location of the measuring devices should follow specific rules [
3].
In all of the previous references [
1,
4,
5,
6,
7,
8,
9,
10], the measurements are based on the
or its A-weighted version (ITU-R 468),
. These measurements apply a frequency-selective filter that picks up the frequency range around 3–6 kHz, where the human ear is most sensitive. These parameters are measured in dB and dBA, respectively.
The data collection procedure from the sensor nodes must also be considered. In the literature [
11], there are several approaches that allow a good data collection. These protocols are associated not only to the routing protocols of the information to the sink, but also to the application to represent the information which can be presented as a web-service.
Another issue to consider in the data information system is the statistical representation of the noise data information oriented to detect problematic areas in real time. A promising tool for this kind of noise mapping is given by geo-statistics, which provides a set of statistical techniques specifically designed for spatial problems. These methods are aimed at predicting the values of a certain phenomenon over an area where several points have been sampled [
12]. The predictions are based on statistical models that can incorporate additional information as explicative variables. Moreover, the prediction error can be estimated using uncertainty propagation [
13,
14].
The use of an equally distributed sampling grid allows for a better determination of the mean value for the underlying statistical distribution of the sampled data obtained at the measuring points. For this reason, the use of geostatistical interpolation, as performed by the Ordinary Kriging (OK) method [
12], can be applied, since the mean value of the measured distribution is already known.
The goal of this paper is to analyze the suitability of a hardware system for noise measuring, based on Raspberry Pi nodes with a condenser microphone as an acoustic sensor and a data collection system, combined with geo-statistical techniques which are applied to the problem of noise mapping in a small city in the Valencian Community in Spain. The advantages provided by geographic information systems (GIS) are also used to locate precisely the measurement locations within a heterogeneous environment. A predictive model is developed by analyzing the noise pollution measurements obtained from a small set of points over a specific city. This model allows for obtaining punctual level estimates and calculating the spatial-temporal variation of the noise pollution level. To this end, the OK interpolation method will be considered. The main objective of this paper is then to evaluate the performance of the Wireless Acoustic Sensor Network (WASN) in combination with this spatial-temporal interpolating method at estimating the values corresponding to unsampled spatial locations, provided that the sampling grid has been correctly designed. This applied study is oriented to find the noise climate in different points for predicting global behaviors. As a result, the method allows for establishing environmental local policies for noise control and zonification at a municipal level, evaluating the local impact of the sound sources.
2. Material and Method
Before describing the developed monitoring system and the spatial statistical predictive model and its application, some notes about the place, the network and the measurements are next given.
2.1. The City
The measurements were carried out in Algemesí (Valencia, Spain). This is a small city with 28,000 inhabitants, located at the South-East of Valencia at coordinates N 9.12 W 0.12. The city is in an agricultural area, although an incipient industrial sector is also growing. The Magre River, which is an affluent of the Xuquer River, flows in the NW-SE of the town settling a natural border.
2.2. The Network
The used nodes in the network are based on Raspberry Pi B (RPi) platform. This platform is based on Broadcom BCM2835 System-on-Chip (SoC) [
15], including an ARM1176JZF-S 700 MHz processor [
16], a Graphic Processing Unit (GPU) and 512 MB of RAM, with a SD slot card memory. A Logilink UA0053 USB sound card [
17] was installed in the RPis in order to record the audio, with an electret omnidirectional microphone [
18]. The frequency response of the electret microphone is nearly flat from 60-16000KHz. Also a WiFi adapter TP-Link TL-WN725N [
19] with IEEE 802.11 b/g/n standard has been introduced in the setup for communication purposes. This configuration allows each node to acquire several seconds of audio with 16 bits per sample (allowing a dynamic range of
dB) at a sampling frequency of 22.05 kHz. The RPi have been placed in a protective housing, each with 3 Kodak 1.5 V KDLR20 batteries (19,500 mAh) [
20]. A mean current consumption of 550 mAh (5 V) was measured, by disabling all unnecessary services in the operating system. It allows a life-time of 28 hours, considering an 80% efficiency for the batteries.
A Raspbian Operating System [
21] was installed in the RPi, which is a GNU/Linux version optimized for RPi hardware, based on a Debian distribution. To implement a Wireless Mesh Network, the Babel [
22] routing protocol was used. This is a loop free, proactive protocol and it is based on distance vector algorithm. Babel is implemented in the Quagga [
22] software suite. Also, a Cloud client called OwnCloud [
23] was installed in each node, in order to gather the information from the different sensors into one system, connected to the Internet. The Owncloud server application was installed in a PC. This system allows storing and automatic synchronization of the information from the different nodes spatially distributed.
A batch script has been programmed in order to setup an automatic measurement for 10 min [
24,
25] every 3 h in each node. This allows a synchronized measurement within the established network.
Figure 1 shows an RPi node with the electret microphone outside the box.
Finally, a web-service based on OpenCPU [
26] has been installed on a web server. This web-service allows programming in R. Its purpose is to harvest data from the cloud and process it using spatial statistics as explained later.
Figure 2 shows the result of the node 4 (see location in
Figure 3).
In order to validate the sonometric records obtained with the RPi, a calibrated Type I standard sound level meter (CESVA SC-310 [
27]) has been used to calibrate and check the different measurements at the same time of the calibrating experiments. Using previously digital recorded audio files of road traffic in one of the locations the mean and standard deviation of the
during a 10 min period have analyzed. Before setting up the network test, a standard calibration using piston phones has been performed, to avoid the misalignment in the measured parameters due to the mismatches in microphones’ sensitivities and frequency responses. The measurements were compared with the standard sound level meter and showed less than
dB error, both in short-term and long-term measures. This difference has been considered to adjust the values of the on-site measurements.
2.3. The Measurements
The measurements were carried out following a grid over an urban plan of the city, in an extension of 1.80 km
. The measurements were conducted at 78 positions as shown in the map of
Figure 3. The measurement set was divided into periods of 3 hours, covering the diurnal period from 07:00 h to 22:00 h. Each location has one measurement in every regular temporal interval. A total of 390 measurements were obtained for performing the statistical analysis. The measurements were taken simultaneously by using a network with 39 RPi nodes measuring during 10 minutes in each time period. These measurements were taken in three weeks. Some of the nodes (27) were located in fixed places. The rest (12 nodes) were used as moving nodes, but immovable while measuring, in order to cover all the measuring network.
Figure 3a shows the locations of all the measurement points in the urban area. The fixed nodes are shown in
Figure 3b as hollowed squares and the moving nodes as filled squares. These moving nodes have been changed during the three weeks that lasted the experiment in order to record sound levels in all the day-time periods for every measuring location.
In the measurement places, traffic noise is recognized as one of the major contributors to the environmental noise [
28]. Therefore, traffic conditions during the measurement period should be also conveniently reported when conducting the measurements. Information on the number and vehicle types (light/heavy vehicles or motorcycles) was gathered in the measuring nodes, as well as their average speed and the kind of pavement on the road (asphalt, concrete, paving stones or soil). This information has been gathered by configuring RPi camera [
29] and storing the video (coded using QCIF resolution and H.263) at the same folder in the Cloud using the same time as the noise measurement in each node for each time period. It is important to note that the physical characteristics of the environment and the presence of animals or people speaking can also influence the noise level. This information is reflected in a set of qualitative or quantitative variables, as summarized in
Table 1.
2.4. Descriptive analysis of the data
The measurement process, especially its temporal distribution, leads us to 5 equally distributed time slots covering the diurnal period from 07:00 h to 22:00 h. For each of these time slots, there are 78 samples available, one per measurement location.
Table 1 describes the information gathered by each node.
Table 2 and
Figure 4 show the results of the descriptive analysis of the noise level in these five temporal periods. The values in
Table 2 and the box diagrams suggest the invariance of the mean value in the measured data, whose results are shown in
Table 3.
The 78 locations are divided into two sets in order to check the validity of the proposed models: a modelling set, composed of 68 locations and a validation set, which has 10 locations. Both sets are used for all of the time periods and are shown in
Figure 5. In order to select the nodes of the validation set, a distribution that bounds the semivariogram error has been chosen, so the nodes are spatially distributed throughout the whole area.
2.5. Spatial Analysis
The obtained noise pollution measurements constitute a data set linked to several locations with its geographical coordinates, longitude and latitude. By denoting the acoustic noise level at a location x as , we can define this data set as , where are all the locations of the modeling set (68 positions).
In this context, the objective of our model is the prediction of
in any location
, particularly those in the validation set. The noise reports contain information (as shown in
Table 1) of the set of covariables included. Consequently,
is modeled as a tendency function of the influencing covariables in the process that explains its variability in a large extent plus some random error explaining the short term variability, i.e.,
where
and
is a Gaussian process intrinsically stationary with zero mean, whose spatial dependence characterization is given by the variogram
[
30]:
where
denotes the variance and
h is an offset. In this study, the variogram has been calculated with the R statistical package for variogram fitting [
31].
3. Results and Discussion
Several studies [
28,
32,
33] conclude that
is the covariable that better explains the noise level, where
Q is the total number of vehicles. This coincides with other studies [
34,
35] on noise pollution in close urban areas. In this case, the variable
seems to be the determinant element for the noise level in the city, and it is a significant covariable in all of the time periods. These covariables have been analyzed separately in each one of the five time periods. However, other variables could have a significant impact in any of the time periods, although less important than the road traffic. This is the reason why the following simple linear model has been adopted for all the time periods:
where
and
determine the corresponding offset and slope of the model, respectively.
Table 4 gathers the tendency estimations of both parameters and their errors for the data in the five time periods. Once the tendency is adjusted, a first analysis of the spatial structure of the residuals has been conducted. The residuals have been represented graphically in the geographical coordinates (
Figure 6) and a contrast test has been performed to check the spatial independence of the observations [
30]. The nugget effect (see variogram part of
Table 4) can be attributed to measurement errors or spatial sources of variation at smaller distances than the sampling interval (or both). Measurement errors occur due to the inherent error in measuring devices. Natural phenomena can vary spatially over a range of scales (i.e., micro or macro scales). Variation at micro scales smaller than the sampling distances will appear as part of the nugget effect. When looking at the model of a semivariogram, it is noticeable that, at a certain distance from the measurement points of the model, the error levels are higher but bounded (see
Figure 7b). The distance where the model first flattens out is known as the range (in this case, the range is related to the arc seconds of the coordinates). The sill is the value where the semivariogram model attains the range (the value on the
y-axis).
From the
P-values in the independence test shown in
Table 3 and the graphics shown in
Figure 6, we can say that the residuals are spatially correlated in all the time periods, except for the time period 5 (
Figure 6e). The time periods 1 and 4 have a unique dependence in latitude, while periods 2 and 3 are correlated in both coordinates. This spatial correlation observed in
Figure 6 is probably related to the orientation of most of the streets in the town. If the orientation of the streets is considered in the plan of the town, they are mostly oriented in the predominant direction North–South.
The adjustment of exponential isotropic variograms for the five data sets is done by using the Restricted Maximum Likelihood method (REML), which allows combining recursively the coefficients of the tendency
,
, and the components of the variogram. These values are shown in
Table 4.
The goodness of the fitted model to each time period has been evaluated from the predictions of the noise level in the 10 locations of the validation set. The mean squared error (MSE) of these predictions has been obtained.
Table 4 shows two values of the MSE. The first, MSE.lm, is calculated with the predictions obtained using the model fitted to the trend without making use of the residuals spatial modeling. The second, MSE.glm, is obtained from the predictions provided by the OK method.
An alternative way to measure the goodness of the fitting is based on cross-validation (CRV) over the modeling set itself, as suggested in [
36]. The authors introduce three quantities to measure the goodness of model fitting: CRV1 contrasts with the unbiasedness of the prediction and its value must be equal to 0; CRV2 measures the accuracy of the standard deviation of the prediction error and its value is around 1; and finally CRV3 which measures the prediction error. The last row of
Table 4 shows the values of these three parameters.
From the observation of
Table 4, it appears that, from the point of view of the spatial prediction, modeling does not seem to have any advantage over the simple use of the trend. However, the goodness of fit is enhanced by spatial techniques, the values of the CRV
x.glm clearly improve the CRV
x.glm (these values are not shown in the table for this reason).
In order to cross-check the validity of the number of selected points, a Leave-One-Out- Cross-Validation algorithm in R (with the “Regularized Random Forest” method) has been used for the whole set of time averaged measurements.
Table 5 shows the results of this test to select the nodes and generates a model to predict the % established in the first column (%test/%training). In this table, mtry stands for the number of randomly selected predictors, coefReg is the regularization value and coefImp is the importance coefficient. The RMSE is the minimum of the list of results obtained with this algorithm. According to this table, the selection of the validation set is enough for our network of measuring nodes. The test set has been checked against the model and the difference is in the range of the recommendation of the ISO 1996-2.
Figure 8 shows a representation of the time evolution of the noise levels in Algemesí. This representation shows that the entrance to the city and its main roads (corresponding to Generalitat Av. and Ronda del Calvari in the left and upper part of the Figure) are the most noise polluted. This is due to the amount of vehicles passing through these main streets. In addition, in this area, the mean speed was reported to be higher than in other locations.
From the models obtained by using this analysis, we have been able to make predictions and their variances with OK. Using geoR library [
37],
Figure 9 shows a representation of the average noise levels in Algemesí on a 50 by 50 grid for the average value of the
taking into account all the time periods. A representation of the noise levels on a plan of Algemesí (Valencia) can be observed together with the kriging of the time average values for each measuring point, so this kriging should be representative enough for all the time periods.
4. Conclusions
In this paper, a distributed noise measurement system based on Internet of Things (IoT) technology developed with Raspberry Pi nodes and a data collection system based in OpenCPU framework, which is related to the R statistical programming language, has been tested to measure the hourly sound pressure level evolution.
With this data, the application of a geostatistical method for interpolating spatial noise levels in a small-sized city in Valencia has been presented. The methodology makes the determination of the simultaneous spatial variation of noise levels in different time periods possible. From the results obtained, some notable conclusions can be extracted. One of them is that the kriging method has been proven as an efficient method to show noise level information in real-time. In addition, the residuals are spatially correlated in most time periods, except for time period 5. Perhaps, this fact is due to the spatial variation of the noise levels in the whole measuring grid in this period. For this reason, the last time period has been excluded in the model, due to the spatial non-correlation of the residuals in the period between 19 h and 22 h.
The goodness of the model in each time period has been checked from the predictions of the noise level in the 10 locations of the validation set (i.e., 13% of the nodes). The results have proven the validity of the spatial model in four time periods. This can be also checked within the cross validation results.
The methodology allows for representing these noise models, enabling the localization of critical areas in different time periods and the prediction of noise levels in different parts of the city. It also allows for showing that the noise levels are highly dependent on the spatial statistic model defined [
38].
This method differs greatly from the issue of the subjective assessment of the noise annoyance, which poses quite a problem for those who are interested in developing and standardising units of measurement that convey the extent of the intensity of noise. It is also well-known that the decibel and the A-weighted system were developed precisely for this purpose, but they are far from being perfect [
39]. At this point, it is worth mentioning that great efforts are being made for the soundscape description that is being standardised in ISO 12913 [
40].
Further work on these issues is currently being done by increasing the number of measuring nodes in the WASN to measure urban noise, also previewed to collect full day measurements to compute parameters , and . This technology in combination with the explained methodology is a promising powerful tool for real-time distributed noise measurement and assessment.