Detection of Water on Road Surface with Acoustic Vector Sensor

Kotus, Józef; Szwoch, Grzegorz

doi:10.3390/s23218878

Open AccessArticle

Detection of Water on Road Surface with Acoustic Vector Sensor

by

Józef Kotus

^* and

Grzegorz Szwoch

Department of Multimedia Systems, Faculty of Electronics, Telecommunication and Informatics, Gdansk University of Technology, Narutowicza 11/12, 80-233 Gdańsk, Poland

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(21), 8878; https://doi.org/10.3390/s23218878

Submission received: 6 October 2023 / Revised: 23 October 2023 / Accepted: 29 October 2023 / Published: 1 November 2023

(This article belongs to the Special Issue Advanced Sensing Technology for Environment Monitoring)

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents a new approach to detecting the presence of water on a road surface, employing an acoustic vector sensor. The proposed method is based on sound intensity analysis in the frequency domain. Acoustic events, representing road vehicles, are detected in the sound intensity signals. The direction of the incoming sound is calculated for the individual spectral components of the intensity signal, and the components not originating from the observed road section are discarded. Next, an estimate of the road surface state is calculated from the sound intensity spectrum, and the wet surface detection is performed by comparing the estimate with a threshold. The proposed method was evaluated using sound recordings made in a real-world scenario, and the algorithm results were compared with data from a reference device. The proposed algorithm achieved 89% precision, recall and F₁ score, and it outperforms the traditional approach based on sound pressure analysis. The test results confirm that the proposed method may be used for the detection of water on the road surface with acoustic sensors as an element of a smart city monitoring system.

Keywords:

water on the road; road surface state; acoustics monitoring; sound intensity; smart city sensors

1. Introduction

Smart city systems are the current trend in environmental monitoring. A network of sensors installed in the urban area provides continuous streams of data that are analyzed in the data centers. Monitoring of the urban traffic system is one of the most important applications of the smart city systems. The goal is to manage the city traffic efficiently and to increase the safety level for drivers and pedestrians. The presence of a water layer on the road surface is an important factor in increasing the risk of traffic accidents [1,2]. Detection of water on the road is, therefore, an essential function of a smart city system, which may alert the drivers and ask them to drive carefully. An efficient smart city system requires a large network of preferably low-cost sensors [3]. State-of-art sensors for measurement of the thickness of the water layer on the road surface are mostly based on optical (laser) sensors. Such devices are large and expensive; therefore, they are not suitable for large smart city systems.

The authors propose to use a small, low-cost acoustic sensor for the assessment of water presence on the road. Acoustic sensors are typically used in smart city systems for noise level measurements [4], but they are capable of providing other useful data. In the previous work, the authors evaluated the usefulness of an acoustic vector sensor (AVS) in traffic analysis. Contrary to standard, single microphone sensors that only measure sound pressure, the AVS measures sound intensity, which is a vector quantity and allows for the determination of the incoming sound direction [5,6]. In previous publications, the authors have successfully applied the sound intensity analysis to the detection of road vehicles and their direction of movement [7]. This paper presents a new application of the sound intensity analysis for the determination of the water presence on the road surface. Preliminary results were presented during the conference talk [8]. Here, a fully developed method is presented, together with the test results, compared with data obtained from the reference device.

Published works on the detection of water on a road using acoustic methods are based on pressure signal analysis. Most of the published methods utilize a machine learning approach, which, while effective, requires a large set of training examples. Abdic et al. [9] used a recurrent Neural Network (NN) trained on 785,826 audio examples and achieved 93.2% recall. Kongrattanaprasert et al. [10] utilized multiple NNs with the learning vector quantization networks and obtained 80% accuracy. Shariff et al. [11] employed convolutional NN with transfer learning to avoid the need for supervised learning with a large dataset, achieving above 80% accuracy. In another publication [12], they trained an NN with scalograms calculated from audio data and reached almost 90% accuracy. Bahrami et al. [13] used a two-stream convolutional NN for the analysis of acoustic features extracted from the noise recorded inside the vehicle. Alonso et al. [14] proposed a method suitable for on-board analysis of tire noise combined with data from the vehicle microcomputer using Support Vector Machines. Kalliris et al. [15] compared various machine learning algorithms and found that Support Vector Machines using Quadratic and Cubic kernels achieved the best results. Wang et al. [16] used an NN trained with a combined set of features extracted from both images and the tire noise, resulting in 92.7% accuracy. Akama et al. [17] used a different approach based on a Bayesian estimator, reporting 99% accuracy in their test set.

The published works mentioned above used the traditional approach based on the analysis of pressure signals recorded with single or multiple microphones. Analysis of sound intensity provides additional information on the sound source direction without the need to use complex microphone setups and apply beamforming algorithms. Sound intensity probes have been known for many years, but due to their high cost compared with standard microphones, they were rarely considered for applications such as the one described in this paper. Acoustic vector sensors built from pairs of matched microphones are also known, but they were rarely used in research, mostly because of the large sensor size when standard microphones are used. Advances in the technology resulted in the appearance of miniature (less than 5 mm) MEMS (Micro-Electro-Mechanical System) microphones. With these microphones, it became possible to construct a small AVS able to determine sound intensity and sound source direction. The aim of this paper is to propose the following modifications to the methods of detection of water on the road surface published in the literature so far. The analysis is performed on the sound intensity instead of the sound pressure, which makes it possible to determine the direction of sound sources. Using this information, the proposed method attenuates sound intensity components not originating from the vehicle tires making contact with the surface of an observed road section. The proposed modifications allow for more accurate sound analysis than in the case of pressure sensors.

The rest of the paper is organized as follows. The stages of the method are described: sound intensity calculation, determination of the incoming sound direction, elimination of the unwanted signal components, detection of acoustic events and, finally, calculation of the measure that represents the presence of the water layer on the road. The next chapter describes the test setup, the results of the performed experiments, the comparison with the reference data and the discussion. The paper ends with conclusions and indications of future work.

2. Materials and Methods

The proposed method of detecting water presence on the road surface using an AVS works in several stages. A general block diagram is shown in Figure 1, and the details of each stage are presented in the following subsections of the paper.

2.1. Sound Intensity

Sound intensity is a measure that describes the energy flow in sound waves, defined as the power carried by sound waves per unit area in a direction perpendicular to that area. Sound intensity, methods of measurement and practical applications became known to the acousticians thanks to the research published by Fahy [18], later extended by Jacobsen [19], with a focus on the measurement methods. Sound intensity is a vector quantity defined as:

I = \frac{1}{T} \int_{0}^{T} p (t) u (t) d t,

(1)

where p(t) is sound pressure (scalar), and u(t) is acoustic velocity (vector).

The acoustic velocity u may be approximated with a pressure gradient calculated from the measurement obtained from two closely spaced microphones: p₁(t), p₂(t). This is called a ‘p-p’ method, and it requires that the two microphones are matched in terms of their parameters—this may be ensured using a calibration procedure [20]. Instantaneous sound intensity may then be calculated as [18]:

I (t) = \frac{p_{1} (t) + p_{2} (t)}{2 ρ r} \int_{- \infty}^{t} (p_{1} (t) - p_{2} (t)) d t,

(2)

where ρ is air density, and r is the spacing between the pressure sensors.

Sound intensity may be calculated in the time domain by averaging the instantaneous intensity or in the frequency domain. The proposed method uses the latter approach. Sound intensity is calculated using the formula [21]:

I (ω) = \frac{1}{2 ρ r ω} I m (P_{1} \cdot P_{2}^{*}),

(3)

where P_i is the Fourier transform of the pressure p_i, Im is the imaginary part of the complex spectrum, asterisk denotes the complex conjugation, and ω is the angular frequency. The advantage of the spectral approach is that sound intensity may be calculated for each spectral component independently.

2.2. Selection of Intensity Components Based on Their Direction

Sound intensity calculated from pressure measured by two microphones represents the flow of acoustic energy along the axis determined by the two sensors. If two identical pairs of omnidirectional microphones are placed on the orthogonal axes so that the middle points of both pairs are at the same location, a two-dimensional acoustic vector sensor (2D AVS) is obtained. Such a sensor is able to determine the azimuth φ of a sound source using sound intensities I_X, I_Y measured along the X- and Y-axes, respectively (Figure 2):

φ = a r c t a n (\frac{I_{X}}{I_{Y}}) .

(4)

If sound intensity is calculated in the frequency domain, the sound source azimuth can be calculated for the individual spectral components:

φ_{k} = a r c t a n (\frac{I_{X, k}}{I_{Y, k}}),

(5)

where k is the spectral bin index with central frequency f_k given by:

f_{k} = k \frac{f_{s}}{K},

(6)

where f_s is the sampling frequency, and K is the Fourier transform length (in this paper, sound intensity analysis is performed in the digital domain).

With the above equations, an “azimuth spectrum” may be computed, providing information on the sound source direction on a time-frequency plane. Total sound intensity may be calculated by averaging the frequency components within a frequency range defined by the bin indices

〈 k_{m i n}, k_{m a x} 〉

:

I = \frac{1}{k_{m a x} - k_{m i n} + 1} \sum_{k = k_{m i n}}^{k_{m a x}} I_{k},

(7)

where I_k is the k-th bin of the sound intensity spectrum. This way, sound intensity limited to a specified frequency range can be calculated. The whole frequency range can also be divided into bands, and sound intensity in each band can be easily computed.

Another advantage of the approach presented here is that spectral components of the sound intensity signal may be selected according to their azimuth. Each spectral component is represented with a pair (I_k, φ_k). Therefore, the summation in Equation (7) may be supplemented with a condition:

I = \frac{1}{k_{m a x} - k_{m i n} + 1} \sum_{k \in B} I_{k},

(8)

where:

B = \{k \in \{k_{m i n}, \dots, k_{m a x}\} |φ_{k} \geq φ_{m i n} \land φ_{k} \leq φ_{m a x}\} .

(9)

where the range of azimuth values that are of interest is defined by (φ_min, φ_max).

In practical situations, if the sensor is positioned so that the zero azimuth corresponds to the axis perpendicular to the road (Figure 2), sounds originating from the vehicle moving on the road will be limited to a specific azimuth range, for example, −20° to 20°. By limiting the sound intensity analysis to components originating from directions within the range of interest, it is possible to reduce the level of the unwanted sounds from the environment, thus increasing the signal-to-noise ratio for the sound intensity analysis.

2.3. Preliminary Results

Figure 3 shows the “azimuth spectrograms” calculated for two cases: the dry and the wet road surface. Each plot was calculated for two vehicles moving in the opposite direction. The X-axis represents time; the Y-axis—frequency (limited to 9 kHz in the plots, as there are no important signal components above that frequency). Hue represents the sound source azimuth, and pixel brightness represents the sound intensity level. It can be clearly observed that in the case of a wet surface, there is a significant increase in the sound intensity for frequencies above 2.5 kHz compared to the dry surface state. However, the plots are difficult to interpret because of the presence of components with azimuth beyond the range of interest.

Figure 4 shows the same two cases, but the azimuth components within the range of 40 to 320 degrees were removed, leaving only the components with the azimuth values corresponding to the observed road segment. The two vehicles may now be clearly seen in the plot. Removal of the unwanted signal components allows the analysis to focus on signals produced by the vehicles near the sensor and to reduce the influence of other sound sources on the analysis results.

2.4. Acoustic Events Detection

Road surface state should be analyzed only if the sensor receives sounds originating from vehicles moving through the observed road section. The event detection procedure analyzes the intensity signals obtained from the sensor and extracts signal parts that likely contain vehicle sounds for further analysis. An acoustic event is defined by an increase in the sound intensity relative to the background noise level. A single acoustic event represents one or more sound sources, in this case, moving vehicles. The proposed algorithm does not require dividing an event into individual moving sound sources.

It is assumed that the sensor is oriented relative to the observed road as follows: the X-axis of the sensor is parallel to the road, and the Y-axis is perpendicular to the road (Figure 2). Sound intensity is calculated according to Equation (8) by limiting the azimuth range to values corresponding to the road section, where sound intensity from moving vehicles is sufficiently higher than the noise level, e.g., −40 to 40 degrees. These values should be selected based on the distance between the sensor and the road.

Total intensity I_XY in the horizontal plane (parallel to the ground) is a scalar value calculated from the intensities I_X, I_Y obtained from Equation (8):

I_{X Y} = \sqrt{I_{X}^{2} + I_{X}^{2}} .

(10)

Sound intensity calculation is limited to the spectral bin range (k_min, k_max) corresponding to the frequency range of approximately 400 Hz to 4 kHz, which contains the sound intensity originating from vehicle tires making contact with the road surface [22]. Lower frequencies are discarded because they contain unwanted sound intensity components from engine noise, wind, environmental noise, etc.

The calculated total intensity values I_XY are smoothed with a filter to reduce the amount of noise in the analyzed signal. The presented method uses a moving average filter with an averaging time equal to c.a. 300 ms.

The intensity I_n of the acoustic background (noise) is calculated using an exponential averaging filter with a long averaging time:

I_{n} [k] = α \cdot I_{n} [k - 1] + (1 - α) \cdot I_{a} [k - δ],

(11)

where α is the averaging factor, k is the sample index, and δ is the update delay in samples. The value of α is related to the averaging time, which should be sufficiently large to smooth changes in the acoustic background level. Usually, α is close to one (0.98 to 0.998).

Detection of an acoustic event is based on the condition:

\bar{I_{X Y}} > I_{n} + m,

(12)

where m is a constant value of a detection margin. The background noise estimate I_n is updated only if the condition in Equation (12) is not fulfilled.

Figure 5 presents an example of the acoustic event detection. The upper plot shows the averaged sound intensity in two axes defining the horizontal plane. Signal parts detected as acoustic events are marked with the dotted line boxes. The bottom plot shows the sound source azimuth. It can be observed that the azimuth changes smoothly during the acoustic events, while between the events, the azimuth changes are uneven and random. The detected events in this example represent single vehicles, except for the last one (at 82,000), which contains two vehicles (two peaks in the intensity and two separate monotonic segments in the azimuth plot).

2.5. Estimation of the Road Surface State

It is shown in Figure 3 and Figure 4 that the presence of water on the road surface causes an increase in the sound intensity for frequencies above 2.5 kHz. In order to evaluate this observation more accurately, intensity spectra for all acoustic events detected in the test recordings were calculated and averaged within two classes: the dry and the wet surface, according to the data from the reference sensor. The results are shown in Figure 6. It can be observed that in the frequency range up to about 1 kHz, there is no difference in spectra for both surface types. However, as the frequency increases, the spectrum for the wet surface exhibits a noticeably higher level than the dry surface spectrum. Therefore, sound intensity in the high-frequency range acts as a discriminating factor in the surface state analysis, while the low-frequency range intensity may be used as a normalizing factor.

Based on this observation, in the proposed method, the sound intensity is calculated using Equation (8) in three separate frequency bands: I_1k, I_3k, I_4k. For each intensity signal frame, spectral components are selected according to their azimuth, as described in the previous sections. Individual spectral components are then time-averaged using a moving average filter (in the presented case, a filter with the averaging window spanning c.a. 330 ms) to reduce the measurement noise present in the calculated sound intensity spectra. Next, the total intensity in each frequency band is calculated using Equation (8). The frequency ranges for each band and their corresponding spectral bin indices, calculated for the Fourier transform length K = 512, are given in Table 1.

Based on the observation regarding the difference of the average spectra for the dry and wet surfaces, the values I_3k and I_4k are used to discriminate between these two surface types, while I_1k serves as a normalizing factor. The proposed instantaneous surface state measure s_i, estimated from a single acoustic event, is calculated as:

s_{i} = \sqrt{{(\frac{\bar{I_{3 k}}}{\bar{I_{1 k}}})}^{2} + {(\frac{\bar{I_{4 k}}}{\bar{I_{1 k}}})}^{2}},

(13)

where the intensity values I_1k, I_3k, I_4k are averaged over the whole acoustic event.

This method of calculating s_i was chosen based on the analysis of the collected data. Other approaches, such as the one based on the spectral slope, were also tested, but they were less accurate than the one presented here. In the example presented in Figure 6, s_i = 0.03 for the dry surface and 0.071 for the wet surface.

Calculation of s_i is valid provided that the intensity spectrum has a shape similar to the ones presented in Figure 6, i.e., with a smooth, decreasing spectral envelope above 2.5 kHz. However, it is possible that the spectrum becomes distorted, e.g., by acoustic interference in the analyzed frequency bands. Such results should be discarded from the analysis. A spectral flatness measure is used for determining the validity of s_i:

s f = \frac{e x p (\frac{1}{k_{2} - k_{1} + 1} \sum_{k = k_{1}}^{k_{2}} l n (L_{k}))}{\frac{1}{k_{2} - k_{1} + 1} \sum_{k = k_{1}}^{k_{2}} L_{k}},

(14)

where:

L_{k} = 20 l o g_{10} (I_{k}) + 100,

(15)

k₁ and k₂ are frequency bins equal to k_min of I_3k and k_max of I_4k, respectively. If the spectral flatness is below the threshold (0.75 is used in the algorithm), the result is discarded.

The s_i values are calculated for single acoustic events, and as such, they may be inaccurate. For example, the presence of a high-intensity acoustic distortion from the direction of the road may lead to a false positive result. However, in a typical scenario, multiple vehicles are present within an observation period. Therefore, the s_i values may be time-averaged to provide an improved surface state estimate. In the proposed method, a two-stage processing is performed. The first stage is realized with a median filter with a short window (e.g., five events), the purpose of which is to remove the outliers. The second stage is a standard moving average filter with a longer window (e.g., 11 events), which reduces noise in the computed values. A longer filter window provides better noise reduction at the cost of delaying the surface state change detection. After the filtering, the averaged surface state estimate s_a is obtained.

The final decision on the road surface state (dry/wet) is performed by comparing the s_a values with the threshold value. In the experiments presented here, the threshold was constant, and the selection of its value is discussed further in the paper. Adaptive threshold selection was considered (a dynamic threshold set by continuously estimating the background noise level and varying the threshold value according to the current noise estimate), but due to the complexity of its implementation, it was left for future research.

3. Experiments

3.1. Test Setup

Validation of the proposed method was performed in a real-world scenario using a custom-built AVS. The sensor was constructed from six omnidirectional, digital MEMS microphones (IvenSense INMP441 [23]), with sensitivity −26 dBFS (decibels relative to full scale), providing pressure signals sampled at 48 kHz, with 24-bit resolution, using the I²S protocol. Microphones were mounted on the sides of a cube with an edge length of 10 mm. The microphone signals were received by a microcomputer (Raspberry Pi 4) through an I²S-USB interface. The signals from the sensor microphones were recorded into 15-min-long files (six channels of uncompressed pressure data) and stored on a flash drive.

The test setup was installed in a rural area by the side of a straight section of a busy regional road with an even asphalt surface. The sensor was placed in a protective enclosure connected to a box containing the microcomputer and the power source (Figure 7). The sensor was mounted c.a. 7.2 m away from the road edge at a height of 4.3 m.

A remote road surface state sensor Vaisala DSC111 [24], mounted above the AVS, was used as a reference device. This is a professional, certified device that measures the thickness of the water layer on the road using a spectroscopic sensor. Data from the reference sensor were recorded in the 120 s intervals, and they constitute the ground truth data for the proposed method evaluation.

The data were collected during July 2023. The AVS recordings from nine non-consecutive days were selected for the analysis based on the presence of water on the road surface reported by the reference sensor. For each of the nine days, continuous 24-h recordings were analyzed. A total of 216 h of recordings were analyzed. The recorded signals were processed offline on a computer using Matlab and Python scripts. The detection algorithm was also implemented using Python scripts so that it can be used in the online mode, processing live signals and outputting the results.

The AVS was calibrated in an anechoic chamber by measuring impulse responses from each microphone. A correction function was calculated in the frequency domain for each microphone to equalize differences between microphones on each axis [20]. These correction functions were applied to the pressure signal spectra before the intensity signals were calculated.

For the detection of acoustic events, the analysis was limited to the frequency range 375 Hz to 4031.25 Hz, and the spectral components corresponding to the azimuth range −30 to 50 degrees were selected (the sensor was rotated by c.a. 10 degrees relative to the road). During the surface state estimation, the azimuth range was further limited to −10 to 30 degrees (the range was selected based on the distance of the sensor from the road). The signals were analyzed in blocks of 512 samples at 48 kHz sampling frequency (block length 10.67 ms). The averaging filter length for the event detection and the intensity spectrum smoothing was 31 blocks (330.67 ms). The instantaneous surface state estimates were processed by the median filter of length 5 samples, then by the moving average filter with length 11 samples.

Short time frames of 512 samples were used to achieve good temporal resolution (10.67 ms) so that the acoustic events are analyzed as soon as possible. The frame length affects the frequency resolution during the spectral analysis of the intensity signals—it is equal to 93.75 Hz. The experiments proved that such frequency resolution is sufficient for the proposed algorithm. Frequency resolution may be improved by increasing the time frame at the cost of reduced temporal resolution.

3.2. Results

A total of 24,170 acoustic events were detected in the test set, which means that, on average, about 112 events were detected during one hour of recordings (one event may contain multiple vehicles). The detected events were recorded as timestamped values of the averaged surface state estimates s_a. The ground truth data consisted of the water layer thickness measured by the reference sensor at 120 s intervals. In order to match these two datasets, the algorithm results s_a were resampled to the time intervals defined by the reference data using linear interpolation. The binary decision wet/dry was performed by comparing these values with a threshold. For the reference data, a threshold of the water layer thickness equal to 0.2 mm was used (according to the data recorded by the reference sensor, this threshold separates the “dry” and “wet” surface state classes). In the reference sensor data, 7.14% of readouts indicated a wet surface.

The choice of the detection threshold for the evaluated algorithm was made by calculating the accuracy metrics (precision, recall and F1-score) for different threshold values using the whole dataset. The resulting RoC (receiver operating characteristics) curves are shown in Figure 8. A threshold of 0.065 provided equal precision and recall values, and it was chosen as the decision threshold for the experiments. A different threshold value may be used if a higher precision (less false negative results) or a higher recall (less false positive results) is preferred.

Table 2 shows the results obtained from the analysis of the whole dataset using the proposed method. The main version of the evaluated algorithm (Alg. 1) is the proposed method based on sound intensity analysis, including the selection of spectral components with azimuth covering the observed road section. Two other approaches were evaluated for comparison. The purpose of these two algorithms is to evaluate the advantage of using the sound intensity analysis and the selection of spectral components based on the azimuth, as implemented in Alg. 1. The first method (Alg. 2) is also based on the sound intensity, but all spectral components are considered, regardless of their azimuth. The other method (Alg. 3) is the standard approach based on the sound pressure (signals from a single microphone) instead of the sound intensity. With this approach, determining the source azimuth is not possible. For each case, the threshold value was found using the procedure described earlier and rounded to the third decimal place. The results are discussed in the next section.

An example of the detection results and comparison with the reference data is presented in Figure 9. The upper plot shows the surface state estimates calculated by the algorithm: dots present the instantaneous values s_i for the individual acoustic events; the line shows the averaged s_a values. The bottom plot line shows the data from the reference sensor. Signal sections indicating wet surface (values above the threshold) are marked with colored boxes: purple for the algorithm detection and light brown for the reference data detection. Areas marked with dark brown color indicate parts where the wet surface was detected by both the algorithm and the reference sensor.

4. Discussion

The detection threshold in the algorithm was selected so that the precision and the recall are approximately equal, which also means that the F₁ score is the same. Hence, the term “accuracy” will be used to describe all three metrics. The proposed algorithm achieved c.a. 89% accuracy in the detection of the presence of water on the road surface when compared with the data from the reference sensor. Given the complexity of the problem of estimating the road surface state using only audio signal analysis, that level of accuracy may be considered satisfactory. It should be noted that the surface state was evaluated at 120 s intervals. A false negative result does not mean that, e.g., rainfall was completely missed. Many of the false negative and false positive detection results were caused by the detection delay, i.e., the detector changing its state too late. Similarly, there were some false positive results that lasted only for a short time. It should also be noted that the reference sensor measures the water layer at one specific point on the road, while the proposed method analyzes a larger section of the road. This aspect may influence the results comparison after a rainfall, especially if the road surface is uneven or it is not uniformly covered by sunlight.

The choice of the detection threshold for the evaluated algorithm was based on the equal precision and recall condition. In a real-life application, the threshold should be tunable so that a desired balance between the precision and the accuracy may be obtained. As expected, increasing the threshold improves the precision, reducing the risk of false negative results, but at the same time, it deteriorates the recall, increasing the risk of false positive results. Decreasing the threshold has the opposite effect. In a practical installation, reducing the risk of false negative results may be preferred so that the threshold value may be increased. It should also be noted that the proposed method achieves above 76% F₁ score for the whole range of the tested threshold values 0.05 to 0.10 (Figure 8).

A plot of example results in Figure 9 shows that although the majority of the instantaneous surface state estimates s_i follow the surface state changes (their values increase as the surface becomes wet), there are some results that deviate from the trend. This is most likely caused by acoustic sources in vehicles that emit sounds not related to the tires in the analyzed frequency range, increasing the spectral intensity level and causing higher s_i values on a dry surface. Such cases occur mostly for larger vehicles (e.g., trucks) but only for some of them. The proposed procedure relies on the smoothing algorithm that filters out such cases from the results. Therefore, the algorithm works on the assumption that there is a sufficiently large number of the analyzed events available so that any result that deviates from the trend is discarded. This condition was fulfilled in the test setup. However, in case of very low traffic (e.g., one vehicle every five minutes), such result averaging is not possible, and the algorithm accuracy is expected to deteriorate. Figure 9 shows that water detection is more problematic during the night hours when the number of vehicles is significantly lower than during the day. Hence, in a practical application, a minimum number of events per observation period should be imposed, and the results obtained for a low number of events should not be reported. Additionally, the proposed algorithm provides a form of a “reliability” measure for the results by computing the standard deviation of the averaged instantaneous values.

From the analysis of the results plot (Figure 9), it should also be observed that the dispersion of the instantaneous values is significantly higher for the wet surface than for the dry surface. This means that although there is an increase in the intensity level for higher frequencies on the wet surface, as shown in Figure 6, the degree of the increase may depend on the vehicle size and weight, tire size and condition, etc. Therefore, the requirement of having a sufficiently large number of events for the analysis is even more important for a wet surface.

The filtering (smoothing) procedure is necessary to obtain the surface state values s_a suitable for the detection. Every online filtering procedure introduces a delay to the results. If the smoothing filter length is increased, a higher level of noise suppression is obtained, making the detection easier, but at the same time, it increases the detection delay. Such a delay is unwanted in the wet surface detection; a wet surface is expected to be reported as soon as possible. Therefore, relatively short filters were used in the experiments (a median filter of length 5 and a moving average filter of length 11) as a compromise. In practical applications, the filter length may be made tunable as a “detection latency” parameter.

An important feature of the method based on sound intensity measured by an AVS is the ability to determine the azimuth (source direction) for every spectral component and to select only the components with the azimuth of the observed road section. If that function is omitted and the whole sound intensity spectrum is analyzed, the detection accuracy decreases by about 0.07 (Alg. 2 in Table 2). This result proves that the proposed method that limits the sound intensity analysis to the azimuth range of interest provides a significant increase in the wet surface detection accuracy.

A comparison of the proposed method with a similar algorithm operating on the sound pressure signals, recorded with a single microphone in the AVS (Alg. 3 in Table 2), indicates that the method based on the sound intensity with component selection has significantly higher accuracy by c.a. 0.13 than the pressure-based approach. Even if the component selection is omitted, the method based on sound intensity has an accuracy higher by c.a. 0.06 than the method based on pressure. Therefore, the road surface state estimation based on the sound intensity analysis provides significantly higher accuracy than the traditional state-of-the-art approach in which pressure signals from a single microphone are analyzed.

In the presented experiment, the sensor was analyzing traffic on a road with a single lane in each direction. The number of lanes and their direction are not important for the proposed method. The sound intensity decreases with the distance from the sensor. If a lane is too far from the sensor, the sound intensity becomes comparable with the noise level, and detection of the acoustic events is impossible. Therefore, positioning the sensor close to the road is preferred.

5. Conclusions

The proposed method of the detection of water on the road surface is built upon an observation that the presence of a water layer on the road changes the soundscape of the tire noise by increasing the sound intensity level in the frequency range above 1 kHz. The results of the experiments performed using the real-world recordings indicate that the proposed method has sufficient accuracy to be considered for practical applications, such as smart city systems, in which high accuracy, certified sensors are not required. Compared with the reference sensor used in the experiments, an AVS may be realized as a low-cost, small and power-efficient device suitable for installation in multiple locations within a distributed monitoring system. The algorithm can be run in quasi-real time on a microcomputer with moderate processing power. To perform an accurate detection of the water layer on the road, a sufficient level of traffic intensity is required. The sensor used by the proposed method may also provide other important data related to traffic monitoring. From the event detection results calculated with the method described here, it is possible to obtain data on traffic intensity (coverage of the observation period with the detected events). Analysis of the sound intensity and the source azimuth may also be used for vehicle detection and counting.

The experiments described in this paper were conducted to validate the proposed method. Only one specific test installation was available for the experiments. The proposed method should be tested further in other locations with different types of road surfaces, different traffic intensities, different seasons, etc. Such experiments are planned for the next stage of the research. It is expected that different conditions will require retuning of the algorithm parameters, mostly the detection threshold. One possible enhancement of the proposed algorithm is the addition of an automatic detection threshold selection based on the analysis of the noise level in the surface state estimates. This is a complex problem which requires separate research. Nevertheless, the results obtained from the test installation prove the validity of the proposed approach to the estimation of water presence on the road surface using only acoustic signals.

In this paper, an algorithmic approach to the problem was proposed. Certainly, the machine learning approach may also be considered. The sound intensity signals may be calculated as proposed here, and the average sound intensity spectrum, after the component selection based on their azimuth, may form an input vector for the machine learning algorithm, which replaces the surface state estimate calculation and the threshold decision. This approach requires collecting a much larger dataset than the one used in this paper. The authors plan to explore this method in future research. However, the algorithmic approach presented in this paper is simple, does not require high computing power, and provides good detection accuracy.

Author Contributions

Conceptualization, J.K.; methodology, J.K. and G.S.; software, G.S.; investigation, J.K. and G.S.; data curation, G.S.; writing—original draft preparation, G.S.; writing—review and editing, G.S. and J.K.; visualization, G.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Polish National Centre for Research and Development (NCBR) through the European Regional Development Fund entitled INFOLIGHT Cloud-Based Lighting System for Smart Cities under Grant POIR.04.01.04-00-075/19.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Biesse, F. Analysis of wet road usage with a driving safety concern. In Proceedings of the 21st VDA Technischer Konress, Berlin, Germany, 14–15 March 2019; pp. 74–91. [Google Scholar]
Spitzhüttl, F.; Goizet, F.; Unger, T.; Biesse, F. The real impact of full hydroplaning on driving safety. Accid. Anal. Prev. 2020, 138, 105458. [Google Scholar] [CrossRef]
Ramírez-Moreno, M.A.; Keshtkar, S.; Padilla-Reyes, D.A.; Ramos-López, E.; García-Martínez, M.; Hernández-Luna, M.C.; Mogro, A.E.; Mahlknecht, J.; Huertas, J.I.; Peimbert-García, R.E.; et al. Sensors for sustainable smart cities: A review. Appl. Sci. 2021, 11, 8198. [Google Scholar] [CrossRef]
Alsina-Pagès, R.M.; Bellucci, P.; Zambon, G. Smart Wireless Acoustic Sensor Network Design for Noise Monitoring in Smart Cities. Sensors 2021, 20, 4765. [Google Scholar] [CrossRef]
Mickiewicz, W.; Jabłoński, M.J.; Pyła, M. Calculation of Spatial Sound Intensity Distribution Based on Synchronised Measurement of Acoustic Pressure. In Proceedings of the 18th International Conference on Methods and Models in Automation and Robotics (MMAR), Międzyzdroje, Poland, 26–29 August 2013; pp. 696–700. [Google Scholar] [CrossRef]
Mickiewicz, W.; Raczyński, M.; Parus, A. Performance analysis of cost-effective miniature microphone sound intensity 2D probe. Sensors 2020, 20, 271. [Google Scholar] [CrossRef] [PubMed]
Szwoch, G.; Kotus, J. Acoustic detector of road vehicles based on sound intensity. Sensors 2021, 21, 7781. [Google Scholar] [CrossRef] [PubMed]
Kotus, J.; Szwoch, G.; Czyżewski, A.; Kostek, B. Assessment of road surface state with acoustic vector sensor. J. Acoust. Soc. Am. 2022, 152, A193. [Google Scholar] [CrossRef]
Abdić, I.; Fridma, L.; Brown, D.E.; Angell, W.; Reimer, B.; Marchi, E.; Schuller, B. Detecting Road Surface Wetness from Audio: A Deep Learning Approach. In Proceedings of the 23rd International Conference on Pattern Recognition (ICPR), Cancun, Mexico, 4–8 December 2016. [Google Scholar] [CrossRef]
Kongrattanaprasert, W.; Nomura, H.; Kamakura, T.; Ueda, K. Application of Neural Network Analysis to Automatic Detection of Road Surface Conditions Utilizing Tire Noise from Vehicles. In Proceedings of the ICCAS-SICE, Fukuoka, Japan, 18–21 August 2009. [Google Scholar]
Shariff, K.K.M.; Ali, A.; Ab Rahim, S.A.E.; Ismail, Z.K. Wet Road Detection Using CNN with Transfer Learning. In Proceedings of the 12th Symposium on Computer Applications & Industrial Electronics (ISCAIE), Penang, Malaysia, 21–22 May 2022. [Google Scholar] [CrossRef]
Shariff, K.K.M.; Zainuddin, S.; Ali, M.S.A.M. Detection of Wet Road Surfaces from Acoustic Signals Using Scalogram and Optimized AlexNet. In Proceedings of the 12th Symposium on Computer Applications & Industrial Electronics (ISCAIE), Penang, Malaysia, 11–12 May 2022. [Google Scholar] [CrossRef]
Bahrami, S.; Doraisamy, S.; Azman, A.; Nasharuddin, N.A.; Yue, S. Acoustic Feature Analysis for Wet and Dry Road Surface Classification Using Two-Stream CNN. In Proceedings of the CSAI ‘20: 4th International Conference on Computer Science and Artificial Intelligence, Zhuhai, China, 11–13 December 2020; pp. 194–200. [Google Scholar] [CrossRef]
Alonso, J.; López, J.M.; Pavón, I.; Asensio, C.; Areas, G. Platform for on-Board Real-Time Detection of Wet, Icy and Snowy Roads, Using Tyre/Road Noise Analysis. In Proceedings of the International Symposium on Consumer Electronics (ISCE), Madrid, Spain, 24–26 June 2015. [Google Scholar] [CrossRef]
Kalliris, M.; Kanarachos, S.; Kotsakis, R.; Haas, O.; Blundell, M. Machine Learning Algorithms for Wet Road Surface Detection Using Acoustic Measurements. In Proceedings of the IEEE International Conference on Mechatronics (ICM), Ilmenau, Germany, 18–20 March 2019. [Google Scholar] [CrossRef]
Wang, Z.; Zhan, J.; Duan, C.; Guan, X.; Zhong, Z.; Cao, Z. Road Surface Recognition Based on Vision and Tire Noise. In Proceedings of the 5th CAA International Conference on Vehicular Control and Intelligence (CVCI), Tianjin, China, 29–31 October 2021. [Google Scholar] [CrossRef]
Akama, S.; Tabaru, T.; Shin, S. Bayes Estimation of Road Surface Using Road Noise. In Proceedings of the 30th Annual Conference of IEEE Industrial Electronics Society (IECON), Busan, Republic of Korea, 2–6 November 2004. [Google Scholar] [CrossRef]
Fahy, F. Sound Intensity, 2nd ed.; E & F.N. Spon: London, UK, 1995. [Google Scholar]
Jacobsen, F. Sound intensity and its measurement and applications. Curr. Top. Acoust. Res. 2003, 3, 87–91. [Google Scholar]
Kotus, J.; Szwoch, G. Calibration of acoustic vector sensor based on MEMS microphones for DOA estimation. Appl. Acoust. 2018, 141, 307–321. [Google Scholar] [CrossRef]
Chung, J.Y. Cross-spectral method of measuring acoustic intensity without error caused by instrument phase mismatch. J. Acoust. Soc. Am. 1978, 64, 1613. [Google Scholar] [CrossRef]
Ballesteros, J.A.; Sarradj, E.; Fernández, M.D.; Geyer, T.; Ballesteros, M.J. Noise source identification with beamforming in the pass-by of a car. Appl. Acoust. 2015, 93, 106–119. [Google Scholar] [CrossRef]
IvenSense: INMP441, Omnidirectional Microphone with Bottom Port and I2S Digital Output. Available online: https://invensense.tdk.com/wp-content/uploads/2015/02/INMP441.pdf (accessed on 5 July 2023).
Vaisala Remote Road Surface State Sensor DSC111 Datasheet. Available online: https://docs.vaisala.com/v/u/B210470EN-E/en-US (accessed on 5 July 2023).

Figure 1. Block diagram of the algorithm.

Figure 2. Sensor coordinate system relative to the road.

Figure 3. Sound source azimuth plots for: (a) dry road surface, (b) wet road surface. Pixel brightness represents the sound intensity level.

Figure 4. Sound source azimuth plots for: (a) dry road surface, (b) wet road surface, with components limited to the azimuth range of interest (−40° to 40°). Pixel brightness represents the sound intensity level.

Figure 5. Example of the acoustic event detection: sound intensity and the detected events (upper plot) and the sound source azimuth (lower plot). Signal parts detected as acoustic events are marked with the dotted line boxes.

Figure 6. Averaged sound intensity spectra for the dry and the wet road surface. Gray regions indicate the frequency bands used for the surface state estimation.

Figure 7. The AVS (a) and the test setup (b) used for the evaluation of the proposed method.

Figure 8. RoC curves for different detection thresholds in the evaluated algorithm. The vertical line shows the value of equal precision and recall.

Figure 9. Example of the analysis results, with two periods of rainfall. (Upper plot): surface state estimates s_i and s_a. (Bottom plot): reference data and the “wet surface” detection results. Dashed horizontal lines indicate the thresholds.

Table 1. Frequency ranges used for the surface state analysis and spectral bin ranges k calculated for K = 512 (sampling rate 48 kHz).

Band	f_min [Hz]	f_max [Hz]	k_min	k_max
I_1k	1125	1500	12	16
I_3k	2625	3000	28	32
I_4k	4125	4593	44	49

Table 2. Results of the surface state estimation using three versions of the algorithm.

Parameter	Alg. 1	Alg. 2	Alg. 3
Analyzed signals	intensity	intensity	pressure
Component selection by azimuth	yes	no	no
Detection threshold	0.065	0.053	0.207
Number of the analyzed time points	6480	6480	6480
Number of “wet surface” time points	463	463	463
True positive results (TP)	414	382	351
False negative results (FN)	49	81	112
False positive results (FP)	53	83	113
Precision: P = TP/(TP + FP)	88.6%	82.1%	75.6%
Recall: R = TP/(TP + FN)	89.4%	82.5%	75.8%
F₁ score: F₁ = (2 · P · R)/(P + R)	89.0%	82.3%	75.7%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kotus, J.; Szwoch, G. Detection of Water on Road Surface with Acoustic Vector Sensor. Sensors 2023, 23, 8878. https://doi.org/10.3390/s23218878

AMA Style

Kotus J, Szwoch G. Detection of Water on Road Surface with Acoustic Vector Sensor. Sensors. 2023; 23(21):8878. https://doi.org/10.3390/s23218878

Chicago/Turabian Style

Kotus, Józef, and Grzegorz Szwoch. 2023. "Detection of Water on Road Surface with Acoustic Vector Sensor" Sensors 23, no. 21: 8878. https://doi.org/10.3390/s23218878

APA Style

Kotus, J., & Szwoch, G. (2023). Detection of Water on Road Surface with Acoustic Vector Sensor. Sensors, 23(21), 8878. https://doi.org/10.3390/s23218878

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Detection of Water on Road Surface with Acoustic Vector Sensor

Abstract

1. Introduction

2. Materials and Methods

2.1. Sound Intensity

2.2. Selection of Intensity Components Based on Their Direction

2.3. Preliminary Results

2.4. Acoustic Events Detection

2.5. Estimation of the Road Surface State

3. Experiments

3.1. Test Setup

3.2. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI