1. Introduction
Nowadays, odour pollution is one of the main causes of public complaints to local authorities [
1]. Several human activities, especially those related to waste treatment and disposal, can lead to the formation of unpleasant odours. Even though odour emissions are, in general, not harmful to human health, they often generate concerns among the people living in the proximity of these sources [
2,
3]. Therefore, odour is now considered a pollutant, and it has also been included in the Best Available Techniques (BAT) reference document for waste treatment plants (WTPs) among the pollutants to be monitored and controlled from this type of installation [
4]. Odour monitoring, which is applied whenever an odour nuisance at a sensitive receptor is expected or has been substantiated, can be carried out using analytical methods or sensorial approaches [
5]. Among analytical tools, Instrumental Odour Monitoring Systems (IOMSs), including electronic noses (e-noses), have been recently increasingly studied for air quality monitoring applications [
6,
7]. Several studies prove that, if properly trained, IOMSs are able to detect the presence or the absence of odours and discriminate between different odour emission sources [
8,
9]. The first studies describing the application of e-noses for the monitoring of odours at receptors for a limited duration date back to the early 2000s–2010s [
10,
11]. Recently, some scientific studies have also discussed the possibility of using e-noses for estimating odour concentration [
12,
13]. For more details on the applicability of e-noses for environmental monitoring, we refer to some very recent review papers on the topic [
6,
7]. Still, their application as real-time air quality monitoring tools for permanent installations is still far from being state-of-the-art. Nonetheless, in Italy, in the last 2–3 years, there have been some cases in which the Competent Authority has requested a WTP to realize a permanent installation of IOMS with the purpose of continuously analysing the ambient air and providing real-time identification of anomalous odour events. If properly working, this type of installation could act not just as an emission monitoring instrument but could represent an effective tool for odour mitigation, allowing the recognition of the causes of the odour events, thereby enabling the plant operators a prompt and targeted intervention and thus reducing the impact on the neighbourhoods.
As previously mentioned, despite its potentialities, the implementation of IOMS for continuous odour monitoring in real-life applications is still challenging [
14], and is mainly hindered by some critical aspects related to prolonged outdoor use in critical conditions. Such critical aspects include sensors’ drift over time and cross-sensitivity to atmospheric variables and non-odorous gases [
15,
16,
17]. Cross-sensitivity towards temperature and humidity is particularly critical for environmental odour monitoring, because such parameters may vary within a rather wide range of values, thereby negatively affecting the sensors’ outputs. As an example, typical relative humidity variations that may occur outdoors range from 20% to 100%, whereby some experimental studies have proven gas sensors typically used in e-nose systems (i.e., MOX sensors) to be sensitive to humidity variations of 1%.
For this reason, different humidity and temperature compensation strategies have been proposed in the literature [
18] such as interference compensation models [
19,
20,
21], methods based on separation models for environmental factors [
22] and methods based on hardware optimization [
23,
24]. To the best of our knowledge, such solutions have been studied and implemented in laboratory conditions, while the application of compensation strategies in real scenarios is still very limited. Indeed, laboratory experiments are carried out in a controlled environment, whereas when operating outdoors, variable background conditions, as well as the presence of unknown interferents, may affect sensors’ responses in an unpredictable way compared to what is observed in the lab, thus reducing the effectiveness of the compensation strategies developed in lab conditions.
In this context, this paper describes the implementation of a continuous odour emission monitoring system consisting of two e-noses at the fenceline of a WTP. Besides describing the experimental procedure involved in training and testing the IOMS in situ, the paper focuses on the development of a specific pre-treatment procedure to be applied to the monitoring data with the purpose of compensating oscillations of the e-nose signals due to temperature (T) and relative humidity (RH) variations. Two approaches, which differ in the definition of the reference baseline, have been investigated, and the efficacy of the two methods was evaluated in terms of classification and quantification performance achieved within specific field verification tests and compared with the ones obtained without applying any correction.
Furthermore, the paper reports the results of the IOMS monitoring carried out at the WTP fenceline for about one year. The frequency of odour events and the trend of the odour concentration at the monitoring sites were evaluated and correlated with other useful information regarding the plant operation, such as extraordinary maintenance activities, temporary shutdowns of the biogas upgrading section, etc., which could potentially be correlated with increased odour emissions, with the purpose of proving the possibility of using the IOMS network as an effective tool for odour management.
The novelty of this work is not related to the development of new algorithms, but rather to the whole procedure adopted for the realization of an effective odour monitoring system capable of providing accurate and useful information in a complex, real-life scenario.
2. Materials and Methods
2.1. Plant Description
The plant under investigation is a WTP that converts the organic fraction of municipal solid waste (OFMSW) into biomethane and high-quality compost (
Figure 1). After pre-treatments (plastic and sand separation), the OFMSW is mixed with water and transferred to anaerobic digestion tanks. Anaerobic digestion leads to the formation of biogas and digestate. The biogas is then refined to biomethane in an upgrading section, while the digestate is sent to an aerobic treatment section to produce high-quality compost.
In order to prevent odour emissions, OFMSW storage and treatments are carried out in closed sheds operated under a slight vacuum and equipped with ventilators collecting exhaust air to a biofilter unit, which is the only conveyed emission of the plant. However, fugitive emissions from sheds cannot be completely neglected.
Thus, the following odour sources were considered in the study: the fugitive emissions from the sheds used for the storage and pre-treatment of the OFMSW and from the buildings used for the maturation and storage of compost, fugitive biogas emissions from the digestor and the upgrading section, and the conveyed emissions from the biofilters treating the air sucked from the buildings containing the OFMSW and the compost.
2.2. Monitoring System
For this study, as hardware, we used two identical e-noses WT1 (WT1 1179 and WT1 1180) commercialized by Ellona (Tolouse, France) and one weather station Vantage Pro2, commercialized by Davis Instruments Corporation (Hayward, Charlotte, NC, USA). Each WT1 e-nose is equipped with an array of 7 sensors, comprising 4 semiconducting metal oxides (MOX) sensors, 2 electrochemical sensors specific for the detection of hydrogen sulphide (H2S) and ammonia (NH3) and 1 photoionization detector (PID) calibrated with isobutylene. The instruments are also equipped with 2 sensors for monitoring temperature and relative humidity. The weather station Vantage Pro2 is equipped with a group of sensors including temperature and humidity, rainfall meter and anemometer, which offers the possibility to acquire meteorological data with a frequency of 15 min. All the data acquired by the e-noses and the weather station can be visualized and downloaded through the Ellona web interface for further elaboration.
Data processing was implemented externally from the Ellona software v. 2.4.2., in order to develop the compensation models required by the specific application, as described in the following sections.
The installation points for the e-noses have been defined based on the results of a parametric dispersion modelling study, which was carried out with the purpose of evaluating a preliminary correlation between odour concentration at the plant fenceline and the odour concentration expected at the nearest receptors. From this analysis, the most impacted receptors were identified and the selection of the instruments’ installation points was defined to be representative of the emission plumes reaching the most impacted receptors most frequently.
Based on this preliminary study and other logistic considerations, it was decided to install the e-nose WT1 1180 at the Eastern boundary of the plant in the proximity of the biofilters and the entrance of the shed where the OFMSW is discharged and pre-treated before the anaerobic digestion process. The installation site of the WT1 1180 is also very close to the biogas upgrading section. The e-nose WT1 1179 was installed on the opposite side of the plant (Western boundary), i.e., on the rear of the waste receiving and treatment sheds, but close to the two digesters and the torch.
2.3. Electronic Nose Training
2.3.1. Data Acquisition
The main purpose of the e-nose training is to create a reference dataset (i.e., training set, TS) to be used for the implementation of models for classifying and quantifying the air analysed by the instruments during the monitoring phase. To carry this out, dedicated olfactometric campaigns are needed to characterize the plant odour emissions sources. Measurements should be carried out in different periods in order to account for both the intrinsic variability of the sources and the variability associated with the seasonality and different weather conditions. In this case, a total of 70 odour samples were collected directly at the odour sources of the plant as listed in
Section 2.1 and analysed by dynamic olfactometry (EN 13725:2022) [
25] to assess their odour concentration. Samples were collected over 5 olfactometric campaigns, conducted in different seasons between February 2022 and October 2022, and with different meteorological conditions.
In more detail, odour samples were collected in WTP sheds devoted to OFMSW treatment by means of a mechanical depression pump. Samples of exhaust gas from biofilters were collected by means of a static hood, which allows the isolation of a part of the biofilter surface and conveys the exhaust gas flux through the exit duct where the sample is taken by means of a mechanical depression pump. Finally, samples of fugitive biogas emissions were taken from the gas transfer pipeline from the digestor to the upgrading section, exploiting the overpressure of the gas flow for filling the gas sampling bag. In order to create a TS representative of conditions expected at the monitoring sites, characterized by lower concentrations than those measured at the emission sources, odour samples were diluted with odourless ambient air taken at the plant fenceline when odours were not perceivable by the operators.
After dilution, from the initial 70 samples collected directly at the odour sources, we obtained a total of 240 samples (
Table 1), with odour concentrations ranging from about 20 ou
E/m
3 to about 1000 ou
E/m
3.
In more detail, 196 samples collected during the first four campaigns were used to build the TS, whereas the 44 samples of the fifth campaign were used as external independent test sets for field performance verification, as described in
Section 2.4 (
Table 2).
Samples were presented to the e-nose directly in the field to account for background conditions at the monitoring sites and prevent bias in the TS. Odourless ambient air samples were also analysed to define the baseline of IOMS signals when no odour could be perceived. Sensor responses accounting for various analyses in the field were registered for the next steps of data processing.
2.3.2. Data Processing
The general scheme of the data processing procedure developed for this study is illustrated in
Figure 2. The details of each step will be discussed in the following sub-sections. After an initial pre-treatment aimed at normalizing the data and compensating potential interferences, different features from the sensors’ signal curves were extracted and selected by means of the Boruta algorithm [
26]. Selected features were used as input for principal component analysis (PCA) to conduct an exploratory analysis [
27,
28]. Then, a two-step machine learning model was implemented. In the first phase, a Support Vector Machine (SVM) [
29,
30] classification model was built to predict the class of odours detected at the fenceline. In the second step, a class-specific regression model based on Support Vector Regression (SVR) [
31] was developed to estimate the odour concentration. These algorithms were chosen because they proved to have a better performance on the specific dataset considered in this study compared to other classification algorithms (e.g., Random Forest and KNN) and other regression algorithms (e.g., PLS) [
32]. Different approaches were compared to find out the optimal data processing procedure for the specific case, as will be described in the following sub-sections.
Pre-Treatment and Normalization of the Sensors’ Resistances
As already mentioned, one of the main issues associated with the application of e-noses for environmental odour monitoring is their cross-sensitivity towards humidity and temperature variations. Such interference cannot be neglected in the case of outdoor installations, where considerable changes in temperature and humidity levels of the ambient air occur because of the day–night cycles, the season, and the variable meteorological conditions. All these variations cause alterations in e-nose sensor resistances comparable to those observed in the case of odour events. Thus, if not properly treated, they are likely to lead to misinterpretation of the monitoring results and overestimation of the odour detections.
For this reason, we decided to explore the possibility of applying a specific pre-treatment aimed at compensating interferences on e-nose sensor responses caused by temperature and humidity variations occurring during the monitoring, which could have been misclassified as odour events.
The proposed pre-treatment involves the normalization of raw sensor signals recorded during the monitoring versus a reference baseline, as reported in (1),
where
ynorm is the normalized value of resistance,
y is the raw resistance value of the sensor and
yref is the value of reference baseline. Two approaches for the definition of the reference baseline have been investigated: the first one uses as parameter
yref as the mean sensor resistance value of the different sensors exposed to odourless ambient air, while the second approach is characterized by the implementation of a variable ‘dynamic’
yref obtained through a regression model, which correlates the value of resistance in non-odorous ambient air with the values of absolute humidity (
Figure 3).
Figure 3 shows, as an example, how the electrical resistance of one MOX sensor of the e-nose, exposed to odourless air, changes with the moisture content of the analysed air. In more detail,
Figure 3 compares the MOX sensor baseline with the absolute humidity, expressed in g/m
3 and defined by (2),
where
AH is the absolute humidity,
RH is the relative humidity,
T is the temperature of the air expressed in K,
Rw is the specific constant of gases for water vapour (equal to 461.5 J/kg/K), and
Ps is the water vapour pressure evaluated using the empiric Equation (3) available on Perry’s Chemical Engineers’ Handbook [
33],
where
C1,
C2,
C3,
C4, and
C5 are empiric constants specific for the evaluation of the vapour pressure of water and
P is the pressure expressed in Pa.
The visual inspection of the plot in
Figure 3 points out the existence of a decreasing trend of the sensor’s baseline with increasing absolute humidity, thereby suggesting the possibility of finding a correlation between these two variables through a regression model.
The regression model we chose consists of a polynomial regression based on a formula such as the one reported in (4),
where the
a coefficients are the empiric constants obtained from the regression model and
AH is the absolute humidity value evaluated using the temperature and relative humidity registered by the instruments during the analysis. Regression models with different polynomial curves, up to the fifth grade, were tested for each sensor. The choice of the polynomial model was made mainly by a visual check by trying to select the polynomial curve with a lower degree that better approximates the sensor trend (e.g., first-grade polynomial in the case of a linear decreasing trend or third-degree polynomial in the case of an exponential-like trend).
It should be pointed out that this polynomial regression does not claim to have a physical meaning of the sensor’s change in resistance in consideration of the chemical reactions occurring on the sensor’s surface. However, since the interaction mechanism of water with the sensor surface is still not fully understood, especially in the case of complex mixtures with other chemical compounds, thus making mathematical modelling of the physical phenomena hardly applicable, the only possible way of “correcting” the effect of humidity on the sensors is to use an empirical regression based on the interpolation of experimental data obtained from sensor exposure to different humidity levels. It should be further noted that similar approaches (using polynomials) have been already proposed in the scientific literature [
34] to correct for humidity and temperature effects on MOX sensors.
Feature Extraction and Selection
After normalization of the sensors’ signals, feature extraction [
35], calculated on a 5 min basis, was applied on the TS to extract the numerical parameters useful for the discrimination of the different odour emission sources considered. This step requires the implementation of mathematical operations on the sensor curves to obtain a multidimensional vector with informative values to be used as input for classification and quantification. In this case, 10 features were been extracted for each MOX sensor and 1 feature was extracted for each electrochemical sensor and for the PID, for a total of 43 features extracted (
Table 3).
To select the best features that entail useful information allowing the discrimination of a certain type of odour, the data processing procedure includes a step of feature selection. Different types of feature selection algorithms can be applied [
36,
37,
38]. Here, we used Boruta [
26], which is an algorithm based on the Random Forest classifier allowing us to rank the features based on their contribution to the performance of classification/regression. Compared to other algorithms, which are based on finding the smallest feature set giving the best classification result, Boruta allows us to identify all the features relevant to the classification purposes.
As a result, eleven features were selected for WT1 1180, and thirteen for WT1 1179, respectively. These features comprise the minimum of the resistance (A), the area under the curve and the minimum of the EMA with alpha 0.1 (D,E) [
39], the difference between the maximum and minimum of the resistance (F), the minimum of the derivative of the resistance (G) and the maximum reading of the H
2S electrochemical sensor (L).
Exploratory Analysis
On the selected features, a principal component analysis (PCA) was applied to reduce data dimensionality and obtain a graphical visualization of the instrument’s capability to discriminate the different odour classes (
Appendix A). Also, the PCA allows us to identify possible odour samples acting as outliers, which are removed from the TS before the implementation of the classification and regression models. In this type of application, outliers are mainly attributable to sampling issues (e.g., contamination of the sampling lines, unpredictable variations in the sampled emission, etc.), which are not always clearly recognizable a priori, and thus are required to be identified through a graphic interpretation of the sample distribution in the so-called influence plot [
28].
Classification and Regression Models
Not all the samples analysed (
Section 2.3.1) were used for the implementation of the classification and regression models. As previously mentioned, the training models were developed using only 196 out of 240 samples, because the remaining 44 were collected later and left out as an external test set to assess the e-nose performance during operation on the field (as described in
Section 2.4). Moreover, ca. 10% of the training samples were identified as outliers and removed to optimize the TS. Furthermore, the PCA allowed us to observe similarities between some of the odour classes considered, which made them hardly discriminable. Therefore, the final training model was built based on three classes: ‘Air’, which is representative of the condition of absence of odour; ‘Biogas’, which refers to potential fugitive emissions from the safety valves on the biodigesters; and ‘Organic odour’, which refers to the fugitive emissions from the sheds used for the receiving and pre-treatment of the organic wastes and the building dedicated to the maturation and storage of the compost and the biofilters.
The odour pattern registered by the e-nose is different depending on the odour class. As a consequence, samples belonging to different odour classes with the same odour concentration will provoke a different sensor response pattern, because of the different chemical composition of the odour classes. For this reason, the construction of an effective quantification model should be based on the identification of the odour class, first. Also, the identification of the odour class is of particular importance for the plant managers, because it enables the identification of the causes of the odour events detected at the fenceline.
The models implemented for classification and regression are based on the non-linear SVM (Support Vector Machine) with a Gaussian radial basis function (RBF) kernel [
30]. As mentioned in the previous paragraphs, the score of the data processed by PCA is taken and used as input for the implementation of the classification models, whose outputs define the odour class of the gas analysed. In particular, for both e-noses, the first three principal components account for about 90% of the explained variance and so they were selected as input of the models.
In more detail, three classification models deriving from different data analysis procedures were compared: (1) Model CA—analysis of the raw sensor resistance without any pre-treatment; (2) Model CB—analysis of the sensor resistance after normalization of the signal as reported in Equation (1), with the parameter
yref fixed at the general sensor resistance value of the different sensors exposed to odourless ambient air; (3) Model CC—analysis of the sensors resistance after normalization with absolute humidity compensation of the signal as reported in Equation (1), with a variable
yref evaluated with the regression model expressed in Equation (4) at the absolute humidity measured during the analysis time window. The models obtained were optimized by means of internal 10 k-fold cross validation and to determine their performance an independent dataset was used for testing [
40].
Regarding odour quantification, two different strategies were compared: the first approach (Model QA) is based on the application of a global regression model trained using a training set comprising all samples without differentiation of the odour class; instead, the second approach relies on a classification of the odour analysed prior to regression (Model QB). In this case, different regression models are built based on the number of odour classes defined in the TS (
Figure 2). The aim of the comparison is to verify the assumption that the classification of the odour prior to its quantification can improve the accuracy of the odour concentration prediction.
The evaluation of the regression models’ performance was conducted by comparing the odour concentration predicted on the independent test set with the concentration obtained by the reference method, i.e., dynamic olfactometry, and then by applying a statistical analysis based on the Bland–Altman (B&A) model [
41,
42].
2.4. Performance Verification in the Field
During the monitoring phase, it is important to carry out specific field tests to evaluate the reliability of the instruments’ detections. Field performance testing is also one of the requirements of the recently published Italian technical norm UNI 11761:2023 [
43] on instrumental odour measurement. In this case, sampling for field performance testing was carried out as for the training phase, but in a separate campaign after the definition of the training models: gas samples were withdrawn from the different odour emission sources considered, analysed by dynamic olfactometry (in accordance with EN 13725:2022), diluted, and then analysed by the e-noses to verify the instrument accuracy in recognizing the odour class and quantifying the odour concentration.
The odour classification capability was assessed in terms of the Matthews Correlation Coefficient (MCC), which is a better indicator compared to the global accuracy when the classes of interest are not balanced [
44].
The Bland–Altman method was then applied to evaluate the odour quantification capability and compare the odour estimates by the e-noses with the reference method. This approach was chosen because dynamic olfactometry has a high measurement uncertainty (generally considered at about 50% [
13]), which makes a comparison using traditional calibration methods based on the correlation coefficient (r) unsuitable [
45]. For this reason, it is necessary to adopt “model comparison” methods, which enable us to compare outputs obtained from methods having comparable levels of uncertainty. The Bland–Altman method has been recently proposed as a suitable approach to provide quantitative information about the capability of IOMS to estimate odour concentrations [
13].
To properly apply the Bland–Altman method, it is first necessary to check whether the differences in the values calculated with the two methods are normally distributed. This can be evaluated, for example, by applying a Shapiro–Wilk test [
46] and verifying the resulting
p-value. If the
p-value is higher than 0.05, then the condition of normality cannot be rejected. It should be considered that, when dealing with odour concentration values, the agreement should be evaluated by comparing the differences in their logarithms.
The application of the Bland–Altman method provides the bias, which is the mean of the differences, and the Limits of Agreement (Lower LoA and Upper LoA), which define a range within which 95% of the differences between one measurement and the other are included.
Performance testing was mainly conducted with the purpose of comparing the different compensation strategies and the different regression models developed, finding the most suitable approach to develop a robust and accurate quantification model.
2.5. Monitoring Data Evaluation
After installation, the e-noses analysed the ambient air continuously, and the sensor resistance values were registered every 10 s. The monitoring phase lasted for about 1 year, resulting in a monitoring dataset of 73,685 observations.
These data were processed according to the data processing procedure described in
Section 2.3.2. In more detail, data acquired continuously were divided into time intervals of 5 min to extract features from the sensor curves and process them with the classification and regression models developed. This means that the monitoring system produces an output every 5 min in terms of odour class detected and estimated odour concentration.
Monitoring data were analysed with the purpose of obtaining information about the detection of odours at the fenceline and their origin. Data were analysed first with the purpose of evaluating the odour impact in terms of the ratio between all the ‘odour’ detections (i.e., measurements classified differently than ‘air’) and the total number of measurements. Then, the relative detections of the different odour classes and the odour concentration trends over time were evaluated as well, with the purpose of identifying the most critical odour classes causing the majority and/or more intense odour events.
4. Conclusions
The present study discusses the possibility of using e-noses to continuously monitor the odour emissions at the fenceline of a WTP by detecting the presence of odours, recognizing their provenance (odour source) and estimating the odour concentration. In more detail, this study focuses on the instrument training methods, thereby comparing the application of different training models.
First, the training involved extensive field data acquisition: 196 samples of known odour quality and concentration collected at the plant’s main odour sources over four different olfactometric campaigns conducted in different seasons were analysed by the instruments, opportunely diluted to reproduce odour concentration levels that are likely to be detected at the plant fenceline. Furthermore, other 44 independent samples collected on a fifth campaign were used for testing.
Then, the work involved the implementation and the comparison of different models for the compensation of ambient air humidity and temperature, which are known as main interferents in the application of e-noses for environmental monitoring. Results show that the implementation of a suitable corrective model of sensor responses to compensate for changes in absolute humidity of the sample significantly improves the odour classification performances, passing from accuracies of ca. 70% to 100%. These observations further highlight the importance of including of a suitable compensation strategy whenever the e-nose is used outdoors, with variable atmospheric conditions.
Finally, in this study, we also studied and compared different models for the estimation of the odour concentration. In more detail, we wanted to confirm the hypothesis that a ‘double-step’ model, including first the identification of the odour class and then applying a regression model specific for the recognized odour class, would be more effective than a more simplified regression model without prior identification of the odour class. Indeed, the ‘double-step’ model proved to provide more accurate estimations, resulting in reduced factors for the Limits of Agreement evaluated by the Bland–Altman method. Moreover, quantification performances achieved by the trained electronic noses turned out to be comparable with the quality criteria fixed by EN 13725:2022 for the reference method, i.e., dynamic olfactometry.
As a conclusion, the study proves that, if properly trained, by including a suitable signal pre-treatment method for compensating atmospheric variables and by implementing specific quantification models based on a preliminary identification of the odour class for regression, e-noses can be very effective for environmental odour monitoring, achieving classification accuracies above 97% and quantification performances comparable to those considered acceptable by the reference standard for dynamic olfactometry.
Regarding the final application, the most important result consists of the realization of an effective odour emission monitoring system, enabling the real-time detection of odour events and the identification of their sources. This is a key feature for plant managers for identifying the possible causes of anomalous odour emissions and enabling prompt intervention, thus reducing the odour impact on the neighbourhoods.
Future studies should focus on the optimization of the classification and quantification models developed on a longer monitoring period, thereby accounting for the well-known problem of sensor drift over time and developing specific strategies to counteract it.