Assessing the Impact of Aviation Emissions on Air Quality at a Regional Greek Airport Using Machine Learning

Stefanis, Christos; Manisalidis, Ioannis; Stavropoulou, Elisavet; Stavropoulos, Agathangelos; Tsigalou, Christina; Voidarou, Chrysoula (Chrysa); Constantinidis, Theodoros C.; Bezirtzoglou, Eugenia

doi:10.3390/toxics13030217

Open AccessArticle

Assessing the Impact of Aviation Emissions on Air Quality at a Regional Greek Airport Using Machine Learning

by

Christos Stefanis

^1,*

,

Ioannis Manisalidis

^1,2,

Elisavet Stavropoulou

¹,

Agathangelos Stavropoulos

³,

Christina Tsigalou

¹

,

Chrysoula (Chrysa) Voidarou

⁴

,

Theodoros C. Constantinidis

¹

and

Eugenia Bezirtzoglou

¹

Laboratory of Hygiene and Environmental Protection, Faculty of Medicine, Democritus University of Thrace, 68100 Alexandroupolis, Greece

²

Delphis S.A., 14564 Kifisia, Greece

³

School of Social and Political Sciences, University of Glasgow, Glasgow G12 8QQ, UK

⁴

Department of Agriculture, School of Agriculture, University of Ioannina, 47100 Arta, Greece

^*

Author to whom correspondence should be addressed.

Toxics 2025, 13(3), 217; https://doi.org/10.3390/toxics13030217

Submission received: 18 February 2025 / Revised: 4 March 2025 / Accepted: 14 March 2025 / Published: 16 March 2025

(This article belongs to the Section Air Pollution and Health)

Download

Browse Figures

Versions Notes

Abstract

Aviation emissions significantly impact air quality, contributing to environmental degradation and public health risks. This study aims to assess the impact of aviation-related emissions on air quality at Alexandroupolis Regional Airport, Greece, and evaluate the role of meteorological factors in pollution dispersion. Using machine learning models, we analyzed emissions data, including CO₂, NOx, CO, HC, SOx, PM_2.5, fuel consumption, and meteorological parameters from 2019–2020. Results indicate that NOx and CO₂ emissions showed the highest correlation with air traffic volume and fuel consumption (R = 0.63 and 0.67, respectively). Bayesian Linear Regression and Linear Regression emerged as the most accurate models, achieving an R² value of 0.96 and 0.97, respectively, for predicting PM_2.5 concentrations. Meteorological factors had a moderate influence, with precipitation negatively correlated with PM_2.5 (−0.03), while temperature and wind speed showed limited effects on emissions. A significant decline in aviation emissions was observed in 2020, with CO₂ emissions decreasing by 28.1%, NOx by 26.5%, and PM_2.5 by 35.4% compared to 2019, reflecting the impact of COVID-19 travel restrictions. Carbon dioxide had the most extensive percentage distribution, accounting for 75.5% of total emissions, followed by fuels, which accounted for 24%, and the remaining pollutants, such as NOx, CO, HC, SOx, and PM_2.5, had more minor impacts. These findings highlight the need for optimized air quality management at regional airports, integrating machine learning for predictive monitoring and supporting policy interventions to mitigate aviation-related pollution.

Keywords:

air pollution; environment; health; airport; transport; public health; gas emission; machine learning models

1. Introduction

The surge in global economic growth has undeniably bolstered various sectors; however, this prosperity has been accompanied by a grave environmental repercussion—escalating concentrations of pollutants in the atmosphere. Human activities associated with urbanization, industrialization, and economic development have significantly contributed to this burgeoning ecological crisis. While these advancements have propelled society forward, they have concurrently triggered the production of harmful pollutants, adversely impacting the environment and human health. The resulting air pollution has emerged as a multifaceted global concern, encompassing social, economic, political, and legislative dimensions [1,2].

In the United States, the acknowledgement of the pressing nature of air quality concerns is evident in the mandate for the US Environmental Protection Agency (EPA) to reassess the National Ambient Air Quality Standards (NAAQS) every five years. This regulatory obligation underscores recognizing the dynamic nature of air pollution challenges and the necessity for adaptive strategies to mitigate its adverse effects. Similarly, in Europe, air pollution stands as a significant threat to environmental health, precipitating respiratory ailments, cardiovascular diseases, and premature deaths among populations. The staggering statistics depicting the correlation between morbidity and mortality rates and air pollution are alarming. Approximately nine million deaths annually are linked [3,4,5,6,7].

WHO figures show that most of the world’s population (99%) inhales air that exceeds setting guideline limits containing high levels of pollutants [8]. Global air pollution has increased by 8% from 2008 to 2013, with low- and middle-income countries showing the highest urban air pollution levels [8]. Nevertheless, pollution levels in some European cities exceed the limit values for pollutant concentrations [6]. Emissions through transport, industrial facilities, fires, and storms due to climate issues contribute to environmental degradation and impact public and individual health [1,7].

The activity of the transport sector emits air pollutants and increases levels of air pollution [1]. There is mounting evidence that emissions from aviation grow faster than any other mode of transport [2]. Aircraft are releasing carbon dioxide (CO₂), carbon monoxide (CO), hydrocarbons (HC), nitrogen oxides (NO_x), suspended particulate matter (PM), and sulfur oxides (SO_x) [1,2]. Aviation emissions contribute as much to the global climate change [1]. However, air pollution is associated with numerous adverse health effects and many diseases [1].

COVID-19 restrictive measures were imposed internationally in the transport field to limit the spread of the virus. During this pandemic, restrictions and limited anthropogenic activities impacted the gloomy picture of air quality. Many studies were produced on this matter, reporting the reduction in air pollution during the pandemic due to the abovementioned measures [9,10]. In this vein, nitrogen oxide (NO_x) concentrations and particle concentrations (PM_2.5, PM₁₀) were significantly reduced, while ground-level ozone (O₃) levels rose [11,12]. Carbon monoxide (CO), as well as sulfur dioxide (SO₂), showed abatement during the restriction period, but it has not been steady [13,14].

However, population-limited mobility and social distancing policies led to a downward trend in COVID-19 cases due to the measures taken [10]. Evaluating the aftermath of the COVID-19 pandemic on society and the economy is essential to providing responses and adapting governmental measures and policies to be applied [15]. Thus, the United Nations globally engaged its 131 country teams serving 162 countries to support governments and develop effective public health preparedness and responsiveness policies against the COVID-19 pandemic [15].

Still, socio-economic changes are observed related to gender inequalities, as women’s work increased due to both childcare as well as remote professional work at home [16]. Vulnerable groups in society have been strongly affected by the COVID-19 pandemic [17] following a study by NIVEL (Netherlands Institute for Health Services Research). The study is based on records from general practitioners and data collected from the Statistics Netherlands (CBS) organization. Low-income families or disadvantaged social groups appeared to be more vulnerable. Also, mental health seems to be seriously affected in several population groups as well as in people having pre-existing mental health problems [17,18]. Subsequently, according to the WHO (World Health Organization), the COVID-19 pandemic caused discriminatory behavior due to the social stigma against several disadvantaged ethnic groups [19] and people affected by the SARS-CoV-2 virus.

The selection of the given airport was based on the traffic, configuration, and area of the airport, as well as the variety of aircraft types operating at the airport. It will also be compared with prevailing environmental conditions, air pollution, regional development, and public health policy-making. Although various prediction models have been proposed by scientists in the field [20], there is still a need for more accurate models to develop effective prevention and control strategies in cases where threshold values rise to unacceptable levels for public health.

Having calculated the pollutant emissions, obtaining an image of their concentration in the areas of interest will be appropriate in two ways: by atmospheric dispersion calculation models and on-site measurements. Thus, we can have snapshots of the concentration of pollutants in the atmosphere at a specific location and time.

As stated previously, our study aims to record the current air pollution in the airport and assess the factors that influence the existing management model, intending to optimize it through a decision support system. While there is a wealth of research on air pollution, there needs to be more information on air pollution issued by airports. The existing studies in our country have mainly focused on “Eleftherios Venizelos”, the largest airport in Greece, which is closer to the data of a standard European airport. The lack of data and data recording in the regional airports, especially those in Eastern Macedonia and Thrace, aroused our interest in the present study. However, the restrictive policy due to the pandemic offers us an ideal model for comparative studies of the impact of pollution on air quality levels.

To summarize, this study aims to assess the air pollution at Alexandroupolis Regional airport in Greece and the influence of meteorological parameters on the dispersion of pollutants related to air transport operations. Furthermore, this study applies machine learning techniques to develop a methodological approach for predicting air pollutants and identifying critical environmental conditions that affect air pollution. Specifically, it will be assessed whether aviation emissions contribute significantly to local air pollution, with NOx, CO₂, and PM_2.5 being the dominant pollutants. Furthermore, we assume that meteorological conditions may impact the concentration levels of contaminants. Lastly, we present various machine learning models intending to increase the predictive ability for estimating pollutant levels, offering a valuable tool for air quality management, mainly at regional airports.

2. Materials and Methods

As we stated previously, our interest was focused on a study of air traffic in a regional Greek airport of Eastern Macedonia and Thrace, which is the airport of Alexandroupolis (Figure 1).

The Alexandroupolis “Democritus” civil airport is approximately 7.0 km east of Alexandroupolis in Evros Prefecture, Thrace (Northeastern Greece). The airport pays tribute to the ancient atomic philosopher Democritus, who hails from Avdira near Xanthi in Thrace.

Completed in 2011, the airport comprises terminal and administrative buildings covering over 8500 m². Its coordinates place it at Latitude: 40°51′21″ North and Longitude: 25°57′22″ East. It comprises one terminal building, an administration building, a control tower, and a fire brigade station. At the same time, it holds a Category VI (6) rating for Airport Fire Fighting, providing four (4) parking positions tailored for medium-sized aircraft (http://www.ypa.gr/en/our-airports/kratikos-aerolimenas-alejandroypolhs-dhmokritos-kaald, accessed on 1 December 2023).

Our study was conducted from January 2019 to December 2020. Due to the containment measures, the impact of aircraft pollutants during the COVID-19 global pandemic, as a reduction in flight numbers, was registered. The Hellenic Civil Aviation Authority (CAA) collected all air traffic and fleet composition data.

Air pollutants from aircraft operations disperse based on several meteorological factors, including wind speed, temperature, and atmospheric stability. The dispersion patterns determine the extent to which pollutants such as NOx, CO, SOx, and PM_2.5 reach populated areas near airports. Wind direction and speed significantly affect the transport of contaminants, while temperature inversions can trap emissions near the ground, leading to higher exposure levels in nearby communities. Manisalidis (2023) highlights that exposure to these pollutants is directly linked to respiratory and cardiovascular diseases, increased hospital admissions, and long-term health complications [21].

The emissions have been calculated using each aircraft’s emission factors, following the standard LTO emission factor methodology and the analytical methodology incorporated in the EMEP/EEA Air Pollutant Emission Inventory Guidebook, which includes emissions released at ground level and up to an altitude of 3000 feet, following the International Civil Aviation Organization (ICAO) guidelines. Specifically, pollutants such as NOx, CO, and PM_2.5 were primarily assessed at ground level, where aircraft taxiing, takeoff, and landing emissions occur. However, some dispersion of pollutants into the lower atmosphere is expected, influenced by meteorological conditions such as wind speed, precipitation, temperature, and atmospheric stability [1]. By incorporating these factors, the study comprehensively assesses aviation emissions and their potential impact on local air quality.

Briefly, the emissions in this study were calculated using each aircraft’s emission factors according to the standard LTO emission factor methodology and the methodological approach described in the EMEP/EEA Air Pollutant Emission Inventory Guidebook [22]. The total emissions E_m,a,p,I of a given pollutant p from aircraft type I at airport a over a specific period T were estimated using the simplified approach:

E_m,a,p,I = 10⁻⁶ × EF_p,i × Δ_a,i

where:

E_m,a,p,I = Emissions of pollutant p from aircraft type i at airport a for time period T (t/T);
EF_p,i = Emission factor for pollutant p for aircraft type i (g/LTO);
Δ_a,i = Number of LTO cycles for aircraft type i at the airport a (LTO/T).

Factor 10⁻⁶ is applied to convert emissions from grams (g) to metric tons (t), ensuring compliance with international emission reporting standards. This activity-based approach ensures that emissions are estimated based on real-time aircraft operations, particularly within the Landing and Take-Off (LTO) cycle, which includes approach, taxi-in, taxi-out, takeoff, and climb-out up to 3000 feet. By adopting this standardized methodology, the study provides a robust and internationally recognized framework for evaluating aviation-related air pollution [23,24].

Meteorological parameters: Weather parameters, comprising monthly average temperature (temperature °C), rain (mm), average sunshine duration (INST), and maximum wind (Beaufort), were acquired from the nearby meteorological station (https://w1.meteo.gr/Gmap.cfm accessed on 1 April 2023). The automatic airport station measured all basic meteorological parameters in the area and represented the weather conditions and the respective climate data from the airport area.

The dataset consists of monthly aviation emissions and meteorological data collected for Alexandroupolis airport in 2019 and 2020. It contains the following fields: Year: the calendar year of data collection; Total Traffic: the total number of aircraft movements recorded monthly; Aircraft Type: the specific aircraft models operating during the month; Month: the corresponding month of data collection; CO₂ (kg): the total carbon dioxide emissions from aircraft operations; NO_x (kg): the nitrogen oxide emissions from aircraft engines; CO (kg): the volume of carbon monoxide emissions; HC (kg): hydrocarbon emissions from aircraft fuel combustion; SO_x (kg): sulfur oxide emissions attributed to aviation activities; Fuel Consumption (kg): the total fuel burned during operations; PM_2.5 (kg): fine particulate matter (PM_2.5) emissions from aircraft operations. Meteorological parameters (temperature, rainfall, sunshine, and wind speed) are monthly averages representing prevailing weather conditions; the emissions data are based on average emission factors for different aircraft types, calculated according to monthly total aircraft traffic.

The aviation emissions dataset consists of monthly cumulative values, meaning that for each pollutant (e.g., CO₂, NOx, PM_2.5), the total monthly emissions are recorded based on the sum of emissions from all aircraft operations within the month. Accordingly, the meteorological data consist of monthly averages, with temperature, wind speed, precipitation, and sunshine duration averaged over the corresponding month for 2019 and 2020. This distinction ensures emissions reflect the total aviation activity while meteorological data represent prevailing atmospheric conditions. To enhance clarity, Supplementary Materials Table S1 presents a sample of the dataset used in the study, demonstrating how emissions and meteorological parameters are structured for analysis.

This research’s set of measurements consists of 168 measurements for two years, 2019 and 2020. This dataset records aviation emissions as monthly cumulative values and meteorological parameters as monthly averages. As mentioned above, limitations in data availability and the operational characteristics of the border regional airport of Alexandroupolis limited the study period to two years. Unfortunately, the recording of meteorological data before this period presents some inconsistencies, making it difficult to ensure reliable long-term data. However, the choice of this period provides the study of the impact of the pandemic crisis on air traffic in 2020, namely pre-pandemic vs. pandemic emissions. In particular, the sharp decline in aviation activity due to restrictions imposed on air travel at national and international levels provides a unique opportunity to examine pre-pandemic emissions versus the evolution of the pandemic, offering valuable insights into how operational disruptions affect air quality.

This study provides a substantial snapshot of aviation-related emissions. Future research efforts should focus on a larger dataset, e.g., 2015–2025, to decipher long-term trends, the rate of air traffic recovery, and the corresponding impacts. However, such a large-scale study would require consistent methodologies for collecting all data over many years to ensure its comparability and reliability.

To summarize, 168 measurements were catalogued for all parameters, air pollutants from aviation emissions, and meteorological variables in 2019 and 2020. Descriptive statistics and the Pearson correlation coefficient were applied to the meteorological and pollutant variables at a 0.01 confidence level, except otherwise stated.

Data Description, Machine Learning Models, and Evaluation Metrics

Machine learning, a subset of Artificial Intelligence, focuses on granting computers the capacity to acquire the skills needed to execute particular tasks without explicit human programming. It revolves around creating models that can absorb knowledge from data and subsequently use it to make informed decisions or predictions when confronted with new available data (Figure 2) [25].

Linear Regression is a straightforward choice for basic predictive tasks, performing well on high-dimensional, sparse datasets. Decision trees are non-parametric models that efficiently navigate data using simple tests and are great for nonlinear decision boundaries. In regression, an ensemble of decision trees creates a combined Gaussian distribution prediction. Gradient boosting is a powerful technique for regression, incrementally building trees while minimizing error. It excels in handling complex problems with a stepwise approach. Bayesian inference aids data analysis and learning. Fields like medicine need to assess prediction uncertainty. Neural networks can be used for Regression, offering adaptability in modelling nonlinear functions, especially in complex scenarios [26,27,28,29,30,31].

In the study proposed here, the air pollution database for the city of Alexandroupolis, Greece, was considered, and an attempt was made to predict the emissions levels of PM_2.5 using various machine learning methods. Five basic algorithms, Bayesian Linear Regression, Boosted Decision Tree, Linear Regression, Decision Forest Regression, and Neural Network Regression, were used for regression analysis using machine learning methods. The machine learning algorithms used in this research are presented here briefly: Decision trees are non-parametric models that efficiently navigate data using simple tests and are great for nonlinear decision boundaries. In regression, an ensemble of decision trees creates a combined Gaussian distribution prediction. Gradient boosting is a powerful technique for regression, incrementally building trees while minimizing error. It excels in handling complex problems with a stepwise approach. Linear Regression is a straightforward choice for basic predictive tasks, performing well on high-dimensional, sparse datasets. Bayesian inference aids data analysis and learning. Finally, neural networks can be used for Regression, offering adaptability in modelling nonlinear functions, especially in complex scenarios (https://learn.microsoft.com/en-us/azure/machine-learning/component-reference/boosted-decision-tree-regression?view=azureml-api-2, accessed on 1 March 2024) [2,32].

Classical regression-based algorithms, especially machine learning ones like Decision Trees and Random Forest, have been widely applied in forecasting air quality levels and characteristics. Regarding predictor variables, three main categories were discerned: variables associated with pollutant concentrations, meteorological parameters, and variables about temporal and spatial characteristics [33]. In the same survey, PM_2.5 was the most predicted pollutant among the analyzed documents. Consequently, a combination of variables from the three categories mentioned above was chosen to anticipate PM_2.5 concentration in this research study. Figure 3 illustrates the research workflow of the proposed system.

The evaluation metrics, such as Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Relative Absolute Error (RAE), Relative Squared Error (RSE), and Coefficient of Determination (R²), were used to assess all the machine learning methods. Evaluation metrics are valuable tools for determining the performance of machine learning models, and they can be categorized into two groups. On the one hand, range-dependent metrics are used to compare different models on the same dataset. On the other hand, percentage metrics facilitate model comparisons independently of the dataset. Some of the most commonly used metrics in the analyzed studies in regression are R² and MAPE (20.1% each). RMSE/MSE and MAE are prevalent among the range-dependent metrics, appearing in 68.45% and 46.3% of the publications, respectively [34].

The robust evaluation framework of the models can be briefly explained as follows: A lower value signifies a more robust prediction ability regarding the mean absolute error. Likewise, a smaller value indicates more substantial predictive capabilities for root mean squared error. In the case of mean absolute error, better model performance is displayed by lower values. On the contrary, the coefficient of determination (R²) assesses the model’s predictive capacity, which is revealed to have higher values [34]. Supplementary Materials Figure S1 presents all modelling parameters extracted from the Microsoft Azure Studio Classic for regression machine learning algorithms.

Finally, it is also essential to understand the predictive ability of machine learning models, apart from their evaluation framework, through specific metrics as mentioned above. In this vein, the Permutation Importance method (PIM) was used to interpret the best machine learning model that will result [35,36]. The process’s function is simple and practically based on the fact that if a variable is essential for the model’s predictive ability, rearranging its values will significantly reduce its accuracy. On the contrary, if rearranging the values of the variable leaves the predictive ability of the model indifferent, then this variable does not significantly impact the predictive ability of the model. Furthermore, this method can be applied to any machine learning model, and thus, its application does not require retraining the model but only rearranging the input values. Consequently, the researcher is provided with a direct estimate of the importance of all features based on the modification in the model performance [37,38,39].

3. Results

As stated, all information about air traffic and fleet composition, depicted in Figure 4, was gathered from the Hellenic Civil Aviation Authority (CAA).

Figure 4 shows the aircraft that used Alexandroupolis airport in 2019 and 2020. The A320 had the largest percentage share, followed by the AT43, with 35% and 22%, respectively. The remaining aircraft types for 2019 shared percentages of 16%, 14% for the AT45 and AT 72, while the A319 and DH8D aircraft had percentages below 10%. A similar picture is presented in the percentage shares by aircraft type in 2020 at Alexandroupolis airport. Specifically, the A320 held the most significant percentage with 34%, while the AT72 share increased to 24%. The AT45 (15%) and AT43 (12%) aircraft types had slight changes compared to the previous year. The share of DH8D aircraft increased to 10%, and other aircraft accounted for 2%, indicating a slight diversification in the fleet composition.

Figure 5 illustrates the two pie charts representing the percentage contribution of the pollutants under study for 2019 and 2020.

In 2019, carbon dioxide had the most extensive percentage distribution, accounting for 75.5% of total emissions, followed by fuels, which accounted for 24%, and the remaining pollutants, such as NO_x, CO, HC, SO_x, and PM_2.5, had more minor impacts. In 2020, the same pattern is presented. Namely, carbon dioxide represents the most significant percentage of pollutants, while the rate of fuels, although decreasing compared to 2019, still has the second percentage distribution among them.

Furthermore, although the proportions of the remaining pollutants represent approximately the exact percentages, it is evident that an overall decrease is observed, which can be attributed to the sharp reduction in air transport and activity due to the restrictive measures imposed during the pandemic crisis. The most significant decrease was observed for hydrocarbons (62.2%), followed by PM_2.5 (35.4%), CO (33.4%), and CO₂ (28.1%), NO_x (26.5%), SO_x (25.3%), and fuel consumption, which decreased by 22.9%. The above highlights the significant influence of air traffic volume on emission levels and emphasizes the need for further mitigation strategies to control aviation-related pollution. In conclusion, carbon dioxide is the pollutant with the most considerable percentage contribution to emissions.

Figure 6 depicts the monthly comparison of pollutant emissions between 2019 and 2020 through seven bar graphs. Each graph describes the variation of a specific pollutant per month, with yellow representing 2019 emissions and orange representing 2020 emissions.

Figure 6 shows the monthly carbon monoxide emissions, with a lower concentration in 2020. The exact figure shows a decrease in carbon monoxide emissions in both years in 2020. Moreover, hydrocarbon emissions in 2020 are significantly lower, mainly at the beginning of the year. In Figure 6, which depicts the monthly emissions per year of nitrogen oxides, decreasing trends appear in 2020 compared to 2019. The PM_2.5 concentrations also decreased in 2020 compared to 2019, with the difference being less pronounced than in the other pollutants. Also, monthly sulfur oxide emissions show less change, decreasing in 2020. Finally, fuel emissions also follow a downward trend in 2020, indicating a decrease in consumption. Generally speaking, pollutant emissions are lower in 2020 due to changes in activities that affect fuel combustion and gas emissions, such as restrictive measures due to the COVID-19 pandemic (Supplementary Materials Figure S2 visualizes all monthly trends of pollutant emissions in 2019 and 2020).

Table 1 and Figure 7, respectively, give the descriptive statistics and the values of the Pearson correlation coefficient between the meteorological and pollutant variables. The correlations between meteorological parameters, the pollutant variables, and the respective descriptive statistics are presented in the table below (Table 1).

The above table shows the descriptive characteristics of the measurements for the variables that determine air pollution and the measurements of the collected meteorological parameters. It shows the range, minimum, and maximum values of the parameters, the average, the variance, the standard deviation, and the standard error of the variable’s values.

Nitrogen oxides (NO_x) show a wide range of values and high standard deviations, indicating large fluctuations in the recorded values of their presence. Hydrocarbons (HC) and sulfur oxides (SO_x) show variability in their concentrations, with lower values. Regarding PM_2.5, occasional peaks in their presence are observed from the values of their descriptive statistics. Regarding fuel and carbon dioxide emissions, fuel consumption has the most extensive range (0–48904.8) and an average value of 9026.7, indicating significant fuel consumption and usage.

From the recording of meteorological parameters, the average temperature appears to have a value of 16.9 °C, and precipitation has an average value of 45.45 mm, indicating fluctuating weather conditions. The total traffic based on the recorded flights reaches an average of 16, with an average sunshine duration of 228 min. In conclusion, the high variation of pollutants such as NO_x, CO, and CO₂ indicates the seasonal variation of flights. Concurrently, the relatively low levels of PM_2.5 values against the background of the fluctuation in the values of meteorological parameters (wind, rain) indicate the variability in the dispersion levels of pollutants.

Figure 7 outlines the visual representation of the Pearson correlation coefficients between different pollutant concentrations (e.g., NO_x, CO, SO_x, PM_2.5, CO₂) and meteorological parameters (e.g., mean temperature, rain precipitation, wind speed). Briefly, the red tones imply positive correlations (closer to +1), the blue tones indicate negative correlations (closer to −1), while white or light colors demonstrate weak or absent correlations between the variables.

Strong positive correlations are shown between CO₂ and NO_x, SO_x and NO_x, and fuel with CO₂ and NO_x, possibly due to familiar sources of fuel combustion and air traffic emissions. Fuel consumption also contributes to carbon dioxide emissions. Finally, PM_2.5 is strongly associated with NO_x and SO_x.

Conversely, moderate correlations between total traffic, PM_2.5, and CO₂ are shown, as more air traffic leads to increased suspended particles and carbon dioxide emissions.

Rainfall shows negative correlations with PM_2.5 and sunshine duration since it is known that rainfall reduces the concentrations of these particles. In conclusion, the meteorological parameters, temperature, and wind speed do not significantly affect pollutant concentrations, with the corresponding Pearson correlation coefficients ranging at levels that indicate weak or no correlation. In terms of statistical significance, airplane fuel consumption and total traffic appear to be the primary drivers of air pollution, significantly affecting the effect of emissions of the above pollutants.

In Figure 8, the comparison of the proposed algorithms is represented. Bayesian Linear Regression and Linear Regression performed better than the other algorithms. These two had almost the same Coefficient of determination metric (R²) score. Linear Regression had the lowest value of Relative Squared Error. In contrast, the Bayesian Linear Regression algorithm had the lowest value for the metrics Mean Absolute Error, Root Squared Error and Relative Squared Error. The remaining algorithms show mixed trends in their evaluation metrics. Some excel in one metric and others in another.

The extant algorithms exhibit disparate trends in their respective evaluation metrics, manifesting prowess in distinct domains. Notably, Neural Network Regression encountered suboptimal error rate performance while attaining a commendable Coefficient of Determination value. This is evident in its elevated values across multiple error metrics, including Mean Absolute Error, Root Mean Squared Error, Relative Absolute Error, and Relative Squared Error.

Conversely, the Boosted Decision algorithm demonstrated moderate performance across most metrics, except the Coefficient of Determination, where it ranked second-lowest compared to its algorithmic counterparts. In a final analysis, the Decision Forest Regression emerged as the third-best performer among the ensemble of algorithms under consideration. This conclusion is substantiated by its superior performance in Absolute Error, Root Mean Square Error, Relative Absolute Error, Relative Mean Square Error, and Coefficient of Determination compared to the remaining machine learning algorithms, as illustrated in Figure 8.

Figure 9 shows the effect of features on the prediction of PM_2.5 levels for the Bayesian Linear Regression model. The analysis is based on the Permutation Importance Method, and the importance of the features is depicted in two different ways.

Panel A visualizes the features’ importance at their variation level in predicting PM_2.5 levels. Each point shows the degree to which it contributes to the model. Thus, it is noted that carbon monoxide, NO_x, FUEL, CO₂, SO_x, HC, and TOTAL TRAFFIC concentrations have different levels of influence. The TOTAL TRAFFIC feature shows the most negligible dispersion, in contrast to CO and NO_x, which show the most considerable variability. Figure 9B shows the mean value importance of the features. TOTAL TRAFFIC emerges as the most critical factor influencing the prediction of PM_2.5 levels, followed by the concentrations of HC, SO_x, CO₂, and FUEL pollutants. On the contrary, CO shows minor importance, which demonstrates that it is not a critical factor for the change in PM_2.5 levels.

4. Discussion

The European Green Deal prioritizes addressing air pollution, recognizing its critical impact on public health and the environment. Proactive measures aim to pave the way for all European residents’ healthier and cleaner future. The risks posed by air pollution are severe, contributing significantly to respiratory illnesses, cardiovascular complications, and premature mortality. To combat this pressing issue, the European strategy revolves around comprehensive actions, including reducing transport, industry, and agriculture emissions [40]. In addition, air quality has begun to be investigated in terms of its impact on mental disorders, with studies attempting to elucidate the role of PM_2.5, for example, concerning the development of depression, schizophrenia, anxiety, and bipolar disorder [41].

Furthermore, it extends to enforcing stringent air quality standards, advocating for cleaner technologies, and promoting sustainable practices across various sectors. Moreover, the European Green Deal underscores the significance of collaborating internationally and forging partnerships fostering a unified global effort against air pollution (https://ec.europa.eu/commission/presscorner/detail/en/ip_22_6278 (Last accessed 24 December 2023), https://environment.ec.europa.eu/topics/air_en (Last accessed 24 December 2023)).

In the present study, the Bayesian and Linear Regression models yielded high metric performances to predict PM_2.5 pollutants related to aviation emissions, with R² of 0.96 and 0.97, respectively. This shows high accuracy when considering the concentration of other pollutants and meteorological factors.

Also, the above prediction models effectively captured the impact of aviation activity NO_x and CO₂ emissions, with Pearson correlation coefficients of 0.92 and 0.89, respectively (Figure 7). This highlights the critical role of aviation activity and the corresponding fuel consumption in the concentration of these pollutants. This finding also aligns with the existing literature on the impact of aviation activity on the concentration levels of various pollutants [42,43,44].

It should also be emphasized that the predictive ability of the models is captured in the right direction with the actual measurements of a sharp reduction in emissions in 2020, the year of the start of the pandemic crisis. Specifically, the 28.1% reductions for CO₂ and 26.5% for NO_x (Figure 5) also reflect the actual picture of the reductions resulting from the corresponding air traffic reduction during this period. Other studies which evaluated the impact of the pandemic on air activity and air quality have confirmed such a pattern [45,46].

The impact on air quality due to lockdown restrictions has been observed, and concentrations of air pollutants have decreased significantly during the pandemic, mainly due to reduced anthropogenic activities. In Greece, a significant drop in urban air pollution has also been reported, for example, a reduction in NO emissions of up to 78% in urban stations and by 45% at Athens International Airport, while NO₂ levels decreased by 73% in the two largest cities of Greece, Athens and Thessaloniki. This decrease is also due to the general reduction in aviation activity and, by extension, emissions. Lastly, it was observed that pollutants such as NO₂ showed a sharp decrease. In contrast, pollutants such as PM_2.5 and PM₁₀ showed more variable trends influenced by meteorological variables such as wind dispersion and dust transport [47,48].

Despite machine learning models’ high predictive ability, the models do not take into account small deviations that should be attributed to factors such as meteorological fluctuations and local pollution sources. These are generally the recommendations for improving air forecast models, namely incorporating more variables that reflect weather conditions [49,50,51].

Artificial intelligence models for forecasting environmental pollution are a previously introduced concept. Investigations into employing artificial intelligence in the context of atmospheric pollution have experienced a notable surge since 2017. Within the domain of air pollution, machine learning models, with a specific emphasis on regression techniques, stand out as widely adopted approaches for scrutinizing and deciphering the distributions of air pollutants, mainly when focusing on PM_2.5 concentration and its implications for public health [52].

Another study [53] compared several algorithms (MLR, KNN, M5P, RF, SVM, or MLP) to predict various pollutant concentrations in Valencia, Spain. Notably, RF achieved the highest accuracy [53]. Ameer et al. (2019) performed a similar comparison involving four models (RF, DT, MLP, Boosting) to predict PM_2.5 levels in several Chinese cities, with RF demonstrating superior accuracy [54]. Li et al. (2019) pitted Logistic Regression against RF for forecasting AQI in California, and RF emerged as the more accurate predictor [55]. Pasupuleti et al. (2020) considered three algorithms (RF, DT, MLR) to forecast the concentration of various pollutants in Spain, with RF again showing the highest accuracy [56].

In a different context, Kaur Bamrah et al. (2020) compared various regressor methods (MLP, RF, DT, and SVR) for predicting AQI in India, incorporating terrain features [57]. In these studies, RF consistently achieved the highest accuracy. Yarragunta et al. (2021) compared six regression algorithms (DT, KNN, SVR, MLR, RF, and Naive Bayes) to predict AQI in Delhi, and the DT algorithm secured the highest accuracy in this particular case [58]. Chakradhar Reddy et al. (2021) conducted a comprehensive comparison of six supervised ML models (LR, RF, DT, SVR, KNN, and Naive Bayes) for forecasting AQI in New Delhi [58], and the results indicated that DT achieved notably high accuracy, approaching 100% [59,60].

In this current research, Bayesian Linear Regression and Linear Regression algorithms were the most accurate. Particulate matter concentrations, specifically PM_2.5, are predominantly influenced by pollution emissions and prevailing weather conditions [61]. Over four years, Kou et al. (2021) [61] scrutinized the meteorological impact on PM_2.5-related air quality in China between 2016 and 2019, utilizing a high-resolution atmospheric composition reanalysis dataset [62]. The correlation between weather patterns and air quality was further investigated. The results indicated that, in tandem with China’s stringent enforcement of its clean air policy from 2016 to 2019, meteorological conditions played a constructive role in enhancing air quality [20]. In a separate investigation, Alpan and Sekeroglu (2020) employed machine learning algorithms to predict six pollutant levels, integrating meteorological data such as precipitation and temperature [62]. The Random Forest algorithm demonstrated a high predictive capability across two distinct datasets. The authors asserted that accurate forecasts of pollutant concentrations could be achieved solely by utilizing meteorological data [63].

Ambient air pollution is a significant global health concern, contributing to over 3 million premature deaths worldwide, with Low- and Middle-Income Countries (LMICs) bearing the majority of this burden. In these countries, facing air pollution levels classified as public health hazards, megacities resort to emergency measures like red alerts and vehicle-rationing interventions (VRIs). Even during interventions, both cities experienced increased cardiopulmonary mortality, emphasizing the need for short- and long-term strategies to manage the health impacts of air pollution [64].

Analyzing the dynamics behind fine particulate matter (PM_2.5) and ozone (O₃) pollution across key regions in China, extensive studies employed the Weather Research and Forecasting/Community Multiscale Air Quality (WRF/CMAQ) system from 2013 to 2019. The model demonstrated high accuracy, evaluating against observed pollutants in significant areas like the North China Plain, Yangtze River Delta, Pearl River Delta, Chengyu Basin, and Fenwei Plain, slightly overestimating PM_2.5 in one region. Notably, nitrate (NO₃⁻) and ammonium (NH₄⁺) emerged as vital PM_2.5 components in heavily polluted zones. This analysis highlighted negative correlations between PM_2.5 and O₃ in most areas, underscoring the model’s ability to simulate China’s long-term air quality trends, which is crucial for effective emission control strategies [65].

Furthermore, understanding pollutant emission sources is crucial for effective mitigation. Air quality data from urban, suburban, industrial, and rural areas in Jining, Shandong Province, China, were compared for characteristics and health risks associated with air pollutants. Variances in PM_2.5, PM₁₀, SO₂, NO₂, and CO concentrations between 2017 and 2018 were observed, with O₃ concentrations increasing. Functional areas exhibited similar seasonal variations and diurnal patterns, with O₃ contributing significantly to exposure excess risks (ERs). Premature deaths attributable to air pollutants were calculated, highlighting O₃ as the significant contributor. Pollution transport from industrial areas to urban and suburban regions played a crucial role in determining air quality, emphasizing urgent measures to reduce O₃ pollution, particularly considering the prevalent ozone formation regime in industrial areas [66].

Air pollution and climate change exhibit intricate interdependencies, where climate fluctuations impact air pollution dynamics and vice versa. This relationship is complex, with emissions of air pollutants affecting climate through radiative forcing and climate changes altering the physical, chemical, and biological processes linked to air pollution. High-pressure weather conditions tend to be associated with elevated PM_2.5 and O₃ levels. Seasonally, PM_2.5 concentrations are higher during the winter, while O₃ concentrations are higher during the summer [67]. Uncertainties persist despite recognizing these interactions, requiring deeper insights to comprehend their mechanisms and consequences. Additionally, the co-emission of greenhouse gases (GHGs) with air pollutants suggests the potential for synergistic mitigation strategies. Yet, the existing literature needs an in-depth understanding of these co-benefits [68].

Notably, research has shown a link between long-term exposure to PM_2.5 and child mortality, with studies confirming this pattern in countries in Asia, Africa, and Latin America and an additional decline in living standards due to air pollution [69,70,71,72]. Given the impact of air pollution on the levels and severity of respiratory diseases, especially among elders and children, machine learning methods were applied to link air pollutants, seasonal variation, and climate data. A study in Taizhou, China, utilized various machine-learning models, including Linear Regression, Random Forest (RF), AdaBoost, and Neural Networks, to investigate the relationship between air pollutant concentrations and pediatric respiratory diseases. The findings reveal significant seasonal fluctuations in both the numbers of pediatric respiratory outpatients and the concentrations of air pollutants. NO₂, CO, particulate matter (PM₁₀ and PM_2.5), and outpatient numbers peak during the winter, indicating a substantial impact of air pollution on pediatric respiratory diseases. Regression models demonstrate that ML methods capture clinic visit trends and turning points, with nonlinear models outperforming their linear counterparts. Notably, the RF model emerged as the most effective [73].

The burden of air pollution is disproportionately related to factors such as age and gender. An additional study showed that short-term exposure to air pollutants, mainly gaseous pollutants such as NO₂ and CO, is linked with an escalated risk of hospital visits for AD in a city in southern China with low pollution concentrations. The age group of women between the ages of 45 and 64 seems to be most affected, providing evidence that the level of air pollution may be a risk factor, even for anxiety disorders [74].

The influence of meteorological factors such as wind is significant for the dispersion of pollutants, specifically PM_2.5, and air masses at a local level. The fluctuation of air masses at a seasonal level can also affect the transport of these particles and alter the level of air quality [75]. In our research, four meteorological parameters were considered when estimating air quality in Alexandroupolis. Meteorology, atmospheric reactivity, and emissions at the regional level are, among other factors, the most contributing factors in the temporal variability of PM_2.5 concentration and air quality, as revealed in a study that applied statistical methods to consider PM_2.5 daily measures and meteorological parameters in India [26]. The airport in Alexandroupolis primarily caters to domestic flights and experienced minimal alterations in flight frequency and fleet composition from 2019 to 2020. Nevertheless, in both years, there was a notable rise in emission concentrations—including fuel, NO_x, CO, HC, SO_x, and PM—during the summer and New Year seasons, coinciding with increased travel activity.

Temperature is also essential to CO emissions, as low temperatures reduce aircraft fuel evaporation due to inefficient combustion, resulting in increased carbon monoxide emissions. Humidity was also positively correlated with the above aircraft exhaust emissions [74,75,76].

Comparing the present study with other approaches to emissions and pollutants at other airports in Greece, with a different methodological approach, it was found that NO₂ concentrations exceeded regulatory limits by almost 30% of the cases under specific meteorological conditions. In the same study, although the PM₁₀ and SO₂ concentration levels were within limits for air quality standards, in the present study, the maximum value recorded for PM_2.5 was 5.6 and for SO_x was 41, indicating that in smaller and regional airports, there is a different dynamic and distribution of these pollutants, possibly due to, among other things, local emissions from aircraft, ground vehicles. Finally, although the approach in the above study involves static dispersion modelling, the machine learning models here offer a slight advantage because they consider the real-time estimation of pollutant levels based on changing meteorological conditions and airport activity. This dynamic capability benefits proactive air quality management, while dispersion models mainly provide ex-post emission assessments [77,78].

In closing, we will also refer to a study that concerns the impact of lockdowns during the pandemic crisis at the two largest airports in Greece. What was observed is that at the airport of the capital of Greece, Athens, NO₂ and CO concentrations decreased by 45% and 30%, respectively, highlighting the dominant role of air transport and aviation in urban air pollution. Although there are no data yet for Alexandroupolis airport to make a comparison of pollutant concentrations before, during, and immediately after the pandemic crisis, we can speculate that other phenomena, such as extreme weather phenomena such as the dust transport observed in Greece and agro-industrial activities around the airport, may also contribute to the sources of atmospheric air pollution at airports and PM [47,48,79,80].

5. Limitations

The present study on applying machine learning algorithms for predicting PM_2.5 concentrations in Alexandroupolis bears certain methodological constraints. Notably, the temporal scope of the investigation, limited to a relatively abbreviated period, suggests the potential for enriched insights through an extension across multiple years. A refinement of the study’s comprehensiveness could be achieved by incorporating a broader array of meteorological variables and epidemiologically relevant medical data, thereby enhancing the contextual richness of the predictive modelling.

Despite these acknowledged limitations, this study represents a seminal contribution as one of the initial endeavors to systematically examine the interplay between air pollutant concentrations, meteorological parameters, and the influence of aircraft flights. Understanding these intricate relationships is pivotal in devising effective mitigation measures and policies.

6. Conclusions

This work advances our understanding of air quality and pollution dynamics. It establishes a precedent for subsequent empirical inquiries in Greece, laying the groundwork for more comprehensive studies in the emerging field of air pollution research. The perspectives from this investigation contribute to the growing body of knowledge at the intersection of environmental science, air pollution, epidemiology, meteorology, climate change, and public health.

Analysis and prediction of air pollution levels at airports are essential topics in atmospheric and environmental research due to air pollution’s impact on human health and quality of life. Predicting the maximum concentration of the above parameters in the atmospheric air is of great importance for controlling and improving the quality of the atmosphere. The ultimate goal is the sustainable development of the airport region concerning public health issues.

Due to the increased choice of aircraft as a means of transport and the growth of the aviation industry, aircraft emissions are skyrocketing to be ignored, and policymakers, including environmental, legal, regulatory, and public health aspects, may propose practical strategies for minimizing the effects of global warming and climate changes on health.

This research advocates for a dynamic approach to deploying effective policies and strategies to underscore the imperative of sustaining prevention and control measures for air pollution in airport environments. By addressing the specific challenges associated with air quality management in airports, the study aims to contribute to developing comprehensive and adaptable measures for controlling and mitigating pollution in these settings.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/toxics13030217/s1: Figure S1. Modeling parameters are extracted from the Microsoft Azure Studio Classic for regression machine learning algorithms. Figure S2. Monthly trends of pollutant emissions in 2019 and 2020: (a) CO; (b) CO₂; (c) HC; (d) NO_x; (e) PM_2.5; (f) SO_x; (g) Fuel. Table S1. Dataset sample.

Author Contributions

Conceptualization, I.M. and E.B.; methodology, T.C.C., C.S. and C.T.; writing—original draft preparation, E.S.; writing—review and editing, C.T. and C.V.; visualization, A.S.; supervision, T.C.C., E.B. and C.T.; project administration, T.C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

This work was supported by the Master’s program in “Food, Nutrition and Microbiome” of the Medical School, Democritus University of Thrace, Greece.

Conflicts of Interest

Author Ioannis Manisalidis was employed by the company Delphis S.A. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Manisalidis, I.; Stavropoulou, E.; Stavropoulos, A.; Bezirtzoglou, E. Environmental and Health Impacts of Air Pollution: A Review. Front. Public Health 2020, 8, 14. [Google Scholar] [CrossRef] [PubMed]
Liu, J.; Attari, M.U.; Akhtar, M.; Amin, M.; Yu, Z.; Janjua, L. Investigating the Impact of Transport Services and Renewable Energy on Macro-Economic and Environmental Indicators. Front. Environ. Sci. 2022, 10, 916176. [Google Scholar] [CrossRef]
Di, Q.; Dai, L.; Wang, Y.; Zanobetti, A.; Choirat, C.; Schwartz, J.D.; Dominici, F. Association of Short-term Exposure to Air Pollution with Mortality in Older Adults. JAMA 2017, 318, 2446–2456. [Google Scholar] [CrossRef] [PubMed]
European Environment Agency. Air Pollution. Available online: https://www.eea.europa.eu/themes/air (accessed on 21 September 2022).
Cromar, K.R.; Gladson, L.A.; Hicks, E.A.; Marsh, B.; Ewart, G. Excess Morbidity and Mortality Associated with Air Pollution above American Thoracic Society Recommended Standards, 2017–2019. Ann. Am. Thorac. Soc. 2022, 19, 603–613. [Google Scholar] [CrossRef]
Samek, L. Overall Human Mortality and Morbidity Due to Exposure to Air Pollution. Int. J. Occup. Med. Environ. Health 2016, 29, 417–426. [Google Scholar] [CrossRef]
Briz-Redón, Á.; Serrano-Aroca, Á. The Effect of Climate on the Spread of the COVID-19 Pandemic: A Review of Findings, and Statistical and Modelling Techniques. Prog. Phys. Geogr. Earth Environ. 2020, 44, 591–604. [Google Scholar] [CrossRef]
World Health Organization. First Global Conference on Air Pollution and Health. Available online: https://www.who.int/news-room/events/detail/2018/10/30/default-calendar/air-pollution-conference (accessed on 21 September 2022).
Baldasano, J.M. COVID-19 Lockdown Effects on Air Quality by NO₂ in the Cities of Barcelona and Madrid (Spain). Sci. Total Environ. 2020, 741, 140353. [Google Scholar] [CrossRef]
Ji, H.; Tong, H.; Wang, J.; Yan, D.; Liao, Z.; Kong, Y. The Effectiveness of Travel Restriction Measures in Alleviating the COVID-19 Epidemic: Evidence from Shenzhen, China. Environ. Geochem. Health 2022, 44, 3115–3132. [Google Scholar] [CrossRef]
Bekbulat, B.; Apte, J.S.; Millet, D.B.; Robinson, A.L.; Wells, K.C.; Presto, A.A.; Marshall, J.D. Changes in Criteria Air Pollution Levels in the US Before, During, and After COVID-19 Stay-at-Home Orders: Evidence from Regulatory Monitors. Sci. Total Environ. 2021, 769, 144693. [Google Scholar] [CrossRef]
González-Pardo, J.; Ceballos-Santos, S.; Manzanas, R.; Santibáñez, M.; Fernández-Olmo, I. Estimating Changes in Air Pollutant Levels Due to COVID-19 Lockdown Measures Based on a Business-as-Usual Prediction Scenario Using Data Mining Models: A Case Study for Urban Traffic Sites in Spain. Sci. Total Environ. 2022, 823, 153786. [Google Scholar] [CrossRef]
Filonchyk, M.; Hurynovich, V.; Yan, H. Impact of COVID-19 Lockdown on Air Quality in Poland, Eastern Europe. Environ. Res. 2021, 198, 110454. [Google Scholar] [CrossRef] [PubMed]
Chen, Z.; Hao, X.; Zhang, X.; Chen, F. Have Traffic Restrictions Improved Air Quality? A Shock from COVID-19. J. Clean. Prod. 2021, 279, 123622. [Google Scholar] [CrossRef]
United Nations Development Programme. Socio-Economic Impact of COVID-19. Available online: https://www.undp.org/coronavirus/socio-economic-impact-covid-19 (accessed on 19 October 2022).
Munir, K.A. Inequality in the Time of Coronavirus. J. Manag. Stud. 2021, 58, 607–610. [Google Scholar] [CrossRef]
National Institute for Public Health and the Environment (RIVM). Vulnerable Groups in Society Have Been Hit Harder by the COVID-19 Pandemic. Available online: https://www.rivm.nl/en/news/vulnerable-groups-in-society-have-been-hit-harder-by-covid-19-pandemic (accessed on 19 October 2022).
Uphoff, E.P.; Lombardo, C.; Johnston, G.; Weeks, L.; Rodgers, M.; Dawson, S.; Seymour, C.; Kousoulis, A.A.; Churchill, R. Mental Health Among Healthcare Workers and Other Vulnerable Groups During the COVID-19 Pandemic and Other Coronavirus Outbreaks: A Rapid Systematic Review. PLoS ONE 2021, 16, e0254821. [Google Scholar] [CrossRef] [PubMed]
World Health Organization. A Guide to Preventing and Addressing Social Stigma Associated with COVID-19. Available online: https://www.who.int/publications/m/item/a-guide-to-preventing-and-addressing-social-stigma-associated-with-covid-19 (accessed on 19 October 2022).
Lin, L.; Liu, X.; Zhang, T.; Cao, Y. A Prediction Model to Forecast Passenger Flow Based on Flight Arrangement in Airport Terminals. Energy Built Environ. 2023, 4, 680–688. [Google Scholar] [CrossRef]
Manisalidis, I. Comparative Analysis of Air Pollutant Emissions in Greek Airports and Its Impacts on Public Health. Doctoral Thesis, Democritus University of Thrace, School of Health Sciences, Department of Medicine, Alexandroupolis, Greece, 2023. [Google Scholar]
EMEP/EEA Air Pollutant Emission Inventory Guidebook 2023. Available online: https://www.eea.europa.eu/en/analysis/publications/emep-eea-guidebook-2023 (accessed on 1 March 2025).
Zou, R.; Wang, B.; Wang, K.; Shang, W.L.; Xue, D.; Ochieng, W.O. A pathway to sustainable aviation: Modeling aircraft takeoff mass for precise fuel consumption and aircraft emission calculations. Energy 2025, 319, 135074. [Google Scholar] [CrossRef]
Zhu, C.; Hu, R.; Liu, B.; Zhang, J. Uncertainty and its driving factors of airport aircraft pollutant emissions assessment. Transp. Res. Part D Transp. Environ. 2021, 94, 102791. [Google Scholar] [CrossRef]
Dataversity. A Brief History of Machine Learning. Available online: https://www.dataversity.net/a-brief-history-of-machine-learning/ (accessed on 8 November 2023).
Noorbakhsh-Sabet, N.; Zand, R.; Zhang, Y.; Abedi, V. Artificial Intelligence Transforms the Future of Healthcare. Am. J. Med. 2019, 132, 795–801. [Google Scholar] [CrossRef]
Pyayt, A.L.; Mokhov, I.I.; Lang, B.; Krzhizhanovskaya, V.V.; Meijer, R.J. Machine Learning Methods for Environmental Monitoring and Flood Protection. Int. J. Comput. Inf. Eng. 2011, 5, 549–554. [Google Scholar] [CrossRef]
Hino, M.; Benami, E.; Brooks, N. Machine Learning for Environmental Monitoring. Nat. Sustain. 2018, 1, 583–588. [Google Scholar] [CrossRef]
May, T.O.; Livas-García, A.; Jiménez-Torres, M.; Cruz May, E.; López-Manrique, L.M.; Bassam, A. Artificial Intelligence Techniques for Modeling Indoor Building Temperature Under Tropical Climate Using Outdoor Environmental Monitoring. J. Energy Eng. 2020, 146, 04020004. [Google Scholar] [CrossRef]
Masood, A.; Ahmad, K. A Review on Emerging Artificial Intelligence (AI) Techniques for Air Pollution Forecasting: Fundamentals, Application, and Performance. J. Clean. Prod. 2021, 322, 129072. [Google Scholar] [CrossRef]
Hashimoto, D.A.; Witkowski, E.; Gao, L.; Meireles, O.; Rosman, G. Artificial Intelligence in Anesthesiology: Current Techniques, Clinical Applications, and Limitations. Anesthesiology 2020, 132, 379–394. [Google Scholar] [CrossRef] [PubMed]
Sun, S.; Lu, H.; Tsui, K.L.; Wang, S. Nonlinear Vector Auto-Regression Neural Network for Forecasting Air Passenger Flow. J. Air Transp. Manag. 2019, 78, 54–62. [Google Scholar] [CrossRef]
Méndez, M.; Merayo, M.G.; Núñez, M. Machine Learning Algorithms to Forecast Air Quality: A Survey. Artif. Intell. Rev. 2023, 56, 10031–10066. [Google Scholar] [CrossRef]
Mogoș, R.I.; Petrescu, I.; Chiotan, R.A.; Crețu, R.C.; Troacă, V.A.; Mogoș, P.L. Greenhouse Gas Emissions and Green Deal in the European Union. Front. Environ. Sci. 2023, 11, 1141473. [Google Scholar] [CrossRef]
Altmann, A.; Toloşi, L.; Sander, O.; Lengauer, T. Permutation importance: A corrected feature importance measure. Bioinformatics 2010, 26, 10. [Google Scholar] [CrossRef] [PubMed]
Tian, Z.; Wei, J.; Li, Z. How Important Is Satellite-Retrieved Aerosol Optical Depth in Deriving Surface PM_2.5 Using Machine Learning? Remote Sens. 2023, 15, 3780. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Fisher, A.; Rudin, C.; Dominici, F. All Models are Wrong, but Many are Useful: Learning a Variable’s Importance by Studying an Entire Class of Prediction Models Simultaneously. J. Mach. Learn. Res. 2019, 20, 177. [Google Scholar]
Miller, T. Explanation in artificial intelligence: Insights from the social sciences. Artif. Intell. 2019, 267, 1–38. [Google Scholar] [CrossRef]
Li, X.; Zhang, X. A Comparative Study of Statistical and Machine Learning Models on Carbon Dioxide Emissions Prediction of China. Environ. Sci. Pollut. Res. 2023, 30, 117485–117502. [Google Scholar] [CrossRef]
Bai, Y.; Liang, X.; Xia, L.; Yu, S.; Wu, F.; Li, M. Association between air pollutants and four major mental disorders: Evidence from a Mendelian randomization study. Ecotoxicol. Environ. Saf. 2024, 283, 116887. [Google Scholar] [CrossRef]
Gao, Z.; Mavris, D.N. Statistics and Machine Learning in Aviation Environmental Impact Analysis: A Survey of Recent Progress. Aerospace 2022, 9, 750. [Google Scholar] [CrossRef]
Brodzik, Ł.; Prokopowicz, W.; Ciupek, B.; Frąckowiak, A. Minimizing the Environmental Impact of Aircraft Engines with the Use of Sustainable Aviation Fuel (SAF) and Hydrogen. Energies 2025, 18, 472. [Google Scholar] [CrossRef]
Wang, J.; Zu, L.; Zhang, S.; Jiang, H.; Ni, H.; Wang, Y.; Zhang, H.; Ding, Y. Recent Advances and Implications for Aviation Emission Inventory Compilation Methods. Sustainability 2024, 16, 8507. [Google Scholar] [CrossRef]
Chung, Y.; Sunwoo, Y. Impact of Aviation Emissions and its Changes Due to the COVID-19 Pandemic on Air Quality in South Korea. Atmosphere 2022, 13, 1553. [Google Scholar] [CrossRef]
Rybarczyk, Y.; Zalakeviciute, R. Assessing the COVID-19 impact on air quality: A machine learning approach. Geophys. Res. Lett. 2021, 48, e2020GL091202. [Google Scholar] [CrossRef]
Avdoulou, M.M.; Golfinopoulos, A.G.; Kalavrouziotis, I.K. Monitoring Air Pollution in Greek Urban Areas During the Lockdowns, as a Response Measure of SARS-CoV-2 (COVID-19). Water Air Soil Pollut. 2023, 234, 13. [Google Scholar] [CrossRef]
Koulidis, A.G.; Progiou, A.G.; Ziomas, I.C. Air Quality Levels in the Vicinity of Three Major Greek Airports. Environ. Model. Assess. 2020, 25, 749–760. [Google Scholar] [CrossRef]
Subramaniam, S.; Raju, N.; Ganesan, A.; Rajavel, N.; Chenniappan, M.; Prakash, C.; Pramanik, A.; Basak, A.K.; Dixit, S. Artificial Intelligence Technologies for Forecasting Air Pollution and Human Health: A Narrative Review. Sustainability 2022, 14, 9951. [Google Scholar] [CrossRef]
Olawade, D.B.; Wada, O.Z.; Ige, A.O.; Egbewole, B.I.; Olojo, A.; Oladapo, B.I. Artificial intelligence in environmental monitoring: Advancements, challenges, and future directions. Hyg. Environ. Health Adv. 2024, 12, 100114. [Google Scholar] [CrossRef]
Samad, A.; Garuda, S.; Vogt, U.; Yang, B. Air pollution prediction using machine learning techniques—An approach to replace existing monitoring stations with virtual monitoring stations. Atmos. Environ. 2023, 310, 119987. [Google Scholar] [CrossRef]
Guo, Q.; Ren, M.; Wu, S.; Sun, Y.; Wang, J.; Wang, Q.; Ma, Y.; Song, X.; Chen, Y. Applications of Artificial Intelligence in the Field of Air Pollution: A Bibliometric Analysis. Front. Public Health 2022, 10, 933665. [Google Scholar] [CrossRef]
Ochando, L.C.; Julián, C.I.F.; Ochando, F.C.; Ferri, C. AirVLC: An Application for Real-Time Forecasting Urban Air Pollution. In Proceedings of the 2nd International Workshop on Mining Urban Data, Lille, France, 11 July 2015; Volume 1392, pp. 72–79. [Google Scholar]
Ameer, S.; Shah, M.A.; Khan, A.; Song, H.; Maple, C.; Islam, S.U.; Asghar, M.N. Comparative Analysis of Machine Learning Techniques for Predicting Air Quality in Smart Cities. IEEE Access 2019, 7, 128325–128338. [Google Scholar] [CrossRef]
Li, J.; Shao, X.; Sun, R.; Visioli, A. A DBN-Based Deep Neural Network Model with Multitask Learning for Online Air Quality Prediction. J. Control Sci. Eng. 2019, 2019, 5304535. [Google Scholar] [CrossRef]
Pasupuleti, V.R.; Uhasri; Kalyan, P.; Srikanth; Reddy, H.K. Air Quality Prediction of Data Log by Machine Learning. In Proceedings of the 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS), Coimbatore, India, 6–7 March 2020; pp. 1395–1399. [Google Scholar] [CrossRef]
Kaur Bamrah, S.; Saiharshith, K.R.; Gayathri, K.S. Application of Random Forests for Air Quality Estimation in India by Adopting Terrain Features. In Proceedings of the 2020 4th International Conference on Computer, Communication and Signal Processing (ICCCSP), Chennai, India, 28–29 September 2020; pp. 1–6. [Google Scholar] [CrossRef]
Yarragunta, S.; Nabi, M.A.; Jeyanthi, P.; Revathy, S. Prediction of Air Pollutants Using Supervised Machine Learning. In Proceedings of the 2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, 6–8 May 2021; pp. 1633–1640. [Google Scholar] [CrossRef]
Chakradhar Reddy, K.; Nagarjuna Reddy, K.; Brahmaji Prasad, K.; Selvi Rajendran, P. The Prediction of Air Quality Using Supervised Learning. In Proceedings of the 2021 6th International Conference on Communication and Electronics Systems (ICCES), Coimbatre, India, 8–10 July 2021; pp. 1–5. [Google Scholar] [CrossRef]
Saminathan, S.; Malathy, C. Ensemble-Based Classification Approach for PM_2.5 Concentration Forecasting Using Meteorological Data. Front. Big Data 2023, 6, 1175259. [Google Scholar] [CrossRef]
Kou, X.; Peng, Z.; Zhang, M.; Zhang, N.; Lei, L.; Zhao, X.; Miao, S.; Li, Z.; Ding, Q. Assessment of the Meteorological Impact on Improved PM_2.5 Air Quality Over North China During 2016–2019 Based on a Regional Joint Atmospheric Composition Reanalysis Dataset. J. Geophys. Res. Atmos. 2021, 126, e2020JD034382. [Google Scholar] [CrossRef]
Alpan, K.; Sekeroglu, B. Prediction of Pollutant Concentrations by Meteorological Data Using Machine Learning Algorithms. Int. Arch. Photogr. Remote Sens. Spat. Inf. Sci. 2020, 44, 21–27. [Google Scholar] [CrossRef]
Ji, Y.; Zhi, X.; Wu, Y.; Zhang, Y.; Yang, Y.; Peng, T.; Ji, L. Regression Analysis of Air Pollution and Pediatric Respiratory Diseases Based on Interpretable Machine Learning. Front. Earth Sci. 2023, 11, 1105140. [Google Scholar] [CrossRef]
Sinha, J.; Kumar, N. Mortality and Air Pollution Effects of Air Quality Interventions in Delhi and Beijing. Front. Environ. Sci. 2019, 7, 434511. [Google Scholar] [CrossRef]
Mao, J.; Li, L.; Li, J.; Sulaymon, I.D.; Xiong, K.; Wang, K.; Zhu, J.; Chen, G.; Ye, F.; Zhang, N.; et al. Evaluation of Long-Term Modeling Fine Particulate Matter and Ozone in China During 2013–2019. Front. Environ. Sci. 2022, 10, 872249. [Google Scholar] [CrossRef]
Yuan, Y.; Zhang, X.; Zhao, J.; Shen, F.; Nie, D.; Wang, B.; Wang, L.; Xing, M.; Hegglin, M.I. Characteristics, Health Risks, and Premature Mortality Attributable to Ambient Air Pollutants in Four Functional Areas in Jining, China. Front. Public Health 2023, 11, 1075262. [Google Scholar] [CrossRef]
Xu, L.; Wang, B.; Wang, Y.; Zhang, H.; Xu, D.; Zhao, Y.; Zhao, K. Characterization and Source Apportionment Analysis of PM_2.5 and Ozone Pollution over Fenwei Plain, China: Insights from PM_2.5 Component and VOC Observations. Toxics 2025, 13, 123. [Google Scholar] [CrossRef] [PubMed]
Zhu, S.; Yu, H.; Zhang, Y.; Zhang, Y.; Kinnon, M.M. Editorial: Air Pollution and Climate Change: Interactions and Co-Mitigation. Front. Environ. Sci. 2022, 10, 1105656. [Google Scholar] [CrossRef]
Lien, W.-H.; Owili, P.O.; Muga, M.A.; Lin, T.-H. Ambient particulate matter exposure and under-five and maternal deaths in Asia. Int. J. Environ. Res. Public Health 2019, 16, 3855. [Google Scholar] [CrossRef]
Gouveia, N.; Junger, W.L.; Romieu, I.; Cifuentes, L.A.; de Leon, A.P.; Vera, J.; Strappa, V.; Hurtado-Díaz, M.; Miranda-Soberanis, V.; Rojas-Bracho, L.; et al. Effects of air pollution on infant and children respiratory mortality in four large Latin-American cities. Environ. Pollut. 2018, 232, 385–391. [Google Scholar] [CrossRef]
Sarkodie, S.A.; Strezov, V.; Jiang, Y.; Evans, T. Proximate determinants of particulate matter (PM_2.5) emission, mortality and life expectancy in Europe, Central Asia, Australia, Canada and the US. Sci. Total Environ. 2019, 683, 489–497. [Google Scholar] [CrossRef]
Amnuaylojaroen, T.; Parasin, N. Future Health Risk Assessment of Exposure to PM_2.5 in Different Age Groups of Children in Northern Thailand. Toxics 2023, 11, 291. [Google Scholar] [CrossRef]
Ravindra, K.; Vakacherla, S.; Singh, T.; Upadhya, A.R.; Rattan, P.; Mor, S. Long-Term Trend of PM_2.5 over Five Indian Megacities Using a New Statistical Approach. Stoch. Environ. Res. Risk Assess. 2023, 38, 715–725. [Google Scholar] [CrossRef]
Zhong, X.; Guo, T.; Zhang, J.; Wang, Q.; Yin, R.; Wu, K.; Zou, Q.; Zheng, M.; Hall, B.J.; Renzaho, A.M.N.; et al. Short-Term Effect of Air Pollution on Daily Hospital Visits for Anxiety Disorders in Southern China with Low Pollution Concentrations. Toxics 2025, 13, 45. [Google Scholar] [CrossRef] [PubMed]
Liang, Q.; Zhang, X.; Miao, Y.; Liu, S. Multi-Scale Meteorological Impact on PM_2.5 Pollution in Tangshan, Northern China. Toxics 2024, 12, 685. [Google Scholar] [CrossRef]
Zhao, J.; Mao, Z.; Han, B.; Fan, Z.; Ma, S.; Li, J.; Wang, R.; Yu, J. Characterizing Aircraft Exhaust Emissions and Impact Factors at Tianjin Binhai International Airport via Open-Path Fourier-Transform Infrared Spectrometer. Toxics 2024, 12, 782. [Google Scholar] [CrossRef]
Theophanides, M.; Anastassopoulou, J. Air pollution simulation and geographical information systems (GIS) applied to Athens International Airport. J. Environ. Sci. Health Part A 2009, 44, 758–766. [Google Scholar] [CrossRef] [PubMed]
Matthaios, V.N.; Triantafyllou, A.G.; Koutrakis, P. PM₁₀ episodes in Greece: Local sources versus long-range transport—Observations and model simulations. J. Air Waste Manag. Assoc. 2016, 67, 105–126. [Google Scholar] [CrossRef] [PubMed]
Aygun, A.; Dursun, O.O.; Toraman, T. Machine learning based approach for forecasting emission parameters of mixed flow turbofan engine at high power modes. Energy 2023, 271, 127026. [Google Scholar] [CrossRef]
Chetan, S.; Seema, S.; Sowmya, B.J.; Rajesh, N.; Supreeth, S.; Dayananda, P.; Rohith, S.; Ranjan, R.; Goud, V. A Machine Learning Approach for Environmental Assessment on Air Quality and Mitigation Strategy. J. Eng. 2024, 2893021. [Google Scholar] [CrossRef]

Figure 1. Study area—Alexandroupolis airport, Greece.

Figure 2. Machine learning as a subfield of Artificial Intelligence.

Figure 3. Research workflow of the proposed methodology.

Figure 4. (a) Fleet composition in Alexandropoulis airport—2019. (b) Fleet composition in Alexandropoulis airport—2020. Abbreviations: A320—Airbus A320, A319—Airbus A319, AT43—ATR 42-300/320, AT45—ATR 42-500, AT72—ATR 72-200/500, DH8D—De Havilland Canada Dash 8 Q400.

Figure 5. Percentage contribution of pollutants in 2019 and 2020.

Figure 6. Comparison of monthly emissions for various pollutants in 2019 and 2020: CO; CO₂; HC; NO_x; PM_2.5; SO_x; fuel.

Figure 7. Heatmap of Pearson correlations between meteorological and pollutant variables. ** Correlation is significant at the 0.01 level (two-tailed), * moderate correlation, *** strong correlation.

Figure 8. Evaluation of the regression machine learning algorithms and the performance heat map.

Figure 9. Feature importance analysis for PM_2.5 prediction.

Table 1. Descriptive statistics of the meteorological and pollutant variables.

Parameter **	N	Range	Minimum	Maximum	Mean	Std. Error	Std. Deviation	Variance
NO_x	168	602.9	1.9	604.8	103.8	11.5	146.4	21,445.3
CO	168	415.7	2.3	418	82.8	6.5	83.4	6967.6
HC	168	87.9	0.1	88.0	8.1	1.26	12.75	162.79
SO_x	168	40.9	0.2	41.1	7.58	0.77	9.87	97.55
Fuel	168	48,904.8	199.4	48,705.4	9026.7	925.8	11,748.1	138,017,958.8
PM_2.5	168	5.52	0.07	5.6	1.69	0.22	1.84	3.4
CO₂	168	153,411.2	628	154,039.2	28,433.02	2916.33	37,004.19	1,369,310,238.3
Total Traffic	168	55.0	1	56	16.12	1.2	15.04	226.43
Mean T (C⁰)	168	22.8	5.7	28.5	16.9	0.6	7.6	58.4
Rain (mm)	168	131.0	0.0	131.0	45.45	2.75	35.73	1276.7
INST (min)	168	316.0	76.0	392.0	228.0	7.21	93.48	8740.1
Max Wind (Bf)	168	3.0	3.0	6.0	4.2	0.04	0.57	0.33

** Parameters: Fuel, NO_x, CO_x, SO_x, PM_2.5, SO_x: Total traffic emissions [Kg].

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Stefanis, C.; Manisalidis, I.; Stavropoulou, E.; Stavropoulos, A.; Tsigalou, C.; Voidarou, C.; Constantinidis, T.C.; Bezirtzoglou, E. Assessing the Impact of Aviation Emissions on Air Quality at a Regional Greek Airport Using Machine Learning. Toxics 2025, 13, 217. https://doi.org/10.3390/toxics13030217

AMA Style

Stefanis C, Manisalidis I, Stavropoulou E, Stavropoulos A, Tsigalou C, Voidarou C, Constantinidis TC, Bezirtzoglou E. Assessing the Impact of Aviation Emissions on Air Quality at a Regional Greek Airport Using Machine Learning. Toxics. 2025; 13(3):217. https://doi.org/10.3390/toxics13030217

Chicago/Turabian Style

Stefanis, Christos, Ioannis Manisalidis, Elisavet Stavropoulou, Agathangelos Stavropoulos, Christina Tsigalou, Chrysoula (Chrysa) Voidarou, Theodoros C. Constantinidis, and Eugenia Bezirtzoglou. 2025. "Assessing the Impact of Aviation Emissions on Air Quality at a Regional Greek Airport Using Machine Learning" Toxics 13, no. 3: 217. https://doi.org/10.3390/toxics13030217

APA Style

Stefanis, C., Manisalidis, I., Stavropoulou, E., Stavropoulos, A., Tsigalou, C., Voidarou, C., Constantinidis, T. C., & Bezirtzoglou, E. (2025). Assessing the Impact of Aviation Emissions on Air Quality at a Regional Greek Airport Using Machine Learning. Toxics, 13(3), 217. https://doi.org/10.3390/toxics13030217

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Assessing the Impact of Aviation Emissions on Air Quality at a Regional Greek Airport Using Machine Learning

Abstract

1. Introduction

2. Materials and Methods

Data Description, Machine Learning Models, and Evaluation Metrics

3. Results

4. Discussion

5. Limitations

6. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI