The Phenomenon of Temperature Increase in Poland: A Machine Learning Approach to Understanding Patterns and Projections

Franczyk, Anna; Twardosz, Robert

doi:10.3390/app152010994

Open AccessArticle

The Phenomenon of Temperature Increase in Poland: A Machine Learning Approach to Understanding Patterns and Projections

by

Anna Franczyk

^1,*

and

Robert Twardosz

²

¹

Department of Geoinformatics and Applied Computer Science, AGH University of Science and Technology, al. Mickiewicza 30, 30-059 Kraków, Poland

²

Faculty of Geography and Geology, Jagiellonian University, ul. Gronostajowa 7, 30-387 Kraków, Poland

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(20), 10994; https://doi.org/10.3390/app152010994

Submission received: 9 September 2025 / Revised: 9 October 2025 / Accepted: 13 October 2025 / Published: 13 October 2025

(This article belongs to the Section Environmental Sciences)

Download

Browse Figures

Versions Notes

Abstract

This study presents an analysis of patterns in mean monthly air temperature increases in Poland using the deep learning model Neural Basis Expansion Analysis for Time Series (N-BEATS) algorithm. The dataset comprises mean monthly temperatures recorded between 1951 and 2024 at eight meteorological stations across Poland. The research was conducted in two phases. In the first phase, the 74-year period was divided into two distinct intervals: one characterized by relative temperature stability, and the other by a marked upward trend. In the second phase, the N-BEATS neural network was employed to extract temporal patterns directly from the data and to forecast future temperature values. The results confirm the capacity of machine learning methods to identify persistent climate trends and demonstrate their utility for long-term monitoring and prediction.

Keywords:

temperature rise; machine learning; N-BEATS method

1. Introduction

One major area of scientific research today is climate change and climate variability, which is clearly in response to the incontestable phenomenon of global warming that has been widely addressed in IPCC publications [1]. Although records show that air temperatures have been increasing since the end of the Little Ice Age, this pattern became considerably more pronounced at the end of the 20th century. While during the 1880–2019 period as a whole the air temperatures rose by approximately 0.7 °C/100 years, in the final 30 years of this timeframe (1990–2019), the increase in temperature has actually been in the region of 2.0 °C/100 years [2]. Trenberth [3] demonstrated that this warming trend has actually been characterized by distinct spatial variations: it has been more significant in land than over the oceans. In Europe, warming has accelerated at a faster rate than in other continents [4,5], and this has manifested itself in a higher incidence of heat waves of greater intensity, in which temperatures significantly surpassed the long-term average of whole months, or even seasons, in which temperatures significantly surpassed the long-term average [6,7]. According to agreed-upon prognoses, rapid warming will continue in the future [8].

On the other hand, data from different sources and years show quite clearly that rising air temperatures in Europe and its nearby surrounding areas vary significantly in terms of both time and place [9,10,11]. This means that the warming trend of recent years is actually a complex and spatially diversified phenomenon [12,13,14]. Accurate evaluation of its potential social and economic impacts requires sustained monitoring that relies on a consistent dataset.

According to the latest research [11], the current upward trend in air temperatures in Europe began in the second half of the 1980s. Previously, temperatures had remained more or less stable. Conclusions were drawn from a classic statistical analysis of an area-average series of air temperature values over a period of 70 years, between 1951 and 2020, using the linear regression goodness of fit method as the basis for their calculations. This is because the slope of the linear regression of average annual temperatures and seasonal temperatures in the 1986–2020 timeframe turned out to be approximately twice as great as for the entire 70-year period as a whole. The results of this analysis provided the stimulus for further research based on data with a greater temporal resolution, i.e., average monthly air temperature values in tandem with objective and more advanced techniques based on artificial intelligence (AI), i.e., machine learning. Such new approaches are finding increasing applications in many scientific areas and are also being applied in a variety of ways in climate studies [15,16,17,18].

Previous studies have examined the time periods of significant air temperature changes across Europe [19]. These findings provided a direct impetus for the present study, which offers a detailed analysis of changes in mean monthly air temperatures in Poland using a neural network model. According to an analysis of mean monthly air temperatures [19] over a 70-year period, the annual temperature record can be divided into two distinct phases: an earlier period of relative stability and a subsequent phase marked by a clear increase in temperature. The present study focuses on this latter period of warming, aiming to (i) develop a time series model for monthly mean temperatures recorded after 1999 and (ii) generate forecasts of monthly average temperature values for Poland.

Time series forecasting has traditionally been dominated by linear models, such as the autoregressive integrated moving average (ARIMA) model [20]. With recent advances in artificial intelligence, hybrid approaches combining conventional statistical methods with artificial neural networks have been increasingly applied to time series analysis [21,22]. Such methodologies enable the identification of both linear and nonlinear dependencies within time series data, thereby improving predictive accuracy and model robustness. In the present study, a pure deep learning approach, the Neural Basis Expansion Analysis for Interpretable Time Series (N-BEATS) model [23], was employed. The N-BEATS model was benchmarked against state-of-the-art forecasting frameworks, including a Transformer-based architecture and an XGBoost variant, in order to assess its methodological advantages in modeling air temperature variability. Model parameters were optimized through a grid search procedure, ensuring that the resulting forecasts reflect the most accurate configuration for the analyzed dataset.

The article first provides an exploratory overview of the data, followed by a methodological framework combining clustering of annual temperature cycles with time series modeling based on machine learning. The subsequent analysis presents and evaluates the forecasting results.

2. Materials and Methods

2.1. Data Used and Their Preliminary Analysis

The input data for the study consisted of mean monthly air temperatures collected from eight meteorological stations across Poland. Five of these stations—located in Kraków, Warszawa, Poznań, Koszalin, and Kasprowy Wierch—are part of a broader network of 210 stations distributed as evenly as possible across Europe and its immediate surroundings (Figure 1a) [19]. The remaining three stations, Olsztyn, Kołobrzeg, and Rzeszów, represent a lake district, a coastal lowland, and a Carpathian foreland, respectively, and were included to minimize potential spatial bias in the analysis. The data covered a 74-year timeframe between January 1951 and December 2024. Such a dataset provided the basis for numerous earlier studies, including research on changing patterns in annual and seasonal temperatures in Europe [11]. It was constructed on the basis of publicly accessible databases used for climate studies. One dataset of particular importance for the authors was the European Climate Assessment & Dataset [24].

The location of the stations (seven cities and one mountain—Kasprowy Wierch) together with their latitudes and longitudes are included in Table 1. The locations of the Polish meteorological stations against the background of the network of European meteorological stations as a whole are presented in Figure 1b.

The data from the eight Polish stations covering a total of 888 air temperature values (74 years × 12 months) for each of the meteorological stations. One example of a time series showing changes in monthly air temperature values in the period 1951–2024 (Warsaw) is presented in Figure 2.

2.2. Cluster Analysis of Annual Cycles of Monthly Mean Air Temperature

To identify characteristic patterns of thermal variability, the analyzed dataset of mean monthly air temperatures covering the period 1950–2024 was divided into annual cycles, each representing a complete year of observations. This decomposition yielded a total of 74 annual curves. To visualize the long-term evolution of annual temperature changes, the data were displayed as a sequence of annual cycles. In order to minimize the influence of local fluctuations and enhance the clarity of inter-annual variability, the cycles were further smoothed using five-year moving averages. The methodology applied in these calculations, together with the corresponding mean values for the periods 1951–1955, 1952–1956, 1953–1957, 1954–1958, and 2020–2024, is presented in Figure 3.

For the 74 curves, which were broken down into yearly periods, hierarchical clustering was applied to identify groups exhibiting similar temporal patterns. The method of hierarchical clustering is based on the successive formation of new clusters from previously defined ones, ultimately producing a hierarchical structure of data organization. In this study, Ward’s criterion [25] was employed as the linkage method. This approach determines clusters by minimizing the total within-cluster variance. At each step of the procedure, the merger of two clusters is selected so as to yield the smallest possible increase in the overall sum of squared errors. The outcome of the procedure is a cluster tree (dendrogram), which graphically represents the hierarchical relationships between the groups, thereby facilitating the interpretation of similarities and differences among the analyzed time series.

2.3. Deep Learning Analysis Method

To generate forecasts of monthly mean air temperature values for Poland, the N-BEATS (Neural Basis Expansion Analysis for Interpretable Time Series) model was employed. N-BEATS is a deep neural architecture based on multilayer perceptrons (MLPs) with backward and forward residual connections (Figure 4).

The fundamental building block of the model is a multi-layer, fully connected network that utilizes ReLU activation functions. The input from the lookback period is initially processed through four fully connected layers. Subsequently, the output is stacked, divided into two components, and further processed by means of additional fully connected layers, which estimate the expansion coefficients for both forward (forecast) and backward (backcast) projections.

A single stack comprises multiple basic blocks arranged in accordance with the doubly residual stacking principle. Within this framework, the input to each block is obtained by subtracting the output of the previous block from its respective input. The final forecast output of a stack is obtained by adding up the forecast outputs of all the constituent blocks. Forecasts are aggregated in a hierarchical manner, thereby facilitating the construction of a deep neural network capable of capturing complex temporal dependencies. In the present study, interpretable configuration of the N-BEATS architecture was employed. Unlike the generic variant, which consists of at least 30 stacks, the interpretable model is designed with two stacks. This configuration sequentially breaks down the input signal into polynomial and harmonic bases with the aim of capturing trend and seasonality components. Each stack comprises three blocks, whereas in the generic version, each stack contains only a single block. Furthermore, the basis layer weights are shared at the stack level, thus enhancing the model’s interpretability and structural coherence.

Model evaluation was performed separately for each meteorological station using a 20% validation subset. Forecast accuracy was assessed using three complementary metrics: MAPE, RMSE, and R². The Mean Absolute Percentage Error (MAPE) is defined as:

M A P E = \frac{1}{N} \sum_{i = 1}^{N} |\frac{y_{i} - {\hat{y}}_{i}}{y_{i}}| \cdot 100 %,

(1)

where

y_{i}

is the actual value,

{\hat{y}}_{i}

is the predicted value and N is the number of observations. MAPE provides a percentage-based measure of error but is sensitive to values near 0 °C. The Root Mean Square Error (RMSE), which quantifies error in the original units, is calculated as

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}

(2)

The coefficient of determination (R²), indicating the proportion of variance explained by the model, is computed as:

R^{2} = 1 - \frac{\sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{N} {(y_{i} - {\bar{y}}_{i})}^{2}}

(3)

where

{\bar{y}}_{i}

is the mean of the observed values. Together, these metrics provide a concise and complementary evaluation of forecast performance.

3. Results

The annual variability plots presented in Figure 5, calculated using five-year moving windows for all eight meteorological stations, consistently reveal an overall increase in temperature throughout the study period.

A steady rise in temperature, illustrated as a sequence of colors ranging from blue to purple, has been especially pronounced since the final decades of the previous century (the 1980s and the 1990s). No distinct rise on this scale can be observed between the 1950s and the 1970s. However, the rate of temperature change varied across individual months, which is reflected in the differing thickness of the plotted trajectories. In particular, the winter months (January–March) exhibit greater variability and a more complex structure compared to the smoother and more consistent trends observed for the remaining months of the year. This pattern suggests that the warming process in Poland has not been uniform throughout the annual cycle but rather exhibits distinct seasonal differences in both magnitude and temporal dynamics.

Figure 6 presents the course of mean monthly temperature values for stations located in Poland (surface mean temperature values) between 1951 and 2024. The mean temperature values were prepared individually for particular months in a calendar year.

The temperature values shown in Figure 6 align with the average temperature values calculated for the entire European dataset [19]. The patterns in mean temperature values for particular months indicate variability. After an initial period of fluctuations around the mean temperature value each month a consistent upward trend is noticeable in mean monthly temperature values. This change is especially evident in mean temperatures in the spring and summer months (March–July). Mean temperature curves calculated for the autumn and winter months (August–February) also indicate an increase in the mean temperature; however, such a rise does not point to such a strong linear trend as was noted in the summer months. Moreover, in recent years, a distinct downward tendency in mean temperatures has been noted for the period from August to November, suggesting a localized cooling effect during the early autumn season.

The results obtained from the clustering of mean annual cycles, performed for data recorded for stations located in Poland, are presented in Figure 7. Similarly to the clustering of mean annual cycles of temperature values conducted for the whole of Europe [19], a distinct separation was observed in the distribution of annual temperature profiles. Furthermore, the analysis confirmed that these differences are primarily driven by variations in temperatures recorded during the summer months, which exert the greatest influence on the overall structure of the annual thermal cycle.

The post-1999 period, corresponding to the phase of intensified warming, was subsequently utilized to develop forecasts of monthly mean air temperature values for Poland using the N-BEATS deep learning model. All computations were performed with the Darts open-source Python library (v3.11.9), which provides a unified framework for time series analysis and forecasting. The dataset, comprising monthly mean air temperature records from meteorological stations distributed across Poland, was normalized and subsequently partitioned into training and validation subsets according to an 80:20 ratio. The training set was used to train an interpretable version of the model. The input_chunk_length parameter, which specifies the number of past time steps provided to the model as input, and the output_chunk_length parameter, defining the forecasting horizon, were determined through a grid search procedure. The hyperparameter ranges explored in this procedure were: input_chunk_length = [12, 24, 36, 48] and output_chunk_length = [6, 12, 18]. The number of training epochs was set to 100, as no further improvement in the loss function was observed beyond this point. Forecast performance for each combination of these parameters was assessed using MAPE, RMSE, and R², with the results presented in Table 2.

The combination of input_chunk_length = 24 and output_chunk_length = 12 was selected for the final model. Although the performance metrics were very similar to the shorter forecasting horizon (output_chunk_length = 6), the choice of 12 months naturally aligns with predicting the entire annual cycle.

For the selected hyperparameter configuration of the N-BEATS model (input_chunk_length = 24, output_chunk_length = 12), its predictive performance was benchmarked against two state-of-the-art alternatives: a Transformer-based model and an XGBoost variant. The results of this comparison are presented in Table 3.

The results clearly indicate that N-BEATS outperformed both benchmark models across all evaluation metrics. In particular, it achieved the lowest MAPE (12.2%), reflecting superior accuracy in relative terms, as well as the lowest RMSE, confirming a reduced magnitude of absolute forecast errors. Moreover, the coefficient of determination (R² = 0.90) demonstrates that N-BEATS explained a higher proportion of variance in the observed data compared to XGBoost (R² = 0.88) and the Transformer model (R² = 0.86).

Given these results, the N-BEATS model was selected for generating the final forecasts. A global model for Poland was trained using 80% of the combined dataset from all eight meteorological stations, comprising a total of 1996 data points. The use of N-BEATS made it possible to simultaneously analyze multiple time series, which was particularly advantageous given the relatively limited number of observations available for each individual station. By adopting a modeling approach capable of jointly utilizing several series during the training phase, the effective size of the training dataset was increased approximately eightfold, thereby enhancing the model’s ability to generalize and capture shared temporal dynamics across different locations. The values of RMSE, MAPE, and R² obtained for the validation sets of all eight stations are presented in Table 4.

The forecast evaluation results are relatively consistent across most of the meteorological stations. For the majority of locations, MAPE values remain close to 14%, with RMSE values ranging between 0.08 and 0.09 °C and R² values around 0.85–0.87, indicating a stable level of accuracy and good model fit. The lowest error values were obtained for Rzeszów (MAPE = 13.37%, RMSE = 0.08, R² = 0.87), suggesting that forecasts for this location are particularly reliable. In contrast, the mountain station at Kasprowy Wierch exhibits a noticeably higher MAPE of 21.58% and a lower R² of 0.83, which may be attributed to the greater variability and complexity of temperature dynamics in high-altitude environments. Slightly elevated errors are also observed for the coastal station in Kołobrzeg (MAPE = 16.30%), reflecting the influence of maritime climatic factors. Overall, the results confirm the robustness of the forecasting framework, while also highlighting that prediction accuracy may vary depending on local climatic and topographical conditions.

The predicted results for two meteorological stations—Rzeszów and Kasprowy Wierch—together with the actual values from the validation sets (rescaled to their original real-world units) are presented in Figure 8. These two stations were selected as they represent the best and worst forecast performance, respectively, within the group of analyzed locations in terms of MAPE, RMSE, and R². The monthly average temperatures were forecast for a future time span of ten years, covering the period from January 2025 to December 2034.

The predicted results differ not only in terms of local variations (predictions for urban areas) but also within the range of forecasted temperatures (e.g., the difference between the predicted value for the meteorological station at Kasprowy Wierch and the predicted values for the other seven stations located in different cities in Poland). The forecasts presented in the previous figure, which cover the period corresponding with the validation set (from September 2019 to December 2024), are to a considerable extent in harmony with the actual validation data.

Figure 9 shows the residuals between the predictions and the validation set for all five meteorological stations.

Residual diagnostics confirmed that the residuals obtained from N-BEATS forecasts were, in most cases, normally distributed, uncorrelated, and homoscedastic. The Shapiro–Wilk test indicated no significant deviations from normality for any of the eight stations, confirming that the model captured the main patterns of thermal variability without systematic bias. The Ljung–Box test results suggested that residuals were free from serial correlation at most stations, with the exception of Warszawa and Kasprowy Wierch, where weak autocorrelation was detected (p < 0.05). However, this effect was marginal and did not materially affect overall forecast performance. Furthermore, the ARCH test showed no evidence of heteroscedasticity across all analyzed stations (p > 0.05), implying a stable variance of forecast errors over time.

In the case of the predicted values for the period between January 2025 and December 2034, it should be emphasized that, based on this type of graphical representation, no distinct global trend or noticeable local shifts can be identified in the projected monthly mean temperatures (Figure 10). The forecasts presented for Poland in Figure 9 clearly illustrate the differing dynamics that characterize the winter and summer months. Furthermore, the rate of change, expressed as a linear trend fitted to the data, is summarized in Table 5 which presents the estimated slopes (°C decade⁻¹) for each month, together with their 95% confidence intervals. The results presented in Table 5 indicate pronounced seasonal variability in the projected temperature trends across the forecast period. Statistically significant positive trends were identified for March, April, and May, suggesting a potential warming tendency during the spring months. In contrast, late summer and autumn months (August–November) display statistically significant negative trends, implying a possible cooling pattern during this part of the year.

The magnitude of these trends ranges from approximately –3.6 °C decade⁻¹ in November to +3.3 °C decade⁻¹ in March, reflecting substantial inter-monthly differences in the forecasted temperature evolution. For the winter months (December–February), the estimated slopes are small and statistically insignificant (p > 0.05), which may indicate that temperature variations during this period are driven by more complex, potentially nonlinear processes rather than by a consistent linear trend.

4. Discussion

The results obtained in this study confirm the occurrence of long-term warming trends in Poland, consistent with broader European and global climate patterns. Analysis of 74 years of temperature observations from five meteorological stations revealed a marked shift around 1999, when a stable climatic phase gave way to accelerated warming. This transition is particularly visible in the summer months, which have shown stronger linear increases in mean temperatures compared with the winter season. Such a pattern mirrors findings for other European regions, suggesting that Poland’s climate dynamics are part of a wider continental-scale process.

The clustering analysis further demonstrated that the key driver of the observed changes lies in summer temperature values. This seasonal asymmetry is of particular importance for environmental and socio-economic systems: rising summer temperatures are associated with an increased frequency of heatwaves, water shortages, and higher energy demand for cooling, while warmer winters may influence agricultural cycles, biodiversity, and the hydrological balance.

From a methodological perspective, the use of the N-BEATS deep learning model proved effective for forecasting monthly average temperatures. The model achieved relatively low error rates (MAPE values between 14.0% and 21.6%), which are comparable to results reported in previous climate-focused time series studies using hybrid statistical–neural approaches. The highest forecasting error was recorded for the Kasprowy Wierch station, which can be attributed to its mountainous location. At higher altitudes, temperature patterns are strongly influenced by orographic effects, atmospheric circulation, and local variability, making predictions more challenging compared with urban lowland stations.

The forecasts for the decade 2025–2034 suggest a continuation of the observed warming, though with pronounced seasonal variability. The increase in spring and early summer temperatures (March–July) aligns with projections of intensified continental warming under climate change scenarios.

5. Conclusions

The analysis of monthly mean air temperatures from eight meteorological stations in Poland for the period 1951–2024 appears to indicate a general warming tendency, with some evidence of an acceleration after 1999. The results highlight pronounced seasonal asymmetry: the strongest increases were observed in the spring and early summer months, while winter temperatures showed a more moderate rise. Clustering analyses confirmed that long-term changes in annual cycles are primarily driven by summer variability, in line with observations made at the European scale.

The application of the N-BEATS deep learning model demonstrated its effectiveness in forecasting short- to medium-term temperature dynamics. The model achieved satisfactory predictive accuracy across all stations, with residuals normally distributed, which confirms that the underlying temporal patterns were adequately captured. Forecasts indicate a continuation of the warming trend, with the most pronounced increases expected in the spring and summer months, while autumn and winter temperatures are projected to remain relatively stable.

These findings confirm that Poland’s temperature changes are embedded within broader continental and global warming processes and CMIP6 ensemble projections. At the same time, the observed and predicted intensification of summer warming has critical implications for ecosystems, agriculture, and socio-economic systems. Future research should aim to integrate higher-resolution spatial data and hybrid approaches that combine statistical learning with process-based climate models in order to improve predictive reliability and extend the forecast horizon.

Author Contributions

Conceptualization, A.F.; methodology, A.F.; software, A.F.; validation, A.F. and R.T.; formal analysis, A.F.; investigation, A.F.; resources, R.T.; data curation, R.T.; writing—original draft preparation, A.F.; writing—review and editing, A.F. and R.T.; visualization, A.F.; supervision, A.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research project was partly supported by program “Excellence initiative—research university” for the AGH University. Research project supported by program implementation doctorate financed by Ministry of Science and Higher Education and by a grant from the Faculty of Geography and Geology under the Strategic Program Excellence Initiative at Jagiellonian University.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

IPCC. Climate Change 2021: The Physical Science Basis; Cambridge University Press: Cambridge, UK, 2023. [Google Scholar]
Kundzewicz, Z.W.; Pińskwar, I.; Koutsoyiannis, D. Variability of global mean annual temperature is significantly influenced by the rhythm of ocean-atmosphere oscillations. Sci. Total Environ. 2020, 747, 141256. [Google Scholar] [CrossRef] [PubMed]
Trenberth, K.E.; Jones, P.D.; Ambenje, P. Observations: Surface and Atmospheric Climate Change. In Climate Change 2007: The Physical Science Basis; Solomon, S., Qin, S.D., Manning, M., Chen, Z., Marquis, M., Averyt, K.B., Tignor, M., Miller, H.L., Eds.; Cambridge University Press: Cambridge, UK; New York, NY, USA, 2007. [Google Scholar]
Van der Schrier, G.; van den Besselaar, E.J.M.; Klein Tank, A.M.G.; Verver, G. Monitoring European average temperature based on E-OBS gridded data set. J. Geophys. Res. Atmos. 2013, 118, 5120–5135. [Google Scholar] [CrossRef]
Luterbacher, J.; Werner, J.P.; Smerdon, J.E.; Fernández-Donado, L.; González-Rouco, F.J.; Barriopedro, D.; Ljungqvist, F.C.; Büntgen, U.; Zorita, E.; Wagner, S.; et al. European summer temperatures since Roman times. Environ. Res. Lett. 2016, 11, 024001. [Google Scholar] [CrossRef]
Liu, X.; He, B.; Guo, L.; Huang, L.; Chen, D. Similarities and differences in the mechanisms causing the European summer heatwaves in 2003, 2010, and 2018. Earth’s Future 2020, 8, e2019EF001386. [Google Scholar] [CrossRef]
Twardosz, R.; Kossowska-Cezak, U. Large-area thermal anomalies in Europe (1951–2018). Temporal and spatial patterns. Atmos. Res. 2021, 251, 105434. [Google Scholar] [CrossRef]
Vautard, R.; Gobiet, A.; Sobolowski, S.; Kjellström, E.; Stegehuis, A.; Watkiss, P.; Mendlik, T.; Landgren, O.; Nikulin, G.; Teichmann, C.; et al. The European climate under a 2 °C warming. Environ. Res. Lett. 2014, 9, 034006. [Google Scholar] [CrossRef]
Chen, D.; Walther, A.; Moberg, A. European Trend Atlas of Extreme Temperature and Precipitation Records; Springer: Dordrecht, The Netherlands, 2015. [Google Scholar]
Kaufman, L.; Rousseeuw, P.J. Finding Groups in Data: An Introduction to Cluster Analysis; Wiley: New York, NY, USA, 1990. [Google Scholar]
Twardosz, R.; Walanus, A.; Guzik, I. Warming in Europe: Recent Trends in Annual and Seasonal Temperatures. Pure Appl. Geophys. 2021, 178, 4021–4032. [Google Scholar] [CrossRef]
Ji, F.; Wu, Z.; Huang, J.; Chassignet, E.P. Evolution of land surface air temperature trend. Nat. Clim. Change 2014, 4, 462–466. [Google Scholar] [CrossRef]
Hegerl, G.; Brönnimann, S.; Schurer, A.; Cowan, T. The early 20th century warming: Anomalies, causes, and consequences. WIREs Clim. Change 2018, 9, e522. [Google Scholar] [CrossRef] [PubMed]
Krauskopf, T.; Huth, R. Temperature trends in Europe: Comparison of different data sources. Theor. Appl. Climatol. 2020, 139, 1305–1316. [Google Scholar] [CrossRef]
Zhou, B.; Erell, E.; Hough, I.; Rosenblatt, J.; Just, A.C.; Novack, V.; Kloog, I. Estimating near-surface air temperature across Israel using a machine learning based hybrid approach. Int. J. Climatol. 2020, 40, 6106–6121. [Google Scholar] [CrossRef]
Zennaro, F.; Furlan, E.; Simeoni, C.; Torresan, S.; Aslan, S.; Critto, A.; Marcomini, A. Exploring machine learning potential for climate change risk assessment. Earth-Sci. Rev. 2021, 220, 103752. [Google Scholar] [CrossRef]
Chen, L.; Han, B.; Wang, X.; Zhao, J.; Yang, W.; Yang, Z. Machine Learning Methods in Weather and Climate Applications: A Survey. Appl. Sci. 2023, 13, 12019. [Google Scholar] [CrossRef]
Bussalleu, A.; Hoek, G.; Kloog, I.; Probst-Hensch, N.; Röösli, M.; de Hoogh, K. Modelling Europe-wide fine resolution daily ambient temperature for 2003–2020 using machine learning. Sci. Total Environ. 2024, 928, 172454. [Google Scholar] [CrossRef] [PubMed]
Franczyk, A.; Twardosz, R.; Walanus, A. Structure of rise in monthly temperature in Europe as estimated by machine learning. Pure Appl. Geophys. 2025, 182, 2631–2653. [Google Scholar] [CrossRef]
Box, G.E.P.; Jenkins, G.M. Time Series Analysis: Forecasting and Control; Holden-Day: San Francisco, CA, USA, 1970. [Google Scholar]
Zhang, G.P. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 2003, 50, 159–175. [Google Scholar] [CrossRef]
Elseldi, M. Forecasting temperature data with complex seasonality using time series methods. Model. Earth Syst. Environ. 2023, 9, 2553–2567. [Google Scholar] [CrossRef]
Smyl, S. A hybrid method of exponential smoothing and recurrent neural networks for time series forecasting. Int. J. Forecast. 2020, 36, 75–85. [Google Scholar] [CrossRef]
Klein Tank, A.M.G.; Wijngaard, J.B.; Können, G.P.; Böhm, R.; Demarée, G.; Gocheva, A.; Mileta, M.; Pashiardis, S.; Hejkrlik, L.; Kern-Hansen, C.; et al. Daily dataset of 20th-century surface air temperature and precipitation series for the European Climate Assessment. Int. J. Climatol. 2002, 22, 1441–1453. [Google Scholar] [CrossRef]
Ward, J.H. Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 1963, 58, 236–244. [Google Scholar] [CrossRef]
Oreshkin, B.N.; Carpov, D.; Chapados, N.; Bengio, Y. N-BEATS: Neural Basis Expansion Analysis for Interpretable Time Series Forecasting. In Proceedings of the International Conference on Learning Representations 2020 (ICLR), Addis Ababa, Ethiopia, 30 April 2020. [Google Scholar]

Figure 1. The Polish weather stations included in the study (diagram (b)) form part of a group of 210 meteorological stations distributed as evenly as possible across Europe and its immediate surroundings (a) [20], supplied with air temperature records of lake district (Olsztyn), a coastal-lowland (Kołobrzeg), and a Carpathian foreland (Rzeszow) stations (b).

Figure 2. Time series of monthly temperature values at the Warszawa station in the period 1950–2024.

Figure 3. Time series of mean monthly air temperatures for the period 1950–2024 (a) and the calculation scheme for five-year mean temperature cycles (b). The colored sections in the figure represent five-year intervals used for calculating the smoothed cycle of temperature changes.

Figure 4. The top-level architecture of N-BEATS, source: [26].

Figure 5. Annual course of temperature change recorded at Polish stations smoothed out using a 5-year window.

Figure 6. Monthly variability in mean temperatures in Poland during the period 1951–2024 (grey line). The diagrams also include values averaged using Gaussian window size 7 and with a standard deviation of 2 (blue line).

Figure 7. Clustering results based on mean temperature values for summer and winter months, performed using hierarchical clustering.

Figure 8. Monthly average temperature forecasts for the period 2019–2034 generated using the N-BEATS neural network, trained on individual time series for Rzeszów and Kasprowy Wierch. Forecasted values are depicted in red, while actual validation set observations are shown in blue.

Figure 9. Residual plot showing the differences between predicted values and actual monthly average temperatures from the validation set for all five meteorological stations.

Figure 10. Variability of mean monthly temperatures in Poland from 1951 to 2024 (grey line), including predicted future trends over the next 10 years (red line). The diagrams also include values averaged using a size 7 Gaussian window and a standard deviation of 2 (blue line).

Table 1. Location, longitude and latitude of the meteorological stations in Poland.

Location	Longitude	Latitude
Koszalin	54.194° N	16.172° E
Warszawa	52.2297° N	21.0122° E
Poznań	52.4069° N	16.929° E
Kraków	50.0647° N	19.945° E
Kasprowy Wierch	49.2614° N	19.9872° E
Kołobrzeg	54.176° N	15.583° E
Rzeszów	50.041° N	21.999° E
Olsztyn	53.770° N	20.490° E

Table 2. Forecast performance metrics (MAPE, RMSE, R²) for different combinations of input and output chunk lengths in the N-BEATS model.

Input Chunk Length	Output Chunk Length	MAPE	RMSE	R²
12	6	22.6	0.14	0.64
12	12	15.2	0.10	0.82
24	6	12.2	0.07	0.90
24	12	12.2	0.07	0.90
24	18	14.2	0.09	0.87
36	6	12.4	0.08	0.89
36	12	13.5	0.08	0.88
36	18	14.0	0.08	0.88
48	6	14.5	0.08	0.87
48	12	19.6	0.11	0.78
48	18	13.2	0.08	0.88

Table 3. Forecast accuracy metrics (MAPE, RMSE, R²) for N-BEATS, Transformer, and XGBoost models.

Model	MAPE	RMSE	R²
N-BEATS	12.2	0.07	0.90
XGBoost	14.4	0.08	0.88
Transformer	15.22	0.09	0.86

Table 4. Forecast performance metrics (MAPE, RMSE, R²) calculated separately for each meteorological station in Poland.

Location	MAPE	RMSE	R²
Koszalin	14.30	0.09	0.85
Warszawa	14.26	0.08	0.87
Poznań	14.30	0.09	0.86
Kraków	13.94	0.09	0.86
Kasprowy Wierch	21.58	0.10	0.83
Rzeszow	13.37	0.08	0.87
Olsztyn	14.08	0.09	0.86
Kołobrzeg	16.30	0.09	0.83

Table 5. Estimated linear trends in forecasted mean monthly temperatures (°C decade⁻¹) for the period 2025–2034, together with their 95% confidence intervals and corresponding p-values.

Month	Linear Trend (°C Decade⁻¹)	95% Confidence Interval (°C Decade⁻¹)	p-Value
January	−0.23	±1.17	0.67
February	0.50	±2.00	0.54
March	3.29	±1.32	0.00
April	2,14	±0.94	0.00
May	0.62	±0.88	0.00
June	0.63	±1.29	0.33
July	0.14	±0.36	0.41
August	−3.13	±1.80	0.00
September	−3.52	±0.86	0.00
October	−2.25	±1.13	0.00
November	−3.62	±1.21	0.00
December	−2.64	±1.29	0.00

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Franczyk, A.; Twardosz, R. The Phenomenon of Temperature Increase in Poland: A Machine Learning Approach to Understanding Patterns and Projections. Appl. Sci. 2025, 15, 10994. https://doi.org/10.3390/app152010994

AMA Style

Franczyk A, Twardosz R. The Phenomenon of Temperature Increase in Poland: A Machine Learning Approach to Understanding Patterns and Projections. Applied Sciences. 2025; 15(20):10994. https://doi.org/10.3390/app152010994

Chicago/Turabian Style

Franczyk, Anna, and Robert Twardosz. 2025. "The Phenomenon of Temperature Increase in Poland: A Machine Learning Approach to Understanding Patterns and Projections" Applied Sciences 15, no. 20: 10994. https://doi.org/10.3390/app152010994

APA Style

Franczyk, A., & Twardosz, R. (2025). The Phenomenon of Temperature Increase in Poland: A Machine Learning Approach to Understanding Patterns and Projections. Applied Sciences, 15(20), 10994. https://doi.org/10.3390/app152010994

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Phenomenon of Temperature Increase in Poland: A Machine Learning Approach to Understanding Patterns and Projections

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Used and Their Preliminary Analysis

2.2. Cluster Analysis of Annual Cycles of Monthly Mean Air Temperature

2.3. Deep Learning Analysis Method

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI