Next Article in Journal
Modelling Agroforestry’s Contributions to People—A Review of Available Models
Previous Article in Journal
Crop Allelopathy for Sustainable Weed Management in Agroecosystems: Knowing the Present with a View to the Future
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modelling the Common Agricultural Policy Impact over the EU Agricultural and Rural Environment through a Machine Learning Predictive Framework

by
Dragos Sebastian Cristea
1,*,
Sarina Rosenberg
1,
Adriana Pustianu Mocanu
1,
Ira Adeline Simionov
2,
Alina Antache Mogodan
2,
Stefan Mihai Petrea
2,* and
Liliana Mihaela Moga
1
1
Faculty of Economics and Business Administration, “Dunărea de Jos” University of Galaţi, 800008 Galați, Romania
2
Food Science, Food Engineering, Biotechnology and Aquaculture (SAIABA), Faculty of Food Science and Engineering, “Dunărea de Jos” University of Galaţi, 800201 Galati, Romania
*
Authors to whom correspondence should be addressed.
Agronomy 2021, 11(11), 2105; https://doi.org/10.3390/agronomy11112105
Submission received: 10 September 2021 / Revised: 12 October 2021 / Accepted: 14 October 2021 / Published: 20 October 2021
(This article belongs to the Section Farming Sustainability)

Abstract

:
This research provides an analytical and predictive framework, based on state-of-the-art machine-learning (ML) algorithms (random forest (RF) and generalized additive models (GAM)), that can be used to assess and improve the Common Agricultural Policy (CAP) impact/performance over the agricultural and rural environments, easing the identification of proper instruments that can be used by EU policy makers in CAP’s financial management. The applied methodology consists of elaborating a custom-developed analytical framework based on a dataset containing 22 relevant indicators, considering four main dimensions that describe the intricacies of the EU agricultural and rural environment, in the CAP context: rural, emissions, macroeconomic, and financial. The results highlight that an increase of the agricultural research and development funding, as well as the agriculture employment rate, negatively influence the degree of rural poverty. The rural GDP per capita is influenced by the size of the employment rate in agriculture. It seems that environmental sustainability, identified by both fertilizers used and emissions from agriculture parameters, significantly influences the GDP per capita. In predicting emissions in agriculture, the direct payment, degree of rural poverty, fertilizer use, employment in agriculture, and agriculture labor productivity are the main independent parameters with the highest future importance. It was found that when predicting direct payments, the rural employment rate, employment in agriculture, and gross value added must be considered the most. The agricultural, entrepreneurial income prediction is mainly influenced by the total factor productivity, while agricultural research and development investments depend on gross value added, direct payments, and gross value added in the agricultural sector. Future research, related to prediction models based on CAP indicators, should also consider the marketing dimension. It is recommended for direct payments to be used to invest in upgrading the fertilizers technologies, since environmental sustainability will influence economic growth.

1. Introduction

One of the most important components of the European Union budget is represented by agricultural expenditures, the Common Agricultural Policy (CAP) being the only policy financed from the budget of the European Union (EU). Still, this situation seems to be changing, as can be observed in the recent EU financial exercises. Thus, if at the beginning of the 1980s the amounts allocated for supporting the CAP represented 66% of the budget of the Union, the projected expenses for the period between 2014–2020 decreased, reaching 37.8%. The CAP budgetary costs, in relation to the EU gross national income (GNI), changed from 0.54% in 1990 to 0.34% in 2020 (European Parliament, 2019).
According to Eurostat data, the importance of CAP can be emphasized, as in 2017, over 12 million farmers were registered in the EU, and agriculture and the agri-food industry represent 6% of the EU GDP and generate over 46 million jobs. Rural areas represent over 77% of the EU territory (47% agricultural land and 30% forest) and shelter about half of its population (agricultural communities and other inhabitants) (https://ec.europa.eu/commission/presscorner/detail/ro/MEMO_13_631, accessed on 28 August 2021), with 9.7 million people working in the agricultural domain in 2016.
The objectives of the CAP facilitate, at the branch level, the following benefits: it helps farmers to invest in farm modernization, guarantees food safety and security, and creates and preserves jobs in the agri-food industry. Additionally, it supports diversified, rural communities and protects environment and animal welfare.
At the macroeconomic level, CAP aims to balance the participation of the parties involved in world trade, while protecting the internal markets. Additionally, the aspects through which to adjust the possible imbalances in the agricultural field are considered in the process of developing integration mechanisms for producers, which slows down the economic development.
The enlargement of the EU has led to increased pressure on measures to support and develop the agricultural sector, as net contributing states have become more reluctant to fund such programs. The main problem arises from the need to adapt the set of community policies to the reality of the Eastern European states.
In [1], the authors analyzed the aspects by which the CAP proves that it is not adapted to the way of organizing the agricultural sectors in the new member states of Central and Eastern Europe. The authors considered that the application of a financial support scheme, developed on the model of agricultural systems in Western European countries, is not an optimal solution for former communist countries, where historical heritage and the economic transition period have led to the configuration of significantly different agricultural systems, compared to those in Western Europe. The divergent trend of the distribution of financial allocations between net contributing states and those benefiting from the implementation of the Common Agricultural Policy generated debates in the run-up to the 2014–2020 programming period.
As presented in [2,3,4,5,6,7], CAP policies strongly influence national economies, with some authors [2] emphasizing that CAP plays one of the most important roles in the EU’s overall construction. In a recent study [3], while demonstrating that the partial equilibrium model (CAPRI) and the theory of moves (TOM) can be jointly used to test socio-political acceptance of the CAP’s reforms, the authors also emphasize that CAP is also one of the most controversial EU policy frameworks involving continuous reforms and adjustments.
To assess the performance of the CAP throughout the EU, the use of modern machine learning techniques, predicting certain scenarios, could represent a solution. Hence, modern technological software tools could be useful in providing a better understanding over what should be changed to make it more efficient.
For example, some authors [4] presented both the structure and results of a comprehensive model of European community agricultural markets and the associated CAP support mechanisms, identifying a public controversy over the large amount of public expenditure for the agriculture field under CAP, derived from how the farm systems are financed. A research paper [5] models the impact of policy measures within the CAP on farm production, income, and prices, as well as farmland biodiversity. The study [4] describes how CAP subsidies and regulations are conducting changes in land use. This methodology links the impacts of economic policy instruments with both changes in agricultural production, prices, and incomes, as well as with biodiversity effects, having land use and land-use intensity as the connecting drivers. The author uses the Common Agricultural Policy Regionalized Impact Modelling System (CAPRI) to conduct the economic analysis and Dyna-CLUE (Conversion of Land Use and its Effects) to model land use changes, defining an indicator expressing biodiversity in the agricultural areas.
A modelling procedure targeting the impact of CAP payments on the crop diversity of agricultural holdings can be found in another author’s research [6], which uses an econometric approach to identify irrigation, geographical location (including the soil quality), total crop area, and the legal structure of agricultural holdings as the most important factors in influencing the crop diversity and, conversely, considers the manager’s education, distance of the farm from the city, and membership in farmers’ associations as unimportant factors.
The author presents that some CAP payments influence crop diversity; specifically, single area payment scheme (SAPS) payments, displaying a negative effect, and payments for areas facing natural constraints were found to have positive effects.
In [7], Midler investigated the behavioral economic motives of the current policy approaches for integrating environmental objectives into the Common Agricultural Policy. The author analyzed the impact of several policy design features on farmers’ decisions to adopt sustainable agricultural practices: (i) framing of the policy: whether farmers perceive themselves as being part of the problem or the solution, (ii) degree of control: mandatory vs. voluntary policy, and (iii) framing of incentives as either losses or gains. All of the tested policy designs resulted in a significant increase in the hectares conserved, compared to a baseline scenario without policy. Moreover, behavioral factors do significantly affect farmers’ behavior at the individual level. Only framing is found to have a significant effect upon policy effectiveness
The current study complements the ideas presented above, by showing how the CAP impact on EU agricultural and rural environment development can be modelled by using machine learning modelling techniques.
Currently, machine learning (ML) and artificial intelligence techniques are successfully used to approach many agricultural economic subjects.
For example, some authors [8] emphasize that ML offers great potential for expanding the applied economist’s toolbox. There are many papers proving the ML potential in performing broad economic analysis [9,10,11,12,13] or for assessing different farm management strategies [14,15,16]. An important aspect, when considering the development of a ML model, is to create both a good predictive and interpretable model. For this, it is important to be able to measure the influence/effect of the explanatory variables on the resulting prediction. Thus, in studies, such as John [17], Fisher [18], and Kim [19], it is shown how the importance of each predictor variables can be assessed in various ML algorithms.
The ML model complexity depends on the studied phenomenon and agricultural economic; due to various economic, social, or biophysical processes, many of these phenomena are non-linear [20,21,22,23]. Econometric tools can provide accurate predictive non-linear models; however, they may not be equally interpretable as the ML ones. Thus, it is possible to accurately predict without understanding the existing feature relations.
Thus, in [24], the author emphasizes that it is important to identify the models able to capture non-linearities, while Storm [8] shows that ML models are flexible and useful for various contexts where econometric models fail to deliver relevant interpretability.
Several studies [8,25,26,27,28] describe three relevant methods for a non-linear problem; specifically, gradient boosting, trees ensemble (e.g., random forests (RF)), and neural networks. The first two methods are flexible for a large variety of tasks, as well as providing feature interpretability, while the third one provides good predictions but in the absence of parameters affecting interpretation.
In [29], the authors used ensemble methods for farmland rental rate predictions. Some authors [30] also use ensemble trees, respectively, random forests, to provide accurate prediction models for crop yield and enhancing food policies at the regional and global scales. Thus, in some research [30], RF models, used for predicting crop yield responses to climate and biophysical variables, at global and regional scales, in wheat, maize, and potato, were evaluated. The study’s results proved that random forests represent an effective method for predicting crop yield, at regional and global scales, due to its precision, ease of use, and utility in data analysis.
Choosing ensemble trees and generalized additive models over a neural network approach is also sustained in some research [25], a study that analyses the most important differences between neural networks and tree-based methods, such as random forest. Besides interpretability, the number of required samples for a feasible model is high in the case of neural networks. Furthermore, in several studies, the random forests outperformed the neural networks.
The research at hand can offer guidelines for CAP decision makers, especially in the context of the new EU financial perspective. Therefore, the aim is to provide an analytical and predictive framework based on state-of-the-art ML algorithms (random forest and generalized additive models) and multiple linear regression, in order to assess and improve the C.A.P impact on agricultural and rural environments development, as well as to identify and develop proper instruments. This can be used more effectively and efficiently by EU policy makers in Common Agricultural Policy financial management) and to offer suitable information necessary for developing agriculture supporting programs based on digital technologies, in order to offer continuously monitoring of rural, emissions, macroeconomic, and financial dimensions for assuring real time quality predictions. This study is substantially valuable, as it aims to explain how a series of individual indicators are interlinked with each other in an extended vision, considering each of the four mentioned dimensions. Thus, the identification of new prediction equations will offer the possibility to better evaluate the effects of CAP policies and to level up their positive impact, assuring monitoring and offering methods of control for each of the predicted parameter.

2. Materials and Methods

2.1. Dataset Description

To assess and improve the C.A.P impact on agricultural and rural environments and to identify proper instruments that can be used by the EU in Common Agricultural Policy financial management, the current study developed a machine learning analytical framework based on 22 parameters, as follows:
  • Gross domestic product per capita in rural areas—[GDP] (thousand dollars) (source: European Commission): the gross domestic product per capita, registered only for rural areas.
  • Gross value added—[GVA] (billion dollars) (source: theglobaleconomy.com): the total gross value added.
  • Gross value added for agriculture—[GVA_Agriculture] (billion dollars) (source: theglobaleconomy.com): the gross value added of the agriculture sector.
  • Gross value added in rural areas—[GVA_Rural] (billion dollars) (source: theglobaleconomy.com): the gross value added registered only for rural areas.
  • Percentage of gross value added for agriculture in total gross value added—[GVA_Percentage_Agriculture] (%) (source: theglobaleconomy.com): the percentage of the gross value added of the agriculture sector, from total values of gross value added.
  • Percentage of gross value added in rural areas in total gross value added (%) (source: theglobaleconomy.com): the percentage of the gross value added registered only for rural areas, from total values of gross value added.
  • Direct payments—[DP] (euros/ha) (source: European Commission): total value of direct payments.
  • Consumer price evolution of food products—[CPEFP] (%) (source: European Commission): the price change between the current and reference periods of an average basket of goods and services purchased by households.
  • Agricultural factor income per annual work unit in real terms—[AFI] (euros/annual work unit) (source: European Commission): the value of agricultural factor income.
  • Agriculture employment rate—[employment_agriculture] (%) (source: theglobaleconomy.com): the percentage of people employed in agriculture sector, from total working age population.
  • Rural employment rate—[RER] (%) (source: European Commission): the percentage of people employed in rural areas, from total working age population.
  • Rural employment rate for males—[RER_M] (%) (source: European Commission, 2017): the percentage of males, from total working population employed in rural areas.
  • Rural employment rate for females—[RER_F] (%) (source: European Commission): the percentage of females, from total working population employed in rural areas.
  • Degree of rural poverty—[DRP] (%) (source: European Commission): the weight of the average income in the agricultural sector, as compared to the wage in the whole economy.
  • Agricultural entrepreneurial income—[AEI] (euros/annual work unit) (source: European Commission): the total agricultural entrepreneurial income.
  • Agriculture research and development—[ARD] (mil. euros) (source: Eurostat database): government support for agriculture research and development.
  • Labour productivity in agriculture—[LPA] (euros/annual work unit) (source: European Commission, 2017): the labour productivity in agriculture sector.
  • Total factor productivity in agriculture—[TFP] (euros/annual work unit) (source: European Comission): total factor productivity within agricultural sector, as an efficiency indicator of agricultural systems.
  • Cereal crop yield—[Crop_Yield] (kg) (source: theglobaleconomy.com): the total cereal crop production.
  • Fertilizers use in agriculture—[Fertilizer_Use] (kg/ha) (source: theglobaleconomy.com): the quantity of fertilizers used within the agricultural sector.
  • Ammonia emissions from agriculture—[EA] (thousand tons) (source: European Commission, 2017): the amount of ammonia (NH3) emissions as a result of the agricultural production.
  • CO2 Emissions from agriculture—[CO2] (thousand tons) (source: Eurostat): the amount of CO2 emissions as a result of the agricultural production.
The financial period from 2014–2020 has not been completely reported yet; therefore, not all of the latest series of data are available (only some indicators from 2014–2016). Thus, the data used in the analytical framework of the present study were collected from the previous financial period (2007–2013). In addition, it is considered that, for the comparability and relevance of the present paper’s main aim, the research must be performed on data series covering an entire financial period, with respect to the availability of all the data needed to carry out the study for the 2007–2013 programming period.
The structure of the data set used in the present study is innovative, in terms of analysing the CAP impact over the EU agricultural and rural environment. However, the data set structure is generally defined by the aim of the analytical frameworks. Thus, since the present study targets to assess and, furthermore, improve the impact of CAP on the EU agriculture and rural environment, the main indicators, related to agriculture’s technical performance and productivity, as well as the environmental, economic, and social sustainability of the rural communities, were considered. By analysing the scientific literature, different studies applied various data set structure for determining various aspects of social, environmental, and economic dimensions, in relation to the agriculture sector [31,32,33,34,35,36,37,38,39,40,41].
Thus, in [31], the authors analyzed the land use impacts under CAP direct support, considering a series of indicators as agricultural land, forest, natural vegetation, build-up area, and abandoned land. Additionally, the analytical framework implied the evaluation of the CAP subsidy abolition impact, considering the following indicators values as a data set: agriculture GVA, pesticide use, N and P surplus, fossil energy demand area, fossil energy demand husbandry, methane emission, N2O emission, global warming potential, farmland birds, dead wood, and unemployment rate.
Another study [32] targeted the monitoring and evaluation of environmental objectives of the CAP, by considering farmland habitats and biodiversity and nitrogen pollution themes. Therefore, a series of indicators, such as the N inputs, grasslands mowing intensity, drainage of grasslands, land use changes, grasslands plant species richness, insects abundance and richness, area of agricultural habitats, area of semi-natural landscape elements, crop diversity, landscape openness, field size, connectivity, abundance and spatial distribution of targeted farmland species, crop pollination, natural pest control, contribution to biomass production, and cultural interactions with farmland biodiversity were used to enhance the CAP monitoring and evaluation framework, in terms of farmland habitats and biodiversity. At the same time, the N pollution analytical framework considered NH3 emissions to the atmosphere, net N surplus, N load to surface water, N load to deep groundwater, N concentration in shallow groundwater in agro-ecosystems, N concentration in drinking water wells, N concentration in (semi-)natural aquatic ecosystems, water transparency, phytoplankton growth, NH3 concentration in the atmosphere, PM concentration in the atmosphere, wetlands and (semi-)natural riparian vegetation, N retention, and recreation and tourism, in order to contribute to the effectiveness of the assessments of both agri-environmental measures and results-based payments to farmers.
Some authors [33] combined factor analysis and linear regression models to link the local determinants of development of agriculture and rural areas with the frequency of the applications for CAP rural development measures. Thus, a number of 59 indexes were considered, which covers different dimensions of regional features of agriculture and rural areas (organic agriculture, demography, animal production, entrepreneurship, and agricultural land greening). Thus, among the important data set used variables, the following can be highlighted: farm size, share of organic farms, permanent grassland within agriculture land, agriculture production space, share of pig, cattle, and poultry farms, NPK fertilizers use, length of water supply network, real population increase, and number of children (in comparison to elderly people).
Other studies [34] focused on investigating the relationship between EU agricultural subsidies and the outflow of labor from agriculture and identified that CAP subsidies reduced the outflow of labor from agriculture, due to decoupled Pillar I payments, while coupled Pillar I payments had no impact on reducing labor outflow from agriculture. However, in order to reveal those findings, the following data set components were adopted: growth rate, non-agricultural/agricultural GVA per worker, population density, unemployment rate, family farm labor force, regional GDP, subsidy rates (includes total CAP payments, Pillar I payments/coupled payments/decoupled payments, Pillar II payments/human, and physical capital/agri-environmental).
Some research [35] targeted the sustainability concept of agriculture that was included in a complex analytical framework, which considered three dimensions (economic, social, and environmental). Thus, the used methodology focused on a series of variables, such as the income of agricultural producers, contribution of agriculture to GDP, insured area, agricultural employment, stability of work-force, risk of abandonment of agricultural activity, economic dependence on agricultural activity, N and P inputs, pesticide risk, use of irrigation water, energy balance, and agri-environmental subsidy areas.
Other authors [36] had focused on analytical frameworks that targeted the identification of possible mechanisms that could be used for monitoring and controlling agriculture employment. These were variables, such as agricultural production, crop production, animal production, employment in agriculture, rural population, labor demand in agriculture, wage in agriculture, agriculture GDP, and the net export of agricultural products were used.
However, there were studies [37] that focused on the identification of various perspectives that could be used in the elaboration of agriculture economic sustainable development strategies. Thus, in order to achieve this, variables, such as agriculture credits, agriculture GDP, agriculture GVA, farms subsidies, and agriculture production were considered the main variables [37].
The structure of the data set used to generate the analytical framework was highly dependent on the specific aim of the study. Thus, a framework that targeted the analysis agriculture environmental performance could include data related to greenhouse gas emissions, soil, carbon, or pests [38]. However, if targeting yield prediction, in order to optimize marketing strategies [39], variables such as annual rainfall, food price index, and cultivation area must be considered.
Agriculture is also involved in the rural youth (not in employment, education, or training—NEETs) dimension, and several authors [40] developed frameworks involving variables, such as youth population (aged 15–24), as well as the share of the total population, labor productivity, GVA, number of employed people, or risk of poverty rate, in order to control the social impact of agriculture sector on rural areas. Additionally, other authors [41] used linear regression models to determine whether the rural NEET indicator depends on other factors such as rural unemployment, rural employment, GDP, or rural demography.
The developed analytical/predictive framework, based on modeling the data synthetically, presented in Table 1, provides insights into various perspectives that can indicate the proper instruments to be used by the EU to assess and improve the CAP impact on the agricultural sector and rural environment.
Therefore, to identify the economic importance of the agricultural sector and rural environment, the GDP per capita in rural areas, total GVA, the GVA of the agriculture sector, and GVA registered (only for rural areas) were included among the analyzed parameters of the present study. GDP per capita describes the economic output from the consumers side, while GVA elucidates the state of economic activity from the producers’ or supply side [37,42].
A fundamental objective of the EU CAP (Article 39 of the Treaty on the Functioning of the European Union, TFEU) is to increase agricultural productivity. This will lead to fair standard of living for the agricultural communities, increasing their individual earnings.
According to Prado [43], in the agricultural sector, specifically rural areas, the economic situation is strongly related to sustainability, as sustainable agriculture should come from an agricultural production that is economically viable and does not degrade the environment in the long run. Additionally, social sustainability, a multi-dimensional concept, includes the quality of life, revealed especially by the employment rate. Thus, in order to characterize the agriculture sector and rural areas, in terms of sustainability (economic, environmental, and social sustainability), the following indicators were considered: agriculture employment rate, rural employment rate, rural employment rate for males, rural employment rate for females, degree of rural poverty, agriculture research and development, consumer price evolution of food products, labor productivity in agriculture, total factor productivity in agriculture, cereal crop yield, fertilizers use in agriculture, ammonia emissions from agriculture, and CO2 Emissions from agriculture.
Additionally, farmer income is characterized as a key element in EU agriculture policy. Thus, this paper’s analytical framework considers the value of direct payments, agricultural factor income, and agricultural entrepreneurial income as key indicators in evaluating farmer income.

2.2. Development Technologies

The current predictive framework, developed for analyzing the CAP impact over the EU agricultural and rural environment, was implemented by using the Python and R programming languages, with various specific libraries: Seaborn, SciPy, Folium, Choropleth maps, and scikit-learn, based on the technological flow presented in Figure 1.
Seaborn library uses the main matplotlib graphic system, which offers statistical graphics, an API for viewing existing relationships between variables, linear regression models plotting, support for factor variables, univariate/bivariate distributions visualization, and color palettes for identifying data patterns. SciPy is a component of an entire Python ecosystem for engineering, mathematics, and science, comprised of six core libraries: NumPy (base N-dimensional array package), SciPy (library for scientific computing), Matplotlib (2-D plotting), IPython (an enhanced interactive console), SymPy (symbolic mathematics), and pandas (data structures & analysis). The SciPy library that was used is a core package of the SciPy stack, providing user-friendly numerical routines for integration, interpolation, optimization, linear algebra, and statistics. The research analysis used specifically the functions used to minimize objective functions for nonlinear curve fitting problems. Folium represents a map visualization library built on top the Python ecosystem, as well as the mapping strengths of the leaflet.js library. As such, Folium provided data visualization using the interactive leaflet maps. Additionally, it enabled both the binding of data to a map for choropleth visualizations, as well as passing rich vector/raster/HTML visualizations as markers on the map. The library has a number of built-in tilesets from OpenStreetMap, Mapbox, and Stamen, and it supports custom tilesets with the Mapbox or Cloudmade API keys. Folium supports the image, video, GeoJSON, and TopoJSON overlays. The choropleth map is a type of thematic map with shaded/patterned areas in proportion to various statistical variables that represent aggregating summaries of a geographic characteristics within each area, such as population density or per-capita income. By using choropleth maps, how a measurement varies across a geographic area can easily be identified or the level of variability within a region can be shown. These types of maps are most used by governmental bodies for publishing statistical data that describes well-known geographic units, such as countries, states, provinces, and counties. The scikit-learn represents one of the Python libraries suites for predictive data analysis, based on NumPy, SciPy, and matplotlib libraries, offering various algorithm implementations (such as ensemble trees and random forests) for a variety of situations: regression, classification, clustering, dimensionality reduction, model selection, and pre-processing,

2.3. Linear Modelling—Multiple Linear Regression and Model Selection

The current research involves two approaches towards data modelling: linear and non-linear. The first was based on a multiple linear regression approach for all the linear cases, providing, in addition, a quantification of the existing relationship between the dependent variable and various predictors. The dataset was used in fitting linear models, according to a linear equation, as presented below:
Y = 1 X 1 + 2 X 2 + + p X p + β + e
In the above equation, Y is the dependent variable, while X1, X2, …, Xp are the p independent variables (predictors). The β represents the intercept indicating the Y value. When all the predictors are zeros, α1, α2, …, αp are the coefficients of the predictors that reflect the contribution of each independent variable in predicting the dependent variable; e is the residual term indicating the difference between the actual and fitted response values.
For determining the optimal linear models, we used techniques such as stepwise regression and best subsets selection. Both were used to identify useful predictors, in an OLS (ordinary least regression) context. Stepwise regression adds or removes individual predictors, one at a time, based on their statistical significance, while stepwise technique is based on the significance level that determines if a parameter is accepted in the model or not. ‘Best subsets’ are methods which compare all possible models, starting with a specified set of predictors and displaying the best fitting models that contains one, two, or all the predictors. The result is not a single model but several models that must be compared and chosen among them. As can be observed in Figure 2, the ‘best subsets’ method requires some interpretation for choosing the model with the highest R2 for one predictor, two predictors, and so on.
Selecting the final models required, as well as the residual plots and Durbin–Watson interpretation (Figure 3), to ensure the model meets all criteria. Thus, Appendix B also presents the parameter coefficients (each with a p-value less than 0.05 and variance inflation factor (VIF) <10) of each identified multiple linear regression models.
By using the residuals histogram, the assumption that the residuals are normally distributed is checked, the ‘residuals versus fits’ checks the constant variance of the residuals, and the residuals versus order of data verifies that the residuals are uncorrelated.
The presence of autocorrelation in the errors of a regression model is verified with the Durbin–Watson statistical test, used to identify if adjacent observations present correlated errors. If correlated, an ordinary least squares (OLS) approach can underestimate the standard error of the coefficients, making the predictors seem more significant than they really are.
Thus, for identifying the best model, a series of best subsets regression related metrics were analyzed, as follows: R-sq, R-sq (adj), R-sq (pred), Mallows’ Cp, and S value. The R-sq represents the percentage of variation in the response that is explained by the model. A higher R-sq value means a better data fitting. The R-sq (0–100) will increase when adding additional predictors. As such, R-sq was used only for comparing models of the same size. Small samples do not offer good precision estimation, regarding the strength of the relationship between response and predictors, which was considered. Nevertheless, large samples (163) were used, making the assumptions reliable. Even when a model has a high R2, the residual plots ought to be checked to verify that the model meets the model assumptions. The R-sq (adj) was used to compare models with different numbers of predictors. If R-sq increases when adding predictors, without model improvements, the R-sq (adj) incorporates the number of predictors in the model. The R-sq (pred) was used to have a glimpse in regard to how well our models could predict the response for new observations. A high R-sq (pred) implies a model with better predictive abilities. Additionally, we considered that an R-sq (pred) much lower than R-sq could indicate over-fitting of the model, when unimportant predictors were added. We consider that R-sq (pred) is more useful than R-sq (adj) (when comparing models), as its value is based on observations that were not included in the model calculation. The Mallows’ Cp represents a value obtained by comparing the precision and bias of the full model to models with the best subsets of predictors. The Mallows’ Cp value was used to differentiate among different models, by choosing the one that is closer to the predictors’ number, plus the constant, indicating that the model produces precise and unbiased estimates. The S value was used to assess how well the model describes the response. An important aspect is that the S value is measured in the units of the response variable, representing how far the actual data are related with the fitted values. The lower the value of S, the better the model describes the response.

2.4. Non-Linear Modelling—Random Forest (RF) and Generalized Additive Models (GAM)

The second approach was needed for all the situations where linear models could not significantly describe and predict the relation between different parameters. Moving into the non-linearity data realm, we used tree-based ensemble learning models (random forests) and GAM models, two methods that provided excellent predicted results and parameter importance insights. When developing ML models, it is important to select the optimal features to be used by the data modeling process. This will lead to a better performance of both the supervised and unsupervised learning approaches and reduced calculation costs involved by training phases; additionally, in the case of high-dimensional input, it will handle dimensionality curse. Feature importance scores can be also used for model interpretation. Selecting the optimal features is even more important when the number of features is high. Usually, it is not necessary to use each available feature in developing a predictive algorithm, as the model should be built mainly with the important features that are able to explain dependent variable variance. The feature selection techniques were used in the current research to reduce the complexity of the model, provide an interpretation for the predictors (in relation to the dependent variable), model accuracy improvement, and reduce the overfit.
The RF represents an ensemble learning method that evolved from decision trees. As described in another paper [44], a RF consists of many decision trees, where each of the trees provides a classifier/regressor for the input data. A RF will collect each tree classification, and it will choose the one that was most voted as the result. Each tree uses sampled data from the original dataset as input, in which a subset of features is randomly selected from the optional features to grow the tree at each node based on one selected input feature value. The RF uses two concepts, respectively, bagging and random selection of features, producing a number of regression trees, with the input data selected through bootstrap sampling and the features a random subset of the original features.
The RF regression is a part of the larger ensemble learning model’s suite, as it is based on ensembles of T un-pruned regression trees, generated based on bootstrap sampling from the original training data. At each node, the optimal node splitting feature is selected from a set of m features that are randomly picked from the M features. For mM, the correlation between the different trees is significantly reduced and, thus, the average response of multiple regression trees is expected to have lower variance than individual regression trees.
Larger m improves the predictive capabilities of individual trees, but usually leads to an increased correlation between trees. The bootstrap resampling of the data for training each tree also increases the independence between the trees.
Even if the RF models are able to offer high accuracy predictions, the interpretability of the generated model is not as easy to accomplish as it is with linear regression models. An ensemble of regression trees is hard to analyze, while linear regression models provide straightforward interpretability of individual features through the regression coefficients.
However, an additional analysis of the generated RF models can assess the importance of the individual features. Determining variable (feature) importance with a random forest model can be performed in several ways. The first is the naive approach, where variable importance assignment is based on counting the number of times the xm variable has been selected by all the trees in the ensemble. This approach could prove problematic, as it does not incorporate the effect of the variable selection for each tree. The mean decrease impurity method evaluates the importance of the variable xm by averaging the weighted reduction in cost for all nodes where xm is selected, over all T trees in the forest (Gini importance). The permutation importance method, proposed by other authors [45], is based on measuring the decrease in the prediction accuracy when the values of xm are randomly permuted in the out-of-bag samples. Permutation importance performs better than the naive and mean decrease impurity approaches but can be extremely expensive, computationally.
According to Probst et al. [46], the RF algorithm has several hyperparameters that must be set by the user, such as the number of variables for each split, number of observations drawn randomly for each tree, the splitting rule, minimum number of samples for a node, and number of trees. The RF works well even with the default values, still a hyperparameter tuning could lead to an improvement of its predictive power [44,47,48].
It is important to mention that a RF approach is usually used for two different purposes when assessing their contribution to the prediction of the response variable: the construction of accurate classification or regression rules (intended for usage as prediction tools for future data) and investigation of the relevance of predictor variables for the prediction problem.
The current research uses the random forest approach to provide predictive models both for quantifying the CAP impact on the EU agricultural and rural environment and to identify the features importance, outside the linear regression scenarios.
The RF hyperparameters can be tuned either to enhance the predictive power of the model or to reduce the training times. There are three user-available features that can be fine-tuned for improving the model predictive power: max_features, n_estimators, and min_sample_leaf. The max_features represent the maximum number of features that are to be tried in individual tree. Assigning the maximum number of features can be achieved via the following options: Auto/None—take all the features making sense in every tree without restrictions; sqrt—take square root of the total number of features for an individual run; and 0.2—option that permits using 20% of variables in individual run. Because each node will have many options to consider, the overall performance of the model will likely increase. Still, the algorithm speed will decrease, so an optimal balance should be achieved. The n_estimators is the number of trees you want to build before taking the maximum voting or averages of predictions. A higher number of trees give you better performance but makes your code slower. High values of this hyperparameter leads to stronger and more stable predictions. The min_sample_leaf is a leaf that represents the end node of a decision tree. A smaller leaf makes the model more prone to capturing noise in train data. Usually, a leaf size should be more than 50; still multiple leaf sizes should be tried to find the optimum value.
While using a RF regression approach, there are also several parameters that could be fine-tuned to increase the model training speed. For example, n_jobs tells the engine how many processors it is allowed to use. A value of “−1” means there is no restriction, whereas a value of “1” means it can only use one processor. Another group of parameters is random_state, a parameter used for solution replication. A definite value of random_state will always produce the same results if given with same parameters and training data. The third group of parameters is oob_score, the standard RF cross validation method, similar to leave one out validation technique, still much faster.
As mentioned above, besides using RF regressors as a non-linear regression method, the purpose of our research was also to use random forests in determining parameter importance for each regression case. Determining what features are important for inferring the dependent variable, even if it does not provide numeric interpretability, would still be useful, as it is possible to pinpoint the most important variables to be known when predicting the value of another one. Thus, the endeavor of the study was to identify the models that would include the most important features. There are three main benefits of this approach: the model will be easier to interpret, the variance of the model will be reduced (and, therefore, the general overfitting), and the computational cost (and time) of training a model will be reduced. Because tree-based strategies, used by RF naturally, rank by how well they improve the purity of the node, RF are often used to perform feature selection by calculating the decrease in impurity over all trees (gini impurity). Python libraries (scikit-learn) can be used to determine the feature importance based on the gini impurity or by permutation importance (our current research), where each feature importance is calculated by identifying its effect over the model accuracy, while randomly shuffling the predictor variables. According to Breiman [49], the permutation importance technique is most appropriate, as it does not rely on internal model parameters. As Breiman shows, the more accurate a model is, the more relevant the process of the feature selection is. When using a permutation algorithm, the importance of a feature derives from analyzing R-sq values (regressor) by using a validation set of samples through the RF. The algorithm permutes the column values of a single predictor feature and then passes all test samples back through the RF and recomputes the R-sq. The feature importance represents the difference between the baseline and the drop in R-sq values caused by permuting the column. There is also a caveat, which is that the permutation algorithm is computationally expensive, compared with the decrease in impurity mechanism, but the results are more reliable.
For measuring the performance of a random forest model, the current study followed several steps, as presented above in Figure 1.
According to Figure 1, the first step was to define and implement the accuracy function that calculates the MAPE (mean absolute percentage value) metric, the accuracy being defined as 100-MAPE value. Subsequently, the remaining steps aimed to maximize the accuracy value and, for that, the base model accuracy (the model with the default hyperparameter values), and its ten most important feature weights were identified (Appendix A, Table A1, Table A2, Table A3, Table A4, Table A5, Table A6, Table A7, Table A8, Table A9, Table A10, Table A11, Table A12, Table A13, Table A14, Table A15, Table A16 and Table A17) afterwards, based on two hyperparameter tuning techniques (Randomized Search CV, Grid Search), to establish the best-fitted model. Grid Search and Randomized Search are the two most popular methods for the hyper-parameter optimization of any model. In both cases, the aim is to test a set of parameters whose range has been specified by the users and observe the outcome, in terms of the metric used (accuracy, in our case).
Randomized search represents a tuning technique where random combinations of the hyperparameters are used to find the best solution for the built model. It is similar to grid search, yet, has proven to yield better results, comparatively.
On the other hand, grid search attempts to compute the optimum values of hyperparameters, from a list of provided values. It is an exhaustive search that is performed on specific parameter values of a model.
Importantly, the model’s validation was not only performed based on the test data set, which is normally 20–30% of the original dataset, but also on parameter values coming from the next year (year 2017), which is totally unknown data for the models.
For several non-linear cases with a lower RF accuracy, this research used a generalized additive modelling approach (GAM). A big advantage of GAM models stems from its interpretability, the contribution of each predictor being clearly presented. A GAM model, instead of considering the relationship between predictors and dependent variable as simple weighted sum, models the outcome as a sum of arbitrary functions of each feature by replacing the beta coefficients from linear regression, with flexible functions (splines) that allow for nonlinear relationships to be modeled.
The GAM modeling [50] was used, so far, in several studies related to agricultural activities/agricultural economic. For example, it has been applied for problems targeting ecology, land allocation, and climatology [51,52,53], as well as for predicting field-level agricultural traits, such as wheat yield [54] or pest use assessment [55].

3. Results and Discussion

3.1. Comparative Analysis between EU Countries Considering Parameters within All Four Main Dimensions Used for CAP Analysis: Rural, Emissions, Macroeconomic, and Financial

The analysis of the total GVA, the GVA of the agriculture sector and the GVA registered for rural areas, revealed significant disparities between EU countries (Figure 4). Thus, in countries, such as BE, BG, EE, HU, LT, and LV, the GVA of both rural areas and agriculture have a low representativeness of total GVA value (Figure A1). This emphasizes that, for these states, significant future changes, in terms of agricultural policy, will probably be observed at the level of national economy, over a long-term period. Additionally, in most cases, the GVA registered for rural areas exceeds the agriculture sector GVA (AT, DE, EL, FI, FR, IT, PL, RO, PT, SE, SI, and UK), a result which emphasizes that the rural areas are not entirely dependent on agricultural activities (Figure 4). The differences between GVA values among all analyzed countries were significant. This encourages the establishment of a general EU analytical framework, which considers the peculiarities of each of the member countries. The primary sector GVA (agriculture, forestry, and fishing) influence over the national economy is considered as moderate (Figure 4) for most EU countries.
The lowest values of GDP per capita, registered only for rural areas, were recorded in Eastern European countries (Figure A1). However, Eastern European countries register a superior percentage of the GVA of the agricultural sector, in relation to the total GVA (Figure A2). Furthermore, a high percentage of GVA, registered only for rural areas, in relation to total GVA, were recorded among Eastern and Central European countries (Figure A3).
Considering the situation of several Western EU countries, it can be stated that, although the percentage of rural GVA is high, the agriculture GVA is relatively low, in relation to the total GVA, emphasizing that not all EU countries’ rural economies are based on the performance of the agricultural sector.
The best results, in terms of cereal crop yield are, generally, recorded in Central and Western EU member countries (Figure A4). This can be due to crop production systems intensity and the efficiency of culture technologies, which may increase crop yields.
Also, in [56], it is confirmed that the land productivity of European agriculture is high, especially in Western Europe. Thus, the potential to enhance cereal production in Europe through the reduction of the yield gap is largest in Eastern Europe, where approximately half of the production increase could occur in Romania and Poland [57]. However, according to Coelli [58], yield alone as a measure of productivity, provides misleading indications of the degree of productivity improvement in agriculture. According to the International Food Policy Research Institute (IFPRI), the total factor productivity within the agriculture sector is considered an efficiency indicator of agricultural systems. In [59], TFP was characterized as a measure of productivity, efficiency, and technological change. The EU member countries TFP in agriculture sector (Figure A5) revealed generally higher values for Eastern European countries. Those results (Figure A5), correlated with cereal crop yield situation (Figure A4), may be explained by low production factors prices recorded in Eastern Europe, compared to Western Europe countries. Additionally, in [60], the authors showed that, while GDP, employment rate, and capital stock utilization decreased, the TFP growth held up relatively well. Other studies [61,62,63] emphasized the positive influence of technical and technological progress on TFP growth. However, according to some studies [52], the TFP must be correlated with capital stock to limit the impact on capacity utilization, while unemployment rates reduce the contribution of capital utilization. The estimates of TFP growth will incorporate changes in capacity utilization if the capital stock is not adjusted to match changes in the utilization of it. According to Eaae [50], the TFP growth reduces employment of unskilled labor, as well as food prices. Thus, the increase of TFP, because of the implementation of new technologies, will increase farmers profit, agriculture areas, agriculture demand for labor, and will develop agriculture-related industries. According to other authors [64], differences in the productivity between EU members are the result of differentiated levels of support, mainly due to DP, since in the new member states, the yield level is lower; hence, the rate of subsidies was also lower. Therefore, CAP must target a more even distribution of DP in the new financial perspective for 2021–2026.
Agriculture is a key sector for Eastern European countries and for Portugal, since it covers a significant percentage of total employment rate (Figure A6). This reveals the importance of agriculture for regional economic development, also considering the sectors closely associated with farming, such as food processing, services, retail trade, education, and research. However, considering the rural employment rate situation (Figure A7), it can be that North and Western European countries registered higher values, compared to the rest of EU countries.
This may emphasize that the rural communities, from Northern and Western European countries, have a wider range of employment variants towards other economic sectors, outside of agriculture. This situation may be also justified by the technical progress achieved in agriculture sector, that can assure large agriculture production with low number of employees.
However, regarding employment, the EU primary sector can be even more important, rather than from the productivity point of view. The EU labor productivity in the agriculture sector (Figure A8) registered low values in Eastern EU countries, compared to the rest of member countries. According to previous studies (Dorward, 2013), agricultural labour productivity plays a fundamental role within wider economic development processes, since higher agricultural labour productivity releases agricultural labour from food production to production of other goods and services.
However, according to a previous study [65], the agricultural labour productivity may be underestimated if rural agricultural workers obtain substantial proportions of their incomes from non-farm activities.
High labour and land productivity are also related to fertilizer productivity. Thus, a high fertilizer use will lead to high labour productivity and, respectively, high income. Therefore, several studies [66,67,68] emphasized the need for specific attention to possible challenges that may appear when targeting agricultural labour productivity. However, a high agricultural labour productivity will generate better economic development and also, will decrease food prices.
Considering the quantity of fertilizers used in the agricultural sector by the EU countries (Figure A14), it can be observed low values in Eastern EU countries, except Poland, compared to the rest of member countries.
Thus, this may be considered as a possible influencing factor for low labor productivity results, recorder in Eastern EU countries. However, since agriculture is characterized as a climate sensitivity sector [69], the impact of climate change can be linked to fertilizers administration intensity, applied in each of the EU countries. Thus, to maintain high crop production, considering present climate change effects, the intensity of fertilizers will increase.
The situation related to fertilizers used in the agricultural sector for each of the EU member countries is generally synchronized with both the level of ammonia (NH3) (Figure A14) and CO2 (Figure A15) emissions, because of the agricultural production.
However, from the Eastern EU countries, Poland and Romania report the highest amount of ammonia (NH3) emissions, but even so, significantly less than France, Germany, Italy, and Spain (Figure A14). Poland registered the highest amount of CO2 emissions (Figure A15) from the Eastern EU countries, akin to the fertilizer situation (Figure A9).
Regarding Ireland, the fertilizer usage value is considerably higher (Figure A9), although both the ammonia (NH3) (Figure A10) and CO2 (Figure A11) emissions registered low values. This may indicate a possible better sustainable fertilizer use technology, compared to rest of the EU countries.
Direct payments are considered the main instruments, which support farm income and, therefore, contribute to rural vitality. However, according to the European Commission, despite the increase in size and the improvement of productivity, many farms depend heavily on direct payments. However, concomitantly, direct payments ensure sustainable farming through economic viability.
The largest amount of direct payments is attracted by agriculture farmers from Western EU countries, such as Germany, France, Spain, Italy, and United Kingdom (Figure A12). In Eastern EU countries, Poland and Greece records the highest values of direct payments (Figure A13).
This situation is generally reflected by the registered status of agricultural entrepreneurial income, where Eastern EU countries registered the lowest results (Figure A13). The direct payments support farm income and also, eliminates possible uncertainties related to agriculture economic activity.
Thus, a possible increase of these countries capacity to attract direct payments will possibly lead to more consistent agricultural entrepreneurial income, increase agricultural entrepreneurial market competitiveness, and facilitate the intention of farmers to adapt, considering present situation related to climate change. Additionally, ensuring a fair income for farmers is one of the objectives proposed in the CAP reform.
According to [70], economic development theorists and practitioners have long focused attention on the question of the impact of food prices on poverty in developing countries. However, long periods of low agriculture prices can reduce return and increase rural poverty. Additionally, in low-income countries, heavy taxation of agriculture through lower commodity prices might have contributed to extensive rural poverty [71]. However, it is difficult to estimate the effect of changes in food prices over poor households, since this is influenced by the level of dependence on agriculture production as a source of income.
Both the degree of rural poverty (Figure A14) and the consumer price evolution of food products (Figure A15) records the highest values, generally, in Eastern EU countries. This reveals that the rural economy is strongly related to agricultural production, as high food prices may affect affordability of consumption basket components, return of agriculture activity, as well as labor demand and wages in the agriculture sector.
The highest differences between rural employment rate for males versus females are recorded in Italy and Greece (Figure A16 and Figure A17). However, the agriculture employment rate is generally influenced by the economic performances of the sector. Thus, in order to improve production and performance and to decrease poverty, research and development activities are important. A previous study [72] emphasized that the returns to agricultural research and development have been large. The underinvestment in public research and development activities can eventually lead to the impossibility of satisfying of aggregate demand for food, increasing food prices and rural poverty. In Eastern EU, the research and development investments are considerable low, compared to Western EU countries (Figure A18). If public investments in research and development will increase in Eastern EU, these countries will growth in importance as producers of agricultural innovations and will improve their agriculture production performance.
The current research developed analytical framework took into consideration four main dimensions for describing the intricacies of the EU Agricultural and Rural Environment in the CAP context: (a) rural, (b) emissions, (c) macroeconomic, and (d) financial.
Each dimension is described by a set of parameters, respectively a set of predictive models. The developed models aimed to accurately predict the targeted parameters, but also to provide a glimpse in formally understanding, which are the most relevant parameters when determining targeted indicators.

3.2. The ‘Rural’ Dimension

The rural dimension is described by the following six parameters: DRP, GVA_Rural, RER_M, RER_F, RER, and GDP (rural GDP per capita). Predicting these parameters involved the development of both linear models and non-linear random forest models as not all parameters could be described through linear regression modelling.
According to the Ministry of Public Works, Development and Administration—MLPDA (Strategy for the development of the agri-food sector in the medium and long term 2020–2030), the level of national incomes in rural areas represents approximately 16.6% of the European average, in the context of an average income ranked lower than the European one.
The phenomenon of rural poverty can be explained in terms of the advanced age of the rural population, a low degree of employment, a limited employment opportunity and the low wages. An important influence on the level of rural poverty has the investments in the agricultural sector, through European and national financing instruments, also, the research—development activity is stimulated by the European regional development policies. Direct payments could positively stimulate the rural economic development, the GDP growth, with implicit effects on reducing the rural degree of poverty.
The degree of rural poverty is an important aspect to be understood, its reduction bringing benefits for the deceleration of the rural-urban migration process, conducting to a lower pressure on government to provide additional spending on services, such as sanitation, health, and education in urban areas.
In this research, it was possible to model the degree of rural poverty indicator through linear modelling, by using best subset selection technique to choose the set of independent variables. The equation describing the DRP indicator is the following:
L n D R P = 9.07 + 0.22 L n E m p l o y m e n t   A g r i c u l t u r e 0.13 L n G V A   A g r i c u l t u r e + 0.07   L n D P 0.57 L n G D P   C a p i t a   R u r a l + 0.13 L n   A R D 1.11 L n R E R
The linear regression model presented above shows good accuracy metrics, having the S-value at 0.18, the adjusted R-sq at 77.99% and the predicted R-sq at 76.66%. The result of the analysis shows that the degree of rural poverty decreases with the increase of: The Gross Domestic Product per capita in rural areas (−0.57%), the employment rate (−1.11%), the of the gross value added (−0.13%), with the values of 0.57, 1.11, respectively 0.13%.
It can be noticed that the ‘Rural Employment’ parameter was identified as an important predictor of the model, an increase of its value by 1% resulted in a 1.11% decrease in the rural poverty Equation (2). Additionally, gross domestic product per capita is considered the second parameter, regarding its importance in determining the degree of rural poverty Equation (2). Thus, this emphasizes that a sustainable economic growth will be achieved by increasing both gross domestic product per capita and gross value added of the agriculture sector Equation (2), since they significantly influence the degree of rural poverty Equation (2). Therefore, rural poverty is responsive to economic growth in rural areas. Dollar and Kraay (2001) [73] emphasized that economic growth may reduce poverty by raising the incomes of everyone in society, including the poor. Additionally, another study [74] revealed the importance of economic growth in reducing poverty, pointing out, also, the importance of the inequality variable.
In [75], the authors, after analyzing the determinants of rural poverty in 32 developing countries, also identified that the gross agricultural value added, and rural employment are important factors contributing to alleviating poverty in rural areas. Besides these two parameters, the authors also identified the government policies for reducing systemic discrimination against ethnic minorities, the encouraging tourism where possible and the programs designed to assist the irrigation of croplands as important variables that influence rural poverty degree. However, considering Equation (2), the agriculture research and development and agriculture employment rate negatively influence the degree of rural poverty. This may be due to bad agriculture research management, which targets the fundamental area, rather than applicative area. Additionally, a high employment rate in the agricultural sector can decrease the average income, as well as labor productivity. A poor agriculture research management will decrease the productivity because of low degree of technical and technological development and, therefore, increase the employment rate.
The reduced DP importance can be justified by the fact that CAP does not clearly define their amount for each region, according to a large framework of specifications. This is mentioned by other authors [76], which underline those specific guidelines should be define separately, for each geographic region, to increase the efficiency of CAP towards a better sustainable development. Other studies [77] emphasize that agricultural payment tools improve the production capacity of cultivated land, a situation which will lead to the increase of labor productivity and a decrease of RER. Thus, models that imply payment tools should be considered together with flexible policies, which can relax the decision-making process related to various specific aspects within limited time-periods. However, there are studies [77] which emphasize that payment tools should be used together with regulatory tools in order to establish a hybrid system, which aims to maximize the positive impact.
According to [78,79], over 80% of farms in the EU are represented by family farms; therefore, DP are being infused into local economies on a much higher scale, compared to the financial support received by industrial corporations, which in a form of dividends may be allocated out of EU. Therefore, DP can be considered as tools of fostering the national economic growth and can accentuate the differences between the EU member states. Hence, prediction models, which imply the DP as an independent variable, are important to for considering the new CAP programming period, since, according to other authors [80], they are considered the main funding issue and, therefore, creating fairer criteria for the distribution of financial envelopes of DP between the EU member states is proprietary. Thus, prediction models will offer superior performance, compared to classical arithmetic-based rules standardized by EC [81].
GDP per capita in rural areas shows how much economic production value can be attributed to each individual citizen living in rural settlements, being also a prosperity measure, since it can be translated as a measure of wealth. The GDP per capita in rural areas is an important parameter to be measured and forecasted. Indeed, it is also part of the explanation why many rural populations flow to non-agricultural sectors and urban areas [82].
For the current study, the ‘GDP per capita in rural areas’ (GDP) indicator modelled using a multiple linear regression, with the following obtained model:
L n G D P = 7.18 + 0.08   L n F e r t i l i z e r   U s e + 0.18 L n E m p l o y m e n t   A g r i c u l t u r e 0.17   L n C P E F P + 0.18 L n L P A 0.12 L n   E A + 0.16 L n A R D 0.37 L n D R P
The above model has good accuracy metrics: S value 0.14, adjusted R-sq 87.19%, and predicted R-Sq 85.84%. The model shows how the set variables influence the rural Gross Domestic Product per capita in rural areas.
Thus, a positive impact for the use of fertilizers in agriculture (0.08%), the employment agriculture rate (0.18%), the labor productivity in agriculture (0.18%), and the agriculture research and development (0.16%) variables, in addition to a negative impact of the consumer price evolution of food products (0.17%), ammonia emissions from agriculture (0.12%), and the degree of rural poverty (0.37%) variables on the gross domestic Product per capita in rural areas, can be observed.
The rural GDP per capita is influenced by the size of employment rate in agriculture—Equation (3). With the increase in the number of people employed in rural areas, there is also an increase in the value of rural GDP and, implicitly, a reduction in the degree of rural poverty. Several authors [64,83,84,85,86] have emphasized the importance of increasing employment in rural areas and encourage CAP to increase this initiative as a remedy for social exclusion, depopulation of these areas, and the income gap, in order to achieve high sustainable agriculture. Additionally, according to Equation (3), the ARD and EA directly influence the GDP situation, which can confirm the finding of other authors [87,88,89] related to the relation between human capital, as well as its qualification and education and income of agricultural population.
The Agriculture Research and Development activities, strongly stimulated and financed by the EU, in the period between 2014–2020, together with efficient and well-implemented public policies and good agricultural practices will stimulate the efforts for obtaining new technologies, optimal production systems, in relation to local characteristics, the know-how transfer, promoting cooperation and innovation, increased agricultural productivity (influenced also by the efficient use of fertilizers), and implicitly the increase of rural GDP per capita.
However, considering the relationship between GDP per capita and fertilizer use, as well as ammonia emissions from agriculture, it seems that environmental sustainability significantly influences the gross domestic product per capita. Thus, the use of fertilizers promotes the increase of crop productivity and, therefore, the economic growth. This can only be possible if sustainable fertilizer administration technologies are used (ammonia depth soil injections), limiting, therefore, ammonia emissions.
The rural GDP formation is also conditioned by the consumer price evolution of food products. While incomes of the rural population remain constant and the prices increase (regardless of the form of inflation, i.e., increases in production costs or demand for products and services), consumption automatically decreases, as well as the incomes of companies offering products and services, and there is a decrease in their financial and economic performance, with direct negative effects in the formation of the national GDP. The effects can be even stronger in terms of lower wages in rural areas.
The above findings are confirmed in studies, such as [90,91] where the authors investigate the relationship between both ‘relative GDP share of rural areas’ and ‘relative GDP share of agriculture’ versus the agricultural employment rate. In [91], based on a sample of ten Central and Eastern European countries, the authors identify a strong relationship between the GDP share of agriculture and agricultural employment rates, their findings suggest that agriculture may play a role in lowering the prevailing rates of high unemployment. Moreover, in [90], the authors present the main aspects of the Common Agricultural Policy (CAP) emphasizing on the strong relation existing between agricultural employments and rural areas GDP share, inside the European Union (EU) member states
As presented in Figure A5, there are many European countries where rural environment added value (GVA_Rural) has a significant share in the overall country GVA, e.g., Austria, Germany, France, Italy, Portugal, Romania, Slovenia. In [92] the authors present the economic importance of the private rural sector, such as the contribution of businesses to Gross Domestic Product. Their findings reveal that, besides the contribution to the overall gross value added, the rural businesses are improving the local resilience in both direct and indirect ways. As direct contributions, the creation of local employment and local product and service delivery can be mentioned, while for indirect contributions, the added value of primary business activities can be mentioned.
The current research models the ‘GVA_Rural’ parameter through a random forest model with base model (BM) accuracy at 89.27%, random search accuracy (RS) at 92.02%, grid search accuracy (GS) at 92.28% and model validation (MV) accuracy at 65.07%. After performing the feature selection analysis, it was concluded that, according to their weights, the following five parameters were the most important (contributed the most) to the prediction of the Rural GVA indicator: 0.24 for EA, 0.23 for DP, 0.11 for CO2, 0.05 for GVA Agriculture, and 0.01 for Employment Agriculture.
As it can be observed, the GVA_Rural prediction depends mostly on EA, DP, CO2 and GVA Agriculture. According to the above weights, it can be stated that there is a connection between the CAP direct payments and the gross value added of the rural environments. The ‘DP’ indicator has second weight in the model, its contribution to the dependent variable prediction being significant. The ‘Ammonia Emissions’ parameter displays the higher weight in the model. This confirms that agriculture plays an important role in the development of rural communities as most of the ammonia emissions are related to agricultural activities. Besides ammonia emissions, it can be observed that CO2 agricultural emissions ranks third, enforcing the hypothesis that for the European rural environment development, agriculture plays a pivotal role. An intensive agriculture, which implies intensive fertilizer administration, will have a direct and instant effect on rural GVA, through improving agriculture productivity. However, considering Equation (3) and the above weights, it can be recommended to use direct payments to invest in upgrading the fertilizer technologies, since environmental sustainability will influence economic growth.
Rural areas are different in accordance with social and economic dimensions- they are not homogenous. Traditionally, the population density was used for the definitions of rural areas in Europe. As defined by EUROSTAT, rural areas are defined as those with a population density of less than 100 inhabitants/km2.
The EC communication entitled “The Future of Rural Society” (EC, 1998:28-29) identifies three standard problems of rural areas: the pressure of modern life, rural decline and job finding/rural employment and difficulty to reach very marginal areas. Additionally, according to their extent of integration within the national economy, rural areas are distinguished as: integrated rural areas—with a growing population, an employment basis in the secondary and tertiary sectors, but with farming as key activity of use of land, intermediate rural areas—relatively distant from urban centres, with a varying mix of primary and secondary sectors—larger scale farming operations and remote rural areas—with the lowest population densities, often the lowest incomes, and an older population, which depends heavily on agricultural employment.
The current research presents a set of three models describing the European rural employment rates based on the available dataset containing CAP related indicators. Thus, based on the dataset presented above, this study considered the following rural employment related parameters: (a) the overall European rural employment rate, (b) male rural employment rate, (c) women rural employment rate.
The following figure displays a six-year average of the aforementioned parameters for each of the European states:
As it can be noticed from Figure 5, the rural employment rate for women in rural environments is noticeably lower than the male one. Among all analyzed countries, Greece, Italy or Romania present the highest difference between the previous mentioned indicators. Additionally, it is easy to spot the countries with the highest rural employment rate, namely: Germany, Denmark, The Netherlands, Sweden, and the UK.
According to [93], 40 years ago throughout Europe, women in rural areas had fewer employment opportunities, compared to both men in rural areas and women in urban areas. These days it is still difficult to provide a picture of the average employment situation of rural women, as employment prospects differ enormously by age, marital status, family stage or country. However, currently, the rural employment gap between man and women narrowed a lot for several EU countries Denmark, Finland, Netherlands, and Sweden.
The present study aimed to formalize, based on the available dataset, the various existing perspectives on rural employment rates. Therefore, this research modelled all three rural employment rate parameters (RER, RER_M, RER_F) through a non-linear random forest approach, obtaining three different models with the following characteristics:
  • The overall ‘Rural employment rate’ (RER) model has the base (BM) accuracy at 97.79%, random search accuracy (RS) at 97.86%, grid search accuracy (GS) at 97.82% and model validation (MV) accuracy at 93.53%. The feature selection analysis shows that, according to their weights, the following five parameters were the most important for the prediction of the Rural_GVA indicator: 0.68 for DRP, 0.41 for GDP per capita in rural areas, 0.20 for RER_M, 0.18 for EA, and 0.14 for GVA_Agriculture.
  • The model describing RER_M parameter has a base model (BM) accuracy at 98.14%, random search accuracy (RS) at 97.87%, grid search accuracy (GS) at 97.84% and model validation (MV) accuracy at 97.65%. By performing the feature selection analysis, it was identified that, according to their weights, the following five parameters were the most important for the RER_M indicator prediction: 0.49 RER_F, 0.21 for GDP, 0.20 for DRP, 0.10 for GVA Rural, and 0.01 for CPEFP
  • The RER_F random forest model also displays very high accuracy metrics. Its base model (BM) accuracy is at 98.60%, random search accuracy (RS) at 98.68%, grid search accuracy (GS) at 98.60% and model validation (MV) accuracy at 96.01%. The most important parameters are described by the following weights: 0.68 for DRP, 0.41 for GDP capita in rural environment, 0.20 for RER_M, 0.18 for EA and 0.14 for GVA_Agriculture.
As it can be noticed, for the prediction of the overall rural employment rate all five parameters contributed significantly, especially the degree of rural poverty and GDP per capita in rural areas (as implied by the weights presented above). The description of the relationship between the degree of rural poverty and rural employment rate was found in [94], where the authors show that the rapid growth of the agricultural production and income of the farm employees is the dominant means of reducing rural poverty. Additionally, according to the authors, large-scale commercial farms show little impact on rural poverty reduction, as compared to areas dominated by small commercial farms that are providing jobs for more communities.
Regarding the male rural employment rate, the most significant parameters were the women employment rate and the rural gross domestic product per capita. This implies that a possible relation exists between the rural rate of employment for man and the rural rate of employment for women.
The women rural employment rate model parameters are all contributing significantly to predicting the dependent variable (RER_F), especially the degree of rural poverty and GDP per capita in rural environments. As the model implies, the value for the degree of rural poverty strongly influences the accuracy of the model when predicting women rural employment rate. In [95], the author, after having analyzed the change in trends of women’s unemployment and poverty rates, also noticed their interrelationship. Thus, there are studies [95,96] which clearly highlights that unemployment is one of the main reasons of poverty and the women unemployment rate in 28 countries of the EU, is lower than men; however, their risk of poverty rate outweighs the indicator of men.

3.3. The ‘Environmental’ Dimension

In the context of the current research, the ‘Environmental dimension was described by using three parameters: EA (ammonia emissions in agriculture), CO2 emissions, and Fertilizer_Use, based on studies, such as [97], which come from agriculture. Additionally, in [98], it is shown that agricultural activities represent the major CH3 source and effective methods are needed to quantify and mitigate these emissions, while also understanding the current factors that are influencing it. European progress in reducing agricultural emissions is not as high as needed to be [99,100], the air pollution is constantly increasing in countries with developed agricultural systems [101,102].
The current research emphasizes that the agricultural sector’s ammonia reduction methods seem to be directly affected by the agricultural investments, such as CAP direct payments. Based on the GAM model presented in Equation (4), these investments have a significant impact over the ammonia emissions Equation (4). Besides direct payments, predictors, such as employment in agriculture, and fertilizer use, gross value added of the rural environment or GDP display an interpretable influence over the agricultural emissions. For the current research, after applying the generalized additive model technique (GAM), the emissions in agriculture parameter were explained by the following model, where s represents the spline function associated to each predictor:
E A   ~   s D P + s D R P + s E m p l o y m e n t _ A g r i c u l t u r e + s F e r t i l i z e r _ U s e + s   G D P + s G V A + s G V A _ R u r a l + s L P A
Initially, the model was trained with all the available predictors, but the final version included only the significant ones (presented in Figure 6).
The model presented in Equation (4) is described by excellent accuracy metrics, having the adjusted R-sq at 98.20% and as it can be observed, the direct payments parameter fit values shows that higher values of DP have a clear negative influence over the ammonia emissions, after exceeding a specific threshold.
For lower direct payments, the influence over the emissions in agriculture is positive, but as the DP values increase, the effect over the EA starts to reverse. It could be assumed that when the direct payments are higher, they are also used for purchasing improved fertilizers. All the significant parameters have a positive effect over the EA, but with several peculiarities.
For example, the influence of the employment in agriculture parameter over EA presents a sudden increase for values between 10–20%. The GDP’s influence decreases for high GDP values (highly developed countries being able to use the latest agricultural research products), while for the gross value added by the rural sector or for the fertilizer use, the model presents a direct ascending trend of their positive influence over the EA. In [103], the authors emphasize that the mitigation of the greenhouse ammonia emissions from the agriculture would be possible if a technological path is followed.
Technology leads to a higher productivity and, as Figure 6 presents, even a small increase in agricultural labor productivity is associated with lower ammonia emissions, similar with the high LPA values.
The environmental dimension includes the CO2 emissions in the agriculture as several studies showed that agricultural activities are responsible for the overall increase of the CO2. In [104], the conversion of different ecosystem for the agricultural usage is presented as one of the causes for CO2 emissions.
Also, [105] shows that agricultural activities are sources of greenhouse gases, such as CO2, CH4, and N2O and reliable quantifications should be obtained to assess the necessary budgets for land-use management and for climate research.
Like ammonia emissions, there are parameters related to industrialization, agricultural research and development, rural development, specific policies, or know-how improvements that are influencing CO2 emissions. According to the current research, CO2 emissions were also modelled by using a generalized additive model.
Thus, Equation (5) displays the most significant predictors over the CO2 agricultural emissions.
C O 2   ~   s A R D + s D P + s E m p l o y m e n t _ A g r i c u l t u r e + s F e r t i l i z e r _ U s e + s   G D P + s G V A + s G V A _ A g r i c u l t u r e + s G V A _ R u r a l + s L P A + s R E R + s T F P
The above model has excellent accuracy metrics (adj R-sq: 97.45%); as it can be observed (Figure 7), each predictor has a positive influence on the dependent variable, meaning they all contribute to the overall increase of the CO2. However, their influence is not linear. The direct payments display a slow increase for low DP values and a stronger one for high DP values. The ‘fertilizer use’ and ‘gross value added’ parameters clearly influence the CO2 emissions, higher values also meaning higher CO2 values.
We can observe three cases (GDP, GVA_Agriculture, LPA) where the influence of predictors over CO2 emission decreases as their value increases. The high GDP values reflects the developed countries, which are continuously aiming to reduce greenhouse gas emissions, while high values of GVA_Agriculture and LPA reflect the countries with developed agricultural systems, which are using modern greener technologies to increase agricultural productivity.
Fertilizers are used extensively by farmers, still, their use is expensive and can harm the environment. In [106] the authors describe how the excessive use of fertilizer generated environmental degradation and how fertilizer use decreased with the increase of farm size, as in small farms a very low machinery level inhibited the precise fertilization.
In the current research, the fertilizer use modelling was performed by using a non-linear generalized additive model Equation (6), as the linear modelling displayed low accuracy metrics.
F e r t i l i z e r _ U s e ~   s E m p l o y m e n t _ A g r i c u l t u r e + s G V A _ A g r i c u l t u r e + s G V A _ R u r a l + s R E R + s   T F P + s   C r o p _ Y i e l d
The ‘Fertilizer Use’ prediction model showed high accuracy metrics, with an Adj-Rsq value of 0.97 and according to the determined feature significance, the most important parameters for explaining the variance of the ‘fertilizers use’ dependent variable are: Crop_Yield, Employment_Agriculture, GVA_Agriculture, GVA_Rural, rural employment rate, and total factor productivity.
As noticed from Figure 8, the total factor productivity (TFP) and the Crop_Yield predictors display the existence of a clear relation relative to the dependent variable ‘fertilizer use’.
TFP represents the ratio between output to inputs [107], more specifically the rate at which the ratio of outputs to inputs changes over time [108]. If the production changes came from a greater input use (e.g., more fertilizers, workers, and/or land), the change in TFP would be zero. Thus, as noticed in the above figure, a higher TFP value is related to lower values of fertilizer use. Higher TFP values are related to technical change and efficiency improvements, being usually associated with agricultural systems based on improved plant varieties, agricultural mechanics advances or irrigation expansion and less on quantity of used fertilizers. Therefore, if a country has higher total factor productivity values, it can yield higher output with the same resources. Market imperfections could induce an increase of flexibility related to agriculture products, determining the decrease of prices if market offer for agriculture products increase and, therefore, if productivity increases, as was reported in other study [64]. This hypothesis is also confirmed by other authors [109] who emphasize that constant effort to improve factor productivity by adopting new technologies might not induce positive results in terms of farms profitability. However, according to other authors [110], most agrarian transformation is focused on improving farming productivity and, on long-term, this will generate agriculture labor transfer through other economic sectors. Therefore, according to [111], high TFP generates low agriculture employment rate. Additionally, another study [112] concluded that moderate cuts in unconditional DP would have no negative farm employment effects, underlying the low impact of these payments on agriculture employment rate and revealing that economic developments outside agriculture have, via the general wage level, the most pronounced effect on labor use in the farm sector. However, since the impact of subsidies on agricultural employment rate was found to be either positive [113], mixed [112], or negative [114], a strong need for using prediction modeling techniques is needed to define specific frameworks for proper efficient DP inputs, considering indicators related to both direct and indirect effects.
In predicting the ‘Fertilizers Use’ value, the fit values of the spline function associated to ‘Crop Yield’ parameter shows that for most of the cases the crop yield increase is associated with an increase of the ‘fertilizer use’ parameter.
The importance of the ‘Fertilizer Use’—‘Crop Yield’ relation is presented in several studies. In [115], the authors show that the use of pesticides and mineral fertilizers has improved crop yields but also contaminated food and the environment. Based on their findings, the fertilizer and pesticide consumption continuously increased. Additionally, in [116], a research that estimates the role of agronomic inputs in cereal yield improvements, the authors clearly show the role of fertilizer and modern seeds in boosting yields. Besides crop yield and TFP, the ‘rural employment rate’ and the ‘employment in agriculture’ fit spline values describe a situation similar to the finding presented in [117], where the authors showed that farmers use less fertilizer in a context where the rural employment rate is high.

3.4. The ‘Macroeconomic’ Dimension

The ‘Macroeconomic’ dimension, presented in this study, is described by the following five parameters: GVA_Agriculture, Crop_Yield, TFP, GVA, LPA.
C.A.P. represents the starting point for the planned policies on climate and energy for 2030. Through them are imposed the objectives of reducing emissions by 40% by 2030 and the participation of the agricultural sector in achieving this goal.
Increasing the added value in agriculture, at national and regional level, together with the transition to a sustainable agricultural production, are aspects intensely monitored by the European Union. European funds are dedicated, managed by the Agency for Financing Rural Investments Institution, subordinated to the Ministry of Agriculture and Rural Development (MADR), with attributions of technical and financial implementation of the National Rural Development Program 2014–2020.
According to [118], in Europe, agriculture accounts for more than 47% of the total territory. In recent decades, European agriculture has experienced a continuous decrease in the number of farms, while the farm size shows a tendency towards larger holdings. According to the authors, a similar tendency was observed also for the share of agriculture in the gross value added (GVA), but with some differences between European regions, see Figure A5.
The growth of the gross value added in agriculture (GVA Agriculture) depends largely on a country’s level of development and it is correlated with various exogenous factors. A stable economic environment, for example, which favors the investments, will create the premises for the development of the agricultural sector. In other words, the conditions in which the existing economic agents in agriculture carry out their activity in stable conditions from an economic, political, legislative point of view, they can participate in increasing of the added value at national level.
The current research identifies an accurate multiple linear regression model (S-value 0.45, adjusted R-Sq 85.44%, predicted R-sq 85.09%) for determining the gross value added of the agricultural sector, see Figure A7.
L n G V A   A g r i c = 4.22 + 0.32   L n D P + 0.14 L n L P A + 0.54   L n E A
An important aspect, revealed by the above model, is the importance of the direct payment’s indicator, that is a 1% increase of the direct payments would lead to a 0.32% increase of the GVA agriculture.
As the above model shows, the GVA_Agriculture parameter is positively influenced by the rest of the variables included in the model: Labour Productivity in Agriculture (LPA) by 0.14% and the Ammonia emissions from agriculture by 0.54%.
In [118], the authors investigate the factors that govern the differential performance of agriculture across the EU-27 countries by using a machine learning technique, respectively k-means clustering method, identifying a strong relation between agricultural labour productivity, specifically agricultural investments, and the economic performance of the sector.
The development of agriculture sector implies high production intensities, possibly due to extra-investments, as direct payments, made for both improving labour productivity and increasing the use of fertilizers, which will promote growth.
The overall Gross Value-Added parameter can be expressed through a multiple linear regression model with very good accuracy (S value 0.20, adjusted R-sq 97.83%, predicted R-sq 97.67%)—see Equation (8).
L n G V A = 4.37 + 0.09   L n C O 2 + 0.10 L n G V A   r u r a l + 0.31   L n F e r t i l i z e r   U s e 0.35 L n E m p l o y m e n t   A g r i c u l t u r e + 0.67 L n   G V A   A g r i c u l t u r e + 0.17 L n D P + 1.30 L n R E R
According to the above model, the overall gross value added is positively influenced by several parameters. According to these, an increase in the rural employment rate, or agricultural CO2 emissions, or agricultural gross value added or fertilizer use or direct payments would lead to an increase of the gross value-added indicator. The only negative influence comes from the ‘Employment in Agriculture’ parameter, that could be explained by a significant agricultural employee reconversion to services or industrial field. This explanation is sustained also in [119], where the authors identifies that the number of agricultural co-operatives has been continuously decreasing, even they are still providing job opportunities.
The labor productivity is determined by comparing the labor cost with the total labor efficiency. In agriculture, labor productivity (LPA) refers to the number of units of output(s) produced per unit of labor used in the process of production. It is an indicator that is determined by dividing the quantity of output by the total units of labor used. Assessing the quantity of labor input involves the number of active workers and the number of time units (hours, days, and months) worked.
According to [120] agricultural labour productivity plays an important role in defining the competitiveness of the sector, contributing to the overall economic development. Thus, the authors examine labour productivity differences in the Greek agricultural sector, by analysing the factors that determine the productivity of the sector and investigating the relation between productivity and its determinant factors, determining two basic factor categories: (a) factors linked to the structure of the agricultural enterprises—used productive (human and fixed) capital, technology, productive dynamism, scale returns, (b) factors related to the land and the natural environment.
In [121], the authors position LPA as one of the methods that could be applied by farmers to profit from the economic growth in their society: (a) increase the yields of their crops and animals, (b) switch to the production of high value products (for which there is an increasing demand in the market), (c) increase the labour productivity on their farm, and (d) find non-farm sources of income for some or all of their family members. Increasing the LPA is also important as many farmers are aware that with the increasing average income in their country the proportion of the labour force, which can find employment in agriculture decreases.
Also, as presented in [122] high and unstable food and agricultural commodity prices, environmental constraints and concerns about population growth pushed the agriculture production on national policy and research agendas that requires relevant and accessible monitoring indicators—LPA being one of them.
The current study presents a linear modelling describing the LPA parameter. The model Equation (8) has very good accuracy metrics having S value at 0.24, adjusted R-sq 92.67% and predicted R-sq at 92.01%.
L n L P A = 1.89 + 0.14   L n G V A 0.06   L n G V A   r u r a l + 0.86   L n A F I + 0.59 L n R E R
According to the above equation the Agricultural Factor Income parameter displays a strong influence, as a 1% AFI increase will lead to a 0.86% LPA increase, with the rest of the parameters remaining constant. Additionally, the rural employment rate displays a strong positive influence as the value of LPA will increase by 0.59% for a 1% RER increase.
The relationship between remuneration and productivity of labor factor (at the level of the agricultural sector) is presented in detail, both analytically and empirically by [123]. Based on an optimal solution of the income maximization problem, they derived the determinants of remuneration of the labor factor, i.e., labor productivity and subsidies.
However, considering Equation (8), it can be stated that the GVA influence positively the labor productivity in agriculture, compared to GVA Agriculture. Since agriculture is considered a complex sector, this implies the establishment of a series of direct relations with other sectors. Therefore, the labor productivity is mostly related to technological factors, but can be also influenced by the technical progress of the economy. Therefore, using automatized equipment for technological operations related to different crop production stages can involve low number of employees and, respectively high labor productivity. However, the automatization of agriculture production cycle may raise problems related to rural employment and, therefore, may affect rural GVA. This can explain the negative relation between LPA and rural GVA, revealed by Equation (8).
The crop yield represents a measurement of the amount of agricultural production harvested—yield of a crop—per unit of land area, being an important parameter to understand because it helps optimizing food security programs, also explaining cost variation for different years.
Modelling crop yield and crop production is necessary at EU and member state level for providing to CAP decision makers timely information needed for a rapid decision-making during the growing season. Furthermore, estimating the crop production is useful in relation to trade, development policies and humanitarian assistance linked to food security.
In [124], the authors also propose an analytical framework allowing estimation of the effects of the new instrumentation of the CAP in the arable crop sector—crop yield, price cuts and compensatory payments.
The ‘Crop_Yield’ parameter could not be modelled with linear modelling techniques and a random forest model was used.
The model describing Crop_Yield parameter has a base model (BM) accuracy at 92.20%, random search accuracy (RS) at 91.80%, grid search accuracy (GS) at 91.85% and model validation (MV) accuracy at 88.14%. By performing the feature selection analysis, it was identified that, according to their weights, the following parameters were the most important for the Crop_Yield prediction: 0.36 LPA, 0.30 Fertilizer Use, 0.12 for GVA_rural, 0.04 for DRP, and 0.02 for CO2. However, crop production is not related to DP. This is also reported by other authors that emphasized the low efficiency of CAP support regulations about the financial flow volume, pointing out the impact of relatively high support for large holdings on discouraging the boosting of productivity [125]. There are studies [126] which characterize subsidies as both negative and positive mechanisms for improving agriculture farm behavior, as they can encourage farmers to expand production through activities that they would consider too risky in the absence of guaranteed income from DP, while, at the same time, as negative aspect, farms production structures distortions can appear. However, since farm overinvesting can be considered a risk of DP, an entrepreneurial training for transferring know-how within facilities owners must be considered to be integrated in future CAP.
According to the above weights, the most important factors in determining crop yield value are LPA and the fertilizer use. Both factors contribute most to the model accuracy, this current study confirming previous models. Such an example can be found in [127], where the authors apply a different machine learning approach, that is a multilevel logistic regression for investigating the labor productivity of the six main agricultural systems across European NUTS2 regions, mentioning the strong relationship existing between LPA and crop yield. In [128] the authors emphasize that the benefits of nutrient inputs are often minimized in discussions of potential risk. The purpose of their research was to examine existing data and approximate the effects of nutrient inputs, specifically from commercial fertilizers, on crop yield. A total of 362 seasons of crop production were included in the long-term study evaluations and the results of their investigation indicated that at least 30 to 50% of crop yield is attributable to commercial fertilizer nutrient inputs.
Hence, intensive agriculture, associated to high values of crop yield, will generate high LPA, due to a high automatization degree of all processes included in crop culture technology and, also, will involve the use of considerable quantity of fertilizers, a situation confirmed by the above model feature importance weights.
Agricultural productivity is a necessary condition for the sector development as well as for the economy. Because partial productivity does not fully reflect whether the growth is due to an increased efficiency or due to the technological improvements, the ‘Total Factor Productivity’ (TFP) indicator can be used to measure the net growth of output per unit of total inputs.
Due to its importance for the sector, TFP indicator is debated in the subject. Thus, in the CAP context, the issue was to determine if direct payments impact in any way the TFP value. In [129], after a three-stage spatial analysis, the authors identified that the productivity of resources in agriculture is not affected by the total amount of subsidies; but rather, by their structure. Their analysis confirmed that the qualitative predictor “structure of the agricultural support”, is a significant factor influencing the capital productivity y over the whole of the period.
Also, in [130], the authors investigate the impact of the Common Agricultural Policy (CAP) subsidies on farm total factor productivity (TFP) in the European Union (EU). Based on a structural semi-parametric estimation algorithm directly incorporating the effect of subsidies into a model of unobserved productivity, their research showed that the subsidies negatively impacted farm productivity in the period before the decoupling reform and only later the effect of subsidies on productivity turned positive.
Based on the current research available data, the TFP parameter could not be linearly modelled; still, by using a random forest approach, the resulting model showed very good accuracies: base model accuracy at 98.02%, RS accuracy at 98.15%, GS accuracy at 98.15%, and model validation at 93.93%.
The feature selection analysis identified that, according to their weights, the following parameters were the most important for the TFP parameter prediction: 0.39 Crop_Yield, 0.36 ARD, 0.15 Fertilizer_Use, 0.08 DP and 0.05 RER_M.
In the current research, based on the identified weights, the direct payments have a four rank, according to its importance in predicting the TFP value, so its relationship with TFP should not be disregarded. However, what should be emphasized is its ARD importance in predicting TFP. Thus, funding the agricultural research and development clearly influences TFP value.
Investments in proper ARD activities will improve the technical and technological performance of agriculture sector and, if fertilizers quantity will increase, a higher production and TFP, respectively, will be assured.

3.5. The ‘Financial’ Dimension

For the present study, the ‘Financial’ dimension is described by the following four parameters: AFI, DP, AEI, ARD.
The ‘Agricultural Factor Income’ (AFI) parameter is a partial labor productivity measure of the agricultural sector. Agricultural factor income measures the income generated by farming, which is used to remunerate borrowed or rented factors of production (capital, wages and land rents) as well as own production factors (own labor, capital and land).
In previous research, [131] analyzed the relationship between farm income and influential factors. Results indicate that prices paid (PP) and received by farmers, technological change, interest, and exchange rates (ERs), gross domestic product (GDP) and land prices all influence farm income.
In the current study, AFI was modelled using a multiple linear regression model, described by the following equation:
L n A F I = 2.30 0.18   L n E m p l o y m e n t   A g r i c u l t u r e + 0.47 L n L P A + 0.30   L n G D P + 0.18 L n A E I
The four parameters composing the model were able to describe more than 90% of the AFI variation. The model adjusted R-sq is at 94.83%, the predicted R-sq is 93.96%, and the S value 0.17. According to the above equation, the ‘Labor Productivity in Agriculture’ parameter displays a strong influence, as a 1% LPA increase will lead to a 0.47% AFI increase, with the rest of the parameters remaining constant. Similarly, with [131], the above model shows that the ‘GDP per capita in rural areas’ displays a strong positive influence, as the value of AFI will increase by 0.30% for a 1% GDP increase.
One of the most important indicators of the CAP program is ‘Direct Payments’ (DP). In their study [132], the authors state that Europe’s Common Agricultural Policy has constantly evolved since its foundation in the 1960s and one of its principal components is the Pillar 1 Direct Payments. The CAP supports the development of the agricultural sector through a combination of direct payments to farmers, financial assistance towards investments in rural development, and environmental protection measures. Direct payments are meant to support EU farmers, being the main component of the Common Agricultural Policy (CAP) and receiving special attention during the new 2014–2020 reform. First introduced in 1992, after the MacSharry reform, these payments (payments per hectare and animal head) were designed to support farmers income and, moreover, to support a faster transition towards green agriculture [133]. In another example [134], the authors examine the role of the governments and direct payments for the financial efficiency of the organic farms, their results showing that direct payments play an important role in the financial viability of organic farms in both Western and Eastern European countries. Additionally, in [135], it is shown that coupled and decoupled direct payments have different impacts on agricultural rental values, because of the different production responses associated with these payments, influencing the agricultural entrepreneurial income.
Still, there are also contradictory opinions about the role of direct payments. Many economists agree that direct payments are a needed basic income support for farmers, while others consider that the direct payments should provide a compensation for the public goods farmers deliver [130]. At the same time, some economists affirm that there is no need to apply direct payments for farmers, as we should not distinguish the agricultural sector from other economic sectors [136,137].
Based on the previously described dataset, the current research modelled the DP parameter through a linear approach, by using the best subset selection technique to choose the set of independent variables. As such, the equation describing the direct payments is as follows:
L n D P = 9.32 + 0.63   L n G V A + 0.24 L n C O 2 + 0.14   L n G V A   r u r a l + 0.41 L n E m p l o y m e n t   A g r i c u l t u r e + 0.50 L n   A F I 2.87 L n R E R 0.55 L n G D P 0.17 L n C P E F P
According to another study [138], the DP can also generate negative factors, with regards to rural sustainability. Therefore, significant financial influx into unprepared economies, decreasing the diversification of crops or causing a deficit of skilled labor forces (generated by DP inputs) may generate lower RER, CPEFP, and, eventually, cause a decrease of GDP.
The DP may promote agriculture as the main employer in rural regions. According to [138], this may create an illicit assumption of better employment possibilities and create competitiveness. As a result, the unemployment rate will increase, since many workers with agricultural studies will compete for a low number of working places and a deficit of specialists in other sectors (complementary to agriculture).
The linear regression model presented above shows excellent accuracy metrics, having the S-value at 0.34, adjusted R-sq at 93.18%, and predicted R-sq at 92.62%. The result of the analysis shows that the DP value increases with the increase of gross value added (0.63%), CO2 emissions from agriculture (0.24%), gross value added in rural areas (0.14%), agriculture employment rate (0.41%), and agricultural factor income per annual work unit in real terms (0.50%).
The ‘Agricultural Entrepreneurial Income’ (AEI) includes income from agriculture and forestry, as well as various subsidies and compensations, such as agricultural subsidies, European Union agricultural aid, and compensation for harvest losses.
Modelling the AEI parameter could reveal which are the most influencing parameters for predicting its value and to what extent their influence goes. The AEI indicator is an important metric found in the larger ‘agricultural entrepreneurship’ field and, as this study [139] shows, even though the mainstream entrepreneurship research had neglected the agricultural sector, this scenario seems to have changed in the previous years, with three main approaches being identified, each of which consider AEI an important indicator: entrepreneurial skills and behavior, entrepreneurial strategies and community, and entrepreneurial activity.
The current research describes the Agricultural Entrepreneurial Income (AEI) parameter, based on a linear approach, after applying the best subset selection technique. The following linear model was identified for describing AEI:
L n A E I = 6.17 + 0.09   L n G V A   A g r i c u l t u r e + 0.95 L n A F I + 1.34   L n T F P
The AEI model presented above has good accuracy (S value 0.45, adjusted R-sq 74.52%, and predicted R-sq 73.45 %), the independent variables manage to explain more than 70% of the AEI variation. As it can be observed from Equation (11), AEI can be determined by using only three independent variables: GVA_Agriculture, AFI, and TFP. Noticeably, the most prominent variables are the AFI and TFP. This model shows that for a 1% percent increase of the ‘Total Factor Productivity’ indicator, the AEI value will increase by 1.34%. According to [140,141], total factor productivity (TFP) has become the choice measure of productivity, as it is calculated by dividing the total production by the weighted average of inputs, i.e., labor and capital, representing the growth in real output, which is more than the growth in inputs such as labor and capital. Therefore, a higher economic efficiency formally leads to higher entrepreneurial income.
The ‘Agricultural Research and Development’ funding importance is emphasized in different research studies. For example, in [142], the authors describe the agricultural innovation platforms as ‘promising vehicles to foster a paradigm shift in agricultural research for development’. Digital farming, presented by [143], also represents a mandatory direction for the modern agricultural systems and technologies, such as sensors, robotics, and data analysis, which aid in moving from tedious operations to continuously automated processes. The authors highlighted task planning algorithms, digitalization, sensor optimization, multi-robots, human-robot collaboration, environment reconstruction (from aerial images), and ground-based sensors for the creation of virtual farms (facing challenges in the context of digital farming).
The ARD parameter could not be modelled by using a linear approach; nevertheless, the random forest approach showed a good model accuracy: BM accuracy at 86.01%, RS accuracy at 81.52%, GS accuracy at 82.18%, and model validation at 70.18%.
The feature selection analysis identified that, according to their weights, the following parameters were the most important for the ARD parameter prediction: 0.23 GVA, 0.16 DP, 0.15 GVA_Agriculture, 0.06 CO2, and 0.03 RER_M.
Based on these numbers, it can be noticed that direct payments, GVA, and agricultural gross value added are important parameters in ARD prediction. As such, it seems that direct payments overall managed to influence the dynamics of ARD investments. This would make sense, especially these days, when, as the authors in [144] showed, the agricultural R&D is critical for ensuring sufficient food for the world in the coming decades.
Therefore, although CAP uses instruments that cover the full range of policy options, from information provision to regulation and economic incentives [145], there are issues regarding policy implementation, which led to agriculture’s low economic, environmental, and social sustainability [146,147]. Therefore, the information provided by the prediction models, emphasized within the current study, are performed to minimize the criticisms revealed in previous research [148], related to the low incapacity of CAP to effectively monitor and enforce its policies by using specific indicators or methods. The use of prediction models techniques must also be supported by implementing digital technologies, which can offer real-time data, in order to support the modeling process. The synergic approach of applying digitalization and modeling technique assures the ability to measure the policy inputs and outcomes, facilitate the process of considering specific times and location variables (when evaluating CAP), and, according to other studies [145], reduce information asymmetry and transaction costs of policy implementation. Therefore, by using prediction model instruments, CAP could develop frameworks that consider local, in addition to farm-level, variables. According to [149], the challenge for policy makers and practitioners at different administrative levels is to appreciate the heterogeneous territorial context in the EU, as well as to obtain inspiration for including an explicit spatial dimension in further policy development. Moreover, other authors [150] consider that subsidies should be aligned as much as possible with the characteristics of the local agricultural sector, in order to maximize the outcomes of intervention mechanisms.
Therefore, the synergic use of digitalization [145] and machine learning techniques will enhance the ability to establish strong relationships between CAP-specific input and outcome indicators, based on dataset collection, and offer the possibility to consider specific attributes, such as location, in evaluation policy dimension. This will assure the possibility of adjusting policy variables, such as DP or regulated quantities, and offering flexibility control. Therefore, this will be useful for the CAP to manage, assuring a degree of equilibrium among the agriculture farms. There are authors [151] that emphasize that the EU subsidies on incomes gained by farms was diverse—the strongest units benefited the most, even though they are the least vulnerable to market fluctuations.
According to other authors [h], modeling outcomes increases feasibility, depending on the degree of digitalization and, therefore, the indicators of measurement costs.
The use of prediction model targets can also find effectiveness mechanisms, in order to change the economic situation of farms in new EU members, considering that, according to several studies [150,152], the CAP instruments were more frequently the cause of these gaps between countries. This was possible considering that member countries, according to [150], differ in how long they have been covered by CAP instruments (depending on when they joined the EU). Thus, prediction models could assure the improvement of fixed assets in agriculture farms, due to better agriculture investment policies [x18], which also contributes to the renewal and development of technical facilities production assets. However, in order to maintain the prediction precision of the models, considering that agriculture investments are characterized by the long-term rate of return and the outcomes of investments generally come to light with different delays [150], it is required for the data used in the prediction models to cover a longer period of time. Therefore, the application of the methodology developed in the research at hand (also considering the following CAP implemented period) will increase models’ accuracy and improve the analytical capacity of current developed framework.
Future research related to prediction models based on CAP indicators should also consider the marketing dimension, since the mechanisms that imply DP reported to sold products volume could be a solution of increasing the sustainability and efficiency of agriculture. Thus, this will better manage the possible CAP issues related to farm’s overproduction [153] and rural region resilience [154,155].

4. Conclusions

The current study presented models, both linear and random forest, characterized by very good to excellent performance metrics (adjusted and predicted R-sq, as well as accuracy). Thus, when predicting previously unseen data, the result was conclusive, accurately describing reality. Since there is not enough published data for the 2014–2020 period, the current research results were based on the data acquired for a six-year period between 2007–2013. In this context, future iterations of this research should consider retraining the framework models based on an extended dataset, in order to enhance their predictive strength. Otherwise, current models can easily be used to assess insights over the dataset parameters, in terms of parameter relations and importance.
The models emphasize that EA and CO2 dynamics will indicate the sustainable use of DP and, therefore, directly influence the DRP. A restrictive CAP regulation which, therefore, limits the use of DP solely for technical investments and discourages the excessive use of fertilizers is recommended. Furthermore, the management of ARD activity should target both environmental and economic sustainable development.
Similar to most studies, the design of the current study is subject to limitations. Therefore, firstly, there are limitations in terms of dataset components used for testing and validation of predictive models. The numbers of 22 parameters introduced into the prediction models were selected from a multitude of parameters that were directly or indirectly interacting with CAP. Therefore, since CAP has wide, multidimensional implications, in order to establish highly complex prediction models, considering a larger number of indicators is recommended. However, this could imply a decrease of the accuracy of prediction models. Although the dataset used in the present study had a reduced number of indicators, the resulted prediction models had a high accuracy and could easily be used for monitoring CAP impact and establishing frameworks for fast intervention measurements, if necessary.
The second limitation is related to location specificity. Thus, the results are unifying models, developed for the EU. As presented previously, it is important to consider the CAP impact, in relation to the specificity of each area. However, the developed models can be extended, and the methodology can be used for the elaboration of regional models. The models developed in current paper intend to have a general specificity, since they are categorized as fast tools and monitoring methods that were based on prediction.

Author Contributions

Conceptualization, D.S.C., S.M.P., L.M.M. and A.P.M.; data curation, S.M.P., A.A.M. and D.S.C.; formal analysis, D.S.C., S.M.P. and I.A.S.; investigation, D.S.C., S.M.P. and A.A.M.; methodology, D.S.C., S.M.P. and S.R.; project administration, S.M.P., L.M.M., A.P.M. and S.R.; software, D.S.C., S.M.P. and I.A.S.; supervision, D.S.C., S.M.P. and A.A.M.; visualization, D.S.C., S.M.P. and A.A.M.; writing—original draft, D.S.C., S.M.P., S.R. and I.A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by a grant of the Romanian National Authority for Scientific Research and Innovation, CNCS/CCCDI-UEFISCDI, project PN-III-P2-2.1-PTE-2019-0697, within PNCDI III.

Institutional Review Board Statement

This article does not contain any studies with human participants or laboratory animals performed by any of the authors.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

This work was supported by a grant of the Romanian National Authority for Scientific Research and Innovation, CNCS/CCCDI-UEFISCDI, project PN-III-P2-2.1-PTE-2019-0697, within PNCDI III. This paper was supported by the project CNFIS-FDI-2021-0443. Active measures to increase and streamline the capacity for research, development, innovation, and technology transfer at the “Dunărea de Jos” University of Galați—CEREX-UDJG_2021.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses or interpretation of data; in the writing of the manuscript or in the decision to publish the results.

Appendix A. Random Forest Models—Features’ Weights

Table A1. ARD.
Table A1. ARD.
WeightFeature
0.38 ± 0.17GVA
0.23 ± 0.06GVA_Agriculture
0.06 ± 0.03DP
0.03 ± 0.00CO2
0.02 ± 0.01RER_M
0.02 ± 0GDP
0.01 ± 0.01EA
0.01 ± 0LPA
0.01 ± 0RER_F
0.01 ± 0.00AEI
Table A2. Rural GVA.
Table A2. Rural GVA.
WeightFeature
0.25 ± 0.10DP
0.24 ± 0.07EA
0.16 ± 0.02CO2
0.01 ± 0.01Employment_Agriculture
0.01 ± 0.00GVA_Agriculture
0.01 ± 0.00GVA
0.01 ± 0.00Crop_Yield
0.01 ± 0.00LPA
0.01 ± 0.00GDP
0.01 ± 0.00RER_F
Table A3. TFP.
Table A3. TFP.
WeightFeature
0.33 ± 0.12Crop_Yield
0.21 ± 0.05ARD
0.04 ± 0.01Fertilizer_Use
0.03 ± 0.02DP
0.01 ± 0.01RER_M
0.01 ± 0.02GVA_Rural
0.01 ± 0.01CO2
0.01 ± 0.01GVA_Agriculture
0.01 ± 0.00GDP
0.01 ± 0.00GVA
Table A4. RER_F.
Table A4. RER_F.
WeightFeature
0.49 ± 0.21DRP
0.44 ± 0.13GDP
0.24 ± 0.04RER_M
0.09 ± 0.03EA
0.04 ± 0.03GVA_Agriculture
0.03 ± 0.01Employment_Agriculture
0.02 ± 0.00GVA_Rural
0.02 ± 0.01GVA
0.02 ± 0.01Crop_Yield
0.01 ± 0.01DP
Table A5. RER_M.
Table A5. RER_M.
WeightFeature
0.63 ± 0.19RER_F
0.15 ± 0.08DRP
0.05 ± 0.02GDP
0.02 ± 0.01GVA_Rural
0.02 ± 0.01CPEFP
0.01 ± 0.01TFP
0.01 ± 0.00DP
0.01 ± 0.00EA
0.01 ± 0.00GVA_Agriculture
0.01 ± 0.00CO2
Table A6. Crop Yield.
Table A6. Crop Yield.
WeightFeature
0.32 ± 0.06Fertilizer_Use
0.25 ± 0.04LPA
0.04 ± 0.02GVA_Rural
0.01 ± 0.01DRP
0.01 ± 0.00CO2
0.01 ± 0.00Employment_Agriculture
0.01 ± 0.01GDP
0.01 ± 0.01RER_F
0.00 ± 0.01GVA
0.00 ± 0.01EA
Table A7. RER.
Table A7. RER.
WeightFeature
1.16 ± 0.28DRP
0.09 ± 0.06DP
0.07 ± 0.04GDP
0.03 ± 0.03Employment_Agriculture
0.01 ± 0.01LPA
0.01 ± 0.01Crop_Yield
0.01 ± 0.00CO2
0.01 ± 0.01GVA
0.00 ± 0.01GVA_Agriculture
0.00 ± 0.01Fertilizer_Use

Appendix B. Multiple Linear Regression Models—Predictors Coefficients, p-Value, VIF

Table A8. DRP Model Coefficients.
Table A8. DRP Model Coefficients.
TermCoefficientSE CoefficientT-Valuep-ValueVIF
Constant9.070.8410.820.00
LN_EmploymentAgriculture0.230.037.090.001.63
LN_GVA_Agriculture−0.130.04−3.240.006.99
LN_DP0.080.042.120.047.46
LN_GDP−0.570.08−7.550.002.79
LN_ARD0.130.033.830.007.15
LN_RER−1.110.22−5.150.002.06
Table A9. DRP Model Coefficients.
Table A9. DRP Model Coefficients.
TermCoefficientSE CoefficientT-Valuep-ValueVIF
Constant7.181.295.560.00
LN_Fertilizer_Use0.080.032.570.011.63
LN_EmploymentAgriculture0.180.036.250.002.48
LN_CPEFP−0.180.03−5.600.001.43
LN_LPA0.180.035.500.004.65
LN_EA−0.130.03−4.450.005.83
LN_ARD0.160.035.930.007.89
LN_TFP−0.880.26−3.440.001.53
LN_DRP−0.370.05−7.340.002.38
Table A10. EA Model Coefficients.
Table A10. EA Model Coefficients.
TermCoefficientSE CoefficientT-Valuep-ValueVIF
Constant1.731.231.410.16
LN_Crop_Yield0.470.123.920.002.08
LN_DP0.460.0410.310.004.46
LN_AFI−0.600.12−4.780.009.44
LN_LPA0.250.112.180.038.79
LN_ARD0.490.0510.480.005.36
LN_DRP−0.250.12−2.110.043.08
LN_GDP−0.530.14−3.720.004.07
Table A11. CO2 Model Coefficients.
Table A11. CO2 Model Coefficients.
TermCoefficientSE CoefT-Valuep-ValueVIF
Constant−21.353.39−6.300.00
LN_DP0.170.062.630.017.06
LN_ARD0.210.063.410.007.24
LN_TFP2.830.604.720.001.46
LN_DRP0.530.124.400.002.38
LN_EA0.460.095.140.009.72
LN_RER2.500.425.930.002.46
LN_Fertilizer_Use0.310.074.200.001.50
Table A12. GVA Agriculture.
Table A12. GVA Agriculture.
TermCoefficientSE CoefficientT-Valuep-ValueVIF
Constant−4.220.45−9.300.00
LN_DP0.330.093.800.007.33
LN_LPA0.150.062.530.011.47
LN_EA0.550.105.700.006.50
Table A13. GVA Model Coefficients.
Table A13. GVA Model Coefficients.
TermCoefficientSE CoefficientT-Valuep-ValueVIF
Constant−4.380.97−4.520.00
LN_CO20.090.042.240.035.37
LN_GVA_Rural0.100.024.370.002.82
LN_Fertilizer_Use0.310.056.890.001.47
LN_EmploymentAgriculture−0.350.03−10.400.001.47
LN_GVA_Agriculture0.670.0415.180.006.81
LN_DP0.180.053.510.0011.68
LN_RER1.300.225.810.001.82
Table A14. LPA Model Coefficients.
Table A14. LPA Model Coefficients.
TermCoefficientSE CoefficientT-Valuep-ValueVIF
Constant−1.890.82−2.310.02
LN_GVA0.150.035.030.003.28
LN_GVA_Rural−0.070.02−2.820.012.08
LN_AFI0.870.0519.010.002.62
LN_RER0.600.232.550.011.52
(13)
Table A15. AFI Model Coefficients.
Table A15. AFI Model Coefficients.
TermCoefficientSE CoefficientT-Valuep-ValueVIF
Constant2.300.317.400.00
LN_Employment_Agriculture−0.190.04−5.390.002.17
LN_LPA0.480.059.800.006.18
LN_GDP0.300.074.010.003.04
LN_AEI0.190.035.810.002.96
Table A16. AEI Model Coefficients.
Table A16. AEI Model Coefficients.
TermCoefficientSE CoefT-Valuep-ValueVIF
Constant−6.173.64−1.700.09
LN_GVA_Agriculture0.090.052.040.041.54
LN_AFI0.950.0713.060.001.75
LN_TFP1.350.721.860.071.17
Table A17. GVA Model Coefficients.
Table A17. GVA Model Coefficients.
TermCoefficientSE CoefficientT-Valuep-ValueVIF
Constant9.321.466.390.00
LN_GVA0.640.079.180.009.33
LN_CO20.250.073.760.004.86
LN_GVA_Rural0.150.043.800.003.05
LN_Employment_Agriculture0.420.085.220.003.07
LN_AFI0.500.114.530.007.13
LN_RER−2.870.37−7.820.001.81
LN_GDP−0.550.20−2.820.015.68
LN_CPEFP−0.160.09−1.850.071.71

Appendix C. Choropleth Maps

Figure A1. The average value of GDP per capita in rural areas.
Figure A1. The average value of GDP per capita in rural areas.
Agronomy 11 02105 g0a1
Figure A2. The average % of the GVA of the agriculture sector, in relation to total GVA.
Figure A2. The average % of the GVA of the agriculture sector, in relation to total GVA.
Agronomy 11 02105 g0a2
Figure A3. The average % of the GVA registered for rural areas, in relation to total GVA.
Figure A3. The average % of the GVA registered for rural areas, in relation to total GVA.
Agronomy 11 02105 g0a3
Figure A4. The average cereal crop yield.
Figure A4. The average cereal crop yield.
Agronomy 11 02105 g0a4
Figure A5. The average TFP in the agricultural sector.
Figure A5. The average TFP in the agricultural sector.
Agronomy 11 02105 g0a5
Figure A6. The average agricultural employment rate.
Figure A6. The average agricultural employment rate.
Agronomy 11 02105 g0a6
Figure A7. The average rural employment rate.
Figure A7. The average rural employment rate.
Agronomy 11 02105 g0a7
Figure A8. The average labor productivity in the agricultural sector.
Figure A8. The average labor productivity in the agricultural sector.
Agronomy 11 02105 g0a8
Figure A9. The average quantity of fertilizers used in the agricultural sector.
Figure A9. The average quantity of fertilizers used in the agricultural sector.
Agronomy 11 02105 g0a9
Figure A10. The average amount of ammonia (NH3) emissions, as a result of the agricultural production.
Figure A10. The average amount of ammonia (NH3) emissions, as a result of the agricultural production.
Agronomy 11 02105 g0a10
Figure A11. The average amount of CO2 emissions, as a result of the agricultural production.
Figure A11. The average amount of CO2 emissions, as a result of the agricultural production.
Agronomy 11 02105 g0a11
Figure A12. The average value of direct payments.
Figure A12. The average value of direct payments.
Agronomy 11 02105 g0a12
Figure A13. The average value of AEI.
Figure A13. The average value of AEI.
Agronomy 11 02105 g0a13
Figure A14. The average percentage of the degree of rural poverty.
Figure A14. The average percentage of the degree of rural poverty.
Agronomy 11 02105 g0a14
Figure A15. The average value of the consumer price evolution of food products.
Figure A15. The average value of the consumer price evolution of food products.
Agronomy 11 02105 g0a15
Figure A16. The average % of rural employment rate for men.
Figure A16. The average % of rural employment rate for men.
Agronomy 11 02105 g0a16
Figure A17. The average % of the rural employment rate for women.
Figure A17. The average % of the rural employment rate for women.
Agronomy 11 02105 g0a17
Figure A18. The average value of the agricultural research and development activities.
Figure A18. The average value of the agricultural research and development activities.
Agronomy 11 02105 g0a18

References

  1. Petrovici, D.A.; Gorton, M. An Evaluation of the Importance of Subsistence Food Production for Assessments of Poverty and Policy Targeting: Evidence from Romania. Food Policy 2005, 30, 205–223. [Google Scholar] [CrossRef]
  2. Roederer-Rynning, C.; Roederer-Rynning, C. The Common Agricultural Policy: A Case of Embedded Liberalism. In Oxford Research Encyclopedia of Politics; Oxford University Press: Oxford, UK, 2019. [Google Scholar]
  3. Kiryluk-Dryjska, E.; Baer-Nawrocka, A. Reforms of the Common Agricultural Policy of the EU: Expected Results and Their Social Acceptance. J. Policy Modeling 2019, 41, 607–622. [Google Scholar] [CrossRef]
  4. Buckwell, A.E.; Harvey, D.R.; Thomson, K.J.; Parton, K.A. The Costs of the Common Agricultural Policy; Routledge: London, UK, 2019; ISBN 9781000681628. [Google Scholar]
  5. Overmars, K.P.; Helming, J.; van Zeijts, H.; Jansson, T.; Terluin, I. A Modelling Approach for the Assessment of the Effects of Common Agricultural Policy Measures on Farmland Biodiversity in the EU27. J. Environ. Manag. 2013, 126, 132–141. [Google Scholar] [CrossRef] [PubMed]
  6. Lazíková, J.; Bandlerová, A.; Rumanovská, L.; Takáč, I.; Lazíková, Z. Crop Diversity and Common Agricultural Policy-The Case of Slovakia. Sustainability 2019, 11, 1416. [Google Scholar] [CrossRef] [Green Version]
  7. Thomas, F.; Midler, E.; Lefebvre, M.; Engel, S. Greening the Common Agricultural Policy: A Behavioural Perspective and Lab-in-the-Field Experiment in Germany. Eur. Rev. Agric. Econ. 2019, 46, 367–392. [Google Scholar] [CrossRef]
  8. Storm, H.; Baylis, K.; Heckelei, T. Machine Learning in Agricultural and Applied Economics. Eur. Rev. Agric. Econ. 2019, 47, 849–892. [Google Scholar] [CrossRef]
  9. Einav, L.; Levin, J. Economics in the Age of Big Data. Science 2014, 346. [Google Scholar] [CrossRef]
  10. Bajari, P.; Nekipelov, D.; Ryan, S.P.; Yang, M. Machine Learning Methods for Demand Estimation. Am. Econ. Rev. 2015, 105, 481–485. [Google Scholar] [CrossRef] [Green Version]
  11. Grimmer, J. We Are All Social Scientists Now: How Big Data, Machine Learning, and Causal Inference Work Together. PS-Political Sci. Politics 2014, 48, 80–83. [Google Scholar] [CrossRef] [Green Version]
  12. Monroe, B.L.; Pan, J.; Roberts, M.E.; Sen, M.; Sinclair, B. No! Formal Theory, Causal Inference, and Big Data Are Not Contradictory Trends in Political Science. PS-Political Sci. Politics 2014, 48, 71–74. [Google Scholar] [CrossRef] [Green Version]
  13. Susan, A. The Impact of Machine Learning on Economics. In The Economics of Artificial Intelligence; University of Chicago Press: Chicago, IL, USA, 2019. [Google Scholar]
  14. Shekhar, S.; Schnable, P.; Lebauer, D.; Baylis, K.; Vanderwaal, K. Agriculture Big Data (AgBD) Challenges and Opportunities from Farm to Table: A Midwest Big Data Hub Community Whitepaper. In Proceedings of the Machine Learning: Farm-to-Table Workshop, Champaign, IL, USA, 18–19 April 2017. [Google Scholar]
  15. Coble, K.H.; Mishra, A.K.; Ferrell, S.; Griffin, T. Big Data in Agriculture: A Challenge for the Future. Appl. Econ. Perspect. Policy 2018, 40, 79–96. [Google Scholar] [CrossRef] [Green Version]
  16. Kamilaris, A.; Prenafeta-Boldú, F.X. Deep Learning in Agriculture: A Survey. Comput. Electron. Agric. 2018, 147, 70–90. [Google Scholar] [CrossRef] [Green Version]
  17. John Lu, Z.Q. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. J. R. Stat. Soc. 2010, 173, 83–85. [Google Scholar] [CrossRef]
  18. Fisher, A.; Rudin, C.; Dominici, F. All Models Are Wrong but Many Are Useful: Variable Importance for Black-Box, Proprietary, or Misspecified Prediction Models, Using Model Class Reliance. arXiv 2018, arXiv:1801.01489. Available online: https://arxiv.org/abs/1801.01489 (accessed on 11 March 2021).
  19. Kim, B.; Khanna, R.; Koyejo, O. Examples Are Not Enough, Learn to Criticize! Criticism for Interpretability. In Proceedings of the 30th International Conference on Neural Information Processing Systems, Barcelona, Spain, 5–10 December 2016. [Google Scholar]
  20. Schlenker, W.; Roberts, M.J. Nonlinear Temperature Effects Indicate Severe Damages to U.S. Crop Yields under Climate Change. Proc. Natl. Acad. Sci. USA 2009, 106, 15594–15598. [Google Scholar] [CrossRef] [Green Version]
  21. Burness, H.S.; Brill, T.C. The Role for Policy in Common Pool Groundwater Use. Resour. Energy Econ. 2001, 23, 19–40. [Google Scholar] [CrossRef]
  22. Zivin, J.G.; Neidell, M. Environment, Health, and Human Capital. J. Econ. Lit. 2013, 51, 689–730. [Google Scholar] [CrossRef]
  23. Shimshack, J.P.; Ward, M.B.; Beatty, T.K.M. Mercury Advisories: Information, Education, and Fish Consumption. J. Environ. Econ. Manag. 2007, 53, 158–179. [Google Scholar] [CrossRef] [Green Version]
  24. Signorino, C.S.; Yilmaz, K. Strategic Misspecification in Regression Models. Am. J. Political Sci. 2003, 47, 551–566. [Google Scholar] [CrossRef]
  25. Jones, S.; Johnstone, D.; Wilson, R. Predicting Corporate Bankruptcy: An Evaluation of Alternative Statistical Frameworks. J. Bus. Financ. Account. 2017, 44, 3–34. [Google Scholar] [CrossRef]
  26. Lessmann, S.; Baesens, B.; Seow, H.V.; Thomas, L.C. Benchmarking State-of-the-Art Classification Algorithms for Credit Scoring: An Update of Research. Eur. J. Oper. Res. 2015, 247, 124–136. [Google Scholar] [CrossRef] [Green Version]
  27. Xia, Y.; Liu, C.; Li, Y.Y.; Liu, N. A Boosted Decision Tree Approach Using Bayesian Hyper-Parameter Optimization for Credit Scoring. Expert Syst. Appl. 2017, 78, 225–241. [Google Scholar] [CrossRef]
  28. Fenske, N.; Kneib, T.; Hothorn, T. Identifying Risk Factors for Severe Childhood Malnutrition by Boosting Additive Quantile Regression. J. Am. Stat. Assoc. 2011, 106, 494–510. [Google Scholar] [CrossRef] [Green Version]
  29. März, A.; Klein, N.; Kneib, T.; Musshoff, O. Analysing Farmland Rental Rates Using Bayesian Geoadditive Quantile Regression. Eur. Rev. Agric. Econ. 2016, 43, 663–698. [Google Scholar] [CrossRef]
  30. Jeong, J.H.; Resop, J.P.; Mueller, N.D.; Fleisher, D.H.; Yun, K.; Butler, E.E.; Timlin, D.J.; Shim, K.M.; Gerber, J.S.; Reddy, V.R.; et al. Random Forests for Global and Regional Crop Yield Predictions. PLoS ONE 2016, 11, e0156571. [Google Scholar] [CrossRef]
  31. Sieber, S.; Amjath-Babu, T.S.; Jansson, T.; Müller, K.; Tscherning, K.; Graef, F.; Pohle, D.; Helming, K.; Rudloff, B.; Saravia-Matus, B.S.; et al. Sustainability Impact Assessment Using Integrated Meta-Modelling: Simulating the Reduction of Direct Support under the EU Common Agricultural Policy (CAP). Land Use Policy 2013, 33, 235–245. [Google Scholar] [CrossRef]
  32. Grondard, N.; Hein, L.; van Bussel, L.G.J. Ecosystem Accounting to Support the Common Agricultural Policy. Ecol. Indic. 2021, 131, 108157. [Google Scholar] [CrossRef]
  33. Kiryluk-Dryjska, E.; Beba, P.; Poczta, W. Local Determinants of the Common Agricultural Policy Rural Development Funds’ Distribution in Poland and Their Spatial Implications. J. Rural. Stud. 2020, 74, 201–209. [Google Scholar] [CrossRef]
  34. Garrone, M.; Emmers, D.; Olper, A.; Swinnen, J. Jobs and Agricultural Policy: Impact of the Common Agricultural Policy on EU Agricultural Employment. Food Policy 2019, 87, 101744. [Google Scholar] [CrossRef] [Green Version]
  35. Gómez-Limón, J.A.; Sanchez-Fernandez, G. Empirical Evaluation of Agricultural Sustainability Using Composite Indicators. Ecol. Econ. 2010, 69, 1062–1075. [Google Scholar] [CrossRef]
  36. Patyka, N.; Gryschenko, O.; Kucher, A.; Hełdak, M.; Raszka, B. Assessment of the Degree of Factors Impact on Employment in Ukraine’s Agriculture. Sustainability 2021, 13, 564. [Google Scholar] [CrossRef]
  37. Cristea, D.S.; Magdalena, T.R.M.; Zamfir, C.G.; Adrian, T.R.; Adrian, Z.G.; Stefan-Mihai, P.; Dumitru, N. Perspectives of the Moldavian Agricultural Sector by Using a Custom-Developed Analytical Framework. Sustainability 2020, 12, 4671. [Google Scholar] [CrossRef]
  38. Holzworth, D.P.; Snow, V.; Janssen, S.; Athanasiadis, I.N.; Donatelli, M.; Hoogenboom, G.; White, J.W.; Thorburn, P. Agricultural Production Systems Modelling and Software: Current Status and Future Prospects. Environ. Model. Softw. 2015, 72, 276–286. [Google Scholar] [CrossRef]
  39. Sellam, V.; Poovammal, E. Prediction of Crop Yield Using Regression Analysis. Indian J. Sci. Technol. 2016, 9, 1–5. [Google Scholar] [CrossRef]
  40. Manual for the Classification of Intervention Best-Practices with Rural NEETs-Rural NEET Youth Observatory-COST Action CA18213. Available online: https://rnyobservatory.eu/web/manual-best-practices-rural-neets/ (accessed on 12 October 2021).
  41. Mujčinović, A.; Nikolić, A.; Tuna, E.; Stamenkovska, I.J.; Radović, V.; Flynn, P.; McCauley, V. Is It Possible to Tackle Youth Needs with Agricultural and Rural Development Policies? Sustainability 2021, 13, 8410. [Google Scholar] [CrossRef]
  42. Telega, I.; Telega, A. Driving factors of material consumption in European countries-spatial panel data analysis. J. Environ. Econ. Policy 2020, 9, 269–280. [Google Scholar] [CrossRef]
  43. Prado, S. Growing Unequal? Income Distribution and Poverty in OECD Countries; Organisation for Economic Co-operation and Development: Paris, France, 2008; ISBN 9789264044197. [Google Scholar]
  44. Criminisi, A.; Shotton, J.; Konukoglu, E. Decision Forests: A Unified Framework for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning. Found. Trends Comput. Graph. Vis. 2011, 7, 81–227. [Google Scholar] [CrossRef]
  45. Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; CRC Press: Boca Raton, FL, USA, 2017; ISBN 9781351460491. [Google Scholar]
  46. Probst, P.; Wright, M.N.; Boulesteix, A.L. Hyperparameters and Tuning Strategies for Random Forest. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2019, 9, e1301. [Google Scholar] [CrossRef] [Green Version]
  47. Ziegler, A.; König, I.R. Mining Data with Random Forests: Current Options for Real-World Applications. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2014, 4, 55–63. [Google Scholar] [CrossRef]
  48. Biau, G.; Scornet, E. A Random Forest Guided Tour. Test 2016, 25, 197–227. [Google Scholar] [CrossRef] [Green Version]
  49. Breiman, L. Statistical Modeling: The Two Cultures. Stat. Sci. 2001, 16, 199–231. [Google Scholar] [CrossRef]
  50. James, G.; Witten, D.; Hastie, T.; Tibshirani, R. Moving Beyond Linearity. In An Introduction to Statistical Learning; Springer: New York, NY, USA, 2013. [Google Scholar]
  51. Roberts, M.J.; Key, N. Agricultural Payments and Land Concentration: A Semiparametric Spatial Regression Analysis. Am. J. Agric. Econ. 2008, 90, 627–643. [Google Scholar] [CrossRef]
  52. Lawler, J.J.; White, D.; Neilson, R.P.; Blaustein, A.R. Predicting Climate-Induced Range Shifts: Model Differences and Model Reliability. Glob. Chang. Biol. 2006, 12, 1568–1584. [Google Scholar] [CrossRef] [Green Version]
  53. Stauffer, R.; Mayr, G.J.; Messner, J.W.; Umlauf, N.; Zeileis, A. Spatio-Temporal Precipitation Climatology over Complex Terrain Using a Censored Additive Regression Model. Int. J. Climatol. 2017, 37, 3264–3275. [Google Scholar] [CrossRef] [Green Version]
  54. Chen, K.; O’Leary, R.A.; Evans, F.H. A Simple and Parsimonious Generalised Additive Model for Predicting Wheat Yield in a Decision Support Tool. Agric. Syst. 2019, 173, 140–150. [Google Scholar] [CrossRef]
  55. Rosenheim, J.A.; Cass, B.N.; Kahl, H.; Steinmann, K.P. Variation in Pesticide Use across Crops in California Agriculture: Economic and Ecological Drivers. Sci. Total Environ. 2020, 733, 138683. [Google Scholar] [CrossRef]
  56. Olesen, J.E.; Trnka, M.; Kersebaum, K.C.; Skjelvåg, A.O.; Seguin, B.; Peltonen-Sainio, P.; Rossi, F.; Kozyra, J.; Micale, F. Impacts and Adaptation of European Crop Production Systems to Climate Change. Eur. J. Agron. 2011, 34, 96–112. [Google Scholar] [CrossRef]
  57. Schils, R.; Olesen, J.E.; Kersebaum, K.C.; Rijk, B.; Oberforster, M.; Kalyada, V.; Khitrykau, M.; Gobin, A.; Kirchev, H.; Manolova, V.; et al. Cereal Yield Gaps across Europe. Eur. J. Agron. 2018, 101, 109–120. [Google Scholar] [CrossRef]
  58. Coelli, T.J. Measurement of Total Factor Productivity Growth and Biases in Technological Change in Western Australian Agriculture. J. Appl. Econom. 1996, 11, 77–91. [Google Scholar] [CrossRef]
  59. Dimelis, S.P.; Dimopoulou, M. Evaluating Productivity Growth Measures in the EU. Econ. Plan. 2002, 35, 161–181. [Google Scholar] [CrossRef]
  60. Fernald, J.; Matoba, K. Growh Accounting, Potential Output, and the Current Recession. FRBSF Econ. Lett. 2009, 26. [Google Scholar]
  61. Eaae, J.I.; September, S. Studies on the Agricultural and Food Sector in Central and Eastern Europe; Leibniz Institute of Agricultural Development in Transition Economies: Halle, Germany, 2007; ISBN 9783938584316. [Google Scholar]
  62. International Bank for Reconstruction and Development. Global Economic Prospects: Technology Diffusion in the Developing World 2008; Word Bank: Washington, DC, USA, 2008. [Google Scholar]
  63. Levenko, N.; Oja, K.; Staehr, K. Total Factor Productivity Growth in Central and Eastern Europe before, during and after the Global Financial Crisis. Post-Communist Econ. 2019, 31, 137–160. [Google Scholar] [CrossRef]
  64. Guth, M.; Smędzik-Ambroży, K.; Czyżewski, B.; Stępień, S. The Economic Sustainability of Farms under Common Agricultural Policy in the European Union Countries. Agriculture 2020, 10, 34. [Google Scholar] [CrossRef] [Green Version]
  65. Haggblade, S.; Hazell, P.; Reardon, T. The Rural Non-Farm Economy: Prospects for Growth and Poverty Reduction. World Dev. 2010, 38, 1429–1441. [Google Scholar] [CrossRef]
  66. Foley, J.A.; Ramankutty, N.; Brauman, K.A.; Cassidy, E.S.; Gerber, J.S.; Johnston, M.; Mueller, N.D.; O’Connell, C.; Ray, D.K.; West, P.C.; et al. Solutions for a Cultivated Planet. Nature 2011, 478, 337–342. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Beddington, S. The Future of Food and Farming. Int. J. Agric. Manag. 2011, 1, 2–6. [Google Scholar]
  68. Naylor, R. Expanding the Boundaries of Agricultural Development. Food Secur. 2011, 3, 233–251. [Google Scholar] [CrossRef]
  69. Erbas, B.C.; Solakoglu, E.G. In the Presence of Climate Change, the Use of Fertilizers and the Effect of Income on Agricultural Emissions. Sustainability 2017, 9, 1989. [Google Scholar] [CrossRef] [Green Version]
  70. Matthews, A. Food Prices and Rural Poverty. Eur. Rev. Agric. Econ. 2011, 38, 617–619. [Google Scholar] [CrossRef]
  71. Schiff, M.; Valdés, A. Chapter 27 Agriculture and the Macroeconomy, with Emphasis on Developing Countries. Handb. Agric. Econ. 2002, 2, 1421–1454. [Google Scholar]
  72. Pardey, P.G.; Chan-Kang, C.; Dehmer, S.; Beddow, J.M.; Hurley, T.M.; Rao, X.; Alston, J.M. Investments in and the Economic Returns to Agricultural and Food R&D Worldwide. In Encyclopedia of Agriculture and Food Systems; Elsevier: Amsterdam, The Netherlands, 2014; ISBN 9780080931395. [Google Scholar]
  73. Dollar, D.; Kraay, A. Growth Is Good for the Poor. J. Econ. Growth 2002, 7, 195–225. [Google Scholar] [CrossRef]
  74. Ravallion, M.; Squire, L.; Bruno, M. Equity and Growth in Developing Countries: Old and New Perspectives on the Policy; World Bank: Washington, DC, USA, 1998; ISBN 0262024144. [Google Scholar]
  75. Dao, M.Q. Rural Poverty in Developing Countries: An Empirical Analysis. J. Econ. Stud. 2004, 31, 500–508. [Google Scholar]
  76. Galán-Martín, Á.; Pozo, C.; Guillén-Gosálbez, G.; Antón Vallejo, A.; Jiménez Esteller, L. Multi-Stage Linear Programming Model for Optimizing Cropping Plan Decisions under the New Common Agricultural Policy. Land Use Policy 2015, 48, 515–524. [Google Scholar] [CrossRef]
  77. Chen, Z.; Zhang, A.; Zhou, K.; Huang, L. Can Payment Tools Substitute for Regulatory Ones? Estimating the Policy Preference for Agricultural Land Preservation, Tianjin, China. Land Use Policy 2021, 100, 104860. [Google Scholar] [CrossRef]
  78. Volkov, A.; Balezentis, T.; Morkunas, M.; Streimikiene, D. In a Search for Equity: Do Direct Payments under the Common Agricultural Policy Induce Convergence in the European Union? Sustainability 2019, 11, 3462. [Google Scholar] [CrossRef] [Green Version]
  79. Agriculture Statistics-Family Farming in the EU-Statistics Explained. Available online: https://ec.europa.eu/eurostat/statistics-explained/index.php?title=Agriculture_statistics_-_family_farming_in_the_EU (accessed on 9 September 2021).
  80. Sinabell, F.; Schmid, E.; Hofreither, M.F. Exploring the Distribution of Direct Payments of the Common Agricultural Policy. Empirica 2013, 40, 325–341. [Google Scholar] [CrossRef] [Green Version]
  81. European Council. Cover Note Conclusions (Multiannual Financial Framework). 2013. Available online: http://data.consilium.europa.eu/doc/document/ST-37-2013-INIT/en/pdf (accessed on 8 August 2021).
  82. Chen, M.; Zhang, H.; Liu, W.; Zhang, W. The Global Pattern of Urbanization and Economic Growth: Evidence from the Last Three Decades. PLoS ONE 2014, 9, e103799. [Google Scholar] [CrossRef] [Green Version]
  83. Chatzinikolaou, P.; Manos, B.D.; Bournaris, T. Classification of Rural Areas in Europe Using Social Sustainability Indicators. In Proceedings of the Italian Association of Agricultural and Applied Economics (AIEAA), Trento, Italy, 4–5 July 2012. [Google Scholar] [CrossRef]
  84. Subić, J.; Jeločnik, M.; Jovanović, M. Evaluation of Social Sustainability of Agriculture within the Carpathians in the Republic of Serbia. Sci. Pap. Ser.-Manag. Econ. Eng. Agric. Rural. Dev. 2013, 13, 411–415. [Google Scholar]
  85. Basiago, A.D. Economic, Social, and Environmental Sustainability in Development Theory and Urban Planning Practice. Environmentalist 1998, 19, 145–161. [Google Scholar] [CrossRef]
  86. Weingaertner, C.; Moberg, Å. Exploring Social Sustainability: Learning from Perspectives on Urban Development and Companies and Products. Sustain. Dev. 2014, 22, 122–133. [Google Scholar] [CrossRef] [Green Version]
  87. Wolz, A.; Fritzsch, J.; Reinsberg, K. The Impact of Social Capital on Farm and Household Income: Results of a Survey among Individual Farmers in Poland. In Proceedings of the European Association of Agricultural Economists (EAAE), Ashford, UK, 9–10 April 2005. [Google Scholar] [CrossRef]
  88. Latruffe, L.; Diazabakana, A.; Bockstaller, C.; Desjeux, Y.; Finn, J.; Kelly, E.; Ryan, M.; Uthes, S. Measurement of Sustainability in Agriculture: A Review of Indicators. Stud. Agric. Econ. 2016, 118, 123–130. [Google Scholar] [CrossRef]
  89. Flora, C.; Roesch-McNally, G.E. Sustainable Agriculture and Social Justice: A Conversation with Dr. Cornelia Flora. J. Crit. Thought Prax. 2018, 3. [Google Scholar] [CrossRef]
  90. Papadopoulos, A.G. The Impact of the CAP on Agriculture and Rural Areas of EU Member States. Agrar. South: J. Political Econ. 2015, 4, 22–53. [Google Scholar] [CrossRef]
  91. Bein, M.A.; Ciftcioglu, S. The Relationship between the Relative GDP Share of Agriculture and the Unemployment Rate in Selected Central and Eastern European Countries. Agric. Econ. 2017, 63, 308–317. [Google Scholar] [CrossRef] [Green Version]
  92. Steiner, A.; Atterton, J. Exploring the Contribution of Rural Enterprises to Local Resilience. J. Rural. Stud. 2015, 40, 30–45. [Google Scholar] [CrossRef] [Green Version]
  93. Bock, B. It Still Matters Where You Live: Rural Women’s Employment throughout Europe. In Women in the European Countryside; Routledge: London, UK, 2017; ISBN 9781351142885. [Google Scholar]
  94. Mellor, J.W.; Malik, S.J. The Impact of Growth in Small Commercial Farm Productivity on Rural Poverty Reduction. World Dev. 2017, 91, 1–10. [Google Scholar] [CrossRef]
  95. Kiaušienė, I. Comparative Assessment of Women Unemployment and Poverty in European Union. Intellect. Econ. 2015, 9, 91–101. [Google Scholar] [CrossRef] [Green Version]
  96. Mansi, E.; Hysa, E.; Panait, M.; Voica, M.C. Poverty—A Challenge for Economic Development? Evidences from Western Balkan Countries and the European Union. Sustainability 2020, 12, 7754. [Google Scholar] [CrossRef]
  97. Giannakis, E.; Kushta, J.; Bruggeman, A.; Lelieveld, J. Costs and Benefits of Agricultural Ammonia Emission Abatement Options for Compliance with European Air Quality Regulations. Environ. Sci. Eur. 2019, 31, 1–13. [Google Scholar] [CrossRef] [Green Version]
  98. Velthof, G.L.; van Bruggen, C.; Groenestein, C.M.; de Haan, B.J.; Hoogeveen, M.W.; Huijsmans, J.F.M. A Model for Inventory of Ammonia Emissions from Agriculture in the Netherlands. Atmos. Environ. 2012, 46, 248–255. [Google Scholar] [CrossRef]
  99. Backes, A.; Aulinger, A.; Bieser, J.; Matthias, V.; Quante, M. Ammonia Emissions in Europe, Part I: Development of a Dynamical Ammonia Emission Inventory. Atmos. Environ. 2016, 131, 55–66. [Google Scholar] [CrossRef] [Green Version]
  100. Velthof, G.L.; Lesschen, J.P.; Webb, J.; Pietrzak, S.; Miatkowski, Z.; Pinto, M.; Kros, J.; Oenema, O. The Impact of the Nitrates Directive on Nitrogen Emissions from Agriculture in the EU-27 during 2000–2008. Sci. Total Environ. 2014, 468, 1225–1233. [Google Scholar] [CrossRef]
  101. Schulze, E.D.; Ciais, P.; Luyssaert, S.; Schrumpf, M.; Janssens, I.A.; Thiruchittampalam, B.; Theloke, J.; Saurat, M.; Bringezu, S.; Lelieveld, J.; et al. The European Carbon Balance. Part 4: Integration of Carbon and Other Trace-Gas Fluxes. Glob. Chang. Biol. 2010, 16, 1451–1469. [Google Scholar] [CrossRef]
  102. Hutchings, N.J.; Sommer, S.G.; Andersen, J.M.; Asman, W.A.H. A Detailed Ammonia Emission Inventory for Denmark. Atmos. Environ. 2001, 35, 1959–1968. [Google Scholar] [CrossRef]
  103. Valin, H.; Havlík, P.; Mosnier, A.; Herrero, M.; Schmid, E.; Obersteiner, M. Agricultural Productivity and Greenhouse Gas Emissions: Trade-Offs or Synergies between Mitigation and Food Security? Environ. Res. Lett. 2013, 8, 035019. [Google Scholar] [CrossRef]
  104. Paustian, K.; Cole, C.V.; Sauerbeck, D.; Sampson, N. CO2 Mitigation by Agriculture: An Overview. Clim. Chang. 1998, 40, 135–162. [Google Scholar] [CrossRef]
  105. Oertel, C.; Matschullat, J.; Zurba, K.; Zimmermann, F.; Erasmi, S. Greenhouse Gas Emissions from Soils-A Review. Geochemistry 2016, 76, 327–352. [Google Scholar] [CrossRef] [Green Version]
  106. Ju, X.; Gu, B.; Wu, Y.; Galloway, J.N. Reducing China’s Fertilizer Use by Increasing Farm Size. Glob. Environ. Chang. 2016, 41, 26–32. [Google Scholar] [CrossRef]
  107. Hulten, C.R. Total Factor Productivity: A Short Biography; University of Chicago Press: Chicago, IL, USA, 2001; Volume 1. [Google Scholar]
  108. Ruttan, V.W. Productivity Growth in World Agriculture: Sources and Constraints. J. Econ. Perspect. 2002, 16, 161–184. [Google Scholar] [CrossRef] [Green Version]
  109. Levins, R.A.; Cochrane, W.W. The Treadmill Revisited. Land Econ. 1996, 72, 550–553. [Google Scholar] [CrossRef]
  110. Kiryluk-Dryjska, E.; Baer-Nawrocka, A. Regional Differences in Benefits from the Eu Common Agricultural Policy in Poland and Their Policy Implications. Agriculture 2021, 11, 288. [Google Scholar] [CrossRef]
  111. Kosmaczewska, J.; Poczta, W. Tourism and Socio-Economic Transformation of Rural Areas: Evidence from Poland; Routledge: London, UK, 2021; p. 372. [Google Scholar]
  112. Petrick, M.; Zier, P. Common Agricultural Policy Effects on Dynamic Labour Use in Agriculture. Food Policy 2012, 37, 671–678. [Google Scholar] [CrossRef] [Green Version]
  113. Olper, A.; Raimondi, V.; Cavicchioli, D.; Vigani, M. Do CAP Payments Reduce Farm Labour Migration? A Panel Data Analysis across EU Regions. Eur. Rev. Agric. Econ. 2014, 41, 843–873. [Google Scholar] [CrossRef]
  114. Berlinschi, R.; Swinnen, J.; van Herck, K. Trapped in Agriculture Credit Constraints, Investments in Education and Agricultural Employment. Eur. J. Dev. Res. 2014, 26, 490–508. [Google Scholar] [CrossRef]
  115. Liu, Y.; Pan, X.; Li, J. A 1961–2010 Record of Fertilizer Use, Pesticide Application and Cereal Yields: A Review. Agron. Sustain. Dev. 2014, 35, 83–93. [Google Scholar] [CrossRef] [Green Version]
  116. McArthur, J.W.; McCord, G.C. Fertilizing Growth: Agricultural Inputs and Their Effects in Economic Development. J. Dev. Econ. 2017, 127, 133–152. [Google Scholar] [CrossRef]
  117. Lamb, R.L. Fertilizer Use, Risk, and off-Farm Labor Markets in the Semi-Arid Tropics of India. Am. J. Agric. Econ. 2003, 85, 359–371. [Google Scholar] [CrossRef] [Green Version]
  118. Giannakis, E.; Bruggeman, A. The Highly Variable Economic Performance of European Agriculture. Land Use Policy 2015, 45, 26–35. [Google Scholar] [CrossRef]
  119. Schwarcz, P.; Mura, L.; Pätoprstý, M.; Schwarczová, L. The Impact of Employment in Agriculture on Overall Employment and Development: A Case Study of the District of Topol’čany, Slovakia. SEER 2012, 15, 483–492. [Google Scholar] [CrossRef]
  120. Polyzos, S.; Arabatzis, G. Labor Productivity of the Agricultural Sector in Greece: Determinant Factors and Interregional Differences Analysis. Mediterr. J. Econ. Agric. Environ. 2006, 5, 58–64. [Google Scholar]
  121. Van den Ban, A. Increasing Labour Productivity in Agriculture and Its Implications. J. Agric. Educ. Ext. 2011, 17, 401–409. [Google Scholar] [CrossRef]
  122. Dorward, A. Agricultural Labour Productivity, Food Prices and Sustainable Development Impacts and Indicators. Food Policy 2013, 39, 40–50. [Google Scholar] [CrossRef] [Green Version]
  123. Rembisz, W.; Sielska, A. Relationship between Labour Productivity and Its Remuneration. The Case of Agriculture. Glob. J. Humanit. Soc. Sci. 2016, 2, 262–269. [Google Scholar] [CrossRef]
  124. Guyomard, H.; Baudry, M.; Carpentier, A. Estimating Crop Supply Response in the Presence of Farm Programmes: Application to the CAP. Eur. Rev. Agric. Econ. 1996, 23, 401–420. [Google Scholar] [CrossRef]
  125. Gołębiewski, J. Review of the Book by Bazyli Czyżewski “Market Treadmill in European Agriculture”, PWN, Warsaw 2017, P. 227. Econ. Reg. Stud. 2019, 12, 336–342. [Google Scholar] [CrossRef] [Green Version]
  126. Vozárová, I.K.; Kotulič, R.; Vavrek, R. Assessing Impacts of CAP Subsidies on Financial Performance of Enterprises in Slovak Republic. Sustainability 2020, 12, 948. [Google Scholar] [CrossRef] [Green Version]
  127. Giannakis, E.; Bruggeman, A. Exploring the Labour Productivity of Agricultural Systems across European Regions: A Multilevel Approach. Land Use Policy 2018, 77, 94–106. [Google Scholar] [CrossRef]
  128. Stewart, W.M.; Dibb, D.W.; Johnston, A.E.; Smyth, T.J. The Contribution of Commercial Fertilizer Nutrients to Food Production. Agron. J. 2005, 97, 1–6. [Google Scholar] [CrossRef] [Green Version]
  129. Czyzewski, B.; Smedzik-Ambrozy, K. The Regional Structure of the CAP Subsidies and the Factor Productivity in Agriculture in the EU 28. Agric. Econ. 2017, 63, 149–163. [Google Scholar] [CrossRef] [Green Version]
  130. Rizov, M.; Pokrivcak, J.; Ciaian, P. CAP Subsidies and Productivity of the EU Farms. J. Agric. Econ. 2013, 64, 537–557. [Google Scholar] [CrossRef]
  131. Beckman, J.; Schimmelpfennig, D. Determinants of Farm Income. Agric. Financ. Rev. 2015, 75, 385–402. [Google Scholar] [CrossRef]
  132. Buckwell, A.; Matthews, A.; Baldock, D.; Mathijs, E. CAP: Thinking out of the Box: Further Modernisation of the CAP-Why, What and How? RISE Foundation: Brussels, Belgium, 2017. [Google Scholar]
  133. Nazzaro, C.; Marotta, G. The Common Agricultural Policy 2014–2020: Scenarios for the European Agricultural and Rural Systems. Agric. Food Econ. 2016, 4, 1. [Google Scholar] [CrossRef] [Green Version]
  134. Offermann, F.; Nieberg, H.; Zander, K. Dependency of Organic Farms on Direct Payments in Selected EU Member States: Today and Tomorrow. Food Policy 2009, 34, 273–279. [Google Scholar] [CrossRef]
  135. Patton, M.; Kostov, P.; McErlean, S.; Moss, J. Assessing the Influence of Direct Payments on the Rental Value of Agricultural Land. Food Policy 2008, 33, 397–405. [Google Scholar] [CrossRef]
  136. Zhu, X.; Lansink, A.O. Impact of CAP Subsidies on Technical Efficiency of Crop Farms in Germany, the Netherlands and Sweden. J. Agric. Econ. 2010, 61, 545–564. [Google Scholar] [CrossRef]
  137. Zhu, X.; Demeter, R.M.; Lansink, A.O. Competitiveness of Dairy Farms in Three Countries: The Role of CAP Subsidies. In Proceedings of the 12th Congress of the European Association of Agricultural Economist, Ghent, Belgium, 26–29 August 2008. [Google Scholar]
  138. Morkunas, M.; Labukas, P. The Evaluation of Negative Factors of Direct Payments under Common Agricultural Policy from a Viewpoint of Sustainability of Rural Regions of the New EU Member States: Evidence from Lithuania. Agriculture 2020, 10, 228. [Google Scholar] [CrossRef]
  139. Dias, C.S.L.; Rodrigues, R.G.; Ferreira, J.J. What’s New in the Research on Agricultural Entrepreneurship? J. Rural. Stud. 2019, 65, 99–115. [Google Scholar] [CrossRef]
  140. Hulten, C.R.; Dean, E.R.; Harper, M.; Hulten, C.R. Total Factor Productivity. In New Developments in Productivity Analysis; University of Chicago Press: Chicago, IL, USA, 2013. [Google Scholar]
  141. Erken, H.; Donselaar, P.; Thurik, R. Total Factor Productivity and the Role of Entrepreneurship. J. Technol. Transf. 2018, 43, 1493–1521. [Google Scholar] [CrossRef] [Green Version]
  142. Schut, M.; Klerkx, L.; Sartas, M.; Lamers, D.; Campbell, M.M.C.; Ogbonna, I.; Kaushik, P.; Atta-Krah, K.; Leeuwis, C. Innovation Platforms: Experiences with Their Institutional Embedding in Agricultural Research for Development. Exp. Agric. 2016, 52, 537–561. [Google Scholar] [CrossRef] [Green Version]
  143. Shamshiri, R.R.; Weltzien, C.; Hameed, I.A.; Yule, I.J.; Grift, T.E.; Balasundram, S.K.; Pitonakova, L.; Ahmad, D.; Chowdhary, G. Research and Development in Agricultural Robotics: A Perspective of Digital Farming. Int. J. Agric. Biol. Eng. 2018, 11, 1–14. [Google Scholar] [CrossRef]
  144. Alston, J.M.; Beddow, J.M.; Pardey, P.G. Agricultural Research, Productivity, and Food Prices in the Long Run. Science 2009, 325, 1209–1210. [Google Scholar] [CrossRef]
  145. Ehlers, M.H.; Huber, R.; Finger, R. Agricultural Policy in the Era of Digitalisation. Food Policy 2021, 100, 102019. [Google Scholar] [CrossRef]
  146. Navarro, A.; López-Bao, J.V. Towards a Greener Common Agricultural Policy. Nat. Ecol. Evol. 2018, 2, 1830–1833. [Google Scholar] [CrossRef]
  147. Pe’er, G.; Zinngrebe, Y.; Moreira, F.; Sirami, C.; Schindler, S.; Müller, R.; Bontzorlos, V.; Clough, D.; Bezák, P.; Bonn, A.; et al. A Greener Path for the EU Common Agricultural Policy. Science 2019, 365, 449–451. [Google Scholar] [CrossRef]
  148. Pe’er, G.; Bonn, A.; Bruelheide, H.; Dieker, P.; Eisenhauer, N.; Feindt, P.H.; Hagedorn, G.; Hansjürgens, B.; Herzon, I.; Lomba, Â.; et al. Action Needed for the EU Common Agricultural Policy to Address Sustainability Challenges. People Nat. 2020, 2, 305–316. [Google Scholar] [CrossRef]
  149. Alexiadis, S.; Ladias, C.; Hasanagas, N. A Regional Perspective of the Common Agricultural Policy. Land Use Policy 2013, 30, 665–669. [Google Scholar] [CrossRef] [Green Version]
  150. Czubak, W.; Pawłowski, K.P. Sustainable Economic Development of Farms in Central and Eastern European Countries Driven by Pro-Investment Mechanisms of the Common Agricultural Policy. Agriculture 2020, 10, 93. [Google Scholar] [CrossRef] [Green Version]
  151. Bournaris, T.; Manos, B. European Union Agricultural Policy Scenarios’ Impacts on Social Sustainability of Agricultural Holdings. Int. J. Sustain. Dev. World Ecol. 2012, 19, 426–432. [Google Scholar] [CrossRef]
  152. Kutkowska, B. Wspieranie Dochodów Rolniczych Przez Dopłaty Bezpośrednie W Gospodarstwach Dolnego Śląska. J. Agribus. Rural. Dev. 2009, 12, 101–109. [Google Scholar]
  153. Grant, W. The Common Agricultural Policy; Macmillan International Higher Education: London, UK, 1997. [Google Scholar] [CrossRef]
  154. Pandey, B.; Bandyopadhyay, P.; Kadam, S.; Singh, M. Bibliometric Study on Relationship of Agricultural Credit with Farmer Distress. Manag. Environ. Qual. Int. J. 2018, 29, 278–288. [Google Scholar] [CrossRef]
  155. Morkūnas, M.; Volkov, A.; Pazienza, P. How Resistant Is the Agricultural Sector? Economic Resilience Exploited. Econ. Sociol. 2018, 11, 321–332. [Google Scholar] [CrossRef]
Figure 1. Data modelling technological flow.
Figure 1. Data modelling technological flow.
Agronomy 11 02105 g001
Figure 2. Best subset result for determine Labor Productivity in Agriculture best model.
Figure 2. Best subset result for determine Labor Productivity in Agriculture best model.
Agronomy 11 02105 g002
Figure 3. Model selection residual plots example.
Figure 3. Model selection residual plots example.
Agronomy 11 02105 g003
Figure 4. The average values of total GVA, GVA of the agriculture sector, and GVA registered for rural areas in the previous financial period (2007–2013) for each of the EU member countries.
Figure 4. The average values of total GVA, GVA of the agriculture sector, and GVA registered for rural areas in the previous financial period (2007–2013) for each of the EU member countries.
Agronomy 11 02105 g004
Figure 5. Rural employment rates comparison for European countries—an average between 2007–2013(%).
Figure 5. Rural employment rates comparison for European countries—an average between 2007–2013(%).
Agronomy 11 02105 g005
Figure 6. GAM predictors influence over the Emissions in Agriculture.
Figure 6. GAM predictors influence over the Emissions in Agriculture.
Agronomy 11 02105 g006
Figure 7. GAM predictors influence over the CO2.
Figure 7. GAM predictors influence over the CO2.
Agronomy 11 02105 g007
Figure 8. GAM predictors influence over the fertilizer use.
Figure 8. GAM predictors influence over the fertilizer use.
Agronomy 11 02105 g008
Table 1. Dataset descriptive statistics.
Table 1. Dataset descriptive statistics.
Descriptive Statistics
NMinimumMaximumMeanStd. Deviation
GVA17517.153361.48636.99892.21
CO21751366.0898,865.9619,935.6224,333.36
GVA_Rural1751.25452.8063.1498.20
Fertilizer_Use17544.30495.20160.1798.64
Employment Agriculture1751.0631.016.635.93
Crop_Yield1751545.009710.005109.891776.97
GVA_Agriculture1750.4947.2210.7312.67
DP16635.138167.001663.132049.99
CPEFP1750.7110.412.701.51
AFI1752025.5360,715.9617,245.1112,542.00
RER17549.1277.9463.627.54
RER_M17556.4983.1769.445.93
RER_F17539.9174.0659.237.45
LPA1752523.4062,877.2217,227.4214,909.76
EA1759.03692.49147.21186.68
DRP1726.5064.1027.5811.66
AEI175598.1548,318.5114,119.2310,983.96
GDP17527.90127.4471.5926.49
TFP17592.13125.03103.636.57
ARD1750.00770.70123.74184.83
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Cristea, D.S.; Rosenberg, S.; Pustianu Mocanu, A.; Simionov, I.A.; Mogodan, A.A.; Petrea, S.M.; Moga, L.M. Modelling the Common Agricultural Policy Impact over the EU Agricultural and Rural Environment through a Machine Learning Predictive Framework. Agronomy 2021, 11, 2105. https://doi.org/10.3390/agronomy11112105

AMA Style

Cristea DS, Rosenberg S, Pustianu Mocanu A, Simionov IA, Mogodan AA, Petrea SM, Moga LM. Modelling the Common Agricultural Policy Impact over the EU Agricultural and Rural Environment through a Machine Learning Predictive Framework. Agronomy. 2021; 11(11):2105. https://doi.org/10.3390/agronomy11112105

Chicago/Turabian Style

Cristea, Dragos Sebastian, Sarina Rosenberg, Adriana Pustianu Mocanu, Ira Adeline Simionov, Alina Antache Mogodan, Stefan Mihai Petrea, and Liliana Mihaela Moga. 2021. "Modelling the Common Agricultural Policy Impact over the EU Agricultural and Rural Environment through a Machine Learning Predictive Framework" Agronomy 11, no. 11: 2105. https://doi.org/10.3390/agronomy11112105

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop