3.1. Data Sources
This paper selects panel data from 285 prefecture-level cities in China during the period 2006–2017 as a sample, with full coverage of policy time points. The YRDECC has experienced four expansions in 2010, 2013, 2018, and 2019, respectively, of which the 2010 expansion and 2013 expansions were the largest and most influential. Thus, the period selected in this paper not only covers the two major expansions, leaving sufficient time intervals for studying the long-term effects of the policy, but also avoids the disruption of the following expansion in 2018 and 2019. The expansion policy will also avoid confounding the results of the study. Moreover, different tiers of cities are covered to ensure the robustness of the analysis.
The data of carbon emissions were obtained from the CEADS database compiled by Tsinghua University [
40] and collated by summing the county-level data. The CEADs Centre used a particle swarm optimization-back propagation (PSO-BP) algorithm to unify the scale of DMSP/OLS and NPP/VIIRS satellite imagery to estimate carbon emissions from 2735 counties in China since 1997, with data currently available until 2017.
This paper discusses the impact of the expansion of urban agglomerations on the carbon emissions from both incumbent and new cities. In order to meet the requirements of empirical analysis, the county-level data are summed up into city-level. The control variables in this paper include gross urban product and its index, output value and number of people employed in the three types of industries, total population of urban residents, road paved area, number of foreign direct investment, fiscal expenditure from local government, amount of fixed asset investment, amount of investment in real estate development, and PM2.5 emissions, which were all originally obtained from the China Statistical Yearbook of previous years, the China Urban Statistical Yearbook, and the China Population and Employment Statistical Yearbook. Meanwhile, energy data are collected from the China Industrial Statistical Yearbook. The missing data are supplemented by extracts from provincial statistical yearbooks or the statistical bulletins, statistical yearbooks, and local chronicles of each city.
3.2. Variables
- (1)
Dependent variable
This paper focuses on the impact of the expansion of the Yangtze River Delta urban agglomeration on carbon emissions in 2010 and 2013. Total carbon emissions and carbon emissions efficiency are the key indicators of interest in the carbon field [
41], which are discussed as explained variables in this paper, and the indicators of total carbon emissions and intensity are taken in logarithmic form. The mentioned GDP in this paper is deflated using 2006 as the base year to remove the effects of inflation, and it is also deflated in the subsequent ratio data related to GDP.
- (2)
Independent variable
The independent variable in this paper is the YRD expansion policy, which is divided into three categories according to the sub-category of whole city, incumbent city, and new city.
- (3)
Control variable
rdgp denotes urban affluence, which is characterized by total urban GDP and a quadratic term of GDP to test the hypothesis of the environmental Kuznets curve which indicates that urban affluence affects the environment. Wage denotes income levels, which reflects the standard of living and spending power of urban residents and is a fundamental expression of economic development. Zhang et al. (2021) found that the growth of residents’ income drives the use of clean energy, which in turn affects the level of carbon emissions [
42]. pden denotes the population density, which is measured according to the size of the resident population within each square kilometer of land area. Zhang et al. (2021) found that the migration of population produces a population density effect, which in turn affects the level of urban carbon emissions. tech indicates the amount of science and technology expenditure of each local government, which represents the innovative input. patent denotes the number of patents granted for inventions, which represents the innovative output. Albitar et al. (2022) found that corporate environmental technology innovation drives cleaner production and reduces carbon emissions, and that government environmental management plays an important role in the carbon reduction process [
43]. FDI indicates the level of foreign direct investment of each city. Apergis et al. (2022) confirmed the pollution haven hypothesis of FDI flows on developing countries [
44]. We use foreign direct investment to represent the level of openness of cities to the outside world to address the missing data of total import and export of some cities. Road denotes the infrastructure construction, which is used to represent the paved road area per capita of urban built-up areas. fai denotes the fixed asset investment per city. rei denotes property development investment per city. energy denotes the level of urban energy consumption. Sun et al. (2020) describe the trend of simultaneous growth of energy consumption and carbon emissions, which are strongly correlated [
45]. mkt denotes the level of marketization, which reflects the degree of market activity in the city and is characterized by the Fan‘s Marketization Index, which is broadly used to measure the marketization level in China [
46].
- (4)
Mechanism variable
Connect denotes economic linkages referring to Liu et al. (2017) [
14] and Hou et al. (2009) [
47], which is measured by applying the modified gravity model and calculated by the formula
/
, where
,
denotes population and
denotes the distance between cities, measured by the shortest road distance between the centers of the two cities. The linkage intensity of resources and economy is denoted by
, which represents the intensity of linkages between city
and other cities within the expansion of urban agglomeration (21 cities in total after 2010 and 29 after 2013) at the level of
, population size, and road distance in year
.
denotes industrial upgrade, which measures the advanced industrial structure, is calculated by the share of tertiary industry output in secondary one
denotes air quality of cities, which is the measure of how many micrograms of
per cubic meter of air is found in cities. Dong et al. (2019) confirmed the synergistic variation between
and CO
2 emissions, and that the synergistic variation trends differed across regions [
48]. Exploring the variation trends of
and CO
2 in the expansion process can help us to understand the environmental benefits of expansion in depth.
Similarly, SO
2 denotes the level of local air pollutant emission. Zheng et al. (2011) found a stable long-term equilibrium relationship between SO
2 and CO
2 emissions [
49], but this equilibrium relationship can be deviated by external shocks. We can therefore observe the synergistic effects of capacity expansion shocks on SO
2 and CO
2.
The descriptive statistics for the variables in this paper are in
Table 1.
3.3. Baseline Regression Model
The STRIPAT model proposed by Dietz et al. (1994) [
50], which states that population size, level of economic development, and technological progress are the three main factors influencing the environment, has been widely used in the study of various environmental indicators, especially in the extended analysis of the factors influencing carbon emissions. The standard form of the model is as follows:
where
is the environmental variable,
refers to the population size,
refers to the level of economic development,
refers to the level of technology, and
represents other technological variables. Taking logarithms of both sides at the same time, we can obtain the influence coefficients of
,
,
, etc. The economic significance of
,
,
is the elasticity of carbon emission level (total, intensity) to the population size, economic development level, and technological progress of a city, respectively.
Based on the STRIPAT model, this paper introduces the policy variable of the expansion in the YRD urban agglomeration to enrich the model.
In recent years, the “Durbin counterfactual framework” has been commonly used as the analytical framework in the field of policy evaluation. The difference between the two is the “treatment effect”. The difficulty of this analysis is to find a control group that is highly consistent with the treatment group before the policy occurs. The difference-in-differences method is the most classic and mature policy evaluation method under the Durbin counterfactual framework. Its basic idea is to use exogenous interventions to divide the treatment and control groups, to differentiate the differences between the groups before and after the policy intervention, in order to solve the problem of heterogeneity between the experimental and control groups. If the policy occurs at different times for each sample within the treatment group, the staggered DID method is applied. Since the cities in the Yangtze River Delta in this paper experienced two expansions within the time window, each city was not subject to the disposition at the same time. In order to mitigate the potential bias due to omitted variables that vary with individuals and time on the analysis results, and to solve the problem of nuisance terms clustering, we use a two-way fixed staggered DID model and add control variables to solve the problem of heteroskedasticity, autocorrelation, and clustering of nuisance terms.
The model is set up as follows.
where the subscript
represents the city (
= 1, 2, …, 285) and
represents the
year of policy implementation.
is the explanatory variable and is characterised by total carbon emissions, and carbon efficiency (carbon emissions of GDP per unit) (expressed as
).
is the constant term.
is the core explanatory variable, which represents the staggered DID variable of the multi-period expansion policy.
=
, where
denotes the treatment indicator.
equals 1 if city
is included in the YRD agglomeration in year
, or else it equals 0.
refers to the timing of policy implementation, where yes = 1 and no = 0.
The regression coefficient reflects the carbon reduction effect of expansion and is the coefficient we focus on. The size of total carbon emissions and efficiency are not major factors in the city joining the YRDECC since there is no reverse causality issue. Thus, the causality expressed by the model holds.
denotes other control variables that affect total carbon emissions as well as carbon efficiency and varies with changes in time and cities. Considering that the expansion policy is gradually rolled out at the city level, the order of new cities entering the YRD urban agglomeration may be endogenous to the socioeconomic factors of the cities, so this paper controls as much as possible for socioeconomic variables that may affect the order of city entrance. According to the STRIPAT model proposed by Dietz et al. (1994), population size, technology level, and economic development level are the main factors affecting the environment. In addition, combined with the existing literature, investment and energy variables are also the main factors influencing carbon emissions. Based on the above argument, this paper controls for several categories of factors such as population, technology, economy, investment, and energy. First, the population variable is set as population agglomeration (). Secondly, the technological variables are set as the level of urban technology () and innovation (). Thirdly, economic variables are set as urban affluence (), income level (), and marketisation level (). Fourthly, we use energy consumption () to characterize the variable of energy. Fifth, the investment variable includes real estate development investment (), fixed asset investment (), and foreign investment (. The specific meanings of variables have been described above.
In addition, represents city fixed effects, while represents year fixed effects. represents the random error term. We report robust standard errors with city clustering.
3.4. Moderating Effect Model
Based on the theoretical analysis section, this paper adds an interaction term to verify the two types of impact mechanisms of expansion on carbon reduction. The model is set up as follows.
where
is an indicator of mechanism variables, and also a more specific and detailed description of the expansion policy variables, including economic linkages, industrial upgrade, etc. Economic linkages (
) refers to Liu et al. (2017) and Hou et al. (2009), measured by applying the modified gravity model and calculated by the formula
/
, where
,
denotes population, and
denotes the distance between cities, measured by the shortest road distance between the centers of the two cities. The linkage intensity of resources and economy is denoted by
, which represents the intensity of linkages between city
and other cities within the expansion of urban agglomeration (21 cities in total after 2010 and 29 after 2013) at the level of
, population size, and road distance in year
. Industrial upgrade (
) is calculated according to the method proposed by Gan et al. (2011) to measure the advanced industrial structure, i.e., the share of tertiary industry output in the secondary one.
captures the effect of carbon emissions reductions triggered solely by the urban agglomeration expansion policy itself when other mechanisms do not exist. is the coefficient of the interaction term, which measures the effect of expansion on carbon emissions through the mechanism variables. Meanings of the other symbols are the same as in Equation (2).