1. Introduction
Environmental problems affect a country’s economic development, and the economic development level also restricts its ability to improve environmental quality. Based on empirical data, the Environmental Kuznets Curve shows an inverted U-shaped relationship between environmental quality and economic development [
1]. In the early stage of economic development, environmental pollution is increasing with the increase in incomes. However, as the economy rises to a higher level, the income growth will reduce the pollution level and improve environmental quality. Andreoni et al. (2001) considered that consumers in countries with higher economic development level tend to have a higher preference for environmental goods and higher investment on the environment, which contribute to an inverted U-shaped relationship between economy and environment [
2]. The curve emphasizes the unidirectional influence of economic development level on environmental quality. However, environmental quality also restricts the development of society and economy in reverse [
3] because the economic development is established and developed on the basis of natural environment. Production activities are inseparable from natural resources. A good natural environment can reduce the cost of economic development and provide power support for sustainable economic development. Especially for many countries that have entered the middle-income stage of economic development, the control of the environmental pollution is not only related to the improvement of people’s living environment, but is also affecting further economic growth, which in turn promotes the improvement of environmental quality. As a result, environmental pollution has become an urgent problem for many countries whose economic development has reached the middle-income level.
In recent years, governments and researchers around the globe searched relentlessly for novel ways, including marketization, government initiatives, and third-party treatment to curb pollution. The first and the second models experienced frequent failures since only one of the players is responsible for the treatment, and the cost involved is prohibitive [
4]. The third model, which engages multiple players, including governments, polluters, and treatment services providers, demonstrated remarkable advantages in solving issues related to environmental pollution [
5]. The success of the model is attributable to its ability to promote marketization, specialization, regulatory compliance, and industrialization while preventing failures in government initiatives and marketization [
6]. In fact, the model has already gained increasing popularity, and its effectiveness in curbing pollution is quite satisfactory [
7]. In China, the opening-up policy has been very successful in realizing rapid economic development. However, it comes with great cost to the environment [
8,
9,
10,
11]. The issue was initially tackled by government initiatives [
12]. In September 2017, the State Council issued the Opinions on Promoting the Third-Party Treatment of Environmental Pollution, which encourages and promotes the new pollution treatment model, where polluters entrust environmental services providers (ESPs) with certain payments stipulated in contracts [
13]. This action indicates the significance attached to the third-party treatment model (TPTM) by the Chinese government and a starting point for its wide implementation across China.
In many developed countries, however, the model’s effectiveness in curbing pollution has been quite satisfactory. Its success in China depends on the meaningful cooperation of multiple players in the model that has just been put into practice in the country [
14,
15]. Key players in the model include local governments, polluters, and ESPs. The outsourcing contracts between polluters and ESPs affect not only the polluters’ outsourcing cost but also ESPs’ payoffs and the effectiveness of treating pollution. Local governments, as environmental regulators, are responsible for monitoring this effectiveness. In case the standards are not met, local governments would impose penalties on ESPs and polluters. The behaviors and payoffs of the three players interact. Evidently, the game among the three players would determine whether the TPTM can effectively solve issues related to industrial pollution.
Recently, the evolutionary game framework has been increasingly applied to topics regarding the treatment of environmental pollution. Most studies addressed environmental pollution games between governments and polluters [
16,
17,
18,
19]. Others investigated whether the games among regions or states could lead to cooperation in treating environmental pollution [
20,
21,
22,
23]. In contrast, Jiang et al. (2019) [
24] examined how the games among governments at various levels and enterprises affect the performance in environmental pollution treatment. A number of researchers introduced consumers into the games among governments and enterprises by employing the three-player evolutionary game model for governments, consumers, and enterprises to examine the effects of consumers’ behaviors on environmental pollution [
25,
26,
27,
28,
29]. Only a few researchers have addressed the TPTM in the area of environmental pollution by applying game theories. Among them, Xu et al. (2018) [
30] and Zhou et al. (2019) [
13] constructed an evolutionary game model that involved three players, including local governments, ESPs, and polluters, to study the development of the third-party treatment systems in China. They argued that local governments must adjust penalties on noncompliance dynamically, according to the development of the treatment systems. In addition, they found that fiscal subsidies granted to ESPs could improve the performance in environmental treatment. Even fewer researchers, also employing the evolutionary game model, examined the effects of a whistleblowing regime for environmental issues on the performance of environmental treatment [
28].
The above literature review shows that only a few studies have investigated the TPTM in the evolutionary game framework. Only Xu et al. (2018) [
30] and Zhou et al. (2019) [
13] addressed the issues of environmental treatment by employing a three-player evolutionary game model by incorporating governments, polluters, and ESPs. Moreover, they focused on whether the governments have fulfilled their duties of supervision and whether polluters and ESPs have complied with relevant regulations. In their studies, not much attention had been paid to how the outsourcing contracts between ESPs and polluters had been reached. Therefore, by building on their research results, this study made the following improvements by capturing the practices of third- party treatment of environmental issues. First, local governments in China may adopt the strategy of strict regulation or loose regulation in environmental supervision instead of regulating or not regulating. The strict and loose strategies are distinguished by the probabilities of being detected and punished by governments due to ESPs’ unsatisfactory performance in treating pollution. Second, the responsibilities of the polluters and ESPs are also considered since the decisions about the outsourcing contracts between the polluters and ESPs can be incorporated into the model. With these two improvements, this study constructed an evolutionary game model for the three players to represent the treatment process under the TPTM. This approach is also the biggest difference between this study and previous studies.
2. Methods
2.1. Model Hypothesis
This study proposes the following hypotheses for building the three-player evolutionary game model involving local governments, polluters, and environmental services providers.
(1) Bounded rationality. It implies that players initially do not know the strategies at the Nash equilibrium of the three-player game. Therefore, they cannot adopt the corresponding best strategy for the Nash equilibrium to be reached at once. They can only choose any of the available strategies with a certain probability. By playing the game repeatedly, they continuously adjust the probability for each strategy until it converges to a stable Nash equilibrium strategy.
(2) Replication dynamics. Players of the game with bounded rationality adjust their strategies by using the replicator dynamics equation, which represents changes of the probability or frequency of a strategy being adopted by a player with a dynamic differential equation as below [
31]:
where
denotes the probability of a player choosing strategy
,
is the expected payoff for the player with strategy
adopted, and
is the average payoff from adopting all the possible strategies.
(3) Hypotheses regarding the behaviors of local governments. Local governments, mostly responsible for regulating the performance of ESPs, are assumed to have two options: strict regulation or loose regulation. It is also assumed that the probability of local governments applying strict regulation is x . With strict regulation, the probability of finding an unsatisfactory treatment performance is , while that of loose regulation is . With strict regulation, the economic and time cost incurred by local governments is , and the cost with loose regulation is . Meanwhile, strict regulation adopted by local governments would reduce economic output in the short term and result in cost . However, satisfactory performance in treating pollution would improve the living environment. The welfare of residents would, thus, be increased by . Moreover, rewards from the central government are . When the performance is unsatisfactory, local governments impose a penalty on polluters and ESPs.
(4) Hypotheses regarding the behaviors of polluters. Polluters entrust ESPs with treating pollution by outsourcing contracts that transfer the obligations of treating pollution to ESPs. There are mainly three issues to be agreed upon in the contracts: fees , the share of responsibilities , and liquidated damages . The fees are to be paid by polluters to ESPs as compensation for treating pollution. However, ESPs are responsible for paying liquidated damages to polluters in case the treatment performance is unsatisfactory. If local governments impose penalties for identified noncompliance, the penalties should be shared. Specifically, the penalty to be paid by polluters is , and that which is to be paid by ESPs is . There are two types of contracts that can be awarded to ESPs by polluters. The first are the contracts with higher fees and a larger share of responsibilities . That is, polluters pay higher fees to ESPs for ESPs to treat pollution satisfactorily. In case local governments impose penalties for unsatisfactory treatment, the ESPs pay a larger share of the penalties. The second type of contract are those with lower fees and a smaller share of responsibilities . That is, polluters pay low fees to ESPs. In case governments impose penalties for unsatisfactory treatment, the ESPs pay a smaller share of the penalties, where . It is assumed that the probability of polluters choosing a high price contract is . When ESPs conduct a satisfactory treatment, the polluters can also obtain payoff for a successful treatment.
(5) Hypotheses related to the behavior of ESPs. ESPs are responsible for treating pollution after being entrusted by polluters and can choose from two options: treating pollution satisfactorily and unsatisfactorily. It is assumed that ESPs treat pollution satisfactorily at the probability of . It is also assumed that ESPs treat pollution satisfactorily at the cost of and unsatisfactorily at the cost of , where .
Regarding the above hypotheses related to the behaviors of local governments, polluters, and ESPs, we have described the relevant parameters in
Table 1.
2.2. Model Building
In the three-player game played by local governments, polluters, and ESPS, the probability of the local governments (a) choosing the strict regulation strategy is x; that of the polluters (b) choosing a high price contract is y, and that of the ESPs (c) choosing the strategy of treating pollution satisfactorily is z. The three players will adjust their probabilities continuously as the game is repeated. There are eight decision combinations in the ultimate outcomes for each time the game is played (as shown in
Table 2). Based on the above hypotheses of player behavior, it is possible to calculate the payoffs for the three players under different combinations of strategies, as shown in
Table 3.
4. Simulations and Discussions
As shown by the analysis of the replicator dynamics equations for the three-player evolutionary game involving local governments, polluters, and ESPs, strategies of the three players vary over time. Thus, to promote the TPTM’s effectiveness, it is possible to control or adjust relevant parameters to drive the strategic choices of local governments and ESPs toward the direction expected by the general public. That is, local governments choose strict regulation, and ESPs choose to treat pollution satisfactorily (i.e., ). Based on replicator dynamics Equations (4), (8) and (12) for three-player decisions involving local governments, polluters, and ESPs, this study simulates, mostly by Python 3.4, the three-player game’s evolutionary dynamics and the effects of changes in relevant parameters on the outcomes.
4.1. Evolution of Strategies of Local Governments, Polluters, and ESPs Under an Initial State
Setting
respectively denotes the probabilities of local governments choosing strict regulation, polluters choosing high price contracts, and ESPs choosing to treat pollution satisfactorily. Based on the values assigned to the relevant parameters in related studies [
27,
28], and considering the critical conditions determined by F′(x) = 0, F′(y) = 0, and F′(z) = 0, the following values are assigned to the relevant parameters in this study:
Moreover, assuming that players’ initial strategies are
respectively, with the Python simulation, we obtain the following equilibrium outcomes
from the three-player game evolution (as shown in
Figure 4).
4.2. Effects of the Penalty Amounts Imposed by Local Governments on the Stable state of the Three-Player Game
From the stability analysis on three-player game equilibrium strategies in
Section 3.2, it is evident that penalties imposed by local governments on ESPs and polluters affect the ultimate stable state of the three-player evolutionary game. When the probability of ESPs treating pollution satisfactorily is
, then
. Moreover, as
, the larger the
, the larger the
. When the probability of local governments choosing strict regulation is
, then
. This situation indicates that the larger the penalty amount imposed by local governments for the unsatisfactory treatment of pollution, the stronger the motivation of local governments to obtain fiscal income through penalties, and the more likely local governments would adopt strict regulation, which further pushes ESPs to treat pollution satisfactorily.
By simulating three-player game evolution with different penalty amounts (as shown in
Figure 5), the ultimate outcomes, as shown in
Figure 5a–c, for smaller amounts are as follows. Local governments adopt loose regulation, ESPs choose to treat pollution unsatisfactorily, and polluters choose a high price contract to assume a smaller share of responsibilities for pollution. The ultimate outcomes, as shown in
Figure 5d–f for a larger amount, are as follows. The local governments adopt strict regulations, ESPs choose to treat pollution satisfactorily, and polluters choose low price contracts to assume a larger share of responsibilities for pollution. Moreover, a larger penalty amount speeds up convergence in the three-player game.
Although a larger penalty amount imposed on noncompliance can motivate the pursuit of fiscal income by the local governments and promote strict regulation, such pursuit can also lead to abusive regulations. In other words, local governments may subject treatment performance to excessive scrutiny to increase fiscal income. Such practice distorts the behaviors of ESPs and polluters in treating pollution, thus leading to lower productivity and reduced overall economic output. Therefore, the increase in the penalty amount regarding noncompliance is constrained by economic loss.
4.3. Effects of the Amount of Environmental Awards from the Central Government to Local Levels on the Speed of Convergence of the Three-Player Game
According to the analysis on stable strategies for the three-player evolutionary game in
Section 3.2, the critical values
,
and
do not relate to environmental awards
from the central government to local levels. That is, the size of
does not change the ultimate stable state of the three-player game. However,
have effects on the convergence speed of the game. By simulating the three-player game evolution with different amounts of awards from the central government to local levels (as shown in
Figure 6), we find that the speed of convergence increases with a higher award amount. With a larger award or penalty amount from the central government at the local levels, local governments have stronger motivation to adopt strict regulation on the treatment performance of ESPs to obtain awards or avoid penalties from the central government. When local governments adopt strict regulations, the unsatisfactory treatment of ESPs is more likely to be spotted. With higher expected penalty amounts for unsatisfactory treatment, ESPs will tend to choose the strategy of treating pollution satisfactorily while the best strategy for polluters is choosing the low-price contract.
4.4. Effects of the Regulation Cost of Local Governments on the Stable State of the Three-Player Game
From the stability analysis on three-player game equilibrium strategies in
Section 3.2, it is evident that the size of the critical value
is related to the regulation cost of local governments. Moreover,
. Therefore, the higher the strict regulation cost
, the smaller the critical value
. It means governments are more likely to adopt strict regulation, while changes to
will lead to further changes to the critical values
and
. Simulating the three-player game evolution with different strict regulation costs
(as shown in
Figure 7), we find that the three-player evolutionary game converges to the equilibrium state
, which is expected by the general public with a lower regulation cost. The lower the strict regulation cost, the higher the convergence speed. With a higher strict regulation cost, the three-player evolutionary game converges to the equilibrium state
, which is not expected by the general public. The higher the strict regulation cost, the higher the convergence speed. It demonstrates that lower strict regulation cost motivates local governments to adopt strict regulation, which, in turn, promotes ESPs to treat pollution satisfactorily to improve environmental quality. The lower the strict regulation cost, the shorter the time needed to reach the equilibrium outcomes, thereby benefiting the rapid improvement of environmental quality.
5. Conclusions
At present, China has entered as a middle-income developing country. Environmental pollution has become an obstacle to further economic development; the third-party governance model of environmental pollution has become an important choice for the Chinese government. This study primarily investigates whether TPTM can effectively solve the issues regarding industrial pollution in China by constructing a three-player evolutionary game model involving local governments, polluters, and ESPs. The stability analysis on equilibrium states of the three-player evolutionary game shows that the game can converge to different equilibrium states under various critical conditions. It is possible to control or adjust relevant parameters to drive the behaviors of local governments and ESPs toward the direction that the general public expects. That is, local governments choose strict regulation, and ESPs choose to treat pollution satisfactorily. The above analysis is further verified with a numerical simulation test to illustrate the effects of different parameters on the convergence of the three-player evolutionary game. The following conclusions are reached in the verification process:
(1) A larger penalty amount can motivate local governments to enhance regulation and ESPs to choose to treat pollution satisfactorily. Moreover, a larger penalty amount also facilitates the convergence of the two players’ behavior toward the above state. A larger penalty amount motivates local governments to adopt strict regulations to increase fiscal income. It also increases the expected noncompliance cost of ESPs. This result encourages ESPs to treat pollution satisfactorily, which is an outcome expected by the general public.
However, with a larger penalty amount, local governments may abuse regulation in the pursuit of fiscal incomes. Such practice distorts the behaviors of ESPs and polluters in treating pollution, thereby leading to lower productivity and reduced overall economic output. Therefore, local governments should develop clear and detailed criteria for imposing penalties to control the excessive regulation of local governments.
(2) The environmental awards from the central government do not affect the ultimate stable state of the three-player game; however, they affect the convergence speed. The awards are not a determining factor to the effectiveness of the TPTM but can speed up the evolution of the three-player game toward outcomes expected by the general public or shorten the time needed for the TPTM to reach the valid state.
(3) Regulation cost of local governments has substantial effects on whether the TPTM can effectively solve issues regarding industrial pollution. The lower regulation cost incurred by local governments means that, given a certain amount of the total cost, more resources can be directed to regulating the treatment process of ESPs and its outcomes. Meanwhile, the lower regulation cost can facilitate the emergence of the TPTM’s effectiveness.
Based on the above conclusions, this study proposes the following recommendations for improving the effectiveness of the TPTM in China:
(1) Penalties on noncompliance should be increased further by the local governments, and rewards from the central government to local levels for satisfactory performance in treating pollution should also be increased. The promotional mechanisms for local government officials should evaluate their performance comprehensively by incorporating indicators in several dimensions (including the economic and environmental indicators) to motivate local governments to regulate pollution treatment strictly. Increasing environmental awards alone cannot effectively improve performance. The measure must be implemented together with other policies. While increasing environmental awards, imposing a larger penalty amount by the local governments on polluters can effectively improve the performance of the TPTM.
(2) A more robust regulatory system can reduce regulation costs and increase the probability of spotting pollution problems by local governments. Building on the existing regulatory system by introducing public whistleblowing mechanisms can engage private sectors. Specifically, the measures include leveraging big data systems and mobile network platforms for motivating the general public with appropriate economic awards to report noncompliance. They can lower local regulation costs of governments, increase the efficiency in spotting pollution problems by the local governments, and push ESPs to solve issues related to industrial pollution effectively.