1. Introduction
Since the State Council approved the implementation of the National Land Remediation Plan (2016–2020), land remediation has been used nationwide to help implement ecological civilization, rural revitalization, and poverty eradication strategies. The plan has played an essential role in promoting cultivated land protection and land conservation, promoting the integrated development of urban and rural areas, and improving the ecological environment. Data from the Rural Land Remediation Monitoring and Supervision System shows that China has built 232 million mu of high-standard farmland during the 13th Five-Year Plan period (1 mu equals 0.165 acres). Of this land, 110 million mu was built by the former Ministry of Land and Resources and 122 million mu by other departments; about 87,000 land remediation projects have been completed nationwide, replenishing 18,279,800 mu of cultivated land [
1]. The average quality grade of cultivated land nationwide changed from 5.09 in 2017 to 4.76 in 2021, but only 31.24% of the total cultivated land area is in Grades 1 to 3. The pollution situation of China’s cultivated land is still serious. The results of a detailed survey of the soil pollution status of agricultural land show that the primary pollutants affecting the soil environmental quality of agricultural land are heavy metals, among which cadmium is the primary pollutant [
2,
3].
Heavy metal pollution of cultivated land not only endangers the ecological safety of cultivated land, resulting in crop yield reduction [
4,
5], cultivated land biodiversity decline [
6,
7], and other consequences, but also leads to “cadmium rice”, “blood lead”, and other events, directly posing a serious threat to human life [
8,
9,
10]. Under this pressure, implementing heavy metal contaminated cultivated land remediation cannot wait. To this end, China has promulgated the Soil Pollution Prevention and Control Action Plan and the Soil Pollution Prevention and Control Law and has carried out remediation and governance attempts in typical areas [
11]. In March 2014, the Ministry of Agriculture and the Ministry of Finance jointly launched the “Pilot Project on Heavy Metal Contaminated Cultivated land and Crop Planting Structure Adjustment in Hunan Province”, which explored a comprehensive restoration and governance approach based on agricultural measures for 1.7 million mu of heavy metal contaminated cultivated land in the Changsha–Zhuzhou–Xiangtan area [
12]. During the first year, the government-led pattern of cultivated land remediation was ineffective. Then, the pilot area began to explore the “government + third-party remediation enterprise” model in 2015, in which the government became the developer of remediation policies, the purchaser of remediation services, and the supervisor of remediation implementation. However, on the one hand, the contaminated cultivated land is numerous and widely distributed. On the other hand, due to a lack of supervision funds [
13], technology, and manpower, it is difficult for the government to effectively implement the whole supervision process to third-party governance companies [
14]. Therefore, the pilot area began to explore the joint governance model of “government + third-party governance enterprise + third-party supervision enterprise” in 2016 [
15], forming a situation of “ladder multiple supervision” by the government and third-party supervision enterprise.
Although studies have shown that in general environmental governance, the third-party governance model can effectively improve environmental governance, research has found that third-party remediation of heavy metal pollution on cultivated land is not ideal [
11]. Due to the imperfections of the multi-participation system, the third-party remediation of heavy metal-polluted cultivated land still suffers from the absence and weakening of the main body of remediation, especially since the central position of farmers cannot be manifested; the main body of responsibility is thus difficult to identify, the principle of responsibility allocation is invalid, and the communication of information between multiple participating bodies is poor [
16,
17,
18]. More seriously, due to the large number of participating subjects and the difficulty of coordinating the interests of those heterogeneous subjects, cultivated land remediation is caught in a “collective action dilemma”, which seriously threatens the remediation effect [
11]. Therefore, the key to ensuring the remediation effect of cultivated land is coordinating the conflicting interests of all participants.
Most of the existing studies about the conflicts of interests of third-party governance participants are based on a three-party game among the government, emission enterprises, and governance enterprises. Due to the existence of the institutional basis for conspiring [
19,
20], emission enterprises will have the incentive to conspire with governance companies to falsify EIA materials and falsify monitoring data. This conspiracy can lead to government regulatory risks [
21]. Currently, due to the different technical levels of each enterprise, it is easy to cause adverse selection and for bad money drives out goods, so the third-party market mechanism needs to be further enhanced [
22]. Moreover, the level of pollution control and monitoring efforts is private information that is difficult to access [
23]. Third parties may seek illicit profits by cutting investments in pollution control in the name of profit maximization [
24,
25]. Studies show that the influence of local government will gradually weaken if the emission enterprises and third-party governance companies cooperate well. However, without the advice and support of the local government, emission enterprises and third-party governance companies will eventually fall into the dilemma of “remediating while pollution”. The key factors that influence the evolutionary stabilization strategy (ESS) of the three parties include the rent-seeking cost, the violation cost, and the social credibility of the emission enterprises; the compensation for non-compliance, the cost of remediation, the government incentives, and the rent-seeking revenue of the governance companies; and the fines and regulatory costs of the governments. In response to the opportunistic behavior caused by information asymmetry, the government can distinguish governance companies at different levels by setting reasonable subsidy intervals and disguised costs [
26]. At the same time, the government can coordinate conflicts of interests by improving the legal and regulatory system, establishing a perfect monitoring system, designing incentive mechanisms for rewards and punishments based on the “principle of shared responsibility”, reducing participation and remediation costs, improving the reputation gains of both enterprises [
27], and adopting cooperative games.
At present, studies about the third-party governance of cultivated land remediation and how to coordinate the conflict among the government, governance companies, and supervisory companies are relatively lacking. According to the practice of cultivated land remediation in the Changsha–Zhuzhou–Xiangtan area, this paper constructs a game model based on the special governance structure of “ladder multiple supervision”, as shown in
Figure 1.
The evolutionary game mainly studies the participants’ long-term game under imperfect rationality [
28,
29]. The strategy choices of the grassroots government, the governance enterprise, and the supervising enterprise are never achieved overnight. These choices need to be stabilized through a long period of evolution, so it is very suitable to study the behavior choices of the grassroots government, the governance enterprise, and the supervising enterprise under a third-party remediation model of heavy metal-contaminated cultivated land through the evolutionary game. Compared with previous studies, this paper proposes the following three innovations: Firstly, the evolutionary game approach is used to clarify the conflicts of interests of the three parties in the third-party remediation model of heavy metal polluted-cultivated land, and the influence of sampling by the higher-level government is considered. Third-party supervisory enterprises are also included as game subjects, which has strong practical significance. Secondly, few scholars in the existing evolutionary game literature have discussed in detail the changes and jumps in the equilibrium point under different parameter conditions, and the situation where parameter changes lead to a leap of the equilibrium point often implies profound policy insights, which is remedied in this paper. Thirdly, a simulation analysis was conducted using MATLAB 2020b to verify the validity of the model analysis under different initial conditions, and based on the analysis’s findings, countermeasures and suggestions were proposed for coordinating the conflicting interests between the three parties and to improve the efficiency of cultivated land governance.
3. Model Analysis
3.1. Evolutionary Stability Analysis of the Strategy of Grassroots Government
The expected and average benefits (U
x1, U
x2) of the choice of “action” or “inaction” by the grassroots government in cultivated land governance are:
The replication dynamic equation for the choice of strategy of the grassroots government is:
The first derivative of F(x) and the set G(y) are respectively:
According to the stability theorem of the differential equation, the probability of the grassroots government choosing “action” in a stable state must satisfy F(x) = 0 and . Since , G(y) is an increasing function. Therefore, when , G(y) = 0, at which time and the grassroots government cannot determine the stabilization strategy; when y < y*, G(y) < 0, at which time , x = 1 is the evolutionary stabilization strategy (ESS) of the grassroots government; conversely, x = 0 is the ESS.
The evolutionary phase diagram of the strategy of the grassroots government is shown in
Figure 2.
Proposition 1. The probability of grassroots governments choosing “action” in cultivated land remediation is positively correlated with the frequency of spot checks and the amount of incentives received by higher-levels of government and negatively correlated with the cost saved by “inaction”.
Proof. Since
, it follows that y* gradually increases when J
g, β gradually increases, or C
gh − C
gl gradually decreases. At this time, the volume of A1 in
Figure 2 gradually increases, proving that the grassroots government tends to choose the probability of “action” gradually increases. □
Proposition 1 suggests that increasing the amount of incentives and the frequency of random inspections by higher-level governments can effectively motivate grassroots governments to choose the “action” strategy. It also suggests that the incremental benefits of grassroots governments choosing “action” should be greater than their incremental costs based on the cost–benefit principle. Furthermore, grassroots governments can be motivated to adopt an “action” strategy by strengthening publicity to enhance corporate awareness of compliance with the law, by strengthening the use of modern information technology, and by building and improving the mutual supervision mechanism of enterprises to reduce the cost of supervision.
Proposition 2. In the evolutionary process, the probability of government departments choosing “action” in cultivated land remediation decreases as the probability of “active harness pollution” and “active supervision” of governance and supervisory companies increases.
Proof. From the evolutionary stability analysis of strategy selection in the government sector, it is clear that x = 1 is the ESS when y < y*, i.e., . When y > y*, i.e., , x = 0 is the ESS; therefore, as y and z increase, the government’s stabilization strategy changes from x = 1 to x = 0. □
Proposition 2 shows that the strategy choice of governance and supervision enterprises will influence the behavior strategy of the grassroots government. The choice of “active harness pollution” and “active supervision” by governance and supervision enterprises can reduce the work intensity and difficulty of the grassroots government to a certain extent. Still, it will eventually lead to a lack of supervision by the grassroots government. In the long run, the enterprises will maximize their interests, as the principle of “Cutting corners” will affect the effect of cultivated land remediation. Therefore, the “green GDP” criterion should be included in the officials’ performance appraisal system to urge the grassroots government not to slacken the supervision of cultivated land remediation regardless of the behavioral strategies chosen by the governance and supervision enterprises [
30,
31].
3.2. Evolutionary Stability Analysis of the Strategy of Governing Companies
The expected and average benefits (U
y1, U
y2) of choosing “active harness pollution” or “negative harness pollution” in cultivated land remediation are
The replication dynamic equation for the choice of strategy of the governing companies is:
The first derivative of F(y) and the set J(z) are, respectively:
According to the stability theorem of the differential equation, the probability that the governing enterprise chooses “active harness pollution” in a stable state must satisfy F(y) = 0 and . Since , J(z) is a decreasing function. Therefore, when , J(z) = 0, at which time and the governing firm cannot determine the stabilization strategy; when z < z*, J(z) > 0, at which time , y = 0 is the governing firm’s evolutionary stabilization strategy (ESS); conversely, y = 1 is the ESS.
The evolutionary phase diagram of the strategy of the governance companies is shown in
Figure 3.
Proposition 3. The probability of governance companies choosing “active harness pollution” in cultivated land remediation is positively correlated with the amount of deduction by the grassroots government and the probability of random inspection by the higher-level government. It is negatively correlated with the cost saved by “negative harness pollution”.
Proof. Since
, z* decreases when F
m, β gradually increases or C
mh − C
ml gradually decreases, and the volume of B1 in
Figure 3 gradually increases. It proves that the probability of governance companies choosing “active harness pollution” gradually improves. □
Proposition 3 shows that the choice of strategy for governing enterprises is positively correlated with the intensity of punishment and the frequency of random inspections by higher-level governments. It is negatively correlated with the cost saved by “negative harness pollution”. The government can urge governance enterprises to choose “active harness pollution” by establishing a “routine inspection” system to increase the frequency of random inspections. By setting up a third-party integrity system, such as a “blacklist”, the market access and exit mechanisms of third-party governance can be improved, in turn improving the non-compliance and trust-breaking costs of the “negative harness pollution” strategy of enterprises. Reducing the cost saved by “negative harness pollution” strategy encourages enterprises to tend toward “active harness pollution”. Currently, the environmental third-party governance market still needs to mature, and the pressure of higher fines may increase industry barriers, which is not conducive to developing the third-party market. Therefore, an incentive system such as a “tax benefit” for third-party enterprises can be appropriately introduced to increase the governance enterprise’s gain [
32]. The incentive system can compensate for the “slack behavior” caused by insufficient punishment, which not only makes enterprises tend to implement positive behaviors but also contributes to the stable and healthy development of the third-party market.
Proposition 4. In the evolutionary process, the probability of governance companies choosing “active harness pollution” in cultivated land remediation increases as the probability of government and supervisory companies choosing “action” and “active supervision” increases.
Proof. The evolutionary stability analysis of the strategy choice of the governing firm shows that y = 0 is the ESS when z < z*, i.e., . When z > z*, i.e., , y = 1 is the ESS; therefore, the stabilization strategy of the governing firm changes from y = 0 to y = 1 with the increase in x and z. □
Proposition 4 suggests whether the governance companies choosing “active harness pollution” is influenced by the strategic choices of the government and the supervisory firm. When the government and supervisors choose “action” and to “actively supervise”, the governance companies will tend toward “active harness pollution”. Therefore not only can the effectiveness of cultivated land remediation be ensured by strengthening government supervision and information disclosure, but also by strengthening the impartiality of supervisory agencies, such as by improving the social responsibility of supervisory agencies, giving full play to the effectiveness of social forces in monitoring the efficacy of cultivated land remediation, and building a cultivated land remediation situation of shared governance by all people [
33,
34] to urge the governance enterprises to remediate actively.
3.3. Evolutionary Stability Analysis of the Strategy of Supervisory Enterprises
The expected and average benefits (U
z1, U
z2) of the supervisory companies choosing “active supervision” or “negative supervision” in cultivated land remediation are:
The replication dynamic equation for the choice of strategy of the supervisory enterprises is:
The first derivative of F(z) and the set H(x) are, respectively:
According to the stability theorem of the differential equation, the probability that the supervisory enterprise chooses “active supervision” in a stable state must satisfy F(z) = 0 and , because H(x) is a decreasing function. Therefore, when , H(x) = 0, at which time , the supervisory enterprise cannot determine the stability strategy; when x < x*, H(x) > 0, at which time , z = 0 is the supervisory enterprises’ evolutionary stabilization strategy (ESS); conversely, z = 1 is the ESS.
The evolutionary phase diagram of the strategy of the supervisory enterprises is shown in
Figure 4.
Proposition 5. The probability of supervision enterprises choosing “active supervision” in cultivated land remediation is positively correlated with the amount of deduction by the grassroots government and the probability of random inspection by the higher-level government. It is negatively correlated with the cost saved by “negative supervision”.
Proof. Since
, x* gradually decreases when F
s, β gradually increases, or C
sh − C
sl gradually decreases. The volume of C2 in
Figure 4 gradually increases, indicating that the probability of supervising enterprises tends to “actively supervise” gradually increases. □
Proposition 5 shows that the government cannot only curb the “negative supervision” of supervision enterprises by increasing the punishment and sampling probability of supervision enterprises but can also increase the cost of “negative supervision” to regulate the supervision enterprises’ behavior, for example, by establishing a credit file and credit accumulation system, strengthening information disclosure, and effectively promoting external public supervision.
Proposition 6. In the evolutionary process, the probability of supervisory enterprise choosing “active supervision” in cultivated land remediation increases as the probability of government choosing “action” rises.
Proof. The evolutionary stability analysis of the strategy choice of the supervisory enterprise shows that z = 0 is the ESS when x < x*. When x > x*, z = 1 is the ESS, so the stable strategy of the supervisory enterprise changes from z = 0 to z = 1 with the increase in x. □
Proposition 6 shows that the “active supervision” strategy of supervision enterprises is influenced by the government’s strategic choice, and supervision will tend to be active when the government chooses “action”. Therefore, to guarantee the effect of cultivated land remediation, it is necessary for the government supervisory department to strictly supervise and adopt a zero-tolerance attitude towards the “negative supervision” of enterprises to curb the implementation of negative behaviors by supervisory enterprises.
3.4. System Equilibrium Point Stability Analysis
According to the scholar Friedman [
35], the stability of the system of differential equations can be judged based on the positive and negative Jacobi matrix. According to scholars RitzbeRger and Weibull [
30], the case of a three-party game requires only the analysis of pure strategy points. The Jacobi matrix can be derived from the replicated dynamic equations of the three-party subject. From the Lyapunov method, it is known that in the stability analysis of differential systems, stability can be judged according to the positive and negative characteristics root of the equilibrium point—when all the characteristics root of the equilibrium point are negative, the point is an evolutionarily stable strategy (i.e., an asymptotically stable point). The eight pure strategy points are substituted into the Jacobi matrix in turn, and the corresponding characteristic roots of the equilibrium points are obtained. The results are shown in the following
Table 2.
The conditions of the relative sizes of the parameters assumed in the previous section are as follows: C
mh − C
ml < F
m, C
sh − C
sl < F
s. According to the condition that the characteristic roots are all negative, it is known that E2, E4, E5, and E6 are not evolutionarily stable points. Further analysis shows the conditions for the other four equilibrium points to become stable points and the positive and negative characteristic roots in the following
Table 3.
In the existing literature on three-party evolutionary games, few papers discuss the transformation and variation of equilibrium points under different parameter variations in depth. To more intuitively discover the change in the equilibrium point under the influence of various parameters and to find more managerial insights, the simulation of the equilibrium evolution under the scenario of parameter variation will be simulated next using numerical simulation techniques.
4. Computational Experimental Simulation
In this paper, we refer to the parameter settings of Lin Ze-Jian [
15] to conduct numerical simulation with the help of MATLAB 2020B software under the previous assumptions. We establish arrays 1 to 4, which satisfy the equilibrium point condition, as follows:
Array 1: Rgs = 1, Cgh = 100, Cgl = 0, Fg = 15, Bm = 830, Cmh = 670, Cml = 470, Fm = 210, Bs = 21, Csh = 17, Csl = 4, Fs = 15, Jg = 15, β = 0.5, α = 0.2, Rge = 1435;
Array 2: Rgs = 1, Cgh = 100, Cgl = 0, Fg = 15, Bm = 830, Cmh = 670, Cml = 470, Fm = 410, Bs = 21, Csh = 17, Csl = 4, Fs = 15, Jg = 15, β = 0.5, α = 0.2, Rge = 1435;
Array 3: Rgs = 1, Cgh = 100, Cgl = 0, Fg = 15, Bm = 830, Cmh = 670, Cml = 470, Fm = 210, Bs = 21, Csh = 17, Csl = 4, Fs = 15, Jg = 15, β = 0.91, α = 0.2, Rge = 1435;
Array 4: Rgs = 1, Cgh = 100, Cgl = 0, Fg = 200, Bm = 830, Cmh = 670, Cml = 470, Fm = 210, Bs = 21, Csh = 17, Csl = 4, Fs = 15, Jg = 15, β = 0.5, α = 0.2, Rge = 1435.
The simulation results are consistent with the system evolution stability analysis’s conclusions (see
Table 3 for details), which verifies the model’s validity. Considering the limited space of this paper, we only conduct further numerical simulations for E1(0, 0, 0) to verify the previous analysis of system evolutionary stability and to explore the influence of critical parameters on the evolution of the game subjects’ behavior. Meanwhile, to make later studies more rational, the parameters to be simulated in this paper are divided into two categories: the first category is reward and punishment parameters, and the second category is probability parameters. The effects of the changes in the two types of parameters on the equilibrium evolution are discussed separately in the latter part.
4.1. Effects of Changes in Reward and Punishment Class Parameters on Equilibrium Evolution
4.1.1. The Effect of Changes in the Amount of Deductions by the Grassroots Government on the Equilibrium Evolution of Governance Enterprises
From
Figure 9, it can be seen that increasing the amount of deductions to governance firms and increasing the frequency of random inspections by the higher-level government can shift the equilibrium. The combination of strategies (“inaction”, “negative harness pollution”, “Negative supervision”) changes to the combination of strategies (“action”, “active harness pollution”, “Negative supervision”). However, when the amount of withholding or the frequency of spot checks by the higher-level government is too low, the behavior of the governance enterprises cannot be effectively regulated by these two methods. Therefore, while the higher-level government establishes a “regular” inspection system, setting up incentive and punishment measures compatible with the governance enterprise is the key to guaranteeing the standard behavior of governance enterprises.
4.1.2. The Effect of Changes in the Amount of Deductions by the Grassroots Government on the Equilibrium Evolution of Supervisory Enterprises
From
Figure 10, increasing the amount of deduction for supervisory enterprises and increasing the frequency of random inspections by the higher-level government can shift the equilibrium. The combination of strategies (“inaction”, “negative harness pollution”, “negative supervision”) changes to the combination of strategies (“inaction”, “active harness pollution”, “active supervision”). However, when the amount of deductions or the frequency of spot checks by the higher-level government is too low, it cannot effectively regulate the behavior of supervision enterprises in these two ways. Compared with
Figure 9, we can find that strict supervision of supervision enterprises can be “double-pronged”. Not only can supervision enterprises actively supervise, but governance companies can also choose to actively remediate due to the pressure of “multiple supervision in a ladder”, which can improve the effect of cultivated land remediation more efficiently. Therefore, the supervisory enterprises can be based on setting a reasonable amount of deduction and a reasonable frequency of random inspection, improving the reward mechanism through a differentiated reward and punishment system, etc., to enhance their motivation and ability to implement positive behavior.
4.1.3. The Effect of Changes in Rewards and Punishments from Higher Levels of Government on the Equilibrium Evolution of Grassroots Government
Comparing
Figure 11a and
Figure 11b, it can be seen that increases in the incentives and penalties for grassroots government cause the equilibrium to shift from a combination of strategies (“inaction”, “negative harness pollution”, “negative supervision”) to a strategic combination (“action”, “active harness pollution”, “active supervision”). The reward and punishment system of the grassroots government is crucial to the third-party governance model of cultivated land. Constructing an incentive-compatible reward and punishment system for the grassroots government is an effective way to solve the third-party governance dilemma. In the real situation, the punishment for the grassroots government is shown in the form of interviews and accountability, withdrawal of project funds, transfer from the post, demotion, etc. Since this model collects the punishment in one parameter, strengthening the means mentioned above can play a certain incentive role from the perspective of the grassroots government. Comparing
Figure 11a and
Figure 11b, we can see that, compared with punishment, rewarding the grassroots government can transform the equilibrium point more efficiently. Therefore, we should optimize the assessment of the grassroots government, introduce the green ecological concept to establish an incentive-compatible assessment and evaluation system, and enhance the power of the grassroots government’s “action” on political performance.
4.2. Effect of Changes in Probability Class Parameters on Equilibrium Evolution
4.2.1. The Effect of the Frequency of Spot-Checking by Higher Levels of Government on the Evolution of Equilibrium
As can be seen from
Figure 12, the probability of spot checks by higher levels of government gradually increases, which causes the equilibrium to shift from a combination of strategies (“inaction”, “negative harness pollution”, “negative supervision”) to the strategy combination (“inaction”, “active harness pollution”, “active supervision”). Although the effectiveness of cultivated land remediation can be equally guaranteed in the short term when stabilized in the (“inaction”, “active harness pollution”, “active supervision”) strategy, the complex principal–agent relationship of “ladder multiple supervision” will increase the risk of “moral hazard” and “adverse selection”. Suppose the grassroots government’s long-term supervision is absent. In that case, governance enterprises and supervision enterprises will tend to negatively harness pollution and reduce supervision based on their interests. It is difficult to form a long-term guarantee mechanism for the quality of cultivated land governance. Therefore, the accountability system of the grassroots government on the governance of cultivated land should be improved [
36], the expected return of “action” and the weight in the performance appraisal should be increased, the sense of responsibility of government departments should be strengthened, and adequate supervision should be ensured.
4.2.2. A Spot Check of Governance Firms by a Higher-Level Government Can Detect the Effect of the Probability of Negative Harness Pollution on the Evolution of Equilibrium
From
Figure 13, we can see that increasing α alone does not move the equilibrium point. Still, with the gradual increase in α, the arrival at the equilibrium point (“inaction”, “active harness pollution”, “active supervision”) can be gradually accelerated. Comparing
Figure 12 with
Figure 13, in
Figure 12, where α = 0.2 and β = 0.5, the strategy tends toward (“inaction”, “negative harness pollution”, “negative supervision”). In contrast, in
Figure 13, α = 0.1 and β = 0.9 tends to (“inaction”, “active harness pollution”, “active supervision”), indicating that, to a certain extent, enhancing the frequency of spot checks can compensate for the limitations of the technical cognition and workforce of the upper-level government spot checkers.
5. Conclusions
5.1. Results and Suggestions
Based on the current practice of heavy metal-contaminated cultivated land remediation, this paper constructs a three-party evolutionary game model involving the grassroots government, governance companies, and supervision enterprises. This study analyzes the stability of each party’s strategy choice, the stability of the equilibrium strategy combination of the game system, and the influence relationship of each element; verifies the validity of the analysis’s conclusions through simulation analysis; and proposes relevant countermeasures and suggestions according to the influence relationship and stability conditions of each factor.
First, the grassroots government’s choice of “action” is a prerequisite for consistent action by the participants in cultivated land remediation. The higher probability of the government choosing “action” can increase the likelihood of the supervisory enterprises’ choice of “active supervision”. In turn, the higher probability of “action” and “active supervision” by the government and supervisory companies can increase the probability of “active harness pollution” by governance companies. However, as the probability of governance enterprises and supervisory enterprises choosing “active harness pollution” and “active supervision” increases, the probability of government departments to choose “action” will decrease. This will easily lead to the lack of supervision by the grassroots government and make it difficult to establish a long-term guarantee mechanism for the quality of cultivated land. Therefore, we can strengthen the sense of responsibility of grassroots government supervisory departments, improve the accountability system of cultivated land remediation, and establish a mutual supervision mechanism among various grassroots government departments to reduce information asymmetry and conflicts of interest between the higher-level and grassroots governments due to the commissioning agency. At the same time, the assessment of the grassroots government should be optimized. The green ecological concept was introduced to establish an incentive-compatible assessment and evaluation system to enhance the effect of the grassroots government’s “action” on political performance.
Second, the incentives of rewards and subsidies can be used to enhance the motivation and ability of governance and supervision enterprises to implement positive behaviors. The evaluation system of government-purchased services should be optimized, a relevant credit evaluation system should be built, and enterprises with high levels and a good reputation can issue proof medals and publicize them to enhance the positive incentive effect of reputation benefits. The incentive system for governance and supervision enterprises should also be improved. The enthusiasm of enterprises with differentiated reward and punishment mechanisms can be mobilized. Furthermore, enterprises can be encouraged and guided to increase research and development into vital common technologies for cultivated land pollution remediation and to support enterprises to train high-end talents in cultivated land pollution remediation, such as through scientific and technological innovation and technical services to reduce the cost of governance and supervision. By combining reputational, monetary, and tax incentives, we can save government regulation costs and enhance the endogenous motivation of enterprises to choose “active harness pollution” and “active supervision” strategies.
Third, a regular spot check system should be established, and the results of these spot checks should be published. the risk of information asymmetry of the entrusted agent should also be resolved. The public participation mechanism and information disclosure system should be improved. Enterprises with weak technical service capability, low operation and remediation levels, and poor comprehensive credit should be exposed publicly and punished. These actions would make third-party enterprises magnify the perceived value of loss through the deterrent effect of punishment to reduce their risk appetite.
Fourth, the costs of business violations and breaches of trust should be increased to prevent opportunistic behavior. The probability of implementing positive enterprise behaviors is negatively related to the cost savings from practicing a negative behavior. The government should strengthen the qualification reviews of third-party enterprises, establish third-party market access and exit mechanisms, label enterprises which have a lifetime system of severe violations, and implement a market entry ban system. These measures would increase the cost of implementing negative behaviors and regulating corporate behavior.
5.2. Limitations and Future Research
Several limitations are worth noting, which may provide ideas for the future research direction. First, one of the most common assumptions of the third-party remediation model of heavy metal-contaminated cultivated land constructed in this paper is that the participants are finite rational subjects. However, in real cases, there are inevitably non-finite rational participants due to the differences in culture and environment. Future research can further refine the model’s parameters to consider the behavior of non-finite rational participants and narrow the discrepancy between the theoretical model and the actual case. Second, this study did not consider the influence of agricultural producers, who have a strong subjective initiative in supervising the remediation of heavy metal-contaminated cultivated land. Therefore, our subsequent research direction will be constructing a dynamic game model with the participation of agricultural producers and making innovative suggestions for optimizing the remediation of heavy metal-contaminated cultivated land.