1. Introduction
In recent years, the importance of environmental protection has been recognized. Used products may cause various environmental issues. However, through remanufacturing, the residual values of used products can bring economic benefits. For example, used products such as auto and home appliances contain a large amount of recyclable metal, plastics, and other resources. By recycling those products, the resources can be effectively reused to promote the sustainability of a society. Also, new products are more environment-friendly than old products. For example, compared with ordinary vehicles, electric vehicles use clean energy and are more energy-efficient.
A new concept, the closed-loop supply chain, has been proposed to enhance collecting and remanufacturing. Different from the typical supply chain that only focuses on the forward direction, a closed-loop supply chain consists of both forward and backward activities which involve the movement of used products from customers to upstream suppliers and improves economic and environmental performance [
1,
2]. Therefore, it is important to promote closed-loop supply and remanufacturing processes for both firms and governments.
The government plays a vital role in the promotion and operation of closed-loop supply chains [
3]. In a closed-loop supply chain under government regulation, the firms’ decisions are unavoidably influenced by the government’s policies. For example, when the government allocates subsidies to the remanufactured products produced by the manufacturers. Subsidy level will directly affect the manufacturing costs, and moreover the wholesale price, retail price, and the return rate indirectly. Similarly, the firm’s price policy and remanufacture policy are also influenced by the tax policy of the government. Also, the government’s policy is normally a long-term decision. For those reasons, it is very important for the government to make the “right decision”.
In practical, governments have many measures to promote closed-loop supply chains. To spur domestic consumption, curb pollution and save resources, China has implemented the “old-for-new” policy for auto and home appliance replacements. In 2009, the “old-for-new” started from Beijing, Shanghai and other 7 cities as a pilot program. Customers receive 10% of the sale price as a subsidy on five kinds of new appliances including TV sets, refrigerators, washing machines, air-conditioners and computers [
3,
4]. By the end of 2011, the program received around 84 million units of used home appliances and recycled about 97 tons of steel, non-ferrous metal, and other resources. In January 2019, China announced an implementation guidance which indicates that green products and home appliance replacements will still be supported with an appropriate subsidy.
In Japan, a series of laws have been enacted to promote the recycling of end-of-life vehicles (ELVs), construction material, home appliances, etc. The home appliance recycling law was enacted in June 1998 and was enforced in April 2001; this law detailed stipulates the obligations of each player in the recycling process [
5]. Consumers need to pay for collection and recycling. The retailers collect the used appliances and transfer them to a designated collection site. Then, the manufacturers collect the used appliances from the collection site and take charge of recycling. In this process, the obligations of the government are to provide necessary information on recycling and to impose a penalty on the business entities that make improper claims.
In this paper, we consider the government as the Stackelberg leader. A Stackelberg competition model-based optimization model is designed to find the best tax and subsidy policies for the government. By using evolutionary game theory, we develop a repeated and dynamic population game which consists of multiple manufacturers and retailers, so that the government can observe the whole market to make long-term decisions. Also, in each local market, the Stackelberg competition model is considered between a manufacturer and a retailer to find the optimal price policy and return policy, while the manufacturer is the supply chain leader and the retailer is the follower. The supply chain model considered in this paper consists of a government and a supply chain consisting of multiple manufacturers which remanufacture used products and multiple retailers in multiple local markets. The proposed model includes one government, multiple manufacturers and retailers. Based on the proposed model, the following questions are mainly discussed:
What are the best tax and subsidy policies when the government faces different scenarios and how do the government regulations influence the wholesale price, retail price and the return rate?
Which closed-loop supply chain structure is preferred for the government and will the structure be selected by the market?
What kind of products are worth allocating a high subsidy to?
By analyzing existing studies, we conclude that prior research has mainly focused on the influence of government subsidy policy on the behavior of a few firms. To find the best subsidy policy for the government considering environmental impacts and public revenue, and influences of the policies in the whole market that includes the population dynamics of manufacturers and retailers, are still open questions.
To address those gaps, in this paper, a government optimization model is proposed to decide the optimal tax and subsidy policies in a closed-loop supply chain. The government model is based on the results of an evolutionary game analysis that allows us to analyze how the policy influences the whole market. Also, the closed-form Stackelberg equilibriums of the basic models are solved, so that we can develop the evolutionary game. Finally, numerical experiments are conducted to illustrate the selected policies when the government faces different scenarios.
The rest of this paper is organized as follows.
Section 2 reviews relevant literature.
Section 3 presents the proposed game models which include one-shot Stackelberg competition models between the manufacturer and the retailer, evolutionary game model and a Stackelberg competition-based government optimization model. Numerical experiments in different scenarios are conducted in
Section 4. Finally, the conclusions are summarized in
Section 5.
2. Literature Review
2.1. Closed-Loop Supply Chain Design Problem
Channel design is one of the important research streams in closed-loop supply chains [
6]. Stackelberg competition models are widely used to describe the relationship between manufacturers and retailers [
7,
8,
9,
10,
11,
12,
13,
14,
15,
16]. Savaskan et al. (2004) [
7] studied the problem of choosing the appropriate closed-loop supply chain structure for collecting the used products. To address this problem, a two-echelon supply chain that includes a manufacturer and a retailer was considered. In the supply chain, the used products can be collected by the manufacturer, or the retailer, or the third-party. Using a Stackelberg model, the optimal wholesale price, retail price and product return rate of the manufacturer and the retailer were obtained. They found that the retailer collection was the most effective supply chain structure for the manufacturer. In the same way, Savaskan and Van Wassenhove (2006) [
8] extended Savaskan et al.’s (2004) [
7] work to a competitive retailing environment. In their paper, a supply chain that includes two competing retailers was considered. The manufacturer can either directly collect the used products form the customers, or indirectly collect via retailers. Continually, De Giovanni and Zaccour (2014) [
9] extended Savaskan el al.’s (2004) [
7] work from single-period to a two-period closed-loop supply chain model, in which a part of the new products that sold in the first period are recycled from the customers in the second period. Based on the two-period model, Genc and De Giovanni (2017) [
17] and Xu and Wang (2018) [
10] developed closed-loop supply chain models considering technology level and emission reduction, respectively. Sarkar et al. (2017) [
18] examined the impacts of carbon emission from production and transportation in a closed-loop supply chain which adopts returnable transport items for transportation. Sarkar et al. (2019) [
19] developed a multi-objective model to minimize carbon emissions and maximize total profit in a closed-loop supply chain. In their model, a self-healing polymer-based returnable transport packing is considered in reverse logistics.
The similar closed-loop supply chain structurers in Savaskan et al.’s (2004) [
7] paper are adopted in this paper. Our model is an extension of Savaskan et al.’s (2004) [
7] work by considering the dynamic process, government regulation and multiple manufacturers and retailers.
Different from the former studies, we investigate the closed-loop supply chain design problem from both the government’s perspective and the market’s perspective. The structure that the government preferred and the structure that the market selected are respectively illustrated in the current study. Moreover, by adopting an evolutionary game model, a multi-period closed-loop supply chain model is considered.
2.2. Closed-Loop Supply Chain with Government Tax and Subsidy
The government plays an important role in the closed-loop supply chain. Ma et al. (2013) [
3] studied the influence of the government consumption-subsidy program on a closed-loop supply chain, in which the manufacturer produces new products and one retailer and one e-retailer sell the products to the consumers. They found that all the consumers could benefit from the government consumption-subsidy program. Xiong et al. (2013) [
20] considered a closed-loop supply chain that consists of a supplier and a manufacturer. They illustrated that remanufacturing was beneficial to the environment when the government allocates subsidy to the integrated manufacturer. Wang et al. (2014) [
21] addressed the channel design problem for the remanufacturer who can either sell the remanufactured products to the customers via the manufacturer or directly sell to the customers. They found that the subsidy can promote remanufacturing activities. Wang et al. (2017) [
22] examined a reward-penalty mechanism in a closed-loop supply chain that includes two sequential competing manufacturers and one retailer. They assumed that the government can implement the reward-penalty mechanism to the closed-loop supply chain. By calculating the equilibrium solutions, the supply chains with or without government regulation were both investigated. Saha et al. (2016) [
23] analyzed a reward-driven policy in a closed-loop supply chain. The pricing and remanufacturing strategies are obtained in both the non-cooperative and centralized model.
Recently, Nielsen et al. (2019) [
24] investigated two government incentive policies in a green supply chain. One policy is that the government provides incentives on the R&D investment, and the other policy is that the government offers incentives on the unit product. The social welfare function is concave with respect to government incentive rates in this model. Wan and Hong (2019) [
25] examined the effects of transfer pricing policies and subsidy policies in a closed-loop supply chain. Two recyclers, one retailer and one third-party, are considered in their model. He et al. (2019) [
26] obtained the best channel structure and pricing strategies for the manufacturer and the optimal subsidy policies for the government. Their supply chain model consists of one manufacturer, one retailer and one third-party.
Most of the literature focused on the influence of government regulation on several manufacturers and retailers. In this paper, we analyze the government’s influences on the whole market that includes multiple manufacturers and retailers by developing the dynamic population game via evolutionary game theory.
2.3. Evolutionary Game Theory in Supply Chains
Li et al. (2014) [
27] developed an evolutionary game model with a two-echelon closed-loop supply chain to study evolutionarily stable strategies (ESS) of manufacturers and retailers. They found that pricing is an important factor to determine the evolutionarily stable strategies and government subsidy can support the growth of the remanufacturing industry. Ji et al. (2015) [
28] focused on the recycling process in green supply chain management. In their paper, an evolutionary game model was established to observe the cooperation tendency of the suppliers and manufacturers. The results showed that recycling capability was a key factor in a green supply chain. Esmaeili et al. (2016) [
29] developed an evolutionary game model-based on Stackelberg models in the closed-loop supply chain. The short-term decisions of the manufacturer and retailer were studied by Stackelberg models, and the long-term decisions were investigated by an evolutionary game model. Ding et al. (2018) [
30] studied the coopetition relationship between a manufacturer and a collector in the reverse supply chain. The evolutionary game model was adopted to investigate the evolutionary mechanism and the internal reward-penalty mechanism for their collection strategies.
Although some papers addressed the influences of government subsidy in a closed-loop or reverse supply chain, few of them consider the government as a player. To fill the gap, a government optimization model is developed to investigate the optimal subsidy and tax policy in the closed-loop supply chain which is based on the results of the evolutionary game model and the Stackelberg model in our study.
The contributions of the previous studies and our paper are shown in
Table 1.
3. Model Formulation
3.1. Model Description and Assumption
3.1.1. Notation
Indices:
index of supply chain members, which denotes the manufacture and the retailer.
index of supply chain structures, which denotes the model C, M, R, N.
index of strategies, which denotes the collecting and non-collecting strategy.
Parameters:
unit cost of manufacturing a new product of the manufacturer.
unit cost of remanufacturing a returned product of the manufacturer, .
residual value of a used product, .
unit price of a returned product that a manufacturer pays to a retailer, .
profit allocation rate of manufacturer in the centralized supply chain, . Therefore, for retailer.
importance of environmental factor, .
environmental impact of production unit new product.
environmental impact of remanufacturing unit returned product, .
number of manufacturers in the whole market.
lower bound of expected profit for supply chain member .
Decision variables for Stackelberg competition model:
retail price of unit product in supply chain structure , .
demand of products in supply chain structure ,
fraction of remanufactured products in supply chain structure , .
unit wholesale price in supply chain structure .
profit function for supply chain member in supply chain structure .
total investment in closed loop supply chain structure ,
demand of new products in supply chain structure .
demand of remanufactured products in supply chain structure .
Variables in evolutionary game model:
expected profit of supply chain member who choose strategy .
average expected payoff of manufacturer.
average expected payoff of retailer.
expected demand of new products when the manufacturer selects strategy.
expected demand of remanufactured products when the manufacturer selects strategy.
average demand of new products.
average demand of remanufactured products.
total new products quantity in the whole market.
total remanufactured products quantity in the whole market.
proportion of manufacturers who collect the used products, .
proportion of the manufacturers who don’t recycle the used products.
proportion of the retailers who collect the used products, .
proportion of the retailers who do not recycle the used products.
Decision variables in government regulation model:
subsidy that allocated to unit remanufactured product, .
tax that collected from unit product, .
public revenue.
environmental impact.
3.1.2. Model Description
Figure 1 shows an economic system consisting of
supply chains considered in this study. The government can regulate the market at a limited level by taxation and subsidization. Tax and subsidy policies are considered as long-term and one-shot decisions. The policies are decided based on the analysis of the whole market. In this model, a trade-off between economic development and environmental impacts is considered. The government considers both economic development and environmental impacts trying to find the balance between those two. The government collects tax from the manufacturer for all the products and allocates a subsidy for the remanufactured products. Depending on the situation, the government decides the best tax and subsidy policies.
The whole market consisting of a sufficient number of supply chains is considered. Each supply chain consists of one manufacturer, one retailer and many customers. In this paper, manufacturers and retailers are considered in the whole market. It is assumed that each supply chain belongs to an independent market that is located in different locations.
Each manufacturer and retailer are randomly matched in a certain local market. This assumption is commonly used in evolutionary game models [
31,
32,
33]. The basic assumption of replicator dynamics is that the probability of a player meeting another player that adopts a certain strategy is equal to the fraction of the population that uses a certain strategy [
34]. Therefore, it is usual to assume random pairwise matchings in a large population in evolutionary game theory.
In an evolutionary dynamic process, the frequency distribution of populations that adopt different strategies change over time, in which populations that are associated with better-than-average strategies grow, but those strategies associated with worse-than-average strategies decline [
34,
35]. At the beginning of the game, the manufacturers and the retailers select strategies based on their preferences. Continually, in each round, they observe the behaviors of other manufacturers and retailers and compare their expected profit with the average profit in the whole market. Through the evolutionary dynamics, strategies that with higher profit will be selected by more and more manufacturers and retailers.
In the local market, the manufacturer produces new products and remanufactures used products. The retailer sells those products to the customers and the customers return a certain number of used products to the retailer or manufacturer. Eventually, the returned products are transferred to the manufacturer for remanufacturing. In each supply chain, the manufacturer announces the wholesale price, and then the retailer decides the retail price. Because we assume that each local market is separate from each other, pricing strategies are independent of the change of population. Therefore, pricing strategy and return policy are considered as one-shot decisions.
Both the manufacturer and the retailer have two strategies; is to collect the used product, and is not to collect the used products. The strategies of the manufacturer and the retailer are defined as:
: strategy that the manufacturer collects the used products.
: strategy that the manufacturer does not collect the used products.
: strategy that the retailer collects the used products.
: strategy that the retailer does not collect the used products.
The population of manufacturers that adopt and and retailers that adopt and , changes over time.
In each closed-loop supply chain, four kinds of channel structures are considered which includes model C, M, R and N as shown in
Figure 2. Model C is the centralized supply chain structure, in which the manufacturer and the retailer cooperate with each other to find the optimal pricing and return policies. In model M, the manufacturer directly collects the used products from the customer. In model R, the retailer collects the used products from the customers and recycles those items to the manufacturer. In model N, the recycling and the remanufacturing process are not conducted.
, , , , , , , , , are the given parameters. With the returned products, the remanufacturing cost is less than the manufacturing cost (). For simplicity, the residual value of a used product is calculated by . In model R, the retailer takes charge of collecting used products and transfers those products to the manufacturer. the payment of collection and transportation of the manufacturer is . The manufacturer is profitable when . In model C, one decision-maker optimizes the total profit for both manufacturer and retailer. The profit allocation rate of the manufacturer is . Therefore, the profit of the manufacturer in model C is , and the profit of the retailer is . The manufacturing and remanufacturing processes inevitably have environmental impacts, for example, pollution and raw material consumption. The environmental impacts of the remanufacturing process are equal or less than the manufacturing process (). is adopted to describe the individual rationality of the manufacturer and the retailer. The players are willing to enter the market only when the average expected profit is larger than their expectations.
Players, models and corresponding decision variables are introduced in
Table 2.
Figure 3 illustrates the relationship between the proposed models. The profit, wholesale price, retail price and return policies of the manufacturer and the retailer in different channel structures are solved by the Stackelberg game model. Based on the profit analysis in the Stackelberg game model, the evolutionary game model is proposed to investigate the total quantity of new and remanufactured products and population behaviors in the whole market. Finally, a government optimization model is proposed to decide the optimal tax and subsidy policies in a closed-loop supply chain.
3.1.3. Assumptions
Assumption 1. Demand is a linear function of retail price. The demand function is given by:whereandare positive parameters.represents the market potential andis the consumer’s sensitivity to price. Assumption 2. The manufacturer and the retailer can increase the return rate by investing capital in the market, such as advertising. is a parameter that is used to describe how investment affects the return rate. The return rate is given by: In practical, firms such as Apple, Hewlett-Packard and Xerox Corporation provide some promotional activities to raise the return rate [
36]. Therefore, we assume the return rate depends on the collector’s investment. Similar assumption is wildly used in the study of closed-loop supply chains [
7,
8,
9].
By considering the return rate, the average cost of unit product is given by , where . So, the average cost can be also written as . The demand of new products can then be written as , and demand of remanufactured products can be written as .
Assumption 3. In order to make this model more practical, we assume thatshould be large enough so that the optimal return rate is smaller than 1, namely,. That is, investing loads of money to make the return rate reach 100% is not profitable for the collectors. Similar assumptions can be also found in [7,37,38]. Assumption 4. In a decentralized closed-loop supply chain model, the manufacturer is the supply chain pricing leader and the retailer is the pricing follower. Commonly, the manufacturer is considered as the supply chain leader and the retailer is the follower, which means that the manufacturer decides the wholesale price first, and then the retailer decides the retail price [7,9,17]. It is also a practical assumption in the real world. Assumption 5. We assume perfect substitution between the new and remanufactured products which means that customers cannot tell the difference of quality between those two kinds of products and the new and remanufactured products have the same retail price. This is commonly assumed in the literature related to a closed-loop supply chain [6,7,23,25]. Assumption 6. The government can influence other decision-maker’s decision at a certain level by financial regulation. We assume that the government can collect tax from new and remanufactured products and allocate subsidy to remanufactured products.
3.2. One-Shot Stackelberg Competition Models in Each Local Market
In a local market, we consider the Stackelberg competition between a manufacturer and a retailer. As shown in
Figure 2, Three decentralized closed-loop supply chain structures and one centralized structure (model C) are analyzed. Decentralized structures include manufacturer collection (model M), retailer collection (model R) and non-recycling (model N). Savaskan et al. (2004) [
7] proposed the model C, M and R and analyzed the profit functions with one-shot games. In order to compare all the situations, model N is also considered in this paper.
3.2.1. Model C (Centralized)
Model C is a centralized supply chain decision system in which a manufacturer and a retailer cooperate with each other to determine the integrated optimal price policy and return policy together. The decision-maker maximizes the profit for both the manufacturer and retailer. Apart from Savaskan et al.’s (2004) [
7] work, government regulation is introduced in our supply chain model. The government collects tax from all the products and allocates subsidy to the remanufactured products. We also assume that the government considers this model as a benchmark model when it makes the tax and subsidy policy. The optimization problem of the decision-maker in the centralized supply chain is expressed as follows:
The total profit consists of profit of new products and profit of remanufactured products, where profit of remanufactured products include investment and subsidy. In a centralized supply chain, there is only one decision-maker. Therefore, the wholesale price is not considered in this model.
The profit function can be composed as follows:
According to assumption 3, should be smaller than 0. For simplicity, we write this condition as Because model C is a benchmark in this paper, this condition holds for all the models.
The Hessian matrix of
is as follows:
For the concavity of the , the Hessian matrix should be negative definite. Clearly, and . Hence, we need . According to the assumption above, this last condition is satisfied. Therefore, is strictly concave in and .
Calculating the first-order condition, the optimal price and return rate policies are given by:
The optimal profit is as follows:
3.2.2. Model M (Manufacturer Collection)
In the following three decentralized models, we assume that the manufacturer is the supply chain leader and the retailer is the follower. The manufacturer sells the products to the retailer at wholesale price , and the retailer sales the products to the customers at price .
In model M, the manufacturer collects the used products and invests money to promote the return rate. Hence, the manufacturer firstly decides the wholesale price and return rate, and then the retailer decides the sales price to optimize their own profit separately.
The Stackelberg competition problem is given by:
s.t.
The Stackelberg competition problem can be composed as follows:
s.t.
In this model, the manufacturer is the supply chain leader and takes charge of collecting the used products. The investment is considered in the manufacturer’s problem.
The Stackelberg equilibrium is obtained by the backward reduction. First, by calculating the best response function of the retailer, we obtain:
Substituting the price policy
in
, the wholesale price and return rate is given by:
The Stackelberg equilibrium outcomes for the manufacturer and the retailers are as follows:
is strictly concave in , because .
To proof the concavity of
, the Hessian matrix is derived:
is negative definite and hence is strictly concave in and , because , and . The third inequality is satisfied by the assumption of .
3.2.3. Model R (Retailer Collection)
In this model, the retailer collects the used products and sales of those items to the manufacturer at price . Similar to the previous model, the manufacturer firstly decides the wholesale price , then the retailer decides the retail price and investment, since the retailer is in charge of the collection.
The optimization problems are formulated as follows:
s.t.
The Stackelberg competition problem can be written as follows:
s.t.
By the idea of backward reduction, we derive the sale price and return rate from the retailer’s best-response function firstly as follows:
Substituting
and
in
, the optimal wholesale price of the manufacturer is given by:
The Stackelberg equilibrium for the manufacturer and the retailers are as follows:
To proof the concavity of
, the Hessian matrix is derived:
is negative definite and hence is strictly concave in and , because , and . The third inequality is satisfied by the assumption of .
is also concave since .
3.2.4. Model N (Non-Recycling)
In order to conduct a comparative study on the closed-loop supply chain, in model N, we consider a special case neither the manufacturer nor the retailer collects the used products. So, return rate is irrelevant. Still, the manufacturer is the supply chain leader and firstly announces the wholesale price, then the retailer decides the sale price.
The Stackelberg competition problem is given by:
s.t.
Calculating the first-order condition, the best-response of the retailer is as follows:
Substituting
in
, and the optimal wholesale price of the manufacturer is derived by
The Stackelberg equilibrium outcomes for the manufacturer and the retailers are as follows:
is concave in , because . is concave in , because .
3.3. Evolutionary Game Model
3.3.1. Model Description
In this section, the dynamic population game is used to describe the whole market and how it changes over time and is developed by adopting an evolutionary game theory. A big market that consists of many local markets is considered and in each local market, the manufacturer and the retailer play the one-shot Stackelberg models.
In this game, each manufacturer and retailer can choose to collect the used products or not collect the used products.
and
are respectively used to describe those two strategies of each manufacturer, and
,
for retailers. Then, four strategy combinations are obtained, which correspond to the results of Stackelberg competition between one manufacturer and one retailer in the previous section. The payoff matrix of the evolutionary game is shown in
Table 3. The expression of each payoff can be found in
Section 3.2.
To describe the change of population over time, and are adopted. Let x represent the proportion of manufacturers who select strategy . Then, is the fraction of the manufacturers who choose strategy . Similarly, we use to represent the proportion of the retailers who select strategy , and use to represent who choose . Accordingly, . During the evolutionary game process, manufacturers and retailers adjust their strategies, and will change over time until the system achieves a steady state.
The expected profit of manufacturers that choose
and
are set as
and
, and it can be expressed as
The average expected payoff of manufacturer is:
The expected profit of retailers that choose
and
are set as
and
, and it can be expressed as:
Then, the average expected payoff of retailer is:
The demand matrix of new products and remanufactured products in each closed-loop supply chain model is shown in
Table 4. Note that the demand of remanufactured products in model N equals 0.
and
are the expected demand of new products when the manufacturer selects
and
strategies, respectively. The expressions of
and
are shown as follows:
Therefore, the average demand of new products of manufacturers
can be calculated by:
and
are the expected demand of the remanufactured products when the manufacturer selects
and
strategies, respectively. The expressions of
and
are shown as follows.
Therefore, the average demand for remanufactured products of manufacturers
is shown as follows.
We consider that there are
manufacturers in the whole market.
is the total quantity of new products and
is the total quantity of remanufactured products.
and
can be expressed as:
According to the analysis above, the replicator dynamic equations of the closed-loop supply chain model are as follows:
Equations in (49) and (50), respectively illustrate the rate of proportion change over time between manufacturers and retailers.
3.3.2. Stability Analysis
In this section, the steady-states and the stability of strategies are analyzed. In an evolutionary game model, a steady-state does not change in time, and the stability means that if a solution starts from near the stationary solution, then it will stay near the solution as time progresses. Moreover, an asymptotic stability means that if a solution starts from a near enough to the stationary solution, then it will eventually converge to the solution [
39]. A strategy is an evolutionarily stable strategy (ESS), if and only if the strategy is an asymptotically stable stationary solution [
39]. Therefore, the asymptotic stability is analyzed in this paper.
Proposition 1. (1, 1), (1, 0), (0, 1) and (0, 0) are steady-states.
Proof of Proposition 1. The dynamic system is stable when the differential equations satisfy:
Obviously, (1, 1), (1, 0), (0, 1) and (0, 0) are steady-states. □
Proposition 2. The steady-state (1, 1) is asymptotic stable.
Proof of Proposition 2. The Jacobian matrix is adopted to analyze the asymptotic stability of those steady-states. Based on the replicator dynamic equations, the Jacobian matrix of this closed-loop supply chain model is obtained as follows:
A steady-state is asymptotic stable if the trace of the Jacobian matrix is less than 0 (
) and the determinant of the Jacobian matrix is larger than 0 (
), where
According to the model description, and , then we can get .
According to the assumption,
, we obtain,
Exist
that makes
,
satisfy
Therefore, (1, 1) is an ESS. □
Proposition 3. The steady-state (0, 0) is unstable.
Proof of Proposition 3. We have , hence, .
We have , hence, .
Therefore, (0, 0) is unstable point. □
Proposition 4. The steady-state (1, 0) is unstable.
Proof of Proposition 4. Because and , clearly, .
Therefore, (1, 0) is unstable. □
Proposition 5. The steady-state (0, 1) is unstable.
Proof of Proposition 5. Because and , clearly, .
Therefore, (0, 1) is unstable. □
According to the propositions, the results are summarized in
Table 5. (1, 1) is the evolutionarily stable strategy (ESS).
3.4. Government Model
A government model based on the results of the Stackelberg competition model in the local market and the evolutionary analysis in the whole market is developed. By collecting taxes from all products and allocating subsidies to remanufacturing products, the government can regulate the market to a certain level. To obtain the optimal policies, the trade-off between public revenue and environment impacts is analyzed in the optimization model.
The total public revenue
is calculated by tax and subsidies:
The environmental impact
is calculated as:
The government’s revenue consists of public revenue and environmental impact.
represents how much the government cares about the environment. The government’s optimization problem is formulated as follows:
The objective function (57) consists of the weighted profit of the total public revenue and the reduction of the environmental impact. Constraints (58) represent the fact that the sale price should be larger than the cost of producing a new product. In
Section 3.2, the pricing strategies in four models have been derived and these strategies depend on the government tax and subsidy policies. These constraints prevent the situation of the sale price being larger than the cost under government regulations. Constraints (59) represent the recycle rate should between 0 and 1. These constraints ensure that the quantity of returned products is smaller than the quantity of new products and the return rate should be positive.
Figure 4 shows the standard value-added tax (VAT)/sales tax rates around the world in 2014. In most areas of the world, the standard tax rates are lower than 20%, and the global average rate is 15.8%. Hence, in constraints (60), the tax of unit product is limited to 20% of the basic sale price. In 2009, China’s subsidy for home appliance replacements is 10% of the price. So, in constraints (61), subsidy of unit remanufactured product is limited to 10% of the sale price. Constraints (60) and (61) ensure that the government can regulate the market at a reasonable level. Constraints (62) and (63) represent the fact that the expected profits of the manufacturers and the retailers should be larger than the lower bounds of profit under government regulation. These constraints indicate that when the government makes decisions, it should consider the expected profit of the manufacturers and the retailers. Constraints (64) are the variable boundary conditions which indicate that the fraction of the manufacturer population and the retailer population should be larger than 0 and smaller than 1. These constraints are derived from the evolutionary game model.
4. Numerical Experiments
To illustrate the proposed model, numerical experiments are conducted in this section. Based on data from the literature related to closed-loop supply chain [
8] and practical assumptions. The assumed values of the baseline parameter are shown in
Table 6. The values of
,
,
,
are gathered from [
8]. The reason that we assume
is according to assumption 3, the scale parameter
should be large enough so that the optimal return rate is lower than 1. Also, to have enough of a population in the evolutionary process, 100 manufacturers and retailers are assumed. According to the proof in stability analysis, there exist
that makes the centralized supply chain structurer stable. Based on the preliminary experiments, we assume
. The values for
and
are based on [
26]. In their paper, the environmental impact of remanufactured products
is calculated by
, where
is a parameter that is larger than 0 and equal or less than 1. Hence, in our paper, the environmental impact of remanufacturing unit used product should be smaller or equal to the environmental impact of producing a new product. Therefore, we assume that
and
.
is the price that the manufacturer pays to the retailers for collecting used products and we assume
. The profit of the manufacturer in model R is increased with respect to
. To maximize their profit, in the following examples, we assume that
.
and
are the lower bound of the expected profits. We assume that
and
, indicating that the manufacturers and retailers are willing to enter the market as long as they do not lose money in the market. We assume
, which means that the government cares more about environmental issues than public revenue. In order to examine the effects of the main parameters, the different values for
,
,
,
and
will be taken in the numerical experiments.
4.1. Analysis of Tax and Subsidy
To examine how tax influences the return rates and demand for new products in different models, we set
in
Figure 5 and
Figure 6.
Figure 5 demonstrates that the return rates decrease in model C, M and R with the tax, while the return rate in model N keeps at 0. Although taxes are collected from all products, it has impacts on return rates in some models. In assumption 2 and assumption 3, we assume that the investment to return rate is
and
is sufficiently large so that the optimal return rate is smaller than 1. Therefore, collectors should invest relatively higher investment to increase the return rate. In this example, the higher tax leads to higher retail price and lower profits. With lower profit, the firm will decrease the investment to return rate. As a result, when governments increase the tax, the collectors choose to decrease the investment and the return rates also decrease.
Figure 6 shows that the demand for new products decreases with the increase of tax in all channel structures. The reason is that high tax leads to high retail price and low demand. Therefore, by increasing tax, the demand of new products is decreased and the environmental impact of production is also decreased. Also, as shown in
Figure 6, the demand of new products in model C is less sensitive to the change of tax compared with other models M, R and N.
Figure 7 and
Figure 8 show, how subsidy influences the return rates and demand for remanufactured products when we set
.
In
Figure 7, as the subsidy increases, the return rates in model C and M increase. However, the return rate in model R is not sensitive to the change of subsidy. The subsidy is allocated to the manufacturer to improve the remanufacturing process. However, in model R, the retailers take charge of investment and collecting the used products. To maximize their profit, the manufacturers choose to keep the subsidy instead of reallocating the subsidy to the retailers. As a result, the subsidy cannot effectively enhance the return rate in model R.
In
Figure 8, the demand for remanufactured products increases with the increase of subsidy in model C and R. The demand of used products in model C is more sensitive to the change of subsidy comparing with other models. In this paper, we assume that the environmental impacts of the remanufacturing process are less than the manufacturing process
. Therefore, by increasing subsidy, the government can collect more tax to reduce the environmental impacts of production.
4.2. Analysis of Evolutionary Dynamics
In
Section 3.3.2, we have proved that only the centralized supply chain is a stable strategy in this model. Here, we use an example to illustrate the evolutionary dynamics of the manufacturers and retailers with four different channel structures. This example also provides a benchmark case to study the effects of the main parameters of our model. In
Section 4.3, the results derived from this example will be compared with other scenarios, in which the main parameters take different values. The values of parameters in this example are listed in
Table 6.
To analyze the evolutionary process, we solve the government optimization model to derive the optimal tax and subsidy policies. Then, the results of the government model are substituted into the Stackelberg competition models to obtain the price and return rate policies of the manufacturers and retailers in different channel structures. The results of the government model and Stackelberg competition model are listed in
Table 7 and
Table 8.
Under these parameters, the Hessian for this constrained optimization problem is negative definite. The eigenvalues of this matrix are . Moreover, all the examples in this paper have been examined. The results show that the Hessian matrices are negative definite and the eigenvalues are negative. However, because the number of terms in the objective function is too large (more than 846 terms in expanded expression), it is extremely hard to analytically derive the concavity condition of the government’s optimization problem under other parameters.
In this example, the government chooses to collect high taxes from all products and allocate medium subsidy to remanufactured products. The centralized closed-loop supply chain is preferred for the government. The centralized structure has the highest return rate , which is about two times the return rate in model M and four times the return rate in model R. The calculation results are based on the static perspective. In order to find which strategy is preferred for the market from a dynamic perspective, we conduct the simulation to observe the evolutionary process.
The phase diagram which was created using Dynamo [
41] is shown in
Figure 9. The result illustrates the evolutionary process and how the system converges to a stable strategy.
In
Figure 9, points represent the steady-states, in which black one represents the sable point and white points represent the unstable points. The colors in the phase diagram represent the speeds of motion under the dynamic: red is fast and blue is slow. The arrows show that the trajectories will converge to the steady-state (1, 1) no matter where the initial condition is. That indicates that the centralized strategy is evolutionarily stable from a long-term perspective. This result is corresponding to the propositions that are proposed in this paper.
4.3. Effects of Main Parameters
The effects of residual value
, the environmental attention
, environmental impact of remanufacturing process
, firm’s expected profit
and
on the tax and subsidy policies of government and pricing and return strategies of firms are studied in this section. Three values of
,
,
and
are considered to be interpreted as high, medium and low, respectively. The values are shown as follows:
The nonlinear optimization problems of the government model in different scenarios are solved with MATLAB R2014a.
4.3.1. Residual Value
The residual value is calculated by
. Hence, the higher the residual value of remanufactured products, the lower the cost of remanufacturing processes, which means that products are more profitable. The results of the government model and the Stackelberg model with different values for
are listed in
Table 9 and
Table 10.
We consider
as high, medium and low profit remanufactured products, respectively. In
Table 9, as the residual value of the remanufactured products decreases, the subsidy that is allocated to the manufacturers increases. By reducing the residual value from 15 to 5, the subsidy is raised from 13.1639 to 15.8391.
The reason is that firms have enough motivation to remanufacture when the residual value of used products is high. For low profit remanufactured products, the government allocates a higher subsidy to promote environmental preference. Although the government allocates a higher subsidy for the low-profit products, the return rates are still less than the return rates of high-profit products. Therefore, besides increasing the subsidy, it is also important to develop technology so that we can recycle more material from used products.
By considering the government regulations, our results are different from past literature. Savaskan et al. (2004) [
7] found that in decentralized channels, it is the most effective that the retailer takes the charge of collection activity. Our results show that when residual values are low, the manufacturer can obtain more profit by directly collecting the used products from customers. The reason is that model M is more sensitive to the change of subsidy, has a higher return rate and eventually receives more subsidy from the government than model R. Therefore, manufacturers and retailers that adopt model M have higher profit than adopt model R.
From this example, we found that the government prefers to allocate subsidy to low profit remanufactured products.
4.3.2. Environmental Impact of Remanufacturing Process
Although the environmental impacts of the remanufacturing process are lower than the manufacturing process
, the remanufacturing process still have environmental impacts. The effects of
on decisions of government, manufacturers and retailers are illustrated by numerical experiments. The results of the government model and the Stackelberg model with different values for
are listed in
Table 11 and
Table 12.
By increasing the environmental impact of remanufactured products, the subsidy is decreased. That means that the government is more willing to support the low-pollution, low-consumption remanufactured products. In practice, governments also have the same behaviors that promote products that have better environmental preferences. For example, in China, the government promotes green home appliances by introducing subsidy with proposed subsidies of CNY 26.5 billion [
42].
Moreover, as shown in
Table 11, the remanufactured products that have low environmental impacts receive more subsidy which leads to the increase of return rates.
4.3.3. Environmental Attention
The government needs to consider the trade-off between public revenue and environmental issues. We set different values for environmental attention to illustrate the situations when the government pays different attention to environmental issues. The results of the government model and the Stackelberg model with different values for
are listed in
Table 13 and
Table 14.
As shown in
Table 13, the subsidy policy is very sensitive to the environmental attention of the government. When the government pays a low attention to the environmental issues
, no subsidy will be allocated to the remanufactured products. By increasing
, the government will allocate more subsidies to the manufacturers. When
, the subsidy is raised to 13.1639. Observing
Table 13 and
Table 14, as
decreases, the retail prices increase, which leads to a decrease in return rates and total demand, and eventually, the profits of retailers and manufacturers decrease.
4.3.4. Firm’s Expected Profit
In the previous examples, we set
and
, which means that the manufacturers and retailers are willing to enter the market as long as they don’t lose money. The result is that the government collects a relatively higher tax from the firm in all the examples. Here, we consider that the manufacturers and retailers are willing to enter the market only when the average profit is high enough. The results of the government model and the Stackelberg model with different values for
and
are listed in
Table 15 and
Table 16.
When we set and , the government increases the subsidy from 13.1639 to 18.0214 and also decreases the tax from 37.3194 to 32.1401 to make the manufacturers and retailers willing to enter the market. According to those results, the expected revenues of the manufacturer and retailer are satisfied. We found that the policy can increase the return rate up to 0.75843 in the centralized model and the rates are slightly increased in model M and model R. In this example, the centralized supply chain is more sensitive to the tax and subsidy policy. Also, with higher subsidy and lower tax, the profits of manufacturers and retailers in all channel structures are increased. However, the subsidy is decreased slightly from 18.0214 to 17.8515 when we continually increase the expected profits of the manufacturers and retailers. The reason is that in this model, the government reallocates the subsidy which comes from tax. Therefore, when tax is decreased, the subsidy that the government can allocate will also be reduced.
5. Discussion
The results from the numerical experiments show that tax and subsidy policies play an important role in a closed-loop supply chain that influences the pricing and investment decisions of firms. The centralized supply chain model is preferred for the government and it is also chosen by the market. Considering the balance between public revenue and environmental issues, the government’s best policy is to allocate a high subsidy to low-pollution and low-profit remanufactured products.
The tax and subsidy have complex impacts on demands and return rates in a dynamic closed-loop supply chain. Firstly, we analyze the influence of tax. By conducting the numerical experiments, we find that the higher the tax, the lower the return rate. The reason is that the collectors need to invest a lot of money to promote the return rate in this model. When the government increases the tax on new products, the total profits of the manufacturers and retailers decrease. As a result, to keep their profits, the best strategy for them is to decrease the investment for the return rate. Therefore, high tax leads to a low return rate and low total demand.
Secondly, we illustrate the influence of subsidy. By increasing the subsidy to the remanufactured products, both the demands of remanufactured products and return rates are increased in model C and M. Model C is more sensitive than model M which means that the subsidy is more “effective” in model C. However, models R and N are not sensitive to the change of subsidy. Because of the centralized structure, manufacturers and retailers should share their information to make an integrated decision. By the integrated channel, manufacturers and retailers are willing to make more investment in return rate compared with model M. In our model, the subsidy is allocated to the manufacturer. In the decentralized model R, to maximize their own profits, the manufacture’s best strategy is to keep the subsidy instead of sharing it with the retailer who is in charge of collecting the used products. In model N, the return rate keeps at 0, because manufacturers and retailers do not recycle the products. We suggest that the government should be careful with the high tax, because it may lead to bad influences on the market demand and remanufacturing industry. Because centralized supply chains are more sensitive than decentralized ones, the government should allocate the subsidy to the manufacturers that have good cooperation relationship with their upstream or downstream partners. The government should also have a sound allocation system to make sure that the allocation is spent on remanufacturing.
Thirdly, by analyzing the evolutionary dynamics, we find that the centralized structure is more competitive in the market than decentralized ones. The results show that in the centralized structure the products are sold at a lower price. Therefore, they have more market demand. The centralized supply chain is always preferred in the market because of those natures.
Finally, the effects of several main parameters on the decisions of the government, manufacturers and retailers are analyzed. The results show that low-pollution and low-profit remanufactured products are worth allocating a high subsidy to. The reason is that the firms want to maximize their profit in the competition. Although the remanufactured products have low pollution, the firms will not promote remanufacturing when the products have a low profit. By allocating a high subsidy to these products, the government could improve the environmental performance of society. For the high-profit products, the firms have enough motivation to remanufacture the used products. Therefore, the government does not need to allocate high subsidy to this type of products.
6. Managerial Insights
The resulting managerial insights indicate that when the government increases the tax, both return rate and demand for new products are decreased. The reason is that higher tax leads to lower investment to return rate and lower total market demand. By allocating subsidies to the remanufactured products, both return rates and demand for remanufactured products are increased in model C and model M. Also, subsidy policy is more successful in model C. By contrast, a subsidy is not effective in model R and model N. This is because in model R the subsidy for remanufactured products are allocated to the manufacturers, and to optimal their own profits, the subsidy is not transferred to the retailers. The stability of each structure is investigated. We have found that the centralized closed-loop supply chain is always preferred in the market because the cooperation leads to low retail price, high demand and high profits. Also, the centralized structure is preferred for the government because the manufacturers and retailers are sensitive to the government policies and have high return rate. Numerical experiments in different scenarios have shown that the low pollution, high profit remanufactured products are worth allocating subsidy to. When we set a lower bound of the firm’s revenue, the government will increase the subsidy and decrease the tax.
7. Conclusions
In this paper, we have focused on the government tax and subsidy policies in closed-loop supply chains. In the local markets, four supply chain structures have been considered and the optimal wholesale price, retail price and return rate are obtained for each structure based on the Stackelberg model. To describe the whole market, an evolutionary game model has been adopted. Furthermore, we have provided proofs that the centralized structure is the evolutionarily stable strategy. Then, a government model has been developed to obtain the optimal tax and subsidy policies considering both environmental issues and public revenue. Finally, numerical experiments have been conducted to illustrate the proposed model.
There are several future research directions. First, we assumed that demand is a linear function of the retail price. Further research might explore other demand functions, for example, the non-linear function. Second, the manufacturer is considered to be the supply chain leader and the retailer is the follower. A further study could investigate the effects of leadership structure in this model. Third, the perfect substitution between new products and remanufactured products is considered in this paper. In a future study, it would be interesting to consider the substitutability concept in the closed-loop supply chain by developing a new demand function. Fourth, in the proposed model, the government has only one possible policy toward different channel structures. More flexible tax and subsidy policies can be considered in future studies. Furthermore, the return rate is a function of investment in this paper. Further research could allow the return rate to be affected by other factors. Finally, the third party plays an increasingly important role in reverse logistics. Therefore, the model can be extended to third-party scenarios.