1. Introduction
It is well known that spillovers arise in corporate finance through firm competition and geographical agglomeration; see, for example, [
1], BRS21 hereon. Firm-level outcomes, such as sales or investments, depend on firms’ own treatment assignment in a given intervention and on the fraction of firms treated in the same industry and/or geographical region. As an example of these spillovers, consider the following illustration taken from BRS21. Assume that some coffee shops (i.e., the treated shops) in a given neighborhood are subject to a rise in the price of coffee beans (the treatment). The rise in the input price leads to a rise in the final price per cup of coffee for treated shops, and consequently, to a reduction in their volume of sales, i.e, the direct effect of the treatment. This is not the only effect on sales. It is likely that due to the increase in price, some consumers switch coffee shops. For coffee shops in the same neighborhood whose price has not changed, this implies an increase in the volume of sales, i.e, the spillover effect on the untreated. This spillover is an example of interference in causal inference; see, for example, [
2].
Spillovers can lead to a complicated bias in the estimation of causal effects; see, for example, BRS21. Spillover bias arises when the coverage of an intervention is omitted from the analysis. In terms of the above illustration, the coverage of the intervention is the proportion of coffee shops in a neighborhood that were affected by the increase in their costs. The reduction in the volume of sales suffered by shops in the treated group could be lower if more shops in the neighborhood were affected by the increase in costs. Omitting the proportion of affected coffee shops in a given neighborhood would induce a bias in the estimation of the direct effect. The coverage of an intervention can be measured either by the group-level average or by the leave-one-out average proxy. There is a The group-level average proxy is the average number of firms in a group subject to the treatment including the firm itself. The leave-one-out average proxy is the average number of firms subject to the treatment excluding the firm itself. Little is known about which of the two proxies one should use when controlling for spillovers.
The objective of this paper is to compare the implications for the spillover bias when using the leave-one-out average or the group-level average proxy. We show that choosing the leave-one-out average proxy has two advantages. First, it simplifies the formula for the spillover bias, which facilitates its diagnosis. Second, it clarifies the definition of the average indirect effects, thereby facilitating its interpretation. These advantages justify the use of the leave-one-out average as the preferred proxy.
The leave-one-out average proxy suggests a straightforward statistical test for the diagnosis of the spillover bias. The test is a heteroskedastic-robust Wald test for the null hypothesis of equal average indirect effects on the treated and untreated groups. If this null hypothesis is rejected, the ordinary least square estimator of the average direct effect omitting the spillovers is biased. We illustrate the implementation of this test in the context of measuring the effect of credit supply contractions on firms’ employment decisions.
The rest of the paper proceeds as follows: In
Section 2, we describe the two proxies to model spillovers in empirical research in corporate finance.
Section 3 contains the main result and a discussion of the advantages of using the leave-one-out average as the preferred proxy.
Section 4 presents results from a Monte Carlo study exploring the bias of alternative estimators of the average direct effect.
Section 5 presents an illustration of the implications of our results.
Section 6 concludes.
Appendix A contains auxiliary calculations.
2. Framework and Graphical Representation
Let
denote an outcome, such as investments, debts, sales, or employment, for firm
i belonging to group
g. Group
g typically represents an industry or region. Following BRS21, we assume that
is determined by
where
is an unknown function,
is a treatment indicator variable, and
is the group-level treatment coverage (or intensity). The treatment indicator variable is equal to one if firm
i receives the treatment, and is equal to zero otherwise. The group-level treatment coverage takes values between zero and one. The available data are a sample of size
n, where the group variable
records firm
i’s group. The object of study in this paper is the empirical specification of
.
Causal estimands of interest include the average direct, indirect, total, and overall effects. The average direct effect is the difference between the average outcome for treated and untreated firms given all other things being equal. Following the illustration in the introduction, the average direct effect is the average change in the sales of coffee shops that experienced an increase in the price of coffee beans in the absence of spillovers. Formally, the average direct effect is
The average indirect effects are those due to treatment coverage. They can be defined by comparing the outcomes in the treated or untreated firms. Following the illustration, the average indirect effect on the treated firms is the difference in average sales for a treated coffee shop between two hypothetical situations for the group: the group is fully treated vs. the group is not treated at all. Formally, the average indirect effect on the treated is
The average indirect effect on the untreated is defined similarly:
The average total and overall effects provide summary measures combining direct and indirect effects. The average total effect is the sum of the average direct effect and the average indirect effect on the untreated:
The average overall effect is
Finally, for later reference, we define the average effect at the a-coverage as
where
a is a number between zero and one. Following the illustration, the average effect at the a-coverage is the difference in average sales for treated and untreated coffee shops when the group is treated at the coverage level
a.
We consider two alternative models to estimate the causal estimands of interest. In the first model, the treatment assignment
is independent of the group assignment variable
. This model is a special case of the setting delineated by [
2].
Spillover Model with Leave-one-out Average:
where
is the leave-one-out average,
is the number of firms in group
g, and
is the indicator function, taking a value of one when the condition in parentheses is satisfied, and zero otherwise. In this model, the treatment is allocated as in the simple random treatment assignment assumption, i.e.,
is mean independent of
for any
,
is independent of
for any
, and
and
are independent and identically distributed for any
. In particular, Assumption (11) restricts the dependence between the treatment indicator variable
and the group variable
. Since
and
are both observed, this restriction is testable and hence should not be taken as a disadvantage of the model. This model uses
as a proxy for the coverage
. It delivers the approximations
The approximations for and coincide if the average indirect effects are homogeneous, i.e., .
In the second model, the treatment indicator and the group variable can be related. Treatment in this model may not be assigned as in the simple random treatment assignment assumption. This model has been postulated by BRS21.
Spillover Model with Group-Level Average:
where
is the group-level average treatment. This model uses
as a proxy for
. BRS21 shows, under additional assumptions replicated in the appendix, that this model delivers the approximations:
To facilitate the comparison between the two models, we now represent them using causal graphs.
Two differences arise when comparing the models. First, the proxies for coverage, and consequently, the approximations of the estimands of interest, do not coincide. While and the group-level average are correlated, and the leave-one-out average are not. Notice the absence of an arrow connecting the nodes ‘Treatment’ and ‘Coverage’ in Causal Graph I. Second, while the treatment variable in the model with the leave-one-out average is assumed to follow the simple random assignment assumption, in the model with the group-level average, it is not clear whether treatment is allocated as in a more sophisticated experimental procedure. Notice the presence of the bi-directed arrow connecting the nodes ‘Treatment’ and ‘Group’ in Causal Graph II. Little is known about whether these differences are relevant and, if they are, whether one should use the group-level average or the leave-one-out average proxy. The next section spells out two advantages of using the leave-one-out proxy. These advantages illustrate, first, the relevance of the choice of proxy for the coverage and, second, the benefits obtained from the rigorous modeling of the treatment allocation procedure.
3. Main Results
To proceed, we compare the spillover bias arising from estimating using a baseline model ignoring spillovers.
If there are no spillovers, the OLS estimator of is an unbiased estimator of the average direct effect. BRS21 (Proposition 1) proves the following result:
Lemma 1. The spillover bias of the baseline estimator for the estimand is: Proposition 1. The spillover bias of the baseline estimator for the estimand is: The expression in Proposition 1 is simpler to interpret than the one in Lemma 1: is an unbiased estimator of if and only if the indirect effects on the treated and the untreated groups are homogeneous, i.e., . This is the first advantage of choosing the leave-one-out average proxy.
From the characterization of the spillover bias in Proposition 1, the following statistical test can statistically infer if the baseline estimator is a biased estimator of the average direct effect.
Corollary 1. Empirical researchers can check that the baseline estimator is biased for the average direct effect by performing a heteroskedastic-robust Wald test for the null hypothesis versus the alternative based on the ordinary least squares estimator of .
This check complements the heuristic guidance suggested by BRS21 by providing a test for statistically inferring the presence of spillover bias. If is independent of the treatment indicator variables and the group indicator variables, the homoskedastic-only Wald test is an alternative to perform this check.
What is the baseline estimator unbiased for? Since
one has
and the following corollary holds.
Corollary 2. The baseline estimator is unbiased for the average effect at the average coverage .
The average effect at the average coverage is not equal to the sum of the average direct effect and the average indirect effect on the treated firms, which should prevent one from interpreting the baseline estimator as an unbiased estimator of the aggregation of the average direct effect and the average indirect effects (see, for example, [
3]).
The second advantage of choosing the leave-one-out average proxy comes from the interpretation of the approximation . Consider the case of a group g with two firms. Only i is treated, so and . In this case, there is no indirect effect on the treated firm, which is not reflected in the difference . Compare this result with , obtained using the leave-one-out average. This suggests that approximates the average indirect effect on the treated that we are looking for, while approximates something else. Another way of interpreting this difference is that the group-level average counts ‘twice’ the effect of : by including it, first, in , and, second, in in . The leave-one-out average counts only once the effect of : by including it in and excluding it from in .
4. Monte Carlo Exercises
To explore the finite sample properties of the estimator using the leave-one-out average proxy, we carry out a Monte Carlo study. We consider the specification:
where
and
has a normal distribution with mean 0 and variance 2. The design
has homogeneous average indirect effects and
has heterogeneous average indirect effects. These values are taken from the illustration in BRS21. The treatment variable
follows a Bernoulli distribution with mean
. The group variable
follows from the specification
where
, and
is a standard normal random variable. The group variable
is independent of the treatment variable
. The disturbance term
is independent of the covariates.
Table 1 reports the results for the bias of different estimators of
. ‘Baseline’ labels the ordinary least squares estimator from the specification (15) and ‘leave-one-out’ the ordinary least squares estimator from (8). As predicted by the theory for these experiments, the bias of the baseline estimator of the average direct effect is approximately
, which is half of the difference between the average indirect effect on the treated and untreated firms.
5. Illustration
We now illustrate the use of the previous results in the context of applications conducted in the empirical literature. The aim is to show the advantages of using the leave-one-out average proxy to diagnose the spillover bias on the baseline estimator.
There is a growing body of empirical literature seeking to incorporate spillovers in baseline models. These papers differ in their modeling of spillovers in two dimensions. They either use the group-level average or the leave-one-out average as a proxy for the treatment coverage, and they either assume homogeneous or heterogeneous average indirect effects.
Table 2 below summarizes these differences among already published papers.
Our results apply to any of these papers. We choose the application in BRS21 because the careful execution of the study lends itself to extension by applying the result in Proposition 1 (and its corollaries).
The estimand of interest is the average direct effect of a bank-lending cut (the bank in the database is Commerzbank) on German firms’ employment growth. Here, is the symmetric growth employment rate over the 2008 to 2012 period for firm i located in county g; is a dummy variable that equals one if the fraction of the firm’s relationship banks that are Commerzbank branches is greater or equal than 0.5, and is zero otherwise ( in BRS21’s notation); is the average Commerzbank dependence calculated based on of all other firms in the county g, excluding firm i itself ( in BRS21’s notation). For the convenience of the reader, we reproduce the estimates in the table below (see BRS21, Table 5, Columns (4) and (6)).
By comparing the baseline estimate
with
in
Table 3, BRS21 infers that ignoring spillovers causes the baseline estimator to be biased for the average direct effect. This comparison, however, does not take into account sampling variability, which, as we are going to show below, can change the above inference.
The estimate
is 0.025, whereas the estimate
is −0.115. To verify that this difference is not only due to sampling variability, Corollary 1 proposes a Wald test. Performing this test is straightforward. It requires the Wald test statistic to be computed:
where
and
are the OLS estimators for the estimands
and
,
and
are their respective standard errors, and
is the covariance estimator. The asymptotic null distribution of the Wald statistic is a chi-squared distribution with one degree of freedom, from which we can compute critical values. The Wald test suggests rejecting the null hypothesis (and statistically inferring that the baseline estimator is biased for the average direct effect) if the realized value of the Wald test statistic is greater than or equal to the critical value.
Table 3 contains all of the values to compute the realized value of the Wald test statistic, except for
. For illustrative purposes, we take two values: a lower bound of zero and an upper bound from the Cauchy–Schwarz Inequality. In the case of the upper bound, the realized value of the Wald statistic is
while the critical value at the
confidence level is
. Since the realized value of the statistic (
) is greater than the
critical value (
), the test indicates that the baseline estimator is biased for the average direct effect. However, the baseline estimator is still an unbiased estimator for the average effect at the average coverage (Corollary 2). In the case of the lower bound, the realized value of the statistic (
) is smaller than the
critical value (
). In such a case, Proposition 1 indicates that there is no evidence that the baseline estimator is a biased estimator of the average direct effect. We conclude, from the estimates in
Table 3, that one cannot infer that ignoring spillovers causes the baseline estimator to be biased for the average direct effect. We remark that these results are not immediately available if one chooses the group-level average as a proxy for the coverage.
6. Conclusions
Competitive interactions and agglomeration among firms generate spillovers after a shock, a change in regulation, or any kind of intervention affecting firms. Ignoring these spillovers when estimating causal effects leads to biased estimation. This paper discusses the choice between two alternative proxies for modeling spillovers. We show that this choice is relevant for diagnosing the existence of spillover bias. The leave-one-out average proxy has two advantages over the group-level average proxy. First, it simplifies the formula for the spillover bias, thereby facilitating its diagnosis. The baseline estimator is unbiased for the average direct effect if and only if the average indirect effects are homogeneous. Second, it clarifies the definition of the average indirect effect on the treated firms, thereby facilitating its interpretation. These advantages justify the use of the leave-one-out average as the preferred proxy and suggest a straightforward test to statistically infer the existence of spillover bias.
One natural extension is to investigate how to define the coverage proxy when the treatment is continuous instead of binary. This extension is outside of the scope of this paper and is left for future research.