Next Article in Journal
On the Interpretation of Instrumental Variables in the Presence of Specification Errors: A Reply
Previous Article in Journal
Using a Theory-Consistent CVAR Scenario to Test an Exchange Rate Model Based on Imperfect Knowledge
 
 
Reply published on 19 July 2017, see Econometrics 2017, 5(3), 32.
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Comment

On The Interpretation of Instrumental Variables in the Presence of Specification Errors: A Causal Comment

Economic Studies Division, Central Bank of Austria, Otto-Wagner-Platz 3, Vienna 1090, Austria
The views expressed in this paper do not necessarily reflect those of the Central Bank of Austria or the Eurosystem.
Econometrics 2017, 5(3), 31; https://doi.org/10.3390/econometrics5030031
Submission received: 22 March 2017 / Revised: 9 June 2017 / Accepted: 11 July 2017 / Published: 18 July 2017

Abstract

:
Swamy et al. (2015) argue that valid instruments cannot exist when a structural model is misspecified. This note shows that this is not true in general. In simple examples valid instruments can exist and can help to estimate parameters of interest.
JEL Classification:
C18; C26

1. Introduction

Does an efficient financial system enhance growth? Does uncertainty depress private investment? Do private schools provide better education than public schools? In answering these and many other empirical questions instrumental variables play a central role.
In a recent paper Swamy et al. (2015) argue that valid instruments cannot exist in the presence of any model misspecification. Such mis-specifications include wrong functional form, omission of relevant explanatory variables, and the presence of measurement error in explanatory variables. As a consequence, instrumental variables (IV) and generalized method of moments (GMM) would not work.
No doubt, instruments may be hard to find or difficult to justify in empirical applications. The claim that instruments cannot exist appears to be too strong, however. This note discusses three simple examples where valid instruments can exist and where IV methods will work.
This note proceeds as follows. The next section reproduces the derivations in Swamy et al. (2015) for a simple linear model. This example helps to explain the logic of their nonexistence result for instrumental variables.
Section 3 briefly discusses the nature of structural models. The aim is to clarify some misconceptions that bedevil the interpretation of structural models.
Section 4 reexamines the basic arguments of Swamy et al. (2015) in the context of three extremely simple structural models. All three regressions for estimating the structural parameter of interest are misspecified in these cases. In the first two cases one of the two explanatory variables has been omitted. In the third case the single explanatory variable contains measurement error.
The following will be shown. In the first case the existence of a valid instruments depends on the purpose of the analysis. In the second case instruments can exist and will produce consistent estimates of the structural parameter of interest. In the third example instruments can solve the measurement error problem.

2. Non-Existence of Instruments

The central argument in Swamy et al. (2015) is that the explanatory variables that must be instrumented in an empirical model to obtain consistent estimates are at the same time also part of the error term of the model. All potential instruments are therefor necessarily correlated with the error term. The requirement that instruments must be uncorrelated with the error term is always violated. Hence, valid instruments cannot exist.
Swamy et al. (2015) start with a very general theoretical relationship
y t * = f t ( x 1 t * , ... , x L ( t ) t * )
with unknown functional form, where a possibly time dependent number L ( t ) of variables x i t * determine y t * . This theoretical relationship is assumed to be exact. Thus there is no need for an error term.
The derivations in Swamy et al. (2015) are now repeated for a simple linear model where y t * is exactly determined by two variables x 1 t * and x 2 t * . This model can be written as
y t * = α 0 + α 1 x 1 t * + α 2 x 2 t * .
Equation (2) is the true model.
Let us assume that we can only observe the measurements y t = y t * + ν 0 t and x 1 t = x 1 t * + ν 1 t where ν 0 t and ν 1 t are measurement errors. Thus, the model that we can estimate has the correct functional form, but suffers from measurement error and an omitted variable.
Swamy et al. (2015) argue that the relationship between the unobserved and the observed determinant can always be written as
x 2 t * = λ 0 t + λ 1 t x 1 t *
where λ 0 t is the portion of x 2 t * that remains after the effect of x 1 t * is removed. Substituting (3) into (2) yields
y t * = α 0 + α 2 λ 0 t + ( α 1 + α 2 λ 1 t ) x 1 t * .
Accounting for the measurement errors in (4) gives
y t = α 0 + α 2 λ 0 t + ν 0 t + ( α 1 + α 2 λ 1 t ) ( 1 ν 1 t x 1 t ) x 1 t
This equation includes the bias arising from omitting x 2 t * , the bias from measurement error in x 1 t , and the measurement error in y t . It is a simple version of the fully general expression (8) in Swamy et al. (2015). This model can be written as y t = γ 0 t + γ 1 t x 1 t where the parameters γ 0 t = α 0 + α 2 λ 0 t + ν 0 t and γ 1 t = ( α 1 + α 2 λ 1 t ) ( 1 ν 1 t x 1 t ) are time-varying.
Let us now consider two of the cases discussed in Swamy et al. (2015). In the first case the model is linear. Adding and subtracting the constant parameter model β 0 + β 1 x 1 t to (5) yields
y t = β 0 + β 1 x 1 t + ( α 0 + α 2 λ 0 t + ν 0 t β 0 + ( α 1 + α 2 λ 1 t ) ( 1 ν 1 t x 1 t ) β 1 ) x 1 t )
where the last two terms become the error term in the model to be estimated. A valid instrument must be correlated with x 1 t and uncorrelated with the error term. But x 1 t is also in the error term of (6). Thus, every instrument for x 1 t must be correlated with the error. No valid instrument exists in model (6).
The other case is a simple example of a model with a measurement error in the explanatory variable. This model is
y t * = β x t *
where the measured value of the explanatory variable is x t = x t * + ν t . The model that can be estimated is
y t * = β x t β ν t
where β ν t is the error term. Written as time-varying coefficient model (8) becomes
y t * = β t x t
where β t = β ( 1 ν t x t ) . Adding and subtracting a fixed coefficient model where β * * β yields
y t * = β * * x t + ( β t β * * ) x t
and x t is again also in the error term of Equation (8). No instrument for x t would work.

3. Structural Models

Before we reexamine the arguments in Section 2 we need to clarify what structural models are and what kind of assumptions they encode. This note cannot give a full exposition, of course. Pearl (2009a) provides a comprehensive treatment of structural models and causal inference in general. In particular, Chapter 5 in Pearl (2009a) discusses the interpretation of structural parameters and the error term in structural models in great detail. This section draws heavily on Pearl (2009b) which gives an excellent overview of the foundations of causal modeling with structural equations.
Let us consider the simple linear structural equation
y = β x + ν 0 .
where y depends on x and ν 0 , a variable that stands for all other factors that affect y when x is held constant. Particular values of x and ν 0 assign a particular value y = β x + ν 0 .
Equation (11) encodes the causal assumption that changing or manipulating x causes y to vary. The strength of the this effect is β . Note that the interpretation of β does not depend on ν 0 . The equation says that the effect of a unit change in x on y is β , regardless of the values taken by the other variables in the model. Whether or not x is correlated with ν 0 plays no role. Equation (11) describes a causal mechanism, not statistical associations. The correlation between x and ν 0 becomes important, however, when one attempts to estimate the causal parameter β from observational data.
Furthermore, the relation between x and y is asymmetric. The equality sign is therefore somewhat misleading. Rewriting (11) as
x = ( y ν 0 ) / β
would lead to the miss-interpretation that y causes x. For example, when y is a symptom of a disease x than (12) would imply that the symptom causes the disease. This makes of course no sense.
The graph in Figure 1 makes the causal relationship between x and y explicit. The arrow that points from x to y shows the direction of causality between these variables. The solid nodes indicate that x and y are observable variables. The hollow node indicates that the variable ν 0 is unobserved. The absence of a link between x and ν 0 in this graph indicate the these two variables are assumed to be independent.
Another important assumption in Equation (11) is the invariance of the target parameter β . This is an identifying assumption. The assumption implies that the causal link between x and y is stable. One could of course assume that β changes over time in some way. But this would imply a very different structural model.
Until now nothing was said about estimating Equation (11). The question is whether the causal effect β can be estimated from observational data. When x is uncorrelated with ν 0 then β is identified and consistently estimable from data on y and x. When x and ν 0 are dependent, however, then we need additional information to consistently estimate β . This information may come from an instrumental variable.

4. Non-Existence Revisited: Instruments Do Exist

Let us now turn again to our simple model given by Equation (2) and let us assume that we can only observe y * and x 1 * . We cannot observe x 2 * . For simplicity the intercept α 0 is set to zero. The time subscript t is superfluous and therefore dropped.
Figure 2 shows two possible structural models. In model (a) x 1 * affects y * directly and indirectly via its effect on x 2 * . In model (b) the variable x 2 * is a confounding variable that jointly affects y * and x 1 * . In both graphs z is an instrumental variable.
The regression model that is actually estimated is
y * = β x 1 * + ϵ
where the error term ϵ = α 2 x 2 * . Thus, the model is misspecified because x 2 * is omitted. In both cases x 1 * is correlated with the error. Let us now compute the ordinary least squares (OLS) and IV estimates for both cases.
Case (a): Easy computations show that OLS yields
β ( a ) O L S = C o v ( y * , x 1 * ) V a r ( x 1 * ) = ( α 1 + α 2 λ 12 ) V a r ( x 1 * ) V a r ( x 1 * ) = α 1 + α 2 λ 12
This is the total (i.e., direct + indirect) effect of x 1 * on y * . The IV estimate for β is
β ( a ) I V = C o v ( y * , z ) C o v ( x 1 * , z ) = ( α 1 + α 2 λ 12 ) δ V a r ( z ) δ V a r ( z ) = α 1 + α 2 λ 12
and therefore the same as the OLS estimate. When the goal is to estimate only the direct effect of x 1 * on y * no instrument for x 1 * works as argued in Swamy et al. (2015). If one is interested in the total effect no instrument is needed anyway.
Case (b): Now x 2 * is a confounding variable. As already mentioned, this model is miss-specified and Swamy et al. (2015) would conclude that a valid instrument cannot exist. But in fact a valid instrument can exist. OLS wont work but IV estimation will.
OLS yields
β ( b ) O L S = α 1 λ 21 2 V a r ( x 2 * ) + α 2 λ 21 V a r ( x 2 * ) λ 21 2 V a r ( x 2 * ) = α 1 + α 2 ( 1 / λ 21 )
where the second term is the well known omitted variable bias. The term ( 1 / λ 21 ) is the coefficient from a regression of x 2 * on x 1 * . Note that this auxiliary regression has no causal content. The regression just measures the statistical association between x 2 * and x 1 * . Some simple algebra shows that IV estimation yields now
β ( b ) I V = α 1 δ V a r ( z ) δ V a r ( z ) = α 1
which is the structural parameter that we wanted to estimate.
Why can an instrument work in case (b) but not in case (a)? In case (a) the error term ϵ = α 2 x 2 * = α 2 λ 12 x 1 * is caused by the included variable. The error term is thus indeed a function of x 1 * . No instrument for x 1 * will therefore work if one wants to estimate the direct effect of x 1 * on y * .
Case (b) is quite different. in Figure 2b the error is not a function of the included explanatory variable. Equation (3) in Section 2 is thus not consistent with the underlying structural model. Varying x 1 * does not affect x 2 * . Instead the omitted variable x 2 * in the error term is a cause of the explanatory variable x 1 * . The instrument z is another cause of x 1 * that is independent of x 2 * . Thus the error and the instrument are uncorrelated. Moreover, the instrument affects the dependent variable y * only via x 1 * . IV estimation works with a proper instrument.
Let us now turn to the second example in Section 2 where y t * = β x t * and the explanatory variable x t = x t * + ν t is measured with error. If y t * = β x t * is the true structural model then the causal effect of x t * on y t * is stable. The constant parameter β reflects this. Hence, we cannot simply transform this model into a time-varying coefficient model. Such a model states that the causal effect is unstable and changes over time. The transformed model is therefore inconsistent with the true structural model that we want to estimate. The transformed model has very different implications for the causal link between x t * and y t * . It is a different structural model.
To be consistent with the original structural model one should estimate the model y t * = β x t β ν t . Here the error is not a function of x t and instruments can in principle be found. For example, any cause of x t * that is unrelated to the measurement error ν t and does not directly cause y t * would be a valid instrument for x t .

5. Conclusions

The examples presented in this note demonstrate that instruments can exist. The arguments in Swamy et al. (2015) hold when the structural error is indeed a function of the included explanatory variables. But this is rarely the case in a structural model. For instance, omitted confounding variables are not functions of included variables. The three variable model (b) in Figure 2 provides a simple example. The model is misspecified, but a valid instrument exists since the omitted confounding variable is not a function of the included variable.
Furthermore, Swamy et al. (2015) assume that any model can be expressed as a time varying coefficient (TVC) model. The true model would then be a special case of this general model.
Of course, a constant parameter model is a special case of a TVC model. A true structural model with constant parameters cannot be turned into an equivalent TVC model, however. A structural model with constant parameters cannot at the same time be expressed as a model where the parameters vary over time. The later model has very different causal implications.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Swamy, Paravastu A.V.B., George S. Tavlas, and Stephen G. Hall. 2015. On the Interpretation of Instrumental Variables in the Presence of Specification Errors. Econometrics 3: 55–64. [Google Scholar] [CrossRef]
  2. Pearl, Judea. 2009a. Causality: Models, Reasoning and Inference, 2nd ed. Cambridge: Cambridge University Press. [Google Scholar]
  3. Pearl, Judea. 2009b. Causal inference in statistics: An overview. Statistics Surveys 3: 96–146. [Google Scholar] [CrossRef]
Figure 1. Graph of the structural relationship (11).
Figure 1. Graph of the structural relationship (11).
Econometrics 05 00031 g001
Figure 2. Two possible structural relationships.
Figure 2. Two possible structural relationships.
Econometrics 05 00031 g002

Share and Cite

MDPI and ACS Style

Raunig, B. On The Interpretation of Instrumental Variables in the Presence of Specification Errors: A Causal Comment. Econometrics 2017, 5, 31. https://doi.org/10.3390/econometrics5030031

AMA Style

Raunig B. On The Interpretation of Instrumental Variables in the Presence of Specification Errors: A Causal Comment. Econometrics. 2017; 5(3):31. https://doi.org/10.3390/econometrics5030031

Chicago/Turabian Style

Raunig, Burkhard. 2017. "On The Interpretation of Instrumental Variables in the Presence of Specification Errors: A Causal Comment" Econometrics 5, no. 3: 31. https://doi.org/10.3390/econometrics5030031

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop