Instrumental Variables Estimation
What is Instrumental Variables Estimation?
Instrumental Variables (IV) estimation is a method used in statistics and econometrics to address the problem of endogeneity, which occurs when an independent (explanatory) variable is correlated with the error term in a regression model. The approach uses instruments—variables that are correlated with the endogenous explanatory variable but uncorrelated with the error term—to obtain consistent estimates. IV estimation is particularly useful in causal inference when randomized experiments are not feasible.
The Basic Idea
Instrumental Variables (IV) estimation is a tool economists and social scientists use when they want to deduce a cause-and-effect relationship between two variables—specifically when there's a problem, and hidden factors are messing with results.
Imagine you're trying to see how pet ownership affects people’s long-term health. The challenge is that factors like income level or personality traits, for example, could influence both an individual’s likelihood to adopt a pet and their long-term health. This creates a problem because these hidden factors make it difficult to know whether it’s actually the pet ownership that’s affecting health, or if baseline health and wealth impact whether someone is more likely to own a pet in the first place.
That’s where IV estimation comes into play. Instead of directly trying to link pet ownership to health, researchers use an ‘instrument’—a third variable that’s related to pet ownership but doesn’t directly affect health. The key is finding something that influences whether someone adopts a pet, without being influenced by the same hidden factors.
For example, one potential instrument could be the presence of pet adoption campaigns in a specific neighborhood. A greater number of pet adoption advertisements won’t have an impact on the health of the people in the area, but it would likely increase someone’s willingness to adopt. Using this outside variable of adoption campaigns as an instrument helps to isolate the effect of pet adoption on health, stripping away the noise from the hidden factors.
In more technical terms, the number of pet adoption campaigns is considered an exogenous variable because it’s unrelated to the health outcomes of people in the area (they influence pet adoption rates but don't directly affect health). On the other hand, if we tried to use something like income as a predictor of pet adoption, we might face endogeneity because income could also influence health outcomes, creating a correlation with both adoption and health which leads to biased results.
To recap, the instrumental variable (the adoption campaigns) helps overcome this bias by isolating the effect of pet adoption on health. Thus, exogeneity ensures the variable only impacts the predictor (adoption rates) and not the outcome (health) and endogeneity occurs when a variable impacts both, confounding the analysis.
About the Author
Annika Steele
Annika completed her Masters at the London School of Economics in an interdisciplinary program combining behavioral science, behavioral economics, social psychology, and sustainability. Professionally, she’s applied data-driven insights in project management, consulting, data analytics, and policy proposal. Passionate about the power of psychology to influence an array of social systems, her research has looked at reproductive health, animal welfare, and perfectionism in female distance runners.