Measurement Error and Proxy Variables
Measurement error in the dependent variable raises the noise and inflates standard errors but, under the classical assumption, leaves OLS unbiased. Classical measurement error in an explanatory variable is more damaging: it causes attenuation bias, pulling the estimated coefficient toward zero. A proxy variable is an observable stand-in for an unobserved factor, such as IQ for innate ability, and including a good proxy reduces omitted variable bias. Like OVB, mismeasurement of a regressor threatens consistency, not just precision.
Why it matters
If the ruler you use to measure the outcome is noisy, your predictions are fuzzier but not systematically off. If instead the ruler is on a key input, the noise blurs the link between that input and the outcome, so the estimated effect looks weaker than it really is, dragged toward zero. Proxies fight a related battle: when you cannot observe ability or quality directly, a decent stand-in soaks up part of what would otherwise be hidden and bias your other coefficients.
Formulas
Worked examples
Self-reported income (the regressor) is recorded with survey error in a savings equation.
If true income is measured with classical noise, `regress saving income` produces an attenuated slope, biased toward zero, so the marginal propensity to save looks smaller than it is. The remedy is a better income measure or an instrumental variable, since adding controls alone does not undo errors-in-variables attenuation.
A wage equation cannot observe innate ability, which is correlated with education.
Run `regress lwage educ exper IQ`, using `IQ` as a proxy for ability. Because ability raises both schooling and wages, omitting it biases the return to education; a reasonable proxy absorbs part of that effect and brings the educ coefficient closer to its true value, reducing omitted variable bias.
Common mistakes
- ✗Assuming all measurement error biases coefficients. Classical error in alone inflates the error variance but keeps OLS unbiased; error in a regressor is what causes attenuation.
- ✗Thinking attenuation makes effects look larger. Classical errors-in-variables shrinks the coefficient toward zero, understating the true effect.
- ✗Believing a bigger sample cures measurement error. Attenuation is a consistency problem; the bias persists no matter how large becomes.
- ✗Treating any available variable as a valid proxy. A useful proxy must be genuinely related to the unobserved factor and, ideally, redundant once that factor is controlled.
Revision bullets
- •Classical error in : more noise, larger SEs, still unbiased.
- •Classical error in : attenuation bias toward zero.
- •Mismeasured regressors threaten consistency, not just precision.
- •A proxy stands in for an unobserved factor to reduce OVB.
- •Attenuation does not disappear as the sample size grows.
Quick check
Classical measurement error in an explanatory variable causes:
Including IQ as a proxy for unobserved ability in a wage equation is intended to:
Connected topics
Sources
- Wooldridge (2019), §9.4Wooldridge, Jeffrey M. Introductory Econometrics: A Modern Approach. 7th ed. Cengage, 2019.Derives classical errors-in-variables attenuation for a mismeasured regressor and the benign case of error in .
- Wooldridge (2019), §9.2Wooldridge, Jeffrey M. Introductory Econometrics: A Modern Approach. 7th ed. Cengage, 2019.Develops the proxy variable solution to omitted variables, using IQ for unobserved ability.