Skip to content

Omitted Variable Bias

Omitted variable bias (OVB) arises when you leave out a variable that belongs in the model and that variable correlates with an included regressor. The included slope then absorbs part of the omitted effect, so OLS is biased and stays biased as the sample grows, which means it is inconsistent for the true coefficient. The bias direction follows sign(βomitted)×sign(corr)\text{sign}(\beta_{\text{omitted}})\times\text{sign}(\text{corr}) between the omitted and included variables.

Try it yourself

Omitted variable bias

True model y = β₀ + β₁x₁ + β₂x₂ + u. Drop the relevant x₂ and the short regression hands x₂'s credit to x₁. The bias is β₂·δ, where δ = ρ·(σ₂/σ₁) is the auxiliary slope of x₂ on x₁.

Assumption (held fixed): β₁ = 1.00, σ₁ = σ₂ = 1, so δ = ρ. This is a population (plim) result, no sampling noise.

Short-regression slope on x₁1.40bias +0.40
-2-101234Estimated coefficient on x₁bias = β₂·ρ = +0.40long: β₁ = 1.00short: 1.40Long (includes x₂): unbiasedShort (omits x₂): biased
Auxiliary slope δ = ρ·(σ₂/σ₁) 0.50Bias = β₂·δ +0.40
Omitted effect β₂+0.80
Correlation ρ = corr(x₁, x₂)+0.50
Omitting x₂ pushes the slope on x₁ above the truth: the short estimate 1.40 misses β₁ = 1.00 by +0.40. The sign follows sign(β₂)·sign(ρ); only ρ = 0 or β₂ = 0 closes the gap. More data will not help, OLS is inconsistent here.

Why it matters

If ability raises wages and also rises with schooling, a wage regression that omits ability hands ability’s credit to educ. The schooling coefficient then overstates the true return. Because the problem is built into the population relationship, more data does not fix it.

Formulas

Bias of the short-regression slope
E(β~1)=β1+β2δ1E(\tilde{\beta}_1)=\beta_1+\beta_2\,\delta_1
β2\beta_2 is the omitted variable’s effect; δ1\delta_1 is its regression slope on x1x_1.
Direction of bias
Bias=β2δ1\text{Bias}=\beta_2\,\delta_1
Sign of bias = sign of β2\beta_2 times sign of corr(x1,x2)(x_1,x_2).

Worked examples

Scenario

You regress lwage on educ but omit ability, which you cannot observe.

Solution

Since ability plausibly raises wages (β2>0\beta_2>0) and correlates positively with schooling (δ1>0\delta_1>0), the bias is positive, so `regress lwage educ` overstates the return to schooling. Adding a proxy such as IQ via `regress lwage educ IQ` shrinks the educ coefficient.

NoteUpward bias here is the classic ability-bias story in labor economics.

Common mistakes

  • Thinking a bigger sample cures OVB. The bias survives in the limit, so OLS is inconsistent, not merely imprecise.
  • Believing only variables correlated with the outcome cause bias. The omitted variable must correlate with the included regressor, not just with yy.
  • Assuming the bias is always upward. Its sign depends on the product of the omitted effect and the correlation, so it can pull the estimate either way.
  • Treating OVB as a small-sample or measurement quirk. It is a population specification problem that violates the zero conditional mean assumption.

Revision bullets

  • OVB needs a relevant omitted variable that correlates with an included regressor.
  • It biases the coefficient and persists in large samples, so OLS is inconsistent.
  • Sign of bias = sign(βomitted\beta_{\text{omitted}}) times sign(corr between omitted and included).
  • A proxy or control for the omitted factor reduces the bias.
  • OVB is the failure mode behind the zero conditional mean assumption MLR.4.

Quick check

Omitting a relevant variable biases OLS only when that variable is correlated with what?

Ability raises wages and is positively correlated with schooling. Omitting ability makes the educ coefficient:

Why is omitted variable bias described as a threat to consistency?

Connected topics

Sources

  1. Wooldridge, Introductory Econometrics, Ch. 3
    Wooldridge (2019), Introductory Econometrics: A Modern Approach, 7th ed., Sec. 3.3
  2. Angrist & Pischke, Mostly Harmless Econometrics
    Angrist & Pischke (2009), Mostly Harmless Econometrics, Ch. 3 (omitted variables bias formula)
How to cite this page
Dr. Phil's Quant Lab. (2026). Omitted Variable Bias. Derivatives Atlas. https://phucnguyenvan.com/concept/efm-omitted-variable-bias