Variance of the OLS Estimators
Adding the homoskedasticity assumption SLR.5, (the error variance is constant across ), gives the sampling variance of the slope as . The variance shrinks with a smaller error variance and with more spread in , and it inflates when is large or barely varies. Because is unknown, it is estimated by , whose square root is the standard error of the regression, and the reported standard error of is the square root of the estimated variance.
Why it matters
How precise the slope is depends on three things, the noise in , the spread of , and the sample size. A noisier outcome (large ) makes the slope harder to pin down. A wider spread of acts like a longer lever, anchoring the line more firmly, so more variation in lowers the variance. More data points in the denominator help too. Standard errors translate this variance into the everyday measure of how much you should trust the coefficient.
Formulas
Worked examples
After `regress wage educ`, Stata reports a coefficient on `educ` of 0.54 with a standard error of 0.05, and a "Root MSE" of 3.4.
The standard error 0.05 is , computed as divided by the square root of the total variation in `educ`. The Root MSE 3.4 is , the standard error of the regression, estimating the standard deviation of . If education had been less spread out in the sample, the denominator would shrink and the standard error would rise, making the slope less precise.
Common mistakes
- ✗The standard error of the regression is the same as the standard error of . estimates the spread of the errors around the line. The standard error of measures the precision of the slope estimate and equals divided by .
- ✗The variance formula holds without homoskedasticity. It relies on SLR.5, . Under heteroskedasticity this formula is wrong and robust standard errors are needed instead.
- ✗More variation in makes estimates worse. The opposite is true. Greater spread in enlarges the denominator and lowers , so the slope is estimated more precisely.
- ✗Homoskedasticity is needed for OLS to be unbiased. Unbiasedness needs only SLR.1 to SLR.4. SLR.5 is required for the simple variance formula and for OLS to be the best linear unbiased estimator, not for unbiasedness itself.
Revision bullets
- •SLR.5 homoskedasticity: , constant error variance
- •
- •Variance falls with smaller , more spread in , larger
- •; is the standard error of the regression (Root MSE)
- •, reported by Stata
Quick check
Under SLR.1 to SLR.5, the variance of the OLS slope decreases when:
What does the assumption of homoskedasticity, SLR.5, state?
Connected topics
Sources
- Wooldridge (2019), Ch. 2.5Wooldridge, Jeffrey M. Introductory Econometrics: A Modern Approach. 7th ed. Cengage Learning, 2019. ISBN 978-1-337-55886-0.Section 2.5 adds the homoskedasticity assumption SLR.5 and derives the sampling variance of the OLS estimators and the standard error of the regression.
- Wooldridge (2019), §2.5 (Theorem 2.2)Wooldridge, Jeffrey M. Introductory Econometrics: A Modern Approach. 7th ed. Cengage Learning, 2019.Theorem 2.2 gives and the unbiased estimator .