Sampling Distribution and the Classical Linear Model
Adding MLR.6 normality of the errors, independent of the regressors, to the Gauss-Markov set defines the classical linear model (CLM). Under the CLM the OLS estimators are themselves normally distributed, so the standardized statistics follow exact and distributions in any sample size. This exact-distribution result is what licenses the textbook t test, confidence interval, and F test.
Try it yourself
Unbiased means the estimator centres on the truth across MANY samples — not that one sample is right.
Fix a known process y = β₀ + β₁·x + u with u ~ Normal(0, σ). We draw M fresh samples of size n, and for each one compute the OLS slope β̂₁. The blue histogram is the sampling distribution of those estimates; the gold line marks the true β₁. The mean of the estimates lands on it even though no single sample does.
Setup: x is a fixed evenly-spaced grid on [1, 11] (so Sxx = Σ(xᵢ−x̄)² is known), errors are i.i.d. Normal(0, σ), and the estimator is β̂₁ = Σ(xᵢ−x̄)(yᵢ−ȳ)/Σ(xᵢ−x̄)². Larger n widens Sxx and shrinks the SE; larger σ widens it.
Set n small and watch the histogram spread out while its centre stays on the true β₁. If you only ever collected ONE sample, your single estimate could land far out in that spread. So how can an estimator be "unbiased" and yet be wrong in the one sample you actually have? What does unbiasedness promise you, and what does it not?
Why it matters
Normality of passes straight through to the estimators because each is a linear combination of the errors. That gives clean, exact small-sample distributions rather than approximations. When normality is doubtful you lean on the large-sample asymptotic results instead.
Formulas
Worked examples
You run `regress lwage educ exper` with and want to trust the reported t values exactly.
With such a small sample the exact distribution relies on MLR.6 normality, so a normal-looking residual histogram or a passing `sktest` of residuals supports using the textbook t critical values rather than asymptotic ones.
Common mistakes
- ✗Believing OLS estimation requires normal errors. Estimation and unbiasedness do not; only the exact small-sample t and F distributions do.
- ✗Thinking MLR.6 is needed in large samples. Asymptotic normality replaces it once is big.
- ✗Reading MLR.6 as a claim about the distribution of unconditionally. It is about the error conditional on the regressors.
- ✗Assuming a high implies the errors are normal. Goodness of fit says nothing about the shape of the error distribution.
Revision bullets
- •MLR.6 adds normal errors and defines the classical linear model.
- •Under the CLM, OLS estimators are normal and t, F statistics have exact t, F distributions.
- •Exact small-sample inference relies on MLR.6.
- •Large samples replace MLR.6 with asymptotic normality.
- •MLR.6 concerns the error , not the unconditional distribution of .
Quick check
What does the normality assumption MLR.6 add beyond the Gauss-Markov assumptions?
In the classical linear model, the statistic follows:
Connected topics
Sources
- Wooldridge, Introductory Econometrics, Ch. 4Wooldridge (2019), Introductory Econometrics: A Modern Approach, 7th ed., Sec. 4.1 (CLM)
- Hill, Griffiths & Lim, Ch. 3Hill, Griffiths & Lim (2018), Principles of Econometrics, 5th ed., Ch. 3