Kupiec Unconditional-Coverage Test
The Kupiec (1995) proportion-of-failures test is the formal version of the exception count. It asks whether the observed exception rate equals the model’s promised tail probability . It uses a likelihood-ratio statistic that compares the model’s probability with the empirical failure frequency . Under the null that the VaR is correctly calibrated, is distributed chi-squared with one degree of freedom. A value above the critical level (about 3.84 at 5%) rejects the model. Its weakness, which motivates Christoffersen, is that it tests only the frequency of exceptions and is blind to whether they cluster in time.
Try it yourself
A correct VaR is breached on a known fraction of days, p = 1 − α. An exception is a day whose loss exceeds the −VaR line. Count them, compare to the expected p·T, and the Kupiec test asks whether the gap is just luck. Push the model below true risk and exceptions pile up.
Why it matters
Counting exceptions is informal; Kupiec makes it a hypothesis test. It compares "the model says 1% should fail" against "actually 4% failed" and asks whether that gap is too large to be luck. The likelihood ratio measures how much better the data fit the observed rate than the promised rate. If that improvement is big enough, the model is rejected. The one thing Kupiec cannot see is timing: ten exceptions spread out and ten exceptions all in one panic week give the same count and the same Kupiec verdict.
Formulas
Worked examples
A 99% VaR () is backtested over days and shows exceptions. Does the Kupiec test reject the model at 5%?
The observed rate is against a promised 1%. Plugging , , into the LR formula gives , which exceeds the chi-squared(1) critical value of 3.84. So the model is rejected: it produces significantly more exceptions than 1% calibration allows, evidence that it understates tail risk.
Common mistakes
- ✗Kupiec tests whether exceptions cluster. It tests only the frequency (unconditional coverage); detecting clustering requires the independence component in Christoffersen’s test.
- ✗The Kupiec statistic follows a chi-squared with two degrees of freedom. has one degree of freedom; the two-degree statistic is the joint Christoffersen .
- ✗Failing the count test by eye is the same as the Kupiec test. Kupiec turns the count into a likelihood-ratio hypothesis test with a precise critical value, so it can accept counts that look high but are within sampling noise.
- ✗A model that passes Kupiec is fully validated. Passing only confirms the right number of exceptions; the model can still fail because those exceptions bunch in time.
Revision bullets
- •Kupiec (1995) proportion-of-failures = unconditional-coverage test
- •Tests whether observed exception rate N/T equals promised p = 1 - alpha
- •Statistic LR_UC is chi-squared with 1 degree of freedom
- •Reject calibration if LR_UC > 3.84 at the 5% level
- •Blind to clustering of exceptions in time (its key limitation)
Quick check
What does the Kupiec proportion-of-failures test assess?
The Kupiec statistic is compared against a chi-squared distribution with how many degrees of freedom?
Connected topics
Sources
- Kupiec, P. H. "Techniques for Verifying the Accuracy of Risk Measurement Models." Journal of Derivatives, 3(2), 73-84, 1995. Also Federal Reserve Board, Finance and Economics Discussion Series 95-24.Introduces the proportion-of-failures (unconditional coverage) likelihood-ratio test for VaR.
- Jorion (2007), Ch. 6Jorion, P. Value at Risk: The New Benchmark for Managing Financial Risk. 3rd ed. McGraw-Hill, 2007.Presents the Kupiec test within the VaR backtesting framework.