The Central Limit Theorem says: for i.i.d. random variables with finite mean and variance , the standardized sum
converges in distribution to a standard normal as . This holds regardless of the original distribution of the , as long as the mean and variance exist.
Why it matters
Many phenomena are sums of many small independent effects — measurement errors, biological traits, financial returns. The CLT says their distributions tend toward normal even if the underlying mechanism is wildly non-normal. This is why "assume Gaussian noise" is so often a defensible default.
How fast does it converge
The Berry-Esseen theorem gives a bound: , where . Convergence is faster for more-symmetric, lighter-tailed underlying distributions; slower for heavy tails or strong skew. As a rule of thumb, is "enough" for the sample mean to look approximately normal — but tails take longer.
When it fails
The CLT requires finite variance. Distributions with infinite variance (Cauchy, Pareto with shape ) violate the assumption — their sample means don't concentrate. Heavy-tailed processes look "almost normal" in moderate samples but fail dramatically in the tails.
Practical implications
- Hypothesis tests built on the assumption "the sample mean is normal" (z-test, t-test) work for any underlying distribution provided is large.
- Confidence intervals based on the normal approximation are reliable for when the data isn't too skewed.
- For small samples or heavy-tailed data, use the t-distribution or bootstrap methods instead of the normal.