The Law of Large Numbers (LLN) is the mathematical justification for "more data is better." For independent identically distributed random variables with finite mean , the sample average converges to :
There are two flavors. The weak LLN says convergence holds in probability — for any , . The strong LLN says it holds almost surely — the random sequence converges with probability . The strong version is genuinely stronger; the weak follows from it.
LLN is what makes Monte Carlo simulation work, what justifies A/B testing, and what underpins empirical risk estimation. Average enough samples and you get the expectation, with the precision improving like (per the CLT).
Watch out: LLN requires finite mean. For Cauchy and other distributions where doesn't exist, sample averages never stabilize — they keep jumping by orders of magnitude as grows.