Measures of Variability — Section 2: Descriptive Statistics

Two distributions can share the same mean but have very different spreads. Measures of variability quantify how concentrated or dispersed the data is.

Range

Max minus min. Simple but extremely sensitive to outliers — a single freak value dominates. Rarely used as a serious summary.

Interquartile range (IQR)

$Q_3 - Q_1$ , the spread of the middle 50%. Robust to outliers; commonly displayed in box plots. Useful for data with long tails where you care about the bulk.

Variance and standard deviation

$\text{Var}(X) = \frac{1}{n} \sum (x_i - \bar{x})^2$ (or $\frac{1}{n-1}$ for Bessel's correction in samples). The standard deviation $\sigma = \sqrt{\text{Var}}$ is in the same units as the data — more interpretable.

For approximately normal data, ~68% of values lie within 1 $\sigma$ of the mean, ~95% within 2 $\sigma$ , ~99.7% within 3 $\sigma$ (the "68-95-99.7 rule").

Mean absolute deviation (MAD)

$\frac{1}{n} \sum |x_i - \bar{x}|$ . More robust to outliers than variance because it doesn't square the residuals. Less mathematically convenient (no closed-form gradients) which is why variance dominates in practice.

Coefficient of variation

$\sigma / |\mu|$ . Dimensionless. Useful for comparing variability across distributions with different scales — "noise level relative to signal."

Why N-1 for sample variance

Dividing by $n-1$ (Bessel's correction) makes the sample variance an unbiased estimator of the population variance. Dividing by $n$ gives a biased estimator that systematically underestimates. The difference vanishes as $n$ grows — relevant only for small samples.