Conditional probability is the probability of given that has occurred. Definition: , requiring .
Independence
and are independent iff — knowing tells you nothing about . Equivalently, . Most "i.i.d." assumptions in statistics rest on independence; violating it (autocorrelation, network effects) breaks standard inference.
Bayes' theorem
Read: posterior = likelihood × prior / evidence. Bayes lets you flip the conditioning — from "probability of evidence given hypothesis" (often easy) to "probability of hypothesis given evidence" (what you want).
Classic example: medical test
A disease has prevalence . A test has sensitivity and specificity . If a random person tests positive, what's the probability they have the disease?
Despite a "99% accurate" test, half of positives are false alarms — because the disease is rare. Base rates matter.
Naive Bayes classifier
The simplest probabilistic classifier. Assume features are conditionally independent given the class: . Apply Bayes' rule: . Pick the class with the highest posterior. The independence assumption is rarely true but the classifier works surprisingly well — a good baseline.
Prior shapes inference
In Bayesian inference, the prior encodes your beliefs before seeing data. As data accumulates, the posterior is dominated by the likelihood and the prior matters less. With small samples, the prior matters a lot — choosing it carefully (or analyzing sensitivity to it) is part of the modeling job.