pagesxyz
JobsCompaniesBlogResourcesCommunity
FeedbackContact
JobsCompaniesResourcesBlogContactFeedback

Foundations of Probability

  • What is Probability?
  • Theoretical vs Empirical Probability
  • Three Views of Probability
  • Sample Space and Events
  • Axioms of Probability
  • Independence and Expectation
  • Variance and Standard Deviation
  • Covariance and Correlation
  • Key Inequalities

Set Theory & Combinatorics

  • Set Operations in Probability
  • Counting Methods
  • Advanced Counting

Conditional & Bayesian Probability

  • Conditional Probability
  • Bayes' Theorem
  • Law of Total Probability

Random Variables & Distributions

  • What is a Random Variable?
  • Discrete vs Continuous
  • PDFs and CDFs
  • Expectation, Variance, and Moments

Discrete Distributions

  • Bernoulli and Binomial
  • Poisson and Geometric
  • Negative Binomial and Hypergeometric

Continuous Distributions

  • Uniform and Normal
  • Exponential, Gamma, Beta
  • Heavy-Tailed Distributions

Limit Theorems

  • Law of Large Numbers
  • Central Limit Theorem
  • Convergence in Probability vs Distribution

Frequentist Inference

  • Confidence Intervals
  • Hypothesis Testing
  • p-values and Statistical Decisions
  • Type I and Type II Errors
  • Power and Effect Size
  • Bootstrapping and Resampling

Advanced Probability Tools

  • Law of the Unconscious Statistician
  • Moment Generating Functions
  • Characteristic Functions
  • Markov Chains
  • Stationary Distributions

Bayesian Inference

  • Bayesian Philosophy
  • Prior, Likelihood, Posterior
  • Conjugate Priors
  • MCMC and Modern Computation

Regression Analysis

  • Ordinary Least Squares
  • Multiple Linear Regression
  • Regression Diagnostics
  • Regularization
  • Logistic and Generalized Linear Models

Multivariate Statistics

  • Joint, Marginal, and Conditional
  • Multivariate Normal
  • Covariance Matrices
  • Correlation vs Causation
  • Principal Component Analysis

Stochastic Processes

  • Random Walks
  • Poisson Processes
  • Brownian Motion
  • Itô's Lemma
  • Martingales
  • Geometric Brownian Motion

Simulation & Approximation

  • Monte Carlo Simulation
  • Variance Reduction
  • Bootstrapping for Finance
  • Quasi-Monte Carlo

Time Series

  • Stationarity and Autocorrelation
  • AR, MA, and ARIMA
  • GARCH and Volatility Clustering
  • Cointegration and Pairs Trading
  • Kalman Filters

Information Theory

  • Shannon Entropy
  • Kullback–Leibler Divergence
  • Mutual Information
  • Maximum Entropy

Linear Algebra

  • Vectors, Norms, and Inner Products
  • Matrix Operations
  • Eigenvalues and Eigenvectors
  • Singular Value Decomposition
  • Positive Definite Matrices
  • Numerical Stability

Calculus & Optimization

  • Multivariate Calculus
  • Lagrange Multipliers
  • Convex Optimization
  • Gradient Descent and Variants
  • Stochastic Calculus Primer

Machine Learning Fundamentals

  • Supervised vs Unsupervised
  • Bias–Variance Trade-off
  • Cross-Validation
  • Tree-Based Methods
  • Support Vector Machines
  • Clustering and Dimensionality Reduction
  • Classification Metrics

Deep Learning

  • Feedforward Networks
  • Backpropagation
  • Optimizers and Schedules
  • Regularization in DL
  • Architectures for Finance
  • Loss Functions

Options Pricing

  • Payoffs and Put–Call Parity
  • Risk-Neutral Valuation
  • Binomial Trees
  • Black–Scholes
  • The Greeks
  • Volatility Smile and Surface
  • Exotic Options

Portfolio Theory

  • Mean–Variance Optimization
  • CAPM and Factor Models
  • Sharpe, Sortino, and Information Ratio
  • Black–Litterman
  • Risk Parity

Trading & Risk Applications

  • Value-at-Risk
  • Expected Shortfall
  • Backtesting
  • Market Making Basics
  • Execution and Market Microstructure
  • Statistical Arbitrage
Study Guide/Bayesian Inference
Section 10 · Lesson 10.41

Prior, Likelihood, Posterior

Combining initial belief with evidence to form an updated belief.

Bayes' Rule applied to parameters reads

P(θ∣x)=P(x∣θ) P(θ)P(x)P(\theta \mid x) = \frac{P(x \mid \theta)\, P(\theta)}{P(x)}P(θ∣x)=P(x)P(x∣θ)P(θ)​

In words, posterior is proportional to likelihood times prior:

posterior∝likelihood×prior\text{posterior} \propto \text{likelihood} \times \text{prior}posterior∝likelihood×prior

The proportionality drops the normalizing constant P(x)=∫P(x∣θ)P(θ) dθP(x) = \int P(x \mid \theta) P(\theta)\, d\thetaP(x)=∫P(x∣θ)P(θ)dθ, which is often hard to compute and unnecessary if you only care about relative posterior values.

A worked example: you want to estimate the bias θ\thetaθ of a coin. Pick a uniform prior θ∼Beta(1,1)\theta \sim \mathrm{Beta}(1, 1)θ∼Beta(1,1). Observe 777 heads in 101010 flips. The likelihood is Binomial. Because Beta is conjugate to Bernoulli/Binomial, the posterior is just Beta(1+7,1+3)=Beta(8,4)\mathrm{Beta}(1 + 7, 1 + 3) = \mathrm{Beta}(8, 4)Beta(1+7,1+3)=Beta(8,4), with mean 8/12≈0.678/12 \approx 0.678/12≈0.67. Updating Bayesian beliefs is often this simple — when you pick conjugate priors.

Starting with a Beta(2,2)\mathrm{Beta}(2, 2)Beta(2,2) prior on a coin's bias and observing 555 heads and 333 tails, what is the posterior?

Previous
Bayesian Philosophy
Next
Conjugate Priors