Multiple linear regression generalizes simple regression to many predictors:
Each is the effect of on holding the other predictors fixed. This conditional interpretation is what makes multiple regression powerful, and it's also what gets confused most often.
Three traps to know:
- Multicollinearity. If predictors are highly correlated, individual coefficient variances explode and standard errors balloon. The model still predicts well, but you can't reliably interpret individual coefficients.
- Omitted variable bias. Leaving out a relevant predictor that correlates with included ones biases the included coefficients.
- Interaction effects. Adding lets the effect of depend on the value of — important whenever effects are non-additive.
In quant trading, multivariate regressions on factor returns are the foundation of factor models. The coefficients are the loadings — how much your portfolio moves with the market, with size, with value, and so on.