Fitting
- Bayes risk
- Minimal expected risk over set of all functions
- If minimized → Best possible function
- Capacity of hypothesis space
- It is essentally all possible things. In reg, all possible affine linear fns. In neural networks, all possible specific connection structure.
- If low, is large : Underfitting (Huge difference between best risk and current risk)
- If high, is small : Overfitting (Tiny difference between best risk and current risk)