Emperical Risk
- TRAINING ERROR. Mean loss computed over training examples
- R(f)=E(X,Y)∼P(X,Y)[l(y,f(x))]
- Remp(h)=N1Σi=1NL(h(xi),yi)
- joint prob distribution P(X∈A,Y=c) is unknown
- Learning set L is finite
- Need an estimator to evaluate it
- Supervised Learning
- Compute Ltrain
- Risk train = (1/M)(sum of loss values for (y, f(x)))
- This is an unbiased estimator, so we can use it to approximate the optimal function f* that minimizes R
- This means that we find argminf∈FR^(f,LTrain) (out of all the possible functions)
- limM→∞(fLTrain∗)=f∗ : converges to the fn that minimizes emprical risk
- Ordinary least squares regression