Chapter 6 - Fitting Models

 
  • Minimizing loss , Chapter 5 - Loss functions

  • basic : compute the derivatives of the gradients of the loss wrt params and then adjust the params based on gradients : decrease loss

  • Goal is to minimize the loss

Gradient Descent

Gabor Model

Qs

  • Why do we need to modify the learning rate while training?
  • How would you decide a learning rate depending on batch size?
  • What optimizer would you use for inference?