Recipe for constructing loss functions

 
  • Using Maximum Likelihood
  • For training data
    • Choose a probability distribution defined over the domain of the predictions y with distribution parameters
    • Choose an ML model where and
    • Training Find the parameters that minimize the Negative Log Likelihood over the training data
    • Inference Either return or the value where this distribution is minimized
  • If data is differently distributed and there is no loss associated, just transform the distribution beforehand