Softmax

  • Output : probabilities
  • Softer argmax (0,1)
  • Multinoulli

Entropy

  • determines Entropy
  • If it is 0, and Uniform Distribution and limit to infinity binary vector which is 0 everywhere except at position i when y is maximal

Temperature

  • Higher the T Softer it the distribution. Aka less confident about distribution
  • Lower Harder. More confident