Softmax
- Output : probabilities
- Softer argmax (0,1)
- Multinoulli
Entropy
- determines Entropy
- If it is 0, and Uniform Distribution and limit to infinity → binary vector which is 0 everywhere except at position i when y is maximal
Temperature
- Higher the T → Softer it the distribution. Aka less confident about distribution
- Lower → Harder. More confident