toc: true title: Distillation Token

tags: [‘temp’]


Distillation Token

  • A learned vector that flows through the network along with the transformed Image Data
  • cues the model for its distillation output, which can differ from its class output
  • Specific to Transformers