Subhaditya's KB

Home

❯

KB

❯

AI

❯

Machine Learning

❯

Models

❯

Big Bird

Big Bird

Sep 18, 20241 min read

  • architecture

Big Bird

  • Big Bird: Transformers for Longer Sequences
  • imitation of Transformer-based models is the quadratic complexity
  • sparse Attention mechanism that reduces this quadratic complexity to linear

Graph View

Backlinks

  • Chapter 12 - Transformers
  • _Index_of_Models
  • architecture

Created with Quartz v4.3.1 © 2025

  • GitHub