Bag of N-grams

  • consider word phrases of length n to represent documents as fixed-length vectors to capture local word order
  • suffer from data Sparsity and high dimensionality.