Moment in Time

  • large balanced and diverse dataset
  • video under-
  • standing
  • 1 million video clips that cover 339 classes, and each video lasts around 3 seconds
  • average number of video clips for each class is 1, 757 with a median of 2, 775
  • videos that capturing visual and/or audible actions, produced by humans, animals, objects or nature