Moment in Time
- large balanced and diverse dataset
- video under-
- standing
- 1 million video clips that cover 339 classes, and each video lasts around 3 seconds
- average number of video clips for each class is 1, 757 with a median of 2, 775
- videos that capturing visual and/or audible actions, produced by humans, animals, objects or nature