- 1D-ALVINN
- AudioSet classification
- AudioSet
- BUCC
- Benchmark LLM
- Billion Word
- BooksCorpus
- Broden
- CIFAR
- COCO
- CUB-200-2011 4
- CUB-200-2011
- Cityscapes
- CommonCrawl
- DrawBench
- English Wikipedia
- Europarl-ST
- FGVC Aircraft
- FGVCx
- Fashion MNIST
- Fine grained datasets
- Fisher Spanish-English
- Flickr30K
- GLUE
- GTA5
- Google Conceptual Captions
- Google voice search task
- HMDB51
- IDRiD
- ILSVRC
- IMDB
- ISIC 2018
- KITTI
- Kinetics
- Kvasir Dataset
- Labeled Faces in the Wild
- LibriSpeech
- MILANNOTATIONS
- MIT1003
- MIT300
- MLDoc
- MMLU
- MNIST
- MSCOCO
- MUSAN
- Moment in Time
- NIST 2008 Speaker Recognition Evaluation dataset
- NIST SRE 2016 Cantonese
- NLVR2 3
- OSIE
- PASCAL VOC
- PASCAL-S
- People Art Dataset
- Picasso Dataset
- Places
- Places365
- PlantCLEF
- RACE
- SBU Captions
- SQuAD
- STL-10
- SUNCG
- SYNTHIA
- Salicon dataset
- SceneNet RGB-D
- Shapes Dataset
- Speakers in the Wild
- Stanford Dogs
- Swichboard
- UCF101
- VGGFace2 3
- VQAv2 3
- Visual Commonsense Reasoning
- Visual Genome
- WMT14
- Wall Street Journal task
- XLSR
- YFCC100M
- iNaturalist