![](./assets/images/nobg.png)
DataComp
Welcome to DataComp, the machine learning benchmark where the models are fixed and the
challenge is to find the best possible data! Select a setting to learn more about how to participate
CLIP
Contrastive Language Image Pre-training
Select the best subset of image/text pairs from a large pool to train a CLIP model. Evaluate your training set by testing the model on downstream vision tasks
![Left Image](./assets/images/clip3.png)
LM
Language Modeling
Select the best subset of text data from a large pool to train a language model. Evaluate your training set by testing the model on downstream language tasks
![Right Image](./assets/images/lang3.png)