Datasets

Search Datasets

all

verified

More Filters

XQuAD

This dataset is a great resource for researchers who want to evaluate cross-lingual question answering performance.

cc-by-sa-4.0

10k<n<100k

Question Answering

English

Arabic

German

Greek

Spanish

Hindi

Romanian

Russian

Thai

Turkish

Chinese

Vietnamese

by @AIOZNetwork

CommonGen

Building machines with commonsense to compose realistically plausible sentences is challenging. CommonGen is a constrained text generation task, associated with a benchmark dataset, to explicitly test machines for the ability of generative commonsense reasoning. Given a set of common concepts; the task is to generate a coherent sentence describing an everyday sce- nario using these concepts.

mit

10k<n<100k

Text2Text Generation

English

by @AIOZNetwork

TAL-SCQ5K

TAL-SCQ5K are high-quality mathematical competition datasets created by TAL Education Group.

apache-2.0

10k<n<100k

Image-to-Text

Image Classification

English

Chinese

Japanese

Polish

by @AIOZNetwork

X-CSR

To create these datasets, the authors automatically translated the original CSQA and CODAH datasets, originally available only in English, into 15 other languages.

mit

10k<n<100k

Question Answering

Arabic

German

English

Spanish

French

Hindi

Italian

Japanese

Dutch

Polish

Portuguese

Russian

Swahili

Chinese

Vietnamese

Urdu

by @AIOZNetwork

BLiMP

The Benchmark of Linguistic Minimal Pairs, a challenge set for evaluating the linguistic knowledge of language models (LMs) on major grammatical phenomena in English, finds that state-of-the-art models identify morphological contrasts related to agreement reliably, but they struggle with some subtle semantic and syntactic phenomena.

cc-by-4.0

10k<n<100k

Text Classification

English

by @AIOZNetwork

testtest

mit

n<1k

by @skikes123

DOCCI

The DOCCI dataset consists of comprehensive descriptions on 15k images specifically taken with the objective of evaluating T2I and I2T models. These cover a lot of key details in the images, as illustrated below.

cc-by-4.0

10k<n<100k

Text-to-Image

Image-to-Text

English

by @AIOZNetwork

AI2 Reasoning Challenge

The ARC dataset consists of 7,787 science exam questions drawn from a variety of sources, including science questions provided under license by a research partner affiliated with AI2.

cc-by-sa-4.0

1k<n<10k

Question Answering

English

by @AIOZNetwork

MNIST

MNIST is used to train and evaluate image classification models in complex tasks.

mit

10k<n<100k

Image Classification

English

by @AIOZNetwork

NIH Chest X-Ray

NIH Chest X-Ray is a large dataset containing chest X-ray images of patients collected by the National Institutes of Health (NIH) of the United States.

mit

100k<n<1M

Image Classification

English

by @AIOZNetwork

PLOD: An Abbreviation Detection Dataset

This is the repository for PLOD Dataset subset being used for CW in NLP module 2023-2024 at University of Surrey.

cc-by-sa-4.0

1k<n<10k

Token Classification

English

by @AIOZNetwork