AI Power Progress iA
Resource detail

SQuAD

Access: open (check license). Reading comprehension QA dataset

dataset open-data rag

Resource Metadata

Category

RAG

Provider

huggingface.co

Type

dataset

Level

unknown

Topic

RAG

Track

n/a

Section

Open Data Directory

Format

n/a

Status

publishable

Commercial

unknown

Featured

no

Fast start

no

Sequence

n/a

Priority

n/a

Primary source

website_existing

Sources

website_existing

ID

f0cfade70061e3bb

Open Resource

Fallback Access

Continue Learning

Keep momentum with nearby resources and structured tracks.

Tags: dataset open-data rag

Related Resources

Similar items by topic, tags, and provider (metadata-only).

datasethuggingface.co

TriviaQA

huggingface.co

Access: open (check license). Trivia QA dataset for retrieval + reading comprehension

datasethuggingface.co

Natural Questions

huggingface.co

Access: open (check license). Open-domain QA dataset (long + short answers)

datasethuggingface.co

HotpotQA

huggingface.co

Access: open (check license). Multi-hop QA for RAG evaluation

datasethuggingface.co

FEVER

huggingface.co

Access: open (check license). Fact verification dataset

datasetmicrosoft.github.io

MS MARCO

microsoft.github.io

Access: research-only. Large IR/QA dataset

repogithub.com

BEIR

github.com

Access: open. IR benchmark datasets

datasetBuildphysionet.org

PhysioNet

physionet.org

Canonical source for ECG, ICU, waveform, and related biomedical datasets.

datasetBuildopenneuro.org

OpenNeuro

openneuro.org

Best open hub for MRI/EEG/MEG/iEEG style data.

datasetnandumps.wikimedia.org

Wikimedia Dumps

dumps.wikimedia.org

Strong encyclopedic backbone for general knowledge and factual style.

datasetnanHugging Face

FineWeb

Hugging Face

Huge cleaned English web corpus; best raw breadth for LLM pretraining.