AI Power Progress iA
All Resources / Topics / Topic / MS MARCO
Resource detail

MS MARCO

Access: research-only. Large IR/QA dataset

dataset open-data rag research

Resource Metadata

Category

RAG

Provider

microsoft.github.io

Type

dataset

Level

unknown

Topic

RAG

Track

n/a

Section

Open Data Directory

Format

n/a

Status

publishable

Commercial

unknown

Featured

no

Fast start

no

Sequence

n/a

Priority

n/a

Primary source

website_existing

Sources

website_existing

ID

2939309e984b4365

Open Resource

Fallback Access

Continue Learning

Keep momentum with nearby resources and structured tracks.

Tags: dataset open-data rag research

Related Resources

Similar items by topic, tags, and provider (metadata-only).

datasethuggingface.co

TriviaQA

huggingface.co

Access: open (check license). Trivia QA dataset for retrieval + reading comprehension

datasethuggingface.co

SQuAD

huggingface.co

Access: open (check license). Reading comprehension QA dataset

datasethuggingface.co

Natural Questions

huggingface.co

Access: open (check license). Open-domain QA dataset (long + short answers)

datasethuggingface.co

HotpotQA

huggingface.co

Access: open (check license). Multi-hop QA for RAG evaluation

datasethuggingface.co

FEVER

huggingface.co

Access: open (check license). Fact verification dataset

repogithub.com

BEIR

github.com

Access: open. IR benchmark datasets

datasetBuildphysionet.org

PhysioNet

physionet.org

Canonical source for ECG, ICU, waveform, and related biomedical datasets.

datasetBuildopenneuro.org

OpenNeuro

openneuro.org

Best open hub for MRI/EEG/MEG/iEEG style data.

datasetnandumps.wikimedia.org

Wikimedia Dumps

dumps.wikimedia.org

Strong encyclopedic backbone for general knowledge and factual style.