AI Power Progress iA
Resource detail

DocVQA

Access: open (login). Document VQA datasets

dataset multimodal-data open-data

Resource Metadata

Category

Multimodal Data

Provider

docvqa.org

Type

dataset

Level

unknown

Topic

Multimodal Data

Track

n/a

Section

Open Data Directory

Format

n/a

Status

publishable

Commercial

unknown

Featured

no

Fast start

no

Sequence

n/a

Priority

n/a

Primary source

website_existing

Sources

website_existing

ID

5298629ffd3e1f85

Open Resource

Fallback Access

Continue Learning

Keep momentum with nearby resources and structured tracks.

Tags: dataset multimodal-data open-data

Related Resources

Similar items by topic, tags, and provider (metadata-only).

datasetlaion.ai

Re-LAION-5B

laion.ai

Still an index of URLs/alt-text; reconstructed images have separate rights considerations.

datasetmmmu-benchmark.github.io

MMMU

mmmu-benchmark.github.io

Access: open. Multimodal reasoning benchmark

datasethuggingface.co

HowTo100M

huggingface.co

Access: research-only. Video-text dataset

datasetBuildphysionet.org

PhysioNet

physionet.org

Canonical source for ECG, ICU, waveform, and related biomedical datasets.

datasetBuildopenneuro.org

OpenNeuro

openneuro.org

Best open hub for MRI/EEG/MEG/iEEG style data.

datasetnandumps.wikimedia.org

Wikimedia Dumps

dumps.wikimedia.org

Strong encyclopedic backbone for general knowledge and factual style.

datasetnanstorage.googleapis.com

Open Images V7

storage.googleapis.com

Large supervised vision dataset with labels, boxes, masks, relations, narratives.