Similar items by topic, tags, and provider (metadata-only).
datasetre3data.org
re3data.org
Access: open. Registry of research data repositories
datasetdata.europa.eu
data.europa.eu
European Union open data
datasetdata.gov
data.gov
US government open data
datasetBuildphysionet.org
physionet.org
Canonical source for ECG, ICU, waveform, and related biomedical datasets.
datasetfoundationMIT
MIT
Excellent lecture notes, exams, and videos across advanced technical topics.
datasetUCI
UCI
Classic and modern ML datasets that are ideal for education, benchmarking, and tabular experiments.
datasetHugging Face
Hugging Face
Massive public dataset hub spanning NLP, code, vision, audio, robotics, and benchmarks.
datasetMozilla
Mozilla
Large multilingual speech dataset project for ASR, speech research, and voice tooling.
datasetsynbiohub.org
synbiohub.org
Access: open. Standards-based synthetic biology repository
datasetgithub.com
github.com
ic_design / layout / retrieval
datasetlaion.ai
laion.ai
Still an index of URLs/alt-text; reconstructed images have separate rights considerations.
datasetpaperswithcode.com
paperswithcode.com
Access: open. Dataset index across ML tasks