Similar items by topic, tags, and provider (metadata-only).
datasetUCI
UCI
Classic and modern ML datasets that are ideal for education, benchmarking, and tabular experiments.
datasetnanopenslr.org
openslr.org
Classic open English speech corpus.
datasetMozilla
Mozilla
Large multilingual speech dataset project for ASR, speech research, and voice tooling.
datasetzephyrproject.org
zephyrproject.org
Access: open. RTOS and samples
datasetyosyshq.net
yosyshq.net
Access: open. Synthesis suite
datasetriot-os.org
riot-os.org
Access: open. RTOS
datasetBuildphysionet.org
physionet.org
Canonical source for ECG, ICU, waveform, and related biomedical datasets.
datasetBuildopenneuro.org
openneuro.org
Best open hub for MRI/EEG/MEG/iEEG style data.
datasetfoundationMIT
MIT
Excellent lecture notes, exams, and videos across advanced technical topics.
datasetnandumps.wikimedia.org
dumps.wikimedia.org
Strong encyclopedic backbone for general knowledge and factual style.
datasetnanHugging Face
Hugging Face
Top open code corpus; huge language coverage.
datasetnanstorage.googleapis.com
storage.googleapis.com
Large supervised vision dataset with labels, boxes, masks, relations, narratives.