Similar items by topic, tags, and provider (metadata-only).
datasetopencores.org
opencores.org
hdl_corpus / retrieval
datasetHugging Face
Hugging Face
Massive public dataset hub spanning NLP, code, vision, audio, robotics, and benchmarks.
datasetMozilla
Mozilla
Large multilingual speech dataset project for ASR, speech research, and voice tooling.
datasetdata.worldbank.org
data.worldbank.org
Access: open. Development indicators
datasettonic.readthedocs.io
tonic.readthedocs.io
Access: open. Event-based vision datasets
datasethuggingface.co
huggingface.co
Access: open. Large code corpus
datasetarchive.org
archive.org
Access: open. Q&A dumps
datasetsigmf.org
sigmf.org
Access: open. Standard for RF recordings + metadata (includes examples and tooling)
datasettogether.ai
together.ai
Access: open. Pretraining mix
datasetre3data.org
re3data.org
Access: open. Registry of research data repositories
datasetrcsb.org
rcsb.org
structure_retrieval / molecular_engineering
datasetbuildPHM Society
PHM Society
Public PHM benchmark datasets; practice anomaly detection + remaining useful life.