Similar items by topic, tags, and provider (metadata-only).
datasetarxiv.org
arxiv.org
Access: open. Quantum physics papers
datasetnaninfo.arxiv.org
info.arxiv.org
Open bulk access for research papers across math, physics, CS, etc.
datasetarxiv.org
arxiv.org
Research corpora + metadata pipelines; use via official bulk data guidance.
datasetarxiv.org
arxiv.org
retrieval_index
datasetlib.ncsu.edu
lib.ncsu.edu
Access: open. arXiv metadata/full text access options
resourceAdvancedOpenAlex / arXiv
OpenAlex / arXiv
Use OpenAlex for metadata and citation graph work and arXiv for current papers and technical preprints.
datasetBuildphysionet.org
physionet.org
Canonical source for ECG, ICU, waveform, and related biomedical datasets.
datasetfoundationMIT
MIT
Excellent lecture notes, exams, and videos across advanced technical topics.
datasetMozilla
Mozilla
Large multilingual speech dataset project for ASR, speech research, and voice tooling.
datasetre3data.org
re3data.org
Access: open. Registry of research data repositories
datasetlaion.ai
laion.ai
Still an index of URLs/alt-text; reconstructed images have separate rights considerations.
datasetpaperswithcode.com
paperswithcode.com
Access: open. Dataset index across ML tasks