Similar items by topic, tags, and provider (metadata-only).
repogithub.com
github.com
Access: open. Scholarly corpus
datasethuggingface.co
huggingface.co
Access: open (check license). Trivia QA dataset for retrieval + reading comprehension
datasethuggingface.co
huggingface.co
Access: open (check license). Reading comprehension QA dataset
datasethuggingface.co
huggingface.co
Access: open (check license). Open-domain QA dataset (long + short answers)
datasetmicrosoft.github.io
microsoft.github.io
Access: research-only. Large IR/QA dataset
datasethuggingface.co
huggingface.co
Access: open (check license). Multi-hop QA for RAG evaluation
datasethuggingface.co
huggingface.co
Access: open (check license). Fact verification dataset
repogithub.com
github.com
Access: open. Toy conveyor/valve data for acoustic anomaly detection
repogithub.com
github.com
Open web text
repogithub.com
github.com
Access: open. Embedding benchmark suite; task/dataset licenses vary
repogithub.com
github.com
Access: open. MicroPython source/examples
repogithub.com
github.com
Access: open. Compiler test suite