Similar items by topic, tags, and provider (metadata-only).
videonanOpenCV
OpenCV
Video supplement for image processing and computer vision workflows.
docsBuildOpenCV
OpenCV
Reference docs for computer vision building blocks, APIs, and modules.
docsFoundationOpenCV
OpenCV
Still the fastest practical path to vision preprocessing, classical CV, and real-world image pipelines.
datasetMozilla
Mozilla
Large multilingual speech dataset project for ASR, speech research, and voice tooling.
datasetLAION
LAION
Web-scale image-text corpus for multimodal research; use with strong filtering and license review.
reponanGitHub
GitHub
Excellent multilingual image-text corpus from Wikipedia/Wikimedia.
datasetnanstorage.googleapis.com
storage.googleapis.com
Large supervised vision dataset with labels, boxes, masks, relations, narratives.
datasetnancommonvoice.mozilla.org
commonvoice.mozilla.org
Best open multilingual voice dataset.
datasetnanopenslr.org
openslr.org
Classic open English speech corpus.
datasetBuildMicrosoft / Google
Microsoft / Google
High-value public corpora for detection, captions, grounding, and multimodal experimentation.
resourceFoundationPyImageSearch
PyImageSearch
Project-driven vision learning for detection, OCR, face pipelines, and deployment patterns.
repoBuildOpenAI
OpenAI
Great practical base for local transcription pipelines and speech experiments.