AI Power Progress iA
All Resources / Topics / Topic / Mozilla Common Voice + LibriSpeech bundle
Resource detail

Mozilla Common Voice + LibriSpeech bundle

Best public speech base for ASR experimentation and evaluation.

audio build computer-vision dataset datasets intermediate learning-paths mozilla multimodal openslr speech-dataset-bundle

Resource Metadata

Category

Computer Vision / Multimodal / Audio

Provider

Mozilla / OpenSLR

Type

dataset

Level

Build

Topic

Computer Vision / Multimodal / Audio

Track

Computer Vision / Multimodal / Audio

Section

Learning path

Format

Datasets

Status

publishable

Commercial

link-only

Featured

no

Fast start

no

Sequence

6.0

Priority

Standard

Primary source

direct_links_master

Sources

direct_links_master, mega_open_hub, training_data_stack

ID

b81bfcad5d3ab92a

Open Resource

Fallback Access

Continue Learning

Keep momentum with nearby resources and structured tracks.

Learning placement: track: Computer Vision / Multimodal / Audio ยท stage: Build

Tags: audio build computer-vision dataset datasets intermediate learning-paths mozilla multimodal openslr speech-dataset-bundle

Related Resources

Similar items by topic, tags, and provider (metadata-only).

datasetMozilla

Common Voice

Mozilla

Large multilingual speech dataset project for ASR, speech research, and voice tooling.

datasetnanstorage.googleapis.com

Open Images V7

storage.googleapis.com

Large supervised vision dataset with labels, boxes, masks, relations, narratives.

datasetnanimage-net.org

ImageNet

image-net.org

Still useful for vision classification baselines.

docsFoundationOpenCV

OpenCV Tutorials

OpenCV

Still the fastest practical path to vision preprocessing, classical CV, and real-world image pipelines.

reponanGitHub

WIT

GitHub

Excellent multilingual image-text corpus from Wikipedia/Wikimedia.

repoBuildOpenAI

OpenAI Whisper

OpenAI

Great practical base for local transcription pipelines and speech experiments.