AI Power Progress iA
All Resources / Topics / Topic / Common Voice
Resource detail

Common Voice

Large multilingual speech dataset project for ASR, speech research, and voice tooling.

audio audio-data common-voice computer-vision-multimodal-audio dataset open-data research speech tool training-data

Resource Metadata

Category

Speech

Provider

Mozilla

Type

dataset

Level

unknown

Topic

Computer Vision / Multimodal / Audio

Track

Computer Vision / Multimodal / Audio

Section

Open data

Format

Dataset

Status

publishable

Commercial

candidate

Featured

yes

Fast start

no

Sequence

nan

Priority

A

Primary source

mega_open_hub

Sources

mega_open_hub, training_data_stack, website_existing

ID

4cbeb163750602dd

Open Resource

Fallback Access

Continue Learning

Keep momentum with nearby resources and structured tracks.

Learning placement: track: Computer Vision / Multimodal / Audio

Tags: audio audio-data common-voice computer-vision-multimodal-audio dataset open-data research speech tool training-data

Related Resources

Similar items by topic, tags, and provider (metadata-only).

datasetnanstorage.googleapis.com

Open Images V7

storage.googleapis.com

Large supervised vision dataset with labels, boxes, masks, relations, narratives.

datasetLAION

LAION-5B

LAION

Web-scale image-text corpus for multimodal research; use with strong filtering and license review.

datasetnanimage-net.org

ImageNet

image-net.org

Still useful for vision classification baselines.

reponanGitHub

WIT

GitHub

Excellent multilingual image-text corpus from Wikipedia/Wikimedia.

docsFoundationOpenCV

OpenCV Tutorials

OpenCV

Still the fastest practical path to vision preprocessing, classical CV, and real-world image pipelines.