AI Power Progress iA
All Resources / Topics / Topic / LibriSpeech
Resource detail

LibriSpeech

Classic open English speech corpus.

asr audio audio-data computer-vision course dataset multimodal open-data openslr-org speech text training-data

Resource Metadata

Category

Speech

Provider

openslr.org

Type

dataset

Level

unknown

Topic

Computer Vision / Multimodal / Audio

Track

Computer Vision / Multimodal / Audio

Section

Open data

Format

Dataset

Status

manual_review

Commercial

manual-review

Featured

yes

Fast start

no

Sequence

nan

Priority

A

Primary source

direct_links_master

Sources

direct_links_master, mega_open_hub, training_data_stack, website_existing

ID

1c655e107cad3396

Open Resource

Fallback Access

Continue Learning

Keep momentum with nearby resources and structured tracks.

Learning placement: track: Computer Vision / Multimodal / Audio ยท stage: nan

Tags: asr audio audio-data computer-vision course dataset multimodal open-data openslr-org speech text training-data

Related Resources

Similar items by topic, tags, and provider (metadata-only).

datasetnanstorage.googleapis.com

Open Images V7

storage.googleapis.com

Large supervised vision dataset with labels, boxes, masks, relations, narratives.

datasetMozilla

Common Voice

Mozilla

Large multilingual speech dataset project for ASR, speech research, and voice tooling.

datasetnanimage-net.org

ImageNet

image-net.org

Still useful for vision classification baselines.

docsFoundationOpenCV

OpenCV Tutorials

OpenCV

Still the fastest practical path to vision preprocessing, classical CV, and real-world image pipelines.

reponanGitHub

WIT

GitHub

Excellent multilingual image-text corpus from Wikipedia/Wikimedia.

datasetLAION

LAION-5B

LAION

Web-scale image-text corpus for multimodal research; use with strong filtering and license review.

videonanOpenCV

OpenCV

OpenCV

Video supplement for image processing and computer vision workflows.