AI Power Progress iA
Resource detail

WIT

Excellent multilingual image-text corpus from Wikipedia/Wikimedia.

audio computer-vision dataset github image-text multimodal repo training-data vision-language vlm-pretraining

Resource Metadata

Category

Vision-language

Provider

GitHub

Type

repo

Level

unknown

Topic

Computer Vision / Multimodal / Audio

Track

Computer Vision / Multimodal / Audio

Section

Open data

Format

Dataset

Status

manual_review

Commercial

manual-review

Featured

yes

Fast start

no

Sequence

nan

Priority

A

Primary source

direct_links_master

Sources

direct_links_master, mega_open_hub

ID

ebe875d50b320bde

Open Resource

Fallback Access