AI Power Progress iA
Resource detail

Dolma

Large open corpus spanning web, academic works, code, books, and encyclopedic material.

course dataset

Resource Metadata

Category

base_weights

Provider

allenai.org

Type

dataset

Level

unknown

Topic

general

Track

n/a

Section

n/a

Format

n/a

Status

publishable

Commercial

unknown

Featured

no

Fast start

no

Sequence

n/a

Priority

n/a

Primary source

training_data_stack

Sources

training_data_stack

ID

26f116e3ff39b6b3

Open Resource

Fallback Access