AI Power Progress iA
All Resources / Topics / Topic / Wikimedia Dumps
Resource detail

Wikimedia Dumps

Strong encyclopedic backbone for general knowledge and factual style.

ai-training-data cpt dataset dumps-wikimedia-org open-data rag reference-corpus research-workflow-contribution text training-data

Resource Metadata

Category

Reference corpus

Provider

dumps.wikimedia.org

Type

dataset

Level

unknown

Topic

Research Workflow & Contribution

Track

Research Workflow & Contribution

Section

Open data

Format

Dataset

Status

manual_review

Commercial

manual-review

Featured

yes

Fast start

no

Sequence

nan

Priority

A

Primary source

direct_links_master

Sources

direct_links_master, mega_open_hub, training_data_stack, website_existing

ID

5fd9503fbb5a317a

Open Resource

Fallback Access

Continue Learning

Keep momentum with nearby resources and structured tracks.

Learning placement: track: Research Workflow & Contribution ยท stage: nan

Tags: ai-training-data cpt dataset dumps-wikimedia-org open-data rag reference-corpus research-workflow-contribution text training-data

Related Resources

Similar items by topic, tags, and provider (metadata-only).

datasetnandocs.openalex.org

OpenAlex

docs.openalex.org

CC0 research graph with snapshot updates; ideal for research retrieval and paper routing.

datasetnaninfo.arxiv.org

arXiv bulk data

info.arxiv.org

Open bulk access for research papers across math, physics, CS, etc.

resourceBuildZenodo

Zenodo

Zenodo

Repository for datasets, software, and other research outputs with persistent identifiers and open sharing workflows.

courseZeroOpenStax

OpenStax Subjects

OpenStax

Peer-reviewed, openly licensed textbooks spanning math, science, social science, and more.

resourceBuildPapers with Code

Papers with Code

Papers with Code

Connects papers to code, tasks, benchmarks, and leaderboards so you can move from theory to reproduction.