Khan Academy
Khan Academy
Math/science/CS foundations from zero.
No single site can turn someone into a “leading contributor in everything.” This page is a curated learning graph for becoming a strong contributor across software, systems, and AI — using open and free resources with official links and licensing notes.
Khan Academy
Math/science/CS foundations from zero.
LibreTexts
Large open subject libraries with remixable content.
University of Minnesota
Discover and adopt openly licensed textbooks by subject.
Yale
Full lecture courses with transcripts, readings, and exams.
The Open University
Short free courses and learning resources across many domains.
OpenStax
High-quality, peer-reviewed textbooks with permissive licensing per title.
Saylor
Self-paced courses (some credit pathways).
MIT
University-level depth with syllabi, problem sets, and readings.
NCBI
Biomedical and life-science reference books and documents.
Harvard (CS50)
First serious CS experience with real assignments and mental models.
GitHub
Hands-on Git/GitHub workflows with guided repositories.
Khan Academy
Math/science/CS foundations from zero.
Linux Journey
Linux fundamentals for dev, ops, and cloud.
SQLBolt
Fast interactive SQL basics for backend work.
MIT
Terminal, editors, Git, debugging, profiling, and tooling fluency.
freeCodeCamp
Daily coding reps + project-based certifications.
Harvard (CS50)
First serious CS experience with real assignments and mental models.
MIT OCW
Discrete math, proofs, probability, graphs, and recurrences.
MIT
University-level depth with syllabi, problem sets, and readings.
UC Berkeley
A rigorous intro sequence that builds strong programming fundamentals.
Nand2Tetris
Build a computer from first principles: logic gates → OS.
OSSU
An open self-taught CS spine with strong course sequencing.
UW–Madison
Virtualization, concurrency, and persistence with a systems mindset.
MIT Press
Mental models for abstraction, interpreters, and computation.
teachyourselfcs.com
Gap-filling map after bootcamps/self-teaching.
Linux Journey
Linux fundamentals for dev, ops, and cloud.
SQLBolt
Fast interactive SQL basics for backend work.
MIT
Terminal, editors, Git, debugging, profiling, and tooling fluency.
freeCodeCamp
Daily coding reps + project-based certifications.
The Odin Project
Project-focused full-stack web development curriculum.
roadmap.sh
Role-based roadmaps (frontend, backend, DevOps, AI, etc).
University of Helsinki
Modern React/Node/TypeScript/full-stack path.
Kubernetes
First deployment mental model once you're shipping services.
Made With ML
Design, develop, deploy, and iterate ML systems.
Full Stack Deep Learning
Building AI-powered products across the full lifecycle.
MIT
University-level depth with syllabi, problem sets, and readings.
Hugging Face
LLMs + tooling: transformers, agents, evaluation, and more.
PyTorch
Official beginner-to-advanced training in the PyTorch ecosystem.
roadmap.sh
Role-based roadmaps (frontend, backend, DevOps, AI, etc).
d2l.ai
Interactive deep learning book with code + math.
Made With ML
Design, develop, deploy, and iterate ML systems.
fast.ai
Practical deep learning for coders with a bias toward shipping.
deeplearningbook.org
Classic reference for deep learning fundamentals.
Full Stack Deep Learning
Building AI-powered products across the full lifecycle.
mlsysbook.ai
Machine learning systems engineering depth.
Hugging Face
Discover, load, and compare open datasets via dataset cards.
Hugging Face
LLMs + tooling: transformers, agents, evaluation, and more.
cocodataset.org
Major object detection/segmentation/captioning benchmark dataset.
OpenSLR
~1,000 hours of English read speech.
OpenSLR
Multi-speaker TTS corpus derived from LibriSpeech/LibriVox.
Mozilla
Crowdsourced speech dataset/platform.
Computer vision dataset with boxes, segmentation, and relationships.
Project Gutenberg
Public-domain books (verify country-specific copyright rules).
Common Crawl
Broad web crawl data; requires careful filtering + dedup.
Allen Institute for AI (AI2)
Large open corpus spanning web, academic works, code, books, and encyclopedic material.
Hugging Face
Cleaned and deduplicated English web data derived from Common Crawl.
Hugging Face
Multilingual extension of FineWeb.
Hugging Face
Educational subset of FineWeb.
LAION
Large-scale image-text training data discovery and reconstruction.
OpenSLR
Large multilingual speech corpus spanning multiple languages.
OpenAlex
Research discovery + metadata for filtered downloads and linking.
NCBI
Reuse-permitted subset of PubMed Central open access content.
LAION
Reproducible iteration of LAION-style indexes with transparent rebuild steps.
Together Computer
Open LLM dataset built from Common Crawl snapshots with quality signals and dedup metadata.
EleutherAI
Large, diverse open-source language-modeling dataset.
BigCode
Permissively licensed source code dataset across many languages.
BigCode
Large code dataset with billions of files across programming and markup languages.
Wikimedia
Monthly dumps of Wikipedia and other Wikimedia projects.
arXiv
Research corpora + metadata pipelines; use via official bulk data guidance.
firstcontributions
Make your first PR with a safe, guided workflow.
GitHub
Hands-on Git/GitHub workflows with guided repositories.
goodfirstissue.dev
Find beginner-friendly issues by language and repository.
GitHub
Contributor/maintainer onboarding, best practices, and community norms.
MIT
Terminal, editors, Git, debugging, profiling, and tooling fluency.
GitHub
Proposing, reviewing, and collaborating on changes.
AI Power Progress iA
A short, structured plan to start shipping progress immediately.
AI Power Progress iA
Plan staged learning tracks and import your workbook package.
AI Power Progress iA
Generate project ideas and build tracks from your catalog and goals.
paperswithcode.com
Find papers + leaderboards + code implementations by task.
MLCommons
Systems and model benchmarking reference points.
mlsysbook.ai
Machine learning systems engineering depth.
OpenAlex
Research discovery + metadata for filtered downloads and linking.
arXiv
Research corpora + metadata pipelines; use via official bulk data guidance.