Similar items by topic, tags, and provider (metadata-only).
repoFoundationggml-org
ggml-org
Core local inference stack for CPU / GPU quantized deployment and experimentation.
docsBuildLlamaIndex
LlamaIndex
Conceptual and practical entry point for ingestion, indexing, and retrieval-augmented generation workflows.
repoAdvancedMeta
Meta
Library for efficient similarity search and clustering of dense vectors at large scale.
repoBuildpgvector
pgvector
PostgreSQL extension for embedding similarity search when you want one operational database for app + vectors.
docsBuildOpen WebUI
Open WebUI
Offline-first self-hosted AI interface that works well as a local front-end for models and knowledge tools.
docsAdvancedvLLM
vLLM
High-throughput server for serving open models behind an OpenAI-compatible API.
docsFoundationOllama
Ollama
Official documentation for running and integrating local models with a simple developer workflow.
docsBuilddeepset
deepset
Solid framework for retrieval pipelines, agents, evaluation, and production patterns.
docsBuildQdrant
Qdrant
Simple, strong choice for local and hybrid vector search systems.
docsAdvancedHugging Face
Hugging Face
Useful for supervised fine-tuning, preference tuning, and training experiments.
docsBuildOpen WebUI
Open WebUI
Fastest path to standing up Open WebUI with Docker for a self-hosted local AI stack.
docsBuildOllama
Ollama
API docs for integrating local generation into apps, automations, and RAG pipelines.