A production-ready, Docker-powered RAG system that understands the difference between code and prose. Ingest your codebase and documentation, then query them with full privacy and zero configuration.
Comments
2dogsanerd•25m ago
Hi HN,
I built this "Community Edition" kit because I wanted a clean, self-hosted way to chat with my documents AND my codebase without sending data to the cloud.
It's a Docker Compose setup that wires together:
Docling for parsing (handles PDFs/Tables really well)
ChromaDB for vector storage
FastAPI backend + Simple UI
Ollama for local embeddings/LLM
The main feature: It separates "Code" and "Docs" into different collections with optimized chunking strategies, so you can ask "How does the auth middleware work?" and get actual code snippets back.
It's open source (MIT). I stripped out the heavy async worker queues from my enterprise version to keep this kit lightweight enough for a standard VPS or local machine.
Repo: https://github.com/2dogsandanerd/Knowledge-Base-Self-Hosting...
I'd love your feedback on the architecture or the parsing quality!
2dogsanerd•25m ago