Show HN: KektorDB – Lightweight, Embeddable Vector+Graph Database Written in Go

3•san0n•1mo ago

Comments

san0n•1mo ago

Hi HN, author here.

I started KektorDB as a personal challenge to learn Go and database internals. Soon, however, I got hooked: I wanted the project to have some dignity beyond a simple "toy project".

I didn’t follow a rigid roadmap; I iterated based on what felt right. I started by implementing caching and a semantic firewall, and from there, the step towards an integrated RAG pipeline was natural.

To be honest, the choice to integrate RAG comes from my laziness. I tried building a system using Python and LangChain, but I hated managing external scripts and dependencies just to make data talk to the LLM. I wanted a "batteries-included" solution.

However, the first results of my "naive" RAG were disappointing. That’s why I decided to integrate a Lightweight Graph (to semantically link chunks) and techniques like HyDe directly into the engine. All while keeping a fixed constraint: it must remain a single binary, easily embeddable as a Go library.

While KektorDB is a general-purpose embeddable Vector + Graph database, its RAG pipeline is intentionally designed as a practical default. It's not a replacement for complex, heavily customized RAG infrastructures, but a way to get a local system working quickly.

Here is a quick overview of the features:

- HNSW Indexing: With support for Float32, Float16, and Int8 quantization.

- Hybrid Search: Combines vector similarity with BM25 keyword scoring for better accuracy.

- Graph Layer: Maintains a generic adjacency graph alongside vectors. Although the RAG pipeline uses it to link chunks, the system exposes APIs to define arbitrary relationships enabling semantic traversal.

- Persistence: AOF (Append-Only File) + Snapshot.

- RAG Features: Background worker for document ingestion + integrated proxy for query rewriting and Grounded HyDe (OpenAI-compatible).

Current Limitations:

1. It is currently RAM-bound (graph and vectors live in memory). I am working on a hybrid disk-storage engine.

2. Ingestion parsing can be improved (especially regarding tables in PDFs).

The code is pure Go (with optional Rust kernels for specific SIMD operations), all contained in a single binary.

The project started out of a desire to learn, but I would like to continue developing it seriously. For this reason, I would appreciate any kind of technical advice or feedback.

Thanks for reading.

Repository: https://github.com/sanonone/kektordb

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Show HN: A luma dependent chroma compression algorithm (image compression)

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Show HN: If you lose your memory, how to regain access to your computer?

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

Show HN: I spent 4 years building a UI design tool with only the features I use

Show HN: Craftplan – Elixir-based micro-ERP for small-scale manufacturers

Show HN: Django-rclone: Database and media backups for Django, powered by rclone

Show HN: Smooth CLI – Token-efficient browser for AI agents

Show HN: Witnessd – Prove human authorship via hardware-bound jitter seals

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

Show HN: More beautiful and usable Hacker News

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

Show HN: PalettePoint – AI color palette generator from text or images

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

Show HN: Slack CLI for Agents

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

Show HN: Gigacode – Use OpenCode's UI with Claude Code/Codex/Amp

Show HN: ARM64 Android Dev Kit

Show HN: Stacky – certain block game clone

Show HN: A toy compiler I built in high school (runs in browser)

Show HN: Micropolis/SimCity Clone in Emacs Lisp

Show HN: Env-shelf – Open-source desktop app to manage .env files

Show HN: Nginx-defender – realtime abuse blocking for Nginx

Show HN: Daily-updated database of malicious browser extensions

Show HN: Horizons – OSS agent execution engine

Show HN: MCP App to play backgammon with your LLM

Show HN: I'm 75, building an OSS Virtual Protest Protocol for digital activism

Show HN: Falcon's Eye (isometric NetHack) running in the browser via WebAssembly