frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: GitDB – GPU-accelerated vector database with Git-style version control

3•vkaufmann•2h ago
GitDB is a version-controlled vector database. Branch, merge, diff, and time-travel your embeddings the same way git handles source code. No server, no Docker — pip install gitdb-vectors and go.

  What makes it different:

  - Version control is native — git log, git diff, git branch, git merge for vectors. Roll back to any commit. Cherry-pick by
   meaning.
  - Time-travel queries — db.query_text("revenue", at="v1.0") searches an old snapshot.
  - CEPH CRUSH placement — deterministic data routing. Scales horizontally by adding peers, no coordinator.
  - P2P distributed — peers sync over SSH like git remotes. Each node is a full shard.
  - FoundationDB features — hooks, transactions, watches, secondary indexes, schema enforcement.
  - Universal ingest — swallow entire SQLite databases, MongoDB exports, CSV, Parquet, PDF with one command:
  db.ingest("legacy.db")
  - CLI-first — works from terminal like git: gitdb init && gitdb add --text "doc" && gitdb commit -m "init"
  - Embedded — no server process. Import and query in 3 lines of Python.

  21 modules, 13,150 lines, 394 tests. GPU-accelerated (MPS/CUDA/CPU).

  GitHub: https://github.com/VincentKaufmann/GitDB

  PyPI: pip install gitdb-vectors