The idea came from my frustration with existing vector DBs that were either too heavy for experimentation or too opaque to modify. I wanted something simple, modular, and extensible — so I built it.
What it does:
Lets you store, index, and search high-dimensional vectors
Supports multiple indices (Flat, HNSW, IVF, LSH, Annoy)
Has memory, disk, and hybrid storage backends
Includes a full document processing pipeline (parsing, cleaning, chunking, embedding)
Offers quantization, persistence, and plugin-based extensibility
All written in Python, integrated with NumPy, and production-tested with logging and monitoring built in.
Install:
pip install valori
GitHub: https://github.com/varshith-Git/valori
PyPI: https://pypi.org/project/valori
I’d love to hear your thoughts —
What’s missing for you in current vector DBs?
If you’ve built LLM or RAG systems, what do you wish a lightweight, pure Python DB like this handled better?
Would you prefer tighter integrations (LangChain, Haystack, etc.) or a more “build-it-yourself” style?
Feedback, criticism, or collaboration ideas are all welcome. — Varshith (varshith.gudur17@gmail.com )
bendtb•8h ago
steffann•1h ago