Zvec: A lightweight, fast, in-process vector database

https://github.com/alibaba/zvec

37•dvrp•1d ago

https://zvec.org/en/

Comments

clemlesne•2h ago

Did someone compared with uSearch (https://github.com/unum-cloud/USearch)?

skybrian•57m ago

Are these sort of similarity searches useful for classifying text?

OutOfHere•39m ago

It altogether depends on the quality and suitability of the provided embedding vector that you provide. Even with a long embedding vector using a recent model, my estimation is that the classification will be better than random but not too accurate. You would typically do better by asking a large model directly for a classification. The good thing is that it is often easy to create a small human labeled dataset and estimate the error confusion matrix via each approach.

CuriouslyC•27m ago

Embeddings are good at partitioning document stores at a coarse grained level, and they can be very useful for documents where there's a lot of keyword overlap and the semantic differentiation is distributed. They're definitely not a good primary recall mechanism, and they often don't even fully pull weight for their cost in hybrid setups, so it's worth doing evals for your specific use case.

esafak•17m ago

You could assign the cluster based on what the k nearest neighbors are, if there is a clear majority. The quality will depend on the suitability of your embeddings.

_pdp_•18m ago

I thought you need memory for these things and CPU is not the bottleneck?

simonw•15m ago

Their self-reported benchmarks have them out-performing pinecone by 7x in queries-per-second: https://zvec.org/en/docs/benchmarks/

I'd love to see those results independently verified, and I'd also love a good explanation of how they're getting such great performance.

News publishers limit Internet Archive access due to AI scraping concerns

uBlock filter list to hide all YouTube Shorts

My smart sleep mask broadcasts users' brainwaves to an open MQTT broker

Zvec: A lightweight, fast, in-process vector database

Ooh.directory: a place to find good blogs that interest you

IBM tripling entry-level jobs after finding the limits of AI adoption

Instagram's URL Blackhole

5,300-year-old 'bow drill' rewrites story of ancient Egyptian tools

Flood Fill vs. The Magic Circle

Amsterdam Compiler Kit

The consequences of task switching in supervisory programming

Discord: A case study in performance optimization

Launching Interop 2026

Ask HN: How to get started with robotics as a hobbyist?

Show HN: Sameshi – a ~1200 Elo chess engine that fits within 2KB

Unicorn Jelly

A review of M Disc archival capability with long term testing results (2016)

Show HN: MOL – A programming language where pipelines trace themselves

Descent, ported to the web

Colored Petri Nets, LLMs, and distributed applications

A header-only C vector database library

An AI agent published a hit piece on me – more things have happened

Windows NT/OS2 Design Workbook

Vim 9.2

YouTube as Storage

A method and calculator for building foamcore drawer organisers

Zig – io_uring and Grand Central Dispatch std.Io implementations landed

How many registers does an x86-64 CPU have? (2020)

Fun with Algebraic Effects – From Toy Examples to Hardcaml Simulations

Show HN: Arcmark – macOS bookmark manager that attaches to browser as sidebar

Zvec: A lightweight, fast, in-process vector database

Comments

News publishers limit Internet Archive access due to AI scraping concerns

uBlock filter list to hide all YouTube Shorts

My smart sleep mask broadcasts users' brainwaves to an open MQTT broker

Zvec: A lightweight, fast, in-process vector database

Ooh.directory: a place to find good blogs that interest you

IBM tripling entry-level jobs after finding the limits of AI adoption

Instagram's URL Blackhole

5,300-year-old 'bow drill' rewrites story of ancient Egyptian tools

Flood Fill vs. The Magic Circle

Amsterdam Compiler Kit

The consequences of task switching in supervisory programming

Discord: A case study in performance optimization

Launching Interop 2026

Ask HN: How to get started with robotics as a hobbyist?

Show HN: Sameshi – a ~1200 Elo chess engine that fits within 2KB

Unicorn Jelly

A review of M Disc archival capability with long term testing results (2016)

Show HN: MOL – A programming language where pipelines trace themselves

Descent, ported to the web

Colored Petri Nets, LLMs, and distributed applications

A header-only C vector database library

An AI agent published a hit piece on me – more things have happened

Windows NT/OS2 Design Workbook

Vim 9.2

YouTube as Storage

A method and calculator for building foamcore drawer organisers

Zig – io_uring and Grand Central Dispatch std.Io implementations landed

How many registers does an x86-64 CPU have? (2020)

Fun with Algebraic Effects – From Toy Examples to Hardcaml Simulations

Show HN: Arcmark – macOS bookmark manager that attaches to browser as sidebar