frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Vector database that can index 1B vectors in 48M

https://www.vectroid.com/blog/why-and-how-we-built-Vectroid
42•mathewpregasen•2h ago

Comments

ge96•1h ago
M is minutes
ikanade•1h ago
Legend
HarHarVeryFunny•1h ago
I was starting to think this was impressive, if not impossible. 1B vectors in 48 MB of storage => < 1 bit per vector.

Maybe not impossible using shared/lossy storage if they were sparsely scattered over a large space ?

But anyways - minutes. Thanks.

Edit: Gemini suggested that this sort of (lossy) storage size could be achieved using "Product Quantization" (sub vectors, clustering, cluster indices), giving an example of 256 dimensional vectors being stored at an average of 6 bits per vector, with ANN being one application that might use this.

l5870uoo9y•1h ago
Thankfully not months.
softwaredoug•1h ago
Oh the horrors of search indexing Ive seen... including weeks / months to rebuild an index.
stevemk14ebr•1h ago
Thank you, title needs edited.
OutOfHere•1h ago
Proprietary closed-source lock-in. Nothing to see here.
CuriouslyC•1h ago
Seriously. The amount of lift a SaaS product needs to give me is insane for me to even bother evaluating it, and there's a near zero percent chance I'll use it in my core.
kcb•9m ago
Especially a product that demands access to large quantities of your most sensitive data to be useful.
stronglikedan•1h ago
Nothing for you to see here. Surely you just aren't their target customer.
OutOfHere•54m ago
So who is? Who really needs to index 1 billion new vectors every 48 minutes, or perhaps equivalently 1 million new vectors every 3 seconds?
HEmanZ•51m ago
What do you think an alternative is for someone who:

1. Has a technical system they think could be worth a fortune to large enterprises, containing at least a few novel insights to the industry.

2. Knows that competitors and open source alternatives could copy/implement these in a year or so if the product starts off open source.

3. Has to put food on the table and doesn’t want to give massive corporations extremely valuable software for free.

Open source has its place, but it is IMO one of the ways to give monopolies massive value for free. There are plenty of open source alternatives around for vector DBs. Do we (developers) need to give everything away to the rich

softwaredoug•1h ago
Not trying to be snarky, just curious -- How is this different from TurboPuffer and other serverless, object storage backed vector DBs?
ashvardanian•1h ago
Very curious about the hardware setup used for this benchmark!
esafak•23m ago
By the creator of the real-time data platform https://en.wikipedia.org/wiki/Hazelcast.

UTF-8 is a brilliant design

https://iamvishnu.com/posts/utf8-is-brilliant-design
69•vishnuharidas•1h ago•30 comments

EU court rules nuclear energy is clean energy

https://www.weplanet.org/post/eu-court-rules-nuclear-energy-is-clean-energy
236•mpweiher•1h ago•122 comments

QGIS is a free, open-source, cross platform geographical information system

https://github.com/qgis/QGIS
97•rcarmo•2h ago•25 comments

Many hard LeetCode problems are easy constraint problems

https://buttondown.com/hillelwayne/archive/many-hard-leetcode-problems-are-easy-constraint/
290•mpweiher•4h ago•206 comments

Rust: A quest for performant, reliable software [video]

https://www.youtube.com/watch?v=k_-6KI3m31M
31•raphlinus•11h ago•0 comments

The treasury is expanding the Patriot Act to attack Bitcoin self custody

https://www.tftc.io/treasury-iexpanding-patriot-act/
479•bilsbie•7h ago•370 comments

How FOSS Projects Handle Legal Takedown Requests

https://f-droid.org/2025/09/10/how-foss-projects-handle-legal-takedown-requests.html
35•mkesper•2h ago•4 comments

3D modeling with paper

https://www.arvinpoddar.com/blog/3d-modeling-with-paper
175•joshuawootonn•5h ago•28 comments

Humanely dealing with humungus crawlers

https://flak.tedunangst.com/post/humanely-dealing-with-humungus-crawlers
39•freediver•2h ago•6 comments

Vector database that can index 1B vectors in 48M

https://www.vectroid.com/blog/why-and-how-we-built-Vectroid
44•mathewpregasen•2h ago•15 comments

Qwen3-Next

https://qwen.ai/blog?id=4074cca80393150c248e508aa62983f9cb7d27cd&from=research.latest-advancement...
479•tosh•13h ago•188 comments

Advanced Scheme Techniques (2004) [pdf]

https://people.csail.mit.edu//jhbrown/scheme/continuationslides04.pdf
75•mooreds•3h ago•7 comments

Windows-Use: an AI agent that interacts with Windows at GUI layer

https://github.com/CursorTouch/Windows-Use
72•djhu9•3d ago•12 comments

Oq: Terminal OpenAPI Spec Viewer

https://github.com/plutov/oq
63•der_gopher•4h ago•9 comments

How to Become a Pure Mathematician (Or Statistician)

http://hbpms.blogspot.com/
27•ipnon•3d ago•3 comments

Power series, power serious (1999) [pdf]

https://www.cambridge.org/core/services/aop-cambridge-core/content/view/19863F4EAACC33E1E01DE2A21...
7•signa11•2d ago•1 comments

Building a Deep Research Agent Using MCP-Agent

https://thealliance.ai/blog/building-a-deep-research-agent-using-mcp-agent
44•saqadri•2d ago•9 comments

Doom-ada: Doom Emacs Ada language module with syntax, LSP and Alire support

https://github.com/tomekw/doom-ada
58•tomekw•4h ago•5 comments

Why do browsers throttle JavaScript timers?

https://nolanlawson.com/2025/08/31/why-do-browsers-throttle-javascript-timers/
16•vidyesh•2h ago•11 comments

VaultGemma: The most capable differentially private LLM

https://research.google/blog/vaultgemma-the-worlds-most-capable-differentially-private-llm/
40•meetpateltech•3h ago•10 comments

Racintosh Plus – Rackmount Mac Plus

http://www.identity4.com/2025-racintosh-plus/
103•zdw•3d ago•19 comments

K2-Think: A Parameter-Efficient Reasoning System

https://arxiv.org/abs/2509.07604
9•mgl•2h ago•2 comments

Groundbreaking Brazilian Drug, Capable of Reversing Spinal Cord Injury

https://www1.folha.uol.com.br/internacional/en/scienceandhealth/2025/09/groundbreaking-brazilian-...
11•_aleph2c_•28m ago•0 comments

Show HN: DWS OS, a Plan 9 Inspired Web “OS”

https://dws.rip
38•tdubey•4h ago•8 comments

Chat Control faces blocking minority in the EU

https://twitter.com/TutaPrivacy/status/1966384776883142661
329•miohtama•6h ago•104 comments

A beginner's guide to extending Emacs

https://blog.tjll.net/a-beginners-guide-to-extending-emacs/
114•ibobev•4h ago•13 comments

Show HN: I made a generative online drum machine with ClojureScript

https://dopeloop.ai/beat-maker/
146•chr15m•10h ago•27 comments

Ships are sailing with fake insurance from the Norwegian Ro Marine

https://www.nrk.no/vestland/xl/over-100-ships-have-sailed-without-legitimate-insurance-from-the-n...
190•aregue•5h ago•86 comments

Show HN: An MCP Gateway to block the lethal trifecta

https://github.com/Edison-Watch/open-edison
32•76SlashDolphin•4h ago•14 comments

Debian 13, Postgres, and the US time zones

https://rachelbythebay.com/w/2025/09/11/debtz/
255•move-on-by•17h ago•132 comments