frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

ANN v3: 200ms p99 query latency over 100B vectors

https://turbopuffer.com/blog/ann-v3
29•_peregrine_•3d ago

Comments

jascha_eng•3d ago
This is legitimately pretty impressive. I think the rule of thumb is now, go with postgres(pgvector) for vector search until it breaks, then go with turbopuffer.
_peregrine_•2d ago
seems like a good rule of thumb to me! though i would perhaps lump "cost" into the "until it breaks" equation. even with decent perf, pg_vector's economics can be much worse, especially in multi-tenant scenarios where you need many small indexes (this is true of any vector db that builds indexes primarily on RAM/SSD)
shayonj•1h ago
v cool and impressive!
lmeyerov•49m ago
Fun!

I was curious given the cloud discussion - a quick search suggests default AWS SSD bandwidth is 250 MB/s, and you can pay more for 1 GB/s. Similar for s3, one http connection is < 100 MB/s, and you can pay for more parallel connections. So the hot binary quantized search index is doing a lot of work to minimize these both for the initial hot queries and pruning later fetches. Very cool!

kgeist•30m ago
Are there vector DBs with 100B vectors in production which work well? There was a paper which showed that there's 12% loss in accuracy at just 1 mln vectors. Maybe some kind of logical sharding is another option.
_peregrine_•13m ago
the solution described in the blog post is currently in production at 100B vectors

This paper has been cited more than 6k times. It's fatally flawed.

https://statmodeling.stat.columbia.edu/2026/01/22/aking/
312•timr•5h ago•134 comments

Deutsche Telekom is violating Net Neutrality

https://netzbremse.de/en/
397•tietjens•6h ago•200 comments

Show HN: Bonsplit – Tabs and splits for native macOS apps

https://bonsplit.alasdairmonk.com
66•sgottit•3h ago•10 comments

ANN v3: 200ms p99 query latency over 100B vectors

https://turbopuffer.com/blog/ann-v3
29•_peregrine_•3d ago•6 comments

Iran Protest Death Toll Could Top 30k, According to Local Health Officials

https://time.com/7357635/more-than-30000-killed-in-iran-say-senior-officials/
97•mhb•1h ago•19 comments

Google confirms 'high-friction' sideloading flow is coming to Android

https://www.androidauthority.com/google-sideloading-android-high-friction-process-3633468/
343•_____k•5d ago•311 comments

Introduction to PostgreSQL Indexes

https://dlt.github.io/blog/posts/introduction-to-postgresql-indexes/
126•dlt•6h ago•5 comments

Doom has been ported to an earbud

https://doombuds.com
36•arin-s•2h ago•8 comments

Show HN: TUI for managing XDG default applications

https://github.com/mitjafelicijan/xdgctl
39•mitjafelicijan•3h ago•12 comments

BirdyChat becomes first European chat app that is interoperable with WhatsApp

https://www.birdy.chat/blog/first-to-interoperate-with-whatsapp
668•joooscha•19h ago•416 comments

Jurassic Park - Tablet device on Nedry's desk? (2012)

https://www.therpf.com/forums/threads/jurassic-park-tablet-device-on-nedrys-desk.169883/
74•exvi•5h ago•29 comments

Nango (YC W23, Dev Infrastructure) Is Hiring Remotely

https://jobs.ashbyhq.com/Nango
1•bastienbeurier•2h ago

Web-based image editor modeled after Deluxe Paint

https://github.com/steffest/DPaint-js
6•bananaboy•2h ago•0 comments

Bridging the Gap Between PLECS and SPICE

https://erickschulz.dev/posts/plecs-spice/
6•eschu•4h ago•1 comments

Adoption of EVs tied to real-world reductions in air pollution: study

https://keck.usc.edu/news/adoption-of-electric-vehicles-tied-to-real-world-reductions-in-air-poll...
466•hhs•14h ago•419 comments

BU-808: How to Prolong Lithium-based Batteries (2023)

https://www.batteryuniversity.com/article/bu-808-how-to-prolong-lithium-based-batteries/
34•eswat•2d ago•6 comments

A Lament for Aperture

https://ikennd.ac/blog/2026/01/old-man-yells-at-modern-software-design/
148•firloop•4d ago•32 comments

The Rebirth of Pennsylvania's Infamous Burning Town

https://www.atlasobscura.com/articles/centralia-pennsylvania-rebirth
24•pbshgthm•5d ago•3 comments

Alarm overload is undermining safety at sea as crews face thousands of alerts

https://www.lr.org/en/knowledge/press-room/press-listing/press-release/2026/alarm-overload-is-und...
36•geox•2h ago•19 comments

I built a 2x faster lexer, then discovered I/O was the real bottleneck

https://modulovalue.com/blog/syscall-overhead-tar-gz-io-performance/
67•modulovalue•4d ago•33 comments

Show HN: AutoShorts – Local, GPU-accelerated AI video pipeline for creators

https://github.com/divyaprakash0426/autoshorts
50•divyaprakash•7h ago•20 comments

Hands-On with Two Apple Network Server Prototype ROMs

http://oldvcr.blogspot.com/2026/01/hands-on-with-two-apple-network-server.html
22•todsacerdoti•6h ago•0 comments

David Patterson: Challenges and Research Directions for LLM Inference Hardware

https://arxiv.org/abs/2601.05047
89•transpute•12h ago•10 comments

Sony Data Discman

https://huguesjohnson.com/random/sony-ebook/
28•naves•6h ago•1 comments

Two Weeks Until Tapeout

https://essenceia.github.io/projects/two_weeks_until_tapeout/
160•client4•13h ago•14 comments

Show HN: LangGraph architecture that scales (hexagonal pattern, 110 tests)

https://github.com/cleverhoods/sagecompass
3•cleverhoods•5d ago•0 comments

Article on the History of Spot Instances: Analyzing Spot Instance Pricing Change

https://spot.rackspace.com/blogs/history-of-spot-instances
7•aleroawani•4d ago•0 comments

Postmortem: Our first VLEO satellite mission (with imagery and flight data)

https://albedo.com/post/clarity-1-what-worked-and-where-we-go-next
193•topherhaddad•18h ago•63 comments

Claude Code's new hidden feature: Swarms

https://twitter.com/NicerInPerson/status/2014989679796347375
463•AffableSpatula•1d ago•301 comments

Intrinsically stretchable 2D MoS2 transistors

https://www.nature.com/articles/s41467-026-68504-2
16•bookofjoe•4d ago•0 comments