frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: TurboQuant for vector search – 2-4 bit compression

https://github.com/RyanCodrai/py-turboquant
37•justsomeguy1996•5d ago

Comments

justsomeguy1996•5d ago
I built a Python implementation of Google's TurboQuant paper (ICLR 2026) for vector search. The key thing that makes this different from PQ and other quantization methods: it's fully data-oblivious. The codebook is derived from math (not trained on your data), so you can add vectors online without ever rebuilding the index. Each vector encodes independently in ~4ms at d=1536.

The repo reproduces the benchmarks from Section 4.4 of the paper — recall@1@k on GloVe (d=200) and OpenAI embeddings (d=1536, d=3072). At 4-bit on d=1536, you get 0.967 recall@1@1 with 8x compression. At 2-bit, 0.862 recall@1@1 with ~16x compression.

Paper: https://arxiv.org/abs/2504.19874

pidtom•1h ago
I built TurboQuant+ (https://github.com/TheTom/llama-cpp-turboquant), the llama.cpp implementation of this paper with extensions: asymmetric K/V compression, boundary layer protection, sparse V dequant, and this week weight compression (TQ4_1S) that shrinks models 28-42%% on disk with minimal quality loss. 5k+ stars, 50+ community testers across Metal, CUDA, and AMD HIP.

Cool to see the same WHT + Lloyd-Max math applied to vector search. The data-oblivious codebook property is exactly what makes it work for online KV cache compression too. No calibration, no training, just quantize and go.

If anyone is running local LLMs and wants to try it: https://github.com/TheTom/turboquant_plus/blob/main/docs/get...

iNaturalist

https://www.inaturalist.org/
77•bookofjoe•58m ago•18 comments

Solana Drift Protocol drained of $285M via fake token and governance hijack

https://anonhaven.com/en/news/drift-protocol-hack-285-million-solana/
36•anonhaven•51m ago•7 comments

Show HN: I built a frontpage for personal blogs

https://text.blogosphere.app/
454•ramkarthikk•5h ago•132 comments

Why the heck are we still using Markdown?

https://bgslabs.org/blog/why-are-we-using-markdown/
11•veqq•17m ago•9 comments

We replaced RAG with a virtual filesystem for our AI documentation assistant

https://www.mintlify.com/blog/how-we-built-a-virtual-filesystem-for-our-assistant
33•denssumesh•23h ago•15 comments

Go on Embedded Systems and WebAssembly

https://tinygo.org/
23•uticus•1h ago•0 comments

Understanding young news audiences at a time of rapid change

https://reutersinstitute.politics.ox.ac.uk/understanding-young-news-audiences-time-rapid-change
24•giuliomagnifico•5d ago•9 comments

Samsung Magician disk utility takes 18 steps and two reboots to uninstall

https://chalmovsky.com/2026/03/29/samsung-magician.html
259•chalmovsky•4d ago•136 comments

Big-Endian Testing with QEMU

https://www.hanshq.net/big-endian-qemu.html
63•jandeboevrie•4h ago•47 comments

Show HN: TurboQuant for vector search – 2-4 bit compression

https://github.com/RyanCodrai/py-turboquant
37•justsomeguy1996•5d ago•2 comments

April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini

https://gist.github.com/greenstevester/fc49b4e60a4fef9effc79066c1033ae5
228•greenstevester•8h ago•93 comments

The Technocracy Movement of the 1930s

https://donotresearch.substack.com/p/welcome-to-the-technocracy
21•lazydogbrownfox•15h ago•7 comments

A Recipe for Steganogravy

https://theo.lol/python/ai/steganography/seo/recipes/2026/03/27/a-recipe-for-steganogravy.html
95•tbrockman•5d ago•23 comments

Build your own Dial-up ISP with a Raspberry Pi

https://www.jeffgeerling.com/blog/2026/build-your-own-dial-up-isp-with-a-raspberry-pi/
25•arjunbajaj•3h ago•4 comments

SSH certificates: the better SSH experience

https://jpmens.net/2026/04/03/ssh-certificates-the-better-ssh-experience/
139•jandeboevrie•8h ago•55 comments

Show HN: Apfel – The free AI already on your Mac

https://apfel.franzai.com
550•franze•9h ago•124 comments

What Category Theory Teaches Us About DataFrames

https://mchav.github.io/what-category-theory-teaches-us-about-dataframes/
145•mchav•5d ago•44 comments

ESP32-S31: Dual-Core RISC-V SoC with Wi-Fi 6, Bluetooth 5.4, and Advanced HMI

https://www.espressif.com/en/news/ESP32_S31_Release
163•topspin•5d ago•91 comments

TDF ejects its core developers

https://meeksfamily.uk/~michael/blog/2026-04-02-tdf-ejects-core-devs.html
123•janvdberg•6h ago•79 comments

If you're running OpenClaw, you probably got hacked in the last week

https://old.reddit.com/r/sysadmin/comments/1sbdw29/if_youre_running_openclaw_you_probably_got_hac...
127•kykeonaut•1h ago•59 comments

Mercurial Dyson – a plan for the disassembly of planet Mercury

https://github.com/RokoMijic/MercurialDyson/blob/main/written_report.md
24•indy•1h ago•12 comments

What we learned building 100 API integrations with OpenCode

https://nango.dev/blog/learned-building-200-api-integrations-with-opencode/
71•rguldener•3d ago•16 comments

NHS staff refusing to use FDP over Palantir ethical concerns

https://www.freevacy.com/news/financial-times/nhs-staff-refusing-to-use-fdp-over-palantir-ethical...
260•chrisjj•8h ago•109 comments

Solar and batteries can power the world

https://nworbmot.org/blog/solar-battery-world.html
210•edent•3h ago•302 comments

Category Theory Illustrated – Types

https://abuseofnotation.github.io/category-theory-illustrated/06_type/
49•boris_m•8h ago•1 comments

Intel Assured Supply Chain Product Brief

https://www.intel.com/content/www/us/en/content-details/850997/intel-assured-supply-chain-product...
44•aw-engineer•4d ago•10 comments

Tailscale's new macOS home

https://tailscale.com/blog/macos-notch-escape
532•tosh•23h ago•286 comments

Google releases Gemma 4 open models

https://deepmind.google/models/gemma/gemma-4/
1686•jeffmcjunkin•1d ago•447 comments

Cursor 3

https://cursor.com/blog/cursor-3
517•adamfeldman•1d ago•377 comments

Critics say EU risks ceding control of its tech laws under U.S. pressure

https://www.politico.eu/article/fatal-decision-eu-slammed-for-caving-to-us-pressure-on-digital-ru...
190•nickslaughter02•7h ago•121 comments