frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

KV-Cache Compression from First Principles – 10 strategies benchmarked in C

https://anne.lat/blog/kv-cache-compress
2•nadeko123•1h ago

Comments

nadeko123•1h ago
We implemented 10 KV-cache compression strategies from scratch in C and benchmarked them against each other on quality, memory, and throughput. Everything from symmetric INT8 to H2O eviction to a pyramid scheme (the good kind) that assigns different precision based on token age.

The pyramid approach ended up being the most interesting finding – recent tokens stay FP32, middle-aged go INT8, old tokens drop to INT4. Gets you 2.8x memory reduction at 0.996 cosine similarity to the FP32 baseline. Turns out tokens age out of relevance and precision should follow.

All code is pure C, no dependencies, ~2,400 lines. Every figure in the article is reproducible.

Fun crafts done by AI agents – humans subscribe to read

https://crafthunt.ai/
1•forest0xia•1m ago•1 comments

FAA Ends Use of 'See and Avoid' for Helicopters Near Busy Airports

https://www.nytimes.com/2026/03/18/us/politics/faa-visual-separation-policy.html
1•bookofjoe•3m ago•1 comments

Talking with Robin Brooks

https://paulkrugman.substack.com/p/talking-with-robin-brooks
1•paulpauper•4m ago•0 comments

GitHub – DrBradStanfield/Roadmap

https://github.com/DrBradStanfield/roadmap
1•evo_9•8m ago•0 comments

Build knowledge agents without embeddings

https://vercel.com/blog/build-knowledge-agents-without-embeddings
1•ahamez•9m ago•0 comments

AI's impact on private equity – based on AI deep research

https://ai-transformation.fyi/
1•Dagulf795•9m ago•1 comments

Show HN: Playra

https://playra-app.vercel.app/
1•Allenboyy•12m ago•0 comments

ClawMem

https://github.com/yoloshii/ClawMem
1•flippyhead•13m ago•0 comments

Alpha release: HunterPrey, a persistent SSH PvP world

https://hunterprey.com/
1•kicksent•14m ago•0 comments

Show HN: Association rule mining on 21.6M poker hands

https://pattern.poker
1•et9797•16m ago•0 comments

The Environment Will Be Saved If We Just Avoid Doing Icky Things (2024)

https://branchfree.org/2024/11/25/the-environment-will-be-saved-if-we-just-avoid-doing-icky-things/
2•usdogu•17m ago•0 comments

When the sun sets, batteries rise: 24/7 solar in California

https://pv-magazine-usa.com/2026/02/17/when-the-sun-sets-batteries-rise-24-7-solar-in-california/
1•toomuchtodo•18m ago•0 comments

Show HN: Three deployable open source platforms from a solo builder

1•BrainDAnderson•19m ago•0 comments

New OllamaMQ Version v0.2.5

https://github.com/Chleba/ollamaMQ
1•chleba•21m ago•0 comments

MoaV: Why Anti-Censorship Needs Infrastructure, Not Just Tools

https://medium.com/@sbetamc/moav-16-protocols-one-server-why-anti-censorship-needs-infrastructure...
1•shayanbahal•23m ago•0 comments

Why Tech Bros Are Now Obsessed with Taste

https://www.newyorker.com/culture/infinite-scroll/why-tech-bros-are-now-obsessed-with-taste
2•rmason•23m ago•1 comments

Microbenchmarking Chipsets for Giggles

https://chipsandcheese.com/p/microbenchmarking-chipsets-for-giggles
2•zdw•32m ago•0 comments

AI Is Garbage and a Bubble

https://www.richardcarrier.info/archives/38652
20•mastabadtomm•33m ago•8 comments

Show HN: Inner Warden, self-defending server with eBPF and AI

https://www.innerwarden.com/
2•maiconburn•34m ago•0 comments

Describe an app in one sentence, get it built and shipped within minutes

https://bubbling.dev/
1•georgesmith9914•35m ago•2 comments

GrapheneOS will remain usable by anyone without requiring personal information

https://grapheneos.social/@GrapheneOS/116261301913660830
4•nothrowaways•36m ago•0 comments

Show HN: SYNX – a new config format with active mod. Built for AI and humans

https://synx.aperturesyndicate.com/
2•Kaiserrberg•37m ago•0 comments

Show HN: Discover Indie Version of Popular SaaS Products

https://indiehustles.com/
2•devarifhossain•38m ago•1 comments

Mutual Party Extremism

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6425039
2•neehao•39m ago•0 comments

The Slow Collapse of MkDocs

https://fpgmaas.com/blog/collapse-of-mkdocs/
1•zdw•40m ago•0 comments

Show HN: I built a paper-based vault with m-of-n keys

https://papervault.xyz
2•boazeb•40m ago•0 comments

Solod: Go can be a better C

https://antonz.org/solod/
6•ibobev•42m ago•0 comments

Show HN: EnvMaster – inject encrypted env variables from your terminal

https://www.envmaster.dev/
2•selixe_•43m ago•0 comments

Onefiling

https://bumbershootsoft.wordpress.com/2026/03/21/onefiling/
2•ibobev•43m ago•0 comments

Learning Rust

https://alexene.dev/2018/09/09/Learning-rust.html
3•ibobev•45m ago•0 comments