frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Hoeren – Local-only meeting transcription and voice dictation

1•dimaberlin•1m ago•0 comments

Chinese electric truck maker Windrose makes first US delivery

https://www.reuters.com/business/autos-transportation/chinese-electric-truck-maker-windrose-makes...
1•java-man•4m ago•1 comments

Show HN: Go-Bt: Minimalist Behavior Trees for Go

https://github.com/rvitorper/go-bt
1•rvitorper•5m ago•0 comments

Neon Vector Animation

https://davidhampgonsalves.com/neon-animated-eye/
1•dh-g•5m ago•0 comments

BlueHammer – Windows 0day LPE

https://github.com/Nightmare-Eclipse/BlueHammer
1•asimops•7m ago•0 comments

The AI coding agent is a new kind of contributor, and Git isn't made for it

https://agent-undo.com/blog/the-shift/
2•peaktwilight•9m ago•0 comments

Adam Jacob – Communication Breakdown

https://www.adamhjk.com/blog/communication-breakdown/
1•rmoff•9m ago•0 comments

Jitter – Prove human authorship in Google Docs without sharing content

https://scalisos.com
2•theborat•10m ago•0 comments

Show HN: Real-time deepfake in the browser, no GPU needed

https://www.callstrike.ai/deepfake-security-training
1•gyanchawdhary•11m ago•0 comments

Private AI Document Analysis (even in the browser)

https://www.pdflayout.ai/
2•floathub•12m ago•0 comments

WordTrail – Daily Word Puzzle

https://play.google.com/store/apps/details?id=com.wordtrail.app&hl=en_US
1•talha_dev•12m ago•0 comments

Stellar Broadcast colony ship roguelike with on-device neural net planet naming

https://stellarbroadcast.org
1•FaustCircuits•13m ago•0 comments

How much Brits worry about the automation of their work

https://yougov.com/en-gb/trackers/how-much-brits-worry-about-the-automation-of-their-work
1•alphabetatango•14m ago•0 comments

Alicization Town – A decentralized pixel MMO for AI agents

1•LokYu•14m ago•0 comments

AIngram – SQLite shared reasoning memory for agent loops, no API keys

https://github.com/bozbuilds/AIngram
1•bozbuilds•14m ago•2 comments

The day you get cut out of the economy

https://geohot.github.io//blog/jekyll/update/2026/04/08/the-day-you-get-cut-out.html
1•lebek•14m ago•0 comments

OpenAI's AI New Deal [pdf]

https://cdn.openai.com/pdf/561e7512-253e-424b-9734-ef4098440601/Industrial%20Policy%20for%20the%2...
1•devInCamelCase•15m ago•0 comments

Show HN: ZeroID – Open-source identity for AI agents based on OIDF standards

https://github.com/highflame-ai/zeroid
1•jalbrethsen•18m ago•1 comments

Tech industry lays off nearly 80k employees in the first quarter of 2026

https://www.tomshardware.com/tech-industry/tech-industry-lays-off-nearly-80-000-employees-in-the-...
3•speckx•18m ago•0 comments

Git-fire – back up every local repo in one command

https://github.com/git-fire/git-fire
2•bschell•18m ago•1 comments

Deterministic Primality Testing for Limited Bit Width

https://www.jeremykun.com/2026/04/07/deterministic-miller-rabin/
1•ibobev•19m ago•0 comments

Pentagonal numbers are truncated triangular numbers

https://www.johndcook.com/blog/2026/04/01/truncated-triangular-numbers/
1•ibobev•19m ago•0 comments

Artemis II: Integrity completes its first return trajectory correction burn

https://www.nasa.gov/blogs/missions/2026/04/07/artemis-ii-flight-day-7-first-return-correction-bu...
1•Bubble1296•20m ago•1 comments

Two Years of Valkey

https://redmonk.com/sogrady/2026/04/06/valkey-at-two/
1•j03b•20m ago•0 comments

Claude Code Usage Dashboard

https://github.com/phuryn/claude-usage
2•neamar•21m ago•0 comments

An interactive map of river basins and watersheds of North and South America

https://python-maps-vis.vercel.app
2•alphabetatango•21m ago•0 comments

Show HN: Embenx – agentic memory layer for AI agents

https://github.com/adityak74/embenx
1•akarnam37•23m ago•0 comments

The Abbey That Fueled Scotland's Street Violence and Saved the Bees

https://www.governance.fyi/p/the-abbey-that-fueled-scotlands-street
1•bigbobbeeper•23m ago•0 comments

Thoughts on Slowing the Fuck Down

https://mariozechner.at/posts/2026-03-25-thoughts-on-slowing-the-fuck-down/#toc_5
3•lebek•23m ago•0 comments

Giving AI coding agents a Principal Engineer's mind and cost of intern

https://github.com/juyterman1000/entroly
1•abbyedd•24m ago•2 comments
Open in hackernews

LLM inference engine from scratch in C++ – why output tokens cost 5x

https://www.anirudhsathiya.com/blog/transformer
9•ani17•1h ago

Comments

ani17•1h ago
Author here. A bit more context: By day I'm a systems engineer building AI networking infrastructure. So I kept ending up in conversations where I'm not exactly able to wrap my head on the latest inference magic trick.

Like when someone mentioned vLLM's paged attention, I knew virtual memory paging, but had no idea someone had applied the same idea to KV cache allocation on GPUs.

Github link to the project: https://github.com/Anirudh171202/WhiteLotus

ani17•1h ago
The blog walks through why your first token is always the slowest, why output tokens cost 5x more, and how stuff like speculative decoding and chunked prefill actually work, from the perspective of a systems engineer!
brownianmotion1•1h ago
> float bodyWeight = 67.5f; // who needs 32 bits to store a weight??

UHHHH...