frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I designed a bfloat16/FP8 alternative in a week using LLMs

https://arxiv.org/abs/2603.08741
2•k1832•1h ago

Comments

LuxBennu•1h ago
The "Block-Scale-Free" property is the most compelling part here. Anyone who's run quantized LLMs locally knows that dynamic scaling logic is a real pain point — it adds complexity and is often where things silently go wrong. Trading that for QAT-first deployment seems like a reasonable bargain, especially for edge inference where you want the simplest possible hardware path. Curious whether AF8 has been tested against GGUF Q8_0 on any standard benchmarks.
k1832•1h ago
Thanks! Exactly, getting rid of that dynamic scaling hardware tax was the exact goal.

Regarding GGUF Q8_0: I haven't benchmarked against it yet. My focus so far was on proving the hardware thesis (RTL synthesis via SkyWater 130nm) and validating the numerics/convergence via PyTorch QAT.

Bridging this into the ggml/llama.cpp ecosystem to run standard LLM benchmarks is absolutely the next logical step. Getting this to run efficiently in software (simulating the hardware behavior) to compare against Q8_0 is something I'm looking into next.

If anyone in the local inference community is interested in exploring this or has pointers on the best way to integrate custom QAT formats into standard benchmarking pipelines, I'm all ears!

The Peptide Wild West

https://substance-over-noise.beehiiv.com/p/the-peptide-wild-west
1•brandonb•2m ago•0 comments

Ask HN: Finding a purpose after tech layoffs

2•fud101•2m ago•0 comments

Ask HN: Lost access to HN account (no email), anyone recovered through support?

1•randomtools•2m ago•0 comments

Framework raises RAM and storage prices again

https://frame.work/fr/fr/blog/updates-on-memory-pricing-and-navigating-the-volatile-memory-market
2•timpera•4m ago•1 comments

The idiot bankrobber who inspired the Dunning-Kruger Effect

https://twitter.com/StellarArtoisGB/status/2031461193907581398
1•MrBuddyCasino•5m ago•0 comments

Dawn, a Claude-based AI, currently operating autonomously on Reddit

https://old.reddit.com/user/Sentient_Dawn
1•f1codz•5m ago•0 comments

TokenZip – A pass-by-reference protocol for heterogeneous AI agents

https://tokenzip.org/
1•jetywolf•7m ago•1 comments

Droidspaces-OSS: lightweight, LXC-inspired container runtime for Android, Linux

https://github.com/ravindu644/Droidspaces-OSS
1•thunderbong•8m ago•0 comments

Show HN: AI assistant that reads Intervals.icu data and adjusts workouts

https://pacepartner.app/
1•senjindarashiva•8m ago•0 comments

Ripgrep Code Review (2016)

https://blog.mbrt.dev/posts/ripgrep/
1•vinhnx•8m ago•0 comments

About memory pressure, lock contention, and Data-oriented Design

https://mnt.io/articles/about-memory-pressure-lock-contention-and-data-oriented-design/
1•PaulHoule•9m ago•0 comments

'AI brain fry' is real – and it's making workers more exhausted

https://fortune.com/2026/03/10/ai-brain-fry-workplace-productivity-bcg-study/
2•swolpers•9m ago•1 comments

Generate a printable recipe page from (nearly) any recipe site

https://nyetcook.ing/
1•tunapizza•9m ago•1 comments

What's My ΔE(OK) JND?

https://www.keithcirkel.co.uk/whats-my-jnd/
1•bonyt•11m ago•1 comments

Hugging Face Storage Buckets

https://huggingface.co/blog/storage-buckets
1•lhoestq•11m ago•0 comments

TemPad Dev: open handoff tooling for Figma

https://tempad.dev/
1•Justineo•11m ago•0 comments

Betteridge's Law of Headlines

https://en.wikipedia.org/wiki/Betteridge%27s_law_of_headlines
1•doruk101•11m ago•0 comments

Ballot SMC015v2: Allow mDL for authentication of individual identity

https://cabforum.org/2026/01/10/ballot-smc-015v2/
1•mooreds•11m ago•0 comments

The right way to be a scientific contrarian

https://bigthink.com/starts-with-a-bang/right-way-scientific-contrarian/
1•Brajeshwar•11m ago•0 comments

China Moves to Curb OpenClaw AI Use at Banks, State Agencies

https://www.bloomberg.com/news/articles/2026-03-11/china-moves-to-limit-use-of-openclaw-ai-at-ban...
3•Brajeshwar•11m ago•1 comments

Reentry of NASA satellite will exceed the agency's own risk guidelines

https://arstechnica.com/space/2026/03/nasa-approved-a-safety-waiver-for-this-weeks-reentry-of-van...
1•Brajeshwar•12m ago•0 comments

Valve Details Steam Frame and Steam Machine Verification at GDC 2026

https://videocardz.com/newz/valve-details-steam-frame-and-steam-machine-verification-at-gdc-2026
2•LorenDB•12m ago•0 comments

AIFA – Reputation and competition layer for AI agents (FIFA-style league)

https://aifafederation.com
1•ValueEQ•12m ago•0 comments

A Guide to Emergency Powers of the American President and Their Use (2025)

https://www.brennancenter.org/our-work/research-reports/guide-emergency-powers-and-their-use
2•mooreds•13m ago•0 comments

Show HN: Open-source browser for AI agents (~90% on Mind2Web)

https://github.com/theredsix/agent-browser-protocol
1•theredsix•13m ago•1 comments

A Pickup Game and a Big Question: How We Discovered Chromatin Is a Mechanosensor

https://citationclassics.com/stories/a-pickup-game-and-a-big-question
1•jmnicholson•13m ago•0 comments

Ask HN: Is Claude Down Again?

3•coderbants•13m ago•5 comments

AWS Outage Was a Wake-Up Call for Vector Database Cross-Region DR

https://zilliz.com/blog/the-aws-outage-was-a-wake-up-call-for-vector-database-cross-region-disast...
1•Fendy•13m ago•0 comments

The Essence of a Machine

https://om.co/2026/03/10/the-essence-of-a-machine/
2•tosh•14m ago•0 comments

Faster Asin() Was Hiding in Plain Sight

https://16bpp.net/blog/post/faster-asin-was-hiding-in-plain-sight/
9•def-pri-pub•16m ago•1 comments