frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz

https://twitter.com/fguzmanai/status/2065832668172845209
26•laxmena•1h ago

Comments

amelius•1h ago
See also:

https://rits.shanghai.nyu.edu/ai/karpathys-microgpt-on-fpga-...

TL;DR: The CPU implementation was 71x faster than the FPGA.

Note: model has only 4192 parameters.

cyanydeez•1h ago
yeah, then theres prompt loading too.

but anyone who can fit QWEN-3.6 35B with a sustained ~30 token/s and ~100k context with cache could print money as a hardware vendor.

wmf•53m ago
That just sounds like a 3090.
hedgehog•43m ago
That post is uninteresting both because they miss the point, and it's not clear a human was even involved to perceive a point to miss. Sure, with an unlimited transistor budget, power budget, and a design clocked at 4GHz fabbed on 5nm one of the best CPU design teams in the world can make a thing that is straight line faster than a one-person project running at 80MHz on a 20 year old 65nm FPGA. Any other answer would be extremely surprising.

Now, there are a bunch of interesting things about this project. Seeing the example of a tiny transformer running on FPGA is informative, and that it was apparently a pretty quick project for one person + robot assistance. Probably some transferable lessons for anyone else doing robo-FPGA development.

https://github.com/fguzman82/gateGPT/tree/main/

genxy•1h ago
The context window is 16 characters. Talking about tokens per second is meaningless.
dominotw•36m ago
its not meaningless. there could be usecases like spell correction.
cadamsdotcom•49m ago
Transformers scale poorly vs. context window size and parameter count.

Which means really impressive when those N’s are small!

I’m but a pundit in this area so don’t know much. But one wonders if there’s a future in burning larger models to FPGAs - whether big enough FPGAs exist (or can be built), and whether locating specialized compute right with the memory it needs can speed things up.

Likely would need a lot of algorithm parallelism work that’d translate back to CPUs/GPUs.

T-A•12m ago
Related: https://www.spheron.network/blog/etched-ai-sohu-vs-nvidia-tr...

Running local models is good now

https://vickiboykis.com/2026/06/15/running-local-models-is-good-now/
510•jfb•3h ago•252 comments

Claude: Elevated errors across many models

https://status.claude.com/incidents/xmhsglsz3h3w
79•forks•36m ago•51 comments

SpaceX to buy Cursor for $60B

https://www.reuters.com/legal/transactional/spacex-buy-anysphere-60-billion-2026-06-16/
484•itsmarcelg•7h ago•831 comments

Mechanical Watch (2022)

https://ciechanow.ski/mechanical-watch/
505•razin•6h ago•94 comments

TIL: You can make HTTP requests without curl using Bash /dev/TCP

https://mareksuppa.com/til/bash-dev-tcp-http-without-curl/
53•mrshu•1h ago•26 comments

Gamers beware: malicious wallpapers on Steam found stealing accounts

https://securelist.com/dozens-of-malicious-wallpapers-found-on-steam-workshop/120186/
27•speckx•52m ago•8 comments

But yak shaving is fun

https://parksb.github.io/en/article/32.html
82•parksb•3h ago•21 comments

Making ast.walk 220x Faster

https://reflex.dev/blog/why-ast-walk-when-you-can-ast-sprint/
37•palashawas•1h ago•8 comments

SubQ 1.1 Small

https://subq.ai/subq-1-1-small-technical-report
68•EDM115•3h ago•31 comments

After AI Takes Everything

https://ursb.me/en/posts/after-ai-takes-everything/
44•speckx•2h ago•13 comments

Apple's weird anti-nausea dots cured my car sickness

https://www.theverge.com/tech/942854/apple-vehicle-motion-cues-review-really-work
171•neilfrndes•1h ago•61 comments

Correlated randomness in Slay the Spire 2

https://tck.mn/blog/correlated-randomness-sts2/
220•rdmuser•8h ago•65 comments

I admire Fabrice Bellard. He is almost certainly a better overall programmer

https://twitter.com/ID_AA_Carmack/status/2064095424420487226
746•apitman•13h ago•358 comments

Formal Methods and the Future of Programming

https://blog.janestreet.com/formal-methods-at-jane-street-index/
17•nextos•4d ago•1 comments

The time the x86 emulator team found code so bad they fixed it during emulation

https://devblogs.microsoft.com/oldnewthing/20260615-00/?p=112419
448•paulmooreparks•13h ago•142 comments

Why is Meta destroying its engineering organization?

https://newsletter.pragmaticengineer.com/p/why-is-meta-destroying-its-engineering
68•throwarayes•1h ago•33 comments

The octopus architecture for AI agents

https://blog.goodman.dev/blog/octopus-agent-architecture/
6•joshbetz•42m ago•1 comments

10Gb/s Ethernet: switching to a Broadcom SFP+ module

https://www.gilesthomas.com/2026/06/10g-ethernet-switching-to-broadcom-sfp-plus
4•gpjt•18m ago•0 comments

Qwen-Robot Suite: A Foundation Model Suite for Physical World Intelligence

https://qwen.ai/blog?id=qwen-robotsuite
43•ilreb•4h ago•1 comments

An interview with an Apple emoji designer

https://shadycharacters.co.uk/2026/06/ollie-wagner/
70•nate•3d ago•35 comments

Specs Augmented Reality Glasses

https://newsroom.snap.com/introducing-specs-augmented-reality-glasses
16•haberdasher•1h ago•4 comments

'Ghost jobs' could soon be illegal in New York

https://www.fastcompany.com/91558427/ghost-jobs-could-soon-be-illegal-in-new-york
34•toomuchtodo•1h ago•8 comments

Unicorn – The Ultimate CPU Emulator

https://www.unicorn-engine.org/
66•tosh•6h ago•19 comments

Getting Creative with Perlin Noise Fields

https://sighack.com/post/getting-creative-with-perlin-noise-fields
128•0x000xca0xfe•2d ago•20 comments

Banned book library in a wi-fi smart light bulb

https://www.richardosgood.com/posts/banned-book-library/
545•sohkamyung•19h ago•324 comments

Feds freaked over Fable 5 after 'fix this code', not jailbreak, say researchers

https://www.theregister.com/security/2026/06/15/feds-freaked-over-fable-5-after-simple-fix-this-c...
475•_tk_•8h ago•287 comments

The Manhoff Archives: Color photos of Stalin-era USSR taken by a US diplomat

https://www.rferl.org/a/the-manhoff-archive/28359558.html
146•Cider9986•2d ago•50 comments

GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz

https://twitter.com/fguzmanai/status/2065832668172845209
27•laxmena•1h ago•9 comments

I hacked into the worst e-bike and fixed it [video]

https://www.youtube.com/watch?v=hPrtVGimBYs
161•alexis-d•6d ago•80 comments

Making espresso with ultrasound

https://www.unsw.edu.au/newsroom/news/2026/06/New-way-making-espresso
58•darktoto•9h ago•60 comments