frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: GPT-2 implemented using graphics shaders

https://github.com/nathan-barry/gpt2-webgl
228•nathan-barry•9mo ago
Back in the old days, people used to do general-purpose GPU programming by using shaders like GLSL. This is what inspired NVIDIA (and other companies) to eventually create CUDA (and friends). This is an implementation of GPT-2 using WebGL and shaders. Enjoy!

Comments

throwaway314155•9mo ago
This is pretty rad. I've built and run the project on my MacBook and can't seem to get the visualizations to show as they appear in the GitHub demonstration. I'm running Firefox but I think that should support WebGL features you're using?

Happy to file an issue on the repository as well if you'd prefer that.

nathan-barry•9mo ago
Yeah it'd be great if you could file an issue and show what you see! I was using Safari but I'm gonna check other browsers to test if it works on them
throwaway314155•9mo ago
Great! I've filed an issue.
SamosaGuru•9mo ago
Would be really cool to have it hosted on a website if it can be a static page so we could try it out
nathan-barry•9mo ago
Working on getting it up on Github pages, will let you know. Not sure if there will be an issue with hosting the weights, I'll look into it
Philpax•9mo ago
You may be able to fetch the weights directly from Hugging Face. I'd try that first.
ianand•9mo ago
Checkout https://github.com/jseeio/gpt2-tfjs fetches the weights for GPT2 from huggingface on the fly.
jasonjmcghee•9mo ago
I haven't looked at your implementation, but if you can read ONNX weights... https://huggingface.co/Xenova/gpt2
Philpax•9mo ago
I'm reminded of this VRChat world, which runs Qwen2-0.5B in a shader: https://vrchat.com/home/launch?worldId=wrld_e919c1f4-8399-43...
ianand•9mo ago
As the guy who did GPT2 in Excel, very cool and kudos!!

Curious why you chose WebGL over WebGPU? Just to show it can be done?

(Also see my other comment about fetching weights from huggingface)

ronsor•9mo ago
Probably because WebGPU support is still rather iffy.
nathan-barry•9mo ago
This was a final project for a graphics class where we used WebGL a lot. Also I was just more familiar with OpenGL and haven't looked that much into webGPU
littlestymaar•9mo ago
> Curious why you chose WebGL over WebGPU? Just to show it can be done?

For a WebGPU implementation, one can use transformers.js directly (or many other libraries actually), maybe WebGL is more original.

[1]: https://huggingface.co/docs/transformers.js/index

85392_school•9mo ago
Transformers.js wraps the ONNX runtime which is rather versatile (WASM, WebGL, WebGPU, and WebNN). It's not the backend that makes it novel.
_dijs•9mo ago
ianand, I immediately thought of you when I saw this post. Miss you friend.
ianand•9mo ago
Dude, been forever. Thanks. Will DM you.
divan•9mo ago
Someone needs to implement Excel using graphics shaders now.
ianand•9mo ago
Lol. You joke but... https://www.reddit.com/r/GraphicsProgramming/comments/pg9dw8...
helloplanets•9mo ago
Would absolutely read a 'making of' post on this project!
jasonjmcghee•9mo ago
Incredible. Was going to suggest tossing up a github page with it, but looks like one is currently in-progress? (https://nathan-barry.github.io/gpt2-webgl/) but doesn't seem to load for me currently.

Looks like it's trying to load the typescript directly / hosting the source files: `<script type="module" src="src/main.ts"></script>` - instead of hosting the dist folder.

nathan-barry•9mo ago
Yeah, right now the weights aren't even pushed to the repo so there's no way for the github page to load them in. I've looked a tiny bit into it and might end up making a fix for it, having it load the weights by fetching them from somewhere else on load, but probably not today.

There is a link to the zipped up folder with the weights here: https://github.com/nathan-barry/gpt2-webgl/releases/download...

If anyone wants to make a PR which fetches the file from there, unzips it, then loads in the weights from there that would be greatly appreciated.

badmonster•9mo ago
Super cool project! How much of the inference time is actually GPU-bound vs CPU-bound in the browser? Curious if WebGPU would offer a big perf boost over WebGL2 here.
Narishma•9mo ago
Now do it with the fixed function pipeline so I can run it on my 3dfx Voodoo card.
gitroom•9mo ago
Nice, that takes some serious grinding to pull off in shaders. Makes me kinda wonder - does pushing old tools like WebGL actually force deeper understanding of how these models work compared to just grabbing a new library?
supuun•9mo ago
> Back in the old days, people used to do general-purpose GPU programming by using shaders like GLSL. This is what inspired NVIDIA (and other companies) to eventually create CUDA (and friends).

I learned about this at the lecture last week. So fun to see this project.

Tiny C Compiler

https://bellard.org/tcc/
52•guerrilla•1h ago•20 comments

You Are Here

https://brooker.co.za/blog/2026/02/07/you-are-here.html
37•mltvc•1h ago•32 comments

SectorC: A C Compiler in 512 bytes

https://xorvoid.com/sectorc.html
148•valyala•5h ago•25 comments

The F Word

http://muratbuffalo.blogspot.com/2026/02/friction.html
76•zdw•3d ago•31 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode
82•surprisetalk•5h ago•89 comments

LLMs as the new high level language

https://federicopereiro.com/llm-high/
19•swah•4d ago•12 comments

Software factories and the agentic moment

https://factory.strongdm.ai/
119•mellosouls•8h ago•232 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
157•AlexeyBrin•11h ago•28 comments

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/
864•klaussilveira•1d ago•264 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
113•vinhnx•8h ago•14 comments

GitBlack: Tracing America's Foundation

https://gitblack.vercel.app/
17•martialg•50m ago•3 comments

FDA intends to take action against non-FDA-approved GLP-1 drugs

https://www.fda.gov/news-events/press-announcements/fda-intends-take-action-against-non-fda-appro...
29•randycupertino•58m ago•29 comments

Show HN: A luma dependent chroma compression algorithm (image compression)

https://www.bitsnbites.eu/a-spatial-domain-variable-block-size-luma-dependent-chroma-compression-...
21•mbitsnbites•3d ago•1 comments

Al Lowe on model trains, funny deaths and working with Disney

https://spillhistorie.no/2026/02/06/interview-with-sierra-veteran-al-lowe/
73•thelok•7h ago•13 comments

First Proof

https://arxiv.org/abs/2602.05192
75•samasblack•7h ago•57 comments

Brookhaven Lab's RHIC concludes 25-year run with final collisions

https://www.hpcwire.com/off-the-wire/brookhaven-labs-rhic-concludes-25-year-run-with-final-collis...
36•gnufx•4h ago•40 comments

Vocal Guide – belt sing without killing yourself

https://jesperordrup.github.io/vocal-guide/
253•jesperordrup•15h ago•82 comments

I write games in C (yes, C) (2016)

https://jonathanwhiting.com/writing/blog/games_in_c/
156•valyala•5h ago•136 comments

Start all of your commands with a comma (2009)

https://rhodesmill.org/brandon/2009/commands-with-comma/
532•theblazehen•3d ago•197 comments

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

https://github.com/Momciloo/fun-with-clip-path
38•momciloo•5h ago•5 comments

Reinforcement Learning from Human Feedback

https://rlhfbook.com/
98•onurkanbkrc•10h ago•5 comments

Selection rather than prediction

https://voratiq.com/blog/selection-rather-than-prediction/
19•languid-photic•3d ago•5 comments

Italy Railways Sabotaged

https://www.bbc.co.uk/news/articles/czr4rx04xjpo
68•vedantnair•1h ago•54 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
212•1vuio0pswjnm7•12h ago•323 comments

72M Points of Interest

https://tech.marksblogg.com/overture-places-pois.html
42•marklit•5d ago•6 comments

A Fresh Look at IBM 3270 Information Display System

https://www.rs-online.com/designspark/a-fresh-look-at-ibm-3270-information-display-system
52•rbanffy•4d ago•14 comments

Unseen Footage of Atari Battlezone Arcade Cabinet Production

https://arcadeblogger.com/2026/02/02/unseen-footage-of-atari-battlezone-cabinet-production/
129•videotopia•4d ago•40 comments

Coding agents have replaced every framework I used

https://blog.alaindichiappari.dev/p/software-engineering-is-back
273•alainrk•10h ago•452 comments

France's homegrown open source online office suite

https://github.com/suitenumerique
648•nar001•9h ago•284 comments

Microsoft account bugs locked me out of Notepad – Are thin clients ruining PCs?

https://www.windowscentral.com/microsoft/windows-11/windows-locked-me-out-of-notepad-is-the-thin-...
51•josephcsible•3h ago•67 comments