Ask HN: Build Your Own LLM?

15•retube•4mo ago

The best way to really understand how something works is to build it yourself. So I am wondering if there are any good tutorials on building your own LLM from scratch. I.e. implementing tokenisation, embeddings, attention and so on. I am not suggesting one could replicate chatGPT, but more a toy model that implements the core features but based on a much smaller corpus and training data.

Comments

2ro•4mo ago

How about this?

https://mathstodon.xyz/@empty/115088095028020763

retube•4mo ago

thanks

pm2222•4mo ago

https://www.amazon.com/Build-Large-Language-Model-Scratch/dp...

retube•4mo ago

thanks. looks potential

ryanchants•4mo ago

I'd get it straight from Manning and save a few bucks and take out the middle man: https://www.manning.com/books/build-a-large-language-model-f...

sfmz•4mo ago

Andrej Karpathy: Let's build GPT: from scratch, in code, spelled out. https://www.youtube.com/watch?v=kCc8FmEb1nY

beardyw•4mo ago

Andrej Karpathy's Nano GPT is reasonably accessible and easy to run.

https://github.com/karpathy/nanoGPT

runjake•4mo ago

Since you're posting here, you're looking for the shortcut.

The shortcut is Karpathy's "Let's Build GPT: from scratch, in code, spelled out" video:

https://www.youtube.com/watch?v=kCc8FmEb1nY

Then there is a good video that dives into LLMs and how they work that is quite approachable:

https://www.youtube.com/watch?v=7xTGNNLPyMI

From there, flesh out knowledge with his other videos, where he goes both extremely light and extremely deep:

https://www.youtube.com/@AndrejKarpathy/videos

Anyway, I really like's Karpathy's video because he's very good at explaining LLMs at every level.

khamidou•3mo ago

Sorry to self-promote but I did exactly that a few months back: https://khamidou.com/gpt2/

Generally, I think the Karpathy tutorials are a good starting point but they're very mathy (despite people telling you you only need high school math to understand llms, a lot of the abstractions and concepts he uses are a bit foreign to programmers).

I found out rebuilding inference of a known model taught me a lot more than passively sitting through the videos and maybe retyping his code. You should try it with something simple, like a model from a few years back!

liqilin1567•3mo ago

There is a new repo of karpathy: https://github.com/karpathy/nanochat. It's a full-stack implementation of an LLM like ChatGPT in a single, clean, minimal, hackable, dependency-lite codebase.

Ask HN: Why doesn't Netflix add a “Theater Mode” that recreates the worst parts?

Ask HN: Opus 4.6 ignoring instructions, how to use 4.5 in Claude Code instead?

Ask HN: Anyone Using a Mac Studio for Local AI/LLM?

Ask HN: Ideas for small ways to make the world a better place

Ask HN: Non AI-obsessed tech forums

Ask HN: 10 months since the Llama-4 release: what happened to Meta AI?

Ask HN: Who wants to be hired? (February 2026)

LLMs are powerful, but enterprises are deterministic by nature

Ask HN: Who is hiring? (February 2026)

AI Regex Scientist: A self-improving regex solver

Ask HN: Non-profit, volunteers run org needs CRM. Is Odoo Community a good sol.?

Tell HN: Another round of Zendesk email spam

Ask HN: Is Connecting via SSH Risky?

Ask HN: Has your whole engineering team gone big into AI coding? How's it going?

Ask HN: How does ChatGPT decide which websites to recommend?

Ask HN: Why LLM providers sell access instead of consulting services?

Ask HN: Is there anyone here who still uses slide rules?

Ask HN: Mem0 stores memories, but doesn't learn user patterns

Ask HN: Is it just me or are most businesses insane?

Kernighan on Programming

Ask HN: What is the most complicated Algorithm you came up with yourself?

Ask HN: Anyone Seeing YT ads related to chats on ChatGPT?

Ask HN: Does global decoupling from the USA signal comeback of the desktop app?

We built a serverless GPU inference platform with predictable latency

Ask HN: Does a good "read it later" app exist?

Ask HN: Have you been fired because of AI?

Ask HN: Anyone have a "sovereign" solution for phone calls?

Ask HN: Cheap laptop for Linux without GUI (for writing)

Ask HN: Any International Job Boards for International Workers?

Ask HN: How Did You Validate?