frontpage.

Show HN: MicroGPT-C – C99 GPT for Edge Training and Tiny Model Pipelines

https://github.com/enjector/microgpt-c

1•Ajay__soni•1h ago

TL;DR: Pure C99 GPT-2 engine, zero dependencies. Sub-1M-parameter “organelles” coordinate via a Planner-Worker-Judge pipeline and beat random opponents up to 91 % on logic games, all on CPU.

I’m a C/C++ architect focused on low-latency systems. Last year, I tried building agentic pipelines with SLMs/LLMs and hit the usual wall: latency and orchestration overhead killed real-time edge use cases.

Initial research video: https://www.youtube.com/watch?v=q-rs9VZ1-0I

So I asked: how far can you push specialised logic at <1M parameters with nothing but local CPU?

MicroGPT-C is a from-scratch C99 port of Karpathy’s microgpt.py (https://gist.github.com/karpathy/8627fe009c40f57531cb1836010...). Zero deps, single-header, localised KV cache. Speed was never the goal (Andrej’s nanoGPT and llm.c already showed what’s possible). The real experiment was orchestration.

Organelle Pipeline Architecture (OPA): Agile-style Planners, Workers and Judges talking through tiny structured strings (board=XO_|valid=1,3) parsed by a safety-gated stack VM (3.7–5.8M ops/s). A 64K model needed 181 manual interventions; a 460K model trained on those traces internalised everything and needed zero.

Beyond the research: fully auditable AI, great for education (~3k lines of readable C), rapid prototyping, and embedded.

Personal itch: fraud/risk engines. I want agents that hunt “unknown-unknowns” in a sandbox where every decision is inspectable.

Happy to talk implementation, the 97 tests, 22 benchmarks, or anything else. FAQ: https://github.com/enjector/microgpt-c/blob/main/FAQ.md

Quick try (macOS/Linux/Windows): git clone https://github.com/enjector/microgpt-c && cd microgpt-c mkdir build && cd build && cmake .. -DCMAKE_BUILD_TYPE=Release && cmake --build . ./connect4_demo # 460K params, ~21 min train, 88% win rate vs random

Performance (Apple M2 Max): - 4.2K params names: 685k tok/s train, 110k tok/s infer - 841K Shakespeare char: 28k / 16k tok/s - 510K Shakespeare word: 12.5k / 40k tok/s

Full leaderboard (11 games), market-regime experiment (57% holdout = 2.8× baseline), and the book PDF: https://github.com/enjector/microgpt-c/blob/main/docs/book/M... GitHub: https://github.com/enjector/microgpt-c

The battle of the squares: When domestic workers protest

Ex-UK ambassador to U.S. Peter Mandelson arrested in London

The Statistical Signature of LLMs

Exploiting Almost Every Antivirus Software (2020)

From Select to Advanced SQL: JOINs, CTEs, and More

Show HN: Find Local Business Leads

RWKV-7 beats Llama 3.2 with 3x fewer training tokens and formally exceeds TC^0

Show HN: BaitAI – Conversational AI interviews with realtime interview artifacts

The Laws of Software

Ask HN: How are you testing LLM integrations in CI?

The Rise of Eyes Began with Just One

AOL (Sign On – Dial Up) [video]

Wilks' Tolerance Intervals

Unconventional strategies for practicing Spanish (2019)

Everything is Amazing, But Nothing is Ours (2019)

Show HN: Brag Frog – The brag doc that builds itself from GitHub, Jira and Co

The best Part about receiving your miracle

Why SLMs are eating the world (and how to use them)

Solubility in olive oil predicts anasthetic potency of a compound

Snake Charmer – a modern way to view Python docs

Inkplate13 Spectra - Color 13" eink board

Naiply – Real-time multiplayer Spanish card games built with Phoenix LiveView

Hey I almost got scammed by Google

Show HN: Tastebuds AI – recipe generator with USDA-backed nutrition data

C64 Graphics Explorer

Why the EU's AI Act is about to become enterprises' biggest compliance challenge

Writing code is cheap now

What is digital film mastering?

Show HN: Next.js Bot Prerender Middleware (Page Replica and Vercel)

ASML unveils EUV light source advance that could yield 50% more chips by 2030