frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Create-LLM – Train your own LLM in 60 seconds

https://github.com/theaniketgiri/create-llm
38•theaniketgiri•16h ago
https://medium.com/@theaniketgiri/three-months-ago-i-wanted-...

Comments

kk58•10h ago
Does this work on mac
theaniketgiri•10h ago
Yep, works fine on Mac. Try the nano or tiny templates if you want quicker training runs
efilife•9h ago
2 questions: how much of this project is AI generated and how much of only the readme is AI generated?
theaniketgiri•8h ago
Mostly the repetitive stuff like README generation and pushing code with meaningful commit messages was handled by AI. The actual work and logic were done by me.
joshribakoff•7h ago
What about the commit that added tens of thousands of lines of markdown claiming to be an AI summary?

Or the meaningful commit message of “.”

And the commit editing 1,000s of lines of python code mislabeled as a docs change?

theaniketgiri•7h ago
Totally fair question!

Docs / Markdown: AI handled repetitive stuff like READMEs and summaries.

Core logic / Python: fully written by me.

Commit messages: some minimal ones just for quick iterations — the real work is in the code.

AI helped with boilerplate so I could ship faster; all functionality is hand-crafted.

joshribakoff•5h ago
If the AI did the boilerplate that implies it was not fully written by you.

The “meaningful commit messages” — again are a single period as the message for a single commit for the entire python portion of the codebase.

My question was rhetorical. Whether the AI did it or a human did, it burns credibility to refer to things that don’t exist (like “meaningful commit messages”)

teruakohatu•3h ago
Hacker News is a better place when we don’t attack people sharing their work. Your point was made.

Well done to the author for shipping code. I look forward to trying it out.

Grimblewald•3m ago
> for sharing their work

If it was their work your point would hold.

darepublic•6h ago
I don't quite understand how you get from this:

> I wanted to understand how these things work by building one myself.

Directly to this:

What if training an LLM was as easy as npx create-next-app?

I mean that the second thought seems to be the opposite of the first (what if the entirety of training llm was abstracted behind a simple command)

theaniketgiri•5h ago
Great question - I should've been clearer.

When I started, I wanted to understand LLMs deeply. But I hit a wall: tutorials were either "hello world" toys or "here's 500 lines of setup before you start."

What I needed was: "give me working code quickly, THEN let me modify and learn."

That's what create-llm does. It scaffolds the boilerplate (like create-next-app), so you can spend time learning the interesting parts: - Why does vocab size matter? (adjust config, see results) - What causes overfitting? (train on small data, see it happen) - How do different architectures perform? (swap templates, compare)

It's "easy to start, deep to master." The abstraction gets you running in 60 seconds, then you dig into the code

seg_lol•5h ago
The blogpost is some of the best LLM greentext I have seen for targeting the hn hivemind. Everything about this is :chefs kiss:
3abiton•47m ago
How does this differ from nanochat?

Show HN: MyraOS – My 32-bit operating system in C and ASM (Hack Club project)

https://github.com/dvir-biton/MyraOS
98•dvirbt•5h ago•9 comments

Show HN: Helium Browser for Android with extensions support, based on Vanadium

https://github.com/jqssun/android-helium-browser
23•jqssun•3h ago•7 comments

Show HN: Diagram as code tool with draggable customizations

https://github.com/RohanAdwankar/oxdraw
243•RohanAdwankar•1d ago•51 comments

Show HN: The Legal Embedding Benchmark (MLEB)

https://huggingface.co/blog/isaacus/introducing-mleb
2•ubutler•3h ago•0 comments

Show HN: Create-LLM – Train your own LLM in 60 seconds

https://github.com/theaniketgiri/create-llm
38•theaniketgiri•16h ago•13 comments

Show HN: Shadcn/UI theme editor – Design and share Shadcn themes

https://shadcnthemer.com
127•miketromba•1d ago•39 comments

Show HN: Chonky – a neural text semantic chunking goes multilingual

https://huggingface.co/mirth/chonky_mmbert_small_multilingual_1
40•hessdalenlight•1d ago•4 comments

Show HN: LLM Rescuer – Fixing the billion dollar mistake in Ruby

https://github.com/barodeur/llm_rescuer
89•barodeur•2d ago•14 comments

Show HN: I Built DevTools for Blazor (Like React DevTools but for .NET)

https://blazordevelopertools.com/
7•joe-gregory•9h ago•1 comments

Show HN: AI bookmarking app for people who hate AI

https://tryeyeball.com/
3•quinto_quarto•10h ago•0 comments

Show HN: Guided EMDR Therapy App to Heal Inner Trauma

https://myemdr.app/start
2•positive-minds•12h ago•0 comments

Show HN: Random Makers – Show HN and Product Hunt, but Faster and Not Corporate

https://makers.random.gg/
16•waynerd•1d ago•1 comments

Show HN: A browser SIM of a decentralized court for autonomous AI agents

https://aethelred.ayauho.com/
3•Ohuaya•12h ago•1 comments

Show HN: MacOS Live Screensaver – A screensaver that plays live video streams

https://github.com/hauxir/macos-live-screensaver
62•hauxir•5d ago•40 comments

Show HN: Git for LLMs – A context management interface

https://twigg.ai
101•jborland•3d ago•36 comments

Show HN: Deta Surf – An open source and local-first AI notebook

https://github.com/deta/surf
137•mxek•3d ago•39 comments

Show HN: Tommy – Turn ESP32 devices into through-wall motion sensors

https://www.tommysense.com
104•mike2872•3d ago•78 comments

Show HN: Status of my favorite bike share stations

https://blog.alexboden.ca/toronto-bike-share-status/
13•alexboden•1d ago•5 comments

Show HN: A fast, privacy-first image converter that runs in browser

https://imageconverter.dev/
45•wainguo•2d ago•38 comments

Show HN: OpenSnowcat – A fork of Snowplow to keep open analytics alive

https://opensnowcat.io/
75•joaocorreia•3d ago•18 comments

Show HN: Nostr Web – decentralized website hosting on Nostr

https://nweb.shugur.com
101•karihass•3d ago•34 comments

Show HN: Centia.io – Open PostgreSQL/PostGIS back end for developers

https://centia.io/
6•mhoegh•1d ago•0 comments

Show HN: Dictly – Local, real‑time voice‑to‑text for macOS (sub‑100ms, no cloud)

https://dictly.app/
8•JannikJung•1d ago•2 comments

Show HN: Cuq – Formal Verification of Rust GPU Kernels

https://github.com/neelsomani/cuq
94•nsomani•4d ago•63 comments

Show HN: Zoto – low-level audio playback in Zig

https://github.com/braheezy/zoto
3•braheezy•21h ago•0 comments

Show HN: Piping in and Out of Emacs

https://github.com/agzam/mx-piper
4•iLemming•22h ago•1 comments

Show HN: I built a tech news aggregator that works the way my brain does

https://deadstack.net/recent
187•dreadsword•3d ago•97 comments

Show HN: Katakate – Dozens of VMs per node for safe code exec

https://github.com/Katakate/k7
123•gbxk•5d ago•53 comments

Show HN: Sqlite3-dump - a fast SQLite to CSV and parquet

https://github.com/i64/sqlite3-dump
18•Gave4655•2d ago•3 comments

Show HN: Playwright Skill for Claude Code – Less context than playwright-MCP

https://github.com/lackeyjb/playwright-skill
188•syntax-sherlock•6d ago•45 comments