fp.

We Mourn Our Craft

https://nolanlawson.com/2026/02/07/we-mourn-our-craft/

70•ColinWright•1h ago•41 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode

21•surprisetalk•1h ago•17 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/

121•AlexeyBrin•7h ago•24 comments

U.S. Jobs Disappear at Fastest January Pace Since Great Recession

https://www.forbes.com/sites/mikestunson/2026/02/05/us-jobs-disappear-at-fastest-january-pace-sin...

99•alephnerd•2h ago•52 comments

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/

824•klaussilveira•21h ago•248 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html

56•vinhnx•4h ago•7 comments

Al Lowe on model trains, funny deaths and working with Disney

https://spillhistorie.no/2026/02/06/interview-with-sierra-veteran-al-lowe/

53•thelok•3h ago•6 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/

103•1vuio0pswjnm7•8h ago•118 comments

The Waymo World Model

https://waymo.com/blog/2026/02/the-waymo-world-model-a-new-frontier-for-autonomous-driving-simula...

1057•xnx•1d ago•608 comments

Reinforcement Learning from Human Feedback

https://rlhfbook.com/

76•onurkanbkrc•6h ago•5 comments

Start all of your commands with a comma (2009)

https://rhodesmill.org/brandon/2009/commands-with-comma/

478•theblazehen•2d ago•175 comments

Vocal Guide – belt sing without killing yourself

https://jesperordrup.github.io/vocal-guide/

204•jesperordrup•11h ago•69 comments

France's homegrown open source online office suite

https://github.com/suitenumerique

547•nar001•5h ago•253 comments

Coding agents have replaced every framework I used

https://blog.alaindichiappari.dev/p/software-engineering-is-back

215•alainrk•6h ago•334 comments

Selection Rather Than Prediction

https://voratiq.com/blog/selection-rather-than-prediction/

8•languid-photic•3d ago•1 comments

A Fresh Look at IBM 3270 Information Display System

https://www.rs-online.com/designspark/a-fresh-look-at-ibm-3270-information-display-system

35•rbanffy•4d ago•7 comments

72M Points of Interest

https://tech.marksblogg.com/overture-places-pois.html

28•marklit•5d ago•2 comments

Unseen Footage of Atari Battlezone Arcade Cabinet Production

https://arcadeblogger.com/2026/02/02/unseen-footage-of-atari-battlezone-cabinet-production/

113•videotopia•4d ago•30 comments

Where did all the starships go?

https://www.datawrapper.de/blog/science-fiction-decline

73•speckx•4d ago•74 comments

Software factories and the agentic moment

https://factory.strongdm.ai/

68•mellosouls•4h ago•73 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo

273•isitcontent•21h ago•38 comments

Learning from context is harder than we thought

https://hy.tencent.com/research/100025?langVersion=en

199•limoce•4d ago•111 comments

Monty: A minimal, secure Python interpreter written in Rust for use by AI

https://github.com/pydantic/monty

285•dmpetrov•22h ago•153 comments

Making geo joins faster with H3 indexes

https://floedb.ai/blog/how-we-made-geo-joins-400-faster-with-h3-indexes

155•matheusalmeida•2d ago•48 comments

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

https://github.com/sandys/kappal

21•sandGorgon•2d ago•11 comments

Hackers (1995) Animated Experience

https://hackers-1995.vercel.app/

555•todsacerdoti•1d ago•268 comments

Ga68, a GNU Algol 68 Compiler

https://fosdem.org/2026/schedule/event/PEXRTN-ga68-intro/

43•matt_d•4d ago•18 comments

Sheldon Brown's Bicycle Technical Info

https://www.sheldonbrown.com/

424•ostacke•1d ago•110 comments

An Update on Heroku

https://www.heroku.com/blog/an-update-on-heroku/

473•lstoll•1d ago•313 comments

Show HN: If you lose your memory, how to regain access to your computer?

https://eljojo.github.io/rememory/

348•eljojo•1d ago•215 comments

Open in hackernews

Teaching an LLM a Niche Diagraming Language

https://www.huy.rocks/everyday/12-01-2025-ai-teaching-an-llm-a-niche-diagraming-language

30•todsacerdoti•2mo ago

Comments

thomascountz•2mo ago

   ...I heard many good and bad things about [using RL for training] and I must give it a try.

Great article and great ethos. Thanks for sharing! I had no idea how LLM worked before and now I know a bit more.

robot-wrangler•2mo ago

Big thank you to author and OP. This is exactly the kind of homebrew recipe post I've been waiting for. I knew it had to be basically cookbook by now but really simple examples like this with no fluff are surprisingly hard to find. (Anyone got others?)

I've been thinking about similar experiments with some obscure esolang for a long time, so more detail on total time/cost would be nice. Also.. if it's correct that this size model is about the right minimal choice for starting such efforts.. what are the next steps if you wanted to shrink it to only specialize in the target? Should you go for distillation or ablation?

huydotnet•2mo ago

Hey, I'm the author of the post. Thank you so much for the kind feedback!

Speaking about total time/cost, this experiment cost me just $1.01 for 2h30 on a rental GPU. But the actual successful run was less than 10 minutes for both phases. The rest of the time I was spending fixing the code, tuning the params, train, and retrain. It took me about 6 hours to build and clean the two datasets, though.

For the next step, I'm thinking of improving the model accuracy, maybe with RL, but I would not go about shrinking the model size any lower. Prior to this, I've tried a lot of different model sizes on different kinds of tasks, from 135M to 4B. I'm not sure I like the performance of these small models for code generation :D