frontpage.

I built a tiny physics solver LLM that performs surprisingly well on easy-to-medium difficulty physics problems. Most LLMs today still struggle with physics QA (as PhyBench recently highlighted), so I wanted to see how far I could push a small model with careful data and minimal compute.

Model: Qwen3-1.7B

Supervised Finetuning: ~1500 curated examples spanning kinematics, EM, acoustics, and more

RL Fine-tuning: GRPO, 1-shot RLVR style (single example, 70 steps)

Total cost: ~$5 on H100

It started with a cold-start SFT (~3 epochs, loss to 0.3), then I ran RL with accuracy reward that climbed from 0.1 → 0.8.

Goal: Create a lightweight physics solver that’s small enough to deploy anywhere—think of it as a "pocket tutor" for foundational physics.

Still working on evaluations—most benchmarks focus on very hard problems, while I want something that evaluates basic correctness, reasoning, and unit sense on easy/medium problems. If anyone has suggestions, I’d love to hear them.

Brute Force Colors (2022)

Google Translate apparently vulnerable to prompt injection

(Bsky thread) "This turns the maintainer into an unwitting vibe coder"

Software development is undergoing a Renaissance in front of our eyes

Can you beat ensloppification? I made a quiz for Wikipedia's Signs of AI Writing

Spec-Driven Design with Kiro: Lessons from Seddle

Agents need good developer experience too

The Dark Factory

Free data transfer out to internet when moving out of AWS (2024)

Interop 2025: A Year of Convergence

Prejudice Against Leprosy

Slint: Cross Platform UI Library

AI and Education: Generative AI and the Future of Critical Thinking

Maple Mono: Smooth your coding flow

Moltbook isn't real but it can still hurt you

Take Back the Em Dash–and Your Voice

Show HN: 289x speedup over MLP using Spectral Graphs

Teaching Mathematics

3D Printed Microfluidic Multiplexing [video]

Abstractions Are in the Eye of the Beholder

Show HN: Routed Attention – 75-99% savings by routing between O(N) and O(N²)

We didn't ask for this internet – Ezra Klein show [video]

The Real AI Talent War Is for Plumbers and Electricians

Show HN: MimiClaw, OpenClaw(Clawdbot)on $5 Chips

I Maintain My Blog in the Age of Agents

The Fall of the Nerds

Show HN: I'm 15 and built a free tool for reading ancient texts.

How close is AI to taking my job?

You are the reason I am not reviewing this PR

Show HN: FamilyMemories.video – Turn static old photos into 5s AI videos