Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template

1•killerstorm•4mo ago

I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•4mo ago

If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

Clustering Nvidia DGX Spark and M3 Ultra Mac Studio for 4x Faster LLM Inference

Rogue: Open-source AI agent evaluation framework

Agentic AI: Why Evaluation Is the Make-or-Break Factor

Pentagon Imposes Pre-Publication Censorship – All Major U.S. Media Walk Out

See your product's CO₂ impact from the concept phase

Major network vendors team to advance Ethernet for scale-up AI networking

Alcohol use and risk of dementia in diverse populations

Hybrid War Threat Looms over Sweden's Cashless Society

Waymo plans to bring its taxis to London in 2026

So you want to build a data mesh

The Sovereign Tech Fund invests $450k in R Foundation to Enhance R

Why Most Apps Should Start as Monoliths

Sadiq Khan holds birthday bash on £268M superyacht

LLMs Reproduce Human Purchase Intent via Semantic Similarity of Likert Ratings

The Testability of Pure Functions

Sora 2 AI Video Generator – Create AI Videos from Text and Images

What the Eurostack Is Missing

Show HN: maptail – Tail GeoIP data on a world map in realtime

China's Rare Earth Restrictions Aim to Beat U.S. at Its Own Game

Inverse Collatz's Tape

Technology is my leverage, not design

The Economic Cost of Antisemitism

The present and potential future of progressive image rendering

Sora2 AI Video Generator

Don't Stop Believin' in OpenAI

The Slack I Loved Is Slipping Away

Where's the AI Design Renaissance?

Haskell Weekly – Issue 494

MuPDF Explored (2022)

Why does collapsing a bubble with a sound wave produce light?