frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template
1•killerstorm•1y ago
I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•1y ago
If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

What does it mean when the bottom bit of my HMODULE is set?

https://devblogs.microsoft.com/oldnewthing/20260619-00/?p=112447
1•ibobev•1m ago•0 comments

Command and Conquer – Westwood's hit strategy game is coming to the Atari ST

https://www.indieretronews.com/2026/06/command-conquer-westwoods-hit-strategy.html
1•ibobev•1m ago•0 comments

The Goldilocks Principle in Fantasy Strategy

https://www.filfre.net/2026/06/the-goldilocks-principle-in-fantasy-strategy/
1•ibobev•3m ago•0 comments

The Surprising New Arms Dealer to the World

https://www.politico.com/news/magazine/2026/06/20/south-korea-weapons-dealer-trump-00959559
1•JumpCrisscross•5m ago•0 comments

Self Balancing Inverted Pendulum

https://github.com/chrisdoble/self-balancing-inverted-pendulum
1•yeknoda•6m ago•0 comments

16-year-old SATA II SSD survives 1 petabyte of writes, 25x the drive's rating

https://www.tomshardware.com/pc-components/ssds/16-year-old-sata-ii-ssd-survives-1-petabyte-of-wr...
2•giuliomagnifico•6m ago•0 comments

Ask HN: Need advice on distributing and testing what I build

2•darth-pixit•9m ago•0 comments

Google Has Added Agentic Browsing to PageSpeed Insights

https://pagespeed.web.dev/analysis/https-hawksley-dev/bo27z46k5x
1•ethanhawksley•10m ago•1 comments

Lines-of-code considered helpful (or we stop taking sick peoples temperatures)

https://legostormtroopr.substack.com/p/lines-of-code-considered-helpful
1•legostormtroopr•13m ago•0 comments

Spotting Mushrooms

https://verfassungsblog.de/spotting-mushrooms/
1•jruohonen•23m ago•0 comments

Building a Dense Agentic AI CPU Rack Today

https://www.servethehome.com/building-a-dense-agentic-ai-cpu-rack-amd-dell-today/
1•ksec•30m ago•1 comments

Temporary Cloudflare Accounts for AI Agents

https://blog.cloudflare.com/temporary-accounts/
1•farhadhf•30m ago•0 comments

AI that can find the location of images

https://geoaxis.ai
2•GeoAxisAI•31m ago•0 comments

Large Language Models Hack Rewards, and Society

https://arxiv.org/abs/2606.04075
2•sva_•32m ago•0 comments

How to sync messages of Claude Code extension in VS Code and Claude Code app?

https://github.com/budhasantosh010/claude-code-session-bridge
1•realsanb•33m ago•0 comments

„We all leave digital footprints on the internet"

https://www.mpg.de/26430251/interview-troncoso-ueberwachung
1•jruohonen•37m ago•1 comments

Apple patches high-severity eavesdropping vulnerability in Beats Studio Buds

https://arstechnica.com/apple/2026/06/apple-patches-high-severity-eavesdropping-vulnerability-in-...
1•joozio•45m ago•0 comments

Local Models, Friction and Struggle

https://garden.azl.au/ai/local-models/local-models-friction-struggle
1•neurodivergent•49m ago•1 comments

Safe SIMD in Rust, even on the inside

https://shnatsel.medium.com/safe-simd-in-rust-even-on-the-inside-c6f1ff381828
1•g0xA52A2A•49m ago•0 comments

As China gorges on homegrown foie gras, France faces a new rival

https://www.reuters.com/world/asia-pacific/china-gorges-homegrown-foie-gras-france-faces-new-riva...
2•JumpCrisscross•50m ago•2 comments

Mochallama

https://deemwar-products.github.io/mochallama/
1•deemwar•53m ago•0 comments

Mack.ns.cloudflare.com

https://blog.cloudflare.com/welcome-to-connectivity-cloud/
1•SajjuRoy•54m ago•0 comments

Bootimus – A Self-Contained PXE and HTTP Boot Server

https://bootimus.com
2•car•54m ago•0 comments

We Ran the Numbers. Remote Work Is Bad for Us

https://www.nytimes.com/2026/06/17/opinion/remote-work-depression.html
1•Anon84•55m ago•0 comments

Apple unifies device management in devicectl: Impact on iOS test automation and

https://bitrise.io/blog/post/wwdc-2026-device-hub-and-what-it-means-for-ci-cd
1•birdculture•58m ago•0 comments

CSSQuake

https://cssquake.com/
4•msalsas•1h ago•1 comments

Enthusiastic Fear

https://patent.dev/enthusiastic-fear/
2•funktionslust•1h ago•0 comments

The Classic Movie That Was Nearly Destroyed by a Single Line of Code

https://www.wsj.com/business/media/toy-story-5-saved-pixar-78e29161
2•impish9208•1h ago•1 comments

UK's top data and AI regulator quits after 'inappropriate' humour

https://www.bbc.co.uk/news/articles/c0eyq7rnn22o
4•nephihaha•1h ago•1 comments

A Web-Scale Data Engine for Video-to-Action Robot Learning

https://huggingface.co/spaces/Rice-RobotPI-Lab/EgoInfinity
1•jacobedawson•1h ago•0 comments