frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template
1•killerstorm•12mo ago
I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•12mo ago
If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

The Morale of Tech Workers Is Plunging as Layoffs Mount

https://www.nytimes.com/2026/05/19/business/tech-layoffs-blind.html
1•bookofjoe•4m ago•2 comments

Cache – Meal plans from your local store's weekly sales

https://www.cache.fit/
1•blaughlin•4m ago•1 comments

A Unified Theory of Alignment in Layered Systems

https://a-unified-theory-of-alignment-in-layered-systems.tiiny.site/
1•CitizenOfEarth•5m ago•0 comments

The quiet grief of adult friendship

https://timesofindia.indiatimes.com/blogs/civil-irony/the-quiet-grief-of-adult-friendship/
1•crcastle•5m ago•0 comments

Show HN: SaveNeighbor – food delivery through your own personal network

https://www.saveneighbor.com
1•JJonesRatio•7m ago•0 comments

Canonical to shut Ubuntu Pastebin after 18 years of service

https://www.omgubuntu.co.uk/2026/05/canonical-ubuntu-pastebin-shutdown
1•colinprince•9m ago•0 comments

I built an online leather goods store focused on making gift buying less painful

https://www.vintageleather.com.au/
1•vickeycool•11m ago•1 comments

Tfdraw.dev – turn Terraform plan JSON into an editable architecture diagram

https://tfdraw.dev/demo
1•spoosh•11m ago•0 comments

Show HN: The first (free) podcast ad blocker

https://apps.apple.com/us/app/drea-podcast-ad-blocker/id6759070798
1•hamza_q_•20m ago•0 comments

Fatherhood Dramatically Rewires Your Brain

https://www.sciencealert.com/fatherhood-dramatically-rewires-your-brain-scans-reveal
2•Gaishan•22m ago•1 comments

How AI Talks People Out of Conspiracy Theories–and What We Can Learn from That

https://www.wsj.com/tech/ai/ai-debunks-conspiracy-theories-92eff2c5
2•MilnerRoute•32m ago•1 comments

Honopinion

https://honopinion.com
1•mroshani20•37m ago•0 comments

We Built Secure, Scalable Agent Sandbox Infrastructure

https://twitter.com/larsencc/status/2027225210412470668
1•gmays•38m ago•1 comments

Mvm – a fast virtual machine for Go

https://mvm.sh/
1•birdculture•40m ago•0 comments

Teaching Codex to Test a Voice-First Calendar App

https://www.elicited.blog/posts/teaching-codex-to-test-a-voice-first-calendar
1•justanotheratom•42m ago•1 comments

What were your favorite classic iPod games?

1•wompapumpum•44m ago•0 comments

'What Matters Most'–Google Is Changing Your Gmail Inbox

https://www.forbes.com/sites/zakdoffman/2026/05/23/what-matters-most-google-is-changing-your-gmai...
1•healsdata•53m ago•0 comments

Lessons I Learned from Creating Searx

https://hister.org/posts/lessons-i-learned-from-creating-searx
1•xosc•55m ago•0 comments

How Google's Beta Tester Requirement Created a Fiverr Grey Market

https://danunparsed.com/p/googles-beta-tester-requirement
3•sambellll•1h ago•0 comments

The Black Hole Scientists Say Is Growing Too Fast

https://substack.com/profile/512907875-hamza-ashkar/note/c-264627457
2•hamzaashkar•1h ago•0 comments

Agent evals should feel like real work

https://www.zohaib.cc/blog/agent-evals
1•zed_labs_dev•1h ago•0 comments

Verifying a Caliptra Boot-FSM Bug with Mununu

https://marianocerrutti.substack.com/p/verifying-a-caliptra-boot-fsm-bug
1•hasheddan•1h ago•0 comments

The Densest (Urban) Environment in the World

https://oldurbanist.blogspot.com/2011/09/densest-urban-environment-in-world.html
3•Neuronaut•1h ago•1 comments

Poll: Test

1•sillysaurusx•1h ago•0 comments

The Green Side of the Lua

https://arxiv.org/abs/2601.16670
2•radiator•1h ago•0 comments

Star Citizen game has reached $1B in funding

https://robertsspaceindustries.com/en/funding-goals
8•speckx•1h ago•0 comments

Show HN: JavaScript Crossword – a crossword where the clue = eval(answer)

https://lyra.horse/fun/jscrossword/
1•rebane2001•1h ago•0 comments

No Asterisk Products Manifesto: hardware that works when the servers go down

https://noasteriskproducts.org/
2•brooklyntom•1h ago•0 comments

Built a small PR guardrail for token bloat, worth maintaining?

https://github.com/unloopedmido/contextlevy
1•nonlooped•1h ago•0 comments

Test

1•sillysaurusx•1h ago•0 comments