frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template
1•killerstorm•12mo ago
I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•12mo ago
If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

Microsoft reports AI is more expensive than paying human employees

https://fortune.com/2026/05/22/microsoft-ai-cost-problem-tokens-agents/
3•nreece•3m ago•0 comments

Most EVM systems have no memory

https://blog.bridgexapi.io/most-evm-systems-have-no-memory
1•Bridgexapi•4m ago•0 comments

Agent Substrate

https://github.com/agent-substrate/substrate
1•ahmedtd•5m ago•0 comments

Comparisons as Predictable as the Sunrise

https://pudding.cool/2026/05/similes/
1•newtwentysix•6m ago•0 comments

What Is an Air Gap?

https://www.ibm.com/think/topics/air-gap
1•teleforce•12m ago•0 comments

Ask HN: How concerned are you about a debt crisis?

3•bhag2066•13m ago•1 comments

Laravel Lang Compromised with RCE Backdoor Across 700 Versions

https://socket.dev/blog/laravel-lang-compromise
1•gpi•14m ago•0 comments

New kind of dark tourism emerging in online 'Backrooms,' study shows

https://phys.org/news/2026-05-kind-dark-tourism-emerging-online.html
1•WaitWaitWha•18m ago•0 comments

How the Library of Congress is using both AI and volunteers to unlock history

https://www.nextgov.com/artificial-intelligence/2026/05/how-library-congress-using-both-ai-and-vo...
1•WaitWaitWha•22m ago•0 comments

The Verification Tree: Turning AI bug report floods into a confidence signal

https://zenodo.org/records/20349904
2•yellow_glovez•25m ago•0 comments

Thousands evacuated in Garden Grove worry toxic chemicals in tank could explode

https://ktla.com/news/orange-county/thousands-evacuated-again-after-toxic-tank-leak-in-garden-grove/
1•Bender•26m ago•1 comments

Systemd 261-Rc1 Released with OS Installer, IMDS Subsystem and New Storagectl

https://www.phoronix.com/news/systemd-261-rc1
1•Bender•29m ago•0 comments

Feds unwittingly leak pilots' pre-crash conversation

https://www.theregister.com/science/2026/05/23/feds-unwittingly-leak-pilots-pre-crash-conversatio...
1•Bender•31m ago•0 comments

Kagi Search API is now in public preview

https://kagi.com/changelog#10622
1•mroche•32m ago•0 comments

A Visual Tour of Classical Electromagnetism (1960)

https://web.mit.edu/8.02t/www/802TEAL3D/visualizations/guidedtour/Tour.htm#_Toc27302365
2•nill0•32m ago•0 comments

My Hermes and Obsidian Setup and Use Cases

https://metedata.substack.com/p/013-my-hermes-and-obsidian-set-up
1•thunderbong•34m ago•0 comments

New rule requires most green-card applicants to apply from outside U.S.

https://www.washingtonpost.com/immigration/2026/05/22/new-rule-requires-most-green-card-applicant...
4•michaelsbradley•37m ago•0 comments

Shocker: Docker implemented in ~300 lines of bash

https://github.com/traceypooh/shocker
2•traceypooh•41m ago•0 comments

Any positive sides of LLM there?

2•artturibyte•42m ago•1 comments

WebFinder for Tailscale

https://zeulewan.github.io/web-finder/index.html
1•zeulewan•45m ago•0 comments

The Web Is About to Get a Second Door

https://medium.com/@vektormemory/the-web-is-about-to-get-a-second-door-5f9fa0fd0d0f
1•vektormemory•49m ago•0 comments

ngn-k tutorial

https://razetime.github.io/ngn-k-tutorial/
1•1vuio0pswjnm7•52m ago•1 comments

The commencement speech that shook the world

https://idiallo.com/blog/the-commencement-speech-that-shook-the-world
1•shaunpud•55m ago•0 comments

Family Abundance | Why Childcare is So Expensive in Silicon Valley

https://proliberal.substack.com/p/family-abundance
2•mtsteiner•58m ago•0 comments

Trying to preserve other peoples code

https://github.com/Essenceia/CRC_generator/tree/main
3•random__duck•1h ago•0 comments

Why Russian Propaganda Works – and How to Stop Falling for It

https://economicsofpower.substack.com/p/why-russian-propaganda-works-and
7•mariuz•1h ago•0 comments

Ask HN: Has anyone solved Amazon's animated GIF captcha programmatically?

3•jrejaud•1h ago•0 comments

Let me AI that for you

https://let-me-ai.com/
2•NordStreamYacht•1h ago•2 comments

Cognitive architecture AI weighted memory, and a falsifiable continuity metric

https://zenodo.org/records/20350249
1•timeless-hayoka•1h ago•0 comments

Authoritative DNS over encrypted transport at OARC 45

https://blog.apnic.net/2026/05/20/authoritative-dns-over-encrypted-transport-at-oarc-45/
1•8organicbits•1h ago•0 comments