frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template
1•killerstorm•11mo ago
I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•11mo ago
If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

Maker Camp: Shenzhen

https://seeed-studio.github.io/MakerCamp/
1•Eridanus2•47s ago•0 comments

A breakthrough in C/C++ dependency management

https://lcamtuf.substack.com/p/a-breakthrough-in-cc-dependency-management
1•birdculture•1m ago•0 comments

Learning to Orchestrate Agents in Natural Language with the Conductor

https://openreview.net/forum?id=U23A2BUKYt
1•zaevlad•1m ago•1 comments

When LLMs Get Personal

https://joshbudman.substack.com/p/when-llms-get-personal
1•gmays•3m ago•0 comments

Self-hosting isn't scary: a practical guide with Coolify and Hetzner

https://darko.io/posts/self-hosting-with-coolify-and-hetzner/
1•speckx•3m ago•0 comments

U.S. Department of State on Flickr

https://www.flickr.com/photos/statephotos/
1•Tomte•5m ago•0 comments

Does reading do us any good?

https://aeon.co/essays/the-role-of-literature-as-the-key-to-personal-freedom
2•dinosor•6m ago•0 comments

How to Get Traction or First Clients?

https://www.indiehackers.com/post/how-to-get-traction-or-first-clients-USofzBJgR4Oqr1VMb2sw
2•escrowsai•8m ago•0 comments

70x faster cold(ish) starts for SGLang

https://fergusfinn.com/blog/fast-sglang-starts/
1•somnial•8m ago•0 comments

Durable, durable, durable: the AI infrastructure category is forming

https://blog.mattheworiordan.com/p/ive-mapped-the-durable-ai-ecosystem
5•matt_oriordan•9m ago•0 comments

EFF Challenges Secrecy in Eastern District of Texas Patent Case

https://www.eff.org/deeplinks/2026/04/eff-challenges-secrecy-eastern-district-texas-patent-case
4•hn_acker•12m ago•0 comments

Stop California's Paternalistic and Privacy-Destroying Social Media Ban

https://www.eff.org/deeplinks/2026/04/act-now-stop-californias-paternalistic-and-privacy-destroyi...
3•hn_acker•13m ago•1 comments

Shareholder primacy undermined its own logic

https://www.seeingthesystem.com/p/the-map-that-became-the-territory
2•TinyBig•13m ago•0 comments

From Ms to 26 Ns: How a $20 eBay SFP Module Beat My NTP Setup

https://austinsnerdythings.com/2026/04/26/ptp-osa5401-26-nanoseconds-raspberry-pi/
3•speckx•14m ago•0 comments

Agentic AI made DevOps and Agile obsolete

https://avkcode.github.io/blog/self-healing-platform-agent-store.html
2•KyleVlaros•16m ago•1 comments

Will I ever retire? It doesn't look like it

https://www.theguardian.com/commentisfree/2026/apr/25/retirement-finances-work
3•robtherobber•16m ago•0 comments

Show HN: Slatewave – a single color palette across terminals, editors, and apps

https://getslatewave.com/
2•kevinlangleyjr•16m ago•1 comments

I did no work for a year and no one noticed

https://leylakazim.substack.com/p/i-did-no-work-for-a-year
3•mellosouls•17m ago•0 comments

Human biology is ill-adapted to modern cities

https://www.lboro.ac.uk/media-centre/press-releases/2025/november/human-biology-is-ill-adapted-to...
4•rramadass•17m ago•1 comments

Tell HN: GitHub PRs disappearing but only from search

4•ojno•17m ago•1 comments

"Parse, don't validate" through the years with C++

https://derekrodriguez.dev/parse-dont-validate-through-the-years-with-c-/
3•dwrodri•18m ago•0 comments

Google's A2A Protocol: How AI Agents Will Talk to Each Other

https://www.ismatsamadov.com/blog/a2a-protocol-agent-to-agent-google
2•ismats•19m ago•0 comments

The Signal is Broken

https://dataxam.com/blog/the-signal-is-broken/
2•DLarsen•19m ago•1 comments

Three reasons why DeepSeek’s new model matters

https://www.technologyreview.com/2026/04/24/1136422/why-deepseeks-v4-matters/
3•jonbaer•19m ago•0 comments

Show HN: Terminal UI for managing SSH servers (users admin, file transfers)

https://github.com/Frytskyy/deflect-one
2•whitemanv•21m ago•0 comments

Show HN: 2 weeks of coding, 3 months of OpenAI review, my ChatGPT App is live

3•Aldipower•21m ago•0 comments

Show HN: Vibe-coding video games with Claude (Day 14: Tetris)

https://gamevibe.us/14-tetromino
2•pzxc•23m ago•0 comments

GitHub Copilot is moving to usage-based billing

https://github.blog/news-insights/company-news/github-copilot-is-moving-to-usage-based-billing/
54•frizlab•23m ago•20 comments

The Nix Moment

https://determinate.systems/blog/the-nix-moment/
3•biggestlou•24m ago•0 comments

Managed PostgreSQL Comparison (2026)

https://selfhost.dev/managed-postgresql-comparison-2026/
2•iaziz786•25m ago•1 comments