frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template
1•killerstorm•1y ago
I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•1y ago
If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

The Vanta AI Quality Eval Maturity Model

https://www.vanta.com/resources/vanta-ai-quality-evaluation-maturity-model
1•hamelj•5s ago•0 comments

Show HN: Automated Outbound in Your Terminal

https://posthorn.sh/
1•ejcho623•17s ago•0 comments

D-Wave Riding the Dual-Rail for Its Gate-Model Quantum Ambitions

https://www.nextplatform.com/compute/2026/06/10/d-wave-riding-the-dual-rail-for-its-gate-model-qu...
1•rbanffy•42s ago•0 comments

DataPav. Click a DataFrame column, see where it came from

https://datapav.lpavs.com/
1•PaveLuchkov•1m ago•0 comments

Looking Inside Chromium's On-Device AI Stack

https://www.island.io/blog/looking-inside-chromiums-on-device-ai-stack
1•wild_pointer•2m ago•0 comments

Agentic Code Must Be Human Auditable

https://dockyard.com/blog/2026/06/10/it-has-to-be-human-auditable
2•bcardarella•2m ago•0 comments

Anthropic's Fable 5 Is Opus on a Good Day

https://www.williamangel.net/blog/2026/06/10/anthropic-fable.html
1•datadrivenangel•3m ago•0 comments

Bridger Is Building an Osint Dossier in a Cute Font

https://ethanplant.ca/writing/bridger/
1•ethanplant•4m ago•0 comments

Paramount accuses Netflix of "scorched-earth campaign" against WBD merger

https://arstechnica.com/tech-policy/2026/06/netflix-trying-to-poison-regulators-about-wbd-merger-...
1•rbanffy•5m ago•0 comments

Global watchdog calls for tighter controls on agentic AI in finance

https://www.reuters.com/legal/transactional/global-watchdog-calls-tighter-controls-agentic-ai-fin...
1•1vuio0pswjnm7•5m ago•0 comments

Why the blockbuster SpaceX IPO may spell more bad news for crypto

https://www.reuters.com/legal/government/why-blockbuster-spacex-ipo-may-spell-more-bad-news-crypt...
1•JumpCrisscross•8m ago•0 comments

Frost: Disk Drive Is the Snitch

https://protonprivacy.substack.com/p/frost-your-disk-drive-is-the-snitch
2•daesorin•8m ago•0 comments

The Lockdown Dissidents (A WSJ Documentary)

https://www.youtube.com/watch?v=O87Et-w3vdg
1•mudil•9m ago•0 comments

CastIn2007: A 2007 styled YouTube clone I built out of boredom

https://cast-in2007.edgeone.app/
1•colinnW•10m ago•0 comments

AEO: Getting Started

https://hedge-ops.com/posts/answer-engine-optimization-playbook/
1•mooreds•12m ago•0 comments

Linux Foundation's Latest AI Effort Is Around AI Asset and Data Exchange

https://www.phoronix.com/news/Linux-Foundation-OpenSharing
1•daesorin•13m ago•0 comments

Object-Level Explanations for Image Geolocation Models: A GeoGuessr Use-Case

https://arxiv.org/abs/2605.00912
1•PaulHoule•13m ago•0 comments

Virtual Mailbox vs. Lawyer for Incorporating

1•svenv•14m ago•1 comments

AMA: I'm a Random HN User, ask me anything (and I might respond)

2•SpyCoder77•16m ago•11 comments

Show HN: AgentCarousel – behavioral tests for AI agents, with signed evidence

https://github.com/agentcarousel/agentcarousel
1•neemsio•16m ago•0 comments

Social Security Now Expects Shortfall Earlier, in Late 2032

https://www.wsj.com/politics/policy/social-security-trust-insolvency-2032-d26bf25e
4•JumpCrisscross•16m ago•2 comments

An Early Step on the Long Road to Photosynthesis

https://www.quantamagazine.org/an-early-step-on-the-long-strange-road-to-photosynthesis-20260610/
1•daesorin•16m ago•0 comments

New Anthropic privacy policy: age/identity verification for consumer accounts

https://www.anthropic.com/legal/privacy
2•vhantz•16m ago•1 comments

Air Canada pilot accused of flying over 900 flights without valid license

https://www.cbsnews.com/news/air-canada-pilot-arrested-hundreds-flights-no-valid-license/
1•naturalmovement•17m ago•0 comments

DiffusionGemma: 4x Faster Text Generation

https://blog.google/innovation-and-ai/technology/developers-tools/diffusion-gemma-faster-text-gen...
12•meetpateltech•18m ago•2 comments

Show HN: Extend UI – open-source UI kit for modern document apps

https://www.extend.ai/ui
1•kbyatnal•18m ago•0 comments

Detection of animal sounds using data augmentation and transfer learning

https://www.nature.com/articles/s41598-026-48308-6
1•PaulHoule•19m ago•0 comments

Show HN: AI watched my screen for a year. Weather beat sleep

https://donethat.ai/blog/dogfooding-donethat
1•christoph123•20m ago•0 comments

It Is the Nature of Desire Not to Be Satisfied

https://kammartinez.wordpress.com/2012/03/03/it-is-the-nature-of-desire-not-to-be-satisfied-a-rev...
1•jruohonen•20m ago•0 comments

Anthropic support does not exist

https://mg0x7be.github.io/anthropic-support-does-not-exist.html
1•VimEscapeArtist•21m ago•0 comments