frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template
1•killerstorm•1y ago
I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•1y ago
If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

AI First: How the Federal Government Is Prioritizing AI over People and Planet

https://stopgreedbuildgreen.climateandcommunity.org/posts/ai-first
1•eatox•1m ago•0 comments

Gaza's Children

https://gazaschildren.com/
3•abdelhousni•3m ago•1 comments

The Lost World (1925) [video]

https://archive.org/details/the.-lost.-world.-1925.1080p.-blu-ray.x-264-sadpanda
1•petethomas•5m ago•0 comments

New serious vulnerabilities spiked around release of Claude Mythos Preview

https://epoch.ai/data-insights/cve-severity-spike
1•cubefox•6m ago•0 comments

Show HN: Pulse v0.2.0

2•xerrs•7m ago•0 comments

AI inference is obviously profitable

https://www.seangoedecke.com/ai-inference-is-obviously-profitable/
1•emirb•7m ago•1 comments

Africans Are Turning to Starlink

https://www.economist.com/middle-east-and-africa/2026/07/02/africans-are-turning-to-starlink
6•bookofjoe•13m ago•1 comments

Ads in ChatGPT

https://ads.openai.com/
3•vlan121•14m ago•2 comments

RememberLI

https://github.com/KlausSchaefers/rememberli
1•klausschaefers•15m ago•0 comments

Special forces ban Volvo/Chinese electric cars over spying fears

https://www.telegraph.co.uk/news/2026/07/03/special-forces-bans-chinese-cars-spying-fears-volvo/
3•cwwc•15m ago•0 comments

Show HN: Mlx-serve – LLM inference server for Apple Silicon, written in Zig

https://mlxserve.com/
1•ddalcu•16m ago•1 comments

MiniKotlin – A Kotlin Compiler That Runs in a Browser Tab

https://minikotlin.run
1•TheWiggles•18m ago•0 comments

Show HN: ContextCodeCache in Rust

https://github.com/colwill/ccc
1•colwont•19m ago•0 comments

Show HN: Maestro – scaffold Go microservices and keep them in sync

https://github.com/Zagforge-Org/maestro
1•anzedev•19m ago•1 comments

Collabora Office 26.04 Keeps AI Optional and Refines Writer and Calc

https://itsfoss.com/news/collabora-office-26-04/
1•mmarian•21m ago•0 comments

Mistralai/Leanstral-1.5-119B-A6B

https://huggingface.co/mistralai/Leanstral-1.5-119B-A6B
1•satvikpendem•21m ago•0 comments

Meta AI chief says their coming LLM has caught up with OpenAI's flagship model

https://www.businessinsider.com/meta-ai-model-catches-up-openai-gpt-5-says-2026-7
2•maxloh•22m ago•0 comments

Sumit Rana to Step Away from Epic

https://www.healthcareittoday.com/2026/07/03/breaking-news-sumit-rana-to-step-away-from-epic/
1•Forge36•25m ago•0 comments

Ask HN: What did you fail at and what did you learn from it?

2•basilikum•25m ago•0 comments

Jj v0.43.0 Released

https://github.com/jj-vcs/jj/releases/tag/v0.43.0
1•birdculture•27m ago•0 comments

Camera with transparent display launches for the equivalent of $29

https://www.notebookcheck.net/Camera-with-transparent-display-launches-for-the-equivalent-of-29.1...
2•yread•27m ago•1 comments

Congressman says hack of his Signal account proves app is unsecure. Is it true?

https://san.com/cc/congressman-says-hack-of-his-signal-account-proves-app-is-unsecure-is-it-true/
3•devonnull•31m ago•1 comments

Show HN: How clanker are you? A reverse Turing test

https://howclankerareyou.com/
3•niklio•38m ago•1 comments

Ross Spiral Curriculum

https://spiral.ross.org/spiral/#/
1•el3ctron•39m ago•0 comments

Palantir and the NHS – things you need to know

https://theconversation.com/palantir-and-the-nhs-10-things-you-need-to-know-281165
1•abdelhousni•40m ago•1 comments

Applied Category Theory Course (2018)

https://math.ucr.edu/home/baez/act_course/index.html
2•measurablefunc•40m ago•0 comments

A Runtime Modulation Layer for Large Language Models

https://github.com/divinecanon/signalengine-EN-
1•w89780175•42m ago•0 comments

OpenCode, Pi, and Goose: Three Layers of the AI Agent Stack

https://gist.github.com/AIMOWAY/bd8007c8f834a9bc83c71e3178239d75
1•AIMOWAY•43m ago•0 comments

Espionage Against the European Parliament

https://citizenlab.ca/research/member-of-committee-investigating-spyware-hacked-with-pegasus/
27•ledoge•43m ago•0 comments

Giving a domain a hill to climb: benchmarking as data activation

https://sparsethought.com/2026/07/03/benchmarking-as-data-activation/
3•galsapir•49m ago•1 comments