frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template
1•killerstorm•11mo ago
I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•11mo ago
If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

Commodity Markets Outlook [pdf]

https://thedocs.worldbank.org/en/doc/f3138644a1e8e2bb631399ae11d6c408-0050012026/original/CMO-Apr...
1•gmays•23s ago•0 comments

Adobe's 'Modern' User Interface Is Just Webpages – Pixel Envy

https://pxlnv.com/linklog/adobe-modern-user-interface/
1•tambourine_man•55s ago•0 comments

Apple Explores Using Intel and Samsung to Build Main Device Chips in the US

https://www.bloomberg.com/news/articles/2026-05-05/apple-explores-using-intel-and-samsung-to-buil...
2•tambourine_man•3m ago•0 comments

A constrained approach to coding agents

https://github.com/brainless/nocodo
1•brainless•10m ago•1 comments

Ask HN: Best Embedding Models?

1•devstein•11m ago•0 comments

Biscuit

https://github.com/yattsu/biscuit
2•unixfg•16m ago•0 comments

An Analysis of the PocketOS Debacle

https://thedailywtf.com/articles/empty-pockets
2•pseudohadamard•18m ago•1 comments

Musk Settles SEC Suit for Underpaying Twitter Investors by $150M for Just $1.5M

https://www.law.com/corpcounsel/2026/05/04/musk-settles-sec-suit-accusing-him-of-underpaying-twit...
3•1vuio0pswjnm7•18m ago•2 comments

The 90-year-old idea behind JEPA models: Canonical Correlation Analysis (CCA)

https://shonczinner.github.io/posts/embedding-prediction/
1•kjshsh123•20m ago•0 comments

Meta, TikTok Recv Personal Data from Health Exchanges Alarming Privacy Experts

https://www.bloomberg.com/features/2026-healthcare-advertising-trackers-privacy/
3•1vuio0pswjnm7•21m ago•0 comments

An LLM agent that runs on any Linux box

https://getclaw.site/#demo
2•kilian-ai•25m ago•0 comments

Continually improving our agent harness

https://cursor.com/blog/continually-improving-agent-harness
1•gmays•26m ago•0 comments

Show HN: A minimalist personal homepage I designed from scratch

https://olzhasshaikenov.com/
1•olzhas23•30m ago•0 comments

Tokens and Dreams

https://charlesleifer.com/blog/tokens-and-dreams/
3•xngbuilds•30m ago•0 comments

AI and the Danger of Cognitive Surrender

https://www.economist.com/business/2026/04/30/ai-and-the-danger-of-cognitive-surrender
4•1vuio0pswjnm7•34m ago•1 comments

Linux, Windows or macOS: Which Operating System to Use in 2026?

https://www.lucasaguiar.xyz/posts/linux-windows-macos-qual-usar-2026/
2•isfttr•40m ago•3 comments

File Approved – File approvals without the back-and-forth

https://fileapproved.com
2•vannventures•41m ago•0 comments

Echon – A Discord alternative built in Tauri/Rust

https://echon-voice.com
2•highest678•46m ago•0 comments

The Art of Operating Systems (2019)

https://denninginstitute.com/pjd/ArtOS2/
3•aragonite•49m ago•0 comments

Amp's GPT 5.5 Model Analysis

https://ampcode.com/models/gpt-5.5
3•goranmoomin•50m ago•0 comments

Pulitzer Prize Winner in International Reporting

https://www.pulitzer.org/winners/dake-kang-garance-burke-byron-tau-aniruddha-ghosal-and-yael-grau...
10•jay_kyburz•1h ago•2 comments

The artful way of the stack-machine

https://www.pepnom.org/post/post.5.may.2026.html
2•mjbq•1h ago•1 comments

Why AI Agents Need Proof Chains, Not Just Logs

https://github.com/rodriguezaa22ar-boop/atlas-trust-infrastructure
3•astra_omnia•1h ago•0 comments

Process-Level Reward Modeling for Agentic Data Analysis

https://arxiv.org/abs/2604.24198
3•gmays•1h ago•0 comments

You can get dragged into a police investigation by proximity alone – for now

https://www.theverge.com/report/919664/chatrie-v-united-states-supreme-court-arguments-fourth-ame...
2•Cider9986•1h ago•0 comments

What I'm Hearing About Cognitive Debt (So Far)

https://margaretstorey.com/blog/2026/02/18/cognitive-debt-revisited/
42•raphaelcosta•1h ago•8 comments

White House considers government reviews for AI models

https://www.reuters.com/world/white-house-considers-vetting-ai-models-before-they-are-released-ny...
2•AlexDragusin•1h ago•0 comments

The Car That Watches You Back: The Advertising Infrastructure of Modern Cars

https://nobodyaskedforthis.lol/posts/connected-car/
2•cadito•1h ago•1 comments

Nobody Here – 'The Story of Vaporwave' (2026) – Full Movie

https://www.youtube.com/watch?v=6kNqw7UdENg
3•fallinditch•1h ago•2 comments

Astronomers uncover > 1k radio galaxies with 'wings,' expanding a rare class

https://phys.org/news/2026-05-astronomers-uncover-radio-galaxies-wings.html
1•wglb•1h ago•1 comments