frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template
1•killerstorm•10mo ago
I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•10mo ago
If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

We Built a Metric Simulator

https://simpleobservability.com/blog/metric-simulator
1•khazit•48s ago•0 comments

NSA is using Anthropic's Mythos despite blacklist

https://www.reuters.com/business/us-security-agency-is-using-anthropics-mythos-despite-blacklist-...
1•Palmik•1m ago•0 comments

The first open-weights Large Transaction Model, EWE-1

https://sistemalabs.com/blog/introducing-ewe-1
1•0xideas•14m ago•0 comments

What Makes Docs Beautiful?

https://passo.uno/what-makes-docs-beautiful/
1•theletterf•14m ago•0 comments

How I made a budget tracker for my gf because she kept complaining about Sheets

https://edm115.dev/blog/2026/02/15/how-i-made-spendly/
1•EDM115•16m ago•0 comments

Show HN: MyKana, a Japanese learning app I built for my own study

https://mykana.app/
1•zerratar•19m ago•0 comments

ACM CCS 2026 Between-Cycle Transparency Report

https://github.com/ACM-CCS-2026/Transparency-Report
1•jruohonen•21m ago•0 comments

Bun v1.3.13

https://bun.com/blog/bun-v1.3.13
3•Erenay09•25m ago•0 comments

ShannonBase is database agent platform

https://medium.com/@shannon.data.tech/shannonbase-is-databas-agent-platform-2e914ccfc45e
1•shannon-data-ai•31m ago•1 comments

Architecture is all you need (How to think about agentic design)

https://x.com/compose/articles/edit/2046045421844455424
1•Kushal6070•32m ago•0 comments

Kindle E-Readers Released in 2012 or Earlier

https://www.amazon.com/gp/help/customer/display.html?nodeId=TRXsYxKJr4WTdsVs2P
1•bandwitch•34m ago•1 comments

The AI-Ready Product Data Framework for B2B Commerce

https://virtocommerce.com/assets/ai-ready-pim-framework
2•lizzieyo•34m ago•0 comments

How (and why) we rewrote our production C++ front end infrastructure in Rust

https://blog.nearlyfreespeech.net/2026/04/17/how-and-why-we-rewrote-our-production-c-frontend-inf...
1•birdculture•34m ago•0 comments

Show HN: Busybee - a FIFO build queue for multi-agent dev workflows

https://github.com/githappens/busybee
1•playfultones•35m ago•1 comments

WhatsApp Plus is rolling out new premium features

https://wabetainfo.com/whatsapp-plus-is-rolling-out-new-premium-features/
1•fwn•36m ago•0 comments

DuckDB Now Speaks Dutch

https://duckdb.org/2026/04/01/duckdb-now-speaks-dutch
2•saeedesmaili•38m ago•0 comments

Understanding the Go Runtime: The Network Poller

https://internals-for-interns.com/posts/go-netpoller/
1•valyala•39m ago•0 comments

Salesforce Stopped Paying for Salesforcefoundation.org

1•october8140•39m ago•1 comments

Smartphones, Online Music Streaming, and Traffic Fatalities

https://www.nber.org/papers/w34866
1•nixass•44m ago•0 comments

Controlling the secondary fan on Minisforum AI Pro HX 370

https://github.com/MiniPcThinker/minisforum_ai_pro_hx_370_aux_fan_controller/blob/main/INVESTIGAT...
1•minipcthinker•44m ago•0 comments

Prediction Markets: Last Week Tonight with John Oliver [video]

https://www.youtube.com/watch?v=ZN4njIQcSR4
3•Topfi•54m ago•0 comments

File System Wars

https://bytearchitect.io/macos-security/theory/Filesystem-Wars-Why-Your-Choice-of-Storage-is-Actu...
3•rantingdemon•55m ago•0 comments

Email Newsletter Management

https://gemvoyage.net/
1•princesauro•55m ago•0 comments

Bloomberg Terminal is ugly and clunky, but everyone uses it. Even their enemies

https://twitter.com/mb_ghalibaf/status/2045986841220772123
1•haebom•56m ago•0 comments

Neuro-Symbolic Ode Discovery with Latent Grammar Flow

https://arxiv.org/abs/2604.16232
1•ahsillyme•58m ago•0 comments

ZeusHammer – Built an AI Agent That "Thinks Locally"

https://github.com/pengrambo3-tech/ZeusHammer
1•RamboZeusHammer•59m ago•0 comments

New Debian Project Leader Elected for 2026

https://www.phoronix.com/news/Debian-DPL-Sruthi-Chandran
2•axbyte•1h ago•0 comments

Dentavive Legit or Scam in 2026? ( Hype or Trusted Choice?) [pdf]

https://fsc.org/sites/default/files/webform/problem_with_unacceptable_activi/_sid_/Dentavive1Guid...
1•hauzlapy•1h ago•0 comments

Show HN: I Recreated Encarta's MindMaze

https://medium.com/@laurentiu.raducu/i-recreated-encartas-mindmaze-and-added-it-to-select-supply-...
6•laurentiurad•1h ago•4 comments

Show HN: Keshro, plan and execute migrations with AI agents

https://keshro.com
1•jlewitt1•1h ago•1 comments