frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Run automated ML experiments using Claude Code

https://github.com/killerstorm/claude-torch-template
1•killerstorm•8mo ago
I made a template which can be used to conduct (basic) ML experiments in a fully automated mode: Claude Code will write the code, you only need to provide a working environment and the idea.

The goal was largely to demonstrate that this is possible, specifically to:

* encourage to people who want to run some ML experiment but don't have time t code it to actually give it a try * provide evidence that LLM recursive self-improvement is not "science fiction"

The template is bare bones, it does not come with niceties for monitoring experiments, conduct experiments at scale, etc.

The script assumes that CUDA, Python, PyTorch are already set up. This is quite easy if you rent an instance from https://lambda.ai/ - that's pre-installed. You'd only need to install Claude Code (which itself requires npm) to get it going.

As I mentioned in the README, the most advanced experiment I tried so far is injection of sentence-embedding memory into a pre-trained transformer.

The timeline on https://ai-2027.com/ assumes that we'll only be able to get AI coding agents which can do ML experiments in 2026, but it seems like it is already possible now. (I spent only few hours on this, obviously proper AI labs can spend whole days on infrastructure, scaffolding, prompting, fine-tuning, etc.)

Comments

killerstorm•8mo ago
If you actually want to conduct some experiment, I'd suggest:

* fist iterate on the idea with o3 (best choice) or other big model (Opus 4, Gemini 2.5 Pro, Grok 3) -- ask it whether it was done before, how to improve it, what is the expected outcome, etc. o3 is really smart, it can explain intuition between different choices, etc. * Python packages are hard. Using virtual environment (venv) is recommended. `uv` is probably the modern way to manage venv, but installing torch with CUDA support via uv is pain, what I found works is: * `uv pip install torch --torch-backend=cu126` (uv pip uninstall torch) * lambda.ai provides high-quality environment, but it might lack cheaper GPU options. * as I mentioned in README, there's no sandboxing, Claude can do pretty much arbitrary stuff...

Show HN: I made a bootable NixBSD (NixOS and FreeBSD) image

https://github.com/jonhermansen/nixbsd-demo
2•jonhermansen•1m ago•0 comments

SAP shares plunge up to 17% as cloud backlog growth disappoints investors

https://www.ig.com/uk/news-and-trade-ideas/sap-shares-plunge-up-to-17--as-cloud-backlog-growth-di...
1•kavalg•3m ago•0 comments

Iran plans to cut ties with the global internet, and VPNs may not help this time

https://www.techradar.com/vpn/vpn-privacy-security/not-just-censorship-its-digital-isolation-iran...
1•maxloh•3m ago•0 comments

Show HN: Hyperterse – a super fast runtime to connect your data to your agents

https://github.com/hyperterse/hyperterse
1•samrith•3m ago•0 comments

A Missive on the Leitmotif

https://wehwalt.net/leitmotifs
1•sandbach•4m ago•0 comments

Single Bitcoin entity keeping BTC price suppressed below $90K

https://www.msn.com/en-us/money/markets/single-bitcoin-entity-keeping-btc-price-suppressed-below-...
1•radicalethics•5m ago•0 comments

Show HN: A skill that lets AI agents build hooks apps across 4 coding tools

1•runkids•5m ago•0 comments

SRE Is Anti-Transactional

https://cacm.acm.org/practice/sre-is-anti-transactional/
1•zdw•5m ago•0 comments

Microsoft dive spoils Mag 7 earnings enthusiasm

https://www.cnbc.com/2026/01/28/stock-market-today-live-updates.html
1•rurp•5m ago•0 comments

The Economics of a Super Bowl Ad

https://ro.co/perspectives/super-bowl-economics/
1•atlasunshrugged•6m ago•1 comments

Rover 2.0: automating projects with coding agents

https://endor.dev/blog/rover-2-0
1•ridruejo•7m ago•1 comments

Kimi K2.5: Now Free for One Week on AskCodi

https://askcodi.substack.com/p/kimi-k25-now-free-for-one-week-on
1•askcodi•7m ago•0 comments

Electronic components price and lead time increases announced across the board

https://www.cnx-software.com/2026/01/29/electronic-components-price-and-lead-time-increases-annou...
1•pyprism•7m ago•0 comments

Show HN: Kolibri, a DIY music club in Sweden (kolibrinkpg.com)

https://kolibrinkpg.com/
1•EastLondonCoder•9m ago•1 comments

Never Slide Out of the Day

https://aethermug.com/posts/nsoott%22
1•mrcgnc•10m ago•0 comments

Scammers posing as company CEOs surge in Japan

https://www.japantimes.co.jp/news/2026/01/19/japan/crime-legal/japan-ceo-emails-scams/
2•PaulHoule•10m ago•0 comments

Astronauts Are Going Back to the Moon for the First Time in Half a Century

https://time.com/7346146/artemis-ii-launch-nasa-astronauts-moon-mission/
2•ironyman•11m ago•1 comments

Eric S. Raymond: why is there such a huge variance in results from using LLMs?

https://twitter.com/esrtweet/status/2016849708254179501
2•dist-epoch•11m ago•1 comments

Ask HN: LLM and Human Coding Benchmarks?

1•weli•11m ago•1 comments

Apple Knowledge Navigator Video (1987)

https://www.youtube.com/watch?v=umJsITGzXd0
2•noodlebird•11m ago•0 comments

Strassen's Matmul with Avx 512

https://martianlantern.github.io/2026/01/strassen-matrix-multiplication/
1•martianlantern•12m ago•0 comments

Averting the Code Quality Apocalypse

https://sibylline.dev/articles/2026-01-29-the-code-quality-apocalypse/
2•CuriouslyC•12m ago•0 comments

Show HN: Free AI Scan for Hidden Spend and Data Risk

1•bahaii•12m ago•0 comments

Where do we go from here? Some thoughts and speculation

https://blog.codesolvent.com/2025/08/where-do-we-go-from-here-some-thoughts.html
1•Edmond•13m ago•0 comments

Microsoft stock plummets as investors fret on AI spend

https://finance.yahoo.com/news/microsoft-q2-earnings-beat-but-stock-plummets-as-investors-fret-on...
4•m-hodges•14m ago•0 comments

Elixir, Kotlin, C# Outperform Python, TypeScript and Go on AutoCode Benchmark

https://github.com/Tencent-Hunyuan/AutoCodeBenchmark/blob/main/figures/exp_acb.png
1•bnchrch•15m ago•0 comments

US cybersecurity chief leaked sensitive government files to ChatGPT: Report

https://www.dexerto.com/entertainment/us-cybersecurity-chief-leaked-sensitive-government-files-to...
9•randycupertino•16m ago•0 comments

Drug trio found to block tumour resistance in pancreatic cancer

https://www.drugtargetreview.com/news/192714/drug-trio-found-to-block-tumour-resistance-in-pancre...
2•axiomdata316•17m ago•0 comments

Show HN: Prompt → landing page: locally-run AI with a execution layer (demo)

https://github.com/indyh91/Nyxi-Showcase/releases/tag/V1.0.0
1•Shaehenderson•18m ago•0 comments

Show HN: Sparklevalidator.com (For Appcast.xml Files)

https://sparklevalidator.com/
1•dweekly•19m ago•0 comments