frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Auto-Architecture: Karpathy's Loop, pointed at a CPU

https://github.com/FeSens/auto-arch-tournament/blob/main/docs/auto-arch-tournament-blog-post.md
35•fesens•11h ago

Comments

sho_hn•1h ago
Salient on the value of the verifier. Matches my experience in the last two quarters.

Nice detail on the encountered failures. Very similar experiences with my own loops against testsuites.

Great post. A snapshot in time.

pteetor•57m ago
In case you are unfamiliar with Karpathy's Loop[1], it is a genetic algorithm[2] where the genetic "mutations" are clever-but-random ideas generated by an LLM agent, aimed at improving a system.

  (1) Let the LLM randomly perturbate the system.
  (2) Measure the system's performance.
  (3a) If the perturbation improved performance, keep the change.
  (3b) Otherwise, don't.
  (4) Repeat
[1] https://github.com/karpathy/autoresearch

[2] https://en.wikipedia.org/wiki/Genetic_algorithm

fc417fc802•53m ago
Extremely interesting but I don't understand why it was written by an LLM. Either the frontier models are far better than I realized or else writing this document required a lot of manual work regardless at which point why not keep it in your own voice?

> The agent did not know that would also halve the LUT count. It found out by doing it and watching the synthesizer.

So I guess this is an example of an LLM anthropomorphizing and making wild conjectures about the internal workings of a different LLM.

thin_carapace•42m ago
> "If you can write the rules down, an agent will satisfy them faster than your team will."

a fantastic opportunity to become the next next big thing and write a verifier verifier.

at the hypothesized inflexion point where AI instantly performs exactly as commanded, what happens to heavily regulated industries like medical? do we get huge leaps and bounds everywhere EXCEPT where it matters, or is regulation going to be handed over to a verifier verifier?

outside1234•33m ago
Has anyone actually written a verifier for a business / project?
sho_hn•29m ago
I'd say "a verifier" here is a loose term. A great testsuite is a verifier. I've done reverse-engineering projects that involved generating trace logs from the object under test, having a reimplementation emit the same logs, and running strict comparisons.

OP's post is basically pointing out what certainly many others have independently discovered: Your agent-based dev operation is as good as the test rituals and guard rails you give the agents.

Show HN: Auto-Architecture: Karpathy's Loop, pointed at a CPU

https://github.com/FeSens/auto-arch-tournament/blob/main/docs/auto-arch-tournament-blog-post.md
36•fesens•11h ago•7 comments

Show HN: Drive any macOS app in the background without stealing the cursor

https://github.com/trycua/cua
71•frabonacci•12h ago•25 comments

Show HN: Live Sun and Moon Dashboard with NASA Footage

https://www.lumara-space.app/
169•beeswaxpat•14h ago•58 comments

Show HN: GeoTraceroute – Traceroutes on a 3D globe and submarine cables

https://geotraceroute.com
2•Himred•2h ago•0 comments

Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview

https://github.com/dirac-run/dirac
367•GodelNumbering•1d ago•141 comments

Show HN: I built another to do list. But it does a lot

https://apps.apple.com/us/app/rotation-list-shared-to-do/id6758746324
3•toddh•5h ago•1 comments

Show HN: ClusterdOS – Kubernetes without the platform team

https://gitlab.com/aranya-tech/public/clusterdos
2•druid•5h ago•1 comments

Show HN: Utilyze – an open source GPU monitoring tool more accurate than nvtop

https://www.systalyze.com/utilyze
114•ManyaGhobadi•1d ago•28 comments

Show HN: A terminal spreadsheet editor with Vim keybindings

https://github.com/garritfra/cell
104•garritfra•1d ago•49 comments

Show HN: Effected Keyboard 2 – Effects as You Type

2•vitalipom•7h ago•0 comments

Show HN: Turning a Gaussian Splat into a videogame

https://blog.playcanvas.com/turning-a-gaussian-splat-into-a-videogame/
236•yak32•5d ago•63 comments

Show HN: I wrote a DOOM clone in my own programming language

https://spectrelang.org/log/devlog#cubedoom
7•pizza_man•16h ago•3 comments

Show HN: A TUI for Markdown view an editing

https://mdee.bkh.dev
3•cloked•8h ago•0 comments

Show HN: Waiting for LLMs Suck – Give your user a game

https://github.com/ftaip/waiting-game
21•dalemhurley•1d ago•12 comments

Show HN: I mapped the latest UK fuel prices by county

https://fuelfox.uk/regional
4•sircipher•9h ago•0 comments

Show HN: Open Bias – proxy that enforces agent behavior at runtime

https://github.com/open-bias/open-bias/
11•algomaniac•9h ago•3 comments

Show HN: Devicons, +1300 logos and icons in React, SVG, and icon format

https://devicons.io/
8•vorillaz•19h ago•2 comments

Show HN: Ragnerock, an AI data analysis tool

https://www.ragnerock.com
8•mmahowald27•11h ago•4 comments

Show HN: SyncVibe – Code with friends in the terminal, each with your own AI

https://syncvibe.online/
9•curious1008•13h ago•4 comments

Show HN: VoiceGoat – A vulnerable voice agent for practicing LLM attacks

https://github.com/redcaller/voice-goat
6•xmhatx•13h ago•1 comments

Show HN: AgentSwift – Open-source iOS builder agent

https://github.com/hpennington/agentswift
47•hpen•1d ago•9 comments

Show HN: The Unix Magic poster, annotated (updated)

https://github.com/drio/unixmagic
60•drio•2d ago•7 comments

Show HN: Unusual Wikipedia

https://unusualwiki.nk412.com/
22•grilledchickenw•1d ago•3 comments

Show HN: How much of the Linux kernel is written by AI?

https://assisted-by.dev/
7•snek14•14h ago•3 comments

Show HN: Tiao, A two-player turn-based board game

https://playtiao.com
62•trebeljahr•2d ago•29 comments

Show HN: Free textbook on engineering thermodynamics

https://thermodynamicsbook.com/
174•2DcAf•2d ago•47 comments

Show HN: PrePrompt – rewrites vague prompts before they reach the LLM

https://preprompt.org/
8•yashdeeptehlan•1d ago•4 comments

Show HN: Implementing Patio11's "Dangerous Professional" as a Claude Code Plugin

https://playground.tetraresearch.io/p/implementing-patio11s-dangerous-professional
3•tawb•16h ago•1 comments

Show HN: Startup Equity Adventure Game

https://options-game-polymathrobotics.pythonanywhere.com/
35•iliabara•2d ago•26 comments

Show HN: Gova – The declarative GUI framework for Go

https://github.com/NV404/gova
143•aliezsid•4d ago•29 comments