frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Solving a Million-Step LLM Task with Zero Errors

https://arxiv.org/abs/2511.09030
32•Anon84•1h ago

Comments

LMKIIW•41m ago
I dunno, even though the authors address its use, making the task Tower of Hanoi doesn't meet the excitement of the title.
cs702•40m ago
Nice!

Briefly, the idea is recursively to decompose tasks into the simplest possible steps, recursively call (relatively small) LLMs as agents to execute one step at a time, and using a clever voting scheme to choose how to execute each step. The authors use this technique to get a relatively small LLM to solve Towers of Hanoi with 20 rings (1M steps). All of it using natural language.

The most obvious question is whether other tasks, more interesting -- less "rote" -- than Towers of Hanoi, can similarly be recursively decomposed into simple steps. I'm not sure that's always possible.

adastra22•13m ago
Why not? That's basically how NASA manages large projects.
zer00eyz•28m ago
On the surface this is an interesting concept...

The paper however, meh...

No mention of MoE. One would think this is a logical evolution of that but not a mention (that I saw). Its own rubric for the task, Towers of Hanoi, was admittedly weak.

LLM papers are starting to look like the last decade of JS frameworks and Tools. Only with less code and more academics, and thats disappointing, because I think a lack of pragmatism and grounding is now holding the field back...

awei•21m ago
one issue I see is when steps in a plan depend on one another, when you cannot know all the next steps exactly before seeing the results of the previous ones, when you may have to backtrack sometimes
htrp•13m ago
> The approach relies on an extreme decomposition of a task into subtasks, each of which can be tackled by focused microagents. The high level of modularity resulting from the decomposition allows error correction to be applied at each step through an efficient multi-agent voting scheme.

Big if that the decomposition and the voting happen accurately for anything other than toy problems

andai•12m ago
I have ADHD and the same approach works for me. (In fact, most days it is essential!)
andai•9m ago
Worth opening the pdf just for the graph on page 1.

Cloudflare Global Network experiencing issues

https://www.cloudflarestatus.com/?t=1
2094•imdsm•6h ago•1355 comments

Gemini 3 for developers: New reasoning, agentic capabilities

https://blog.google/technology/developers/gemini-3-developers/
330•janpio•2h ago•104 comments

Gemini 3 Pro Preview Live in AI Studio

https://aistudio.google.com/prompts/new_chat?model=gemini-3-pro-preview
406•preek•3h ago•170 comments

Pebble, Rebble, and a Path Forward

https://ericmigi.com/blog/pebble-rebble-and-a-path-forward/
72•phoronixrly•58m ago•12 comments

A Day at Hetzner Online in the Falkenstein Data Center

https://www.igorslab.de/en/a-day-at-hetzner-online-in-the-falkenstein-data-center-insights-into-s...
73•speckx•2h ago•19 comments

5 Things to Try with Gemini 3 Pro in Gemini CLI

https://developers.googleblog.com/en/5-things-to-try-with-gemini-3-pro-in-gemini-cli/
78•keithba•2h ago•26 comments

Gemini 3

https://blog.google/products/gemini/gemini-3/
326•meetpateltech•2h ago•95 comments

Solving a Million-Step LLM Task with Zero Errors

https://arxiv.org/abs/2511.09030
33•Anon84•1h ago•8 comments

Strix Halo's Memory Subsystem: Tackling iGPU Challenges

https://chipsandcheese.com/p/strix-halos-memory-subsystem-tackling
25•PaulHoule•1h ago•9 comments

Google Brings Gemini 3 AI Model to Search and AI Mode

https://blog.google/products/search/gemini-3-search-ai-mode/
71•CrypticShift•2h ago•5 comments

How Quake.exe got its TCP/IP stack

https://fabiensanglard.net/quake_chunnel/index.html
366•billiob•10h ago•75 comments

Nearly all UK drivers say headlights are too bright

https://www.bbc.com/news/articles/c1j8ewy1p86o
461•YeGoblynQueenne•4h ago•448 comments

Do Not Put Your Site Behind Cloudflare If You Don't Need To

https://huijzer.xyz/posts/123/do-not-put-your-site-behind-cloudflare-if-you-dont
332•huijzer•5h ago•249 comments

Show HN: Guts – convert Golang types to TypeScript

https://github.com/coder/guts
7•emyrk•26m ago•0 comments

Google Antigravity

https://antigravity.google/
182•Fysi•2h ago•130 comments

Show HN: Optimizing LiteLLM with Rust – When Expectations Meet Reality

https://github.com/neul-labs/fast-litellm
19•ticktockten•1h ago•3 comments

Google Antigravity, a New Era in AI-Assisted Software Development

https://antigravity.google/blog/introducing-google-antigravity
181•meetpateltech•2h ago•138 comments

The Miracle of Wörgl

https://scf.green/story-of-worgl-and-others/
98•simonebrunozzi•7h ago•55 comments

Gemini 3 Pro Model Card

https://pixeldrain.com/u/hwgaNKeH
399•Topfi•6h ago•262 comments

A squeaky nail, or the wheel that sticks out

https://prashanth.world/squeaky-nail/
4•mangoman•6d ago•2 comments

Beauty in/of mathematics: tessellations and their formulas

https://www.tandfonline.com/doi/full/10.1080/00036811.2025.2510472
16•QueensGambit•5d ago•0 comments

Short Little Difficult Books

https://countercraft.substack.com/p/short-little-difficult-books
89•crescit_eundo•3h ago•44 comments

Mathematics and Computation (2019) [pdf]

https://www.math.ias.edu/files/Book-online-Aug0619.pdf
44•nill0•5h ago•9 comments

Ruby 4.0.0 Preview2 Released

https://www.ruby-lang.org/en/news/2025/11/17/ruby-4-0-0-preview2-released/
152•pansa2•4h ago•51 comments

Looking for Hidden Gems in Scientific Literature

https://elicit.com/blog/literature-based-discovery
10•ravenical•5d ago•1 comments

How many video games include a marriage proposal? At least one

https://32bits.substack.com/p/under-the-microscope-ncaa-basketball
308•bbayles•5d ago•74 comments

GoSign Desktop RCE flaws affecting users in Italy

https://www.ush.it/2025/11/14/multiple-vulnerabilities-gosign-desktop-remote-code-execution/
45•ascii•5h ago•19 comments

I've Wanted to Play That 'Killer Shark' Arcade Game Briefly Seen in 'Jaws'

https://www.remindmagazine.com/article/15694/jaws-arcade-video-game-killer-shark-atari-sega-elect...
23•speckx•4d ago•8 comments

Langfuse (YC W23) Hiring OSS Support Engineers in Berlin and SF

https://jobs.ashbyhq.com/langfuse/5ff18d4d-9066-4c67-8ecc-ffc0e295fee6
1•clemo_ra•11h ago

Azure hit by 15 Tbps DDoS attack using 500k IP addresses

https://www.bleepingcomputer.com/news/microsoft/microsoft-aisuru-botnet-used-500-000-ips-in-15-tb...
457•speckx•1d ago•287 comments