frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

94% zero-shot in a shifting gridworld, no retraining

1•heavymemory•32m ago
Key door puzzle. Grid: 8 by 11. Two rooms. One key somewhere in the left room. Two coloured doors in the wall. Goal on the far side.

Train PPO or DQN on one layout and it solves that layout. Shift the key, add or move a wall passage, alter the distractor key setup and performance collapses. The usual story of memorising geometry instead of the rules.

Instead, I train a small set of skills, like find the correct key, go to the passage, open the correct door, reach the goal. Each skill trained once, then frozen. When the layout changes, nothing updates. It retrieves the right skills from longterm memory and composes them.

State space is already large if treated symbolically. Roughly 50 reachable cells for the agent, 50 for the key, 4 door configurations, multiple passage layouts, 3 inventory values, 4 headings. Around 360,000 distinct logical states from conservative counting.

At composition time, the system only reuses states it actually encountered during skill training. No gradients. No online policy adaptation.

Benchmark: 2500 zeroshot episodes with randomised keys and randomised passages. No retraining. Solve rate about 94%.

Frozen skills. New layouts. Still works.

So here's the real question: If hierarchical RL should solve this, why does it still struggle with such a tiny, structured world unless you train it across every variation? Or am I wrong?

And what’s actually being learned when a system generalises to layouts it has never seen?

I'm interested in that discussion. The gap between, this looks trivial and most agents don't generalise, feels like the interesting thing here.

Show HN: I Wrote a Field Manual on Self-Hosting(Immich,ZFS,Docker)Free on Kindle

https://www.amazon.com/dp/B0FY3XXPNV
1•devmicrosystems•9m ago•0 comments

Make It Easy for Humans

https://tombedor.dev/make-it-easy-for-humans/
1•jjfoooo4•10m ago•0 comments

Gemini Apps limits and upgrades for Google AI subscribers

https://support.google.com/gemini/answer/16275805?hl=en
1•doener•10m ago•0 comments

Compiler Explorer now supports Racket

https://godbolt.org/z/z3WffbzaY
1•azhenley•12m ago•0 comments

It's mathematically highly likely that there is life elsewhere in the universe

https://www.sciencedirect.com/science/article/pii/S0094576525006599?via%3Dihub
3•Rogach•14m ago•2 comments

Token Visualizer

https://github.com/PeterHdd/token-visualization
1•peterhddcoding•14m ago•0 comments

Zenroom – No-code cryptographic virtual machine

https://zenroom.org/
1•smartmic•23m ago•1 comments

94% zero-shot in a shifting gridworld, no retraining

1•heavymemory•32m ago•0 comments

Mint Is Not TeX

https://mint.ubavic.rs/
3•ubavic•33m ago•2 comments

The Fastest Image Diffing Engine You've Never Heard Of

https://vizzly.dev/blog/honeydiff-vs-odiff-pixelmatch-benchmarks/
2•Robdel12•35m ago•0 comments

Eraser: A Dynamic Data Race Detector for Multithreaded Programs (1997) [pdf]

https://web.stanford.edu/class/archive/cs/cs240/cs240.1054/readings/Tocs97.pdf
1•todsacerdoti•38m ago•0 comments

He Wants a New Start. So He Is Taking the Hardest Driving Test in the World

https://www.nytimes.com/2025/11/24/world/europe/london-black-cab-taxi-driving-test.html
1•bookofjoe•44m ago•1 comments

Get Your Kid a Watch

https://www.theatlantic.com/technology/2025/11/smartwatch-kids-screen-time/684975/
4•fortran77•44m ago•1 comments

Pinball Shopify

https://bfcm.shopify.com/
3•SnaKeZ•46m ago•0 comments

Americans no longer see four-year college degrees as worth the cost

https://www.nbcnews.com/politics/politics-news/poll-dramatic-shift-americans-no-longer-see-four-y...
23•jnord•49m ago•11 comments

Memory-Graph – Knowledge Graph Memory for Claude Code with SQLite/Neo4j/Memgraph

https://github.com/gregorydickson/memory-graph
2•gregorydickson•51m ago•1 comments

Nobara Project: Fedora Linux with user-friendly fixes added to it

https://nobaraproject.org/
2•doener•57m ago•0 comments

Braids Osu Article [pdf] (go state)

https://people.math.osu.edu/chmutov.1/wor-gr-05-20/wor-gr-su20/braids-2020.pdf
1•marysminefnuf•58m ago•0 comments

Human3R: Everyone Everywhere All at Once

https://fanegg.github.io/Human3R/
1•pcooper•1h ago•0 comments

AI Teddy Bear That Talked Fetishes and Knives Is Back on the Market

https://gizmodo.com/ai-teddy-bear-that-talked-fetishes-and-knives-is-back-on-the-market-2000691509
3•gnabgib•1h ago•0 comments

Show HN: I Recreated the Windows Longhorn (2004) Aurora Effect in HTML5 Canvas

https://github.com/brainvine/longhorn-aurora
2•AntonioEritas•1h ago•0 comments

Retro RenderMan: shading food for 'Ratatouille' (2020)

https://beforesandafters.com/2020/07/22/retro-renderman-shading-food-for-ratatouille/
1•HL33tibCe7•1h ago•0 comments

Lobste.rs

https://lobste.rs/
3•dtj1123•1h ago•0 comments

Show HN: Xlerb – A Compiled "Forth" for the Beam

2•shawa_a_a•1h ago•0 comments

Show HN: I made a free log anonymizer in the browser

https://www.getloglens.com/tools/log-sanitizer
2•wazzaaaa•1h ago•1 comments

Show HN: Raytha v1.5 – open-source .NET CMS with a new visual page builder

https://github.com/RaythaHQ/raytha
1•apexdodge•1h ago•0 comments

Show HN: ClearHearAI-The Essential App for Hearing Impaired and Deaf Communities

https://clearhearai.com/
1•justinos•1h ago•0 comments

New bill would revive single-room occupancy apartments in NYC

https://www.6sqft.com/new-bill-would-revive-single-room-occupancy-apartments-in-nyc/
8•geox•1h ago•7 comments

Bazzite: The next generation of Linux gaming

https://bazzite.gg/
28•doener•1h ago•3 comments

Show HN: Browser Based Softbody Physics

https://www.maanraket.nl/experiments/peachy-keen/
2•cowboy_henk•1h ago•0 comments