frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: ChartGPU – WebGPU-powered charting library (1M points at 60fps)

https://github.com/ChartGPU/ChartGPU
430•huntergemmer•7h ago•134 comments

Spotify won court order against Anna's Archive, taking down .org domain

https://arstechnica.com/tech-policy/2026/01/annas-archive-said-spotify-scrape-didnt-cause-domain-...
28•voxadam•1h ago•11 comments

Claude's new constitution

https://www.anthropic.com/news/claude-new-constitution
195•meetpateltech•6h ago•129 comments

Challenges in join optimization

https://www.starrocks.io/blog/inside-starrocks-why-joins-are-faster-than-youd-expect
24•HermitX•5h ago•1 comments

OpenAI API Logs: Unpatched data exfiltration

https://www.promptarmor.com/resources/openai-api-logs-unpatched-data-exfiltration
25•takira•2h ago•10 comments

Show HN: Rails UI

https://railsui.com/
81•justalever•3h ago•58 comments

Skip is now free and open source

https://skip.dev/blog/skip-is-free/
213•dayanruben•7h ago•68 comments

Jerry (YC S17) Is Hiring

https://www.ycombinator.com/companies/jerry-inc/jobs/QaoK3rw-software-engineer-core-automation-ma...
1•linaz•1h ago

The WebRacket language is a subset of Racket that compiles to WebAssembly

https://github.com/soegaard/webracket
61•mfru•4d ago•13 comments

Three types of LLM workloads and how to serve them

https://modal.com/llm-almanac/workloads
14•charles_irl•6h ago•1 comments

TeraWave Satellite Communications Network

https://www.blueorigin.com/news/blue-origin-introduces-terawave-space-based-network-for-global-co...
97•T-A•3h ago•62 comments

Letting Claude play text adventures

https://borretti.me/article/letting-claude-play-text-adventures
39•varjag•5d ago•16 comments

eBay explicitly bans AI "buy for me" agents in user agreement update

https://www.valueaddedresource.net/ebay-bans-ai-agents-updates-arbitration-user-agreement-feb-2026/
66•bdcravens•1h ago•33 comments

Waiting for dawn in search: Search index, Google rulings and impact on Kagi

https://blog.kagi.com/waiting-dawn-search
177•josephwegner•5h ago•119 comments

What if AI is both good and not that disruptive?

https://deadneurons.substack.com/p/what-if-ai-is-both-really-good-and
13•nr378•1h ago•9 comments

TrustTunnel: AdGuard VPN protocol goes open-source

https://adguard-vpn.com/en/blog/adguard-vpn-protocol-goes-open-source-meet-trusttunnel.html
24•kumrayu•5h ago•2 comments

Setting Up a Cluster of Tiny PCs for Parallel Computing

https://www.kenkoonwong.com/blog/parallel-computing/
9•speckx•3h ago•0 comments

Slouching Towards Bethlehem – Joan Didion (1967)

https://www.saturdayeveningpost.com/2017/06/didion/
44•jxmorris12•4h ago•1 comments

Show HN: UltraContext – A simple context API for AI agents with auto-versioning

https://ultracontext.ai/
15•ofabioroma•7h ago•15 comments

Tell HN: 2 years building a kids audio app as a solo dev – lessons learned

9•oliverjanssen•8h ago•0 comments

SIMD programming in pure Rust

https://kerkour.com/introduction-rust-simd
24•randomint64•2d ago•5 comments

Show HN: TerabyteDeals – Compare storage prices by $/TB

https://terabytedeals.com
10•vektor888•1h ago•9 comments

Scientists find a way to regrow cartilage in mice and human tissue samples

https://www.sciencedaily.com/releases/2026/01/260120000333.htm
213•saikatsg•4h ago•60 comments

Open source server code for the BitCraft MMORPG

https://github.com/clockworklabs/BitCraftPublic
17•sfkgtbor•5h ago•3 comments

Show HN: Semantic search engine for Studio Ghibli movie

https://ghibli-search.anini.workers.dev/
10•aninibread•8h ago•2 comments

Show HN: RatatuiRuby wraps Rust Ratatui as a RubyGem – TUIs with the joy of Ruby

https://www.ratatui-ruby.dev/
7•Kerrick•4d ago•1 comments

Show HN: Amazon shopping automation without vision

4•tonyww•7h ago•1 comments

Nested code fences in Markdown

https://susam.net/nested-code-fences.html
168•todsacerdoti•9h ago•57 comments

Can you slim macOS down?

https://eclecticlight.co/2026/01/21/can-you-slim-macos-down/
141•ingve•14h ago•189 comments

JPEG XL Test Page

https://tildeweb.nl/~michiel/jxl/
148•roywashere•5h ago•105 comments
Open in hackernews

Letting Claude play text adventures

https://borretti.me/article/letting-claude-play-text-adventures
39•varjag•5d ago

Comments

skybrian•1h ago
It seems like asking Claude to keep notes somehow would work better. An AGENTS file and a TODO file? An issue tracker like beads? Lots of things to try.
pflenker•1h ago
For a game like anchorhead, which is famous in its niche, shouldn’t Claude already know it sufficiently to just solve it right away? I would expect that its data source contained multiple discussions and walkthroughs of the game.
ratg13•57m ago
It's very likely the model didn't stop to question if the game they were playing was something they knew already, and just assumed it was a puzzle created for it.
sfjailbird•36m ago
You can see Claude's responses in the repo. The first one is:

Ah, Anchorhead! One of the most celebrated pieces of interactive fiction ever written

imiric•58m ago
> By the time you get to day two, each turn costs tens of thousands of input tokens

This behavior surprised me when I started using LLMs, since it's so counterintuitive.

Why does every interaction require submitting and processing all data in the current session up until that point? Surely there must be a way for the context to be stored server-side, and referenced and augmented by each subsequent interaction. Could this data be compressed in a way to keep the most important bits, and garbage collect everything else? Could there be different compression techniques depending on the type of conversation? Similar to the domain-specific memories and episodic memory mentioned in the article. Could "snapshots" be supported, so that the user can explore branching paths in the session history? Some of this is possible by manually managing context, but it's too cumbersome.

Why are all these relatively simple engineering problems still unsolved?

iamjackg•55m ago
It's not unsolved, at least not the first part of your question. In fact it is a feature offered by all main LLM providers!

- https://platform.openai.com/docs/guides/prompt-caching

- https://platform.claude.com/docs/en/build-with-claude/prompt...

- https://ai.google.dev/gemini-api/docs/caching

imiric•47m ago
Ah, that's good to know, thanks.

But then why is there compounding token usage in the article's trivial solution? Is it just a matter of using the cache correctly?

StevenWaterman•39m ago
Cached tokens are cheaper (90% discount ish) but not free
moyix•18m ago
Also, unlike OpenAI, Anthropic's prompt caching is explicit (you set up to 4 cache "breakpoints"), meaning if you don't implement caching then you don't benefit from it.
sfjailbird•43m ago
Cool! I would like to see the game sessions.

Edit: they are there in the repo: https://github.com/eudoxia0/claude-plays-anchorhead/tree/mas...

tiahura•41m ago
Claude code, nethack, and tmux are fun to experiment with.
brimtown•32m ago
I’m currently letting Claude build and play its own Dwarf Fortress clone, as an installable plugin in Claude Code

https://github.com/brimtown/claude-fortress

twohearted•29m ago
This is a great idea and great work.

Context is intuitively important, but people rarely put themselves in the LLM's shoes.

What would be eye-opening would be to create an LLM test system that periodically sends a turn to a human instead of the model. Would you do better than the LLM? What tools would you call at that moment, given only that context and no other knowledge? The way many of these systems are constructed, I'd wager it would be difficult for a human.

The agent can't decide what is safe to delete from memory because it's a sort of bystander at that moment. Someone else made the list it received, and someone else will get the list it writes. The logic that went into why the notes exist is lost. LLMs are living the Christopher Nolan film Memento.

lukev•20m ago
This is a great framework to experiment with memory architectures.

Everything the author says about memory management tracks with my intuition of how CC works, including my perception that it isn't very good at explicitly managing its own memory.

My next step in trying to get it to work well on a bigger game would be to try to build a more "intuitive" memory tool, where the textual description of a room or an item would automatically RAG previous interactions with that entity into context.

That also is closer to how human memory works -- we're instantly reminded of things via a glimpse, a sound, a smell... we don't need to (analogously) write in or search our notebook for basic info we already know about the world.

daxfohl•7m ago
I tried something similar, but distilled to "solve this maze" as a text adventure, and while it usually solved it eventually, it almost always backtracked through fully-explored dead ends multiple times before finally getting to the end. I tried raw chat and various approaches of getting it to externalize the state (i.e. prompting it to emit the known state of the maze after each move, but not telling it exactly what to emit or how to format it), and even allowing it to emit code to execute after each turn, but it invariably would get lost at some point.

Of course it had no problem writing an optimal algorithm to solve mazes when prompted. I thought the disparity was interesting.

Note the mazes had the start and end positions inside the maze itself, so they weren't trivially solvable by the "follow wall to the left" algorithm.

This was last summer so maybe newer models would do better. I also stopped due to cost.

sfjailbird•1m ago
Having read through the entire game session, Claude plays the game admirably! For example, it finds a random tin of oily fish somewhere, and later tries (unsuccessfully) to use it to oil a rusty lock. Later it successfully solves a puzzle inside the house by thoroughly examining random furniture and picking up subtle clues about what to do, based on it.

It did so well that I can't not suspect that it used some hints or walkthroughs, but then again it did a bunch of clueless stuff too, like any player new to the game.

For one thing, this would be a great testing tool for the author of such a game. And more generally, the world of software testing is probably about to take some big leaps forward.