frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Polygraph: A Meta-Harness for Maximum Agent Autonomy

https://nx.dev/blog/announcing-polygraph
38•cheald•1h ago

Comments

kstenerud•47m ago
> Space. An agent is stuck in one repo. It can't see how a change fits the wider system, and it can only write to one repo at a time.

Huh? How can it not see multiple repos? They're just directories.

> Time. An agent has no episodic memory. Every session starts blank, so a human carries the memory context.

The memory comes from the research, design, specification, and planning documents.

> We no longer think about where the work happens or what repos are involved. We describe the work in a prompt and let Polygraph figure out what's relevant.

Err... that doesn't sound safe.

> Every decision is on record. So even though our team is distributed, I can ask my agent why a coworker chose one approach over another.

AFTER the fact...

victorsavkin•30m ago
Thank you for your comment.

> Huh? How can it not see multiple repos? They're just directories.

Relevant repos need to be discovered. They have to be set up correctly (some worktrees, most clones), dependencies installed, and the relationships between them made clear, etc.. In a sense, once you've done all that, they do become directories. Turning them into directories, and doing it ergonomically, is the tricky part.

Consider scale: Take the repos you own plus the OSS repos they depend on. It's many thousands. A real team has more. That's a lot to deal with.

> The memory comes from the research, design, specification, and planning documents.

This isn't episodic memory. You'll have high-level documents you can reference, and they're useful for overviews. But only a tiny fraction of decisions ever make it into them. Most decisions get made in the act of implementing something. And the "docs rot, code doesn't" rule applies here too.

> Err... that doesn't sound safe. It just picks the repos (you have access to) and helps you plan the work. Has no efect on safety.

> AFTER the fact...

Yes :) But say I'm reviewing their PR. I can ask my agent why the PR ended up the way it did, and every decision they made along the way is in the session. It's "after the fact", but useful. It doesn't mean every conversation with a human being can be replaced by this :) but a lot of conversations can be.

kstenerud•13m ago
> Relevant repos need to be discovered. They have to be set up correctly (some worktrees, most clones), dependencies installed, and the relationships between them made clear, etc..

This is what Sourcegraph and Github Code Search and Zoekt do, isn't it?

> You'll have high-level documents you can reference, and they're useful for overviews. But only a tiny fraction of decisions ever make it into them. Most decisions get made in the act of implementing something.

Er... In the age of AI the decisions need to be made (and documented) extensively before it starts writing any code. Otherwise you get slop.

> But say I'm reviewing their PR. I can ask my agent why the PR ended up the way it did, and every decision they made along the way is in the session.

That doesn't make the decision set good. And if the only documentation produced came from the implementation phase, then it's going to be self-defending regardless of how good the design actually is (and your review agent, lacking the context, won't know the difference). Multiply that with the many parallel PRs in parallel repos you get with some features, and that's just asking for trouble.

jenniferli23•47m ago
How are you thinking about permissions/revocation if Polygraph’s “memory” becomes a shared layer across repos?
victorsavkin•21m ago
Great question.

Polygraph knows what repos every dev (and therefore their agents) has access to. If a session touches repos you don't have access to, you'll only see the parts you're allowed to: PRs to a repo you can see, for instance. You won't see the logs or high-level descriptions, which can contain info you shouldn't see.

If a dev loses access to a repo, they also lose access to the sessions associated with it.

In other words, although Polygraph has one repo graph and one session graph under the hood, every dev has access to only a subset of each.

jeffbcross•24m ago
lukekarrys, how long would it take you to build this?

Show HN: ToolPalace – 25 free browser tools that work offline, no sign-up

https://toolpalace.online
1•sohilpathan•29s ago•0 comments

Longer daylight linked to 4.4 minutes less sleep per extra hour of light

https://www.nature.com/articles/s44323-026-00092-2
1•kyriakosel•1m ago•1 comments

Show HN: Bsize yet Another Byte Size Crate

https://github.com/fast/bsize
1•tison•1m ago•0 comments

Show HN: Open protocol for agents to book vacation rentals direct from the host

https://vacationrentalprotocol.com/
1•Freezone•2m ago•0 comments

Made a Rust DB run spatial queries on gaming GPU RT cores, beating an H100

https://sedona.apache.org/latest/blog/2026/06/26/sedonadb-04-gpu-accelerated-spatial-joins/
1•dr-jia-yu•3m ago•0 comments

Show HN: Closing the public-key authenticity gap in our E2EE social network

https://mosslet.com/blog/articles/19
1•mosspigletdev•5m ago•0 comments

U.S. government will decide who gets to use latest upgrade to ChatGPT

https://www.washingtonpost.com/technology/2026/06/26/openai-says-us-government-will-vet-users-its...
2•alain94040•5m ago•0 comments

Murmur: Shared communication bus for your coding agents

https://github.com/instavm/murmur
1•handfuloflight•7m ago•0 comments

Poll: What's your primary AI coding agent/orchestrator Claude/Codex/Cursor, etc.?

1•jacobgold•9m ago•1 comments

Malware Insights: macOS Phexia Campaign

https://cookie.engineer/weblog/articles/malware-insights-macos-phexia-campaign.html
1•speckx•15m ago•0 comments

Show HN: AgentBrush – Your coding agent's missing tool: image generation

https://agentbrush.dev/
2•Yan4300•15m ago•0 comments

Ventora Expands Its AI Business Builder to Help Solo Founders

1•emmanol•16m ago•0 comments

Wildfires Are Getting Worse. Patrick Moore Says Otherwise

https://www.notesfromtheroad.com/roam/patrick-moore-wildfires-climate-change.html
1•speckx•22m ago•0 comments

Neural Image Compression with Gemini 3

https://bertolami.com/blog/cascade-neural-image-compression
4•wholenote•22m ago•0 comments

Show HN: I built a hardware quantum RNG and wired it into a Magic 8-Ball

https://dnhkng.github.io/posts/building-the-beam-universe-splitter/
4•dnhkng•22m ago•0 comments

How to Corrupt an SQLite Database File

https://www.sqlite.org/howtocorrupt.html
2•tosh•23m ago•0 comments

Chinese LineShine Supercomputer Debuts at No. 1 in TOP500

https://www.top500.org/news/lineshine-debuts-no-1-top500-enters-new-global-exascale-era/
1•adrian_b•24m ago•1 comments

Pre-Modern Armies for Worldbuilders, Part III: Paying for It

https://acoup.blog/2026/06/26/collections-pre-modern-armies-for-worldbuilders-part-iii-paying-for...
4•jfoucher•24m ago•0 comments

How to Tell We–and AI–Are Choosing the Good

https://deepsub.substack.com/p/how-to-tell-weand-aiare-choosing
1•dsubburam•25m ago•0 comments

Commander's Intent Statement

https://www.votito.com/methods/commanders-intent-statement/
1•adzicg•27m ago•0 comments

Show HN: JSON Viewer Extension for Chrome

https://chromewebstore.google.com/detail/json-viewer-formatter-for/adnplidpgphcdjnebagdiknfjeedhfjp
1•chernikovalexey•27m ago•0 comments

I'm building a Space Cadet Pinball Machine! [video]

https://www.youtube.com/watch?v=lHQ8c8i42VE
3•skibz•27m ago•0 comments

Bigger context windows are the wrong abstraction for coding agents

https://sigilix.ai/blog/bigger-context-windows-are-the-wrong-abstraction-for-coding-agents
2•damartj•28m ago•0 comments

Show HN: No one will beat my hiscore

https://puremint.co.uk/games/git-racer/
1•wonkyfruit•29m ago•0 comments

Webradio server – broadcasts audio source to clients

https://github.com/tau-org/tau-tower
2•modinfo•31m ago•0 comments

Hasp – Local Secret Broker

https://gethasp.com/
1•casca•32m ago•0 comments

Scaling Laws, Carefully

https://lilianweng.github.io/posts/2026-06-24-scaling-laws/
2•tehnub•32m ago•0 comments

Anthropic has hired an economist with interesting views on human survival

https://www.ft.com/content/bb04671c-4377-4231-96ef-0f8e57ed5d1b
3•Jimmc414•33m ago•3 comments

Open Air Chicago

https://www.chicago.gov/city/en/depts/cdph/supp_info/Environment/open-air-chicago.html
1•toomuchtodo•35m ago•1 comments

Human-bench: an eval for "human shaped" agents

https://www.human-bench.com/leaderboard
1•jam0xb797fd•38m ago•1 comments