frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a tool to assist AI agents to know when a PR is good to go

https://dsifry.github.io/goodtogo/
11•dsifry•9h ago
I've been using Claude Code heavily, and kept hitting the same issue: the agent would push changes, respond to reviews, wait for CI... but never really know when it was done.

It would poll CI in loops. Miss actionable comments buried among 15 CodeRabbit suggestions. Or declare victory while threads were still unresolved.

The core problem: no deterministic way for an agent to know a PR is ready to merge.

So I built gtg (Good To Go). One command, one answer:

$ gtg 123 OK PR #123: READY CI: success (5/5 passed) Threads: 3/3 resolved

It aggregates CI status, classifies review comments (actionable vs. noise), and tracks thread resolution. Returns JSON for agents or human-readable text.

The comment classification is the interesting part — it understands CodeRabbit severity markers, Greptile patterns, Claude's blocking/approval language. "Critical: SQL injection" gets flagged; "Nice refactor!" doesn't.

MIT licensed, pure Python. I use this daily in a larger agent orchestration system — would love feedback from others building similar workflows.

Comments

mcolley•8h ago
Super interesting, any particular reason you didn't try to solve these prior to pushing with hooks and subagents?
dsifry•7h ago
I did! The issue however, is having a clear, deterministic method of defining when the code review was 'done'. So the hooks can fire off subagents, but they are non-deterministic and often miss vital code review comments - especially ones that are marked in an inline comment, or are marked as 'Out of PR Scope' or 'Out of range of the file' - which are often the MOST important comments to address!

So gtg builds all of that in and deterministically determines whether or not there are any actionable comments, and thus you can block the agent from moving forward until all actionable comments are thoroughly reviewed, acted upon or acknowledged, at which point it will change state and allow the PR to be merged.

blutoot•1h ago
I thought hooks are always fired if you use it as a PreToolUse event. Wouldn’t that work for the GitHub action tools from the GitHub mcp?
rootnod3•1h ago
Sorry, so the tool is now even circumventing human review? Is that the goal?

So the agent can now merge shit by itself?

Just the let damn thing push nto prod by itself at this point.

ljm•1h ago
Someone’s gonna think about wiring all this up to Linear or Jira, and there’ll be a whole new set of vulnerabilities created from malicious bug reports.
literalAardvark•1h ago
In some workflows it's helpful for the full loop to be automated so that the agent can test if what's done works.

And you can do a more exhaustive test later, after the agents are done running amok to merge various things.

danenania•54m ago
I don’t think “ready to merge” necessarily means the agent actually merges. Just that it’s gone as far as it can automatically. It’s up to you whether to review at that point or merge, depending on the project and the stakes.

If there are CI failures or obvious issues that another AI can identify, why not have the agent keep going until those are resolved? This tool just makes that process more token efficient. Seems pretty useful to me.

baxtr•32m ago
I’m not saying this is, but if I were a malicious state actor, that’s exactly the kind of thing I’d like to see in widespread use.
joshuanapoli•1h ago
This looks nice! I like the idea of providing more deterministic feedback and more or less forcing the assistant to follow a particular development process. Do you have evidence that gtg improves the overall workflow? I think that there is a trade-off between risk of getting stuck (iteration without reaching gtg-green) versus reaching perfect 100% completion.

Show HN: What if your menu bar was a keyboard-controlled command center?

https://extrabar.app/
26•pugdogdev•1h ago•12 comments

Show HN: Streaming gigabyte medical images from S3 without downloading them

https://github.com/PABannier/WSIStreamer
108•el_pa_b•10h ago•39 comments

Show HN: I built a tool to assist AI agents to know when a PR is good to go

https://dsifry.github.io/goodtogo/
11•dsifry•9h ago•9 comments

Show HN: HORenderer3: A C++ software renderer implementing OpenGL 3.3 pipeline

https://github.com/Hobanghann/HORenderer3
2•zghdls•1h ago•0 comments

Show HN: Microwave – Native iOS app for videos on ATproto

https://testflight.apple.com/join/cVxV1W3g
32•sinned•4d ago•10 comments

Show HN: Video-to-Grid – Analyze videos with one Vision API call

https://github.com/unhaya/vam-seek-electron-demo
4•haasiy•3h ago•0 comments

Show HN: Fun things to do with your VM/370 machine

https://rbanffy.github.io/fun-with-old-mainframes.github.io/fun-with-vm370.html
14•rbanffy•6d ago•4 comments

Show HN: Tusk Drift – Turn production traffic into API tests

https://github.com/Use-Tusk/tusk-drift-cli
29•jy-tan•2d ago•2 comments

Show HN: 1Code – Open-source Cursor-like UI for Claude Code

https://github.com/21st-dev/1code
65•Bunas•1d ago•38 comments

Show HN: go-stats-calculator, CLI for computing stats:mean,median,variance,etc.

https://github.com/jftuga/go-stats-calculator
2•jftuga•4h ago•0 comments

Show HN: Reddit GDPR Export Viewer – Built After Ban, Unban, Reban

https://github.com/guilamu/reddit-gdpr-export-viewer
3•guilamu•1h ago•2 comments

Show HN: BentoPDF is a privacy first PDF Toolkit

https://github.com/alam00000/bentopdf
2•__alam__•5h ago•2 comments

Show HN: B-IR – An LLM-optimized programming language

https://github.com/ImJasonH/ImJasonH/blob/main/articles/llm-programming-language.md
60•ImJasonH•5d ago•39 comments

Show HN: mdto.page – Turn Markdown into a shareable webpage instantly

https://mdto.page
53•hjinco•1d ago•31 comments

Show HN: Cyber+ – a security-focused programming language

https://github.com/TanmayCzax/CyberPlus-1.0
2•CzaxTanmay•6h ago•7 comments

Show HN: pgwire-replication - pure rust client for Postgres CDC

https://github.com/vnvo/pgwire-replication
44•sacs0ni•6d ago•8 comments

Show HN: TinyCity – A tiny city SIM for MicroPython (Thumby micro console)

https://github.com/chrisdiana/TinyCity
138•inflam52•2d ago•25 comments

Show HN: FileMason – Automate file organization on macOS with custom rules

https://apps.apple.com/us/app/filemason/id6757748498?mt=12
2•edurevilla•8h ago•0 comments

Show HN: Hc: an agentless, multi-tenant shell history sink

https://github.com/alessandrocarminati/hc
40•acarminati•1d ago•2 comments

Show HN: Sparrow-1 – Audio-native model for human-level turn-taking without ASR

https://www.tavus.io/post/sparrow-1-human-level-conversational-timing-in-real-time-voice
121•code_brian•3d ago•48 comments

Show HN: I made a TIDAL client that runs in the terminal

https://github.com/results-may-vary-org/ttydal
3•a2nb•8h ago•0 comments

Show HN: Webctl – Browser automation for agents based on CLI instead of MCP

https://github.com/cosinusalpha/webctl
133•cosinusalpha•3d ago•38 comments

Show HN: Gambit, an open-source agent harness for building reliable AI agents

https://github.com/bolt-foundry/gambit
89•randall•1d ago•19 comments

Show HN: CleanCloud – Cloud cleanup that can't delete anything

https://getcleancloud.com
9•sureshcsdp•5d ago•4 comments

Show HN: Tabstack – Browser infrastructure for AI agents (by Mozilla)

126•MrTravisB•3d ago•23 comments

Show HN: OpenWork – An open-source alternative to Claude Cowork

https://github.com/different-ai/openwork
222•ben_talent•3d ago•54 comments

Show HN: BGP Scout – BGP Network Browser

https://bgpscout.io/
26•hivedc•1d ago•11 comments

Show HN: Reversing YouTube’s “Most Replayed” Graph

https://priyavr.at/blog/reversing-most-replayed/
83•prvt•1d ago•22 comments

Show HN: On the edge of Apple Silicon memory speeds

https://github.com/timoheimonen/macOS-memory-benchmark
5•user_timo•11h ago•0 comments

Show HN: A smart camera that detects eye movements during REM sleep

https://github.com/lucidcode/Halovision-INSPEC
3•MichaelCoder•11h ago•0 comments