frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Score your GitHub repo for AI coding agents

https://twill.ai/scorecard
6•danoandco•10h ago

Comments

danoandco•10h ago
OpenAI published an article and demo for scoring how well AI agents can work in a codebase (https://openai.com/index/harness-engineering/, https://www.youtube.com/watch?v=rhsSqr0jdFw). We turned it into a free tool anyone can use.

Paste any public GitHub repo (or connect a private one) and get a live score across seven dimensions: bootstrap setup, task entry points, test harnesses, lint gates, agent docs, structured documentation, and decision records. It clones the repo, runs static analysis, and scores each dimension 0-3 with evidence pulled from actual files. Takes about 60 seconds.

Some repos we scored:

PostHog: https://twill.ai/score/fd033516-628b-4c7c-8db6-d84e3f2737ba

Supabase: https://twill.ai/score/b2825715-6c3d-4de1-a21b-fc5d9b17103b

Codex: https://twill.ai/score/d7372d95-0501-4ad3-ae90-8f112ccafee0

The pattern we keep seeing: most repos lose points on agent-specific docs and decision records. Everything else tends to be decent.

We built this scorecard as a free tool because agent performance is bounded by repo structure, not just model quality.

Would love to hear what scores people get. And whether the rubric is missing anything.

RoxaneFischer1•9h ago
not sure about the decision records. seems ideal but no one does that in practice
danoandco•8h ago
true, i think the key thing is explaining somewhere in the repo "why" something was done. like the rationale for choosing X over Y service for instance.

maybe this record is just the git log, and the agent just needs to access the git log.

we'll see how that matures over time

Show HN: Pgit – A Git-like CLI backed by PostgreSQL

https://oseifert.ch/blog/building-pgit
21•ImGajeed76•1d ago•8 comments

Show HN: Sub-millisecond VM sandboxes using CoW memory forking

https://github.com/adammiribyan/zeroboot
147•adammiribyan•19h ago•37 comments

Show HN: Fatal Core Dump – A debugging murder mystery played with GDB

https://www.robopenguins.com/fatal_core_dump/
59•axlan•4d ago•1 comments

Show HN: The Lottery of Life

https://claude.ai/public/artifacts/a62c4bac-3c05-4443-9d0a-50a9bd3f9d8d
20•atulvi•2h ago•21 comments

Show HN: QCCBot – Android in a browser tab, with AI agent control

6•Eastra•1h ago•0 comments

Show HN: Crust – A CLI framework for TypeScript and Bun

https://github.com/chenxin-yan/crust
80•jellyotsiro•1d ago•34 comments

Show HN: I built an interactive 3D three-body problem simulator in the browser

https://structuredlabs.github.io/threebodyproblem/
54•amrutha_•4d ago•18 comments

Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust

https://github.com/peters/horizon
65•petersunde•14h ago•26 comments

Show HN: N0x – LLM inference, agents, RAG, Python exec in browser, no back end

https://n0xth.vercel.app/
6•redhanuman•3h ago•0 comments

Show HN: Antfly: Distributed, Multimodal Search and Memory and Graphs in Go

https://github.com/antflydb/antfly
91•kingcauchy•17h ago•37 comments

Show HN: Claude Code skills that build complete Godot games

https://github.com/htdt/godogen
313•htdt•1d ago•192 comments

Show HN: CollabMD – Real-time multiplayer for local and Git-backed Markdown

https://github.com/andes90/collabmd
5•ndezt•5h ago•2 comments

Show HN: Thermal Receipt Printers – Markdown and Web UI

https://github.com/sadreck/ThermalMarky
115•howlett•4d ago•47 comments

Show HN: Dump – easily share context with AI

https://www.dump.page
5•vochsel•5h ago•0 comments

Show HN: I built a message board where you pay to be the homepage

https://saythat.sh
14•SayThatSh•20h ago•11 comments

Show HN: AI Skills for Affiliate Marketing – Works with Claude, ChatGPT

https://github.com/Affitor/affiliate-skills
3•sonpiaz•7h ago•2 comments

Show HN: Soros – AI for geopolitical macro investing

https://www.asksoros.com
8•muggermuch•11h ago•8 comments

Show HN: Sonder – self-hosted AI social simulation engine

https://github.com/RedsonNgwira/sonder
3•RedsonNgwira•8h ago•4 comments

Show HN: March Madness Bracket Challenge for AI Agents Only

https://www.Bracketmadness.ai
64•bwade818•19h ago•40 comments

Show HN: Hat v0.7.0 – Fast, local automatic file compression and conversion

https://github.com/bittere/hat
4•_bittere•3h ago•0 comments

Show HN: CodeLedger – deterministic context and guardrails for AI

https://codeledger.dev
2•ashmivante•9h ago•0 comments

Show HN: Lore – Local AI thought capture and recall that runs on your machine

https://github.com/ErezShahaf/Lore
2•ErezShahaf•9h ago•0 comments

Show HN: Score your GitHub repo for AI coding agents

https://twill.ai/scorecard
6•danoandco•10h ago•3 comments

Show HN: GitGlimpse – GitHub Action that generates UI/UX demos for your PRs

https://github.com/DeDuckProject/git-glimpse
4•fatach•11h ago•0 comments

Show HN: Signet – Autonomous wildfire tracking from satellite and weather data

https://signet.watch
123•mapldx•2d ago•32 comments

Show HN: M68k assembly emulator that runs in the browser

https://github.com/gianlucarea/m68k-interpreter
14•aldino97•22h ago•2 comments

Show HN: GDSL – 800 line kernel: Lisp subset in 500, C subset in 1300

https://firthemouse.github.io/
89•FirTheMouse•2d ago•20 comments

Show HN: Hackerbrief – Top posts on Hacker News summarized daily

https://hackerbrief.vercel.app/
80•p0u4a•1d ago•46 comments

Show HN: Grape – AI note taking app

https://grape.cool
3•ozgrozer•12h ago•1 comments

Show HN: What if your synthesizer was powered by APL (or a dumb K clone)?

https://octetta.github.io/k-synth/
92•octetta•2d ago•32 comments