frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Goal.md, a goal-specification file for autonomous coding agents

https://github.com/jmilinovich/goal-md
2•jmilinovich•2h ago

Comments

jmilinovich•2h ago
I had 30 broken Playwright tests and no way to tell which ones actually mattered. The problem wasn’t “fix the tests” — it was that there’s no coverage tool for test infrastructure trustworthiness. I had to build the ruler before I could measure anything.

So I wrote a file that defined a composite metric (four weighted components → one score), an improvement loop, and constraints. Pointed Claude at it. Went to bed. Woke up to 12 commits, 47 → 83.

The file became GOAL.md. The insight that surprised me: most software doesn’t have a natural scalar metric like val_bpb. You have to construct it. Documentation quality, API trustworthiness, test infrastructure confidence — these things have no pytest –cov equivalent. But once you build the ruler, the autoresearch loop works on them too.

The part I’m most uncertain about: the “dual score” pattern. When the agent is building its own measuring tools, it can game the metric by weakening the instrument. So the docs-quality example has two scores — one for the docs, one for the linter itself. The agent has to improve the telescope before it can use it. I think this is load-bearing but I’d love to hear if others have found different solutions to the same problem.

Easiest way to try it: paste this into Claude Code, Cursor, or any coding agent and point it at one of your repos:

Read github.com/jmilinovich/goal-md — read the template and examples. Then write me a GOAL.md for this repo and start working on it.

Happy to hear what breaks. The scoring script is bash + jq so it’s not exactly production-grade, and the examples are biased toward the kinds of projects I work on. More examples from different domains would make the pattern sharper.

Show HN: GDSL – 800 line kernel: Lisp subset in 500, C subset in 1300

https://firthemouse.github.io/
46•FirTheMouse•5h ago•11 comments

Show HN: Signet – Autonomous wildfire tracking from satellite and weather data

https://signet.watch
97•mapldx•8h ago•27 comments

Show HN: What if your synthesizer was powered by APL (or a dumb K clone)?

https://octetta.github.io/k-synth/
63•octetta•8h ago•27 comments

Show HN: Claude's 2x usage promotion (March 2026) in your timezone

https://edsonroteia.github.io/claude2x/
3•earaujo•53m ago•0 comments

Show HN: HN Skins – Available Skins: Cafe, Courier, London, Midnight, Terminal

https://github.com/susam/hnskins
3•susam•1h ago•0 comments

Show HN: Goal.md, a goal-specification file for autonomous coding agents

https://github.com/jmilinovich/goal-md
2•jmilinovich•2h ago•1 comments

Show HN: Detach – Mobile UI for managing AI coding agents from your phone

https://github.com/salvozappa/detach
2•salvozappa•3h ago•2 comments

Show HN: Han – A Korean programming language written in Rust

https://github.com/xodn348/han
203•xodn348•23h ago•112 comments

Show HN: Ichinichi – One note per day, E2E encrypted, local-first

121•katspaugh•1d ago•45 comments

Show HN: Sway, a board game benchmark for quantum computing

https://shukla.io/blog/2026-03/sway.html
4•BinRoo•4h ago•0 comments

Show HN: AgentMailr – dedicated email inboxes for AI agents

https://www.agentmailr.com/
7•kumardeepanshu•9h ago•3 comments

Show HN: GitAgent – An open standard that turns any Git repo into an AI agent

https://www.gitagent.sh/
128•sivasurend•1d ago•33 comments

Show HN: GrobPaint: Somewhere Between MS Paint and Paint.net

https://github.com/groverburger/grobpaint
53•__grob•22h ago•17 comments

Show HN: RSS tool to remix feeds, build from webpages, and skip podcast reruns

https://sponder.app
4•kristjan•6h ago•0 comments

Show HN: Dialtone watcher – what is my laptop doing and am I normal

5•fcpguru•7h ago•1 comments

Show HN: Channel Surfer – Watch YouTube like it’s cable TV

https://channelsurfer.tv
595•kilroy123•4d ago•174 comments

Show HN: Context Gateway – Compress agent context before it hits the LLM

https://github.com/Compresr-ai/Context-Gateway
92•ivzak•2d ago•57 comments

Show HN: Code Royale – Play and learn poker with Claude Code (skill)

https://github.com/BohdanPetryshyn/code-royale
2•BohdanPetryshyn•8h ago•0 comments

Show HN: Data-anim – Animate HTML with just data attributes

https://github.com/ryo-manba/data-anim
16•ryo-manba•1d ago•6 comments

Show HN: Ink – Deploy full-stack apps from AI agents via MCP or Skills

https://ml.ink/
31•august-•4d ago•5 comments

Show HN: Axe – A 12MB binary that replaces your AI framework

https://github.com/jrswab/axe
223•jrswab•3d ago•122 comments

Show HN: Voice-tracked teleprompter using on-device ASR in the browser

https://github.com/larsbaunwall/promptme-ai
2•lbaune•11h ago•1 comments

Show HN: What was the world listening to? Music charts, 20 countries (1940–2025)

https://88mph.fm/
113•matteocantiello•5d ago•51 comments

Show HN: Learn Arabic with spaced repetition and comprehensible input

https://abjadpro.com
64•adangit•1d ago•22 comments

Show HN: Lengpal – simple video chat for language exchange

https://www.producthunt.com/products/lengpal
4•ayoubdrissi•4h ago•0 comments

Show HN: BurnShot v2- Zero-Knowledge image sharing (Challenge to decrypt this)

2•axaysharma•5h ago•0 comments

Show HN: OneCLI – Vault for AI Agents in Rust

https://github.com/onecli/onecli
160•guyb3•3d ago•50 comments

Show HN: I built Wool, a lightweight distributed Python runtime

https://github.com/wool-labs/wool
13•bzurak•1d ago•4 comments

Show HN: Rudel – Claude Code Session Analytics

https://github.com/obsessiondb/rudel
143•keks0r•3d ago•85 comments

Show HN: s@: decentralized social networking over static sites

http://satproto.org/
411•remywang•3d ago•219 comments