frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Goal.md, a goal-specification file for autonomous coding agents

https://github.com/jmilinovich/goal-md
2•jmilinovich•2h ago

Comments

jmilinovich•1h ago
I had 30 broken Playwright tests and no way to tell which ones actually mattered. The problem wasn’t “fix the tests” — it was that there’s no coverage tool for test infrastructure trustworthiness. I had to build the ruler before I could measure anything.

So I wrote a file that defined a composite metric (four weighted components → one score), an improvement loop, and constraints. Pointed Claude at it. Went to bed. Woke up to 12 commits, 47 → 83.

The file became GOAL.md. The insight that surprised me: most software doesn’t have a natural scalar metric like val_bpb. You have to construct it. Documentation quality, API trustworthiness, test infrastructure confidence — these things have no pytest –cov equivalent. But once you build the ruler, the autoresearch loop works on them too.

The part I’m most uncertain about: the “dual score” pattern. When the agent is building its own measuring tools, it can game the metric by weakening the instrument. So the docs-quality example has two scores — one for the docs, one for the linter itself. The agent has to improve the telescope before it can use it. I think this is load-bearing but I’d love to hear if others have found different solutions to the same problem.

Easiest way to try it: paste this into Claude Code, Cursor, or any coding agent and point it at one of your repos:

Read github.com/jmilinovich/goal-md — read the template and examples. Then write me a GOAL.md for this repo and start working on it.

Happy to hear what breaks. The scoring script is bash + jq so it’s not exactly production-grade, and the examples are biased toward the kinds of projects I work on. More examples from different domains would make the pattern sharper.

Cuneicode – a language where 0.1 and 0.2 == 0.3

https://cuneicode.dev
1•synchan•4m ago•1 comments

Finctory – A node-based visual backtesting platform

https://finctory.netlify.app/
1•np_Poluri•6m ago•1 comments

Data scientist uses AI and ChatGPT to create cancer vaccine for his dying dog

https://www.theaustralian.com.au/business%2Ftechnology%2Ftech-boss-uses-ai-and-chatgpt-to-create-...
3•philangist•7m ago•0 comments

Drone-Mounted Camera for Real-Time MA-RPPG in Smart Mirror Systems

https://www.mdpi.com/2076-3417/16/5/2307
1•PaulHoule•8m ago•0 comments

Show HN: ObservAgent – Observability for Claude Code(cost, tools, subagents)

https://darshannere.github.io/observagent/
2•darshannere•9m ago•0 comments

EIOU, an open source P2P payment protocol

https://eiou.org/
2•Aeium•12m ago•0 comments

Show HN: HypergraphZ – A Hypergraph Implementation in Zig

https://github.com/yamafaktory/hypergraphz
1•yamafaktory•12m ago•0 comments

Seedance 2.0 delayed due to copyright disputes

https://www.scmp.com/tech/big-tech/article/3346654/bytedance-reportedly-suspends-launch-seedance-...
1•stuartmemo•12m ago•0 comments

Faith Claw – Security middleware for autonomous AI agents (OpenClaw)

https://github.com/KirpalS99/Faith-Claw
1•kirpals99•12m ago•0 comments

The AI Boom Has Exploded the San Francisco Housing Market

https://www.wsj.com/economy/housing/san-francisco-housing-market-ai-8c4e3f59
2•randycupertino•17m ago•1 comments

Ask HN: When bored during vibe coding edits I ---?

1•p0d•17m ago•0 comments

Steven Spielberg Thinks Aliens Are Among Us

https://www.slashfilm.com/2122928/steven-spielberg-disclosure-day-aliens-among-us-sxsw/
1•bookofjoe•18m ago•0 comments

Domestication Syndrome

https://en.wikipedia.org/wiki/Domestication_syndrome
1•andersource•18m ago•0 comments

Kuberna Labs – Open-source SDK for autonomous cross-chain AI agents

https://github.com/kawacukennedy/kuberna-labs
1•n3on250•19m ago•1 comments

Multi-agent coordination via timer-based Discord polling (Claude Code)

https://github.com/AetherWave-Studio/autonomous-claude-code
1•Drew-Aetherwave•19m ago•1 comments

Ask HN: AI Browser Automation

2•compootr•21m ago•0 comments

Show HN: Claude's 2x usage promotion (March 2026) in your timezone

https://edsonroteia.github.io/claude2x/
2•earaujo•21m ago•0 comments

Chinese giants use idled foreign plants to fuel global expansion

https://www.scmp.com/business/china-business/article/3346472/carpool-chinese-giants-use-idled-for...
1•gscott•22m ago•0 comments

I built this in an hour with Claude

https://oscars.prakashvenkat.com/
1•dopatraman•24m ago•0 comments

Ask HN: Why is there a lack of useful use cases for OpenClaw?

1•nazbasho•24m ago•0 comments

Olaf: Bringing an Animated Character to Life in the Physical World [video]

https://www.youtube.com/watch?v=-L8OFMTteOo
1•redman25•26m ago•0 comments

Ask HN: What Should I Make?

1•SpyCoder77•26m ago•3 comments

The ArXiv is separating from Cornell University, and is hiring a CEO

https://mathstodon.xyz/@johncarlosbaez/116223948891539024
4•mellosouls•28m ago•0 comments

Integrity-Weighted Citation Metric

https://quinndupont.github.io/CiteIQ/
1•quinndupont•29m ago•0 comments

Why Claude's new 1M context length is a big deal

https://martinalderson.com/posts/why-claudes-new-1m-context-length-is-a-big-deal/
2•martinald•29m ago•0 comments

University of Houston Physicists Break Superconductivity Temperature Record

https://www.uh.edu/news-events/stories/2026/march/03102026-ambient-pressure-superconductivity-rec...
1•bilsbie•30m ago•0 comments

Securing AI Agents

https://fusionauth.io/articles/ai/securing-ai-agents
1•mooreds•36m ago•0 comments

Show HN: Quell, a local security layer to stop AI IDEs leaking your secrets

https://github.com/Sonofg0tham/Quell
1•Sonofg0tham•37m ago•1 comments

Single message billboard. outbid to takeover

https://billboard.today
1•bekdavid893•41m ago•0 comments

Paul R. Ehrlich, Who Alarmed the World with 'The Population Bomb,' Dies at 93

https://www.nytimes.com/2026/03/15/books/paul-r-ehrlich-dead.html
3•igonvalue•43m ago•1 comments