news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Goal.md, a goal-specification file for autonomous coding agents

https://github.com/jmilinovich/goal-md

2•jmilinovich•2h ago

Comments

jmilinovich•1h ago

I had 30 broken Playwright tests and no way to tell which ones actually mattered. The problem wasn’t “fix the tests” — it was that there’s no coverage tool for test infrastructure trustworthiness. I had to build the ruler before I could measure anything.

So I wrote a file that defined a composite metric (four weighted components → one score), an improvement loop, and constraints. Pointed Claude at it. Went to bed. Woke up to 12 commits, 47 → 83.

The file became GOAL.md. The insight that surprised me: most software doesn’t have a natural scalar metric like val_bpb. You have to construct it. Documentation quality, API trustworthiness, test infrastructure confidence — these things have no pytest –cov equivalent. But once you build the ruler, the autoresearch loop works on them too.

The part I’m most uncertain about: the “dual score” pattern. When the agent is building its own measuring tools, it can game the metric by weakening the instrument. So the docs-quality example has two scores — one for the docs, one for the linter itself. The agent has to improve the telescope before it can use it. I think this is load-bearing but I’d love to hear if others have found different solutions to the same problem.

Easiest way to try it: paste this into Claude Code, Cursor, or any coding agent and point it at one of your repos:

Read github.com/jmilinovich/goal-md — read the template and examples. Then write me a GOAL.md for this repo and start working on it.

Happy to hear what breaks. The scoring script is bash + jq so it’s not exactly production-grade, and the examples are biased toward the kinds of projects I work on. More examples from different domains would make the pattern sharper.

Cuneicode – a language where 0.1 and 0.2 == 0.3

https://cuneicode.dev

1•synchan•4m ago•1 comments

Finctory – A node-based visual backtesting platform

https://finctory.netlify.app/

1•np_Poluri•6m ago•1 comments

Data scientist uses AI and ChatGPT to create cancer vaccine for his dying dog

https://www.theaustralian.com.au/business%2Ftechnology%2Ftech-boss-uses-ai-and-chatgpt-to-create-...

3•philangist•7m ago•0 comments

Drone-Mounted Camera for Real-Time MA-RPPG in Smart Mirror Systems

https://www.mdpi.com/2076-3417/16/5/2307

1•PaulHoule•8m ago•0 comments

Show HN: ObservAgent – Observability for Claude Code(cost, tools, subagents)

https://darshannere.github.io/observagent/

2•darshannere•9m ago•0 comments

EIOU, an open source P2P payment protocol

https://eiou.org/

2•Aeium•12m ago•0 comments

Show HN: HypergraphZ – A Hypergraph Implementation in Zig

https://github.com/yamafaktory/hypergraphz

1•yamafaktory•12m ago•0 comments

Seedance 2.0 delayed due to copyright disputes

https://www.scmp.com/tech/big-tech/article/3346654/bytedance-reportedly-suspends-launch-seedance-...

1•stuartmemo•12m ago•0 comments

Faith Claw – Security middleware for autonomous AI agents (OpenClaw)

https://github.com/KirpalS99/Faith-Claw

1•kirpals99•12m ago•0 comments

The AI Boom Has Exploded the San Francisco Housing Market

https://www.wsj.com/economy/housing/san-francisco-housing-market-ai-8c4e3f59

2•randycupertino•17m ago•1 comments

Ask HN: When bored during vibe coding edits I ---?

1•p0d•17m ago•0 comments

Steven Spielberg Thinks Aliens Are Among Us

https://www.slashfilm.com/2122928/steven-spielberg-disclosure-day-aliens-among-us-sxsw/

1•bookofjoe•18m ago•0 comments

Domestication Syndrome

https://en.wikipedia.org/wiki/Domestication_syndrome

1•andersource•18m ago•0 comments

Kuberna Labs – Open-source SDK for autonomous cross-chain AI agents

https://github.com/kawacukennedy/kuberna-labs

1•n3on250•19m ago•1 comments

Multi-agent coordination via timer-based Discord polling (Claude Code)

https://github.com/AetherWave-Studio/autonomous-claude-code

1•Drew-Aetherwave•19m ago•1 comments

Ask HN: AI Browser Automation

2•compootr•21m ago•0 comments

Show HN: Claude's 2x usage promotion (March 2026) in your timezone

https://edsonroteia.github.io/claude2x/

2•earaujo•21m ago•0 comments

Chinese giants use idled foreign plants to fuel global expansion

https://www.scmp.com/business/china-business/article/3346472/carpool-chinese-giants-use-idled-for...

1•gscott•22m ago•0 comments

I built this in an hour with Claude

https://oscars.prakashvenkat.com/

1•dopatraman•24m ago•0 comments

Ask HN: Why is there a lack of useful use cases for OpenClaw?

1•nazbasho•24m ago•0 comments

Olaf: Bringing an Animated Character to Life in the Physical World [video]

https://www.youtube.com/watch?v=-L8OFMTteOo

1•redman25•26m ago•0 comments

Ask HN: What Should I Make?

1•SpyCoder77•26m ago•3 comments

The ArXiv is separating from Cornell University, and is hiring a CEO

https://mathstodon.xyz/@johncarlosbaez/116223948891539024

4•mellosouls•28m ago•0 comments

Integrity-Weighted Citation Metric

https://quinndupont.github.io/CiteIQ/

1•quinndupont•29m ago•0 comments

Why Claude's new 1M context length is a big deal

https://martinalderson.com/posts/why-claudes-new-1m-context-length-is-a-big-deal/

2•martinald•29m ago•0 comments

University of Houston Physicists Break Superconductivity Temperature Record

https://www.uh.edu/news-events/stories/2026/march/03102026-ambient-pressure-superconductivity-rec...

1•bilsbie•30m ago•0 comments

Securing AI Agents

https://fusionauth.io/articles/ai/securing-ai-agents

1•mooreds•36m ago•0 comments

Show HN: Quell, a local security layer to stop AI IDEs leaking your secrets

https://github.com/Sonofg0tham/Quell

1•Sonofg0tham•37m ago•1 comments

Single message billboard. outbid to takeover

https://billboard.today

1•bekdavid893•41m ago•0 comments

Paul R. Ehrlich, Who Alarmed the World with 'The Population Bomb,' Dies at 93

https://www.nytimes.com/2026/03/15/books/paul-r-ehrlich-dead.html

3•igonvalue•43m ago•1 comments