frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

New Research Reassesses the Value of Agents.md Files for AI Coding

https://www.infoq.com/news/2026/03/agents-context-file-value-review/
13•noemit•1h ago

Comments

verdverm•1h ago
That research has been so misinterpreted for headlines and clicks...

AGENTS.md are extremely helpful if done well.

lucketone•59m ago
Everybody thinks they do agents.md well
noemit•1h ago
The research mostly points to LLM-generated context lowering performance. Human-generated context improves performance, but any kind of AGENTS.md file increases token use, for what they say is "fake thinking." More research is needed.
d1sxeyes•1h ago
Agree. Also, sometimes I intentionally want the agent to do something differently to how it would naturally solve the problem. For example, there might be a specific design decision that the agent should adhere to. Obviously, this will lead to slower task completion, higher inference costs etc. because I’m asking the agent not to take the path of least resistance.

This kind of benchmark completely misses that nuance.

stingraycharles•55m ago
I’d say that it needs to be maintained and reviewed by a human, but it’s perfectly fine to let an LLM generate it.
stingraycharles•54m ago
What is going on in this thread and why are all comments downvoted so heavily?
nayroclade•50m ago
I suspect AGENTS.md files will prove to be a short-lived relic of an era when we had to treat coding agents like junior devs, who often need explicit instructions and guardrails about testing, architecture, repo structure, etc. But when agents have the equivalent (or better) judgement ability as a senior engineer, they can make their own calls about these aspects, and trying to "program" their behaviour via an AGENTS.md file becomes as unhelpful as one engineer trying to micro-manage another's approach to solving a problem.
dev_l1x_be•35m ago
I never use these files and give the current guardrails of a specific task to each short run for agents. Have task specific “agents.md” works better for me.
CrzyLngPwd•15m ago
I have a legacy codebase of around 300k lines spread across 1.5k files, and have had amazing success with the agents.md file.

It just prevents hallucinations and coerces the AI to use existing files and APIs instead of inventing them. It also has gold-standard tests and APIs as examples.

Before the agents file, it was just chaos of hallucinations and having to correct it all the time with the same things.

lmeyerov•5m ago
I liked they did this work + its sister paper, but disliked how it was positioned, as it set up the community to misinterpreting it from a quick read (or just as likely, a title or abstract scan). So for the next X months, instead of it helping, we have to deal with the brain damage.

The good: It shows on one kind of benchmark, some kinds of agentically-generated don't help. So naively generating these, for one kind of task, doesn't work - useful to know!

The bad: Some people assume this means in general these don't work, or automation here doesn't work.

The truth: These files help measurably and just a bit of engineering enables you to guarantee that for the typical case. As soon as you have an objective function, you can flip it into an eval, and set an AI coder to editing these files until they work. Ex: We recently released https://github.com/graphistry/graphistry-skills for more easily using graphistry via AI coding, and by having our authoring AI loop a bit with our evals, we jumped the scores from 30-50% success rate to 90%+. As we encounter more scenarios (and mine them from our chats etc), it's pretty straight forward to flip them into evals and ask Claude/Codex to loop until those work well too.

Pushing and pulling: three reactivity algorithms

https://jonathan-frere.com/posts/reactivity-algorithms/
1•fanf2•1m ago•0 comments

Reverse engineering a DOS game with no source code using Codex 5.4

https://github.com/ammaarreshi/SkyRoads-Codex
1•smusamashah•2m ago•0 comments

Show HN: OpenClaw – Self-host OpenClaw in one command

1•congzhangzh•8m ago•0 comments

Money and collateral in an AI-first society

https://adlrocha.substack.com/p/adlrocha-money-and-collateral-in
1•adlrocha•11m ago•0 comments

Ask HN: Can I repurpose a Bluetooth voice remote as input device for a PC?

1•albert_e•13m ago•1 comments

Ask HN: How are you handling persistent memory across local Ollama sessions

1•null-phnix•14m ago•0 comments

Show HN: Spadyum – An Open-Source Civilization Backup Protocol

https://github.com/kivancadiguzel-design/Spadyum-Genesis/blob/main/README.md
1•Spadyum_Genesis•14m ago•0 comments

Julia Snail – An Emacs Development Environment for Julia Like Clojure's Cider

https://github.com/gcv/julia-snail
1•TheWiggles•16m ago•0 comments

Notes on Writing WASM

https://notes.brooklynzelenka.com/Blog/Notes-on-Writing-Wasm
3•vinhnx•18m ago•0 comments

Making Firefox's right-click not suck, more, with userChrome.css

https://joshua.hu/firefox-making-right-click-not-suck-even-more-with-userchrome
3•mmsc•20m ago•0 comments

Run prompts on a schedule with Claude Code

https://code.claude.com/docs/en/scheduled-tasks
1•blacktulip•20m ago•0 comments

Show HN: Open-source self-hosted Intercom and CCTV platform

https://github.com/rosteleset/SmartYard-Server
2•sbca68•22m ago•0 comments

Show HN: Self-Evolving Skill – empirical results from a 5-round experiment

https://github.com/191341025/Self-Evolving-Skill
1•tiansenxu•27m ago•0 comments

What Is AI Reading?

https://generativepulse.ai/report/
1•doener•28m ago•0 comments

Rcarmo/piclaw: An all-in one agent environment with a mobile-first web UI

https://github.com/rcarmo/piclaw
1•rcarmo•32m ago•0 comments

Show HN: Termix – One dashboard for all your AI coding agents

https://github.com/rustykuntz/termix
2•rustykuntz•32m ago•1 comments

Show HN: Cardea, SSH bastion with per-key ACLs, TPM keys and session recording

https://github.com/hectorm/cardea
2•hectorm•37m ago•0 comments

Show HN: Appbun – Turn any URL into desktop app with one command Electrobun+bun

https://github.com/bigmacfive/appbun
1•bigmacfive•39m ago•1 comments

Dinner at When?

https://archive.aramcoworld.com/issue/196902/dinner.at.when.htm
1•pinkmuffinere•39m ago•0 comments

I built a cloud-native, quorum-free RSM for the JVM atop S3

https://github.com/io-s2c/s2c
2•mzazaipsc•44m ago•1 comments

Show HN: Importree – Import Dependency Trees for TypeScript Files

https://importree.js.org
1•alexgrozav•45m ago•0 comments

Show HN: SubstanceWiki – Open-source encyclopedia of psychoactive substances

https://www.substancewiki.org/
1•toprak123•46m ago•0 comments

Therac-25

https://en.wikipedia.org/wiki/Therac-25
3•hubraumhugo•50m ago•0 comments

Llm9p: LLM as a Plan 9 file system

https://github.com/NERVsystems/llm9p
7•mleroy•51m ago•3 comments

BlackRock caps withdrawals amid investor rush

https://www.msn.com/en-us/news/insight/blackrock-caps-exits-as-redemptions-surge/gm-GM1184A108
2•csomar•52m ago•0 comments

Show HN: Bookvoice – convert PDF books into audiobooks

https://github.com/Semtexcz/Bookvoice
1•semtex_cz•58m ago•0 comments

German political spies mistook a random Berlin woman and got her fired

https://www.eugyppius.com/p/how-german-political-spies-mistook
3•Jerry2•58m ago•1 comments

I'm Not Consulting an LLM

https://lr0.org/blog/p/gpt/
9•birdculture•1h ago•2 comments

Game Theory #9: the US-Iran War [video]

https://www.youtube.com/watch?v=jIS2eB-rGv0
1•flawn•1h ago•0 comments

Dotfiles for Consistent AI-Assisted Development – Dylan Bochman

https://dylanbochman.com/blog/2026-01-25-dotfiles-for-ai-assisted-development/
1•ankitg12•1h ago•0 comments