frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Dari-docs – Optimize your docs using parallel coding agents

https://github.com/mupt-ai/dari-docs
21•byhong03•22h ago
It’s well known at this point that documentation needs to be optimized for AI agents - we’re all pointing our Claude Code / Codex / Pi agents at documentation, and expecting the models to figure out how to implement a product.

This, however, changes the entire optimization problem when writing documentation. Good documentation now becomes more objective - you are solving the very concrete problem: can a dumb harness running the dumbest model implement this reliably?

Humans can typically compensate for inconsistent terminology or scattered context across pages, but for agents, this often will waste time (or even just completely confuse the agent).

We’ve been building a small project around this called dari-docs: users can upload their documentation via website or CLI and run agents across different providers to see where they falter. You can upload your documentation, feed a list of tasks, and ask agents with varying intelligence / cost levels to complete those tasks in parallel. When a run is complete, you get back a list feedback markdown files from each agent run and can apply changes based on agent feedback.

Managed service: https://optimize.dari.dev/, repo link: https://github.com/mupt-ai/dari-docs

The agents actually try to use the product end-to-end. They search through the docs, follow instructions, run commands, try examples, and attempt to debug failures. Importantly, this is not a static LLM review of the documentation. The agents are actually attempting the integration.

You can also enable live verification with test credentials so the agents can actually verify workflows against real APIs:

  dari-docs check . --live-verify --secret-env DARI_TEST_API_KEY --task "Create a checkout session"
If you’re building a CLI, API, MCP server, or SDK and actively maintaining docs for humans or agents, we’d love to work with you and test this on real workflows!

Comments

Aleesha_hacker•18h ago
Cool approach actually letting agents test the docs makes debugging way more practical than just reading them
slipheen•17h ago
I read the GitHub repo, but still don't quite understand-

What exactly is the advantage of doing this vs just running a prompt in my existing coding agent?

I don't understand why this is a harness/project vs just for example, a skill?

I'm confident there's a good reason, I just don't understand.

avyvar•17h ago
Totally fair question. If you only want one agent to sanity-check one doc change, a skill/prompt is probably enough.

We actually aren’t rebuilding a harness here, it’s Pi with several LLM options to select from. The reason this is a project is that the useful workflow is more like a docs test suite: run realistic user tasks across multiple models, isolate each run in a greenfield sandbox, keep the transcripts/results, and make failures reproducible in CI.

You could ask an existing coding agent to spawn subagents for every task/model pair, but once that matrix grows, running hundreds of subagents on your computer gets messy. It’s also the wrong isolation boundary: for docs testing, you usually want the agent to start from a clean environment with access only to the docs/product surface you’re testing, not your whole working tree or local setup.

anish_m•16h ago
Nice! I want to use this for my product at ngram.com. Btw, I also created a sample teaser video: https://www.ngram.com/watch/dari-explainer-video-brief-d7991.... Feel free to use it on your social media
darthproton•14h ago
nice, but uploading is quite sensitive to many though
avyvar•13h ago
how so? we have two options - one with a public link url (i.e something like a Mintlify site with llms.txt, and the LLM searches directly). We only expect folks to upload publicly available docs for now, but would love to hear any potential other solutions for more private docs!
hoansdz•3h ago
I think one feature that would make dari-docs significantly more practical for real-world pipelines is a robust, built-in bidirectional converter between Markdown and HTML

What makes a good variable naming convention

https://benharrap.com/post/2025-03-03-variable-naming-convention/
1•leephillips•2m ago•0 comments

Alexander Grothendieck Revolutionized 20th-Century Mathematics

https://www.quantamagazine.org/how-alexander-grothendieck-revolutionized-20th-century-mathematics...
1•Brajeshwar•4m ago•0 comments

Technical Interviews Reject the Wrong Engineers

https://fagnerbrack.com/technical-interviews-reject-the-wrong-engineers-a8e78ca04b2e
1•birdculture•5m ago•0 comments

Show HN: Let agents run any analysis with Mixpanel data, no UI required

https://docs.mixpanel.com/docs/mixpanel-headless
4•ttchen2•6m ago•0 comments

The Unbearable Blandness Of The 2020's [video]

https://www.youtube.com/watch?v=tzvXoss7A3E
1•mindcrime•6m ago•0 comments

French NATO commander: Europe has no alternative to Palantir's warfare tech

https://www.politico.eu/article/nato-commander-europe-no-palantir-alternative/
1•robertkoss•6m ago•0 comments

Leroy's elusive little people: A review on lilliputian hallucinations (2021)

https://www.sciencedirect.com/science/article/pii/S0149763421001068
1•billfor•6m ago•0 comments

What 1,281 agent runs reveal about coding agent failure in large codebases

https://tessl.io/blog/coding-agent-failure-patterns-large-codebases/
1•jdorfman•7m ago•0 comments

Active beam headlights are finally coming to America

https://arstechnica.com/cars/2026/05/these-clever-active-beam-headlights-are-finally-coming-to-am...
1•LorenDB•8m ago•0 comments

How OLTs may have exposed ISP networks

https://blog.quarkslab.com/how-olts-may-have-exposed-entire-isp-networks.html
2•speckx•9m ago•0 comments

Show HN: A demo video of Effected Keyboard 2

https://www.youtube.com/shorts/6aExjM8A9pE
1•vitalipom•11m ago•0 comments

Navox Network – Browser-only CRM built on weak-ties research

https://www.navox.tech/network
1•nahrin•12m ago•0 comments

Build your own green threads library in C

https://github.com/nihiL7331/thrd-ndl
2•nihiL7331•12m ago•0 comments

Show HN: I made the first free ad blocker for podcasts

https://drea.fm/
1•hamza_q_•13m ago•0 comments

PULSELoCo: 17x less trainer-to-trainer bandwidth in distributed RL post-training

https://arxiv.org/abs/2602.03839
1•synapz_org•13m ago•0 comments

Collabora and Flipper: Opening Up the RK3576

https://www.collabora.com/news-and-blog/news-and-events/collabora-flipper-opening-up-the-rk3576.html
1•mfilion•13m ago•0 comments

AI Gateway Production Index

https://vercel.com/blog/ai-gateway-production-index
1•gmays•14m ago•0 comments

TSA Gold+ program for privatizing airport security screening

https://www.tsa.gov/goldplus
3•victorio•14m ago•1 comments

I spent 50 hours drawing a line graph

https://www.dougmacdowell.com/50-hours-to-draw-some-lines.html
1•dougdude3339•15m ago•1 comments

Microsoft warns of new Defender zero-days exploited in attacks

https://www.bleepingcomputer.com/news/security/microsoft-warns-of-new-defender-zero-days-exploite...
1•Brajeshwar•19m ago•0 comments

Show HN: opub, donated compute for open-source

https://opub.dev/blog/introducing-opub
1•goodroot•21m ago•0 comments

A Booming Shadow Market of Sketchy A.I. Investments

https://www.newyorker.com/culture/infinite-scroll/a-booming-shadow-market-of-sketchy-ai-investments
2•mmayberry•21m ago•0 comments

Deepfakes Tore a High School Apart

https://www.404media.co/radnor-high-school-pennsylvania-ai-deepfakes-child-sexual-abuse-material/
3•cdrnsf•23m ago•0 comments

Apparently former Facebook staffers are in high-ranking positions at Mozilla now

https://goblin.band/notes/ak9wrlzwgqsvbj9y
2•speckx•24m ago•0 comments

MCP-safeguard: first automated security scanner for MCP servers

https://github.com/SyedAnas01/mcp-safeguard
1•Anas1371•25m ago•0 comments

I built a tool to stop AI coding agents from leaking my secrets

https://github.com/getveil/veil
1•bcharest_dev•25m ago•0 comments

Realtime pixels-in-actions-out neural agent for Flappy Anna 3D

https://www.youtube.com/watch?v=gssY-ZQx06g
1•guiguan•26m ago•0 comments

I built a small tool to reduce input token costs by 20-30% for agentic tasks

https://bigindexer.com/blog/reduce-input-token-costs-agentic-tasks
1•afxuh•26m ago•0 comments

Morphogenic Systems Lead

http://mailto:architect@creaturealgorithm.com
2•mariuslukas•27m ago•0 comments

Show HN: Six legendary marketers walk into a workflow

https://github.com/conductor-oss/awesome-skills/tree/main/gtm-mavericks
1•opiniateddev•27m ago•0 comments