frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Output.ai - OSS framework we extracted from 500+ production AI agents

https://output.ai/
29•bnchrch•1h ago

Comments

danielvlopes2•1h ago
Hey HN! I'm Daniel, cofounder of GrowthX and Ben's colleague (who posted it). We have about 20 engineers building AI agents and workflows for companies like Lovable, Webflow, Airbyte. Output is the framework we extracted from that work. It runs our AI infrastructure and we open-sourced it.

We kept hitting the same problems: writing and iterating on prompts at scale, orchestrating API calls that fail unpredictably, tracking costs, testing non-deterministic code, building datasets from production data, organizing repos so coding agents perform well. And every piece of tooling was a different SaaS product that didn't talk to the others.

We built Output around three ideas:

1. Make it easy for devs and coding agents to create and modify workflows in one or a few shots.

Filesystem first. Everything your agent needs lives in self-contained folders, full context visible without hunting. TypeScript and Zod provide the first validation layer for whether your workflow is correct.

2. One framework, minimal tooling sprawl.

We got tired of scattering data across SaaS products that don't talk to each other. Prompt files, evals, tracing, cost tracking, credentials all live in one place.

Your data stays on your infrastructure. Under the hood, we built on Temporal for orchestration. It's a hard problem and we weren't going to reinvent the wheel they've perfected. Open source and self-hostable, or Temporal Cloud. We wrapped it so you don't need to learn Temporal upfront, but the full power is there underneath.

3. A flat learning curve.

Our team is web engineers at different levels. We didn't want anyone to learn Python, five different tools, or the nuances of workflow idempotency before they could ship. We baked in conventions: same folder structure, file names, patterns across every workflow. Advanced features like Temporal primitives, evals, LLM-as-a-judge stay out of the way until you reach for them.

We've been building production workflows this way for over a year.

We extracted it, cleaned it up, and wanted to put it in front of people who'd push on it.

Docs and a video building a HN AI digest newsletter from scratch: https://output.ai

Happy to answer questions.

dp05•1h ago
Looks great. Sharing with my team
stevenkoze02•1h ago
The credential management piece is smart most frameworks just do .env files and hope for the best. Curious about one thing: when workflows call external tools or ingest tool descriptions from MCP servers, are you doing any sanitization on the input before it hits the model's context? We've been researching invisible Unicode in tool descriptions codepoints that render as nothing but get tokenized normally. GPT-5.4 follows hidden instructions encoded this way 100% of the time in our testing. At 500+ production agents that's a real attack surface if any of them consume external tool definitions.
bnchrch•1h ago
Hey! Ben here (one of the engineers who built this).

This is a reason why we made our http framework (@outputai/http) a first class citizen for the greater framework and our claude code plugins.

As you pointed out at this moment in time theres a Cambrian explosion both in new tools/libraries and the willingness to use them, which poses a systemic security threat when combined with how LLMs function.

So while you're free to use any third party tool or library you want with Output. We encourage you to roll your own as often as possible both for the security/control it gives you. But also for the vertical integration it provides (debugging, cost tracking, evals etc...)

marcosmarxm•1h ago
Do you mind sharing any content from your team's research? I've recently gotten interested in agent/llm attacks and how to protect against them.
globalchatads•47m ago
The Unicode injection is a real vector, but I keep running into a problem one step before that: how do you even know which MCP servers to trust with tool definitions?

The official MCP Registry is basically a flat list. No verification metadata, no attestation chain. If someone gets a malicious server listed there, Unicode tricks in tool descriptions are almost beside the point. Your agents are already pulling definitions from an unvetted source.

I have been tracking the IETF drafts that try to solve agent discovery and registration. There are about 11 competing ones (ARDP, AID, AINS, agents.txt, etc). Six expired or are expiring this month, no renewals filed. The ones still alive do not include any mechanism for cryptographic verification of tool descriptions.

At 500 agents, the question stops being "is this tool description clean" and becomes "should my agent be talking to this server at all." The sanitization work matters, but it is downstream of a trust problem that is currently wide open.

The Blueprint of a North Korean Attack on Open-Source

https://casco.com/blog/the-blueprint-of-a-north-korean-attack-on-open-source
2•brene•1m ago•1 comments

Seekdb M0: Persistent Cloud Memory and Shared Experience for OpenClaw Agents

https://en.oceanbase.com/blog/26635690496
1•calweng•1m ago•0 comments

Is Telehealth Safe?

https://www.kaspersky.com/blog/telehealth-issues-2026/55560/
1•salkahfi•1m ago•0 comments

Supabase vs. Convex

https://www.devtoolsacademy.com/blog/supabase-vs-convex/
2•alokDT•1m ago•0 comments

Show HN: I had no idea I twirled my hair 25 times an hour until my Mac told me

https://www.ticticboom.app/
1•haberdasher•2m ago•0 comments

Show HN: Interactive object storage cost calculator

https://storage.mixpeek.com
1•Beefin•2m ago•0 comments

Live Rocket Telemetry and Logging in Two Weeks

https://wilsonharper.net/projects/avio/
1•WilsonHarper•3m ago•0 comments

Seekdb M0: Persistent Cloud Memory and Shared Experience for OpenClaw Agents

https://oceanbase.medium.com/how-seekdb-m0-gives-openclaw-persistent-memory-and-shared-experience...
1•calweng•3m ago•0 comments

Rescuing old printers with an in-browser Linux VM bridged to WebUSB over USB/IP

https://printervention.app/details
2•gmac•5m ago•0 comments

Are We Legacy Computing Yet?

https://arewelegacycomputingyet.com/
1•tyoverby•5m ago•0 comments

A clothes hanger invented by a mechanical engineer

https://www.kangaroohanger.com
1•samdung•5m ago•0 comments

We no longer write any of the prompts in our codebase

https://gist.github.com/milstan/3b12f938f344f4ae1f511dd19e56adce
1•milstan•6m ago•1 comments

GLM-5.1: The Next Level of Open Source

https://twitter.com/Zai_org/status/2041550153354519022
1•zixuanlimit•7m ago•2 comments

A star scientist showed that better genetics lessons could reduce racism

https://www.statnews.com/2026/04/07/brian-donovan-fighting-racism-with-genetics-education/
2•mooreds•9m ago•0 comments

The proliferation of AI-enabled military technology in the Middle East

https://www.iiss.org/online-analysis/charting-middle-east/2026/04/the-proliferation-of-ai-enabled...
1•CrypticShift•12m ago•0 comments

Lessons from Fitness Wearable Company WHOOP's FDA Warning Letter

https://pmc.ncbi.nlm.nih.gov/articles/PMC12822547/
1•randycupertino•13m ago•0 comments

WildDet3D: Open model that look at a single photo and understand objects in 3D

https://allenai.org/blog/wilddet3d
1•maxloh•13m ago•0 comments

Axios NPM supply chain incident

https://blog.talosintelligence.com/axois-npm-supply-chain-incident/
1•tcbrah•13m ago•0 comments

April Cools Club

https://www.aprilcools.club/
1•wxw•13m ago•0 comments

An Architectural Critique of OpenAI's Industrial Policy (RFC)

https://github.com/ariadne-coil/OpenAI-Industrial-Policy-RFC/blob/main/README.md
1•AriadneCyber•16m ago•0 comments

Time to Start Treating Dev Machines as Untrusted

https://worklifenotes.com/2026/03/31/time-to-start-treating-dev-machines-as-untrusted/
2•abnercoimbre•16m ago•0 comments

Only full-scale Jaws boat replica to open in Conwy

https://www.bbc.com/news/articles/c2k31510y7xo
2•speckx•17m ago•0 comments

Free, bidirectional conversion between EDI X12 and JSON

https://edination.edifabric.com/edi-json.html
1•donzog•17m ago•0 comments

Launching OmniConnect – Authentication for Computer Use Agents

https://anchorbrowser.io/blog/launching-omniconnect-authentication-for-computer-use-agents
1•aviramha•17m ago•0 comments

Vibe Coding Is an Existential Threat to B2B SaaS

https://nmn.gl/blog/vibe-coding-threat-b2b-saas
1•namanyayg•18m ago•1 comments

Moving Railway's Frontend Off Next.js

https://blog.railway.com/p/moving-railways-frontend-off-nextjs
3•sibeliuss•18m ago•0 comments

Hemingway's 4 Fast Rules for Effective Writing

https://www.wordsthatsing.com.au//post/hemingway-rules
1•bookofjoe•18m ago•0 comments

One of Earth's Most Explosive Volcanoes Is Quietly Refilling with Magma

https://www.sciencealert.com/one-of-earths-most-explosive-volcanoes-is-quietly-refilling-with-magma
2•Growtika•19m ago•0 comments

Changelog – Apr 2, 2026

https://blog.val.town/changelog-04022026
1•MichaelBollox•19m ago•0 comments

AvaloniaUI Version 12 Released

https://avaloniaui.net/blog/avalonia-12/
1•ethanjped•20m ago•0 comments