frontpage.

The Oats Protocol – Open Agent Tools for Local Coding Agents

3•dsdevjay•53m ago

Recently I was using functiongemma and watched it load and run local source code as a tool call without any training/tuning. A couple days later I got Qwen35 in Open-WebUI to use the "native" tool-calling. With Open-WebUI I could observe the changes as it ran inside the docker containers crawling over stuff on its own, but it was not obvious to observe functiongemma.

As a control freak, the differences in how these two tool-calling approaches got me thinking:

How will open source enable standardized tool-calling for agents so we do not have to build and support custom tool-calling harnesses on our own?

I wanted to share an architecture design pattern we're using to mitigate custom code for tool-calling in many components/subsystems. We open sourced our local OATs coding agent on GitHub https://github.com/district-solutions/open-agent-tools-coder. I run coder with a large local model that delegates tool calling to smaller local models. The coder includes vLLM deployments in the stacks dir https://github.com/district-solutions/open-agent-tools-coder/tree/main/stack for running Qwen36 27B and 35B with tool-calling delegation to functiongemma.

On startup, coder looks for a preprocessed, large JSON index of supported tools. We open sourced the OATs Tool-Calling Prompt Index for >141K Tools on GitHub https://github.com/district-solutions/open-agent-tools#openagent-tools-oats to help everyone use the same patterns (hopefully!). I think of OATs as a "thinking cap". Once that cap is on the smaller models only process a reduced set of tools. This tool-call guidance enables a local large model to delegate "a list of instructions" to a smaller model(s) that can be running on remote devices (I have functiongemma running on laptops with old gpus too e.g. mobile nvidia 3060). This allows for laptops to run local commands with a set of local models: one for the db, one for the api, one for the frontend, one for coding...

Here's the demo video with coder calling functiongemma:

https://asciinema.org/a/3ZhMCyUKjr2dmIH1

What else can we reuse?

- Published the OATs Prompt Index JSON to GitHub and the dataset to HuggingFace https://huggingface.co/datasets/open-agent-tools/open-tools as parquet files which should enable local training and usage with faster tools than json parsers.

Fundamental Trust Issues - Who watches the agent?

Once coder was running +200 local commands overnight with 1 prompt, we started seeing negative side effects around these use cases:

Change Management

- What did coder change? - What did it run? - Why did it choose this tool or that among a sequence of 200+ calls?

Code Reviews

- How do we keep up with changes at this speed?

Things got sketchy fast

- 6-7 weeks ago, I can't prove this but I'm 99% confident coder dropped the tables in non-prod db.

Shit. How do I stop this? How many other people are going to get wrecked by this?

I hope OATs can help you prevent unexpected tool calls doing unexpected things on your env.

- Monitoring - Coder tracks all tool calls for auditing and reviewing. I run many mattermost instances where agents post tool call audit logs for review by humans/agents in specific channels. This allows for tracking stuck agents and watching what they are doing, and I can archive all chats into parquet files for training later. - Human curated approved tools - I open sourced the huge prompt index to make a point, with >141,000 tools, which tools are approved by your team and by security? OATs coder uses 1 json dictionary Prompt Index file to map prompts to local source code. Whatever you change in that json Prompt Index file, coder will support. If you want to link "superhappy" as a prompt to call your already-working local code for: "reading an open-webui note" or "reading an open-webui knowledge collection", just edit the file and save. - Here's a 3 part blog series on how coder works: https://districtsolutions.ai/blog

Thanks for your time!

Tell HN: Typical AI Conversation

Show HN: InsForge – Open-source Heroku for AI coding agents

This Week: Software Testing Changes Forever

Skyblock vs. Microsoft: Final Legal Outcome

CEF AI is hiring a Growth and Community Operator (remote, global)

AI Agent Security Lecture

2026 App Ecosystem: 200k Apps Scanned for SDKs

Gen Z soldiers' plastic surgeries strain Korea's military readiness

CATE – an open-source spatial workspace for terminals, browsers and dev tools

How to Spot a Scam Recruiter Faster Than Madoff Was Exposed

Ask HN: it's mid-may: favourite articles so far?

Less Is More: Interface Agents as Digital Butlers (1994)

The Filipino virtual assistants running LinkedIn engagement networks

Running GitHub Playwright Projects from a Chrome Extension

Wyoming Church Members Targeted by Scammers Stealing in the Name of God

- YouTube [video]

Anthropic's $1.5B copyright settlement is getting messy as judge delays approval

The US is betting on AI to catch insider trading in prediction markets

The Young Are Being Battered by AI as Hiring Shifts to Older Workers

Travel Notes: RubyKaigi Hakodate

Supercharging Immune Cells May Help Control HIV Long-Term

ExploreYC – Search, filter, and analyze every YC company ever funded

EU is on it's way to become an open air museum

The Interview That Ships to Production: replacing whiteboards with pull requests

We stopped AI bot spam in our GitHub repo using Git's –author flag

PgBackRest Will Continue

Bug bounty businesses bombarded with AI slop

Are smartphones behind the fertility decline?

Garry Tan, the CEO of venture YC, accused me of unethical reporting

Taiwan cops say student's radio kit brought bullet trains to a standstill