frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Making Tool Calling 75% More Efficient via Code

https://github.com/zeke-john/codecall
1•zekejohn•1d ago

Comments

zekejohn•1d ago
Traditional AI agents have EVERY tool loaded into context from the stat, call tools one at a time, each requiring a full inference round trip, for example: "delete all completed tasks," that means: call findTasks, wait, call deleteTask for task 1, wait, call for task 2... each call resends the entire conversation history, so tokens compound fast and there is a lot of wasted tokens and inference.

Codecall is an open source approach that lets agents write and execute TypeScript code in a secured Deno sandbox to orchestrate multiple tools programmatically, like calling an API (which is really all a tool is!)

So instead of 20+ inference passes and 90k+ tokens, the agent can just write and execute:

const tasks = await tools.todoist.findTasks({ completed: true }); for (const task of tasks) { await tools.todoist.deleteTask({ id: task.id }); }

2 inference passes. The code runs in a Deno sandbox, executes all operations programmatically, and returns a result. In our demo, for one example, this reduced tokens by 74.7% and tool calls by 92.3% while being much faster as well.

How it works (high level) ->

1. There are only 2 tools (readFile, executeCode) + a file tree. The agent reads SDK files on demand, so a 30 tool setup is effectively the to a 5 tool setup (only the file tree gets bigger)

2. Multiple tool calls happen in one execution, not N inference calls for N operations... because the agent can write code to execute and orchestrate multiple tools (like API) this significantly reduces the number of passes + tokens per request

3. Models have a 10-50% failure rate searching through large datasets in context. Code like users.filter(u => u.role === "admin") is deterministic and avoids those failure, so not only is it more efficient & cheaper. its also often much more accurate when doing operations with lots of data!

We also generate TypeScript SDK files from MCP tool definitions, so the agent sees clean types and function signatures. It also learns from errors, so when a tool call fails, it updates the SDK file with learned constraints so future agents avoid the same mistake.

Codecall works with any MCP server (stdio/http). Would love feedback from anyone interested in or building more complex agents :)

l1am0•1d ago
This is basically what you learn in the huggingface smolagents course (months ago)...

They call it CodeAct

https://huggingface.co/learn/agents-course/en/unit2/smolagen...

zekejohn•1d ago
Interesting! First time im seeing this course, thanks for the link. From a high level it’s definitely in the same code first agents family then. After reading about smolagents for a bit i think the main things Codecall adds are TypeScript + generated SDKs, progressive tool discovery (readFile + executeCode instead of exposing every tool directly), and the single script sandboxed execution first flow w/ learned constraints, rather than the more of the "multi‑step ReAct loop" that smolagents prioritizes (like in the link below), which is a bit more like traditional tool calling w/ code ->

https://huggingface.co/blog/smolagents

Life Happens at 1x Speed

https://terriblesoftware.org/2026/01/08/life-happens-at-1x-speed/
1•matheusml•2m ago•0 comments

Interview: David Haz, Creator of React Bits

https://motion.dev/magazine/interview-david-haz-creator-of-react-bits
1•SirHound•3m ago•0 comments

The Jeff Dean Facts

https://github.com/LRitzdorf/TheJeffDeanFacts
1•ravenical•5m ago•0 comments

Staging is a wasteful lie: the case for the mono-environment

https://www.tomwphillips.co.uk/2026/01/staging-is-a-wasteful-lie-the-case-for-the-mono-environment/
2•tomwphillips•6m ago•0 comments

How do language models solve Bayesian network inference?

https://ferjorosa.github.io/blog/2026/01/02/llms-probailistic-reasoning.html
1•sebg•6m ago•0 comments

Ghent University rector Petra De Sutter uses AI-fabricated quotes in speech

https://www.vrt.be/vrtnws/en/2026/01/08/ghent-university-rector-petra-de-sutter-uses-fabricated-q...
2•lode•8m ago•1 comments

Nano Banana Pro

https://nano-bananapro.org/
2•letsmkvideo•10m ago•1 comments

Show HN: Do you think this is the best diffchecker and would you switch?

https://diffchecker.dev/
1•subhash_k•11m ago•2 comments

Show HN: Spark – Zero-config IoT deployment tool written in Rust

https://github.com/Velooroo/Spark
1•Kazilsky•14m ago•1 comments

How Much Does Education Improve Intelligence? A Meta-Analysis [pdf]

https://labs.la.utexas.edu/tucker-drob/files/2019/08/Ritchie-Tucker-Drob-2018-Psych-Science-How-M...
1•sebg•14m ago•1 comments

ByteDance local agent is something I might feel safe running

https://github.com/bytedance/UI-TARS-desktop
1•mark_l_watson•14m ago•1 comments

How cybercriminals plot to rob a target within a week

https://www.reuters.com/graphics/SOUTHEASTASIA-SCAMS/MANUALS/klpyjlqelvg/
1•barishnamazov•16m ago•0 comments

Architecture Governance: Capturing What and How

https://tomasjurasek.substack.com/p/architecture-governance-capturing
1•silent715•16m ago•0 comments

Verification-Driven Development (VDD) via Iterative Adversarial Refinement

https://gist.github.com/dollspace-gay/45c95ebfb5a3a3bae84d8bebd662cc25
1•sebg•18m ago•0 comments

Shared State Context for AI Agents [Ask/Show][Looking for Beta]

1•aperi•19m ago•0 comments

The Zcash core dev team has resigned

https://twitter.com/tedpillows/status/2009206637962383809
3•simonebrunozzi•21m ago•2 comments

Testmon – Speed up your test suite in CI

https://testmon.net
1•drcongo•22m ago•0 comments

Execline: A Small Scripting Language

http://skarnet.org/software/execline/
1•fanf2•25m ago•0 comments

I Drilled Holes in My $200 Waterproof Panniers

https://cycletouring.substack.com/p/i-drilled-holes-in-my-200-waterproof
2•djrivard•25m ago•0 comments

Wigner Cat Phases: Transition to Quantum Chaos

https://arxiv.org/abs/2512.22169
1•northlondoner•26m ago•1 comments

Show HN: Analytics for SaaS Founders Connecting Stripe, Google Analytics and GSC

https://busel.ai/
1•stasman•26m ago•0 comments

Is Claude Ret***ed? Website where you vote on Claude's daily stupidity

https://www.isclauderetarded.today/
1•skrabe•27m ago•2 comments

Why Deepfake Technology Forces Courts to Rethink the Reliability of Evidence

https://www.technologylaw.ai/p/deepfake-technology-evidentiary-reliability-courts
1•pcaharrier•27m ago•0 comments

Beyond Training: Enabling Self-Evolution of Agents with Mobimem

https://arxiv.org/abs/2512.15784
1•PaulHoule•31m ago•0 comments

Trend Hacking 2025: The Niche Protocol for Founders

https://blog.vect.pro/trend-hacking-guide
1•WoWSaaS•32m ago•1 comments

One Regulation E, Two Different Regimes

https://www.bitsaboutmoney.com/archive/regulation-e/
1•gmcharlt•32m ago•0 comments

The revolution will be televised with QR codes

https://twitter.com/omid9/status/2009049147786104841
2•tim333•32m ago•0 comments

Dutch Fintech Bunq Reapplies to Become a Bank in the US

https://www.bloomberg.com/news/articles/2026-01-07/dutch-fintech-bunq-reapplies-to-become-a-bank-...
1•teekert•33m ago•1 comments

PCSX2 2.6.0

https://pcsx2.net/blog/2025/pcsx2-2.6/
1•wubin•36m ago•0 comments

Hypocritespy(HTTPS://github.com/Ronny12345-art/Hypocritespy)

1•Ronny12345-art•37m ago•0 comments