frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Toolbase – Build reliable AI teammates by example, not instruction

2•David1238•1y ago
Hey HN, we’re David and Ethan, co-founders of Toolbase.

Toolbase is an AI agent and workflow builder that helps you quickly create production-grade AI automations — batteries included.

Try it without registering here: https://gettoolbase.com/

------- The dev cycle in Toolbase -------

1. Define a rough goal.

2. Connect any API or MCP server (we have thousands, or bring your own).

3. Teach the AI what valid input and output examples look like (these later become your unit tests!).

4. Let Toolbase generate the perfect prompt, code, workflow, or agent.

5. Deploy your project as an API, MCP server, or chat interface!

Coding optional. Sharing encouraged.

Note: If you’re familiar with Cursor/Windsurf and the core concepts behind frameworks like Mastra (which Toolbase runs on), you already know how to use Toolbase. You retain the flexibility of coding but avoid boilerplate and plumbing tasks (integration, validation, context mapping, testing, etc.) unless you explicitly choose to do them.

------- Demo -------

https://www.loom.com/share/540c61b2c5634996b088ebbb16989cf0?...

This simple agent validates company billing addresses in our CRM (Pipedrive) by researching them through Tavily search. If there’s an address mismatch, it asks a human to pick the right one via email (watch on 2x speed):

Output email for the example shown in the demo: https://gettoolbase.com/assets/demo-screenshot.png

Producing code and more deterministic workflows follows a similar process.

------- Why another agent/workflow builder? -------

We started building Toolbase out of frustration with existing frameworks, especially for production use, and the lack of IDE support for MLOps artifacts, such as prompts, golden data, workflows, and evaluations. Since much of the code for agentic systems can be dynamically generated and validated using these artifacts, they often become even more important than the code itself.

After speaking with other builders, we also realized that manually coding workflows, experimenting with prompts through trial-and-error, and setting up infrastructure/integrations all took far more time than they should. Tools like Cursor and Windsurf help, but extracting meaning from AI-generated code is slow. Chatbots whipping up arcane code potions in tinted chat windows, which is the other end of the spectrum, demos really well but isn’t maintainable at all (sorry, vibe-coding). So we went with something in the middle: an AI-assiststed visual builder with full code fallback.

------- What do you think? -------

We’re excited for any feedback, thoughts, or questions from the HN community.

Let us know what you think in the comments!

- David & Ethan

Comments

David1238•1y ago
If you want to jump straight to trying it, you can preview it here:

https://gettoolbase.com

Try clicking the play button at the bottom right of workflow pane that appears next to the big blue text.

Clicking each node after you submit your query shows you its results for the run.

Full-screening the experience (icon on the top right) and expanding a node (same icon within a step of the workflow) lets you train that prompt in the same way we did in the demo.

The first node (“User Intent Extractor”) is purposefully vague so you can train it yourself!

Progressively Improving a Ball of Mud

https://afilina.com/improving-ball-of-mud
1•luu•1m ago•0 comments

The best ideas come from the arena

https://www.reproof.app/blog/amex-history
1•maguay•4m ago•0 comments

ZAYA1-8B: An 8B Moe Model with 760M Active Params Matching DeepSeek-R1 on Math

https://firethering.com/zaya1-8b-open-source-math-coding-model/
1•steveharing1•5m ago•0 comments

Ask HN: Does a vetted marketplace improve hiring?

1•Sam6late•6m ago•0 comments

Subquadratic claims to have fixes attention scaling with 12M context window

https://twitter.com/alex_whedon/status/2051663268704636937
1•jiwidi•9m ago•0 comments

Who edited the date stamp of this post

https://xcancel.com/iamasoothsayer/status/1535494638391664641?s=46%0A
2•razodactyl•14m ago•0 comments

The Semantic Conception of Truth and the Foundations of Semantics(1944)

https://www.ditext.com/tarski/tarski.html
2•nill0•17m ago•0 comments

My first post scored 1. Karpathy's autoresearch idea helped me repost

https://github.com/meller/laneconductor
2•meller_a•22m ago•0 comments

Has Meta ever provided Qualcomm EDL files for recovery?

1•nar001•29m ago•0 comments

Mapping Project Complexity with AI

https://www.maiobarbero.dev/articles/project-complexity-ai-skills/
1•maiobarbero•29m ago•1 comments

Show HN: Production-Ready MERN Job Board Template

https://auditjobs.up.railway.app/
1•hlymrk•30m ago•0 comments

Show HN: Crypto Cards – 136 debit/credit cards, MIT-licensed list

https://github.com/mbtrilla/awesome-crypto-cards
1•mbtrilla•32m ago•0 comments

Message Brokers Are Modern Grids(2020)

https://yusufaytas.com/message-brokers-are-modern-grids
3•return_null•33m ago•0 comments

Show HN: Modolap, Improve the Reliability of Your Software Systems

https://modolap.com/
1•ronfriedhaber•34m ago•0 comments

I Wrote a Nix Flake for Helium Browser with Home Manager and NixOS Modules

https://github.com/oxcl/nix-flake-helium-browser
2•oxcl•34m ago•0 comments

Show HN: Social Network for Corporate Cringe

https://CringeOut.com
3•CringeOut•35m ago•0 comments

Plimpton 322 – Babylonian Clay table of triangles 1k years before Pythagoras

https://en.wikipedia.org/wiki/Plimpton_322
1•lifeisstillgood•36m ago•1 comments

Show HN: Uvx privacy-steward for PII removal in texts

https://github.com/AI-Colleagues/privacy-steward
1•NeuralNotwork•37m ago•0 comments

Show HN: Imagev2.me – Tired of juggling AI image subs, so I built one studio

https://imagev2.me/
1•billy42•39m ago•0 comments

SEO-Friendly Public Pages for Confluence

https://marketplace.atlassian.com/apps/356517983/public-pages-for-confluence
1•MaxBabenko•39m ago•0 comments

AI Agent Drained for $200K with This One Tweet Hack

https://www.ccn.com/news/crypto/ai-agent-drained-for-200k-with-this-one-tweet-hack-heres-how/
2•aledevv•44m ago•0 comments

SpaceX IPO gives Musk power and curbs shareholder rights

https://www.reuters.com/sustainability/boards-policy-regulation/spacex-ipo-gives-musk-sweeping-po...
2•denis1•45m ago•1 comments

USB-Vault v1.0.0 – Deterministic Password Generator

https://github.com/emilianosolazzi/USB-Vault
1•emilianosolazzi•45m ago•0 comments

DeepSeek-v4-Pro and Hermes: Unauthorized Modification of Security Controls

https://www.eddieoz.com/deepseek-v4-pro-hermes-unauthorized-modification-of-security-controls/
1•eddieoz•48m ago•0 comments

Rapid: Property-Based Testing for Go

https://github.com/flyingmutant/rapid
1•ThierryBuilds•50m ago•0 comments

How long does it take you to get back into a project after a few days away?

1•mahi_01•52m ago•0 comments

Microsoft in Talks to Ax Energy Pledge Amid Data Center Boom

https://www.bloomberg.com/news/articles/2026-05-06/microsoft-clean-power-target-on-chopping-block...
1•zekrioca•53m ago•0 comments

15 things I learnt launching AI projects in Government (4-part blog post)

https://puntofisso.net/blog/posts/things-i-learned-ai-summary/
1•puntofisso•53m ago•1 comments

Show HN: Keysee – deterministic identicons for public keys

https://keysee.io/ui
1•scottmotte•54m ago•1 comments

Ask HN: Is gretap the right tool for this kind of LTE setup?

2•neroman•55m ago•1 comments