frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Toolbase – Build reliable AI teammates by example, not instruction

2•David1238•1y ago
Hey HN, we’re David and Ethan, co-founders of Toolbase.

Toolbase is an AI agent and workflow builder that helps you quickly create production-grade AI automations — batteries included.

Try it without registering here: https://gettoolbase.com/

------- The dev cycle in Toolbase -------

1. Define a rough goal.

2. Connect any API or MCP server (we have thousands, or bring your own).

3. Teach the AI what valid input and output examples look like (these later become your unit tests!).

4. Let Toolbase generate the perfect prompt, code, workflow, or agent.

5. Deploy your project as an API, MCP server, or chat interface!

Coding optional. Sharing encouraged.

Note: If you’re familiar with Cursor/Windsurf and the core concepts behind frameworks like Mastra (which Toolbase runs on), you already know how to use Toolbase. You retain the flexibility of coding but avoid boilerplate and plumbing tasks (integration, validation, context mapping, testing, etc.) unless you explicitly choose to do them.

------- Demo -------

https://www.loom.com/share/540c61b2c5634996b088ebbb16989cf0?...

This simple agent validates company billing addresses in our CRM (Pipedrive) by researching them through Tavily search. If there’s an address mismatch, it asks a human to pick the right one via email (watch on 2x speed):

Output email for the example shown in the demo: https://gettoolbase.com/assets/demo-screenshot.png

Producing code and more deterministic workflows follows a similar process.

------- Why another agent/workflow builder? -------

We started building Toolbase out of frustration with existing frameworks, especially for production use, and the lack of IDE support for MLOps artifacts, such as prompts, golden data, workflows, and evaluations. Since much of the code for agentic systems can be dynamically generated and validated using these artifacts, they often become even more important than the code itself.

After speaking with other builders, we also realized that manually coding workflows, experimenting with prompts through trial-and-error, and setting up infrastructure/integrations all took far more time than they should. Tools like Cursor and Windsurf help, but extracting meaning from AI-generated code is slow. Chatbots whipping up arcane code potions in tinted chat windows, which is the other end of the spectrum, demos really well but isn’t maintainable at all (sorry, vibe-coding). So we went with something in the middle: an AI-assiststed visual builder with full code fallback.

------- What do you think? -------

We’re excited for any feedback, thoughts, or questions from the HN community.

Let us know what you think in the comments!

- David & Ethan

Comments

David1238•1y ago
If you want to jump straight to trying it, you can preview it here:

https://gettoolbase.com

Try clicking the play button at the bottom right of workflow pane that appears next to the big blue text.

Clicking each node after you submit your query shows you its results for the run.

Full-screening the experience (icon on the top right) and expanding a node (same icon within a step of the workflow) lets you train that prompt in the same way we did in the demo.

The first node (“User Intent Extractor”) is purposefully vague so you can train it yourself!

As AI costs rise, there is little evidence of major utility

https://www.gamesindustry.biz/as-ai-costs-rise-theres-little-evidence-of-major-utility-in-game-de...
1•dude250711•1m ago•0 comments

Why carbon capture and storage won't fix our climate crisis

https://projects.propublica.org/why-carbon-capture-cant-solve-climate-change/
1•world2vec•2m ago•0 comments

Pollen (CEO Negus-Fancey, CTO Wright) tried to remove article, and Google helped

https://blog.pragmaticengineer.com/pollen-tried-to-remove-my-article-about-callum-negus-fancey-an...
2•taubek•3m ago•0 comments

A Modern Inmarsat Decoder

https://github.com/SarahRoseLives/InmarScope
1•SarahRoseLives•6m ago•0 comments

A practical guide to defending your agent memory from attacks

https://medium.com/@vektormemory/a-practical-guide-to-defending-your-agent-memory-attacks-33b91c3...
1•vektormemory•7m ago•0 comments

Inside Consultants' Messy Shift from Hourly Billing

https://www.wsj.com/cfo-journal/inside-consultants-messy-shift-from-hourly-billing-7bd9b802
1•thm•9m ago•0 comments

Show HN: App to support reading foreign language books (on paper)

https://lexiglo.app
1•nikhaldi•12m ago•0 comments

Clever chemistry turns antibiotic-resistant bacteria's own defences against them

https://www.chemistryworld.com/news/clever-chemistry-turns-antibiotic-resistant-bacterias-own-def...
1•visha1v•12m ago•0 comments

The series of tubes filled with enormous amounts of mail, beneath our feet

https://buttondown.com/blog/pneumatic-email
1•maguay•13m ago•0 comments

Sustaining a Shared Reality: How Past Technology Waves Have Impacted Strategy

https://whitneyzim.medium.com/sustaining-a-shared-reality-how-past-technology-waves-have-impacted...
1•BerislavLopac•13m ago•0 comments

Is dbase dead? Customers cannot activate nor contact support

https://delphinightmares.substack.com/p/is-dbase-dead
1•deeaceofbase•15m ago•1 comments

Communicating the Value of Publicly Funded Science

https://cacm.acm.org/opinion/communicating-the-value-of-publicly-funded-science/
1•visha1v•15m ago•0 comments

The Hitchhiker's Guide to Agentic AI: From Foundations to Systems

https://arxiv.org/abs/2606.24937
1•tamnd•16m ago•0 comments

New European Search Engine

https://www.qmay.eu
1•Qmay_Dev•17m ago•0 comments

Anthropic CEO: Open-Source AI is getting dangerous

https://xcancel.com/coinbureau/status/2071330294452666695
4•therein•19m ago•2 comments

Berkshire Hathaway – It's essentially a pre-diversified empire

https://en.wikipedia.org/wiki/Berkshire_Hathaway
1•modinfo•23m ago•0 comments

Show HN: Sidequest is a better /btw for Pi

https://github.com/peterp/pi-sidequest
1•pistoriusp•24m ago•0 comments

LLM-free, layout-aware PDF chunker in pure Rust

https://github.com/matthiasnordwig/pdf-struct-chunker
1•MatthiasNordwig•24m ago•0 comments

Ukraine's newest strike weapon, Balloons

https://www.defensenews.com/global/europe/2026/06/25/ukraines-newest-strike-weapon-drifts-into-ru...
1•garyclarke27•25m ago•0 comments

SpecManager – a full agile team for founders, as a Claude Code plugin

https://github.com/joanseg/specmanager
1•joansg•32m ago•0 comments

Understanding Android's Project Treble, Project Mainline, APK Signature Schemes

https://medium.com/@Max_Sir/understanding-androids-project-treble-project-mainline-and-apk-signat...
1•thunderbong•32m ago•0 comments

Why did this journal retract two 1940s papers by Max Planck?

https://arstechnica.com/science/2026/06/why-did-this-journal-retract-two-1940s-papers-by-max-planck/
59•DR_MING•33m ago•0 comments

Life After Oligarchy

https://www.commonweal.scot/articles/magazine-zrell
1•robtherobber•33m ago•1 comments

War at the Final Frontier

https://medium.com/@firstfromreverse/war-at-the-final-frontier-2f9af096a297
1•WishingWisp•35m ago•0 comments

N8n Docker Compose stack with secrets, TLS, and a 16-check validator

https://github.com/empostigo/n8n-compose-field-guide
1•44_88•37m ago•1 comments

OctoPerf MCP – drive load tests from any LLM (OAuth 2.1, no API key)

https://api.octoperf.com/doc/mcp/
1•Jellly•38m ago•0 comments

Computer Networking: A Top Down Approach (9th Ed): Online Video Presentations

https://gaia.cs.umass.edu/kurose_ross/lectures.php
4•teleforce•43m ago•0 comments

Create sandboxed rich-text telegram agents with a single config file

https://github.com/montyanderson/007
2•montyanderson•46m ago•0 comments

The Race to Reliable Visual Understanding

https://cacm.acm.org/news/the-race-to-reliable-visual-understanding/
2•visha1v•48m ago•0 comments

Show HN: Closedtab: a shared record for human-agent teams

https://www.npmjs.com/package/closedtab
1•omnivore•50m ago•0 comments