frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Toolbase – Build reliable AI teammates by example, not instruction

2•David1238•1y ago
Hey HN, we’re David and Ethan, co-founders of Toolbase.

Toolbase is an AI agent and workflow builder that helps you quickly create production-grade AI automations — batteries included.

Try it without registering here: https://gettoolbase.com/

------- The dev cycle in Toolbase -------

1. Define a rough goal.

2. Connect any API or MCP server (we have thousands, or bring your own).

3. Teach the AI what valid input and output examples look like (these later become your unit tests!).

4. Let Toolbase generate the perfect prompt, code, workflow, or agent.

5. Deploy your project as an API, MCP server, or chat interface!

Coding optional. Sharing encouraged.

Note: If you’re familiar with Cursor/Windsurf and the core concepts behind frameworks like Mastra (which Toolbase runs on), you already know how to use Toolbase. You retain the flexibility of coding but avoid boilerplate and plumbing tasks (integration, validation, context mapping, testing, etc.) unless you explicitly choose to do them.

------- Demo -------

https://www.loom.com/share/540c61b2c5634996b088ebbb16989cf0?...

This simple agent validates company billing addresses in our CRM (Pipedrive) by researching them through Tavily search. If there’s an address mismatch, it asks a human to pick the right one via email (watch on 2x speed):

Output email for the example shown in the demo: https://gettoolbase.com/assets/demo-screenshot.png

Producing code and more deterministic workflows follows a similar process.

------- Why another agent/workflow builder? -------

We started building Toolbase out of frustration with existing frameworks, especially for production use, and the lack of IDE support for MLOps artifacts, such as prompts, golden data, workflows, and evaluations. Since much of the code for agentic systems can be dynamically generated and validated using these artifacts, they often become even more important than the code itself.

After speaking with other builders, we also realized that manually coding workflows, experimenting with prompts through trial-and-error, and setting up infrastructure/integrations all took far more time than they should. Tools like Cursor and Windsurf help, but extracting meaning from AI-generated code is slow. Chatbots whipping up arcane code potions in tinted chat windows, which is the other end of the spectrum, demos really well but isn’t maintainable at all (sorry, vibe-coding). So we went with something in the middle: an AI-assiststed visual builder with full code fallback.

------- What do you think? -------

We’re excited for any feedback, thoughts, or questions from the HN community.

Let us know what you think in the comments!

- David & Ethan

Comments

David1238•1y ago
If you want to jump straight to trying it, you can preview it here:

https://gettoolbase.com

Try clicking the play button at the bottom right of workflow pane that appears next to the big blue text.

Clicking each node after you submit your query shows you its results for the run.

Full-screening the experience (icon on the top right) and expanding a node (same icon within a step of the workflow) lets you train that prompt in the same way we did in the demo.

The first node (“User Intent Extractor”) is purposefully vague so you can train it yourself!

Testing Claude Sonnet 5's agentic claims

https://developer.puter.com/blog/claude-sonnet-5-puter-js/
1•reynaldi•1m ago•0 comments

How to Survive 3 Years in North Korea as a Foreigner

https://mydiplomaticlife.com/how-to-survive-3-years-in-north-korea-as-a-foreigner/
1•chipndale•3m ago•1 comments

A First Course in Causal Inference

https://arxiv.org/abs/2305.18793
1•Anon84•7m ago•0 comments

Why Major Tech Companies Are Investing in Video Streaming Architecture

https://www.forbes.com/councils/forbestechcouncil/2026/07/02/the-real-reason-major-tech-companies...
1•mondainx•9m ago•0 comments

Costco Is the Anti-Amazon

https://phenomenalworld.org/analysis/the-anti-amazon/
1•bookofjoe•9m ago•1 comments

The Inside Story Of Leverage Research 1.0

https://lydialaurenson.substack.com/p/the-inside-story-of-leverage-research
1•bluepeter•9m ago•0 comments

Factories Are Just Rooms

https://interconnected.org/home/2026/07/03/factories
1•arbesman•10m ago•0 comments

Agentic Symphony: Multi-Agent Collaboration for Emergent Musical Composition

https://www.youtube.com/watch?v=QMUXoImgTIA
1•dennisjoseph•11m ago•0 comments

PrivAiTe: Self-hosted proxy that redacts PII from LLM calls, incl. tool-calls

https://github.com/crp4222/PrivAiTe
1•crp4222•11m ago•0 comments

Another Side Project

https://myprolink.info/
1•mr_betamax•13m ago•0 comments

What we learned building a multi-agent PDF table extractor

https://unstract.com/blog/multi-agent-pdf-table-extraction/
1•naren87•13m ago•0 comments

America, 1926: What a Forgotten 100-Year-Old Report Says About Who We Are

https://www.derekthompson.org/p/america-1926-an-absurdly-deep-dive
1•momentmaker•15m ago•0 comments

Why are we still uploading PDFs just to compress them?

https://lumli.io/blog/pdf-compression-cloud-subscription-2026
1•lumli•16m ago•0 comments

Best Simple System for Now

https://dannorth.net/blog/best-simple-system-for-now/
2•daan-k•17m ago•0 comments

Show HN: AI latent space with overlapping manifolds

https://github.com/PJHkorea/Egregore/blob/main/integrated_egregore_core_test_v6_4.py
1•PJHkorea•18m ago•3 comments

The Honest SendGrid Inbound Parse Alternative – MailKite

https://mailkite.dev/blog/sendgrid-inbound-parse-alternative/
1•bucabay•20m ago•0 comments

Jamesob's guide to running SOTA LLMs locally

https://github.com/jamesob/local-llm
1•livestyle•20m ago•0 comments

It Still Can't Do My Job: Four Years of Moving Goalposts (2022–2026)

https://publicznyprofil.github.io/ai_cant_do_your_work/
9•mydreamof•22m ago•3 comments

The and Justice for All" Book Club

https://markmbello.substack.com/p/please-join-us-tomorrow
1•lawsuitllc•22m ago•0 comments

Battery startups see 'crazy' demand to smooth power surges in data centers

https://www.ft.com/content/55c10ef1-1589-47b2-9fa8-a2a04f5cf316
2•alephnerd•24m ago•0 comments

Dropway: Share LLM artifacts with your team

https://www.hugedomains.com/domain_profile.cfm?d=dropway.com
1•d_pang•25m ago•0 comments

Show HN: SigRank – Competitive Stat Screen and Operator Performance Evals O7

https://github.com/SunrisesIllNeverSee/sigrank-app
1•Burnmydays•26m ago•0 comments

Golden Paths Weren't Built for Agents

https://www.massdriver.cloud/blogs/golden-paths-werent-built-for-agents-part-1
1•mooreds•27m ago•0 comments

AI coding is addictive. Engineers are paying the price

https://leaddev.com/ai/ai-coding-is-addictive-engineers-are-paying-the-price
3•sefrost•29m ago•1 comments

Mistral vs. Claude on our onboarding: 4× faster, 30% cheaper

https://squidler.io/blog/eu-models-1-discovery-mistral
1•tidbeck•32m ago•0 comments

How Fighter Jets Lock on (and How the Targets Know) (2014)

https://gizmodo.com/how-fighter-jets-lock-on-and-how-the-targets-know-1644871272
3•downbad_•32m ago•2 comments

Give Smart People the Tools to Do Smart Things

https://superuserdone.com/posts/2026-07-03-give-smart-people-the-tools/
2•SuperUserDone•34m ago•0 comments

Well, the Steam Machine was pretty cool for the 20 minutes that it worked

https://old.reddit.com/r/steammachine/comments/1ulzo6a/well_the_steam_machine_was_pretty_cool_for...
1•HelloUsername•34m ago•0 comments

Pilot Shell: Spec-driven plans; enforced quality gates; persistent knowledge

https://github.com/maxritter/pilot-shell
1•sea-gold•34m ago•0 comments

Stop AI from Wrecking Your Codebase with Spec-Driven Development

https://guibai.dev/a/7656050265522913280/
2•Soarez•35m ago•0 comments