frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Toolbase – Build reliable AI teammates by example, not instruction

2•David1238•1y ago
Hey HN, we’re David and Ethan, co-founders of Toolbase.

Toolbase is an AI agent and workflow builder that helps you quickly create production-grade AI automations — batteries included.

Try it without registering here: https://gettoolbase.com/

------- The dev cycle in Toolbase -------

1. Define a rough goal.

2. Connect any API or MCP server (we have thousands, or bring your own).

3. Teach the AI what valid input and output examples look like (these later become your unit tests!).

4. Let Toolbase generate the perfect prompt, code, workflow, or agent.

5. Deploy your project as an API, MCP server, or chat interface!

Coding optional. Sharing encouraged.

Note: If you’re familiar with Cursor/Windsurf and the core concepts behind frameworks like Mastra (which Toolbase runs on), you already know how to use Toolbase. You retain the flexibility of coding but avoid boilerplate and plumbing tasks (integration, validation, context mapping, testing, etc.) unless you explicitly choose to do them.

------- Demo -------

https://www.loom.com/share/540c61b2c5634996b088ebbb16989cf0?...

This simple agent validates company billing addresses in our CRM (Pipedrive) by researching them through Tavily search. If there’s an address mismatch, it asks a human to pick the right one via email (watch on 2x speed):

Output email for the example shown in the demo: https://gettoolbase.com/assets/demo-screenshot.png

Producing code and more deterministic workflows follows a similar process.

------- Why another agent/workflow builder? -------

We started building Toolbase out of frustration with existing frameworks, especially for production use, and the lack of IDE support for MLOps artifacts, such as prompts, golden data, workflows, and evaluations. Since much of the code for agentic systems can be dynamically generated and validated using these artifacts, they often become even more important than the code itself.

After speaking with other builders, we also realized that manually coding workflows, experimenting with prompts through trial-and-error, and setting up infrastructure/integrations all took far more time than they should. Tools like Cursor and Windsurf help, but extracting meaning from AI-generated code is slow. Chatbots whipping up arcane code potions in tinted chat windows, which is the other end of the spectrum, demos really well but isn’t maintainable at all (sorry, vibe-coding). So we went with something in the middle: an AI-assiststed visual builder with full code fallback.

------- What do you think? -------

We’re excited for any feedback, thoughts, or questions from the HN community.

Let us know what you think in the comments!

- David & Ethan

Comments

David1238•1y ago
If you want to jump straight to trying it, you can preview it here:

https://gettoolbase.com

Try clicking the play button at the bottom right of workflow pane that appears next to the big blue text.

Clicking each node after you submit your query shows you its results for the run.

Full-screening the experience (icon on the top right) and expanding a node (same icon within a step of the workflow) lets you train that prompt in the same way we did in the demo.

The first node (“User Intent Extractor”) is purposefully vague so you can train it yourself!

PCIe 8.0 Spec Draft 0.5 Released for 1TB/S Bi-Directional X16 Bandwidth

https://www.phoronix.com/news/PCIe-8.0-Draft-0.5
1•anonymousiam•42s ago•0 comments

Show HN: Pay.sh – Discover, access, and pay for any API autonomously

https://github.com/solana-foundation/pay
2•fmerian•3m ago•0 comments

Proof of Use against vibe coded software

https://fireharp.com/2026-05-06-proof-of-use-888c47815a
1•fireharp•3m ago•0 comments

EVs now holding their value longer than petrol cars

https://www.thetimes.com/business/companies-markets/article/evs-now-holding-their-value-longer-th...
1•littlexsparkee•3m ago•0 comments

Run multiple versions of the same Python library in one process

https://github.com/claude-at-work/Bubblev2
1•nexusanon•4m ago•0 comments

Recursive grep written in Go benched against a C++ and Rust variant

https://github.com/bep/grrep
1•bjornerik•6m ago•0 comments

MCP Agora open source and local cross-agent persistent memory for AI agents

https://github.com/cioffiAI/mcp-agora
1•cioffiAI•6m ago•0 comments

Lovelace.ai Launches with Context Engine Builder for Mission-Critical AI

https://lovelace.ai/articles/lovelace-emerges-from-stealth-with-industry-defining-context-engine-...
1•tmoertel•6m ago•0 comments

Sim1 – A world where you live alongside AI agents

https://www.sim1.world
1•RoniHenareh•7m ago•0 comments

Zyphra releases the ZAYA1-8B MoE model optimized for intelligence density

https://huggingface.co/Zyphra/ZAYA1-8B
1•mirzap•7m ago•1 comments

Reliable Web App Pattern for .NET

https://learn.microsoft.com/en-us/azure/architecture/web-apps/guides/enterprise-app-patterns/reli...
1•Brysonbw•8m ago•0 comments

Orbit VPS

https://github.com/KenyanRedwoods01/Orbit
1•RedwoodsKenyan•10m ago•0 comments

Data Roles Now Average 24.9 Interview Hours per Hire, Highest Across Tech Roles

https://www.interviewquery.com/p/data-roles-interview-process-2026
1•littlexsparkee•10m ago•3 comments

Mail2github: Send Email and create files in GitHub with Email content

https://github.com/ulrischa/mail2github
1•ulrischa•15m ago•0 comments

GB10 Solution Atlas is now open source, <2min cold start 100 tok/s Qwen3.6-FP8

https://github.com/Avarok-Cybersecurity/atlas
1•azeezish•16m ago•0 comments

Show HN: Vibeguard-dev/local – static AST analysis for AI-generated SQL

https://github.com/MuddySheep/vibeguard-local
2•MuddySheep•17m ago•1 comments

Teleport Contest: Porting NetHack to JavaScript and Dealing with LLM Religion

https://mazesofmenace.ai/announcement/
1•abgruszecki•17m ago•0 comments

Wall Street Millennial: OpenAI Lobbying for Gov Bailout

https://www.youtube.com/watch?v=tiv_LWUzEM8
1•nradov•19m ago•0 comments

China's cyber capabilities now equal to the US, warns Dutch intelligence

https://therecord.media/china-cyber-capabilities-match-us-dutch-intel-says
1•PaulHoule•20m ago•0 comments

Blink – AI Assistant. A knowledge destination

https://blink-oi.vercel.app
2•Pascal1997•21m ago•2 comments

French professor investigated for awarding himself fake prize

https://www.bbc.com/news/articles/c4g8pwjdp6do
3•billybuckwheat•21m ago•0 comments

First-ever 'quadsqueezing' quantum interaction

https://phys.org/news/2026-05-physicists-quadsqueezing-quantum-interaction.html
1•airstrike•22m ago•0 comments

Google's Prompt API

https://wil.to/posts/googles-prompt-api/
2•cdrnsf•22m ago•0 comments

Samsung.com serves lower prices to Archive.org

https://web.archive.org/web/20260506202901/https://www.samsung.com/us/memory-storage/sata-ssd/870...
2•paulnpace•24m ago•0 comments

Show HN: Sqlflow, a SQLite back end layer for Go

https://github.com/avalonbits/sqlflow
1•iccananea•25m ago•0 comments

Galactic Archives:Interactive atlas and timeline of the Star Wars canon universe

https://thegalacticarchive.com/
1•joebig•27m ago•0 comments

It Takes 6 Days to Change 1 Line of Code (2015)

https://edw519.posthaven.com/it-takes-6-days-to-change-1-line-of-code
1•downbad_•28m ago•1 comments

The Work Is Social

https://yusufaytas.com/the-real-work-is-social
7•aura_farmer•29m ago•0 comments

I've made a quick browser extension for screenshot and annotation

https://snap-annotate.netlify.app/
1•finiskyy•29m ago•0 comments

RuneBench: Agent Benchmark on RuneScape Gameplay Tasks

https://maxbittker.github.io/runebench/
2•frozenseven•30m ago•0 comments