frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Toolbase – Build reliable AI teammates by example, not instruction

2•David1238•7mo ago
Hey HN, we’re David and Ethan, co-founders of Toolbase.

Toolbase is an AI agent and workflow builder that helps you quickly create production-grade AI automations — batteries included.

Try it without registering here: https://gettoolbase.com/

------- The dev cycle in Toolbase -------

1. Define a rough goal.

2. Connect any API or MCP server (we have thousands, or bring your own).

3. Teach the AI what valid input and output examples look like (these later become your unit tests!).

4. Let Toolbase generate the perfect prompt, code, workflow, or agent.

5. Deploy your project as an API, MCP server, or chat interface!

Coding optional. Sharing encouraged.

Note: If you’re familiar with Cursor/Windsurf and the core concepts behind frameworks like Mastra (which Toolbase runs on), you already know how to use Toolbase. You retain the flexibility of coding but avoid boilerplate and plumbing tasks (integration, validation, context mapping, testing, etc.) unless you explicitly choose to do them.

------- Demo -------

https://www.loom.com/share/540c61b2c5634996b088ebbb16989cf0?...

This simple agent validates company billing addresses in our CRM (Pipedrive) by researching them through Tavily search. If there’s an address mismatch, it asks a human to pick the right one via email (watch on 2x speed):

Output email for the example shown in the demo: https://gettoolbase.com/assets/demo-screenshot.png

Producing code and more deterministic workflows follows a similar process.

------- Why another agent/workflow builder? -------

We started building Toolbase out of frustration with existing frameworks, especially for production use, and the lack of IDE support for MLOps artifacts, such as prompts, golden data, workflows, and evaluations. Since much of the code for agentic systems can be dynamically generated and validated using these artifacts, they often become even more important than the code itself.

After speaking with other builders, we also realized that manually coding workflows, experimenting with prompts through trial-and-error, and setting up infrastructure/integrations all took far more time than they should. Tools like Cursor and Windsurf help, but extracting meaning from AI-generated code is slow. Chatbots whipping up arcane code potions in tinted chat windows, which is the other end of the spectrum, demos really well but isn’t maintainable at all (sorry, vibe-coding). So we went with something in the middle: an AI-assiststed visual builder with full code fallback.

------- What do you think? -------

We’re excited for any feedback, thoughts, or questions from the HN community.

Let us know what you think in the comments!

- David & Ethan

Comments

David1238•7mo ago
If you want to jump straight to trying it, you can preview it here:

https://gettoolbase.com

Try clicking the play button at the bottom right of workflow pane that appears next to the big blue text.

Clicking each node after you submit your query shows you its results for the run.

Full-screening the experience (icon on the top right) and expanding a node (same icon within a step of the workflow) lets you train that prompt in the same way we did in the demo.

The first node (“User Intent Extractor”) is purposefully vague so you can train it yourself!

Bringing USDT Power to Parca

https://www.polarsignals.com/blog/posts/2025/12/10/usdt-deep-dive
1•gnurizen•2m ago•1 comments

Show HN: Grok-CLI-MCP – MCP server wrapping Grok CLI (alternative to direct API)

https://github.com/BasisSetVentures/grok-cli-mcp
1•changxu•3m ago•0 comments

CEOs Are All-In on AI

https://www.wsj.com/articles/ceos-are-all-in-on-ai-f3882564
1•gmays•4m ago•0 comments

DNP achieves 1.4nm semiconductor process without EUV

https://www.afp.com/en/infos/dnp-achieves-10nm-line-pattern-resolution-nanoimprint-template-cutti...
1•Croftengea•4m ago•0 comments

LibGodot – Embed Godot Engine Everywhere (GodotCon 2025) [video]

https://www.youtube.com/watch?v=L06KBOWCsSk
1•gudzpoz•5m ago•0 comments

Testing S3 ABAC Locally

https://iam.cloudcopilot.io/posts/test-s3-abac-locally-with-iam-lens
1•davidjkerber•5m ago•1 comments

Building Audit Logs with CDC and SCD Type 2

https://www.artie.com/blogs/how-to-build-audit-logs-using-cdc-and-scd-type-2
4•dfallon•7m ago•0 comments

Open-source, no-account video meetings using SvelteKit and Cloudflare

https://videome.video
1•MrRoyce101•8m ago•1 comments

Regulating Commercial Spyware Through Export Controls

https://www.lawfaremedia.org/article/regulating-commercial-spyware-through-export-controls
1•hn_acker•9m ago•0 comments

"UI is Pre-AI" is so real. Look at this Siri like bubble [video]

https://www.youtube.com/watch?v=1I5trPGEu50
1•hellorahulk•9m ago•0 comments

Real UDP Packet loss test using WebRTC (Beta)

https://vishnu.pro
1•vishnu351•9m ago•0 comments

Optical Context Compression Is Just (Bad) Autoencoding

https://arxiv.org/abs/2512.03643
1•atbhtunnm•9m ago•1 comments

Facilitating AI Adoption at Imprint

https://lethain.com/company-ai-adoption/
1•funcimp•12m ago•0 comments

We don't know what most microbial genes do. Can genomic language models help?

https://www.owlposting.com/p/we-dont-know-what-most-microbial
2•abhishaike•13m ago•0 comments

At this ultramarathon, runners tackle 31 miles and eat at nine Taco Bells

https://www.washingtonpost.com/dc-md-va/2025/11/30/dc-taco-bell-50k-ultramarathon/
2•woldemariam•16m ago•0 comments

Show HN: MimicKit – RL framework for humanoid motion imitation

https://github.com/xbpeng/MimicKit
3•xbpeng4•16m ago•0 comments

PSFirebirdToMSSQL – 6x faster than Linked Servers (21 min → 3:24 min)

https://github.com/gitnol/PSFirebirdToMSSQL
1•hngitnol•17m ago•1 comments

England Historic Aerial Photo Explorer

https://historicengland.org.uk/images-books/archive/collections/aerial-photos/
1•davemateer•18m ago•0 comments

Qwen3-Omni-Flash-2025-12-01:a next-generation native multimodal large model

https://qwen.ai/blog?id=qwen3-omni-flash-20251201
2•pretext•18m ago•0 comments

California Enacted AI Bills. Now Officials Must Define Them

https://www.lawfaremedia.org/article/california-enacted-ai-bills-now-officials-must-define-them
2•hn_acker•19m ago•0 comments

Greco for FHE and ZK Consistency

https://blog.enclave.gg/enclave-cryptography-greco-fhe-zk/
2•badcryptobitch•20m ago•0 comments

All 187,460 Miles of Road That Led to Rome, Mapped

https://www.nytimes.com/2025/12/09/science/archaeology-roman-empire-roads.html
2•bookofjoe•20m ago•1 comments

Why Your TV Will Probably Never Be Better Than It Is Now

https://lifehacker.com/tech/why-your-tv-will-never-be-better-than-it-is-now
3•whynotmaybe•21m ago•2 comments

Gold, guns and cartels: The battle for a billion-dollar mine

https://english.elpais.com/international/2025-11-06/gold-guns-and-cartels-the-battle-for-a-billio...
2•PaulHoule•21m ago•0 comments

Copyright Lawsuits over Embedding Are Still a Thing

https://blog.ericgoldman.org/archives/2025/12/copyright-lawsuits-over-embedding-are-still-a-thing...
2•hn_acker•22m ago•0 comments

Show HN: Agent‑Flow – prompts and workflows for any MCP‑compatible AI agent

https://agentflowhq.dev
1•sileo-oss•22m ago•0 comments

Simpler.Grants.gov

https://simpler.grants.gov/
3•gregsadetsky•23m ago•1 comments

Claude Code supports modular rules in .claude/rules/

https://code.claude.com/docs/en/memory
1•freewizard•24m ago•0 comments

Foundation model for health prediction using Apple Watch data

https://9to5mac.com/2025/12/09/researchers-used-3-million-days-of-apple-watch-data-to-train-a-dis...
5•beekay•24m ago•1 comments

Show HN: Stirrup – A lightweight and customizable foundation for building agents

https://github.com/ArtificialAnalysis/Stirrup
1•Gcam•24m ago•0 comments