frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Task Orchestrator – Production Safety for Claude Code Agents

https://github.com/TC407-api/task-orchestrator
2•Travis_Cole•1h ago

Comments

Travis_Cole•1h ago
I've been using Claude Code heavily for months. It's great for velocity, but I kept hitting the same problems:

  - Agent hallucinates file paths that don't exist
  - Claims "tests pass" without running them
  - Same errors recurring across sessions
  - No way to catch failures that aren't crashes

  The tools exist to catch crashes. Nothing exists to catch semantic failures - when the agent confidently gives wrong answers.

  So I built Task Orchestrator - an MCP server that adds an "immune system" to Claude Code:

  1. Semantic failure detection - catches hallucinations, not just crashes
  2. ML-powered learning - remembers failure patterns, warns before similar prompts
  3. Human-in-the-loop - queues high-risk operations for approval
  4. Cost tracking - see exactly what you're spending
  5. Self-healing circuit breakers

  The math problem: at 95% per-step reliability, a 20-step workflow has only 36% success rate. That's not a bug - it's compound probability.

  Technical details:
  - 680+ tests
  - Provider-agnostic (works with any LLM)
  - MCP native for Claude Code
  - MIT licensed

  What features would you want to see that would improve your AI agent workflows?

Flint's Paper Batteries Are Here: Now in Production, Now Available

https://finance.yahoo.com/news/flints-paper-batteries-now-production-180000802.html
1•rguiscard•1m ago•0 comments

Show HN: Qventory – inventory and sales and fulfillment tracking for resellers

https://qventory.com/
1•noakmilo90•4m ago•0 comments

Launching the Handmade Software Foundation

https://handmade.network/blog/p/9106-welcome_to_2026%2521#30623
1•DeathArrow•6m ago•0 comments

Music of the Streets of Rage Series

https://en.wikipedia.org/wiki/Music_of_the_Streets_of_Rage_series
1•nomilk•7m ago•0 comments

Domain-Availability-MCP

https://twitter.com/fullstacktard/status/2012780828414341314
1•fullstacktard•11m ago•0 comments

Tell HN: BunnyPeople

1•fuzzfactor•12m ago•0 comments

Ask HN: Why is Google tolerating impersonation of Gmail from it's own domain?

3•dvh•13m ago•1 comments

Iconify: Library of Open Source Icons

https://icon-sets.iconify.design/
1•sea-gold•13m ago•1 comments

Manual Transmission Thwarts Thieves' Attempt to Steal a Woman's (Kia) Soul

https://www.jalopnik.com/2077298/manual-transmission-stops-kia-soul-thieves/
2•t23•16m ago•1 comments

Show HN: GibRAM an in-memory ephemeral GraphRAG runtime for retrieval

https://github.com/gibram-io/gibram
1•ktyptorio•20m ago•0 comments

IBM T560 LCD

http://www.ibmfiles.com/pages/t560.htm
2•starkparker•20m ago•0 comments

Why are websites trying to talk at me?

1•LeratoAustini•25m ago•2 comments

Zencoder: Zenflow

https://zencoder.ai/lp/zenflow-enterprise
1•handfuloflight•27m ago•0 comments

Show HN: I Replaced Vector DBs with Optimal Transport (Open Source Project))

https://github.com/merchantmoh-debug/Remember-Me-AI
1•MohskiBroskiAI•29m ago•0 comments

Brooks on the System 360 and adoption of the 8 bit byte [video]

https://www.youtube.com/watch?v=9oOCrAePJMs
1•ggeorgovassilis•30m ago•0 comments

AgentCraft: RTS for AI Agents

https://www.getagentcraft.com/
1•doppp•31m ago•0 comments

Show HN: Travel itinerary manager (passion project)

https://tripwaffle.com
1•bufferout•38m ago•0 comments

Archaeologists find the oldest-known shell beads (2021)

https://leakeyfoundation.org/archaeologists-find-the-oldest-known-shell-beads/
1•thunderbong•39m ago•0 comments

Spirit of ThinkPad

https://thinknextdesign.com/home.html
2•__patchbit__•40m ago•0 comments

Reverse Engineering the ESP32-C3 Wi-Fi Drivers for Static Worst-Case Analysis

https://arxiv.org/abs/2501.17684
1•timschmidt•40m ago•0 comments

FragCut – AI that turns gaming streams into viral TikTok/Shorts clips in minutes

https://fragcut.io
1•jacobgor502•42m ago•2 comments

Chat, Save, and Blog

https://chatblogr.com
1•vijayst•1h ago•0 comments

Aesthetics Bento microsites with built-in analytics

1•sendnow•1h ago•0 comments

What the future holds for AI – from the people shaping it

https://www.nature.com/immersive/d41586-025-03701-5/index.html
1•XzetaU8•1h ago•0 comments

Disenshittify or die! How hackers can seize the means of computation (2024)

https://www.youtube.com/watch?v=4EmstuO0Em8
1•undeveloper•1h ago•1 comments

'He's an Idiot': Musk and Ryanair's O'Leary Trade Insults in Starlink Wi-Fi Row

https://www.politico.eu/article/elon-musk-ryanair-chief-michael-oleary-trade-insults-starlink-spa...
3•JumpCrisscross•1h ago•0 comments

How to Use Your Claude Code Pro Subscription in Docker

https://foldr.uk/claude-code-pro-subscription-docker/
1•johnny_reilly•1h ago•0 comments

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

https://arxiv.org/abs/2510.01171
1•ycombiredd•1h ago•0 comments

Trump 25% tariff on European allies until Denmark sells Greenland to US

https://www.theguardian.com/us-news/2026/jan/17/trump-tariff-european-countries-greenland
7•KnuthIsGod•1h ago•2 comments

SFTP Still Delivers the Goods

https://folio.co/blog/sftp-still-delivers-the-goods
1•whatrocks•1h ago•0 comments