frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Launch HN: Sentrial (YC W26) – Catch AI Agent Failures Before Your Users Do

https://www.sentrial.com/
9•anayrshukla•1h ago
Hey HN! We're Neel and Anay, and we’re building Sentrial (https://sentrial.com). It’s production monitoring for AI products. We automatically detect failure patterns: loops, hallucinations, tool misuse, and user frustrations the moment they happen. When issues surface, Sentrial diagnoses the root cause by analyzing conversation patterns, model outputs, and tool interactions, then recommends specific fixes.

Here's a demo if you're interested: https://www.youtube.com/watch?v=cc4DWrJF7hk. When agents fail, choose wrong tools, or blow cost budgets, there's no way to know why - usually just logs and guesswork. As agents move from demos to production with real SLAs and real users, this is not sustainable.

Neel and I lived this, building agents at SenseHQ and Accenture where we found that debugging agents was often harder than actually building them. Agents are untrustworthy in prod because there’s no good infrastructure to verify what they’re actually doing.

In practice this looks like: - A support agent that began misclassifying refund requests as product questions, which meant customers never reached the refund flow. - A document drafting agent that would occasionally hallucinate missing sections when parsing long specs, producing confident but incorrect outputs. There’s no stack trace or 500 error and you only figure this out when a customer is angry.

We both realized teams were flying blind in production, and that agent native monitoring was going to be foundational infrastructure for every serious AI product. We started Sentrial as a verification layer designed to take care of this.

How it works: You wrap your client with our SDK in only a couple of lines. From there, we detect drift for you: - Wrong tool invocations - Misunderstood intents - Hallucinations - Quality regressions over time. You see it on our platform before a customer files a ticket.

There’s a quick mcp set up, just give claude code: claude mcp add --transport http Sentrial https://www.sentrial.com/docs/mcp

We have a free tier (14 days, no credit card required). We’d love any feedback from anyone running agents whether they be for personal use or within a professional setting.

We’ll be around in the comments!

Comments

rajit•1h ago
How do you identify "wrong tool" invocations (how is the "wrong tool" defined)?
anayrshukla•1h ago
Good question. We don’t define “wrong tool” in some universal way, because that really depends on the workflow.

What we do in practice is let the team mark a few tool calls as right or wrong in context, then use that to learn the pattern for that agent. From there, we can flag similar cases automatically by looking at the convo state, the tool chosen, the arguments, and what happened next.

So we’re learning what “correct” looks like for your workflow and then catching repeats of the same kind of mistake.

BoorishBears•1h ago
I know your homepage isn't your business, but I'm bet Claude could fix the janky horizontal overflow on mobile in a prompt. Makes for a very distracting read
anayrshukla•1h ago
Will fix ASAP.
claudeomusic•1h ago
Agreed - fix fast. No way to take a tool seriously about taking care of production that has such a blatant production issue

Temporal: A nine-year journey to fix time in JavaScript

https://bloomberg.github.io/js-blog/post/temporal/
182•robpalmer•2h ago•77 comments

Entities enabling scientific fraud at scale are large, resilient, growing (2025)

https://doi.org/10.1073/pnas.2420092122
178•peyton•4h ago•84 comments

Making WebAssembly a first-class language on the Web

https://hacks.mozilla.org/2026/02/making-webassembly-a-first-class-language-on-the-web/
119•mikece•13h ago•44 comments

BitNet: 100B Param 1-Bit model for local CPUs

https://github.com/microsoft/BitNet
215•redm•5h ago•112 comments

Where Some See Strings, She Sees a Space-Time Made of Fractals

https://www.quantamagazine.org/where-some-see-strings-she-sees-a-space-time-made-of-fractals-2026...
51•tzury•2h ago•3 comments

Elevated errors on login with Claude Code

https://status.claude.com/incidents/jm3b4jjy2jrt
49•zurfer•2h ago•34 comments

Show HN: Klaus – OpenClaw on a VM, batteries included

https://klausai.com/
36•robthompson2018•2h ago•21 comments

Lego's 0.002mm specification and its implications for manufacturing (2025)

https://www.thewave.engineer/articles.html/productivity/legos-0002mm-specification-and-its-implic...
275•scrlk•4h ago•217 comments

Faster asin() was hiding in plain sight

https://16bpp.net/blog/post/faster-asin-was-hiding-in-plain-sight/
119•def-pri-pub•3h ago•60 comments

Launch HN: Prism (YC X25) – Workspace and API to generate and edit videos

https://www.prismvideos.com
15•aliu327•1h ago•7 comments

Fungal Electronics

https://arxiv.org/abs/2111.11231
11•byt3h3ad•1h ago•2 comments

Wiz joins Google

https://www.wiz.io/blog/google-closes-deal-to-acquire-wiz
86•aldarisbm•3h ago•61 comments

Launch HN: Sentrial (YC W26) – Catch AI Agent Failures Before Your Users Do

https://www.sentrial.com/
9•anayrshukla•1h ago•5 comments

AI Agent Hacks McKinsey

https://codewall.ai/blog/how-we-hacked-mckinseys-ai-platform
223•mycroft_4221•8h ago•88 comments

Show HN: Open-source browser for AI agents

https://github.com/theredsix/agent-browser-protocol
32•theredsix•3h ago•14 comments

Show HN: Vanilla JavaScript refinery simulator built to explain job to my kids

https://fuelingcuriosity.com/game.html
23•fuelingcurious•1h ago•13 comments

The MacBook Neo

https://daringfireball.net/2026/03/the_macbook_neo
46•etothet•6h ago•179 comments

Show HN: I built an ISP infrastructure emulator from scratch with a custom vBNG

https://aether.saphal.me/dashboard/default
19•saphalpdyl•4h ago•1 comments

5,200 holes carved into a Peruvian mountain left by an ancient economy

https://newatlas.com/environment/5-200-holes-peruvian-mountain/
4•defrost•1d ago•0 comments

Searching for the Agentic IDE

https://twitter.com/karpathy/status/2031616709560610993
6•bigwheels•2h ago•4 comments

Building a TB-303 from Scratch

https://loopmaster.xyz/tutorials/tb303-from-scratch
181•stagas•3d ago•71 comments

Swiss e-voting pilot can't count 2,048 ballots after decryption failure

https://www.theregister.com/2026/03/11/swiss_evote_usb_snafu/
39•jjgreen•5h ago•88 comments

Zig – Type Resolution Redesign and Language Changes

https://ziglang.org/devlog/2026/#2026-03-10
364•Retro_Dev•16h ago•201 comments

PeppyOS: A simpler alternative to ROS 2 (now with containers support)

https://peppy.bot/
56•Ekami•3d ago•20 comments

Why the global elite gave up on spelling and grammar

https://www.wsj.com/lifestyle/jeffrey-epstein-files-bad-grammar-spelling-trump-ellison-dorsey-gat...
42•matthieu_bl•2h ago•110 comments

Visualizing Ukkonen's Suffix Tree Algorithm

https://www.abahgat.com/blog/visualizing-ukkonens-algorithm/
12•gsky•1d ago•2 comments

Show HN: I built a tool that watches webpages and exposes changes as RSS

https://sitespy.app
3•vkuprin•1h ago•0 comments

Writing my own text editor, and daily-driving it

https://blog.jsbarretto.com/post/text-editor
188•todsacerdoti•15h ago•101 comments

Cloudflare crawl endpoint

https://developers.cloudflare.com/changelog/post/2026-03-10-br-crawl-endpoint/
435•jeffpalmer•19h ago•166 comments

Yann LeCun raises $1B to build AI that understands the physical world

https://www.wired.com/story/yann-lecun-raises-dollar1-billion-to-build-ai-that-understands-the-ph...
574•helloplanets•1d ago•467 comments