frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How are you monitoring AI agents in production?

4•jairooh•17h ago
With the recent incidents (DataTalks database wipe by Claude Code, Replit agent deleting data during code freeze), it's clear that running AI agents in production without observability is risky.

Common failure modes I've seen: no visibility into what the agent did step-by-step, surprise LLM bills from untracked token usage, risky outputs going undetected, and no audit trail for post-mortems.

I've been building AgentShield (https://useagentshield.com) — an observability SDK for AI agents. It does execution tracing, risk detection on outputs, cost tracking per agent/model, and human-in-the-loop approval for high-risk actions. Plugs into LangChain, CrewAI, and OpenAI Agents SDK with a 2-line integration.

Curious what others are using. Rolling your own monitoring? LangSmith? Langfuse? Or just hoping for the best?

Comments

Horos•17h ago
ACID & Idempotent. dataplane / controlplane. dryruns et runbook automations.

llm does not act on production. he build scripts, and you take the greatest care of theses scripts.

Clone you customer data and run evertything blank.

Just uses the llm tool as dangerous tool: considere that it will fail each time it's able to.

even will all theses llm specific habitus, you still get a x100 productivity.

because each of theses advise can ben implemented by llms, for llms, by many way. it's almost free. just plan it.

verdverm•17h ago
OTEL & LGTM, the same stack I use for monitoring everything, on a technical level.

Some of the things you mention are more often addressed by guardrails. Some of the others (quality) require some evaluation for that measure, but results can go into the same monitory stack.

devonkelley•14h ago
Most observability tools in this space are dashcams. They show you what happened after you already got robbed.

The gap isn't monitoring. It's what happens automatically when degradation gets detected. Right now the answer for every team I've talked to is "page a human." That human reads logs, guesses, deploys a fix. The system already shifted while they were debugging.

Ask HN: What Are You Working On? (March 2026)

222•david927•14h ago•774 comments

Ask HN: Are showlang and thelang HN endpoints not being maintained?

2•freakynit•32m ago•0 comments

Ask HN: How to be alone?

577•sillysaurusx•1d ago•433 comments

Ask HN: Please restrict new accounts from posting

637•Oras•19h ago•460 comments

Ask HN: Most beautiful personal blog UI you have ever seen?

99•ms7892•15h ago•44 comments

Ask HN: How are you adapting your career in this AI era?

4•sarthaksaxena•1h ago•3 comments

Tell HN: I'm 60 years old. Claude Code has re-ignited a passion

1050•shannoncc•2d ago•954 comments

All tmux sessions as a single terminal

2•lygten•9h ago•1 comments

Ask HN: What is your oldest living presence on the World Wide Web?

2•dhruv3006•4h ago•0 comments

Ask HN: Would you use a job board where every listing is verified?

55•BelVisgarra•1d ago•95 comments

Ask HN: Are we going to see more job postings asking for only agentic coding?

4•ronbenton•13h ago•6 comments

OpenAI might end up on the right side of history

8•shoman3003•19h ago•3 comments

Ask HN: Can I repurpose a Bluetooth voice remote as input device for a PC?

2•albert_e•1d ago•1 comments

Ask HN: How are you handling persistent memory across local Ollama sessions

4•null-phnix•1d ago•0 comments

Ask HN: How are you monitoring AI agents in production?

4•jairooh•17h ago•3 comments

I replaced my freelance SaaS stack with 5 single-file HTML tools

4•AnnSri•23h ago•1 comments

Ask HN: Why Is Phil Wang / Lucidrains Off GitHub?

3•vessenes•23h ago•3 comments

Ask HN: Anyone else feel this community has changed recently?

52•kypro•2d ago•29 comments

Whisker – Self hosted e-commerce cart, pure PHP, zero dependencies

6•eLohith•1d ago•3 comments

Ask HN: Can we talk about AI Astroturfing?

46•overgard•1d ago•39 comments

PhD interrupted by personal safety issues, now publication record is thin

4•qthrwaway•1d ago•2 comments

Add llms.txt and fix robots.txt for AI agent discoverability

2•nishiohiroshi•1d ago•2 comments

Tell HN: The proposed KIDS Act (HR 7757) effectively mandates biometric browsing

18•fokdelafons•2d ago•0 comments

Ask HN: Last time you wrote code?

5•blinkbat•1d ago•13 comments

Ask HN: Do You Enjoy Your Career in Tech Nowadays?

29•karakoram•3d ago•30 comments

How do teams prevent duplicate LLM API calls and token waste?

3•cachelogic•1d ago•1 comments

What Will Happen to Android?

3•MrLey•1d ago•3 comments

If AI is so good, why don't we have an infinite supply of 10x engineers?

10•YounesDz•1d ago•10 comments

Best Monitoring and Observability Platform?

2•kebforlifer1•1d ago•1 comments

Ask HN: Has anyone noticed the fear-driven prompt suggestions that GPT5.3 makes?

14•cedarscarlett•4d ago•7 comments