frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you prevent AI agents from going rogue in production?

3•techbuilder4242•9h ago
Hi all!

There seems to be an ongoing trend (and my gut feeling) of companies moving from chatbots to AI agents that can actually execute actions—calling APIs, modifying databases, making purchases, etc.

I'm curious: if you're running these in production, how are you handling the security layer beyond prompt injection defenses?

Questions:

- What stops your agent from executing unintended actions (deleting records, unauthorized transactions)?

- Have you actually encountered a situation where an agent went rogue, and you lost money or data?

- Are current tools (IAM policies, approval workflows, monitoring) enough, or is there a gap?

Trying to figure out if this is a real problem worth solving or if existing approaches are working fine.

Ask HN: Quantum Computation, Computers and Programming

11•rramadass•12h ago•9 comments

ADHD. How do you manage the constant stream of thoughts and ideas?

15•chriswright1664•34m ago•10 comments

Ask HN: Discrepancy between Lichess and Stockfish

15•HNLurker2•4h ago•10 comments

Ask HN: 500 citation MSc CS, stuck in a low-trust region. How to move forward?

14•throwawaysafely•7h ago•12 comments

Ask HN: Looking for Windows contributors for meeting-detection engine

7•Ayobamiu•7h ago•0 comments

Ask HN: Who remembers AWS Spot's auction era before the 2017 pricing change?

2•aleroawani•4h ago•0 comments

Tell HN: DigitalOcean's managed services broke each other after update

75•neilfrndes•23h ago•46 comments

Ask HN: Vxlan over WireGuard or WireGuard over Vxlan?

33•mlhpdx•4h ago•52 comments

Ask HN: What are you working on? (January 2026)

253•david927•2d ago•833 comments

Tell HN: The insane price hike of internal SSDs

3•malshe•3h ago•5 comments

Tell HN: Intel could blow up the Console Wars if it had the guts

2•noumenon1111•3h ago•5 comments

Tell HN: The Google Tenor GIF API has been shut down

13•dfajgljsldkjag•7h ago•9 comments

Ask HN: Does anyone else think that humanoid robots is a bubble?

5•NewUser76312•3h ago•8 comments

Ask HN: Are you underutilizing your insurance too?

2•nemath•4h ago•4 comments

Ask HN: Learning Discoverability

2•learnwithmattc•9h ago•0 comments

Is "AI vibe coding" making prototyping worse inside real companies?

11•arapkuliev•7h ago•1 comments

Ask HN: What made you move back to HTML-to-PDF in production?

5•gokulsiva•8h ago•4 comments

Ask HN: Iran's 120h internet shutdown, phones back. How to stay resilient?

47•us321•6h ago•54 comments

Eleva.js – A 2.3KB JavaScript framework with signals and no virtual DOM

2•TarekRaafat•6h ago•0 comments

Unpopular Opinion: Bootstrap is a better front-end framework than Tailwind

21•pyeri•18h ago•25 comments

Experiment: Using NotebookLM as a cynical code reviewer (via custom prompts)

2•practicalaifg•7h ago•0 comments

Gh Account Permabanned – Help?

9•nicomeemes•8h ago•8 comments

Ask HN: Story about a CEO going off on a user who left feedback?

3•VladVladikoff•9h ago•2 comments

Ask HN: How do you prevent AI agents from going rogue in production?

3•techbuilder4242•9h ago•0 comments

Ask HN: Salesforce, SAP, or ServiceNow: Which Is Most Ripe for Disruption?

6•Saurabh_Kumar_•9h ago•1 comments

My casual chat with AI about cancer led to an internal prototype named"Onco-Bus"

2•sony707•9h ago•0 comments

Ask HN: Where is all the protest music?

6•swiper_lux•13h ago•14 comments

Ask HN: Job seekers, what's working / not working?

17•Jabbs•1d ago•17 comments

Ask HN: Is Reddit Down(-Ish)?

3•theanonymousone•7h ago•7 comments

I am a fan of XianXia's novels from China

2•tain1223•6h ago•0 comments