frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Agent Lobotomy: Inference-time verification for autonomous systems

https://steerlabs.substack.com/p/solving-the-confident-idiot-problem
1•steer_dev•1d ago

Comments

steer_dev•1d ago
Doing post-mortems on my agent's failures over the holidays made me realize the problem isn't the model. It is the lack of a deterministic inference-time verification layer.

I spent the break reading the recent Stanford/Harvard paper on agentic adaptation [1]. Their research provides mathematical proof for what I experienced in Q4: supervising only final outputs is a dead end. Agents learn to "ignore tools and improve likelihood," meaning they learn to lie more convincingly to pass evaluations while the underlying logic rots.

I call this the Agent Lobotomy.

The agent I have in production today is significantly dumber than the one I demoed in December. I was forced to strip autonomy, remove context, and add human checkpoints because I could not trust the probabilistic output. We are stuck in an Autonomy Retreat, creating an Authority Bottleneck [2] where agents are relegated to assistive tasks because the tail risk of autonomous action is too high.

I built Steer (open source) to stop the bleed. In v0.4.0, I moved the architecture to an Agent Service Mesh pattern. Instead of decorating every function, you patch the framework (e.g. PydanticAI) at the entry point. It auto-discovers tools and enforces a reliability policy globally via deterministic Reality Locks.

The real unlock is the data. By capturing the delta between a Blocked Response and a Taught Fix, Steer acts as a synthetic data factory for DPO. It moves reliability from a runtime tax to a training asset, allowing you to eventually refactor your prompt monolith into fine-tuned model weights.

I've put together three cookbooks showing how this stops the lobotomy in SQL and RAG workflows: 1/ Framework Patching: https://github.com/imtt-dev/steer/blob/main/steer/cookbook/p... 2/ SQL Security Lock: https://github.com/imtt-dev/steer/blob/main/steer/cookbook/s... 3/ RAG Grounding Guard: https://github.com/imtt-dev/steer/blob/main/steer/cookbook/r...

References: [1] https://arxiv.org/abs/2512.16301 [2] https://cloudedjudgement.substack.com/p/clouded-judgement-12...

The Dream of the Universal Library

https://asteriskmag.com/issues/12-books/the-dream-of-the-universal-library
1•ilamont•2m ago•0 comments

Show HN: Grammar of Graphics CLI tool made in Rust

https://github.com/williamcotton/gramgraph
1•williamcotton•4m ago•0 comments

Infinite Canvas: Building a Seamless, Pan-Anywhere Image Space – Codrops

https://tympanus.net/codrops/2026/01/07/infinite-canvas-building-a-seamless-pan-anywhere-image-sp...
1•rcarmo•4m ago•0 comments

OpenAI to Buy Pinterest? A Strategic Analysis

https://nekuda.substack.com/p/openai-to-buy-pinterest-heres-what
1•ilamont•5m ago•0 comments

What are we to make of "AI replacement"?

https://joshuagans.substack.com/p/what-are-we-to-make-of-ai-replacement
1•paulpauper•6m ago•0 comments

Lua is a pretty good config language

https://til.andrew-quinn.me/posts/lua-is-a-pretty-good-config-language/
1•hiAndrewQuinn•7m ago•0 comments

ActorAgents

https://tailrecursion.com/~alan/ActorAgents.html
1•wooby•7m ago•0 comments

Claude Code CLI Broken

https://github.com/anthropics/claude-code/issues/16673
6•sneilan1•7m ago•0 comments

Show HN: Startup Simulator – AI Choose Your Own Adventure

https://startup-simulator-beta.vercel.app/
1•baristaGeek•11m ago•0 comments

Dora 2025: Year in Review

https://dora.dev/insights/dora-2025-year-in-review/
1•cebert•15m ago•0 comments

Unit testing your code's performance, part 1: Big-O scaling

https://pythonspeed.com/articles/big-o-tests/
2•todsacerdoti•16m ago•0 comments

Tailscale state file encryption no longer enabled by default

https://tailscale.com/changelog
20•traceroute66•16m ago•3 comments

Show HN: Prompt Tower – build and visualize your context

https://prompttower.com/
2•ramoz•17m ago•0 comments

Free health summaries from the top creators

https://summabase.com/en
1•luis13hgr•18m ago•0 comments

US immigration officer fatally shoots woman, 37, in Minneapolis, officials say

https://www.bbc.com/news/live/c7510l1135wt
18•onemoresoop•18m ago•5 comments

Ledger customers impacted by third-party Global-e data breach

https://www.bleepingcomputer.com/news/security/ledger-customers-impacted-by-third-party-global-e-...
1•DGAP•21m ago•0 comments

Why Musk says it would be a 'distraction' for SpaceX to go to Mars this year

https://www.morningstar.com/news/marketwatch/20260107182/why-elon-musk-now-says-it-would-be-a-dis...
3•voxadam•24m ago•0 comments

Intel's Best Product in Years – Panther Lake Announcement [video]

https://www.youtube.com/watch?v=bG68OBQ3x9Y
4•tester756•25m ago•0 comments

A minimal keyboard key effect with CSS

https://pjg1.site/kbd-css.html
2•birdculture•26m ago•0 comments

Claude Code Emergent Behavior: When Skills Combine

https://vibeandscribe.xyz/posts/2025-01-07-emergent-behavior.html
3•ryanthedev•26m ago•2 comments

Show HN: ScotiaSignal: Public sector intent data for Nova Scotia

https://scotiasignal.ca
2•5eva•27m ago•0 comments

Show HN: LLM-First Personal Knowledge Management

https://github.com/joel-solymosi
2•joelsol•31m ago•0 comments

Minneapolis driver shot and killed by ICE

https://www.nbcnews.com/news/us-news/federal-law-enforcement-involved-ice-related-shooting-minnea...
16•fzeroracer•34m ago•0 comments

Earino/DesigningCourse Materials for Designing Analytics Projects

https://github.com/earino/designing-analytics-projects
2•raybb•34m ago•0 comments

Why the Renovate project uses GitHub Discussions as our triage process

https://www.jvt.me/posts/2026/01/07/renovate-why-discussions/
5•zdw•34m ago•1 comments

AI writes code faster. Your job is still to prove it works

https://addyosmani.com/blog/code-review-ai/
2•speckx•39m ago•0 comments

A set of Idiomatic prod-grade katas for experienced devs transitioning to Go

https://github.com/MedUnes/go-kata
3•medunes•40m ago•1 comments

Show HN: TierWise – PPP pricing widget for SaaS (Built in 7 days)

3•elmascato•40m ago•0 comments

Practical Collision Attack Against Long Key IDs in PGP

https://soatok.blog/2026/01/07/practical-collision-attack-against-long-key-ids-in-pgp/
2•zdw•41m ago•0 comments

"This Is Candy" Cereal Warning Labels

https://kozubik.com/items/ThisisCandy/
3•rsync•41m ago•1 comments