Sentrux – AI keeps writing bad code. Built a feedback sensor that grades it live

1•davej32•2h ago

Comments

davej32•2h ago

Why I built this

I noticed my AI agent getting dumber the bigger my project got.

I started with Claude Code. First few days were magic. Then around week two — the agent started hallucinating functions that didn't exist. It got confused about what I was asking. More and more bugs. Every new feature harder than the last. I was spending more time fixing the agent's output than writing code myself.

I kept blaming the AI. But it wasn't the AI losing capability. It was my codebase losing structure.

Here's what was actually happening: same function names with different purposes scattered across files. Unrelated code dumped in the same folder. Dependencies tangling into spaghetti. When the agent searched my project, twenty conflicting results came back — and it picked the wrong one. Every session made the mess worse. Every mess made the next session harder. Eventually even the agent struggled to implement new features in its own codebase.

I looked at tools like Spec Kit — plan architecture first, then let AI implement. But that's not how I work. I prototype fast, chat with the agent, share ideas, follow inspiration. That creative flow is what makes AI agents powerful — and it's exactly what produces messy structure. AI agents can't focus on the big picture and small details at the same time.

What I built

A real-time feedback sensor. It watches the codebase — every file, every dependency — visualized as a live interactive treemap that updates as the agent writes code. 14 quality dimensions graded A through F.

For the demo I gave Claude Code 15 detailed step-by-step instructions to build a FastAPI API. Explicit module boundaries, explicit file separation. Five minutes later: Grade D. Cohesion F. 25% dead code. Even with careful instructions.

It also runs as an MCP server — the AI agent queries the grades mid-session, sees what degraded, and self-corrects. The feedback loop closes.

Pure Rust, single binary, 23 languages via tree-sitter, MIT licensed. Happy to answer any questions.

Show HN: BoltzPay – fetch() that pays for AI agents (x402 and L402)

Show HN: Stop AI Debugging with Print(). Use a Debugger

Show HN: Claude Status

AI isn't digital anymore. It's a 1-GW power problem

Show HN: OpenTabs – Your AI calls Slack's internal API through the browser

I wrote Gitleaks, now I'm working on Betterleaks

What we learned building 100 API integrations with OpenCode

Important Updates to GitHub Copilot for Students

We will come to regret our every use of AI

Show HN: Subagent-CLI – a CLI for managing multiple coding agents

What's My ΔE(OK) JND?

Cynium

Claude can now build interactive charts and diagrams, directly in the chat

I built a system that turns tax law from 100 regions into executable rules

Julian Shapiro: Landing Pages

Show HN: GitClassic, a fast, lightweight GitHub front end (pages under 14KB)

Notes on AI Led Growth (ALG)

Empathia – open-source social network where empathy is the only score

Learn about the ebbs and flows of citibikes in NYC

1 in 4 American Adults Have an "Intimate/Romantic" Relationship with AI

Tech backs Anthropic in its Pentagon fight

Alex Karp: "People are likely gonna have less good, and less interesting jobs"

B12 3.0 – A decade of helping customers build their home online

Ask HN: When do you think "NO AI" will become a part of recruiting pitches?

Show HN: Autoschematic is a new infra-as-code tool built on reversible computing

Altar of the Demo Gods

The Reason Windows Hate Is Exploding: Not Just the UI–It's the End of PC [video]

Reducing Europe's nuclear energy sector was 'strategic mistake', EU chief says

The Emotional Hardship of Leaving Your Company

Show HN: A2Apex – Test, certify, and discover trusted A2A agents