frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: A policy gate that runs before your AI coding agent's tool calls

https://sigmashake.com
1•cavalrytactics•1h ago
As a Security Engineer with over 10+ years in industry, I kept running into the same problem with coding agents: Instructions are not guarantees.

I put guidance in `CLAUDE.md`, `AGENTS.md`, memory files, MCP descriptions, and tool documentation. I explicitly told the agent things like:

* Use the code graph for architecture questions instead of grepping the repository. * Do not use deprecated APIs or Unsafe code. * Prefer specific tools for specific tasks.

The agent would still ignore those instructions surprisingly often. It would grep the entire repo, use deprecated APIs, or choose a slower tool even when a better one was available.

That made me realize prompts and rules solve different problems.

A prompt is a probabilistic influence on model behavior. A rule is an enforcement mechanism.

So I built SSG (SigmaShake Governance), which sits between the agent and its tools. Instead of asking the model to remember a policy, SSG evaluates every tool call before it executes.

For example, this rule redirects architecture-related repository searches away from recursive grep and toward a code graph:

```text rule route-codebase-grep-to-graph { enable true priority 80 severity warning CATEGORY tool-routing FORCE search IF tool EQUALS "Grep" MESSAGE "Architecture, relationship, and dependency questions are routed to the code-graph tool." SUBSTITUTE "graphify query \"<what you were searching for>\"" } ```

When the agent attempts a grep for an architecture question, the call is redirected. If it attempts to write deprecated code, the write can be blocked before the content reaches disk and the replacement API can be suggested.

A few design choices:

* Rules are plain text and git-versioned. * Enforcement runs locally. * The same rules work across Claude Code, Codex, Cursor, Gemini, and MCP-based agents. * Bypasses are allowed, but recorded. * The goal is not to sandbox a hostile model; it's to prevent routine agent mistakes and shortcuts.

I found that many existing controls operate either too early or too late:

* Prompt files influence behavior but don't enforce it. * Tool allowlists are often all-or-nothing. * Pre-commit hooks catch problems after files have already been written. * Harness-specific permissions don't travel with the repository. (what if your team does not use the same harness?)

If you do not live in the terminal, SigmaShake Desktop is the same governance dashboard with no CLI required: a free direct download for macOS, Windows, and Linux. The Mac App Store and Microsoft Store also carry it as a paid, sandboxed build that auto-updates and skips the Gatekeeper and SmartScreen prompts, if you would rather pay once for the managed install (or just want to support the project): https://apps.apple.com/us/app/sigmashake-desktop/id676990115... https://apps.microsoft.com/detail/9N2CHV3STGS4

I've been building this in public for a few months. https://twitch.tv/sigmashake https://youtube.com/@sigmashakeinc

What instructions do your agents consistently ignore?

AI and brain-computer interface allow speechless ALS patient to work full-time

https://www.theregister.com/science/2026/06/16/ai-and-brain-computer-interface-allow-speechless-a...
1•geekinchief•30s ago•1 comments

State of Tech Hiring 2026: What 106,000 Job Listings Reveal

https://www.jobscroller.net/state-of-tech-hiring-2026
1•couentine•36s ago•0 comments

The Anthropic Fable saga proves: we have opened the AI Pandora's box. What now?

https://www.theguardian.com/commentisfree/2026/jun/16/anthropic-fable-ai
1•kuerbel•59s ago•0 comments

Show HN: Azure DevOps TUI Management Style

https://github.com/elpulgo/azdo
1•elpulgo•1m ago•0 comments

I took the le chaton fat AI model meme a step further by making a model

https://github.com/princezuda/lechatonfat
1•zuda•1m ago•1 comments

Polymarket Traders Clash over $345M Iran Peace Market (2026)

https://www.bloomberg.com/news/articles/2026-06-15/polymarket-traders-clash-over-345-million-iran...
1•kratom_sandwich•3m ago•0 comments

Y2Social – a Y2K-era social network I've been building solo

https://www.y2social.com
1•Jason463•3m ago•0 comments

Bug in FIFA World Cup internal system gave anyone ability to modify TV stream

https://techcrunch.com/2026/06/16/bug-in-fifa-world-cup-internal-system-gave-anyone-ability-to-mo...
1•cdrnsf•4m ago•0 comments

Future Self

https://futureyou.media.mit.edu/
2•onemoresoop•5m ago•0 comments

Show HN: Almost all of MonsterWriter's back end is open source

https://www.monsterwriter.com/building-linkedrecords.html
1•WolfOliver•5m ago•0 comments

Apple is about to make Hide My Email useless

https://arseniyshestakov.com/2026/06/16/apple-is-about-to-make-hide-my-email-useless/
3•SXX•8m ago•0 comments

Ask HN: What's your multi-agent orchestration setup, and success rate with it?

1•mandeepj•9m ago•0 comments

A Compendium of Canonical Charts

https://chris-parmer.com/compendium-of-canonical-charts/
1•chriddyp•15m ago•0 comments

Users cry foul after AMD stripped memory crypto from its consumer CPUs

https://arstechnica.com/security/2026/06/users-cry-foul-after-amd-stripped-memory-crypto-from-its...
7•Cider9986•15m ago•0 comments

Ask HN: How do you make LLM generated text believable?

1•StahlGuo•16m ago•1 comments

Ask HN: Those making $500/month on side projects in 2026 – Show and tell

2•fazkan•16m ago•1 comments

Why Anthropic candidates fail culture after clearing coding and system design

https://www.hack2hire.com/blog/what-anthropic-actually-tests-and-what-gets-candidates-rejected-2026
2•hack2hire•16m ago•1 comments

Run Gemma on the edge with the Coral Board [video]

https://www.youtube.com/watch?v=o2rUT2GloV0
1•simonpure•17m ago•0 comments

US manufacturer of military TV walls sold to China, US wants it back

https://www.thewirechina.com/2026/06/14/behind-the-screens/
3•ilamont•17m ago•0 comments

German broadcaster removes TV intro after Elon Musk takes legal action

https://www.bbc.com/news/articles/c0jyzp9z9deo
1•only_in_america•20m ago•0 comments

Will AI End the Open Internet? [Wading Through AI – Episode 6] [video]

https://www.youtube.com/watch?v=gR2T1uxHG7o
1•justin66•20m ago•0 comments

A 1969 camera operators' strike created Upstairs Downstairs multiverse

https://ironicsans.ghost.io/the-color-strike/
2•ohjeez•21m ago•0 comments

Researchers use tiny radio backpacks to track elusive gecko species

https://www.rnz.co.nz/news/environment/598179/researchers-use-tiny-radio-backpacks-to-track-elusi...
1•speckx•22m ago•0 comments

Moving 100TB of backups from MinIO to self-hosted SeaweedFS on Hetzner

https://we-manage.de/en/blog/seaweedfs-on-hetzner-self-hosted-s3-storage
2•shakalandy•22m ago•0 comments

Show HN: Cpt-city, Xkcd colour schemes

https://phillips.shef.ac.uk/pub/cpt-city/xkcd
1•jjgreen•22m ago•0 comments

Outlet Video – Watch videos, 100% free, always

https://outletvideo.pages.dev/
1•telui•23m ago•0 comments

Ask HN: Is there a Vim/Emacs for video editing?

1•pedrodelfino•25m ago•0 comments

Data Processing Is Becoming a GPU Workload

https://www.anyscale.com/blog/data-processing-becoming-gpu-workload
1•robertnishihara•28m ago•0 comments

AI's Silent Leap: From Code to Cognition

https://ameyalambat.com/blog/ai-coding-fatigue
3•ameyalambat128•29m ago•0 comments

Migrating from Claude to DeepSeek without breaking everything

https://blog.firetiger.com/migrating-from-claude-to-deepseek-without-breaking-everything/
3•matsur•29m ago•0 comments