Basically the solution lets you experiment freely with your agent within safe boundaries.
It's deterministic on purpose (doesn't include any Al layer) which means the solution follows clear and already defined rules, to maximize safety/security and predictability.
Rules are heavily tested on detecting prompt injection attempts and other security cases (explained in detail in the docs).
Everything is local and lives on your computer including the docs site.
It gives you a control panel to monitor and control boundaries. When boundaries are about to get crossed you receive an approval request which lets you see what your openclaw was trying to do.
It also (currently) supports Tailscale, so you can connect your Tailscale IP address and receive everything on your phone and you can also chat normally, approve or deny requests. It lets access the control panel via your tailscale IP address (a private one is recommended) from anywhere. Currently only Telegram Channel is supported.
Only supports linux os for now and Opencode Claude Code & OpenClaw runners.
The things you need to get started are explained in the readme, also include quick demo/showcase images so you can see how it looks.
I'll be happy to hear feedback from you guys, especially having it tested against prompt injections to see how it handles it, don't hesitate to open a ticket on the GitHub for any issue that you found, I'll do my best to fix them.
Link here: https://github.com/steadeepanda/agent-ruler/
Thank you for reading. I'll be happy to discuss about it.
jaylew1997•38m ago