We take a look at the recent incident where Meta's AI Alignment director was unable to stop her Openclaw agent from deleting her emails. The root cause is reliance on in-band signalling, we show how revocable agent identities can be used to implement an out-of-band kill switch for agents.
jalbrethsen•1h ago