I do not believe it when an admin ally says the government is acting in good faith or that there is no bitterness or history with DoD (not DoW, another indicator he's full of it).
The administration has made it official policy to fire apolitical competent federal workers. Don't tell me they have a good heart and just want things to be done safely.
Dario, buy a few million of Trumpcoin and you'll be fine.
This is what makes me feel Sacks is speaking the truth here, despite my generally not trusting him (due to MAGA sycophancy) or this administration in general. Given Anthropic and Dario in particular are so alarmist about safety, even a small jailbreak should cause them to pull back and fix it first, right? Didn’t they say Mythos is very dangerous in the wrong hands? How can you take any chances if that’s the case? It’s just not consistent to minimize things - and I feel they probably didn’t want to admit to the world that their own safeguards aren’t good enough either, because that would harm their business if they had to pull back their models and stick to their safety views. But if they didn’t pull these models, they would be admitting that safety is theater for regulatory capture and that it doesn’t really matter.
By minimizing the jailbreak, they’re trying to have it both ways. And that feels dishonest. But also any mature executive would know how this would look. Even if they were correct about it, Anthropic should know how all of this looks to the outside world. The fact that Dario doesn’t, shows they shouldn’t be the ones in charge of this capability. Both OpenAI and Anthropic need brand new professional leadership.
As for Amazon - they aren’t going to raise alarms randomly. AWS is very trustworthy and well run. I believe their security researchers genuinely were worried about the impact of the general public having Mythos access effectively by jailbreaks on Fable. Plus they might be the largest owner of Anthropic equity outside of Anthropic itself.
Mythos being drummed up as a cyber weapon is not the same as it being a weapon -- and treating it as such is just dumb.
Yes, it may find vulnerabilities, but "find" is the keyword -- whatever it finds was already there.
On the other hand, Anthropic really needs to up its game here. I am sympathetic to their position, but their actions betray a lack of understanding of the seriousness of the situation. If we're to believe that only they can align a super intelligence, they should start by aligning the Trump administration into a productive relationship.
And then US chamber of commerce protected itself from liability as well to take export control law seriously after Antropic's CEO categorized Mythos without the cyber query filter as potential cyber weapon.
The problem here is that there's no real technical way to protect against the queries, just making them more expensive to create, as the models are getting smarter.
Also as the main jailbreak ,,technique'' is splitting the task into subtasks, and the main moat of Mythos is that it can solve more complex tasks, I also wouldn't categorize this jailbreak as serious.
satvikpendem•13h ago