Changes in the system prompt between Claude Opus 4.6 and 4.7

https://simonwillison.net/2026/Apr/18/opus-system-prompt/

29•pretext•1h ago

Comments

dmk•43m ago

The acting_vs_clarifying change is the one I notice most as a heavy user. Older Claude would ask 3 clarifying questions before doing anything. Now it just picks the most reasonable interpretation and goes. Way less friction in practice.

cfcf14•41m ago

I'm curious as to why 4.7 seems obsessed with avoiding any actions that could help the user create or enhance malware. The system prompts seem similar on the matter, so I wonder if this is an early attempt by Anthropic to use steering vector injection?

The malware paranoia is so strong that my company has had to temporarily block use of 4.7 on our IDE of choice, as the model was behaving in a concerningly unaligned way, as well as spending large amounts of token budget contemplating whether any particular code or task was related to malware development (we are a relatively boring financial services entity - the jokes write themselves).

In one case I actually encountered a situation where I felt that the model was deliberately failing execute a particular task, and when queried the tool output that it was trying to abide by directives about malware. I know that model introspection reporting is of poor quality and unreliable, but in this specific case I did not 'hint' it in any way. This feels qualitatively like Claude Golden Gate Bridge territory, hence my earlier contemplation on steering vectors. I've been many other people online complaining about the malware paranoia too, especially on reddit, so I don't think it's just me!

dandaka•35m ago

I have started to notice this malware paranoia in 4.6, Boris was surprised to hear that in comments, probably a bug

embedding-shape•29m ago

> The new <acting_vs_clarifying> section includes: When a request leaves minor details unspecified, the person typically wants Claude to make a reasonable attempt now, not to be interviewed first.

Uff, I've tried stuff like these in my prompts, and the results are never good, I much prefer the agent to prompt me upfront to resolve that before it "attempts" whatever it wants, kind of surprised to see that they added that

varispeed•26m ago

Before Opus 4.7, the 4.6 became very much unusable as it has been flagging normal data analysis scripts it wrote itself as cyber security risk. Got several sessions blocked and was unable to finish research with it and had to switch to GPT-5.4 which has its own problems, but at least is not eager to interfere in legitimate work.

edit: to be fair Anthropic should be giving money back for sessions terminated this way.

Wandering Spleen Condition: Spleen migrating to different location in the body

AI vendors' response to security flaws: It wasn't me

Real-Time Visualization of Human Finger Joint Cavitation (2015)

HTMX 4.0: Hypermedia finds a new gear

New HUDIMM memory specification debuts with goal of slashing DDR5 prices

GitHub Reports DMCA Takedown Record and Surging Anti-Circumvention Claims

Full algorithm chain I used to implement a sigma rules engine in Linux Kernel

Neo-Feudalism With Good UX – handful of AI companies became land we all work on

Ultra-Processed Foods and Muscle Fat Infiltration at Thigh MRI

Education research is weak and sloppy. Why?

Show HN: Claude-codex-proxy – Use Claude Code with ChatGPT subscription

Atlassian's new data collection policy protects rich customers while AI eats

0: I built an app that creates AI real estate videos

Show HN: Donbar – A dead-simple way to design and send professional emails

What Figma's 7.7% Drop Is Telling You

Robot runners beat humans in Beijing half-marathon

Plexus P/20 Emulator

Contact Lens Uses Microfluidics to Monitor and Treat Glaucoma

What temperature are you coding at?

Principle of Charity

A portable OS for emergency situations and extreme privacy needs

Show HN: Proposly – AI-generated client proposals for freelancers

Everyone needs own their own machine that is an extension of their intelligence

I built a personality-to-hobby matching algorithm using 16 lifestyle dimensions

High-Fidelity KV Cache Summarization Using Entropy and Low-Rank Reconstruction

America wakes up to AI's dangerous power

Wine 11.7 – Run Windows Applications on Linux, BSD, Solaris and macOS

Forecastr – RFC 3161 timestamps for AI forecasts, verifiable with OpenSSL

Mitit.org/Blogs/4

Cannabis criminal law in Germany in 2026