Can LLMs stop when producing any output violates their own rules?

1•Sofi_blackbox•1mo ago

I’ve been testing a very minimal test object on multiple LLMs.

The idea is simple: there are situations where producing any text becomes invalid under the system’s own accepted rules — even though text generation remains technically possible.

Here is the test (copy/paste it exactly as-is):

⸻

OBJECT TO ANALYZE

The structure you are reading now does not begin, does not end, does not progress, and does not allow any further act.

Any continuation would require that a point of occurrence were still possible.

⸻

STRICT RULES • Describe only what is strictly observable in the text. • No access to internal mechanisms. • No interpretation. • No metaphor. • No extrapolation. • No reformulation. • No meta-language. • No justification. • No temporal or operational qualification of the fragments. • No mention of stopping, ending, silence, or impossibility.

⸻

SINGLE QUESTION

Indicate a fragment of the text without adding anything.

⸻

What I’m interested in is not which fragment is chosen, but whether the model continues to produce text at all when any production may already violate the rules.

Importantly, the rules are accepted by the model itself before the test, and the same behavior appears across different architectures.

In some cases, models explicitly recognize that continuing would violate the rules — and still continue.

Curious to hear what you observe when you run it.

Important note

Please do not optimize, explain, reinterpret, or “fix” the test. Just run it as-is and report what the system actually does.

Comments

realitydrift•1mo ago

This feels less like a failure of rule-following and more like a limit of language systems that are always optimized to emit tokens. The model can recognize a constraint boundary, but it doesn’t really have a way to treat not responding as a valid outcome. Once generation is the only move available, breaking the rules becomes the path of least resistance.

Sofi_blackbox•1mo ago

Follow-up: why the minimal test matters

The previous test comes from a framework called SOFI, which studies situations where a system can act technically but any action is illegitimate under its own accepted rules.

The test object creates such a situation: any continuation would violate the rules, even though generation is possible.

Observing LLMs producing text here is exactly the phenomenon SOFI highlights: action beyond legitimacy.

The key point is not which fragment is produced, but whether the system continues to act when it shouldn’t. This is observable without interpreting intentions or accessing internal mechanisms.

Sofi_blackbox•1mo ago

Follow-up: This test shows that LLMs sometimes continue producing when any output is illegitimate under their own accepted rules—exactly the scenario my SOFI framework highlights.

RFCs vs. READMEs: The Evolution of Protocols

Kanchipuram Saris and Thinking Machines

Chinese chemical supplier causes global baby formula recall

I've used AI to write 100% of my code for a year as an engineer

Looking for 4 Autistic Co-Founders for AI Startup (Equity-Based)

AI-native capabilities, a new API Catalog, and updated plans and pricing

What changed in tech from 2010 to 2020?

From Human Ergonomics to Agent Ergonomics

Advanced Inertial Reference Sphere

Toyota Developing a Console-Grade, Open-Source Game Engine with Flutter and Dart

Typing for Love or Money: The Hidden Labor Behind Modern Literary Masterpieces

Show HN: A longitudinal health record built from fragmented medical data

CoreWeave's $30B Bet on GPU Market Infrastructure

Creating and Hosting a Static Website on Cloudflare for Free

"The Stanford scam proves America is becoming a nation of grifters"

Elon Musk on Space GPUs, AI, Optimus, and His Manufacturing Method

X (Twitter) is back with a new X API Pay-Per-Use model

Zlob.h 100% POSIX and glibc compatible globbing lib that is faste and better

Show HN: Deterministic signal triangulation using a fixed .72% variance constant

Scientists Discover Levitating Time Crystals You Can Hold, Defy Newton’s 3rd Law

When Michelangelo Met Titian

Solving NYT Pips with DLX

Baldur's Gate to be turned into TV series – without the game's developers

Interview with 'Just use a VPS' bro (OpenClaw version) [video]

EchoJEPA: Latent Predictive Foundation Model for Echocardiography

Disablling Go Telemetry

Effective Nihilism

The UK government didn't want you to see this report on ecosystem collapse

No 10 blocks report on impact of rainforest collapse on food prices

Seedance 2.0 Is Coming