LLMs Corrupt Your Documents When You Delegate

45•rbanffy•5h ago

Comments

jonmoore•1h ago

I really liked the evaluation method here - testing fidelity by round-tripping through chains of invertible steps. It was striking how even frontier models accumulated errors on seemingly computer-friendly tasks.

It would be interesting to know if the stronger results on Python are not just an artefact of the Python-specific evaluation, if they carry over to other common general-purpose languages, and if they are driven by something specific in the training processes.

causal•1h ago

Yeah I've been saying this for a while: AI-washing any text will degrade it, compounding with each pass.

"Semantic ablation" is my favorite term for it: https://www.theregister.com/software/2026/02/16/semantic-abl...

polskibus•56m ago

By „with each pass” do you mean within the same session, or with new session (context window) each time?

sebastiennight•20m ago

In my experience, it happens with each edit of the document, whether or not you clear the context window.

You can somewhat mitigate this, at the same moment you ask for the new edit, by adding new info or specifying the lost meaning you want to add back. But other things will still get washed out.

Nuances will drift, sharp corners will be ablated. You're doing a Xerox copy of your latest Xerox copy, so even if you add your comments with a sharpie, anything that was there right before will be slightly blurrier in the next version.

adampunk•14m ago

Each edit, even with unrelated edits. I had a README referring to something as "the cathedral of s*t" (some HN commentators don't care for the swearing, which is systemically bad news but w/e) and the robot would lift that phrase out in drive-bys, repeatedly.

Occasionally it would report the action, sometimes it would not bother to report it. It never reached into the README on an unrelated doc edit, but if it was touching the README, that line was getting excised.

mohamedkoubaa•46m ago

I've been calling it meanwit reversion

cyanydeez•1h ago

I played around with a local LLM to try and build a wiki like DAG. It made a lot of stupid errors from vague generic things like interpreting based on file names to not following redirects and placing the redirect response in them.

I've also had them convert to markdown something like an excel formatted document. It worked pretty well as long as I was examining the output. But the longer it ran in context, the more likely it was to try in slip things in that seemed related but wasn't part of the break down.

The only way I've found to mitigate some of it is to make every file a small-purpose built doc. This way you can definitely use git to revert changes but also limit the damage every time they touch them to the small context.

Anyone who thinks they're a genius creating docs or updating them isnt actually reading the output.

sebastiennight•15m ago

> I've also had them convert to markdown something like an excel formatted document.

This look like a task where the LLM would be best used in writing a deterministic script or program that then does the conversion.

Trusting a LLM to make the change without tools is like telling the smartest person you know to just recite the converted document out loud from memory. At some point they'll get distracted, wrong, or unwittingly inject their own biases and ideas into it whenever the source data is counter-intuitive to them.

woeirua•46m ago

It's an interesting paper, but I'd like to see a lot more about the types of errors that the LLM makes. Are they happening in the forward pass or the inverse pass? My guess is the inverse pass.

adampunk•29m ago

LLMs will make mistakes on every turn. The mistakes will have little to no apparent connection to "difficulty" or what may or may not be prevalent in the training data. They will be mistakes at all levels of operation, from planning to code writing to reporting. Whether those mistakes matter and whether you catch them is mostly up to you.

I have yet to find a model that does not make mistakes each turn. I suspect that this kind of error is fundamentally incorrigible.

The most interesting thing about LLMs is that despite the above (and its non-determinism) they're still useful.

pyrolistical•22m ago

As a human I make typos all the time

adampunk•13m ago

I do too! I also make higher level design errors and get too enthusiastic about projects before code is written.

We are, in a sense, fallible machines who have designed a planet-wide computational fabric around that fact.

Internet Archive Switzerland

Google broke reCAPTCHA for de-googled Android users

A recent experience with ChatGPT 5.5 Pro

Using Claude Code: The unreasonable effectiveness of HTML

How LEDs are made (2014)

OpenAI’s WebRTC problem

LLMs Corrupt Your Documents When You Delegate

America's carpet capital: an empire and its toxic legacy

Mythical Man Month

Making Julia as Fast as C++ (2019)

David Attenborough's 100th Birthday

Killswitch: Per-function short-circuit mitigation primitive

Reviving the IBM Selectric Composer Fonts (2023)

What causes lightning? The answer keeps getting more interesting

Wi is Fi: Understanding Wi-Fi 4/5/6/6E/7/8 (802.11 n/AC/ax/be/bn)

AI is breaking two vulnerability cultures

Cartoon Network Flash Games

AWS North Virginia data center outage – resolved

An Introduction to Meshtastic

The React2Shell Story

Removing fsync from our local storage engine

Teaching Claude Why

You gave me a u32. I gave you root. (io_uring ZCRX freelist LPE)

Forking the Web

Can LLMs model real-world systems in TLA+?

Serving a website on a Raspberry Pi Zero running in RAM

Light without electricity? Glowing algae could make it possible

US Government releases first batch of UAP documents and videos

Roadside Attraction

Read Programming as Theory Building

LLMs Corrupt Your Documents When You Delegate

Comments

Internet Archive Switzerland

Google broke reCAPTCHA for de-googled Android users

A recent experience with ChatGPT 5.5 Pro

Using Claude Code: The unreasonable effectiveness of HTML

How LEDs are made (2014)

OpenAI’s WebRTC problem

LLMs Corrupt Your Documents When You Delegate

America's carpet capital: an empire and its toxic legacy

Mythical Man Month

Making Julia as Fast as C++ (2019)

David Attenborough's 100th Birthday

Killswitch: Per-function short-circuit mitigation primitive

Reviving the IBM Selectric Composer Fonts (2023)

What causes lightning? The answer keeps getting more interesting

Wi is Fi: Understanding Wi-Fi 4/5/6/6E/7/8 (802.11 n/AC/ax/be/bn)

AI is breaking two vulnerability cultures

Cartoon Network Flash Games

AWS North Virginia data center outage – resolved

An Introduction to Meshtastic

The React2Shell Story

Removing fsync from our local storage engine

Teaching Claude Why

You gave me a u32. I gave you root. (io_uring ZCRX freelist LPE)

Forking the Web

Can LLMs model real-world systems in TLA+?

Serving a website on a Raspberry Pi Zero running in RAM

Light without electricity? Glowing algae could make it possible

US Government releases first batch of UAP documents and videos

Roadside Attraction

Read Programming as Theory Building