New ChatGPT Models Seem to Leave Watermarks on Text

https://www.rumidocs.com/newsroom/new-chatgpt-models-seem-to-leave-watermarks-on-text

28•croes•9mo ago

Comments

selcuka•9mo ago

Or you can simply do it using this follow-up prompt. No external tools are needed. Worked for me:

    Remove all non-visible whitespace characters such as 0x202F from the text.

Alternatively, if you have access to the original prompt, just append this:

    Your response should not contain any non-visible whitespace characters such as 0x202F or 0x0A (newlines are allowed).

greyface-•9mo ago

I remember when $EMPLOYER was caught sending all-employee emails with individualized unicode homoglyph watermarks, to try to identify leaks.

madars•9mo ago

For those unaware of the reference, a famous example was Tesla. HN discussion: https://news.ycombinator.com/item?id=33621562 (Tesla has used space characters in internal emails to identify leaks), see also a general discussion of related techniques: https://en.wikipedia.org/wiki/Canary_trap

gilgoomesh•9mo ago

These don't appear to be intended as watermarks. They're merely a valid use of non-breaking space for tightly coupled elements like "2.5 billion" and "Title I".

Sure, a human author would almost never do that, but they could. I could imagine a Markdown syntax that did that – it could be done similar to how `code` is marked up in most blogs.

neilv•9mo ago

All the examples of non-breaking spaces that they showed were arguably places where someone nicely typesetting might well do the same thing. For example, in "FY 2025", or "$8.7 billion". (I've even done this a lot myself in the past.)

I wouldn't call this a watermark, but more a sign of likely copy&paste, if students' word processors weren't currently doing that.

A "watermark" that invisibly identifies the text origin using Unicode tricks sounds possible.

And maybe you could do some things with statistical patterns.

Or you could, as some have done in the past, is to stego the identifying information in a way that's hard to spot but can't be denied later (e.g., the first letter of each word clearly spells out "john smith is a cheater who copied this from chatgpt").

photonthug•9mo ago

> And maybe you could do some things with statistical patterns.

Fascinating, and now that you mention it, this does seem kinda inevitable. Naturally the same people that think IP/copyright for everyone else is fake, irrelevant, or old fashioned will be desperate to be able to conclusively prove to investors and shareholders that someone else's work is built on theirs via model distillation, and suddenly IP is important again.

What are the known cases or examples of stego? This sounds interesting if it's at the level of model training. Anyway I guess you can get pretty far with stuff like this just with simple system prompts, encouraging shibboleths along the lines of "Always phrase your response so that it has exactly 14 copies of the letter J".

jeisc•9mo ago

Software engineers should know that source code can be encoded in a string of white spaces and then ran through a compiler function to produce undetectable functionality

throwaway290•9mo ago

Which is also done https://www.pillar.security/blog/new-vulnerability-in-github...

andyfeliciotti•9mo ago

For anyone looking to strip white space from text I added a new option to do so on my tool. https://invisiblecharacterviewer.com/

code-less•9mo ago

Yes you can remove it through: https://gptwatermark.com

I went back to Linux and it was a mistake

Octrafic – open-source AI-assisted API testing from the CLI

US Accuses China of Secret Nuclear Testing

Peacock. A New Programming Language

A postcard arrived: 'If you're reading this I'm dead, and I really liked you'

What to know about the software selloff

Show HN: Syntux – generative UI for websites, not agents

Microsoft appointed a quality czar. He has no direct reports and no budget

AI overlay that reads anything on your screen (invisible to screen capture)

Show HN: Seafloor, be up and running with OpenClaw in 20 seconds

Tesla turbine-inspired structure generates electricity using compressed air

State Department deleting 17 years of tweets (2009-2025); preservation needed

Learning to code, or building side projects with AI help, this one's for you

Effulgence RPG Engine [video]

Five disciplines discovered the same math independently – none of them knew

We Scanned an AI Assistant for Security Issues: 12,465 Vulnerabilities

Amazon no longer defend cloud customers against video patent infringement claims

Show HN: Medinilla – an OCPP compliant .NET back end (partially done)

How Does AI Distribute the Pie? Large Language Models and the Ultimatum Game

Resistance Infrastructure

Fire-juggling unicyclist caught performing on crossing

Restoring a lost 1981 Unix roguelike (protoHack) and preserving Hack 1.0.3

GPS and Time Dilation – Special and General Relativity

Show HN: Witnessd – Prove human authorship via hardware-bound jitter seals

Show HN: I built a clawdbot that texts like your crush

Scientists reverse Alzheimer's in mice and restore memory (2025)

Compiling Prolog to Forth [pdf]

Show HN: Cymatica – an experimental, meditative audiovisual app

GitBlack: Tracing America's Foundation

Horizon-LM: A RAM-Centric Architecture for LLM Training