frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

SynthID – A tool to watermark and identify content generated through AI

https://deepmind.google/science/synthid/
34•jonbaer•4h ago

Comments

egeozcan•2h ago
I guess this is the start of a new arms race on making generated content pass these checks undetected and detecting them anyway.
dragonwriter•1h ago
Its not really an arms race; any gen AI system that doesn't explicitly incorporate a watermarking tool like this won't be detectable by tools that read the watermarks.

There is a kind of arms race that has existed for a while for non-watermarked content, except that the detection tools are pretty much Magic 8-ball level of reliability, so there's not a lot of effort on the counter-detection side.

peterkelly•2h ago
Create the problem, sell the solution.
9dev•1h ago
You can never be sure something has been generated by a model embedding one of these anyway, so it’s pretty moot.
montag•1h ago
"The watermarks are embedded across Google’s generative AI consumer products, and are imperceptible to humans."

I'd love to see the data behind this claim, especially on the audio side.

donperignon•1h ago
Nah that’s a solved problem if you work on the frequency domain. Same for image. Text is the hard rock here.
donperignon•1h ago
I am not sure that text watermarking will be accurate, I foresee plenty of false positives.
pelasaco•1h ago
looks like the same as anti-virus companies in the 80s? Write virus, Write anti-virus and profit!
teiferer•1h ago
Could anybody explain how this isn't easily circumvented by using a competitor's model?

Also, if everything in the future has some touch of AI inside, for example cameras using AI to slightly improve the perceived picture quality, then "made with AI" won't be a categorization that anybody lifts an eyebrow about.

dragonwriter•1h ago
> Could anybody explain how this isn't easily circumvented by using a competitor's model?

Almost all the big hosted AI providers are publicly working on watermarking for at least media (text is more of a mixed bag); ultimately, its probably a regulatory play—the big providers expect that the combination of legitimate concerns and their own active fearmongering, combined with them demonstrating watermarking, will result in mandates for commercial AI generation services to include watermarking. This may even be part of the regulatory play to restrict availability and non-research use of open models.

verisimi•47m ago
If you see the mark, you'd know at least that you aren't dealing with a purely mechanic rendering of whatever-it-is.
progval•26m ago
By lobbying regulators to force your competitors to add watermarks too.
doawoo•1h ago
the beginning of walled garden “AI” tools has been interesting to follow
chii•58m ago
i find the premise to be an invalid one personally - why is the property that a works from an AI model must be identified/identifiable?
HighGoldstein•36m ago
Video evidence of you committing a crime, for example, should be identifiable as AI-generated.
chii•32m ago
how do we currently deal with tampered video evidence today, before the advent of ai generated videos? Why cant same methods be used for an ai generated video?
Oras•39m ago
OpenAI has been doing something similar for generated images using C2PA [0]

It is easy to alter by just saving to a different format or basic cropping.

I would love to see how SynthID is fixing this issue.

https://help.openai.com/en/articles/8912793-c2pa-in-chatgpt-...

JimDabell•36m ago
> Large language models generate text one word (token) at a time. Each word is assigned a probability score, based on how likely it is to be generated next. So for a sentence like “My favourite tropical fruits are mango and…”, the word “bananas” would have a higher probability score than the word “airplanes”.

> SynthID adjusts these probability scores to generate a watermark. It's not noticeable to the human eye, and doesn’t affect the quality of the output.

I think they need to be clearer about the constraints involved here. If I ask What is the capital of France? Just the answer, no extra information.” then there’s no room to vary the probability without harming the quality of the output. So clearly there is a lower bound beyond which this becomes ineffective. And presumably the longer the text, the more resilient it is to alterations. So what are the constraints?

I also think that this is self-interest dressed up as altruism. There’s always going to be generative AI that doesn’t include watermarks, so a watermarking scheme cannot tell you if something is genuine. It is, however, useful for determining that something came from a specific provider, which could be valuable to Google in all sorts of ways.

HighGoldstein•34m ago
I wonder if, conversely, authentic media can be falsely watermarked as AI-generated.
notpushkin•17m ago
For photos, I think the answer is yes. For texts, the wording will be changed when you watermark them, so I guess that’s a no.
R_Spaghetti•12m ago
It only works across Google shit.

I Don't Believe in MCPs

https://old.reddit.com/r/AI_Agents/comments/1n3w188/the_tool_bloat_problem_why_i_dont_believe_in_...
2•hgaddipa001•3m ago•1 comments

A16-FuseBypass: Debug Logic Enabled on Production Apple Silicon

https://github.com/JGoyd/A16-FuseBypass
2•Bogdanp•6m ago•0 comments

I built 59 open-source Claude Code subagents to supercharge software development

https://github.com/vizra-ai/claude-code-agents
1•aaronlumsden•6m ago•1 comments

A practical guide to debugging GitHub Actions

https://depot.dev/blog/guide-to-debugging-github-actions
1•kylegalbraith•8m ago•0 comments

Inapparent virus infections differentially affect honey bee flight

https://www.science.org/doi/10.1126/sciadv.adw8382
1•PaulHoule•10m ago•0 comments

An LLM-Proof Approach to Reinventing Captcha Systems

https://old.reddit.com/r/LocalLLaMA/comments/1gkeo6u/an_llmproof_approach_to_reinventing_captcha/
2•debdut•10m ago•1 comments

Gall's Law

https://blog.prototypr.io/galls-law-93c8ef8b651e
2•matthewsinclair•13m ago•0 comments

Efficient Deep Learning Book

https://efficientdlbook.com/
2•Maro•17m ago•0 comments

The Dirty Secret of Coding Agents

https://medium.com/@vsh1818/the-dirty-secret-of-coding-agents-5777332e3052
1•vladsh•19m ago•1 comments

What made the Amiga "Genlock-able"?

https://retrocomputing.stackexchange.com/questions/22320/what-made-the-amiga-genlock-able
1•doener•21m ago•0 comments

Situational Awareness: The Decade Ahead (2024)

https://situational-awareness.ai/
1•mpweiher•29m ago•0 comments

USBODE: Optical Drive Emulator for DOS and Newer PCs

https://www.retrorgb.com/usbode-an-ode-for-dos-and-newer-pcs.html
1•transpute•31m ago•0 comments

Historical Housing Prices Project

https://www.philadelphiafed.org/surveys-and-data/regional-economic-analysis/historical-housing-pr...
2•luu•32m ago•0 comments

KeyBee Android Keyboard

https://www.keybeekeyboard.com/
1•kqr•34m ago•0 comments

Looking for Affordable Alternatives to USB ISO Emulators Like iODD

https://yomotherboard.com/question/looking-for-affordable-alternatives-to-usb-iso-emulators-like-...
1•transpute•37m ago•0 comments

Rust ints to Rust enums with less instructions

https://sailor.li/ints-to-enums
3•Bogdanp•42m ago•0 comments

Some clarifications and thoughts around "ChatGPT psychosis"

https://drtompollak.substack.com/p/some-clarifications-and-thoughts
4•FromTheArchives•43m ago•1 comments

Milan's expat 'explosion' brings new buzz to Italy's financial centre

https://www.ft.com/content/f33a01dc-f873-4c62-886f-f69562fb2e46
7•simonebrunozzi•46m ago•1 comments

Show HN: Keeptalking

https://github.com/vadim0x60/keeptalking
3•vadimdotme•49m ago•0 comments

I got people to pay me $50K in 3 days with NFTs (2021)

https://paulstamatiou.com/how-i-made-50k-in-3-days-with-nfts
6•slacktivism123•49m ago•1 comments

U3 (Software)

https://en.wikipedia.org/wiki/U3_(software)
1•transpute•50m ago•0 comments

Weird but True: I^I Is a Real Number

https://medium.com/science-spectrum/weird-but-true-i-i-is-a-real-number-588d443043d2
2•pykello•52m ago•0 comments

USGS Streamgage Import

https://waysidemapping.org/projects/usgs-import/
1•robin_reala•58m ago•0 comments

Culture Has No Name for This Cursed Vibe. It's Everywhere

https://news.artnet.com/art-world/marshmallow-horror-2509289
5•Michelangelo11•1h ago•0 comments

10-20x Faster LLVM -O0 Back-End – Code Generation

https://discourse.llvm.org/t/tpde-llvm-10-20x-faster-llvm-o0-back-end/86664
1•mpweiher•1h ago•0 comments

Show HN: Design inspirations on an infinite wall of UI Components

https://shuffle.dev/inspirations
2•kemyd•1h ago•2 comments

An Interview with Julio Barba

https://halide.cx/blog/julio-barba-interview/
2•Bogdanp•1h ago•0 comments

A Deep Dive into the Wonderful World of SVG Displacement Filtering (2021)

https://www.smashingmagazine.com/2021/09/deep-dive-wonderful-world-svg-displacement-filtering/
1•bawolff•1h ago•0 comments

HumanLayer

https://github.com/humanlayer/humanlayer
1•pykello•1h ago•0 comments

The First Karaoke Machine

https://spectrum.ieee.org/karaoke-machine-ieee-milestone
3•jnord•1h ago•0 comments