Google can train search AI with web content even with opt-out

https://www.bloomberg.com/news/articles/2025-05-03/google-can-train-search-ai-with-web-content-even-after-opt-out

31•gotmedium•9mo ago

Comments

linusg789•9mo ago

https://www.msn.com/en-us/money/other/google-can-train-searc...

riedel•9mo ago

caseyy•9mo ago

I wonder if society (and by extension, our laws) will ever again make a meaningful effort to penalize liars, manipulators, and thieves. I worry the answer is no.

kordlessagain•9mo ago

Assholes will rationalize any way they can, and a lot of the population is "set up" to hear these excuses and evaluate them. So, for a small percentage of assholes, they will have such good excuses nobody holds them accountable.

Funny how calling out well-dressed manipulation bothers some people more than the manipulation itself. Almost like some folks need the illusion to stay intact.

eftychis•9mo ago

You hit the nail in the head with your last sentence. It is a psychological defense mechanism.

People don't want to be associated with fraud and would do any mind tricks to explain things away, while knowing the illusion is there.

SilasX•9mo ago

Yes, that's an important thing to worry about. I'm just not sure that "learning from a website's content how to create other intellectual works without explicit permission from the owner to do so" counts as lying, manipulating, or stealing.

caseyy•9mo ago

Please don't straw-man. The first two paragraphs of the article explain what is happening. There is explicit refusal.

SilasX•9mo ago

Disagreeing with me doesn't mean my criticism is attacking a strawman. That's not what the term means. The websites are, in fact, permitting you to view them, while insisting you not learn anything from the content.

That's not fundamentally different from when employers "explicitly refuse" you learning from your job with them to use at the next one. Sure, they certainly want that, but the law doesn't recognize it as a valid constraint (except for e.g. trade secrets and proprietary knowledge).

caseyy•9mo ago

My argument was that explicitly agreeing not to collect someone's data for AI training, then collecting data for AI training, is lying. You argued that collecting data without explicit agreement is, actually, not lying. Arguing with an easy claim no one made is the definition of a straw-man response.

Look, just have courtesy for others and don't argue in bad faith, the snark included. This community came up with the HN guidelines, let's try to follow them more. That's all I wanted to say. All the best.

kordlessagain•9mo ago

And, just because things are moving so fast, agentic frameworks crawl in real time while helping the user. It's not just about training models, which everyone gets stuck on talking about. I think the agentic framework crawls will probably get worse by a lot.

hulitu•9mo ago

> Google Can Train Search AI with Web Content Even with Opt-Out

Opt out for Google, Facebook and Microsoft is Opt in.

Circumstantial Complexity, LLMs and Large Scale Architecture

Tech Bro Saga: big tech critique essay series

Show HN: A calculus course with an AI tutor watching the lectures with you

Show HN: 83K lines of C++ – cryptocurrency written from scratch, not a fork

Show HN: SAA – A minimal shell-as-chat agent using only Bash

Mario Tchou

Does Anyone Even Know What's Happening in Zim?

The last Morse code maritime radio station in North America [video]

Show HN: Hacker Newspaper – Yet another HN front end optimized for mobile

OpenClaw Is Changing My Life

Everything you need to know about lasers in one photo

SCOTUS to decide if 1988 video tape privacy law applies to internet uses

Epstein files reveal deeper ties to scientists than previously known

Red teamers arrested conducting a penetration test

Show HN: Open-source AI powered Kubernetes IDE

Show HN: Lucid – Use LLM hallucination to generate verified software specs

AI Doesn't Write Every Framework Equally Well

Aisbf – an intelligent routing proxy for OpenAI compatible clients

Let's handle 1M requests per second

OpenClaw Partners with VirusTotal for Skill Security

Goal: Ship 1M Lines of Code Daily

Show HN: Codex-mem, 90% fewer tokens for Codex

FastLangML: FastLangML:Context‑aware lang detector for short conversational text

LineageOS 23.2

Crypto Deposit Frauds

Substack makes money from hosting Nazi newsletters

Framing an LLM as a safety researcher changes its language, not its judgement

Are there anyone interested about a creator economy startup

Show HN: Skill Lab – CLI tool for testing and quality scoring agent skills

2003: What is Google's Ultimate Goal? [video]

Circumstantial Complexity, LLMs and Large Scale Architecture

Tech Bro Saga: big tech critique essay series

Show HN: A calculus course with an AI tutor watching the lectures with you

Show HN: 83K lines of C++ – cryptocurrency written from scratch, not a fork

Show HN: SAA – A minimal shell-as-chat agent using only Bash

Mario Tchou

Does Anyone Even Know What's Happening in Zim?

The last Morse code maritime radio station in North America [video]

Show HN: Hacker Newspaper – Yet another HN front end optimized for mobile

OpenClaw Is Changing My Life

Everything you need to know about lasers in one photo

SCOTUS to decide if 1988 video tape privacy law applies to internet uses

Epstein files reveal deeper ties to scientists than previously known

Red teamers arrested conducting a penetration test

Show HN: Open-source AI powered Kubernetes IDE

Show HN: Lucid – Use LLM hallucination to generate verified software specs

AI Doesn't Write Every Framework Equally Well

Aisbf – an intelligent routing proxy for OpenAI compatible clients

Let's handle 1M requests per second

OpenClaw Partners with VirusTotal for Skill Security

Goal: Ship 1M Lines of Code Daily

Show HN: Codex-mem, 90% fewer tokens for Codex

FastLangML: FastLangML:Context‑aware lang detector for short conversational text

LineageOS 23.2

Crypto Deposit Frauds

Substack makes money from hosting Nazi newsletters

Framing an LLM as a safety researcher changes its language, not its judgement

Are there anyone interested about a creator economy startup

Show HN: Skill Lab – CLI tool for testing and quality scoring agent skills

2003: What is Google's Ultimate Goal? [video]

Google can train search AI with web content even with opt-out

Comments