frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Google can train search AI with web content even with opt-out

https://www.bloomberg.com/news/articles/2025-05-03/google-can-train-search-ai-with-web-content-even-after-opt-out
31•gotmedium•9mo ago

Comments

linusg789•9mo ago
https://www.msn.com/en-us/money/other/google-can-train-searc...
riedel•9mo ago
https://archive.is/1l8SS
caseyy•9mo ago
I wonder if society (and by extension, our laws) will ever again make a meaningful effort to penalize liars, manipulators, and thieves. I worry the answer is no.
kordlessagain•9mo ago
Assholes will rationalize any way they can, and a lot of the population is "set up" to hear these excuses and evaluate them. So, for a small percentage of assholes, they will have such good excuses nobody holds them accountable.

Funny how calling out well-dressed manipulation bothers some people more than the manipulation itself. Almost like some folks need the illusion to stay intact.

eftychis•9mo ago
You hit the nail in the head with your last sentence. It is a psychological defense mechanism.

People don't want to be associated with fraud and would do any mind tricks to explain things away, while knowing the illusion is there.

SilasX•9mo ago
Yes, that's an important thing to worry about. I'm just not sure that "learning from a website's content how to create other intellectual works without explicit permission from the owner to do so" counts as lying, manipulating, or stealing.
caseyy•9mo ago
Please don't straw-man. The first two paragraphs of the article explain what is happening. There is explicit refusal.
SilasX•9mo ago
Disagreeing with me doesn't mean my criticism is attacking a strawman. That's not what the term means. The websites are, in fact, permitting you to view them, while insisting you not learn anything from the content.

That's not fundamentally different from when employers "explicitly refuse" you learning from your job with them to use at the next one. Sure, they certainly want that, but the law doesn't recognize it as a valid constraint (except for e.g. trade secrets and proprietary knowledge).

caseyy•9mo ago
My argument was that explicitly agreeing not to collect someone's data for AI training, then collecting data for AI training, is lying. You argued that collecting data without explicit agreement is, actually, not lying. Arguing with an easy claim no one made is the definition of a straw-man response.

Look, just have courtesy for others and don't argue in bad faith, the snark included. This community came up with the HN guidelines, let's try to follow them more. That's all I wanted to say. All the best.

kordlessagain•9mo ago
And, just because things are moving so fast, agentic frameworks crawl in real time while helping the user. It's not just about training models, which everyone gets stuck on talking about. I think the agentic framework crawls will probably get worse by a lot.
hulitu•9mo ago
> Google Can Train Search AI with Web Content Even with Opt-Out

Opt out for Google, Facebook and Microsoft is Opt in.

Portuguese icon (FROM A CAN) makes a simple meal (Canned Fish Files) [video]

https://www.youtube.com/watch?v=e9FUdOfp8ME
1•zeristor•47s ago•0 comments

Brookhaven Lab's RHIC Concludes 25-Year Run with Final Collisions

https://www.hpcwire.com/off-the-wire/brookhaven-labs-rhic-concludes-25-year-run-with-final-collis...
1•gnufx•2m ago•0 comments

Transcribe your aunts post cards with Gemini 3 Pro

https://leserli.ch/ocr/
1•nielstron•6m ago•0 comments

.72% Variance Lance

1•mav5431•7m ago•0 comments

ReKindle – web-based operating system designed specifically for E-ink devices

https://rekindle.ink
1•JSLegendDev•9m ago•0 comments

Encrypt It

https://encryptitalready.org/
1•u1hcw9nx•9m ago•1 comments

NextMatch – 5-minute video speed dating to reduce ghosting

https://nextmatchdating.netlify.app/
1•Halinani8•10m ago•1 comments

Personalizing esketamine treatment in TRD and TRBD

https://www.frontiersin.org/articles/10.3389/fpsyt.2025.1736114
1•PaulHoule•11m ago•0 comments

SpaceKit.xyz – a browser‑native VM for decentralized compute

https://spacekit.xyz
1•astorrivera•12m ago•1 comments

NotebookLM: The AI that only learns from you

https://byandrev.dev/en/blog/what-is-notebooklm
1•byandrev•12m ago•1 comments

Show HN: An open-source starter kit for developing with Postgres and ClickHouse

https://github.com/ClickHouse/postgres-clickhouse-stack
1•saisrirampur•13m ago•0 comments

Game Boy Advance d-pad capacitor measurements

https://gekkio.fi/blog/2026/game-boy-advance-d-pad-capacitor-measurements/
1•todsacerdoti•13m ago•0 comments

South Korean crypto firm accidentally sends $44B in bitcoins to users

https://www.reuters.com/world/asia-pacific/crypto-firm-accidentally-sends-44-billion-bitcoins-use...
2•layer8•14m ago•0 comments

Apache Poison Fountain

https://gist.github.com/jwakely/a511a5cab5eb36d088ecd1659fcee1d5
1•atomic128•16m ago•2 comments

Web.whatsapp.com appears to be having issues syncing and sending messages

http://web.whatsapp.com
1•sabujp•16m ago•2 comments

Google in Your Terminal

https://gogcli.sh/
1•johlo•18m ago•0 comments

Shannon: Claude Code for Pen Testing: #1 on Github today

https://github.com/KeygraphHQ/shannon
1•hendler•18m ago•0 comments

Anthropic: Latest Claude model finds more than 500 vulnerabilities

https://www.scworld.com/news/anthropic-latest-claude-model-finds-more-than-500-vulnerabilities
2•Bender•22m ago•0 comments

Brooklyn cemetery plans human composting option, stirring interest and debate

https://www.cbsnews.com/newyork/news/brooklyn-green-wood-cemetery-human-composting/
1•geox•23m ago•0 comments

Why the 'Strivers' Are Right

https://greyenlightenment.com/2026/02/03/the-strivers-were-right-all-along/
1•paulpauper•24m ago•0 comments

Brain Dumps as a Literary Form

https://davegriffith.substack.com/p/brain-dumps-as-a-literary-form
1•gmays•24m ago•0 comments

Agentic Coding and the Problem of Oracles

https://epkconsulting.substack.com/p/agentic-coding-and-the-problem-of
1•qingsworkshop•25m ago•0 comments

Malicious packages for dYdX cryptocurrency exchange empties user wallets

https://arstechnica.com/security/2026/02/malicious-packages-for-dydx-cryptocurrency-exchange-empt...
1•Bender•25m ago•0 comments

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

https://github.com/pheonix-delta/axiom-voice-agent
1•shubham-coder•26m ago•0 comments

Penisgate erupts at Olympics; scandal exposes risks of bulking your bulge

https://arstechnica.com/health/2026/02/penisgate-erupts-at-olympics-scandal-exposes-risks-of-bulk...
4•Bender•26m ago•0 comments

Arcan Explained: A browser for different webs

https://arcan-fe.com/2026/01/26/arcan-explained-a-browser-for-different-webs/
1•fanf2•28m ago•0 comments

What did we learn from the AI Village in 2025?

https://theaidigest.org/village/blog/what-we-learned-2025
1•mrkO99•28m ago•0 comments

An open replacement for the IBM 3174 Establishment Controller

https://github.com/lowobservable/oec
1•bri3d•31m ago•0 comments

The P in PGP isn't for pain: encrypting emails in the browser

https://ckardaris.github.io/blog/2026/02/07/encrypted-email.html
2•ckardaris•33m ago•0 comments

Show HN: Mirror Parliament where users vote on top of politicians and draft laws

https://github.com/fokdelafons/lustra
1•fokdelafons•33m ago•1 comments