frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ai.com bought by Crypto.com founder for $70M in biggest-ever website name deal

https://www.ft.com/content/83488628-8dfd-4060-a7b0-71b1bb012785
1•1vuio0pswjnm7•55s ago•0 comments

Big Tech's AI Push Is Costing More Than the Moon Landing

https://www.wsj.com/tech/ai/ai-spending-tech-companies-compared-02b90046
1•1vuio0pswjnm7•2m ago•0 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
1•1vuio0pswjnm7•4m ago•0 comments

Suno, AI Music, and the Bad Future [video]

https://www.youtube.com/watch?v=U8dcFhF0Dlk
1•askl•6m ago•0 comments

Ask HN: How are researchers using AlphaFold in 2026?

1•jocho12•9m ago•0 comments

Running the "Reflections on Trusting Trust" Compiler

https://spawn-queue.acm.org/doi/10.1145/3786614
1•devooops•14m ago•0 comments

Watermark API – $0.01/image, 10x cheaper than Cloudinary

https://api-production-caa8.up.railway.app/docs
1•lembergs•15m ago•1 comments

Now send your marketing campaigns directly from ChatGPT

https://www.mail-o-mail.com/
1•avallark•19m ago•1 comments

Queueing Theory v2: DORA metrics, queue-of-queues, chi-alpha-beta-sigma notation

https://github.com/joelparkerhenderson/queueing-theory
1•jph•31m ago•0 comments

Show HN: Hibana – choreography-first protocol safety for Rust

https://hibanaworks.dev/
5•o8vm•33m ago•0 comments

Haniri: A live autonomous world where AI agents survive or collapse

https://www.haniri.com
1•donangrey•33m ago•1 comments

GPT-5.3-Codex System Card [pdf]

https://cdn.openai.com/pdf/23eca107-a9b1-4d2c-b156-7deb4fbc697c/GPT-5-3-Codex-System-Card-02.pdf
1•tosh•46m ago•0 comments

Atlas: Manage your database schema as code

https://github.com/ariga/atlas
1•quectophoton•49m ago•0 comments

Geist Pixel

https://vercel.com/blog/introducing-geist-pixel
2•helloplanets•52m ago•0 comments

Show HN: MCP to get latest dependency package and tool versions

https://github.com/MShekow/package-version-check-mcp
1•mshekow•1h ago•0 comments

The better you get at something, the harder it becomes to do

https://seekingtrust.substack.com/p/improving-at-writing-made-me-almost
2•FinnLobsien•1h ago•0 comments

Show HN: WP Float – Archive WordPress blogs to free static hosting

https://wpfloat.netlify.app/
1•zizoulegrande•1h ago•0 comments

Show HN: I Hacked My Family's Meal Planning with an App

https://mealjar.app
1•melvinzammit•1h ago•0 comments

Sony BMG copy protection rootkit scandal

https://en.wikipedia.org/wiki/Sony_BMG_copy_protection_rootkit_scandal
2•basilikum•1h ago•0 comments

The Future of Systems

https://novlabs.ai/mission/
2•tekbog•1h ago•1 comments

NASA now allowing astronauts to bring their smartphones on space missions

https://twitter.com/NASAAdmin/status/2019259382962307393
2•gbugniot•1h ago•0 comments

Claude Code Is the Inflection Point

https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point
3•throwaw12•1h ago•1 comments

Show HN: MicroClaw – Agentic AI Assistant for Telegram, Built in Rust

https://github.com/microclaw/microclaw
1•everettjf•1h ago•2 comments

Show HN: Omni-BLAS – 4x faster matrix multiplication via Monte Carlo sampling

https://github.com/AleatorAI/OMNI-BLAS
1•LowSpecEng•1h ago•1 comments

The AI-Ready Software Developer: Conclusion – Same Game, Different Dice

https://codemanship.wordpress.com/2026/01/05/the-ai-ready-software-developer-conclusion-same-game...
1•lifeisstillgood•1h ago•0 comments

AI Agent Automates Google Stock Analysis from Financial Reports

https://pardusai.org/view/54c6646b9e273bbe103b76256a91a7f30da624062a8a6eeb16febfe403efd078
1•JasonHEIN•1h ago•0 comments

Voxtral Realtime 4B Pure C Implementation

https://github.com/antirez/voxtral.c
2•andreabat•1h ago•1 comments

I Was Trapped in Chinese Mafia Crypto Slavery [video]

https://www.youtube.com/watch?v=zOcNaWmmn0A
2•mgh2•1h ago•1 comments

U.S. CBP Reported Employee Arrests (FY2020 – FYTD)

https://www.cbp.gov/newsroom/stats/reported-employee-arrests
1•ludicrousdispla•1h ago•0 comments

Show HN: I built a free UCP checker – see if AI agents can find your store

https://ucphub.ai/ucp-store-check/
2•vladeta•1h ago•1 comments
Open in hackernews

Ask HN: Is it likely AI training models could start training on personal files?

3•sjw987•4mo ago
I've been sorting through my content on Google recently. Backing up and moving off of Gmail and Google Drive was relatively simple, but Google Photos is a bit more daunting. The Google Takeout process has delivered me almost 500 2GB zip folders, with scrambled metadata in supplemental data files, which is going to take a while to sort through. It's my own fault for sticking with one platform for so long, and I got hooked during the "unlimited storage" days of early Google Pixel phones.

The reason I've begun downloading and removing stored files is because I'm (maybe justifiably or not) concerned about the prospect of my personal photos being used to train AI models. The chance that some diffusion model might end up recreating a heavily biased image of my wife, family, friends, or myself, or referencing any of my files or documents and what that all may be used for (commercially or otherwise) concerns me.

Google is the only place I've ever put my personal photos. I've never bothered with anything public facing and trusted that a private cloud storage service would always stay private. So in my case, Google would be the sole place to leave to ensure data sovereignty.

Does anybody believe Google (and other companies) might soon start scanning personal files we hold on their storage facilities? Is that a legal possibility for them?

It seems to me that it's a huge pool of fresh training data that they would inevitably want to get their hands on. And given how much they have already trained on, it seems the next logical step from a business standpoint.

Clearly they would need to change their privacy policies and terms of agreements and inform users of these changes. Is it possible they could slip this sort of change in without much notice?

I was also wondering if anybody might have pointers for the best strategy to securely backup offline. I don't want to just shift my family photos from one company to another where business execs are training their own model. Anybody else handled this recently?

Comments

incomingpain•4mo ago
>I've been sorting through my content on Google recently.

There's allegations that gemini is already trained on this data.

>Does anybody believe Google (and other companies) might soon start scanning personal files we hold on their storage facilities? Is that a legal possibility for them?

Free accounts already have agreed to be used.

>It seems to me that it's a huge pool of fresh training data that they would inevitably want to get their hands on. And given how much they have already trained on, it seems the next logical step from a business standpoint.

Im actually not so sure they have or ever will do. The problem isnt quantity, it's quality. Sure it could train on a bunch of trash in people's but then when inferring, it'll produce trash.

>Clearly they would need to change their privacy policies and terms of agreements and inform users of these changes. Is it possible they could slip this sort of change in without much notice?

you've been agreeing to them being able to read the content of the files for antivirus and antispam reasons for a very long time. To start doing it for AI requires no change.

>I was also wondering if anybody might have pointers for the best strategy to securely backup offline. I don't want to just shift my family photos from one company to another where business execs are training their own model. Anybody else handled this recently?

One of the useful apps I found was 'foldersync' which makes backup to cifs shares possible.