frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

US publishers tell Common Crawl to stop scraping and delete archive

https://pressgazette.co.uk/media_law/common-crawl-ai-news-publishers-scraping-cease-and-desist-letter/
12•thm•1h ago

Comments

toomuchtodo•1h ago
Crawling will go underground à la Anna’s Archive.
khelavastr•35m ago
This is shady. Copyrighters absolutely not get to control use of their copyrighted material when people mentally, sonically, or physically reproduce it for personal use.

It's absurd to say "you can't record this book to a friend or robot".

Nobody seems to actually reproduce the copyrighted materials.

High-dimensional eigendecompositions which underpin AI similarity are some of the most literally derivative materials of texts that you can imagine.

Stagnant•18m ago
Hard to see any practical benefit to go after common crawl. The situation of freely accessible crawled data is bad enough as it is with archive.org and CC being pretty much the only available sources. We need more initiatives like them, not less. The scary thing is how the anti-AI sentiment is being used to lock things down further.

Using Optical Aberrations to Distinguish Real Astronomical Transients

https://arxiv.org/abs/2606.08319
1•solarist•59s ago•0 comments

Ronin

https://100r.co/site/ronin.html
1•tosh•1m ago•0 comments

Watch These Judges Rip into Lawyers for Citing Cases That Don't Exist

https://www.404media.co/new-york-court-ai-citations-landberg-case/
1•b-man•2m ago•0 comments

Built to benefit everyone: our plan

https://openai.com/index/built-to-benefit-everyone-our-plan/
1•mstevens•2m ago•0 comments

Scott and Mark Learn to Vibe Check with Steve Sanderson [video]

https://www.youtube.com/watch?v=zh6fMtL_cSM
1•joshka•3m ago•0 comments

Flat Datacenter Networks at Scale

https://perspectives.mvdirona.com/2026/06/flat-datacenter-networks-at-scale/
1•zdw•3m ago•0 comments

Position paper: Agents should train on their histories, not just retrieve them

https://zenodo.org/records/20583812
1•iamevandrake•3m ago•0 comments

Solar Energy Saves Europeans $135M a Day

https://cleantechnica.com/2026/06/08/solar-energy-saves-europeans-135-million-a-day/
2•vrganj•3m ago•0 comments

Show HN: Open-source plugin that builds single-file HTML decks for coding agents

https://github.com/FluidForm-ai/fluiddocs-deck-builder
1•naggarwal29•4m ago•0 comments

Pentagon Says Alibaba, Baidu, BYD, and Unitree Support China's Military

https://techcrunch.com/2026/06/08/pentagon-says-alibaba-baidu-byd-and-unitree-support-chinas-mili...
1•netfortius•4m ago•1 comments

Show HN: Dochost – turn AI output into a shareable link

https://dochost.io
1•sailorpro•4m ago•0 comments

The Math of Fitting In

https://omnia.sas.upenn.edu/story/math-fitting-in-language-acquisition-social-norms-yang
1•wjb3•5m ago•0 comments

Efficient Training on Multiple Consumer GPUs with RoundPipe

https://arxiv.org/abs/2604.27085
1•PaulHoule•6m ago•0 comments

Govt websites, security, and the dreaded f12

https://github.com/Evillare/EMCCA---potential-berach-in-authentication-and-security/tree/main
1•Evillare•6m ago•0 comments

Noyb launches class action over CRIF's scoring system in Austria

https://noyb.eu/en/secret-scoring-join-crif-class-action-now
1•buzer•6m ago•0 comments

Even light drinking raises risk of cancer, heart disease, and early death

https://www.eurekalert.org/news-releases/1131274
1•stringfood•9m ago•0 comments

They Spent Years on a Math Problem. Then They Were Scooped by A.I

https://www.nytimes.com/2026/06/08/science/ai-scoop-young-mathematicians.html
1•digital55•9m ago•0 comments

macOS 27 Beta Breaks the Ability to Boot Asahi Linux

https://www.phoronix.com/news/macOS-27-Beta-Breaks-Asahi
2•josephcsible•11m ago•0 comments

The Conductor Rewrite: What They Changed to Make It Fast

https://performance.dev/the-conductor-rewrite
1•Charlieholtz•11m ago•0 comments

Can LLMs Beat Classical Hyperparameter Optimization Algorithms?

https://arxiv.org/abs/2603.24647
2•galsapir•12m ago•0 comments

A New Symbolism for the Propositional Calculus (1954) [pdf]

http://www.nsl.com/k/parry/parry.pdf
1•tosh•13m ago•0 comments

Quantum Weak Measurement: Validating Cheng's Cosmological Model

https://medium.com/@f9121212/the-convergence-of-quantum-weak-measurement-and-metaphysical-conject...
2•ortrich•13m ago•0 comments

Show HN: A terminal writing environment with Git, E2EE sync and temporal search

1•sys-ronin•13m ago•1 comments

Show HN: Cate – open-source canvas IDE for agentic coding workflows

https://cate.cero-ai.com
2•Imbiss•14m ago•0 comments

China Preps $295B Plan to Fund Nationwide AI Buildout

https://www.bloomberg.com/news/articles/2026-06-09/china-prepares-295-billion-plan-to-fund-nation...
3•1una•15m ago•1 comments

German Tourists Get Stuck in Bighorns After Following Google Maps

https://cowboystatedaily.com/2026/06/08/german-tourists-get-stuck-in-bighorns-after-following-goo...
1•Bender•15m ago•0 comments

Deep Neural Networks for YouTube Recommendations (2016) [pdf]

https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/45530.pdf
1•Olshansky•17m ago•0 comments

Alberta pitches cheap NatGas for data center boom, at odds with CA's green aims

https://www.reuters.com/business/energy/alberta-pitches-cheap-natural-gas-data-center-boom-odds-w...
1•alephnerd•19m ago•0 comments

macOS 27 requires Apple Silicon, as Apple draws down the Intel Mac era

https://arstechnica.com/gadgets/2026/06/macos-27-requires-apple-silicon-as-apple-draws-down-the-i...
3•Brajeshwar•19m ago•0 comments

Show HN: I built 10 ML algos from scratch because fit() predict() are not enough

https://github.com/ml-from-scratch-book/code
2•akmoleksandr•19m ago•1 comments