frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

RTX 5090 and M4 MacBook Air: Can It Game?

https://scottjg.com/posts/2026-05-05-egpu-mac-gaming/
151•allenleee•1h ago•38 comments

Computer Hobby Movement in Canada

https://museum.eecs.yorku.ca/exhibits/show/hobby_canada/hobby_canada
120•rbanffy•4h ago•33 comments

MIT: 20% drop in incoming graduate students

https://president.mit.edu/writing-speeches/video-transcript-message-president-kornbluth-about-fun...
360•dmayo•2h ago•353 comments

Claude AI recovers an 11 yrs old BTC wallet holding 400k USD

https://www.tomshardware.com/tech-industry/cryptocurrency/bitcoin-trader-recovers-usd400-000-usin...
205•cednore•2h ago•89 comments

Fossils show millipede and centipede ancestors evolved legs underwater

https://phys.org/news/2026-05-ancient-sea-fossils-millipede-centipede.html
15•gmays•2d ago•2 comments

Claude for Small Business

https://www.anthropic.com/news/claude-for-small-business
466•neilfrndes•13h ago•422 comments

Terranox AI (YC W26) Is Hiring a Founding AI/ML Engineer and Summer AI/ML Intern

https://www.workatastartup.com/companies/terranox-ai
1•jadecheclair•20m ago

On The Conflation of Money and Things

https://lithub.com/is-it-even-real-on-the-conflation-of-money-and-things/
20•bookofjoe•1h ago•3 comments

Show HN: Running the second public ODoH relay

https://numa.rs/blog/posts/odoh-anonymous-dns-without-an-account.html
98•rdme•6h ago•32 comments

60fps Video on a CGA? – The GlyphBlaster

https://martypc.blogspot.com/2026/05/60fps-video-on-cga-glyphblaster.html
30•tambourine_man•4d ago•4 comments

Cuba says it has run out of fuel, blames U.S. embargo

https://www.upi.com/Top_News/World-News/2026/05/14/Cuba-says-oil-reserves-totally-drained/9311778...
61•thm•1h ago•48 comments

Linux gaming is faster because Windows APIs are becoming Linux kernel features

https://www.xda-developers.com/linux-gaming-is-getting-faster-because-windows-apis-are-becoming-l...
918•haunter•3d ago•566 comments

EditLens: Quantifying the extent of AI editing in text (2025)

https://arxiv.org/abs/2510.03154
6•horseradish•21h ago•0 comments

The Tree House: A voyage to the source of a backyard dream

https://www.laphamsquarterly.org/roundtable/tree-house
51•Caiero•2d ago•5 comments

Myths about /dev/urandom (2014)

https://www.2uo.de/myths-about-urandom/
64•signa11•5h ago•33 comments

Scorched Earth 2000 – Web

http://www.scorch2000.com/web/
347•meshko•16h ago•139 comments

USDA Projects Smallest US Wheat Harvest Since 1972 Due to Plains Drought

https://www.agweb.com/news/usda-projects-smallest-us-wheat-harvest-1972-due-plains-drought
181•littlexsparkee•4h ago•123 comments

Sam Altman's Business Dealings Under GOP Scrutiny Ahead of OpenAI's IPO

https://www.wsj.com/tech/ai/sam-altmans-business-dealings-under-gop-scrutiny-ahead-of-openais-ipo...
133•1vuio0pswjnm7•4h ago•95 comments

Leaving the Physical World

https://www.eff.org/pages/leaving-physical-world
136•andsoitis•4d ago•58 comments

Saying Goodbye to one line of APL

https://homewithinnowhere.com/posts/2026-05-10-one-line.html#fnref1
63•tosh•3d ago•19 comments

Anthropic forms $200M partnership with the Gates Foundation

https://www.anthropic.com/news/gates-foundation-partnership
69•surprisetalk•2h ago•47 comments

A Claude Code and Codex Skill for Deliberate Skill Development

https://github.com/DrCatHicks/learning-opportunities
175•cdrnsf•14h ago•37 comments

Setting up a free *.city.state.us locality domain (2025)

https://fredchan.org/blog/locality-domains-guide/
601•speckx•1d ago•205 comments

Pipes, Forks, and Zombies

https://cs61.seas.harvard.edu/wiki/2017/Shell3/
30•tosh•6h ago•4 comments

MacBook Neo Deep Dive: Benchmarks, Wafer Economics, and the 8GB Gamble

https://www.jdhodges.com/blog/macbook-neo-benchmarks-analysis/
304•tosh•22h ago•365 comments

A History of IDEs at Google

https://laurent.le-brun.eu/blog/a-history-of-ides-at-google
440•laurentlb•5d ago•286 comments

The Emacsification of Software

https://sockpuppet.org/blog/2026/05/12/emacsification/
378•rdslw•1d ago•236 comments

Swift bricks to be installed on all new buildings in Scotland

https://www.theguardian.com/environment/2026/jan/28/swift-bricks-to-be-installed-in-all-new-build...
105•bookofjoe•4d ago•56 comments

Chess puzzle I found in my dad's old book

https://ardoedo.it/kempelen/
207•Eswo•3d ago•69 comments

Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model

https://github.com/cactus-compute/needle
712•HenryNdubuaku•1d ago•206 comments
Open in hackernews

New agents.txt file found on DreamHost

https://journal.kvibber.com/2026/05/agents-txt-on-dreamhost/
9•speckx•1h ago

Comments

kennywinker•1h ago
Do any agents respect agents.txt?

Is there a way to opt my websites out of ai data collection?

embedding-shape•1h ago
Add HTTP Basic Auth in front of your website, then share the credentials with people who are allowed to view your website. Make sure you don't hand our credentials to employees of OpenAI, Anthropic, xAI or Microsoft.
wolttam•1h ago
Any measure you put in place can/will be ignored by the actors who never planned to respect your wishes in the first place.

That's just how the web works, though.

cortesoft•47m ago
This is true for measures that require the actor to respect your wishes, but doesn't apply to measures that force them to.
wolttam•3m ago
This is just like security; the most secure system is the one that nobody can use.

I think the proof-of-work approach that anubis[0] takes is pretty interesting.

I love the idea of having to do a small amount of work for the author of the content in order to get access to their content. It would be interesting to a scheme where the proof-of-work that clients do in systems like anubis actually had a way to directly benefit the author.

[0]: https://github.com/TecharoHQ/anubis

ghostlyy•1h ago
partial answer: the major labs (Anthropic, OpenAI) do respect robots.txt for their named crawlers, so blocking ClaudeBot/GPTBot in robots.txt works for those specific bots. What you can't easily opt out of is the indirect ingestion via Common Crawl, scraped datasets, and unnamed crawlers. agents.txt doesn't change that picture. The Allow-Training vs Allow-RAG split in the default is the useful part of the file. They're different operations with different costs to the site owner. Training is a one-time bulk ingest. RAG is a runtime fetch per query. A site owner might reasonably allow one and not the other.
ninjin•36m ago
I can report that Facebook does not respect robots.txt. Heck, I even mailed domain@fb.com with the specific IP ranges and log samples three times over a month and they of did not even respond. Keeps on wasting my CPU cycles to this day by crawling massive development forks (I hope they choke on the data...):

    $ (cat /var/www/logs/access.log; zcat /var/www/logs/access.log*.gz) | grep 2a03:2880: | wc -l
    626396
About three hits per second for months now.
drcongo•28m ago
I block their entire ASN when they do that.
dylan604•13m ago
Can you serve them a specific file that would make it expensive on their end?
ninjin•10m ago
If I had the time and energy, I would make some sort of simple code language model and generate infinite junk and feed that to them in the hope that it ruins their future training runs. But, I lack the former and some of the latter. Alternatively, maybe I would actually read one of those "backdoor papers" and try to inject something like that.
sschueller•41m ago
Well Claude still thinks it shouldn't read AGENTS.md [1] so they probably also don't care about agents.txt on a web server...

[1] https://github.com/anthropics/claude-code/issues/6235

crtasm•36m ago
Part of the Managed VPS Hosting package, I guess