frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What measures are you taking to stop AI crawlers?

6•kjok•5h ago
Curious to know what steps people here are taking to protect their sites, products, and APIs. What have you tried that actually works in practice?

Comments

JohnFen•5h ago
I spent a lot of time trying to find a good solution to this problem and failed, so what I ended up doing was to give up and remove my sites from the public web entirely.

I'm eager for a good solution that will allow me to put them back, but I'm doubtful that's going to happen. In any case, I'm extremely interested in other people's replies here. Maybe there's a solution that I haven't been able to find!

bediger4000•5h ago
I have a lot of them in robots.txt as disallow /, of course. I have several getting 404 on any request whatsoever, Meta's AI crawler, Bytespider mainly, via Apache httpd mod_rewrite.
johng•5h ago
Some of our sites have been getting absolutely hammered by the AI bots -- so much so they are taking down the sites. Even with cloudflare protection and caching. The only thing We've been able to do so far is tell Cloudflare to block all AI bots, modify the robots.txt and even then we've had to manually identify IP addresses and bots that ignore all of the above and block them specifically or at the ASN level.

Cloudflare makes doing this kind of stuff easy but I would hate to have to do this manually on a webserver. And I don't like the idea of how much of the internet already relies on Cloudflare.

ATechGuy•2h ago
Just saw this https://x.com/ycombinator/status/1960779353589211577

They say "... can scrape any website—not even Cloudflare can detect it."

Strongly Typed?

https://dotat.at/@/2025-08-28-strongly-typed.html
1•fanf2•3m ago•0 comments

Learning Facts at Scale with Active Reading

https://arxiv.org/abs/2508.09494
1•Anon84•10m ago•0 comments

Show HN: Karton is a simple, type-safe RPC and state-syncing framework (OSS,MIT)

https://github.com/stagewise-io/stagewise/tree/main/packages/karton
2•glenntws•10m ago•0 comments

Show HN: Show HN: Geotagged Photo Map

https://ethan.dev/album
1•Beefin•14m ago•0 comments

Treated vegetable oils to green Singapore's data centres

https://www.businesstimes.com.sg/esg/microsoft-rolls-royce-power-systems-push-treated-vegetable-o...
1•kelt•19m ago•0 comments

Show HN: AI-powered video analysis tool that generates 800 word content prompts

https://video2prompt.org
1•reverseCh•19m ago•0 comments

Trunk: Our Choice for Linting TF Code

https://newsletter.masterpoint.io/p/trunk-our-choice-for-linting-tf-code
1•mooreds•21m ago•0 comments

Localhost: Peter Whidden's Interactive Ecosystem Simulation: Mote

https://www.youtube.com/watch?v=Hju0H3NHxVI
1•bane•23m ago•0 comments

Show HN: Open-Source] Deep Research Assistant Built Solely for Gemini API

https://github.com/zyakita/gemini-deep-research-oss
1•zyakita•23m ago•0 comments

Code Surgery: How AI Assistants Make Precise Edits to Your Files

https://fabianhertwig.com/blog/coding-assistants-file-edits/
1•faangguyindia•25m ago•0 comments

Gates Foundation Cuts Ties with Firm Linked to Democrats

https://www.nytimes.com/2025/08/26/us/politics/gates-foundation-democrats-arabella-advisors.html
2•reaperducer•26m ago•0 comments

Show HN: Multi-Scene Full 3D Context from CCTV

https://customer-ch4p4zaety6us2rk.cloudflarestream.com/3a75994ec7897b7f72690c1c21845da9/iframe?po...
3•teocalin37•39m ago•0 comments

The National Design Studio Is a Scam

https://www.chrbutler.com/the-national-design-studio-is-a-scam
63•delaugust•39m ago•3 comments

Uncertain⟨T⟩

https://nshipster.com/uncertainty/
2•thunderbong•39m ago•0 comments

Music to Break Models By

https://matthodges.com/posts/2025-08-26-music-to-break-models-by/
1•Bogdanp•40m ago•0 comments

Show HN: Paletra – Build WCAG ready color palettes and test them on components

https://www.paletra.cc/app
1•mazahermuraj•44m ago•1 comments

Glow-in-the-dark succulents are here

https://www.popsci.com/science/glow-in-the-dark-plants-succulents/
4•geox•47m ago•0 comments

Parrallel String Matching on CUDA

https://ieeexplore.ieee.org/document/9629415
1•cwmoore•48m ago•1 comments

BookPlotter – AI-Powered Book Summaries and Recommendations

https://bookplotter.com/
1•bookplotter•48m ago•1 comments

Marisa Trie – Static memory-efficient Trie-like structure

https://github.com/pytries/marisa-trie
2•vismit2000•58m ago•0 comments

Dinosaur-eating 'hypercarnivore' discovered in Argentina

https://www.discoverwildlife.com/prehistoric-life/kostensuchus-atrox-argentina
1•wslh•59m ago•0 comments

Uni Kuru Toga Roulette Model Mechanical Pencil Review (2023)

https://www.architecturelab.net/uni-kuru-toga-roulette-model-mechanical-pencil/
1•wslh•1h ago•0 comments

From medieval stronghold to cyber fortress: Shielding Europe's digital future

https://techxplore.com/news/2025-08-medieval-stronghold-cyber-fortress-shielding.html
1•PaulHoule•1h ago•0 comments

As PBS Stations Confront Cuts, American History Takes a Hit

https://www.nytimes.com/2025/08/27/arts/television/american-experience.html
15•ripe•1h ago•2 comments

Microsoft's employee protests have reached a boiling point

https://www.theverge.com/notepad-microsoft-newsletter/766683/microsoft-employee-protests-boiling-...
3•roldie•1h ago•4 comments

Codex IDE Extension

https://developers.openai.com/codex/ide/
1•_mu•1h ago•0 comments

LF Networking Announces Essedum Release 1.0

https://www.linuxfoundation.org/press/lf-networking-announces-essedum-release-1.0-delivering-core...
1•wicket•1h ago•0 comments

Census Bureau Data

https://data.census.gov/
1•jonbaer•1h ago•0 comments

World's Tallest bridge completes key load-bearing test [video]

https://www.bbc.com/news/videos/c5y3rrvl3r2o
2•thunderbong•1h ago•3 comments

Altered states of consciousness induced by breathwork accompanied by music

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0329411
9•gnabgib•1h ago•1 comments