frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Messing with Scraper Bots

https://herman.bearblog.dev/messing-with-bots/
30•HermanMartinus•2h ago

Comments

ArcHound•34m ago
Neat! Most of the offensive scrapers I met try and exploit WordPress sites (hence the focus on PHP). They don't want to see php files, but their outputs.

What you have here is quite close to a honeypot, sadly I don't see an easy way to counter-abuse such bots. If the attack is not following their script, they move on.

NoiseBert69•28m ago
Hm.. why not using dumbed down small, self-hopsted LLM networks to feet the big scrapers with bullshit?

I'd sacrifice two CPU cores for this just to make their life awful.

jcynix•20m ago
If you control your own Apache server and just want to shortcut to "go away" instead of feeding scrapers, the RewriteEngine is your friend, for example:

      RewriteEngine On

      # Block requests that reference .php anywhere (path, query, or encoded)
      RewriteCond %{REQUEST_URI} (\.php|%2ephp|%2e%70%68%70) [NC,OR]
      RewriteCond %{QUERY_STRING} \.php [NC,OR]
      RewriteCond %{THE_REQUEST} \.php [NC]
      RewriteRule .* - [F,L]
Notes: there's no PHP on my servers, so if someone asks for it, they are one of the "bad boys" IMHO. Your mileage may differ.
localhostinger•15m ago
Interesting! It's nice to see people are experimenting with these, and I wonder if this kind of junk data generators will become its own product. Or maybe at least a feature/integration in existing software. I could see it going there.
s0meON3•6m ago
What about using zip bombs?

https://idiallo.com/blog/zipbomb-protection

Spec-Driven Development: The Waterfall Strikes Back

https://marmelab.com/blog/2025/11/12/spec-driven-development-waterfall-strikes-back.html
79•vinhnx•2h ago•57 comments

The Internet Is Cool. Thank You, TCP

https://cefboud.com/posts/tcp-deep-dive-internals/
44•signa11•3h ago•24 comments

AI World Clocks

https://clocks.brianmoore.com/
1019•waxpancake•15h ago•311 comments

Messing with Scraper Bots

https://herman.bearblog.dev/messing-with-bots/
30•HermanMartinus•2h ago•5 comments

So, you want to design your own language? (2017)

https://cs.lmu.edu/~ray/notes/languagedesignnotes/
69•veqq•4h ago•50 comments

AMD GPUs Go Brrr

https://hazyresearch.stanford.edu/blog/2025-11-09-amd-brr
125•vinhnx•7h ago•19 comments

How My Speed Date Got Stolen Onstage at a Live Comedy Dating Show

https://psychotechnology.substack.com/p/how-my-speed-date-got-stolen-onstage
11•eatitraw•1h ago•3 comments

Löb and Möb: Loops in Haskell

https://github.com/quchen/articles/blob/master/loeb-moeb.md
34•fanf2•1w ago•6 comments

Unofficial Microsoft Teams client for Linux

https://github.com/IsmaelMartinez/teams-for-linux
131•basemi•1w ago•98 comments

A new Google model is nearly perfect on automated handwriting recognition

https://generativehistory.substack.com/p/has-google-quietly-solved-two-of
328•scrlk•3d ago•186 comments

SSL Configuration Generator

https://ssl-config.mozilla.org/
163•smartmic•11h ago•47 comments

HipKittens: Fast and furious AMD kernels

https://hazyresearch.stanford.edu/blog/2025-11-09-hk
173•dataminer•1d ago•58 comments

'No One Lives Forever' turns 25 and you still can't buy it legitimately

https://www.techdirt.com/2025/11/13/no-one-lives-forever-turns-25-you-still-cant-buy-it-legitimat...
254•speckx•17h ago•124 comments

Ohm Editor

https://ohmjs.org/editor/
21•andsoitis•4h ago•3 comments

All praise to the lunch ladies

https://bittersoutherner.com/issue-no-12/all-praise-to-the-lunch-ladies
187•gmays•14h ago•107 comments

Random Font – a typographic experiment exploring randomness [pdf]

https://www.ilcovile.it/scritti/COVILE_834_Reprint_Random_Font.pdf
13•misone•1w ago•5 comments

No Leak, No Problem – Bypassing ASLR with a ROP Chain to Gain RCE

https://modzero.com/en/blog/no-leak-no-problem/
77•todsacerdoti•10h ago•4 comments

Structured outputs on the Claude Developer Platform

https://www.claude.com/blog/structured-outputs-on-the-claude-developer-platform
140•adocomplete•14h ago•59 comments

Lawmakers want to ban VPNs and have no idea what they're doing

https://www.eff.org/deeplinks/2025/11/lawmakers-want-ban-vpns-and-they-have-no-idea-what-theyre-d...
218•gslin•1d ago•108 comments

A race condition in Aurora RDS

https://hightouch.com/blog/uncovering-a-race-condition-in-aurora-rds
220•theanomaly•15h ago•69 comments

Blending SQL and Python with Sqlorm

https://hyperflask.dev/blog/2025/11/11/blending-sql-and-python-with-sqlorm/
9•emixam•4d ago•4 comments

Continuous Architecture: A decade of designing for change

https://continuousarchitecture.com/2025/11/04/a-decade-of-ca/
4•gHeadphone•1w ago•0 comments

Async Mutexes

https://matklad.github.io/2025/11/04/on-async-mutexes.html
38•ingve•1w ago•13 comments

Ucs-Detect

https://ucs-detect.readthedocs.io/intro.html
13•djoldman•1w ago•3 comments

Winamp clone in Swift for macOS

https://github.com/mgreenwood1001/winamp
226•hyperbole•21h ago•142 comments

Meet Reservoir – The World's Smartest Water Heater

https://www.reservoirhome.com/
22•flakespancakes•6h ago•28 comments

GEN-0 / Embodied Foundation Models That Scale with Physical Interaction

https://generalistai.com/blog/nov-04-2025-GEN-0
57•jackdoe•1w ago•5 comments

Hiring the Joker

https://quarter--mile.com/hiring-the-joker
26•surprisetalk•1w ago•19 comments

Show HN: Tiny Diffusion – A character-level text diffusion model from scratch

https://github.com/nathan-barry/tiny-diffusion
132•nathan-barry•4d ago•16 comments

Manganese is Lyme disease's double-edge sword

https://news.northwestern.edu/stories/2025/11/manganese-is-lyme-diseases-double-edge-sword
141•gmays•17h ago•78 comments