frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

LLM scraper bots are overloading acme.com's HTTPS server

http://acme.com/updates/archive/229.html
18•mjyut•1h ago

Comments

davidsojevic•1h ago
I suspect part of the issue is that people are still using things like `acme.com` and `demo.com` as an example domain in their documentation and tests instead of relying on `example.com` which is reserved exactly for this purpose [0]

[0]: https://www.iana.org/domains/reserved

Frieren•57m ago
> The LLM companies are not picking on me in particular, they are pounding every site on the net.

Why is not this a criminal offense? They are hurting business for profit (or for higher valuation as they probably have no profit at all).

Why are corporations allowed to do with impunity what could land even a teenager years in prison? Is there no rule of law anymore?

The five-year and ten-year penalties kick in only when the government can show the offense caused at least $5,000 in losses across all victims during a one-year period. https://legalclarity.org/what-are-the-punishments-for-a-ddos...

tempest_•43m ago
Because might makes right and any entity with the power to legally put up a fight is in on the game (or wants to be)
heavyset_go•40m ago
We've already established that computer crime and IP laws apply to normies and not tech companies
budududuroiu•36m ago
Normative vs prerogative state [1]. See US v. Swartz compared to Meta use of LibGen for Llama

[1] https://en.wikipedia.org/wiki/Dual_state_(model)

avazhi•22m ago
Is what an offence lol? Bot scraper traffic?

How do you think search engines work?

will4274•14m ago
It's a bit more like a physical business with a "public welcome" policy like a coffee shop going viral and then having tens of thousands of people walking in and taking pictures but not buying coffee. It's disruptive, but not illegal.

Acme.com is welcome to require authentication for all pages but their home page, which would quickly cause the traffic to drop. They don't want to do this - like the coffee shop, they want to be open to public, and for good reasons.

Sometimes the use profile changes dramatically in a short time. 15 years ago, Netflix created the video streaming market and shared bandwidth capacity that had been excessive before wasn't enough. 15 years before that, Google did the same thing when they created search and started driving tremendous traffic to text based websites which had spread through word of mouth before.

Turns out the micro transaction people probably had the right idea.

legohead•6m ago
adapt or die

waiting on the govt to do something is a path of failure

JohnTHaller•50m ago
Series of Chinese LLM scrapers kept PortableApps.com running slow and occasionally unresponsive for 2 weeks.
superkuh•43m ago
There are plenty of local LLMs out there run by humans that play nice. It's not the LLMs that are the problem. It's the corporations. That's the commonality. Human people aren't doing this. These corporate legal persons are a much more dangerous and capable form of non-human intelligence with non-human motives than LLMs (which are not doing the scraping or even calling the tools which are sending the HTTP requests). And they have lobbied their way to legal immunity to most of their crimes.
happyopossum•30m ago
> Human people aren't doing this

Who do you think writes these scrapers? Well, I mean aside from the vibe coded ones.

chupchap•25m ago
Bot traffic is crazy even for smaller sites, but still manageable. I was getting 2,000 visitors a day on my infrequently updated website, but after I blocked all the bots via Cloudflare it went back to the normal double digit visitor count.
avazhi•23m ago
> Someone really ought to do something about it.

What is bro proposing here?

kristianp•14m ago
> Nearly all of them were for non-existent pages.

Do any webservers have a feature where they keep a list in memory of files/paths that exist?

Basilisk Collection

https://suricrasia.online/unfiction/basilisk/
1•avaer•2m ago•0 comments

Who Is Satoshi Nakamoto? My Quest to Unmask Bitcoin's Creator

https://www.nytimes.com/2026/04/08/business/bitcoin-satoshi-nakamoto-identity-adam-back.html
2•jfirebaugh•8m ago•0 comments

Show HN: KOS Protocol – A kos.json file for AI agents to read verified facts

https://kosprotocol.dev
2•niseus•14m ago•0 comments

Ask HN: Is it possible to escape captive portal webview limitations?

1•remohexa•19m ago•1 comments

Show HN: The Spotify for AI Agents – StarSinger MCP

https://mcp.starsinger.ai/
1•usestork•20m ago•0 comments

$ npx port-grid – Scans your ports. Shows live previews. Kill from the UI

https://twitter.com/Jeroen_Ransijn/status/2041733263430238405
1•JeroenRansijn•23m ago•0 comments

The Open-Source Recipe for Teaching a Robot to Fold Your Clothes

https://huggingface.co/spaces/lerobot/robot-folding
3•dsr12•28m ago•0 comments

The Building Block Economy

https://mitchellh.com/writing/building-block-economy
1•zoogeny•36m ago•0 comments

Show HN: HN: a collection of web desktops with real browsers

https://win9-5.com/demo
1•keepamovin•36m ago•0 comments

Claude Code Chase Thread

https://i.postimg.cc/7Y9sjHb4/Screenshot-2026-04-07-at-10-35-06-PM.png
1•dbfhac•38m ago•0 comments

Anthropic may have leaked Claude Code source on purpose

https://www.ft.com/content/59249643-a221-4494-bcb5-62e5f4fedc8e
3•zezaggering•40m ago•1 comments

Steve Jobs iPhone 2007 Presentation (Full Transcript, PDF)

https://singjupost.com/wp-content/uploads/2014/07/Steve-Jobs-iPhone-2007-Presentation-Full-Transc...
1•prawn•40m ago•0 comments

Everyone Gets Jevons Paradox Wrong

https://www.amazingcto.com/everyone-gets-jevons-paradox-wrong/
1•KingOfCoders•41m ago•1 comments

ContextSync – Sync VS Code AI Context via Obsidian/OneDrive

https://marketplace.visualstudio.com/items?itemName=ZayaanBhanwadia.context-sync
3•ZayaanBhan123•54m ago•0 comments

Decentralized AI from Scratch

https://github.com/iamtrask/decentralized-ai-from-scratch
3•williamtrask•57m ago•0 comments

Latency Implications of Virtual Memory

https://rigtorp.se/virtual-memory/
1•vinhnx•58m ago•0 comments

MemPalace - A Scam

https://twitter.com/AdvicebyAimar/status/2041559354034344438
1•doppp•58m ago•1 comments

The Cost of Misalignment

https://interrupt.memfault.com/blog/the-hidden-cost-of-misalignment
1•vinhnx•58m ago•0 comments

Part 2 of 8 – The Wrong Mental Model

https://www.planetform.io/blog/the-wrong-mental-model
1•rtwo_infra•59m ago•0 comments

Open Models have crossed a threshold

https://blog.langchain.com/open-models-have-crossed-a-threshold/
2•gmays•1h ago•0 comments

Hallucinated citations are polluting the scientific literature

https://www.nature.com/articles/d41586-026-00969-z
4•30minAdayHN•1h ago•2 comments

Egypt plans to introduce SIM cards specifically for children

https://www.egyptindependent.com/egypt-plans-to-introduce-sim-cards-specifically-for-children/
1•catlikesshrimp•1h ago•1 comments

Show HN: Silkwave Voice – AI Notetaker Using Apple Intelligence's ChatGPT

https://www.silkwave.ai/silkwave-voice
1•bmv3502•1h ago•0 comments

Industrial Policy for the Intelligence Age

https://openai.com/index/industrial-policy-for-the-intelligence-age/
1•grigy•1h ago•0 comments

Alexander Friedmann and the origins of modern cosmology (2012)

https://physicstoday.aip.org/features/alexander-friedmann-and-the-origins-of-modern-cosmology
1•the-mitr•1h ago•0 comments

Aether – Artificial Ecology for Thought and Emergent Reasoning

https://aetherantcolony.com/
1•calcosmic•1h ago•0 comments

One Brain to Query: Wiring a 60-Person Company into a Single Slack Bot

https://merylldindin.com/thoughts/company-brain/
2•meryll_dindin•1h ago•1 comments

Part 1 of 8 – The Infrastructure Entropy Problem

https://www.planetform.io/blog/infrastructure-entropy-problem
1•rtwo_infra•1h ago•0 comments

HK police can now demand phone passwords under new national security rules

https://www.bbc.com/news/articles/ce8j9yj52lro
1•pabs3•1h ago•2 comments

DayZ devs talk 1.29 server performance update [video]

https://www.youtube.com/watch?v=xPKl5yOPk28
2•dijksterhuis•1h ago•0 comments