This site can’t provide a secure connection annas-archive.li sent an invalid response. ERR_SSL_PROTOCOL_ERROR
Hmmm… can't reach this page
Check if there is a typo in annas-archive.li.
DNS_PROBE_FINISHED_NXDOMAIN
>In December 2024, the UK Publishers Association won an order from the High Court of Justice requiring major ISPs to block Anna's Archive and other copyright-infringing sites, extending a list of sites blocked since 2015 under section 97A of the Copyright, Designs and Patents Act
I wonder if it's blocked simply by DNS manipulation and therefore only people using the ISP DNS have issues.
> Error code: PR_CONNECT_RESET_ERROR
If I try the http version, I get redirected to https://bloqueadaseccionsegunda.cultura.gob.es/ (which also fails with PR_CONNECT_RESET_ERROR).
If it wasn't enough that half the internet gets unusable whenever there is football on TV (which is fucking stupid), now we're also getting rid of free (text!) information it seems.
You’re welcomed with this message:
Diese Webseite ist aus urheberrechtlichen Gründen nicht verfügbar. Zu den Hintergründen informieren Sie sich bitte hier.
And the works that previously had lead to Project Gutenberg being unavailable from Germany IP addresses will go into public domain in 2027.
[1]: https://www.youtube.com/watch?v=Uxmu25mUZgg [2]: https://cuiiliste.de/
Still a WIP, but it should be working well on Linux, Android and macOS. Give it a go if you want to support Anna's Archive.
And don't use imgur, that's blocked here too.
They first removed the direct links, and now all the references to them.
Now that's a reward signal!
That's what I get on this address:
Diese Webseite ist aus urheberrechtlichen Gründen nicht verfügbar. Zu den Hintergründen informieren Sie sich bitte hier.
Basically blocked for copyright reasons. And the 'hier' leads here:
I have less rights to access the information than LLMs have.
reconnecting•1h ago
We analyzed this on different websites/platforms, and except for random crawlers, no one from the big LLM companies actually requests them, so it's useless.
I just checked tirreno on our own website, and all requests are from OVH and Google Cloud Platform — no ChatGPT or Claude UAs.
cardanome•1h ago
GaggiX•59m ago
reconnecting•52m ago
What I've seen from ASNs is that visits are coming from GOOGLE-CLOUD-PLATFORM (not from Google itself), and OVH. Based on UA, users are: WebPageTest, BuiltWith, and zero LLMs based on both ASN and UA.
1. https://github.com/tirrenotechnologies/tirreno
GaggiX•46m ago
reconnecting•41m ago
GaggiX•20m ago
reconnecting•11m ago
whazor•54m ago
reconnecting•43m ago
We had made a docs website generator (1) that works with HTML (2) FRAMESET and tried to parse it with Claude.
Result: Claude doesn't see the content that comes from FRAMESET pages, as it doesn't parse FRAMEs. So I assume what they're using is more or less a parser based on whole-page rendering and not on source reading (including comments).
Perhaps, this is an option to avoid LLM crawlers: use FRAMEs!
1. https://github.com/tirrenotechnologies/hellodocs
2. https://www.tirreno.com/hellodocs/