frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Google Removed 749M Anna's Archive URLs from Its Search Results

https://torrentfreak.com/google-removed-749-million-annas-archive-urls-from-its-search-results/
56•gslin•1h ago

Comments

ggm•1h ago
I'm not sure I've ever relied on google to tell me what a site like this had, when the site itself is fully indexed, as this one is. Freetext search over the metastate of title, author, format, date (when available) -seems to work.
n1xis10t•1h ago
They don’t have full text search of document contents though do they? I know Google wouldn’t have this for AA pages either, just curious
ggm•1h ago
Good point. So there is definitely a social utility in search over text which google does have, for the trove it scanned, hands and cats-pawprints and all.
n1xis10t•1h ago
I’m pretty sure Google indexing pages from Anna’s archive would only get metadata, because AA doesn’t have the full text of the books on those pages. I think to get the full text you have to download the torrents, and I don’t think Google was doing that.
ggm•55m ago
No, thats more meta's trick. and they were "only doing it for the articles" not the pictures. I think. I dunno..
toomuchtodo•1h ago
Are they in ChatGPT and other LLM providers? No need for Google.
CamperBob2•41m ago
You could say that, yes.
aunty_helen•1h ago
Google does search now? I mean, it's great to see but I'm not sure how this is going to challenge the convenience of my chosen brand of chatbot being able to find the same info without being scammed by 100 seo optimised junk sites.
JKCalhoun•1h ago
Not sure. I understand they used to do search though.

(Love the username, BTW.)

n1xis10t•1h ago
Yeah they’re pretty terrible now. Reminds me, this is an interesting article about search engines getting worse and failing, but the author didn’t get into the spam aspect iirc: https://archive.org/details/search-timeline
n1xis10t•1h ago
I have heard that chatbots aren’t affected by spam as much as Google when you ask them to search, is that true?
add-sub-mul-div•1h ago
1. Your chatbot doesn't have its own internet scale search index.

2. You're being given information that may or may not be coming in part from junk sites. All you've done is give up the agency to look at sources and decide for yourself which ones are legitimate.

n1xis10t•1h ago
As for point one, is that true? I thought ChatGPT and Perplexity had their own indexes.
agluszak•1h ago
Anna's archive has already fulfilled G's needs (training Gemini) so now it's time to pretend it never existed ;)
someperson•48m ago
Feels weird to say but I have found using Yandex of all places an excellent search engine for content that get taken down by DMCA requests.

Eg if you want to watch a movie that's not on Netflix using a web stream the search results are far better.

Feels like Google circa 2005.

negativelambda•20m ago
I just tested, indeed very good results!
chneu•11m ago
I've been playing around with a variety of search engines such as Kagi, Startpage, Ecosia, DDG.

All of them are better than google in finding relevant results. Lol

Google is way too "personalized".

qiqitori•6m ago
You can turn off personalization. (Operating under the assumption that most people search for facts, I personally don't see why one would ever want personalized results.)
drnick1•16m ago
Go thing that Google hasn't been a part of my life for a while now. I use DuckDuck for search.
storus•13m ago
Google's march to irrelevance continues with full steam.

Python concurrency: gevent had it right

https://harshal.sheth.io/2025/09/12/python-async.html
1•hsheth2•3m ago•0 comments

Quinnypig/Yeet

https://github.com/quinnypig/yeet
2•rootforce•9m ago•0 comments

Ask HN: Lawyers of HN, how do you deal with AI slop?

3•gardnr•10m ago•0 comments

AI researchers 'embodied' an LLM into a robot – and it channeled Robin Williams

https://techcrunch.com/2025/11/01/ai-researchers-embodied-an-llm-into-a-robot-and-it-started-chan...
1•gnabgib•11m ago•0 comments

Ford Foundation's New Leader Vows to Protect Elections and the Rule of Law

https://www.nytimes.com/2025/11/03/us/politics/ford-foundation-heather-gerken-trump.html
2•whack•12m ago•0 comments

Unpaid Domestika and CGMA Instructors Protest Online, Students Join over Billing

https://www.classcentral.com/report/domestika-unpaid-instructors/
1•raybb•13m ago•0 comments

11X Faster ScyllaDB Backup

https://www.scylladb.com/2025/11/04/11x-faster-scylladb-backup/
1•tanelpoder•13m ago•0 comments

Thoughts by a non-economist on AI and economics

https://www.lesswrong.com/posts/QQAWu7D6TceHwqhjm/thoughts-by-a-non-economist-on-ai-and-economics
1•gwintrob•19m ago•0 comments

Petri Dish Neural Cellular Automata

https://pub.sakana.ai/pdnca/
1•hardmaru•20m ago•0 comments

Lazy Backup (2006)

http://www.aaronsw.com/weblog/lazybackup
3•varun_ch•29m ago•1 comments

Free Learning in Today's Society: Some Personal Experiences and Reflections

https://www.lesswrong.com/posts/pESH2aYfu4B9rhNEm/free-learning-in-today-s-society-some-personal-...
2•gmays•36m ago•0 comments

Show HN: Send USDC via Email

https://btwnfriends.com/
3•Must_be_Ash•38m ago•0 comments

The Physics of News, Rumors, and Opinions

https://arxiv.org/abs/2510.15053
2•Anon84•39m ago•0 comments

Experiences with AI-Generated Pornography

https://link.springer.com/article/10.1007/s10508-025-03227-x
3•tokai•42m ago•0 comments

Datadog Instance Explorer

https://instances.datadoghq.com/
2•scapecast•48m ago•0 comments

You Freeze in Meetings (Even When You Know You Stuff)

https://www.youtube.com/watch?v=BOOB4nlhTZ4
3•polymath88•49m ago•1 comments

Ups Cargo Plane Crashes in Kentucky

https://www.wsj.com/business/logistics/ups-cargo-plane-crashes-in-kentucky-1a199671
2•CSMastermind•52m ago•1 comments

Enabling Trillion-Parameter Models on AWS EFA

https://research.perplexity.ai/articles/enabling-trillion-parameter-models-on-aws-efa
2•tanelpoder•53m ago•0 comments

FDA described as a "clown show" amid latest scandal; top drug regulator is out

https://arstechnica.com/health/2025/11/fda-described-as-a-clown-show-amid-latest-scandal-top-drug...
33•duxup•1h ago•5 comments

UBS chair warns of 'looming systemic risk' from private credit ratings

https://www.ft.com/content/73ee8c6d-3c04-425e-9d2c-ecbf2f376a4f
3•moose_man•1h ago•0 comments

Why Crypto Can't Build Anything Long-Term

https://x.com/therosieum/article/1984987750647333350
4•salkahfi•1h ago•1 comments

How Much AI Spending Is Too Much? Investors Are Starting to Wonder

https://www.wsj.com/finance/stocks/how-much-ai-spending-is-too-much-investors-are-starting-to-won...
9•moose_man•1h ago•0 comments

Famous Method of Valuing Stocks Is Pointing Toward Some Rough Years Ahead

https://www.wsj.com/finance/investing/this-famous-method-of-valuing-stocks-is-pointing-toward-som...
3•moose_man•1h ago•1 comments

For a Literary Saint, Margaret Atwood Can Sure Hold a Grudge

https://www.nytimes.com/2025/11/01/books/review/margaret-atwood-book-of-lives-memoir.html
3•binning•1h ago•0 comments

Cleaning an orange iPhone 17 Pro with hydrogen peroxide turns it pink

https://www.pcmag.com/news/has-your-orange-iphone-17-turned-pink-turns-out-youre-to-blame
3•zdw•1h ago•0 comments

Trump reverses course to renominate billionaire Musk ally to lead NASA

https://www.theguardian.com/science/2025/nov/04/trump-jared-isaacman-nasa
8•foobarbecue•1h ago•1 comments

What's the deal with the popcorn button? [video]

https://www.youtube.com/watch?v=Limpr1L8Pss
1•Sir_Twist•1h ago•0 comments

Problems regulating emotions during pregnancy linked with perinatal depression

https://theconversation.com/problems-regulating-emotions-during-pregnancy-linked-with-perinatal-d...
3•binning•1h ago•0 comments

GenAI for Computing Careers: A Sunny Take

https://cacm.acm.org/blogcacm/genai-for-computing-careers-a-sunny-take/
2•tjr•1h ago•0 comments

Taliban ban books written by women from Afghan universities

https://www.bbc.co.uk/news/articles/c0kn7yyzrjgo
4•binning•1h ago•0 comments