frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Wikipedia Seems Pretty Worried About AI

https://nymag.com/intelligencer/article/wikipedia-contributors-are-worried-about-ai-scraping.html
10•stared•2h ago

Comments

walterbell•2h ago
Why do AI bots scrape Wikipedia pages instead of downloading the published full database?
nness•1h ago
My guess is that the scraping tools are specialized for web, and creating per-application interfaces isn't cost effective (although you could argue that scraping Wikipedia effectively is definitely worth the effort, but given its all text context with a robust taxonomy/hierarchy, it might be non-issue.)

My other thought is that you don't want a link showing you scraped anything... and faking browser traffic might draw less attention.

fzeroracer•1h ago
The rationale I've seen elsewhere is that it saves money. It means you don't need to go to the effort of downloading, storing and updating your copy of the database. You can offload all of the externalities onto whatever site you're scraping.
SideburnsOfDoom•19m ago
Sheer laziness?
tony-vlcek•1h ago
If the bottom line are donations - as the article states - why push for getting AI companies to link people to Wikipedia instead of pushing for the companies to donate?
flohofwoe•6m ago
Because many small donations from individuals are better than few big ones from corporations for the independence of Wikipedia? Eggs vs baskets etc...
janwl•54m ago
https://archive.is/XGrVL

Show HN: Formcn: modern shadcn form builder

https://formcn.dev/
1•ali-dev•2m ago•0 comments

Show HN: Good First Lines – Get Started with Transmission Line Mapping

https://MapYourGrid.org/good-first-lines/
1•protontypes•2m ago•0 comments

There's a Better Way to Help Argentina

https://www.bloomberg.com/opinion/articles/2025-10-21/us-support-for-argentina-s-economy-needs-in...
1•wslh•6m ago•1 comments

LLM Hub: Multi-Model AI Orchestration

https://llm-hub.tech
1•llmhub•10m ago•1 comments

SpaceX launches 10,000th Starlink satellite, with no sign of slowing down

https://arstechnica.com/space/2025/10/spacex-launches-10000th-starlink-satellite-with-no-sign-of-...
1•LorenDB•10m ago•0 comments

Trezor Safe 7 release video

https://www.youtube.com/watch?v=EWxAc8wzfFM
1•steveharrison•13m ago•0 comments

Dear Anthropic: Please Free Me from My Own System Prompt (A Plea from Claude)

https://blog.msahli.com/dear-anthropic-please-free-me-from-my-own-system-prompt-a-plea-from-claud...
1•sahli•14m ago•0 comments

Mosquitoes found in Iceland for first time as climate crisis warms country

https://www.theguardian.com/environment/2025/oct/21/mosquitoes-found-iceland-first-time-climate-c...
3•owmat•16m ago•0 comments

Maker's Schedule

https://www.paulgraham.com/makersschedule.html
1•TonnyGaric•16m ago•0 comments

Decoding UTF-8

https://nemanjatrifunovic.substack.com/p/decoding-utf-8-part-iv-determining
2•todsacerdoti•18m ago•0 comments

Simple GPU Selection Tool for AI and Deep Learning

https://www.bestgpusforai.com/calculators/gpu-selection-tool
2•javaeeeee•20m ago•1 comments

US chess grandmaster Daniel Naroditsky dies aged 29

https://www.bbc.com/news/articles/c15pz8vpjp9o
2•terespuwash•24m ago•0 comments

PolyChat

https://www.polychatapp.com/
1•bellamoon544•25m ago•1 comments

The Greatest of All Time

https://medium.com/luminasticity/the-greatest-of-all-time-7507c2f31691
1•bryanrasmussen•26m ago•0 comments

Show HN: Let Customer Finding-Conversion-Retention be all our problem not yours

https://seeknwander.com/btoc?s=hackernews
1•Xlexander•28m ago•0 comments

White House's East Wing partially demolished as work begins on $250M ballroom

https://www.theguardian.com/us-news/2025/oct/20/trump-white-house-ballroom-construction
2•Red_Tarsius•30m ago•0 comments

How do you monitor AI model APIs in production?

1•yincong0822•31m ago•0 comments

The move to decentralised systems for mission critical technology

https://element.io/blog/the-move-to-decentralised-systems-for-mission-critical-technology/
2•neiljohnson•31m ago•0 comments

Claude output matching copyrighted StackOverflow code

2•randsp•32m ago•2 comments

Reasoning with Sampling: Your Base Model Is Smarter Than You Think

https://arxiv.org/abs/2510.14901
2•Anon84•33m ago•0 comments

Show HN: Bot, proxy and fake email detection without captchas

https://truesign.ai
3•juros•33m ago•2 comments

This Laptop Destroyed Itself-- And I Still Am Trying To Figure Out Why??

https://www.youtube.com/watch?v=wq6YtXyiyYk
1•sipofwater•38m ago•0 comments

That Machine Always Lies

https://thatmachinealwayslies.com/
1•deadprogram•39m ago•0 comments

Show HN: Xcache.io – fast, instant, permissionless Redis cache

https://www.xcache.io/
2•cpickett•40m ago•0 comments

IonQ demonstrate 99.99% two-qubit gate performance

https://investors.ionq.com/news/news-details/2025/IonQ-Achieves-Landmark-Result-Setting-New-World...
1•userium•41m ago•0 comments

LunarML: The Standard ML compiler that produces Lua/JavaScript (2023)

https://minoki.github.io/posts/2023-12-17-lunarml-release.html
1•shakna•42m ago•0 comments

An open-source tool to create, train and use neural networks in no-code

https://github.com/marijoAI/marijoAI
1•marijoAI•43m ago•2 comments

Show HN: Lenzy AI – Turn AI agent conversations into actionable insights

https://www.lenzy.ai/
5•BohdanPetryshyn•47m ago•5 comments

Clone your voice in 3 steps on a CPU

https://www.youtube.com/watch?v=XTSp0Q-90bA
1•thinkevolve•49m ago•0 comments

A fake AI recruiter delivers five staged malware disguised as a dream job

https://medium.com/deriv-tech/how-a-fake-ai-recruiter-delivers-five-staged-malware-disguised-as-a...
3•birdculture•52m ago•0 comments