frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

You can now play Grand Theft Auto Vice City in the browser

https://dos.zone/grand-theft-auto-vice-city/
103•Alifatisk•1h ago•28 comments

TP-Link Tapo C200: Hardcoded Keys, Buffer Overflows and Privacy

https://www.evilsocket.net/2025/12/18/TP-Link-Tapo-C200-Hardcoded-Keys-Buffer-Overflows-and-Priva...
123•sibellavia•2h ago•24 comments

Garage – An S3 object store so reliable you can run it outside datacenters

https://garagehq.deuxfleurs.fr/
301•ibobev•4h ago•60 comments

Mistral OCR 3

https://mistral.ai/news/mistral-ocr-3
63•pember•1d ago•2 comments

GotaTun -- Mullvad's WireGuard Implementation in Rust

https://mullvad.net/en/blog/announcing-gotatun-the-future-of-wireguard-at-mullvad-vpn
457•km•9h ago•99 comments

Amazon will allow ePub and PDF downloads for DRM-free eBooks

https://www.kdpcommunity.com/s/article/New-eBook-Download-Options-for-Readers-Coming-in-2026?lang...
436•captn3m0•10h ago•237 comments

Vm.overcommit_memory=2 is always the right setting for servers

https://ariadne.space/2025/12/16/vmovercommitmemory-is-always-the-right.html
23•signa11•2d ago•21 comments

Show HN: Stickerbox, a kid-safe, AI-powered voice to sticker printer

https://stickerbox.com/
10•spydertennis•49m ago•7 comments

The FreeBSD Foundation's Laptop Support and Usability Project

https://github.com/FreeBSDFoundation/proj-laptop
102•mikece•5h ago•40 comments

Believe the Checkbook

https://robertgreiner.com/believe-the-checkbook/
72•rg81•4h ago•30 comments

Where Is GPT in the Chomsky Hierarchy?

https://fi-le.net/chomsky/
38•fi-le•4d ago•30 comments

Performance Hints – Jeff Dean and Sanjay Ghemawat

https://abseil.io/fast/hints.html
13•alphabetting•1h ago•0 comments

Reverse Engineering US Airline's PNR System and Accessing All Reservations

https://alexschapiro.com/security/vulnerability/2025/11/20/avelo-airline-reservation-api-vulnerab...
51•bearsyankees•2h ago•23 comments

8-bit Boléro

https://linusakesson.net/music/bolero/index.php
14•Aissen•8h ago•2 comments

Graphite Is Joining Cursor

https://cursor.com/blog/graphite
90•fosterfriends•4h ago•130 comments

Detailed balance in large language model-driven agents

https://arxiv.org/abs/2512.10047
5•Anon84•3d ago•0 comments

Rust's Block Pattern

https://notgull.net/block-pattern/
28•zdw•15h ago•5 comments

Lite^3, a JSON-compatible zero-copy serialization format

https://github.com/fastserial/lite3
83•cryptonector•6d ago•27 comments

NOAA deploys new generation of AI-driven global weather models

https://www.noaa.gov/news-release/noaa-deploys-new-generation-of-ai-driven-global-weather-models
14•hnburnsy•1d ago•0 comments

Show HN: TinyPDF – 3kb pdf library (70x smaller than jsPDF)

https://github.com/Lulzx/tinypdf
19•lulzx•1d ago•3 comments

Show HN: MCPShark Viewer (VS Code/Cursor extension)- view MCP traffic in-editor

19•mywork-dev•2d ago•0 comments

Show HN: I Made Loom for Mobile

https://demoscope.app
37•admtal•3h ago•27 comments

Prepare for That Stupid World

https://ploum.net/2025-12-19-prepare-for-that-world.html
120•speckx•3h ago•69 comments

We pwned X, Vercel, Cursor, and Discord through a supply-chain attack

https://gist.github.com/hackermondev/5e2cdc32849405fff6b46957747a2d28
1077•hackermondev•1d ago•394 comments

Building a Transparent Keyserver

https://words.filippo.io/keyserver-tlog/
42•noident•5h ago•14 comments

Wall Street Ruined the Roomba and Then Blamed Lina Khan

https://www.thebignewsletter.com/p/how-wall-street-ruined-the-roomba
84•connor11528•1h ago•49 comments

Show HN: Stepped Actions – distributed workflow orchestration for Rails

https://github.com/envirobly/stepped
71•klevo•5d ago•10 comments

1.5 TB of VRAM on Mac Studio – RDMA over Thunderbolt 5

https://www.jeffgeerling.com/blog/2025/15-tb-vram-on-mac-studio-rdma-over-thunderbolt-5
565•rbanffy•22h ago•208 comments

Prompt caching for cheaper LLM tokens

https://ngrok.com/blog/prompt-caching/
242•samwho•3d ago•57 comments

History LLMs: Models trained exclusively on pre-1913 texts

https://github.com/DGoettlich/history-llms
706•iamwil•21h ago•346 comments
Open in hackernews

Why we're taking legal action against SerpApi's unlawful scraping

https://blog.google/technology/safety-security/serpapi-lawsuit/
34•xnx•2h ago

Comments

Nextgrid•2h ago
> SerpApi deceptively takes content that Google licenses from others

They have a different definition of "licensing" than most people I guess. Aren't site operators complaining about Google using this "licensed" content in AI overviews... not to mention the scraping for AI model training.

The pot is calling the kettle black.

skybrian•1h ago
As far as I know, Google respects robots.txt and doesn't obfuscate their crawlers, so you can easily block them if you want. It seems like an important distinction?
immibis•1h ago
There's no law that says you have to do that. It used to be a sensible thing to do, in the early internet. In the current internet, obeying robots.txt is a self-handicap and you shouldn't do it.

DDoS remains illegal regardless of robots.txt.

Nextgrid•1h ago
Google can afford to respect robots.txt because it has a monopoly on search and nobody would consider actually blocking them in said robots.txt anyway.

SerpApi doesn't have that privilege.

bitpush•58m ago
but SerpApi is not scraping websites, it is sending malicoius requests to google.com.
Nextgrid•41m ago
SerpApi is scraping Google. The "maliciousness" if the requests is a matter of perspective. Of course Google considers it malicious; that doesn't necessarily make it true.
xnx•15m ago
Google has respected robots.txt from the start.
throw-12-16•1h ago
robots.txt is not a legally binding document, nobody needs to actually respect it
jppope•1h ago
I'm not sure of the legality but I definitely appreciate their product. This lawsuit seems odd because google themselves scrape content for their indexes. From what I see SerpApi is really just providing a machine interface that Google themselves refuses to provide users and visibility into SERPs which is also something that users should have available to them.

I'm probably just being naive though...

bluGill•1h ago
Google publishes how to control their bot - with robots.txt. They then obey those instructions. Google also takes some effort to not use all your bandwidth. Google isn't perfect, but they are at least making a "good faith" effort to be nice and this does count in court. Overall most will agree that in general what google does to allow people to find their website is worth the things that google is doing.

You can of course argue a lot of edge cases if you really want. For the most part I want to say "it isn't worth the argument". In some cases I will take your side if I really have to think about it, but in general the system google has been using mostly works and is mostly an acceptable compromise.

hackerbeat•1h ago
What's nice about scraping all the content for their own good while killing off websites left and right? Google needs to be sued also.

Along with all the other AI companies out there, the've committed the biggest theft in human history.

pawelduda•1h ago
But their robots are enabled by default. So it is a form of unsolicited scraping. If I spam millions of email addresses without asking for permission but provide a link to opt-out form, am I the good guy?
kacesensitive•1h ago
SerpApi wouldn't even be a thing if Google offered an equivalent API...
AuthError•1h ago
why does google need to offer it?
bitpush•1h ago
Why would Google offer an API? This is similar to saying when Apple sues an employee stealing IP "Nobody would steal the IP if they gave it away for free". The question is - why?
sovietmudkipz•1h ago
What’s the difference between scraping and malicious scraping? Does google engage in scraping or malicious scraping? Do the AI companies engage in scraping or malicious scraping?
throw-12-16•1h ago
The size of your legal team.
bakugo•1h ago
Malicious scraping is when people other than them do it. When they scrape the internet to train their AI, it's "lawful" because they said so.
jchw•1h ago
Note that I am not defending the merits of Google's lawsuit, but they did describe in this very post what they believe distinguishes their scraping versus SerpApi.

> Stealthy scrapers like SerpApi override those directives and give sites no choice at all. SerpApi uses shady back doors — like cloaking themselves, bombarding websites with massive networks of bots and giving their crawlers fake and constantly changing names — circumventing our security measures to take websites’ content wholesale. [...] SerpApi deceptively takes content that Google licenses from others (like images that appear in Knowledge Panels, real-time data in Search features and much more), and then resells it for a fee. In doing so, it willfully disregards the rights and directives of websites and providers whose content appears in Search.

To me this seems... interesting, for sure. I think that Google already set a bad precedent by pulling content from the web directly into its results, and an even worse one by paying websites with user-generated content for said content (while those sites didn't pay the users that actually made the user-generated content, as an additional bitchslap.)

But it seems like at the very least Google is suggesting that SerpApi is effectively trying to "steal" the work Google did, rather than do the same work themselves. Though I wonder if this is really Google pulling up the ladder behind them a bit, given how privileged of a position they are in with regards to web scraping.

It's a tough case. I think that something does need to ultimately be done about "malicious" web scraping that ignores robots.txt, but traditionally that sort of thing did not violate any laws, and I feel somewhat skeptical that it will be found to violate the law today. I mean, didn't LinkedIn try this same thing?

moralestapia•1h ago
>bombarding websites with massive networks of bots

Like GoogleBot?

And yeah, robots.txt is not enforced by any law.

I think this is just about dragging SerpApi through a lengthy legal procedure and fees.

jefftk•1h ago
Whether you obey robots.txt (Google does, SerpApi doesn't) seems like an important distinction.
xnx•1h ago
Permission
GuinansEyebrows•1h ago
yoink

* that's the sound of a ladder being yanked up

throw-12-16•1h ago
Google can eat a bag of dicks.

Their entire ai model was scraped.

AstroBen•1h ago
> Defendant SerpApi, LLC (“SerpApi”) offers services that “scrape” this copyrighted content and more from Google, using deceptive means to automatically access and take it for free at an astonishing scale and then offering it to various customers for a fee. In doing so, SerpApi acquires for itself the valuable product of Google’s labors and investment in the content, and denies Google’s partners compensation for their works

this has to be satire. Is Google not the #1 entity guilty of exactly this?

jefftk•1h ago
No, Google doesn't use deceptive means. They identify their crawler as GoogleBot, and obey robots.txt.
Nextgrid•1h ago
Google doesn't have to do that now after already having established its own monopoly... just like SerpApi wouldn't have to act deceptively if they had a monopoly on search.
jppope•1h ago
What about for their LLM products? We know that OpenAi does not respect the robots.txt file
xnx•16m ago
Google uses the same crawler and robots.txt file for training data.
AstroBen•1h ago
Because they've forced everyone to allow them. They're the internet traffic mafia. Block them and you disappear from the internet

They abuse this power to scrape your work, summarize it and cut you out as much as possible. Pure value extraction of others' work without equal return. Now intensified with AI

But yeah, you're right. They're not deceptive

bitpush•11m ago
> Because they've forced everyone to allow them.

nobody is forcing anyone. This is the same argument that people said about google search. Nobody is forcing anyone to use google search, google chrome, or even allow googlebot for scraping.

Thousands of poeple have switched over to chatgpt, brave/firefox ..

Your argument sounds like "I dont like Apple's practices, and I'm forced to buy iPhones. No buddy, if you dont like Apple, dont buy their products"

ekjhgkejhgk•1h ago
Disgusting behavior by google. Scraping is google's whole business.

And then pretending that they're fighting for other people's copyright is just the cherry on top of the pile of hypocrisy.

SilverElfin•1h ago
Google scrapes so what even is this? Beyond that I think it is unreasonable and monopolistic that Google can use all this data (like YouTube) to bolster their AI products but no one else can. It just means the megacorp will keep being megacorp and smaller players are doomed to have to work much harder and get very lucky. It’s not fair competition. So I view scraping Google as necessary for our society.
bitpush•1h ago
From the filing

> SerpApi’s answer to SearchGuard is to mask the hundreds of millions of automated queries it is sending to Google each day to make them appear as if they are coming from human users. SerpApi’s founder recently described the process as “creating fake browsers using a multitude of IP addresses that Google sees as normal users.”

observationist•1h ago
This is why I stopped using google wherever possible - they pushed the frontier of useful fair use and copyright precedents and established that things on the public internet displayed to the public without a login mechanism are fair game for scraping. The US supreme court ruled that you have to incorporate authentication and not simply serve your content to the public internet if you want to restrict usage.

Then they bend over backwards and do the "but not like that!" crap with their legal team and swing their wealth and influence around to screw over other companies and people, and a vast majority of it just vanishes, gets memory holed, with NDAs and out of court settlements, so you never get to see the full scope of harm they inflict unless you're watching like a hawk and catch the headlines before they get disappeared.

Google needs to be broken up and we need to legislate the dismantling of the current adtech regime, with a privacy and sovereignty respecting digital bill of rights that puts the interests of individual citizens above that of giant corporate blobs and the mass surveillance data industry.

ChrisArchitect•27m ago
Related:

Reddit Accuses 'Data Scraper' Companies of Stealing Its Information

https://news.ycombinator.com/item?id=45695433

Our Response to Reddit, Inc. vs. SerpApi, LLC: Defending the First Amendment

https://news.ycombinator.com/item?id=45739889