frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I prompted ChatGPT, Claude, Perplexity, and Gemini and watched my Nginx logs

https://surfacedby.com/blog/nginx-logs-ai-traffic-vs-referral-traffic
84•startages•1h ago

Comments

realaccfromPL•1h ago
Looks like a very fun exercise, I will try it out as well, thanks for the idea!
dawolf-•1h ago
So for the user-agent "ChatGPT-User" I can return my prompt injection text. Got it.
hajimuz•1h ago
I’m curious about the header of their requests. Something like any one of them is using text/markdown accept header?
startages•37m ago
Added $http_accept and re-ran. None of them use text/markdown. Results:

ChatGPT-User/1.0 text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,/;q=0.8,application/signed-exchange;v=b3;q=0.9 Claude-User/1.0 / Perplexity-User/1.0 (empty, no Accept header) PerplexityBot/1.0 (empty, no Accept header) ChatGPT sends a Chrome-style Accept string. Claude sends a wildcard. Perplexity sends nothing at all. Gemini didn't fetch in my test.

Also worth noting: Claude-User hit /robots.txt before the page.

dalton_zk•1h ago
You're not burning money?
shermantanktop•1h ago
This article is absolutely jammed with AI tells. Not this, but that. Here's why X matters. This matters more than that.

The content is interesting, but it's delivered in an article that smells like slop.

nryoo•1h ago
So the state of AI in 2026: ChatGPT DDoS-lite, Claude the polite one that actually reads the rules, Perplexity maybe shows up, and Google was already in your house.
lambda•50m ago
Gah, the writing on this is so painful to read, it feels like this was most likely written by an LLM.

The writing style is so unclear, it's hard to figure out one of the key points: it mentions that Gemini doesn't use a distinct user-agent for its grounding. It doesn't mention whether it actually hit the endpoint during the test, though it kind of implies that with "Silence from Google is not evidence of no fetch." Uh, if there are no requests coming in live, that means no fetch, it's using a cache of your site.

It makes a difference whether it fetches a page live, or whether it's using a cached copy from a previous crawl; that tells you something about how up-to-date answers are going to be from people asking questions about your website from Gemini. But I guess the LLM writing this article just wanted to make things sound punchy an impressive, not actually communicate useful information.

Anyhow, LLM marketing spam from an LLM marketing spam company. Bleh.

anygivnthursday•35m ago
I had to quit after a couple of paragraphs, I cant read such AI slop anymore :(
startages•32m ago
I did use AI to organize my ideas but I didn't think it was that bad, I'll modify and make it easier to read.

Anyway, in my test I saw zero requests from any Google UA after multiple Gemini and AI mode prompts that should have triggered grounding, so the working interpretation is that Gemini served from its own index/cache rather than doing a live provider-side fetch. The original phrasing was fuzzier than it should have been.

realo•13m ago
Sometimes when we point the moon to people they prefer to discuss at length about the finger.

Don't worry.

cruffle_duffle•40m ago
I wish debates about “ai scraping my site” had more nuance.

There are multiple ways these tools access your site and only one of them is “using it for training”. Others are webfetch from chat sessions, “deep research” agents, etc. And those will have different traffic patterns. They aren’t crawlers, they are clumsy, ham handed AI agents doing their humans bidding.

Both can give a site the hug of death. Both can be badly coded. But there is much different intent behind the two and I feel it is important to acknowledge the difference.

ctime•39m ago
Does smack of AI ness

The IPs listed in the output are from reserved ranges as well, like they were intentionally obfuscated (but this was not shared with the reader).

It’s the kind of obfuscation that AI would do (using esoteric bogon ranges as well)

https://ipinfo.io/ips/203.0.113.0/24

We Accepted Surveillance as Default

https://vivianvoss.net/blog/why-we-accepted-surveillance
39•speckx•36m ago•10 comments

Qwen3.6-Max-Preview: Smarter, Sharper, Still Evolving

https://qwen.ai/blog?id=qwen3.6-max-preview
228•mfiguiere•3h ago•131 comments

Atlassian Enables Default Data Collection to Train AI

https://letsdatascience.com/news/atlassian-enables-default-data-collection-to-train-ai-f71343d8
283•kevcampb•4h ago•66 comments

Deezer says 44% of songs uploaded to its platform daily are AI-generated

https://techcrunch.com/2026/04/20/deezer-says-44-of-songs-uploaded-to-its-platform-daily-are-ai-g...
87•FiddlerClamp•1h ago•77 comments

GitHub's Fake Star Economy

https://awesomeagents.ai/news/github-fake-stars-investigation/
502•Liriel•8h ago•274 comments

Bloom (YC P26) Is Hiring

https://www.ycombinator.com/companies/trybloom/jobs
1•RayFitzgerald•10m ago

ggsql: A Grammar of Graphics for SQL

https://opensource.posit.co/blog/2026-04-20_ggsql_alpha_release/
182•thomasp85•4h ago•46 comments

10 years ago, someone wrote a test for servo that included an expiry in 2026

https://mastodon.social/@jdm_/116429380667467307
94•luu•21h ago•59 comments

All phones sold in the EU to have replaceable batteries from 2027

https://www.theolivepress.es/spain-news/2026/04/20/eu-to-force-replaceable-batteries-in-phones-an...
534•ramonga•3h ago•390 comments

I prompted ChatGPT, Claude, Perplexity, and Gemini and watched my Nginx logs

https://surfacedby.com/blog/nginx-logs-ai-traffic-vs-referral-traffic
88•startages•1h ago•14 comments

Sauna effect on heart rate

https://tryterra.co/research/sauna-effect-on-heart-rate
239•kyriakosel•3h ago•136 comments

M 7.4 earthquake – 100 km ENE of Miyako, Japan

https://earthquake.usgs.gov/earthquakes/eventpage/us6000sri7/
183•Someone•7h ago•78 comments

Chernobyl's last wedding: The couple who married as a nuclear disaster unfolded

https://www.bbc.com/news/articles/c0q92lx8q75o
19•1659447091•1d ago•3 comments

WebUSB Extension for Firefox

https://github.com/ArcaneNibble/awawausb
96•tuananh•5h ago•81 comments

Palantir Wants to Reinstate the Draft

https://reason.com/2026/04/20/this-big-tech-firm-wants-to-reinstate-the-draft/
99•tcp_handshaker•51m ago•57 comments

Kimi K2.6: Advancing Open-Source Coding

https://www.kimi.com/blog/kimi-k2-6
171•meetpateltech•1h ago•71 comments

Larry Tesler: A Personal History of Modeless Text Editing and Cut/Copy-Paste (2012)

https://dl.acm.org/doi/epdf/10.1145/2212877.2212896
6•aragonite•3d ago•1 comments

OpenClaw isn't fooling me. I remember MS-DOS

https://www.flyingpenguin.com/build-an-openclaw-free-secure-always-on-local-ai-agent/
189•feigewalnuss•9h ago•226 comments

I'm never buying another Kindle, and neither should you

https://www.androidauthority.com/amazon-kindle-2026-3657863/
70•mikhael•1h ago•53 comments

Ask HN: How to solve the cold start problem for a two-sided marketplace?

83•alegd•3h ago•83 comments

Focused microwaves allow 3D printers to fuse circuits onto almost anything

https://newatlas.com/electronics/meta-nfc-focused-microwaves-circuits/
112•breve•2d ago•21 comments

NSA is using Anthropic's Mythos despite blacklist

https://www.axios.com/2026/04/19/nsa-anthropic-mythos-pentagon
344•Palmik•7h ago•254 comments

What if database branching was easy?

https://xata.io/blog/what-if-database-branching-was-easy
56•tee-es-gee•2d ago•35 comments

Up to 8M Bees Are Living in an Underground Network Beneath This Cemetery

https://www.discovermagazine.com/up-to-8-million-bees-are-living-in-an-underground-network-beneat...
140•janandonly•2d ago•22 comments

IPC medley: message-queue peeking, io_uring, and bus1

https://lwn.net/Articles/1065490/
26•signa11•3d ago•0 comments

SDF Public Access Unix System

https://sdf.org/?ssh
149•neehao•1d ago•74 comments

Show HN: Alien – Self-hosting with remote management (written in Rust)

16•alongub•1h ago•3 comments

I Made the "Next-Level" Camera and I love it

https://thelibre.news/i-made-the-next-level-camera-and-i-love-it/
185•ndr•3d ago•64 comments

Epicycles All the Way Down (2025)

https://www.strangeloopcanon.com/p/epicycles-all-the-way-down
31•surprisetalk•4d ago•13 comments

Claude Token Counter, now with model comparisons

https://simonwillison.net/2026/Apr/20/claude-token-counts/
187•twapi•16h ago•73 comments