frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: We're tracking AI bot visits daily across our network

2•legitcoders•3mo ago
Hi HN,

Since launching LLMS Central (https://llmscentral.com) a few months ago, we're now tracking hundreds of AI bot visits daily across our network. The data is fascinating.

### What We're Seeing

*Daily Bot Traffic (Across Our Network):* - 300-500+ AI bot visits per day - GPTBot (ChatGPT) dominates at ~60% of traffic - Claude, Perplexity, and Google's AI bots make up most of the rest - Peak crawling hours: 2-4 AM UTC (training runs?)

*Real Patterns Emerging:* - Technical documentation gets 5x more AI bot traffic than average content - Blog posts with code examples are crawled 3x more frequently - Sites with llms.txt files see 40% more organized crawling - Most sites have zero visibility into AI bot activity

*Surprising Findings:* 1. AI bots are WAY more active than most people realize 2. They're not just training - they're actively crawling for real-time answers 3. Different bots have different content preferences (Claude likes long-form, Perplexity loves news) 4. Traditional analytics completely miss this traffic

### Technical Details

*Stack:* - Next.js 15 (App Router) - Firebase Firestore for analytics - 2KB tracking script (async, zero perf impact) - Real-time user-agent detection + IP verification

*Bot Detection:* - User-agent parsing (GPTBot, Claude-Web, etc.) - IP range verification (OpenAI, Anthropic, Google) - Behavioral analysis (crawl patterns) - 99%+ accuracy

*Privacy:* - No PII collected - GDPR compliant - Users control data retention - Open source tracking script (coming soon)

### Why I Built This

I noticed my technical blog posts were getting cited by ChatGPT, but Google Analytics showed nothing. Turns out AI bots don't show up in traditional analytics because they're not "users" - they're crawlers.

After manually parsing server logs for weeks, I realized: 1. This should be automated 2. There should be a standard for AI bot permissions (like robots.txt) 3. Sites need visibility into which AI systems are using their content

So I built LLMS Central - both a tracking platform AND a centralized repository for llms.txt files (the proposed standard for AI bot permissions).

### Features

1. *Real-time bot tracking* - See which AI crawlers visit your site 2. *Page-level analytics* - Know which pages AI bots prefer 3. *AEO scoring* - Measure Answer Engine Optimization (like SEO, but for AI) 4. *Multi-engine preview* - See how ChatGPT vs Claude would cite your content 5. *llms.txt generator* - Like robots.txt, but for AI (proposed standard)

### Try It

*Preview tool (no signup):* https://llmscentral.com/aeo-preview

*Full tracking (free tier):* https://llmscentral.com/dashboard

### The Data Keeps Growing

What started as a personal project is now tracking hundreds of domains. Every day we see: - New AI bots appearing (just detected Meta's AI crawler last week) - Crawling patterns evolving (bots are getting smarter about what they crawl) - Sites realizing they have zero visibility into AI usage of their content

The most common reaction: "I had no idea ChatGPT was crawling my site this much."

### Questions

1. Should there be a standard for AI bot permissions (like robots.txt)? We're pushing llms.txt, but curious about alternatives. 2. How should sites monetize AI training data? Or should they? 3. Is "Answer Engine Optimization" (AEO) the future of SEO? 4. What data would YOU want to see about AI bot traffic?

Would love HN's feedback on the technical approach, privacy considerations, and what data would be most valuable to track.

Los Alamos Primer

https://blog.szczepan.org/blog/los-alamos-primer/
1•alkyon•39s ago•0 comments

NewASM Virtual Machine

https://github.com/bracesoftware/newasm
1•DEntisT_•2m ago•0 comments

Terminal-Bench 2.0 Leaderboard

https://www.tbench.ai/leaderboard/terminal-bench/2.0
1•tosh•3m ago•0 comments

I vibe coded a BBS bank with a real working ledger

https://mini-ledger.exe.xyz/
1•simonvc•3m ago•1 comments

The Path to Mojo 1.0

https://www.modular.com/blog/the-path-to-mojo-1-0
1•tosh•6m ago•0 comments

Show HN: I'm 75, building an OSS Virtual Protest Protocol for digital activism

https://github.com/voice-of-japan/Virtual-Protest-Protocol/blob/main/README.md
4•sakanakana00•9m ago•0 comments

Show HN: I built Divvy to split restaurant bills from a photo

https://divvyai.app/
3•pieterdy•11m ago•0 comments

Hot Reloading in Rust? Subsecond and Dioxus to the Rescue

https://codethoughts.io/posts/2026-02-07-rust-hot-reloading/
3•Tehnix•12m ago•1 comments

Skim – vibe review your PRs

https://github.com/Haizzz/skim
2•haizzz•13m ago•1 comments

Show HN: Open-source AI assistant for interview reasoning

https://github.com/evinjohnn/natively-cluely-ai-assistant
4•Nive11•14m ago•5 comments

Tech Edge: A Living Playbook for America's Technology Long Game

https://csis-website-prod.s3.amazonaws.com/s3fs-public/2026-01/260120_EST_Tech_Edge_0.pdf?Version...
2•hunglee2•17m ago•0 comments

Golden Cross vs. Death Cross: Crypto Trading Guide

https://chartscout.io/golden-cross-vs-death-cross-crypto-trading-guide
2•chartscout•20m ago•0 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
3•AlexeyBrin•23m ago•0 comments

What the longevity experts don't tell you

https://machielreyneke.com/blog/longevity-lessons/
2•machielrey•24m ago•1 comments

Monzo wrongly denied refunds to fraud and scam victims

https://www.theguardian.com/money/2026/feb/07/monzo-natwest-hsbc-refunds-fraud-scam-fos-ombudsman
3•tablets•29m ago•1 comments

They were drawn to Korea with dreams of K-pop stardom – but then let down

https://www.bbc.com/news/articles/cvgnq9rwyqno
2•breve•31m ago•0 comments

Show HN: AI-Powered Merchant Intelligence

https://nodee.co
1•jjkirsch•34m ago•0 comments

Bash parallel tasks and error handling

https://github.com/themattrix/bash-concurrent
2•pastage•34m ago•0 comments

Let's compile Quake like it's 1997

https://fabiensanglard.net/compile_like_1997/index.html
2•billiob•34m ago•0 comments

Reverse Engineering Medium.com's Editor: How Copy, Paste, and Images Work

https://app.writtte.com/read/gP0H6W5
2•birdculture•40m ago•0 comments

Go 1.22, SQLite, and Next.js: The "Boring" Back End

https://mohammedeabdelaziz.github.io/articles/go-next-pt-2
1•mohammede•46m ago•0 comments

Laibach the Whistleblowers [video]

https://www.youtube.com/watch?v=c6Mx2mxpaCY
1•KnuthIsGod•47m ago•1 comments

Slop News - The Front Page right now but it's only Slop

https://slop-news.pages.dev/slop-news
1•keepamovin•51m ago•1 comments

Economists vs. Technologists on AI

https://ideasindevelopment.substack.com/p/economists-vs-technologists-on-ai
1•econlmics•54m ago•0 comments

Life at the Edge

https://asadk.com/p/edge
4•tosh•59m ago•0 comments

RISC-V Vector Primer

https://github.com/simplex-micro/riscv-vector-primer/blob/main/index.md
4•oxxoxoxooo•1h ago•1 comments

Show HN: Invoxo – Invoicing with automatic EU VAT for cross-border services

2•InvoxoEU•1h ago•0 comments

A Tale of Two Standards, POSIX and Win32 (2005)

https://www.samba.org/samba/news/articles/low_point/tale_two_stds_os2.html
4•goranmoomin•1h ago•0 comments

Ask HN: Is the Downfall of SaaS Started?

4•throwaw12•1h ago•0 comments

Flirt: The Native Backend

https://blog.buenzli.dev/flirt-native-backend/
3•senekor•1h ago•0 comments