Show HN: AI-First Web – SEO for AI Assistants

1•kure256•2h ago

More and more users don’t browse — they ask AI assistants. But most websites are still built only for human eyes, not for models that need structure, semantics, and metadata to understand content reliably.

AI-First Web explores what “SEO for AI” looks like: how to structure a website so that AI assistants can actually read it, interpret it, and cite it correctly. A well-structured, semantic, machine-readable site increases the chance that your page becomes the cited source for an AI-generated answer.

The project is very early and still evolving — I’d love feedback from people experimenting with how LLMs parse HTML, JSON-LD, and web structure in the real world.

Docs: https://ai-first-guides.github.io/first.ai/

GitHub: https://github.com/ai-first-guides/first.ai

Comments

kure256•2h ago

A small additional note for context:

I’m not arguing that “LLMs will replace browsing” in some absolute way — but it is observable that for many users, the entry point for information is shifting from search → assistant. When you actually inspect how models consume real websites today, the results are pretty uneven:

pages with clean HTML and predictable structure get parsed reliably

JSON-LD is used surprisingly often (but only if it’s correct and minimal)

heavy client-side rendering breaks extraction more than people expect

semantic markup still beats any “AI-enabled” tool by a mile

models hallucinate less when the source has clear hierarchy and meaning

This project isn’t trying to reinvent SEO — it’s more like exploring the minimum structural guarantees that make an LLM treat a page as a trustworthy, cite-able source instead of ignoring it or misreading it.

If anyone here has done experiments with:

how GPT, Claude, Gemini, Llama, etc. read arbitrary web pages

failure cases in parsing / hallucination caused by layout

the effect of metadata vs full-text signal

or even prompt strategies for web ingestion

…I’d genuinely love to compare notes.

Pangram – AI Detection that works

Intel's next-gen Granite Rapids-WS server CPU lineup leaked

'Buy Now, Pay Later' is expanding fast, and that should worry everyone

Ask HN: What's the Least Amount of Process a Small Team Can Get Away With?

WebAssembly Limitations

Interesting websites I found on the internet

Wheels of life dashboard – from my deprecated journaling-via-email project

White nationalist talking points and racial pseudoscience: welcome to Grokipedia

From Napster to Spotify: How Music Piracy Shaped the Subscription Economy

Against Exponential Backoff

Google is collecting troves of data from downgraded Nest thermostats

Show HN: Blue Divide – Nurikabe puzzles with procgen islands for Mac and iPad

Live most polluted major city ranking

Show HN: Learn Docker in your terminal with exercises

Dave's Garage PDP-11 BBS Menu System

I Built a Chrome Extension Loved by over 9k Parents

Tabstack.ai by Mozilla

AI is guzzling energy for slop content

Polarized Subtyping [pdf]

US startup Substrate announces chipmaking tool that it says will rival ASML

All Your Coworkers Are Probabilistic Too

Show HN: An AI nutrition coach you can text on iMessage/WhatsApp

Two years later: again at the web summit

Is Looking Ahead the Same as Guessing?

AI Is Making Big Tech Weaker

Show HN: I made a simple online interval timer for HIIT

Largest deposit holds 99.999% of all gold on Earth

Jeff Bezos to co-lead AI startup in first operational role since Amazon

Operation Chrome Dome

GCC 16 considering changing default to C++20