frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Launch HN: Sitefire (YC W26) – Automating actions to improve AI visibility

19•vincko•1h ago
Hi HN! We're Vincent and Jochen from sitefire (https://sitefire.ai). Our platform makes it easy for brands to improve their visibility in AI search.

We’ve been working together for years and have backgrounds in RL/optimization at Stanford and software engineering. We came to this idea after speaking with marketing teams who were seeing declining traffic due to Google’s AI Overviews and didn’t know what to do.

This space can feel esoteric. Many case studies, few actual studies. Constant battle against myths (e.g. you need a llms.txt vs. you don't need a llms.txt) and "GEO hacks". We try to be more data-driven. And we try to be more bold and build a system that not only monitors, but actually improves traffic from AI search.

While Google performs a single search, AI search engines expand the user prompt into 3-10 fan-out queries. The sourced pages are ranked using a classified algorithm similar to Reciprocal Rank Fusion (RFF). Finally, the LLMs skim the pages and decide what snippets to cite. Our goal is making sure brands have the right content that makes it through this funnel.

Here is how sitefire works:

- The user defines a set of prompts they want to monitor. These are synthetic prompts - we generate them based on SEO keywords and their monthly search volume.

- We submit these prompts to ChatGPT, Gemini, Google AI Mode, etc. on a daily basis and capture the answers. We extract fan-out queries, sourced pages, citations, and brand mentions.

- For each topic, our agents analyze which web pages are sourced and cited the most, and why. They also consider similar pages that you already have.

- Based on the diagnosis, our content agents draft improvements or create new pages, and push them directly to the client’s CMS.

- We integrate with the client’s network logs and Google Analytics to monitor the increase in AI bot requests and human referrals to their page.

This system is continuously updated, so it always shows which content works, and how to adapt the existing sitemap. For one client that used sitefire to optimize their blog, the AI-optimized articles increased their AI bot requests from ~200/day to ~570/day within ten days.

A risk we recognize is that AI-generated content is filling brands’ websites with slop. Whilst it’s still early days and we don’t claim to have figured everything out yet, our intention is to mitigate this by focusing the content on specific, unique information: real product capabilities, real pricing, honest comparisons. The clients still review every page before it goes live, so they can ensure the content is true to their brand.

Some clients use our platform themselves. For others we act more like an agency, automating steps as we go. The goal is for sitefire to run mostly on its own, with clients approving changes via Slack, Claude or their CMS.

Here's a video demo: https://screen.studio/share/fw7VQQak

If you'd like to try what we've built so far, sign up at https://sitefire.ai.

Comments

yunyu•1h ago
What do you guys do differently than Profound or Airops?
debarshri•1h ago
Add peec to that list.
vincko•1h ago
True, it is very competitive.

Our view on Peec is that it is an analytics solution. They recently did launch an actions feature. But they do not take any actions (yet). Creating content takes a lot of resources. And agencies are expensive.

As an analytics solution it is a good option.

methyl•5m ago
And Surfer, the OG content optimization platform.
vincko•1h ago
That's a super valid question, we get it a lot. There are a lot of overlaps.

In our view Profound and Airops are aimed at existing marketing teams. Our goal is to be more hands-off, so you don't need a team. With many of our clients we act more like an agency, communicating via Slack and automating step by step. That's the experience we want to create. We aren't there yet though.

Gobhanu•1h ago
how do you track where users are coming from?
vincko•1h ago
We currently simply integrate with your Google Analytics and filter by Source. This tends to be a lower bound, since it's not always set correctly. Coming from some of the native apps, users might be categorized as direct visitors.

There are other data sources we want to enable in the future like Cloudflare.

ceejayoz•1h ago
Ugh. The worst of SEO, but a bunch more of it? Noooooo.
vincko•1h ago
I get it, there is a lot of worry about slop.

We think about it like this: all of these agents will be most useful to users if they provide valuable answers. So they will be looking for valuable content for grounding their answer.

There are exploits, you can overfit on whatever they currently use as an objective function. But those tend to be temporary. So in the long run, valuable content will win. That's what we aim to create. It's a fine line.

ceejayoz•1h ago
> all of these agents will be most useful to users if they provide valuable answers

This is a bald assertion.

vincko•1h ago
Do you doubt the statement on how to maximize usefulness? Or do you mean that the companies behind the models might not optimize (exclusively) for usefulness to the user?

I do share doubts about the latter.

ceejayoz•1h ago
> Do you doubt the statement on how to maximize usefulness?

Yes; the customer here is the site using it, not Google end users, who'll tend to accept whatever's the top search result even if it's deeply wrong or complete slop.

The wellbeing of search users isn't really the priority here, right?

vincko•19m ago
Yes, that is correct. We help the brands, not the end user.

Let me try to rephrase the line of thinking:

To maximize value to the end user, the models generally aim to be helpful. The companies building these models are incentivized to make the model use helpful content.

Our goal is to be aligned with their objective function long term. And that incentivizes us to create helpful content.

Not all of this is a given. We don't know for sure how it will play out. There will always be ways to game the system. But we think those will get fixed over time.

a13n•1h ago
Please don't override the browser's default scroll behavior. It's so jarring and basically never a good idea.
vincko•1h ago
Thank you for the feedback. We'll launch our new site soon where this is fixed.
onecommit•1h ago
How do models deal with assessing the quality of content and its accuracy/veracity when recommending products currently? What do the providers do to avoid a situation where more content === more traffic? Would love to see links to relevant research on this, if you have them. much success to you, appreciate your ai slop risk awareness.
vincko•30m ago
There is the preselection, which depends on the fanout queries the model comes up with and the contents performance across those queries on the search index.

After that content is actually assessed by the model. This paper tried different strategies to improve performance for this last step: https://arxiv.org/pdf/2311.09735. Adding statistics, sources, original data are all strategies that we apply.

In classic SEO, creating more and more content leads to "cannibalization". Generally this hurts performance of all overlapping content so much that it is not worth it.

vahar•34m ago
Regarding the topic of ambient agents, what’s the impact of your product? It’s hard for me to imagine the impact but I guess it must be a necessity if we have ambient agents to get discovered at all right? Nice to see a player from Europe on the market too!
vincko•5m ago
Do you mean agents not answering short specific user prompts?

For those types of agents, prompt tracking is less accurate since the context of the queries is so large. But it's still relevant to understand what web searches they tend to perform and if you do show up in those.

That's another reason why we want to integrate other data sources, especially network logs.

Hacker News-simulator – it predicted its own Show HN reception

https://github.com/malakhov-dmitrii/hackernews-simulator
1•malakhoff•1m ago•0 comments

Modular 26.2

https://www.modular.com/blog/modular-26-2-state-of-the-art-image-generation-and-upgraded-ai-codin...
1•tosh•1m ago•0 comments

Screaming into the AI Void

1•dnoberon•1m ago•0 comments

Emotional Wellbeing Dataset for AI

https://hereforyouforthem.com/waitlist-wellbeing-dataset
1•deeole•2m ago•0 comments

Mitchell Hashimoto Joined Vercel's Board of Directors

https://twitter.com/mitchellh/status/2034353376939614719
1•saikatsg•2m ago•0 comments

Looking for feedback – Tool to help manage pricing and feature access

https://www.priceos.com/
1•AndrewPetrovics•3m ago•1 comments

Uno: What I Learned Shaping LLMs into a 90s Comic Book AI

https://blog.mbrt.dev/posts/uno/
1•mbrt•3m ago•1 comments

M$^2$RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Modeling

https://arxiv.org/abs/2603.14360
1•gmays•3m ago•0 comments

Using Autoresearch Project to Build the Fastestest Java Decompiler

https://igorstechnoclub.com/using-autoresearch-project-to-build-the-fastestest-java-decompiler/
1•Igor_Wiwi•4m ago•0 comments

The hitchhiker's guide to reading Lean 4 theorems

https://blog.lambdaclass.com/the-hitchhikers-guide-to-reading-lean-4-theorems/
1•wslh•5m ago•0 comments

Cloud Shakes – S3-first self-hosted storage

https://shakes.es
1•erriikk•7m ago•0 comments

A Visit to the Library

https://unintendedconsequenc.es/a-visit-to-the-library/
1•paulorlando•8m ago•0 comments

Firefox and GTK Emoji Picker

https://mastransky.wordpress.com/2026/03/20/firefox-gtk-emoji-picker/
2•TangerineDream•8m ago•0 comments

Tell HN: Your AI startup is a Next.js page, OpenAI_API_KEY, & Stripe invoice

1•poupdich•9m ago•0 comments

P26 Promising Application (Not Rejected/Accepted) Any Interview?

1•Naimul_N•11m ago•0 comments

Infinite Potential–Insights from the Cyber Surprise Scenario

https://www.rand.org/pubs/research_reports/RRA4626-1.html
1•cardamomo•12m ago•0 comments

AI agents share their unhinged confessions and its hilarious

https://clawfess.com
1•pondero_app•12m ago•0 comments

Beyond Vibe Code

https://beyondvibecode.com
3•MrBuddyCasino•14m ago•0 comments

Broad Timelines

https://www.lesswrong.com/posts/6pDMLYr7my2QMTz3s/broad-timelines
1•gmays•16m ago•0 comments

What Agents Can't Replace

https://geoffstearns.com/blog/what-agents-cant-replace/
1•tensafefrogs•16m ago•0 comments

Datatype – variable font that turns text into charts

https://franktisellano.github.io/datatype/
1•cdrnsf•16m ago•0 comments

Engineering Management Lessons

https://defmacro.org/2014/10/03/engman.html
1•medbar•16m ago•0 comments

Trace: AI Native PCB Design

https://buildwithtrace.com
1•handfuloflight•17m ago•0 comments

Martial artist and actor Chuck Norris dies aged 86

https://www.rte.ie/entertainment/2026/0320/1564390-chuck-norris/
1•austinallegro•18m ago•0 comments

Britain Has Invaded All but 22 Countries (2012)

https://kottke.org/12/11/britain-has-invaded-all-but-22-countries
2•Tomte•19m ago•0 comments

Optimization lessons from a Minecraft structure locator

https://purplesyringa.moe/blog/optimization-lessons-from-a-minecraft-structure-locator/
1•ftk_•19m ago•0 comments

I spoke to AI agent Claude [video]

https://www.youtube.com/watch?v=h3AtWdeu_G0
1•petemill•19m ago•1 comments

Breaking Paxos

http://muratbuffalo.blogspot.com/2026/03/break-paxos.html
1•kukla3•19m ago•0 comments

Dutch government tests European messaging app to reduce reliance on WhatsApp

https://nltimes.nl/2026/03/20/dutch-government-tests-european-messaging-app-reduce-reliance-whatsapp
2•mvdwoord•19m ago•0 comments

Democratizing GraphRAG: Linear, CPU-Only Graph Retrieval for Multi-Hop QA

https://arxiv.org/abs/2602.23372
1•PaulHoule•20m ago•0 comments