frontpage.

Show HN: I built a lightweight AI tool to analyze visitor behavior

https://getallinsights.com

2•psyentist•2h ago

I started working on this when I realized what's wrong with tools like FullStory. I work at a B2C iGaming company in product management. We use FullStory and it has been very helpful for watching session recordings to analyze customer behavior, but also for session replays during bug investigations.

However, there are a couple of problems with it. It's very expensive and very heavy because it captures all DOM changes, so it slows down the experience of the end user. For these reasons you are forced to do heavy sampling, for example record only 10% of the sessions. Apart from that, you need to spend a lot of time watching session replays, and sometimes you cannot find the session you are interested in as it might not be included in the recorded sample.

This was basically the trigger for the idea. I started working on building my own tool. The idea was to make it as simple and lightweight as possible, and focus on clarity rather than adding a lot of bloat.

Also worth mentioning, I have noticed that FullStory and other tools recently started adding AI session summary features, which means there is clearly a need. However, they cannot really do visitor-level summaries across sessions or generate broader insights, because due to the heavy script overhead you end up sampling only a small percentage of traffic. Technically you could record all sessions, but it would be extremely expensive and would slow down the experience for everyone. The difference with my approach is that because the tracker is lightweight, you can record all sessions and still generate visitor summaries and insights across the full dataset, at a much lower cost.

So the idea is simple: capture signals from user browsing, such as pageviews, clicks, scrolling, and metadata from the request headers. Record all these events in a session. Then schedule a job every X minutes to send the events to an LLM and get a summary of the session. Then move a level up and generate a visitor profile summary across all sessions of that visitor. Basically it's like a living memory layer of your customers. Finally, use the session summaries and visitor profile summaries as input and let the LLM identify patterns and generate insights, things you might not know about how your visitors behave, things you can act upon.

The first decision was what to use for recording customer behavior. Initially I tried to use rrweb, but I noticed that it slows things down a lot as well. So I ended up building my own tracker that captures just enough signals for generating a summary, but at the same time keeps it super lightweight. For example, on each pageview I record a snapshot of the DOM to give context to the LLM about what the visitor is browsing, but I'm not recording all DOM changes like FullStory.

The next thing was how to generate visitor summaries and insights. Should I send all raw events or just session summaries and visitor summaries? I ended up sending only the summaries as I was happy with the output, and this way I keep it much more cost effective in terms of LLM input tokens.

So far I've tested the tool myself by injecting the script with a Chrome extension and browsing around various websites. The summaries I see so far make a lot of sense. Now I'm looking to test it in the wild with data from real visitors, so I'm looking for people who are happy to embed it in their website and see how valuable the summaries and insights are. Of course I will collect feedback and improve the tool.

On the landing page you can click on Explore Live Demo and you will get an idea of how the tool works.

Thanks.

New tools in Google AI Studio to explore, debug and share logs

Elk are again roaming on lands California returned to Tule River Indian Tribe

Ex-Mossad chief brags Israel has installed a global sabotage network

Ukraine vows harder, better, faster, stronger strikes on Russian oil facilities

Exploring Virginia's Historic Triangle, Then and Now

The ear does not do a Fourier transform

When Using AI, Users Fall for the Dunning-Kruger Trap in Reverse

Most of What We Call Progress

Lessons from Building VT Code: An Open-Source AI Coding Agent

Paul Tudor Jones likens current market to 1999-2000

CS LazBon Lazada

Why Sustainable Civilizations Must Be Democratic

The (rust) Clippy Changelog Cat Contest, a brief retrospective

The Smol Training Playbook: The Secrets to Building World-Class LLMs

State of mission: Battery management with neural networks and electrochemical AI

Moderna, the company that helped save the world, has unraveled

U.S. agencies back banning top-selling TP-Link home routers on security grounds

We May Have Fixed Python's 25-Year-Old Vulnerability

Call Center EasyCash

Show HN: Bypassing face recognition using Fawkes – Now with web interface

Nothing launches its first phone with bloatware apps, some can't be uninstalled

The Bay Area's Hottest Club Is a Filipino Supermarket

I made Cluely, but like, local LLM support, baby

Signs of introspection in large language models

Splat Goes the Slop Bowl

LLM Model Cost Map

Launch HN: Propolis (YC X25) – Browser agents that QA your web app autonomously

Google makes first Play Store changes after losing Epic Games antitrust case

Evolving PHP Streams for Async, Security, and Performance

STAK: My weird programming language for DOS games