frontpage.

Our AI recruitment pipeline was auto-rejecting anyone who'd worked at companies founded after 2023. It didn't recognize names like Harvey or Snorkel AI, or didn't realize how important they'd become because of training data cutoffs.

We had traces, evals, Langfuse dashboards - everything looked fine - but we kept finding failures we should have caught earlier.

The pattern kept repeating:

- ship an improvement - it works for a while - hit an edge case that breaks it - don't notice until we've lost good candidates

That's when we realized - the problem wasn't just our recruitment pipeline - almost every AI product has blind spots that evals miss.

So we built Verse, a tool that surfaces issues directly from real AI interactions - whether that's candidates talking to your recruitment pipeline, users interacting with your agent, or any AI making decisions.

Instead of relying solely on evals, we cluster conversations, identify the key ones to review, and flag the ones that show failure patterns. We use OpenTelemetry for trace ingestion, so it's compatible with Langfuse, Langsmith, Braintrust, and other AI observability tools - you can add it right alongside your existing setup.

I'm posting this because I'm curious whether other teams are hitting the same wall. If you want, I'm happy to audit your AI implementation for free and show you where things commonly break - even if you never use Verse.

Happy to answer any technical questions.

Client Side PHP (2019)

Falcon: Google's Hardware Transport

PlayStation 5 continues to dominate – Microsoft loses touch in the console race

Letting the future out of the box: avoiding boxed futures in Rust

Pebble: How to Build a Smartwatch: Software – Setting Expectations and Roadmap

Marion County agrees to pay $3M and apologize over raid on small-town newspaper

Firefox AI Window

Show HN: PolyCouncil: Multi-Model Deliberation Engine for LMStudio (Open Source)

Travel booking platform Klook reveals robust revenue growth in US IPO filing

Firefox AI Window

Show HN: Kratos - Cloud native Auth0 open-source alternative for self-hosters

Cursor: Past, Present, and Future

Denx (a.k.a. U-Boot) Retires

The Complete Work of Charles Darwin Online

Everybody Codes – Quest 8, 2025 – String art problem

The memory fabric for enterprise AI

Pointer leaks through pointer-keyed data structures

Everyone's linking their site to ChatGPT and it's kinda genius

Eye Movement Therapies, Purple Hats, and the Sagan Standard

I'm a Millionaire. No One Needs More Than $30M

Show HN: Link Snapper – Copy URLs as plain text for unclickable sharing

Why every AI coding tool gets pricing wrong

Automated Contiguous Layer Pruning for Large Language Models

Elon Musk's trillion dollar pay package depends on emerging markets

Universe Simulation Now in Maintenance Mode (Post-Patch Hypothesis)

In Defence of Personal Finance

Gemini 3 rolling out (unconfirmed) to mobile users

Useless AI Products Are Getting Worse [video]

When Reverse Proxies Surprise You: Hard Lessons from Operating at Scale

What Is a Production Process?

Show HN: Verse AI – Catch the AI failures your evals miss

Client Side PHP (2019)

Falcon: Google's Hardware Transport

PlayStation 5 continues to dominate – Microsoft loses touch in the console race

Letting the future out of the box: avoiding boxed futures in Rust

Pebble: How to Build a Smartwatch: Software – Setting Expectations and Roadmap

Marion County agrees to pay $3M and apologize over raid on small-town newspaper

Firefox AI Window

Show HN: PolyCouncil: Multi-Model Deliberation Engine for LMStudio (Open Source)

Travel booking platform Klook reveals robust revenue growth in US IPO filing

Firefox AI Window

Show HN: Kratos - Cloud native Auth0 open-source alternative for self-hosters

Cursor: Past, Present, and Future

Denx (a.k.a. U-Boot) Retires

The Complete Work of Charles Darwin Online

Everybody Codes – Quest 8, 2025 – String art problem

The memory fabric for enterprise AI

Pointer leaks through pointer-keyed data structures

Everyone's linking their site to ChatGPT and it's kinda genius

Eye Movement Therapies, Purple Hats, and the Sagan Standard

I'm a Millionaire. No One Needs More Than $30M

Show HN: Link Snapper – Copy URLs as plain text for unclickable sharing

Why every AI coding tool gets pricing wrong

Automated Contiguous Layer Pruning for Large Language Models

Elon Musk's trillion dollar pay package depends on emerging markets

Universe Simulation Now in Maintenance Mode (Post-Patch Hypothesis)

In Defence of Personal Finance

Gemini 3 rolling out (unconfirmed) to mobile users

Useless AI Products Are Getting Worse [video]

When Reverse Proxies Surprise You: Hard Lessons from Operating at Scale

What Is a Production Process?