frontpage.

We published the first public benchmark for insurance AI agents on HuggingFace.

What it contains: - 510 real insurance scenarios - 10 categories across 9 insurance lines - Train/val/test splits (357/76/77) - 4 routing decisions per scenario: AI handles, AI with verification, human handoff, hybrid collaboration - 3 evaluation metrics: intent accuracy, routing accuracy, action completeness

Why it matters: Insurance is precision work. A wrong routing decision costs money and trust. Most AI benchmarks miss this. They don't test what matters in production.

This data came from a real voice AI system. Years of customer calls. Actual insurance decisions. The scenarios are messy. They're real.

Open source: Apache 2.0 license. Ready to use.

Implementation: https://github.com/pavelsukhachev/hybrid-orchestrator Paper: TechRxiv (IEEE) - "The Hybrid Orchestrator: A Framework for Coordinating Human-AI Teams"

SMTLIB as a Compiler IR I

OpenClaw Is the New Linux

Show HN: A AI-powered, open-source geostrategy game

Show HN: LocaFlow – Fast AI Translation Tool for Xcode Strings, XML and JSON

Work at tiny corp: "Bounties pay you while judging that fit."

Don't use escaping closures in SwiftUI

How TikTok 2.0 Became a Weapon for ICE

Health Advice from A.I. Chatbots Is Frequently Wrong, Study Shows

Cleatus, Fox Sports's football robot (2019)

Show HN: AetherLang – A DSL for building AI workflows with visual debugging

'Goldilocks' Effect for Online Teens? Moderate Social Media Users Fare Better

Show HN: AICO – Manage AI collaborators like managing employees

Baby headcams reveal how babies encounter faces during development

Equality Saturation Meets ML: The Next Step for Smarter Optimizing Compilers [video]

Btrfs Brings Experimental Remap-Tree Feature and More in Linux 7.0

ClawWatcher – Cost and token monitoring for OpenClaw agents

The Lost Dog That Made Constant Surveillance Feel Like a Favor

Why demand for code is infinite: How AI creates more developer jobs

A decades-old video game has helped me defeat the doomscroll

Show HN: Find automation ideas and creators by sharing your business problem

Designing and Using Combinators: The Essence of Functional Programming

Show HN: Axiom – Open-source AI research agent that runs locally (C#, Ollama)

Rust implementation of Mistral's Voxtral Mini 4B Realtime runs in your browser

The first time I visited Meta's HQ, it didn't quite register as a real place

Show HN: Quickpick UI – type-to-filter picker for React and vanilla JavaScript

Joe Rogan Experience #2335 – Dr. Mary Talley Bowden (2025) [video]

The $70M domain that couldn't survive a Super Bowl ad

Rise of the Cowboy Coder

Ask HN: What CI do you use instead of GitHub Actions?

Pure C, CPU-only inference with Mistral Voxtral Realtime 4B speech to text model

Show HN: Insurance AI Benchmark – 510 scenarios from production