frontpage.

Curious to hear what approaches people are taking, what the bottlenecks are, and whether anyone here is pushing toward the goal of "AI that understands you, the first time."

I've been diving into the gap between benchmark ASR performance and real-world speech. Models like Whisper and Deepgram show impressive >95% accuracy in ideal conditions. But in the wild — accents, noisy environments, emotional speech, code-switching, overlapping speakers — accuracy often drops sharply, often to the mid-80s or worse.

This matters because the next wave of AI won't be chatbots; it will be hands-free, real-time systems in contexts like:

- care work (voice logs) - crisis communication - home healthcare - security rounds - field operations - "I need help" micro-interactions

In these high-stakes contexts, 85% accuracy means critical information gets lost.

What seems missing today:

- Fine-tuning pipelines for noisy, accented speech - Reinforcement learning loops (user corrections → model improvements) - Fast per-speaker adaptation - Better handling of disfluencies ("uh," "um," repairs) - Scaling-law insights applied to ASR models - Evaluation metrics that reflect real environments instead of curated datasets

What I'm trying to understand:

- What prevents ASR from reaching reliable >99% accuracy in real-world conditions? - Is the bottleneck the model architecture, data quality, or something else?

Would love to hear from anyone who has:

- Worked on Whisper fine-tuning - Tackled multilingual or accented ASR - Shipped speech systems in noisy environments - Developed conversational (not dictation) ASR models - Built correction-feedback training loops - Deployed ASR in safety-critical or field environments

What worked? What failed? What surprised you?

FDA Intends to Take Action Against Non-FDA-Approved GLP-1 Drugs

Supernote e-ink devices for writing like paper

We are QA Engineers now

Show HN: Measuring how AI agent teams improve issue resolution on SWE-Verified

Adversarial Reasoning: Multiagent World Models for Closing the Simulation Gap

Show HN: Poddley.com – Follow people, not podcasts

Layoffs Surge 118% in January – The Highest Since 2009

Papyrus 114: Homer's Iliad

DicePit – Real-time multiplayer Knucklebones in the browser

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

Show HN: AI Agent Tool That Keeps You in the Loop

Why Every R Package Wrapping External Tools Needs a Sitrep() Function

Achieving Ultra-Fast AI Chat Widgets

Show HN: Runtime Fence – Kill switch for AI agents

Researchers surprised by the brain benefits of cannabis usage in adults over 40

Peter Thiel warns the Antichrist, apocalypse linked to the 'end of modernity'

USS Preble Used Helios Laser to Zap Four Drones in Expanding Testing

Show HN: Animated beach scene, made with CSS

An update on unredacting select Epstein files – DBC12.pdf liberated

Was going to share my work

Pitchfork: A devilishly good process manager for developers

You Are Here

Why social apps need to become proactive, not reactive

How patient are AI scrapers, anyway? – Random Thoughts

Vouch: A contributor trust management system

I built a terminal monitoring app and custom firmware for a clock with Claude

Tiny C Compiler

Y Combinator Founder Organizes 'March for Billionaires'

Ask HN: Need feedback on the idea I'm working on

OpenClaw Addresses Security Risks