frontpage.

Hi HN,

I built EchoEntry (https://echoentry.ai) – a speech-to-text API optimized specifically for digits.

The problem: Generic STT APIs struggle with numbers. "One oh five" becomes "105" sometimes, "15" other times. For healthcare apps, warehouse systems, or IVR, this inconsistency breaks workflows.

My solution: Fine-tuned Whisper-small on 1-999 spoken numbers across 5 English accents. Gets 95% accuracy on 1-3 digit numbers.

Tech stack: - Custom Whisper model (1.7GB) - FastAPI backend - Deployed on 8GB Linode - FFmpeg for audio processing

Try it now (two commands, no signup):

# Download test audio curl -O https://echoentry.ai/test_audio.wav

# Test the API curl -X POST https://api.echoentry.ai/v1/transcribe \ -H "X-Api-Key: demo_key_12345" \ -F "file=@test_audio.wav;type=audio/wav"

Currently free beta (1,000 calls/month per key). Looking for feedback on: 1. What accuracy threshold makes this production-ready for you? 2. Are there other number-heavy use cases I'm missing? 3. Would you pay for this vs. using generic STT?

Docs: https://echoentry.ai/docs.html

Happy to answer technical questions about the fine-tuning process or deployment!

The AI Talent War Is for Plumbers and Electricians

Show HN: MimiClaw, OpenClaw(Clawdbot)on $5 Chips

I Maintain My Blog in the Age of Agents

The Fall of the Nerds

I'm 15 and built a free tool for reading Greek/Latin texts. Would love feedback

How close is AI to taking my job?

You are the reason I am not reviewing this PR

Show HN: FamilyMemories.video – Turn static old photos into 5s AI videos

How Meta Made Linux a Planet-Scale Load Balancer

A Turing Test for AI Coding

How to Identify and Eliminate Unused AWS Resources

A2CDVI – HDMI output from from the Apple IIc's digital video output connector

CLI for Common Playwright Actions

Would you use an e-commerce platform that shares transaction fees with users?

Show HN: SafeClaw – a way to manage multiple Claude Code instances in containers

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

The Evolution of the Interface

Azure: Virtual network routing appliance overview

Seedance2 – multi-shot AI video generation

Πfs – The Data-Free Filesystem

Go-busybox: A sandboxable port of busybox for AI agents

Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery [pdf]

xAI Merger Poses Bigger Threat to OpenAI, Anthropic

Atlas Airborne (Boston Dynamics and RAI Institute) [video]

Zen Tools

Is the Detachment in the Room? – Agents, Cruelty, and Empathy

The purpose of Continuous Integration is to fail

Apfelstrudel: Live coding music environment with AI agent chat

What Is Stoicism?

What happens when a neighborhood is built around a farm