frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Why is Apple's voice transcription hilariously bad?

6•keepamovin•6h ago
Why is Apple’s voice transcription so hilariously bad?

Even 2–3 years ago, OpenAI’s Whisper models delivered better, near-instant voice transcription offline — and the model was only about ~500 MB. With that context, it’s hard to understand how Apple’s transcription, which runs online on powerful servers, performs so poorly today.

Here are real examples from using the iOS native app just now:

- “BigQuery update” → “bakery update”

- “GitHub” → “get her”

- “CI build” → “CI bill”

- “GitHub support” → “get her support”

These aren’t obscure terms — they’re extremely common words in software, spoken clearly in casual contexts. The accuracy gap feels especially stark compared to what was already possible years ago, even fully offline.

Is this primarily a model-quality issue, a streaming/segmentation problem, aggressive post-processing, or something architectural in Apple’s speech stack? What are the real technical limitations, and why hasn’t it improved despite modern hardware and cloud processing?

Comments

bryanrasmussen•6h ago
>they’re extremely common words in software, spoken clearly in casual contexts

extremely common phrases in software are extremely uncommon phrases for most of the world.

bryanrasmussen•5h ago
so there should probably be some sort of jargon-user probability setting that would be evaluated by your phrase usage.

first off there must be some phrases that are more commonly used in development than otherwise that it gets correct, a large number of those indicates high chance of being software jargon user. Furthermore all these other phrases are not in themselves common non-software usage, thus if you are using a lot of phrases that might be, with lower probability but still relatively high probability, software jargon this could be set.

Now we also get to personal behavior tracking, you are on dev sites a lot chance of using software jargon instead of non-software jargon goes up.

Do you use computer for development, chances go up. Of course lots of reasons why they would not track this to keep people from being pissed but still, possible way to improve from tracking.

finally allowing people to create profile - which I don't know if they do because I don't use.

Of course this kind of software dev jargon workflow would also help other identifiable subgroups with specific jargon sets, like lawyers, or accountants, etc. etc.

All these things of o

keepamovin•4h ago
Yeah, all of these are good ideas. And I think they should also utilize the obviously available to them abundant context of any message that you’re sending.
keepamovin•4h ago
OK, fair point. My examples were taken from my immediate previous transcript however this is not a intermittent issue. This is consistent. Terrible hilarious performance.

That’s sad. I tried to prove it terrible in this comment by using transcript here, hoping to show you some examples, but the transcript is essentially accurate. Besides, the sad said humming above and the humming homonym.

Security breaks during partial failures – design notes from distributed systems

5•sandhyavinjam•2h ago•1 comments

Ask HN: Building a tool to ensure things get done on time

3•Vishal19111999•1h ago•0 comments

Ask HN: When do we expose "Humans as Tools" so LLM agents can call us on demand?

31•vedmakk•10h ago•21 comments

Tell HN: Happy New Year

430•schappim•1d ago•200 comments

Ask HN: Which AI productivity tools are you using in 2026?

3•Vishal19111999•1h ago•0 comments

Ask HN: Why is Apple's voice transcription hilariously bad?

6•keepamovin•6h ago•4 comments

Ask HN: How did you learn to code?

23•chistev•20h ago•71 comments

Ask HN: How Are You Handling Auth in 2026?

5•joshcsimmons•11h ago•13 comments

Ask HN: What did you read in 2025?

334•kwar13•6d ago•443 comments

I built a public skill registry and MCP server so Codex can install new skills

2•iluxu•13h ago•0 comments

Ask HN: Loneliness at 19, how to cope?

60•yresting•4d ago•105 comments

Ask HN: What is the best microVMs for AI agents?

8•zfoong•1d ago•7 comments

Semantica – Open-source semantic layer and GraphRAG framework

7•kaifahmad1•21h ago•0 comments

Ask HN: Any example of successful vibe-coded product?

78•sirnicolaz•2d ago•117 comments

Ask HN: Does reading HN make you happy?

47•yakattak•2d ago•37 comments

Tell HN: Happy New Year!

4•realberkeaslan•1d ago•2 comments

Ask HN: How to do a Personal Cybersecurity audit

24•preciousoo•3d ago•12 comments

Tell HN: Stripe Dashboard Is Slow

2•_RPM•14h ago•2 comments

Ask HN: How long before the first civilian cargo flights are AI piloted?

2•givemeethekeys•1d ago•13 comments

Happy New Year HN!

11•thunderbong•1d ago•4 comments

Ask HN: How did you make yourself more marketable?

11•ronbenton•2d ago•13 comments

A curated directory of open-source AI projects

12•doanbactam•2d ago•2 comments

Ask HN: How to go back to listening to MP3s?

9•muratsu•5d ago•24 comments

TP-Link only works with a permanent internet connection

8•roscas•3d ago•7 comments

Ask HN: How are you sandboxing coding agents?

44•m-hodges•5d ago•31 comments

Tell HN: I am afraid AI will take my job at some point

24•funnyfoobar•6d ago•39 comments

Ask HN: How do you manage kids' accounts?

12•xfax•3d ago•7 comments

Ask HN: How do you get visibility if you're suuuuper bad at marketing?

13•ClipNoteBook•4d ago•22 comments

Ask HN: What do you use to manage your coding projects?

5•SunshineTheCat•2d ago•11 comments

Users decide which online platforms to trust in 2025

5•taka-dev•2d ago•3 comments