frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

HN: Shoute – Yes, another dictation app. Why the last 5% is the whole product

https://getshoute.com/deepdive
4•docstryder•1h ago

Comments

docstryder•1h ago
I've been working on Shoute, a speak-to-text app for Mac and Windows that's built around one idea: the full loop has to feel instant

I do know this isn’t a new category. A lot of people here already have some version of this: whisper.cpp behind a hotkey, macOS dictation, SuperWhisper, Wispr Flow, or some other hand-rolled version.

I built one anyway because I kept bouncing off dictation tools in my actual workday.

My problem was not “can an app transcribe my voice?” Most of them can, and impressively well. The problem was the full loop: press shortcut -> speak -> release -> cleaned up text appears where I was already typing - and that this happens consistently, quickly, day after day.

If that loop has enough delay, I lose the thread. If the output is too raw, I am back to editing. If the app needs screenshots to understand context, I start feeling uneasy about using it everywhere. You want to be confident that it always will work - or else you lose trust in it.

So the version I wanted was pretty narrow: - it should feel super quick for short everyday dictation

- the output should be cleaned up before insertion

- it should work across ALL the usual apps

- it should never lose data

- it should support both local and cloud modes (personally for flying but privacy too for specific things)

- it should use only minimal context

Shoute solves all of that really well and is lightweight (native code) and fluid to use day to day. It has a generous free tier (2000 words/week - should be enough for most casual use), one time purchase for both local and cloud, and cloud with subscription ($6.99/mo) for when folks who need the latest cloud models. Not a fan of subscriptions too but hard to have ongoing support for the latest cloud models without it.

Learned some really cool things building this:

The interesting eng lesson for me has been that voice UX is so much more latency-sensitive than normal app UX - the major part of the work on this was on making it consistently low latency end to end.

On latency - the model is only one part of the delay. Shoute runs three backends for different modes and fallback (ElevenLabs streaming, Groq Whisper, and WhisperKit for on-device) and each has different latency profiles. For short recordings (~15s is my avg - Shoute can do really long but not the primary use case for hour long recordings), the annoying delays often come from everything around the model: audio finalization, connection warmup, WebSocket setup, token fetching, fallback paths, local model cold starts, and finally pasting into the active app. Getting all this right consistently took significant time and eng effort despite Claude helping with all of it - taste and architectural direction is still absolutely essential in 2026, especially with desktop and system apps.

Native development is still hard - things like WebSockets are fundamentally web technologies and their native libraries have a lot of hard edges and inconsistencies that only show up when you use something 100 times a day - took some engineering to get around this. Native does make the UX fast but it almost made me wish I had chosen Electron for something with this much network management, but speed and resource efficiency is worth going native for.

Okay, this already feels long - please try it, let me know how it feels, glad to hear feedback and feature requests. Thank you! Here is the link: https://getshoute.com/deepdive

1024bits•53m ago
Congrats on launch!

I'm a little bit confused by this. You say it supports 100+ languages, but on the landing page some languages are colored in and the rest are greyed out, and the total number doesn't seem to amount to 100+.

Also, presumably the local model doesn't cost you anything per token. So why isn't that one the free tier, with the cloud model being in the paid plan? Wouldn't that help you get a lot more users cost-efficiently?

Lastly, your landing page has a lot of "AI hallmarks". This may or may not be a bad thing, but at least on here I imagine many people are fatigued from this pattern.

I'm all for apps that don't use Electron. What did you use for this?

Diffflow: Rust and Svelte Web Monitor

https://diffflow.com/
1•7rin0•52s ago•0 comments

"Being Poor," Ten Years on (2015)

https://whatever.scalzi.com/2015/09/03/being-poor-ten-years-on/
1•chistev•1m ago•1 comments

The Vibe Coding Hangover

https://checkmarx.com/blog/the-vibe-coding-hangover/
1•speckx•2m ago•0 comments

Four tech waves. Six companies. Here's what I'm building next

https://www.tamccann.com/four-tech-waves-six-companies-heres-what-im-building-next/
1•mahirsaid•2m ago•0 comments

Data readiness for agentic AI in financial services

https://www.technologyreview.com/2026/05/14/1137034/data-readiness-for-agentic-ai-in-financial-se...
1•joozio•3m ago•0 comments

Princeton scraps honor code and will supervise exams for first time in 133 years

https://www.independent.co.uk/news/world/americas/princeton-proctor-exams-ai-b2976111.html
1•madihaa•3m ago•0 comments

Next-gen pumps use film-based technology

https://www.foodprocessing.com.au/content/processing/article/next-gen-pumps-use-film-based-techno...
1•PaulHoule•3m ago•0 comments

Agents Can Reason. They Still Can't Search

https://dipkumar.dev/posts/agents/agent-search-problem/
4•askhn1234_12•4m ago•0 comments

Stop using user passwords for OpenStack automation

https://thobias.org/2026/05/10/openstack_app_credentials.html
1•cavanche•5m ago•0 comments

Steve Jobs Years at NeXT Shaped His Success as Apple CEO

https://spectrum.ieee.org/steve-jobs-next-computer
1•oldnetguy•6m ago•0 comments

US audit regulator weighs deep staff cuts to unit overseeing accounting firms

https://www.ft.com/content/f5c56c66-2896-4ef1-9a6a-49c59e76f23e
1•petethomas•6m ago•0 comments

Solar drone with jumbo jet wingspan broke a flight record–then it crashed

https://arstechnica.com/gadgets/2026/05/solar-drone-with-jumbo-jet-wingspan-broke-a-flight-record...
1•Brajeshwar•6m ago•0 comments

Apple-OpenAI Relationship Frays, Setting Up Possible Legal Fight

https://www.bloomberg.com/news/articles/2026-05-14/openai-apple-partnership-frays-setting-up-poss...
3•helsinkiandrew•7m ago•0 comments

Why Your AI Can Write a Novel but Still Struggles to Count to Fifty LLMHall

https://beeble.com/en/blog/why-your-ai-can-write-a-novel-but-still-struggles-to-count-to-fifty
1•odysseyk•9m ago•0 comments

Bay Area tech giant Cisco to cut jobs after record revenue

https://www.sfgate.com/bayarea/article/cisco-layoffs-bay-area-22257875.php
1•mikhael•9m ago•0 comments

Canada court quashes bid by Alberta separatists for independence referendum

https://www.theguardian.com/world/2026/may/14/alberta-separation-referendum-independence-petition...
1•Geekette•9m ago•0 comments

GitHub Copilot App

https://github.blog/changelog/2026-05-14-github-copilot-app-is-now-available-in-technical-preview/
1•hmokiguess•10m ago•0 comments

Show HN: nichy – a visualizer for Rust type memory layouts

https://niche.rs/
1•iridis•11m ago•0 comments

Silent Jungles

https://suziepetryk.com/blog/jungles.html
1•tancik•11m ago•0 comments

Erlang/OTP 29.0 Release

https://www.erlang.org/news/188
1•birdculture•11m ago•0 comments

Show HN: Sun – a Prolog generator for the 2026 solar eclipse

https://github.com/bergholt/triptych
1•kasperbergholt•11m ago•0 comments

Tic Tac Throne – a 3×3 grid game with checkers-class complexity

https://tic-tac-throw.vercel.app/
1•firesofmay•12m ago•0 comments

Nginx Rift Heap-based Buffer Overflow

https://depthfirst.com/nginx-rift
1•planb•13m ago•0 comments

The Sad Wives of AI

https://www.wired.com/story/meet-the-sad-wives-of-ai/
1•bryanrasmussen•15m ago•0 comments

Perseverance Snaps a Selfie on Mars

https://nautil.us/perseverance-snaps-a-selfie-on-mars-1280734
2•Brajeshwar•15m ago•0 comments

Three AWS VPS Runs Looked Identical – One Still Failed Under Load

https://webbynode.com/articles/three-aws-vps-runs-looked-identical-one-still-failed-under-load
1•gsgreen•17m ago•0 comments

How I Sandbox My AI Agents

https://blog.fidelramos.net/software/how-i-sandbox-ai-agents
2•fidelramos•18m ago•0 comments

Apple has won a prestigious award for iOS 26's Liquid Glass design

https://9to5mac.com/2026/05/14/apple-has-won-a-prestigious-award-for-ios-26s-liquid-glass-design/
1•danorama•19m ago•1 comments

Vibecoding – A vibecoding tool for HR who still don't get what vibecoding is

1•zhenruyan•20m ago•1 comments

Like Ollama, but for your own cloud [Apache 2.0]

https://github.com/superlinked/sie
1•supo•20m ago•1 comments