frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: EdgeWhisper – On-device voice-to-text for macOS (Voxtral 4B via MLX)

https://edgewhisper.com
2•raphaelmansuy•2h ago
I built a macOS voice dictation app where zero bytes of audio ever leave your machine.

EdgeWhisper runs Voxtral Mini 4B Realtime (Mistral AI, Apache 2.0) locally on Apple Silicon via the MLX framework. Hold a key, speak, release — text appears at your cursor in whatever app has focus.

Architecture: - Native Swift (SwiftUI + AppKit). No Electron. - Voxtral 4B inference via MLX on the Neural Engine. ~3GB model, runs in ~2GB RAM on M1+. - Dual text injection: AXUIElement (preserves undo stack) with NSPasteboard+CGEvent fallback. - 6-stage post-processing pipeline: filler removal → dictionary → snippets → punctuation → capitalization → formatting. - Sliding window KV cache for unlimited streaming without latency degradation. - Configurable transcription delay (240ms–2.4s). Sweet spot at 480ms.

What it does well: - Works in 20+ terminals/IDEs (VS Code, Xcode, iTerm2, Warp, JetBrains). Most dictation tools break in terminals — we detect them and switch injection strategy. - Removes filler words automatically ("um", "uh", "like"). - 13 languages with auto-detection. - Personal dictionary + snippet expansion with variable support ({{date}}, {{clipboard}}). - Works fully offline after model download. No accounts, no telemetry, no analytics.

What it doesn't do (yet): - No file/meeting transcription (coming) - No translation (coming) - No Linux/Windows (macOS only, Apple Silicon required)

Pricing: Free tier (5 min/day, no account needed). Pro at $7.99/mo or $79.99/yr.

I'd love feedback on: 1. Would local LLM post-processing (e.g., Phi-4-mini via MLX) for grammar/tone be worth the extra ~1GB RAM? 2. For developers using voice→code workflows: what context would you want passed to your editor? 3. Anyone else building on Voxtral Realtime? Curious about your experience with the causal audio encoder.

Comments

Dumbledumb•1h ago
I have been using parakeet TDT v3 with just 0.6B params and its insanely fast (feels instant, even on M1 Air). The accuracy is all I could ask for - I dont see the benefit of a much larger 4B model?

Not knocking your app, but asking before your app seems very focused on one model, while others allow the user to pick according to their needs.

District denies enrollment to child based on license plate reader data

https://www.theregister.com/2026/03/12/district_denies_enrollment_to_child/
1•goplayoutside•9s ago•0 comments

Aether Engine: Coupled multiphysics for photonic ICs under extreme environments

https://github.com/venticedlatte/aether-engine/blob/main/README.md
1•ventiproject•28s ago•0 comments

MiniMax M2.5 is trained by Claude Opus 4.6?

1•Orellius•51s ago•0 comments

Meta planning layoffs as AI costs mount

https://www.reuters.com/business/world-at-work/meta-planning-sweeping-layoffs-ai-costs-mount-2026...
3•Aboutplants•4m ago•1 comments

Adobe's Statement Regarding the Department of Justice Settlement

https://news.adobe.com/news/2026/03/adobe-statement
1•coolandsmartrr•4m ago•0 comments

Institutional AI vs. Individual AI

https://www.a16z.news/p/institutional-ai-vs-individual-ai
1•gmays•12m ago•0 comments

Volkswagen's first tailored EV rolls out as it retakes the top spot in China

https://electrek.co/2026/03/13/volkswagens-first-custom-ev-rolls-out-after-taking-top-spot-in-china/
1•breve•16m ago•0 comments

AutoContext: closed-loop system for improving agent behavior over repeated runs

https://github.com/greyhaven-ai/autocontext
1•frozenseven•22m ago•0 comments

Autoresearch Home

https://ensue-network.ai/autoresearch
1•vinhnx•23m ago•0 comments

BYD's 5 min fast charging, 500 mile range luxury EV is headed overseas

https://electrek.co/2026/03/13/byd-ev-with-5-min-charging-500-miles-range-heads-overseas/
2•breve•23m ago•0 comments

Show HN: A Claude Skill that teaches Rails conventions for LLM calls

https://github.com/rubyonai/rails-llm-integration
1•nagstler•27m ago•0 comments

(Media over QUIC) on a Boat

https://moq.dev/blog/on-a-boat/
1•mmcclure•28m ago•0 comments

Monty Python Got It Wrong About Medieval Disease

https://www.sciencedaily.com/releases/2026/03/260313002645.htm
2•bookmtn•28m ago•0 comments

Mega-OS – 38-agent operating system that runs inside Claude Code

https://github.com/sly-the-fox/mega-os-public
1•slythefox•30m ago•1 comments

$2B nonprofit grants traced to find who's behind age verification bills

https://old.reddit.com/r/linux/comments/1rshc1f/i_traced_2_billion_in_nonprofit_grants_and_45/
3•spaghetdefects•32m ago•0 comments

Elon Musk's Ketamine Use Can't Be Probed in OpenAI Fraud Trial

https://www.bloomberg.com/news/articles/2026-03-13/elon-musk-s-ketamine-use-can-t-be-probed-in-op...
1•caaqil•33m ago•0 comments

Show HN: SupplementDEX – The Evidence-Based Supplement Database

https://supplementdex.com/
1•richarlidad•33m ago•0 comments

Show HN: I built an interactive 3D three-body problem simulator in the browser

https://structuredlabs.github.io/threebodyproblem/
1•amrutha_•34m ago•1 comments

The Egg (2009)

https://www.galactanet.com/oneoff/theegg_mod.html
1•basilikum•35m ago•0 comments

What happens when an autonomous robotaxi gets into an accident?

https://twitter.com/seventensuited/status/2032134435924295805
1•paulnpace•36m ago•0 comments

The Collapse of the Incentive to Make

https://www.carlos-menezes.com/posts/collapse-of-the-incentive-to-make
1•carlos-menezes•37m ago•0 comments

Spotify Silently Updates Itself (and How to Stop It)

https://duckass.bearblog.dev/how-spotify-silently-updates-itself-and-how-to-stop-it/
1•lschueller•39m ago•1 comments

Let the Code Do the Talking

https://sunilpai.dev/posts/after-wimp/
1•aratahikaru5•41m ago•0 comments

RAM: WTF? (2025)

https://gamersnexus.net/news/ram-wtf
1•pabs3•42m ago•0 comments

Sniffer dogs can detect wildlife trafficking via shipping container air samples

https://phys.org/news/2026-02-sniffer-dogs-wildlife-trafficking-shipping.html
1•PaulHoule•43m ago•0 comments

Instagram to discontinue end-to-end encryption for DMs

https://www.androidpolice.com/instagram-is-getting-rid-of-end-to-end-encryption-for-dms/
2•zugi•44m ago•0 comments

Gallo-Roman dodecahedron: twelve faces, zero answers?

https://nunc.ch/en/gallo-roman-dodecahedron-twelve-faces-zero-answers/
1•az09mugen•46m ago•0 comments

What Does Extreme Wealth Do to the Brain?

https://nymag.com/intelligencer/article/what-does-extreme-wealth-do-to-the-brain.html
1•pseudolus•47m ago•1 comments

Microplastics that accumulate in the body may 'clog up' immune cells

https://www.livescience.com/health/microplastics-that-accumulate-in-the-body-may-clog-up-immune-c...
1•jhncls•48m ago•0 comments

Our Experience with I-Ready

https://moultano.wordpress.com/2026/03/12/our-experience-with-i-ready/
6•barry-cotter•48m ago•1 comments