frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: EdgeWhisper – On-device voice-to-text for macOS (Voxtral 4B via MLX)

https://edgewhisper.com
2•raphaelmansuy•2h ago
I built a macOS voice dictation app where zero bytes of audio ever leave your machine.

EdgeWhisper runs Voxtral Mini 4B Realtime (Mistral AI, Apache 2.0) locally on Apple Silicon via the MLX framework. Hold a key, speak, release — text appears at your cursor in whatever app has focus.

Architecture: - Native Swift (SwiftUI + AppKit). No Electron. - Voxtral 4B inference via MLX on the Neural Engine. ~3GB model, runs in ~2GB RAM on M1+. - Dual text injection: AXUIElement (preserves undo stack) with NSPasteboard+CGEvent fallback. - 6-stage post-processing pipeline: filler removal → dictionary → snippets → punctuation → capitalization → formatting. - Sliding window KV cache for unlimited streaming without latency degradation. - Configurable transcription delay (240ms–2.4s). Sweet spot at 480ms.

What it does well: - Works in 20+ terminals/IDEs (VS Code, Xcode, iTerm2, Warp, JetBrains). Most dictation tools break in terminals — we detect them and switch injection strategy. - Removes filler words automatically ("um", "uh", "like"). - 13 languages with auto-detection. - Personal dictionary + snippet expansion with variable support ({{date}}, {{clipboard}}). - Works fully offline after model download. No accounts, no telemetry, no analytics.

What it doesn't do (yet): - No file/meeting transcription (coming) - No translation (coming) - No Linux/Windows (macOS only, Apple Silicon required)

Pricing: Free tier (5 min/day, no account needed). Pro at $7.99/mo or $79.99/yr.

I'd love feedback on: 1. Would local LLM post-processing (e.g., Phi-4-mini via MLX) for grammar/tone be worth the extra ~1GB RAM? 2. For developers using voice→code workflows: what context would you want passed to your editor? 3. Anyone else building on Voxtral Realtime? Curious about your experience with the causal audio encoder.

Comments

Dumbledumb•1h ago
I have been using parakeet TDT v3 with just 0.6B params and its insanely fast (feels instant, even on M1 Air). The accuracy is all I could ask for - I dont see the benefit of a much larger 4B model?

Not knocking your app, but asking before your app seems very focused on one model, while others allow the user to pick according to their needs.

Show HN: Channel Surfer – Watch YouTube like it’s cable TV

https://channelsurfer.tv
400•kilroy123•2d ago•136 comments

Show HN: Context Gateway – Compress agent context before it hits the LLM

https://github.com/Compresr-ai/Context-Gateway
61•ivzak•7h ago•44 comments

Show HN: I wrote my first neural network

https://github.com/stupid-genius/Perceptron
3•allenng•56m ago•0 comments

Show HN: What was the world listening to? Music charts, 20 countries (1940–2025)

https://88mph.fm/
98•matteocantiello•3d ago•44 comments

Show HN: EdgeWhisper – On-device voice-to-text for macOS (Voxtral 4B via MLX)

https://edgewhisper.com
2•raphaelmansuy•2h ago•1 comments

Show HN: Svglib a SVG parser and renderer for Windows

https://github.com/bibhas2/svglib
9•leopoldj•3d ago•1 comments

Show HN: Compressor.app – Compress almost any file format

https://compressor.app
3•matylla•3h ago•0 comments

Show HN: Hardened OpenClaw on AWS with Terraform

https://github.com/infrahouse/terraform-aws-openclaw
2•aleks2•3h ago•0 comments

Show HN: Better HN – Realtime Comment Updates and Cleaner Look

https://github.com/xpl/better-hn
4•xpl•3h ago•0 comments

Show HN: Axe – A 12MB binary that replaces your AI framework

https://github.com/jrswab/axe
211•jrswab•1d ago•118 comments

Show HN: A single CLI to manage llama.cpp/vLLM/Ollama models

https://github.com/av/harbor/releases/tag/v0.4.4
2•everlier•4h ago•1 comments

Show HN: Loop your agents like a dandy little b*tch

https://github.com/geekforbrains/loopsie
5•geekforbrains•4h ago•0 comments

Show HN: ShellSelf – A Developer Portfolio That Feels Like Home

https://www.shellself.com/
6•truetaurus•4h ago•1 comments

Show HN: Anthrology – Time-Traveling Radio

https://anthrology.site/
4•airstrike•5h ago•5 comments

Show HN: Mutate – free inline text replacement for Mac

https://github.com/robert-v/Mutate-public
2•rob3rth•5h ago•1 comments

Show HN: OneCLI – Vault for AI Agents in Rust

https://github.com/onecli/onecli
155•guyb3•1d ago•49 comments

Show HN: DJX – Convention over Configuration for Django (Rails-Inspired CLI)

4•RedsonNgwira•5h ago•2 comments

Show HN: An addendum to the Agile Manifesto for the AI era

https://github.com/brackishman/Agile-Manifesto-AI-Addendum
7•brackishman•4h ago•12 comments

Show HN: Tiny macOS app that adds a facecam bubble to screen recordings

https://github.com/backnotprop/CamBubble
5•ramoz•6h ago•0 comments

Show HN: Rudel – Claude Code Session Analytics

https://github.com/obsessiondb/rudel
140•keks0r•1d ago•83 comments

Show HN: Understudy – Teach a desktop agent by demonstrating a task once

https://github.com/understudy-ai/understudy
112•bayes-song•1d ago•41 comments

Show HN: Mjmx – render mjml using JSX

https://mjmx.dev/
3•skwee357•6h ago•0 comments

Show HN: AgentLog – a lightweight event bus for AI agents using JSONL logs

https://github.com/sumant1122/agentlog
4•paperplaneflyr•6h ago•0 comments

Show HN: Execute local LLM prompts in remote SSH shell sessions

3•smudgy3746•6h ago•2 comments

Show HN: s@: decentralized social networking over static sites

http://satproto.org/
409•remywang•2d ago•218 comments

Show HN: AI milestone verification for construction using AWS

https://builder.aws.com
2•eugenelotsu•7h ago•0 comments

Show HN: RepoCrunch – CLI to analyze GitHub repos

https://github.com/kimwwk/repocrunch
2•chillkim•8h ago•2 comments

Show HN: OpenClaw docs in Japanese, now open source

https://openclawdoc.org
3•mixfox•8h ago•0 comments

Show HN: Open-source browser for AI agents

https://github.com/theredsix/agent-browser-protocol
153•theredsix•2d ago•52 comments

Show HN: OpenClaw-class agents on ESP32 (and the IDE that makes it possible)

https://pycoclaw.com/
27•pycoclaw•1d ago•2 comments