frontpage.

Show HN: FreeFlow – Open-Source Wispr Flow

https://github.com/build-trust/freeflow

2•_mrinalwadhwa_•1h ago

Hi HN!

Voice is fast-becoming my primary interface to computers and AI. I built FreeFlow because I wanted a Wispr Flow-like experience for our entire team, but customizable and private.

Press a hotkey, dictate naturally, polished text appears in any app. Ramble, use filler words, correct yourself mid-sentence. FreeFlow turns messy speech into clean writing and injects it wherever your cursor is: your messaging app, your editor, your coding agent, the terminal, email, anything.

Demo (sound on): https://github.com/build-trust/freeflow#demo-sound-on-

It's really fast. The injection feels instantaneous. In my benchmarks two thirds of dictations finish in under 0.6 seconds. To get that speed, the app streams audio to your private server over a persistent WebSocket while you speak, and a realtime speech-to-text model transcribes incrementally, so by the time you release the key the transcript is mostly done. Two independent WebSocket connections race each other, and if both fail, an HTTP batch fallback catches it. The transcript goes through a post-processing step that removes filler words and fixes grammar. About 40% of dictations are clean enough to skip this step entirely. When post-processing is needed, a fast model handles it in about 0.4 seconds.

It's designed to be taken apart and reassembled. You can swap the speech model, rewrite the prompts, add new languages, or fork the entire experience to fit how your team works. I'm hoping people will morph it into other products.

The FreeFlow service is open source. You can self-host it, but running a low-latency streaming dictation service for a team is real infrastructure work: persistent WebSocket connections, streaming routes to speech models, failover, rate limits. At a company with fifty or five hundred people, keeping that reliable is a job in itself. FreeFlow uses Autonomy to make this easy. On first launch, the macOS app deploys the service to a private server. Two minutes, no infrastructure knowledge needed. You can then invite your team. One server handles everyone, no per-seat fees. It sustains thousands of simultaneous streaming connections. In a stress test, 50 people dictating at the same time got sub-second latency with zero failures.

  brew install build-trust/freeflow/freeflow

It's macOS only for now, but I plan to build for other operating systems. The two most useful contributions right now are mic compatibility data (every mic behaves differently) and prompts that improve polish quality for a specific language.

Try it, tell me how it works with your mic and your apps. What's fast, what's slow, what's broken.

GitHub: https://github.com/build-trust/freeflow

Why does a Stochastic Parrot make sense at all?

Capyra – open-source agent runtime for SAP B1 and WhatsApp

The environmental cost of datacentres is rising. Is it time to quit AI?

A Couple of Git Nits

Are we ready for film distribution via USB drives?

I Take My Laptop to the Gym So Claude Doesn't Have Downtime

Show HN: X07, compiled language where agents write correct code on the first try

The 3-Day Starter Plan for Raspberry Pi Beginners

Contiguitas: The Pursuit of Physical Memory Contiguity in Datacenters

Wanted: Europe's Missing Cloud Provider

Free tool to compare SASE vendors side-by-side

Revealed: The worst mega-leaks of methane driving global heating

Death of a Strawman: The Epistemology of a Language Model

Ask HN: With Promptfoo acquired by OpenAI, what are MCP devs using for testing?

Show HN: Specifica – an open format for writing software specs as Markdown

Show HN: I'm trying to help aspiring Data Analysts

UK security adviser attended US-Iran talks and judged deal was within reach

The Great Developer Schism: Process vs. Product [video]

Show HN: MCP Isn't Dead. You're Just Using It Wrong

CBM-BASIC: Commodore BASIC–style interpreter written in C

A collaborative pixel mural where each 16×16 tile is owned and editable

X11 user daemon to automatically run commands triggered by user specified events

Nvidia Built the A.I. Era. Now It Has to Defend It

Show HN: MUP – Interactive UI inside LLM chat, so anyone can use agentic AI

Samsung to Discontinue Galaxy Z TriFold After Just Three Months

VEO – Open-source content-adaptive video encoding optimizer in Go

Trapped Inside a Self-Driving Car During an Anti-Robot Attack

Java 26 Released

The first open-source agentic AI physicist

Quickly Get Your Local and Public IP Address from the Command Line