frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Open-source customizable AI voice dictation built on Pipecat

https://github.com/kstonekuan/tambourine-voice
8•kstonekuan•7h ago
Tambourine is an open source, fully customizable voice dictation system that lets you control STT/ASR, LLM formatting, and prompts for inserting clean text into any app.

I have been building this on the side for a few weeks. What motivated it was wanting a customizable version of Wispr Flow where I could fully control the models, formatting, and behavior of the system, rather than relying on a black box.

Tambourine is built directly on top of Pipecat and relies on its modular voice agent framework. The back end is a local Python server that uses Pipecat to stitch together STT and LLM models into a single pipeline. This modularity is what makes it easy to swap providers, experiment with different setups, and maintain fine-grained control over the voice AI.

I shared an early version with friends and recently presented it at my local Claude Code meetup. The response was overwhelmingly positive, and I was encouraged to share it more widely.

The desktop app is built with Tauri. The front end is written in TypeScript, while the Tauri layer uses Rust to handle low level system integration. This enables the registration of global hotkeys, management of audio devices, and reliable text input at the cursor on both Windows and macOS.

At a high level, Tambourine gives you a universal voice interface across your OS. You press a global hotkey, speak, and formatted text is typed directly at your cursor. It works across emails, documents, chat apps, code editors, and terminals.

Under the hood, audio is streamed from the TypeScript front end to the Python server via WebRTC. The server runs real-time transcription with a configurable STT provider, then passes the transcript through an LLM that removes filler words, adds punctuation, and applies custom formatting rules and a personal dictionary. STT and LLM providers, as well as prompts, can be switched without restarting the app.

The project is still under active development. I am working through edge cases and refining the UX, and there will likely be breaking changes, but most core functionality already works well and has become part of my daily workflow.

I would really appreciate feedback, especially from anyone interested in the future of voice as an interface.

Comments

lrvick•2h ago
Is there a way to do this with a local LLM, without any internet access needed?
grayhatter•2h ago
I don't think I'd call anything that only works with a proprietary Internet hosted LLM (one you need an account to use) open-source.

This is less voice dictation software, and much more a shim to [popular LLM provider]

Show HN: Cargo-rail: graph-aware monorepo tooling for Rust; 11 deps

https://github.com/loadingalias/cargo-rail
54•LoadingALIAS•4d ago•3 comments

Show HN: Open-source customizable AI voice dictation built on Pipecat

https://github.com/kstonekuan/tambourine-voice
8•kstonekuan•7h ago•2 comments

Show HN: Cardly – Manage, track, and get alerts for your gift card balances

https://cardlyai.app/
3•Pastaza•2h ago•1 comments

Show HN: A meditation timer without guidance, music, or growth mechanics

https://www.centertimer.com
2•tannerc•3h ago•0 comments

Show HN: Hacker News Christmas Colors Browser Extension

https://github.com/FreedomBen/hacker-news-christmas-colors-browser-ext
4•freedomben•3h ago•0 comments

Show HN: Chat with a Random AI

https://randomai.vercel.app/
2•borisandcrispin•3h ago•0 comments

Show HN: User.mom – Everything you need to reach Product-Market-Fit

https://user.mom
5•Wronnay•5h ago•3 comments

Show HN: Depup – a dependency upgrade advisor for Python projects

https://github.com/saran-damm/depup
3•saran-damm•6h ago•0 comments

Show HN: Llmwalk – explore the answer-space of open LLMs

https://github.com/samwho/llmwalk
2•samwho•6h ago•0 comments

Show HN: PhotoToVideoAI – AI photo to video generator

https://phototovideoai.org
3•qzcanoe•8h ago•1 comments

Show HN: 999 Penguins

https://999penguins.com
8•learntocode222•9h ago•1 comments

Show HN: I made a spreadsheet where formulas also update backwards

https://victorpoughon.github.io/bidicalc/
245•fouronnes3•3d ago•113 comments

Show HN: I built time to read all the things I want to

https://read-fast.replit.app/
4•thomoliverz•10h ago•2 comments

Show HN: Tiny VM sandbox in C with apps in Rust, C and Zig

https://github.com/ringtailsoftware/uvm32
190•trj•2d ago•12 comments

Show HN: LinkedQL – Live Queries over Postgres, MySQL, MariaDB

https://github.com/linked-db/linked-ql
30•phrasecode•6d ago•21 comments

Show HN: Listened to your feedback, Critical CSS Generator

https://kigo.studio/tools/critical-css-generator
10•stevenpotts•6d ago•4 comments

Show HN: Tripwire: A new anti evil maid defense

https://github.com/fr33-sh/Tripwire
81•DoctorFreeman•3d ago•48 comments

Show HN: Local Privacy Firewall-blocks PII and secrets before ChatGPT sees them

https://github.com/privacyshield-ai/privacy-firewall
111•arnabkarsarkar•5d ago•54 comments

Show HN: Gemini Pro 3 imagines the HN front page 10 years from now

https://dosaygo-studio.github.io/hn-front-page-2035/news
3329•keepamovin•5d ago•962 comments

Show HN: KV and wide-column database with CDN-scale replication

https://github.com/ankur-anand/unisondb
6•ankuranand•16h ago•2 comments

Show HN: I built this after leaking my AWS keys on StackOverflow

https://codeanswr.com
2•saradev2025•16h ago•2 comments

Show HN: I built a one-click coin flip with no ads or tracking

https://wheelpage.com/coin-flip/
3•nalife520•16h ago•3 comments

Show HN: Tic Tac Flip – A new strategic game based on Tic Tac Toe

https://tic-tac-flip.web.app/
3•breezefox•17h ago•2 comments

Show HN: Sim – Apache-2.0 n8n alternative

https://github.com/simstudioai/sim
238•waleedlatif1•3d ago•60 comments

Show HN: Autofix Bot – Hybrid static analysis and AI code review agent

35•sanketsaurav•3d ago•13 comments

Show HN: Wirebrowser – A JavaScript debugger with breakpoint-driven heap search

https://github.com/fcavallarin/wirebrowser
70•fcavallarin•4d ago•15 comments

Show HN: A 2-row, 16-key keyboard designed for smartphones

https://k-keyboard.com/Why-QWERTY-mini
83•QWERTYmini•4d ago•68 comments

Show HN: Automated license plate reader coverage in the USA

https://alpranalysis.com
239•sodality2•4d ago•146 comments

Show HN: I'm building an open-source Amazon

https://openship.org
49•theturtletalks•2d ago•28 comments

Show HN: I built a system for active note-taking in regular meetings like 1-1s

https://withdocket.com
176•davnicwil•6d ago•132 comments