frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Dicta.to – Local voice dictation for Mac with on-device AI

https://dicta.to/
2•alamparelli•1h ago
I built a macOS dictation app where everything runs on-device. Transcription, auto-correct, translation. No audio or text leaves your machine.

It ships with 4 transcription engines you can swap between: WhisperKit (99 languages), NVIDIA Parakeet TDT 0.6B (25 European languages, fastest of the bunch), Qwen3-ASR 0.6B (30 languages), and Apple Speech on macOS 26+. They all run through CoreML/Metal. Whisper is the most versatile, Parakeet wins on raw latency for European languages, Qwen3 does better with CJK. I went with a protocol-based architecture so you pick the engine that fits your use case instead of me pretending one model rules them all.

After transcription, there's an optional post-processing pipeline using Apple Intelligence (FoundationModels framework, macOS 26+, also fully on-device): auto-correct with filler word removal, tone rewriting, translation. The annoying part was FoundationModels cold start. First inference after idle takes 2-3s, which kills the experience. I worked around it by firing a throwaway mini-inference (`session.respond(to: "ok")`) in parallel while audio is still being transcribed, so the model is already warm when the text arrives. Hacky, but it shaved off the perceived latency.

Getting transcribed text into any arbitrary macOS app was honestly the hardest part. I use clipboard save/restore: read all NSPasteboard types (not just strings, also images, RTF, whatever the user had copied), write the transcribed text, simulate Cmd+V via CGEvent posted to `cghidEventTap`, then restore the original clipboard. Electron apps are slower to process paste events, so I detect them by checking if `Contents/Frameworks/Electron Framework.framework` exists in the app bundle and add extra delay. This whole approach requires Accessibility permissions, which means no sandbox, which means no App Store. I'm fine with that trade-off.

Built this solo in about 6 weeks. One-time purchase, no subscription.

I'm genuinely unsure about the multi-engine approach. Is letting users choose between Whisper/Parakeet/Qwen3 useful, or would most people prefer I just auto-select based on their language? Also curious if anyone has a cleaner approach to text injection on macOS. The clipboard hack works everywhere but it feels fragile and I don't love it.

Waymo Opens 4 New Cities to Public Riders (Now at 10 Total)

https://techcrunch.com/2026/02/24/waymo-robotaxis-are-now-operating-in-10-us-cities/
1•NullHypothesist•1m ago•0 comments

Teens Use and View AI

https://www.pewresearch.org/internet/2026/02/24/how-teens-use-and-view-ai/
1•swolpers•2m ago•0 comments

Lamborghini cancels electric Lanzador as supercar buyers reject EVs

https://arstechnica.com/cars/2026/02/lamborghini-drops-ev-plan-in-favor-of-future-plug-in-hybrids/
1•voxadam•3m ago•0 comments

Show HN: Tacit – The missing Layer 3 of the AI agent stack (open source)

https://github.com/tacitprotocol/tacit
1•ms170888•3m ago•0 comments

Dental group offers to fix Olympic Jack Hughes' smile for free

https://fox56.com/news/local/nepa-dental-group-offers-to-fix-jack-hughes-smile-after-toothless-gr...
1•DivingForGold•4m ago•1 comments

AI's Math Tricks Don't Work for Scientific Computing

https://spectrum.ieee.org/number-formats-ai-scientific-computing
1•rjmunro•5m ago•0 comments

Show HN: TTSLab – Text-to-speech that runs in the browser via WebGPU

https://ttslab.dev
1•MbBrainz•5m ago•0 comments

AIProx: An open registry and manifest standard for autonomous agent discovery

1•LightProx•5m ago•0 comments

Anthropic Links AI Agent with Tools for Investment Banking, HR

https://www.bloomberg.com/news/articles/2026-02-24/anthropic-links-ai-agent-with-tools-for-invest...
1•swolpers•8m ago•0 comments

OpenAI safety reps called to Ottawa after Tumbler Ridge, B.C., mass shooting

https://www.cbc.ca/news/politics/open-ai-summoned-ottawa-tumbler-ridge-9.7103281
3•ChrisArchitect•10m ago•1 comments

files

1•zerosizedweasle•10m ago•0 comments

Show HN: A minimal coding agent in Elixir (Erlang/OTP)

https://github.com/matteing/opal
1•sergiomattei•10m ago•0 comments

Near-Instantly Aborting the Worst Pain Imaginable with Psychedelics

https://psychotechnology.substack.com/p/near-instantly-aborting-the-worst
1•surprisetalk•11m ago•0 comments

Change your default date format to the least ambiguous

https://practicalbetterments.com/change-your-default-date-format-to-the-least-ambiguous/
1•surprisetalk•11m ago•1 comments

Georgist land taxes balance community benefit and the efficiency of markets (2024)

https://devon.postach.io/post/georgist-land-taxes-balance-community-benefit-the-efficiency-of-mar...
1•surprisetalk•11m ago•0 comments

Pecking Order and Flight Leadership (2019)

https://srconstantin.wordpress.com/2019/04/29/pecking-order-and-flight-leadership/
1•surprisetalk•11m ago•0 comments

Apple's Multibillion-Dollar Push to Make Chips in the U.S. [video]

https://www.youtube.com/watch?v=ktFlaBhpMu8
1•tambourine_man•12m ago•0 comments

A catecholamine-independent pathway controlling adaptive adipocyte lipolysis

https://www.nature.com/articles/s42255-025-01424-5
1•PaulHoule•13m ago•0 comments

Show HN: Search through half a million works of art using natural language

https://artexplorer.ai/
1•stefanvdw1•13m ago•0 comments

Show HN: Rappelo – A small tool for solopreneurs to capture leads faster

https://rappelo.com
1•AlexandruEneDev•13m ago•1 comments

EWM: The Emacs Wayland Manager

https://codeberg.org/ezemtsov/ewm
2•dargscisyhp•13m ago•0 comments

MapReduce: Simplified Data Processing on Large Clusters (2004) [pdf]

https://static.googleusercontent.com/media/research.google.com/en//archive/mapreduce-osdi04.pdf
1•vinhnx•14m ago•0 comments

Show HN: OpenPDB – Generate AI agents with real personalities

https://github.com/gitsual/openpdb
1•gitsual•15m ago•0 comments

Paxos made simple (2001) [pdf]

https://lamport.azurewebsites.net/pubs/paxos-simple.pdf
1•vinhnx•17m ago•0 comments

OpenAI calls in the consultants for its enterprise push

https://techcrunch.com/2026/02/23/openai-calls-in-the-consultants-for-its-enterprise-push/
1•signa11•17m ago•0 comments

Distributed Systems for Fun and Profit

https://book.mixu.net/distsys/single-page.html
2•vinhnx•17m ago•0 comments

Lime's billing model is encouraging cyclists to run red lights

https://tk.gg/posts/lime-bikes-should-stop-charging-when-you-stop
1•rustyhancock•18m ago•0 comments

The Coming War on General Computation

https://en.wikisource.org/wiki/The_Coming_War_on_General_Computation
2•bondarchuk•18m ago•0 comments

AI Removed Every Bottleneck Except One: Cognitive Load

https://medium.com/@a.mandyev/ai-removed-every-bottleneck-except-one-3f25b509f26e
2•andrey_m•19m ago•1 comments

The $10T Fight: Modeling a US-China War over Taiwan

http://www.bloomberg.com/news/articles/2026-02-10/the-10-trillion-fight-modeling-a-us-china-war-o...
1•nkurz•20m ago•1 comments