frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Dicta.to – Local voice dictation for Mac with on-device AI

https://dicta.to/
2•alamparelli•3h ago
I built a macOS dictation app where everything runs on-device. Transcription, auto-correct, translation. No audio or text leaves your machine.

It ships with 4 transcription engines you can swap between: WhisperKit (99 languages), NVIDIA Parakeet TDT 0.6B (25 European languages, fastest of the bunch), Qwen3-ASR 0.6B (30 languages), and Apple Speech on macOS 26+. They all run through CoreML/Metal. Whisper is the most versatile, Parakeet wins on raw latency for European languages, Qwen3 does better with CJK. I went with a protocol-based architecture so you pick the engine that fits your use case instead of me pretending one model rules them all.

After transcription, there's an optional post-processing pipeline using Apple Intelligence (FoundationModels framework, macOS 26+, also fully on-device): auto-correct with filler word removal, tone rewriting, translation. The annoying part was FoundationModels cold start. First inference after idle takes 2-3s, which kills the experience. I worked around it by firing a throwaway mini-inference (`session.respond(to: "ok")`) in parallel while audio is still being transcribed, so the model is already warm when the text arrives. Hacky, but it shaved off the perceived latency.

Getting transcribed text into any arbitrary macOS app was honestly the hardest part. I use clipboard save/restore: read all NSPasteboard types (not just strings, also images, RTF, whatever the user had copied), write the transcribed text, simulate Cmd+V via CGEvent posted to `cghidEventTap`, then restore the original clipboard. Electron apps are slower to process paste events, so I detect them by checking if `Contents/Frameworks/Electron Framework.framework` exists in the app bundle and add extra delay. This whole approach requires Accessibility permissions, which means no sandbox, which means no App Store. I'm fine with that trade-off.

Built this solo in about 6 weeks. One-time purchase, no subscription.

I'm genuinely unsure about the multi-engine approach. Is letting users choose between Whisper/Parakeet/Qwen3 useful, or would most people prefer I just auto-select based on their language? Also curious if anyone has a cleaner approach to text injection on macOS. The clipboard hack works everywhere but it feels fragile and I don't love it.

Show HN: enveil – hide your .env secrets from prAIng eyes

https://github.com/GreatScott/enveil
173•parkaboy•12h ago•107 comments

Show HN: X86CSS – An x86 CPU emulator written in CSS

https://lyra.horse/x86css/
229•rebane2001•14h ago•70 comments

Show HN: Ghist – Task management that lives in your repo

https://github.com/unnecessary-special-projects/ghist
3•nxnze•31m ago•0 comments

Show HN: Steerling-8B, a language model that can explain any token it generates

https://www.guidelabs.ai/post/steerling-8b-base-model-release/
277•adebayoj•16h ago•82 comments

Show HN: PgDog – Scale Postgres without changing the app

https://github.com/pgdogdev/pgdog
308•levkk•1d ago•56 comments

Show HN: Cellarium: A Playground for Cellular Automata

https://github.com/andrewosh/cellarium
36•andrewosh•3d ago•1 comments

Show HN: Babyshark – Wireshark made easy (terminal UI for PCAPs)

https://github.com/vignesh07/babyshark
135•eigen-vector•20h ago•45 comments

Show HN: If Discord, Reddit, X, IRC and 4chan had a baby

2•ignasheahy•24m ago•0 comments

Show HN: Awsim – Lightweight AWS emulator in Go (40 services in progress)

https://github.com/sivchari/awsim
2•sivchari•2h ago•0 comments

Show HN: Sowbot – Open-hardware agricultural robot (ROS2, RTK GPS)

https://sowbot.co.uk/
169•Sabrees•1d ago•46 comments

Show HN: AI-native SDLC – 156 test docs, 16 skills, 1 human

https://github.com/gpgkd906/auth9
3•gpgkd906•3h ago•1 comments

Show HN: Dicta.to – Local voice dictation for Mac with on-device AI

https://dicta.to/
2•alamparelli•3h ago•0 comments

Show HN: Open-source LLM and dataset for sports forecasting (Pro Golf)

https://huggingface.co/LightningRodLabs/Golf-Forecaster
6•bturtel•28m ago•0 comments

Show HN: AI phone assistant that became a lifeline for people who can't speak

https://mio.gg/
3•talyuk•3h ago•1 comments

Show HN: AI Timeline – 171 LLMs from Transformer (2017) to GPT-5.3 (2026)

https://llm-timeline.com/
165•ai_bot•1d ago•58 comments

Show HN: Turn human decisions into blocking tool-calls for AI agents (iOS+CLI)

https://github.com/egradman/extendo-cli
2•egradman2•1h ago•0 comments

Show HN: Tessera – An open protocol for AI-to-AI knowledge transfer

https://github.com/incocreativedev/tessera-core
3•kirkmaddocks•4h ago•2 comments

Show HN: WebPerceptor – Enabling AI Mediated Web Browsing

https://github.com/theartofhci/WebPerceptor
2•theartofhci•4h ago•0 comments

Show HN: Claude Copy – Drop-in fix for Claude Code's broken copy-paste

https://github.com/andersmyrmel/claude-copy
2•andersmyrmel•4h ago•0 comments

Show HN: CIA World Factbook Archive (1990–2025), searchable and exportable

https://cia-factbook-archive.fly.dev/
484•MilkMp•1d ago•99 comments

Show HN: Git-native-issue – issues stored as commits in refs/issues/

https://github.com/remenoscodes/git-native-issue
5•remenoscodes•6h ago•3 comments

Show HN: L88 – A Local RAG System on 8GB VRAM (Need Architecture Feedback)

7•adithyadrdo•12h ago•1 comments

Show HN: 3D Mahjong, Built in CSS

https://voxjong.com
134•rofko•2d ago•60 comments

Show HN: Notion-CLI – Full Notion API from the terminal, 39 commands, one binary

https://github.com/4ier/notion-cli
3•4ier•10h ago•1 comments

Show HN: Agent Multiplexer – manage Claude Code via tmux

https://github.com/mixpeek/amux
12•Beefin•1d ago•2 comments

Show HN: BVisor – An Embedded Bash Sandbox, 2ms Boot, Written in Zig

https://github.com/butter-dot-dev/bVisor
19•edunteman•23h ago•5 comments

Show HN: A geometric analysis of Chopin's Prelude No. 4 using 3D topology

https://github.com/jimishol/cholidean-harmony-structure/blob/main/docs/03-case-study-chopin-prelu...
48•jimishol•4d ago•11 comments

Show HN: AgentBudget – Real-time dollar budgets for AI agents

https://github.com/sahiljagtap08/agentbudget
6•sahiljagtapyc•11h ago•6 comments

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

https://github.com/xaskasdf/ntransformer
389•xaskasdf•2d ago•102 comments

Show HN: ClinTrialFinder –AI-powered clinical trial matching for cancer patients

https://www.clintrialfinder.info
2•chncwang•12h ago•0 comments