frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Hitoku Draft – context aware local macOS assistant

https://github.com/Saladino93/hitokudraft
3•lostathome•13h ago
I am working on Hitoku Draft. An open-source, voice-first AI assistant that runs entirely locally. No cloud models, nothing leaves your machine. You press a hotkey, and you talk.

It's context-aware; it reads your screen, documents, and active app to understand what you're working on. You can ask about PDFs, reply to emails, create calendar events, use web search, all by voice.

It supports Gemma 4 and Qwen 3.5 for text generation, plus multiple STT backends (Parakeet, Whisper, Qwen3-ASR).

Examples:

- Gemma4 in action, https://www.youtube.com/watch?v=OgfI-3YjEVU

- query a pdf document, https://www.youtube.com/watch?v=ggaDhut7FnU

- reply to email, https://www.youtube.com/watch?v=QFnHXMBp1gA

- and the usual voice dictation (with optional polishing)

I currently use it a lot with Claude Code, Obsidian and Apple Notes, or just read papers.

Code: https://github.com/Saladino93/hitokudraft/tree/litert

Download of binary: https://hitoku.me/draft/ (free with code HITOKUHN2026)

I am looking for feedback. My goal is to do AI research with clients interfacing, and I thought this is a nice little experiment I could do to iterate/fail quickly.

P.S. (if anyone has tips about this)

Current Gemma4 implementation (with small models) has some problems:

- easy to hallucinate for long contexts, so had to reset it often. Tuned some parameters, but need to find a sweet spot.

- Gemma4 with LiteRT is currently fast compared to the MLX implementation of Qwen3.5 (like 3x faster on my machine when dealing with images). But it has the price of memory spikes. I believe this is because LiteRT's WebGPU backend can allocate significantly more GPU memory than the model weights alone (I got 38GB of memory taken, for the E4B~4GB model!). I guess we need to wait for Google for this.

- App size: because no official Swift package from Google yet, have to bundle some file (LiteRT dylibs) that adds ~98 MB to a previous MLX only version (total app goes from ~50 MB to ~150 MB)

If any of this bothers you: use Qwen 3.5 instead (pure MLX), or wait for the upstream fixes from Google :)

Otherwise, for the mid-term I plan to switch to a potentially slower, but safer, MLX version for Gemma4 (hopefully on the weekend).

Show HN: Ithihāsas – a character explorer for Hindu epics, built in a few hours

https://www.ithihasas.in
137•cvrajeesh•10h ago•33 comments

Show HN: Deflect One – command line dashboard for managing Linux servers via SSH

https://github.com/Frytskyy/deflect-one
2•whitemanv•1h ago•1 comments

Show HN: Continual Learning with .md

https://github.com/SunAndClouds/ReadMe
23•wenhan_zhou•9h ago•16 comments

Show HN: Prmana – OIDC SSH Login for Linux with DPoP (Rust, Apache 2.0)

https://github.com/prodnull/prmana
3•cbchhaya•3h ago•0 comments

Show HN: boringBar – a taskbar-style dock replacement for macOS

https://boringbar.app/
503•a-ve•1d ago•287 comments

Show HN: Excalicharts – Charting Library for Excalidraw

https://github.com/tombedor/excalicharts
4•jjfoooo4•5h ago•0 comments

Show HN: Mcptube – Karpathy's LLM Wiki idea applied to YouTube videos

https://github.com/0xchamin/mcptube
7•0xchamin•13h ago•0 comments

Show HN: Lythonic – Compose Python functions into data-flow pipelines

https://github.com/walnutgeek/lythonic
4•walnutgeek•9h ago•0 comments

Show HN: I built a social media management tool in 3 weeks with Claude and Codex

https://github.com/brightbeanxyz/brightbean-studio
177•JanSchu•20h ago•120 comments

Show HN: OQP – A verification protocol for AI agents

https://github.com/OranproAi/open-qa-protocol
5•Aamir21•7h ago•1 comments

Show HN: Oberon System 3 runs natively on Raspberry Pi 3 (with ready SD card)

https://github.com/rochus-keller/OberonSystem3Native/releases
237•Rochus•1d ago•93 comments

Show HN: Claudraband – Claude Code for the Power User

https://github.com/halfwhey/claudraband
117•halfwhey•1d ago•42 comments

Show HN: Equirect – a Rust VR video player

https://github.com/greggman/equirect
11•greggman65•21h ago•1 comments

Show HN: Farchive – SQLite-backed history-preserving compressed archive

https://github.com/eliask/farchive
4•ekns•13h ago•0 comments

Show HN: Deconflict – Open-source WiFi planner with physics-based walls

https://deconflict.pages.dev
3•s_e__a___n•9h ago•1 comments

Show HN: Pardonned.com – A searchable database of US Pardons

496•vidluther•2d ago•272 comments

Show HN: I benchmarked Gemma 4 E2B – the 2B model beat the 12B on multi-turn

https://aiexplr.com/post/gemma-4-e2b-benchmark
6•mailharishin•10h ago•0 comments

Show HN: GDL – I built an AI-powered invention engine

https://groundeddiscoverylabs.com/
2•Whyachi•10h ago•0 comments

Show HN: Lint-AI by RooAGI, a Rust CLI for AI Doc Retrieval

https://github.com/RooAGI/Lint-AI
3•rooagi•10h ago•0 comments

Show HN: Encrypted, nothing stored, nothing repeated face-gated asset sharing

https://veylt.net/
4•bekkerza•11h ago•0 comments

Show HN: pg_grpc – Call gRPC services directly from PostgreSQL

https://github.com/CSenshi/pg_grpc
7•csenshi•12h ago•1 comments

Show HN: Hitoku Draft – context aware local macOS assistant

https://github.com/Saladino93/hitokudraft
3•lostathome•13h ago•0 comments

Show HN: Asthi – Damn good asset tracker

https://www.asthi.app/
3•suvamsh•13h ago•0 comments

Show HN: Bloomberg Terminal for LLM ops – free and open source

7•amans9712•13h ago•1 comments

Show HN: Aeolus – a library for unified access to air quality sensor networks

https://github.com/southlondonscientific/aeolus
3•ruaraidh•13h ago•0 comments

Show HN: Dbg – One CLI debugger for every language (AI-agent ready)

https://redknightlois.github.io/dbg/
4•redknight666•13h ago•0 comments

Show HN: FluidCAD – Parametric CAD with JavaScript

https://fluidcad.io/
155•maouida•3d ago•37 comments

Show HN: 15 yrs of Django in prod: patterns I keep using (agent skills)

https://github.com/dvf/opinionated-django
6•vanflymen•14h ago•1 comments

Show HN: Rekal – Long-term memory for LLMs in a single SQLite file

https://github.com/janbjorge/rekal
7•jeeybee•1d ago•8 comments

Show HN: Crafto – AI carousel post generator for LinkedIn and Instagram

https://crafto.ai
4•xz_tie•15h ago•1 comments