Show HN: CyberWriter – a .md editor built on Apple's (barely-used) on-device AI

https://cyberwriter.app

3•uncSoft•1h ago

Apple has quietly shipped a pretty complete on-device AI stack into macOS, with these features first getting API access in MacOS 26. There are multiple components in the foundation model, but the skills it shipped with actually make this ~3b parameter model useful. The API to hit the model is super easy, and no one is really wiring them together yet.

- Foundation Models (macOS 26) - a ~3B-parameter LLM with an API. Streaming, structured output, tool use. No API key, no cloud call, no per-token cost. - NLContextualEmbedding (Natural Language framework, macOS 14+) -- a BERT-style 512-dim text embedder. Exactly what OpenAI and Cohere sell, sitting in Apple's SDKs since iOS 17. - SFSpeechRecognizer / SpeechAnalyzer - on-device speech-to-text including live dictation. Solid accuracy on Apple Silicon.

I built cyberWriter, a Markdown editor, on top of all three, mostly as a test and showcase to see what it can do. I actually integrated local and cloud AI first, and then Apple shipped the foundation model, it stacked on super easy, and now users with no local or API AI knowledge can use it with just a click or two. Well the real reason is because most markdown editors need plugins that run with full system access, and I work on health data and can't have that.

Vault chat / semantic search. The app indexes your Markdown folder via NLContextualEmbedding (around 50 seconds for 1000 chunks on an M1). The search bar gets a "Related Ideas" section that matches by meaning - typing "orbital mechanics" surfaces notes about rockets and launch windows even when those exact words never appear. Ask the AI a question and it retrieves the top 5 chunks as context. Plain RAG, but the embedder, retrieval, chat model, and search all run locally.

AI Workspace. Command+Shift+A opens a chat panel, Command+J triggers inline quick actions (rewrite, summarize, change tone, fix grammar, continue). Apple Intelligence is the default; Claude, OpenAI, Ollama, and LM Studio all work if you prefer. The same context layer - document selection, attached files, retrieved vault chunks - feeds every provider through the same system-message path. Because the vault context is file and filename aware, it can create backlinks to the referenced file if it writes or edits a doc for you.

Voice notes and dictation. Record a voice note directly into your doc, transcribe it with SpeechAnalyzer, or just dictate into the editor while you think. Audio never leaves the Mac.

The privacy story is straightforward because the primitives are already private. Vectors live in a `.vault.embeddings.json` file next to your vault, never sent anywhere. If you use Apple Intelligence, even the retrieved text stays on-device. For cloud models there is a clear toggle and an inline warning before any filenames or snippets leave the machine.

Honest limitations:

- 512-dim embeddings are solid mid-tier. A GPT-4-class embedder catches subtler relationships this will miss. - 256-token chunks can split long paragraphs mid-argument. - Foundation Models caps its context window around 6K characters, so vault context is budgeted to 3K with truncation markers on the rest. - Multilingual support is English-only right now. NLContextualEmbedding has Latin, Cyrillic, and CJK model variants; wiring the language detector across chunks is Phase 2.

The developer experience for these APIs is genuinely good. Foundation Models streams cleanly, NLContextualEmbedding downloads assets on demand and gives you mean-poolable token vectors in a handful of lines. Curious what others here are building on this stack - feels like low-hanging fruit that has been sitting there for a while.

https://imgur.com/a/HyhHLv2

The Apple AI embedding feature is going live today. I'm honestly surprised it even works out of the box.

Comments

rampatra•1h ago

Love that your landing page is the product itself. However, I think the app icon and the screenshots on the App Store can be improved that will lead to better downloads.

uncSoft•59m ago

thanks, I am probably the worst designer and didn't want to use AI, but you're not wrong

Show HN: The Trawl CLI, trudge through agent harness logs for shit and giggles

Montgomery Multiplication in Signed Redundant Representations [pdf]

The Ultimate Question: What Does the Endgame Look Like?

Lessons from Going Solo

The $400M Machine That Spawned the Most Coveted Toy

GitHub Trip Report (2019)

Rip language. Compiles to ES2022. Built-in reactivity

Open-Weight Models: Curated Guide for Production LLM Deployment

Cryptocurrency industry on track to surpass 2024 spending on Texas midterm races

Codebase Readiness Grid: can your repo handle AI agents?

Could 'A River Runs Through It' Have Been a Hit Today?

Reflections on Vibe Researching

Show HN: Evading an AI SOC with Sable from Vulnetic

Why the Wellness Elite Such as Jordan Peterson and Mark Hyman Are Getting Sepsis

Show HN: ANSI Saver for Apple TV

Yann LeCun says Dario Amodei "knows nothing about AI effects on jobs"

Anthropomorphism and Trust in Human-Large Language Model Interactions

Show HN: We trained a 32B model to beat Opus 4 at credit card optimization

GraalVM JavaScript Sandboxing

A Security Framework Sized for the Business You Run

You don't need a RAG, you just need RAG

MNT Reform is an open hardware laptop, designed and assembled in Germany

Two Paradoxes Blocking Bitcoin

Show HN: Self-hosted Raspberry Pi wall display (no cloud, no subscription)

Learn Vim for the Last Time

Build the Dam System

Show HN: Germball – Drone-deployed seeds triggered by soil moisture

Corporate Bullshit Considered Harmful

Highlighting Interactive Code Blocks

US birth records uncover an autism risk surge tied to common drugs