frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Sentient OS – On-device intelligence layer for your entire digital life

https://sentient-os.ai
2•TechExpert2910•1h ago
Hi HN :D I'm 20 and I spent a year building something that shouldn't be possible: a custom on-device vision LLM that processes your entire digital life overnight on a phone.

We all have thousands of buried screenshots, notes, files, bookmarks, saved posts, etc we'll never find again. The only way to make AI understand all of it is to upload everything to the cloud -- privacy nightmare, and way too expensive at scale. And it shouldn't be possible on-device either: small models are too dumb, and phones are too slow for thousands of LLM inference runs.

So I spent a year deeply optimizing every layer of the on-device inference stack to make it possible anyway.

Sentient OS runs a custom multimodal vision LLM on your phone and laptop while they charge overnight. It understands your entire digital life -- every screenshot, note, file, email, bookmark, plus integrations for external services -- with nothing ever leaving your device.

This gives you three things that weren't possible before:

-> Talk to your entire digital life in natural language: "what was that wine I liked?" / "who did I wanna meet next week?" [on-device RAG]. And with MCP, your existing LLM (ChatGPT, Claude, etc.) can talk to your digital life too -- so it actually understands you.

-> Proactive reminders surfaced from your own data: "that tax return in your Downloads is due next week" / "tickets for that concert you screenshotted open tomorrow"

-> Knowledge graphs of your entire digital life: tap any node to find what you buried!

Here's what I had to build to make this possible:

Inference speed:

- KV cache reuse: the system prompt + few-shot examples are identical across all 3,000 analysis calls. I run inference on that prefix once, cache the KV state, and reuse it for every image. Prefill drops to just processing the image itself.

- Thermal-aware scheduling: I throttle the moment iOS reports thermal state > fair. I have all night, so I trade speed for not cooking the device.

- iOS jetsam awareness: iOS kills apps above a specific RAM threshold. I profiled that threshold across different iPhones and push right up to the edge.

Model quality at small size:

- Vision transplant: a 2B Qwen model has terrible vision. I transplanted Qwen 3.5 9B's multimodal projector onto the 2B base. Same architecture family makes this possible.

- Selective quantization on MLX: MLX doesn't support k-quant style mixed precision. I built it manually: less quantization on first/last layers and high-activation layers, more aggressive on the rest.

The alpha processes ~3,000 screenshots entirely on-device on a 6 year old iPhone. Coming to Mac and iPhone!

Previously I researched Apple's neural accelerators: https://www.reddit.com/r/LocalLLaMA/comments/1ohrn20/

And I love OSS! I built https://github.com/theJayTea/WritingTools (2K+ stars, ~30 press features). I'm considering making Sentient OS OSS under AGPL (so no one else can profit off of my work haha).

I think this is one of the coolest consumer usecases to take advantage of on-device LLMs. I'd love to hear what you all think, and happy to answer any questions (I love geeking out about the deep work that's gone into optimizing models and inference!) :D

Comments

GoldenGate67•1h ago
Interesting, but how are you going to monetise this? I wouldn’t trust a completely free service at analysing all my data because then I’m the product.
TechExpert2910•1h ago
Hey! Really great question. My plan is to charge a ~$2 a month subscription if you'd like to analyze more than the last 6 months’ worth of your data (I'll also allow a one-time lifetime license option!).

And I can let you use it on this 6-month window for free because it costs me nothing per user! Your own device does all the processing :D

Show HN: Virshle – Make ordinary virtual machines

https://virshle.crocuda.com/
1•jean_dupont•7m ago•0 comments

Why North Korea Will Never Give Up Its Nuclear Weapons

https://spectator.clingendael.org/en/publication/why-north-korea-will-never-give-its-nuclear-weapons
1•ck45•8m ago•1 comments

The Advance of Heliotherapy – Nature

https://www.nature.com/articles/109663a0?twclid=24ras4wi5qjw2lcvz2mgnnna5z
2•bilsbie•8m ago•0 comments

Emergent swarm bio structure – mark 36

https://rogmash.neocities.org/
1•rogmash•9m ago•0 comments

I Do Not Recommend Bitwarden

https://xn--gckvb8fzb.com/i-do-not-recommend-bitwarden/
3•maxloh•9m ago•0 comments

English councils to trial Google AI tool to speed up planning decisions

https://www.ft.com/content/91ce4475-d325-4d65-babb-4214996bc0f6
1•mmarian•14m ago•1 comments

VS Code inserting 'Co-Authored-by Copilot' into commits regardless of usage

https://github.com/microsoft/vscode/pull/310226
3•indrora•14m ago•0 comments

Museum Speelklok

https://www.museumspeelklok.nl/en/
2•whtspc64•20m ago•0 comments

Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation

https://github.com/facebookresearch/tuna-2
1•neehao•21m ago•0 comments

Azthengar Build 2026.05.01

https://azthengar.itch.io/azthengar-build-20260501
1•RobotHouse•22m ago•0 comments

Show HN: Fabrica – A minimal terminal-based coding agent built in Rust

https://github.com/Endi1/fabrica
1•PestoDiRucola•22m ago•0 comments

Inside the Courtroom at the OpenAI Trial

https://www.nytimes.com/2026/04/30/insider/times-inside-openai-musk-trial.html
1•bookofjoe•23m ago•1 comments

When Dawkins Met Claude

https://unherd.com/2026/04/is-ai-the-next-phase-of-evolution/
3•Anon84•25m ago•0 comments

Where to buy a non-Apple, non-Google smartphone

https://www.theregister.com/2026/05/01/buy_a_foss_fondleslab/
1•u1hcw9nx•27m ago•0 comments

Meta buys robotics startup to bolster its humanoid AI ambitions

https://techcrunch.com/2026/05/01/meta-buys-robotics-startup-to-bolster-its-humanoid-ai-ambitions/
4•Brajeshwar•27m ago•0 comments

One photographer's passion project of capturing local newsrooms

https://apnews.com/article/newspapers-newsrooms-photographing-media-f0d0939e04bb66f8d340f6f43df5bf5e
1•thm•28m ago•0 comments

Anthropic potential $900B+ valuation round could happen within 2 weeks

https://techcrunch.com/2026/04/30/anthropic-potential-900b-valuation-round-could-happen-within-tw...
4•wslh•28m ago•0 comments

Ubuntu is adding Generative AI features, and other Linux distros might follow

https://www.neowin.net/news/ubuntu-is-going-all-in-on-generative-ai-and-other-linux-distros-might...
2•bundie•30m ago•0 comments

Man takes robot to airport, gets ticket for it, it gets on plane

https://twitter.com/scottbudman/status/2050321131467653197
1•fragmede•32m ago•0 comments

IPv8 will not work [video]

https://www.youtube.com/watch?v=Uyr8homutII
1•drewr•35m ago•0 comments

Menace on the Streets

https://macleans.ca/longforms/menace-on-the-streets/
1•herbertl•37m ago•0 comments

You'll Subscribe to Your Router, Too – The First Router Bribes Are Here [video]

https://www.youtube.com/watch?v=qjesRkTFZqY
1•01-_-•38m ago•0 comments

Ember 6.12

https://blog.emberjs.com/ember-released-6-12/
2•satvikpendem•40m ago•0 comments

Vibe code with me this Linux system on a browser tab

https://linuxontab.com/
3•kilian-ai•41m ago•1 comments

New Oscars rules: No AI actors, human-written scripts only

https://www.dw.com/en/new-oscars-rules-exclude-ai-performers-require-scripts-written-by-human/a-7...
2•qainsights•42m ago•0 comments

Jonathan Swift's Last Joke

https://www.newyorker.com/culture/the-weekend-essay/jonathan-swifts-last-joke
1•samizdis•44m ago•0 comments

South Africa withdraws AI policy due to fake AI-generated sources

https://www.reuters.com/world/africa/south-africa-withdraws-ai-policy-due-fake-ai-generated-sourc...
4•gnabgib•49m ago•0 comments

The Man Who Built NVIDIA [audio]

https://www.econtalk.org/the-man-who-built-nvidia-with-stephen-witt/
1•mooreds•50m ago•0 comments

Hilariously Useless: Mahalo's Guide to Playing the Xylophone (2011)

https://www.businessinsider.com/check-out-mahalos-hilariously-useless-guide-to-playing-the-xyloph...
1•mooreds•50m ago•1 comments

A terminal Markdown viewer built using Charm libraries

https://github.com/inkcheck/ink
1•geordee•51m ago•0 comments