A year ago, I shared a simple demo of running LLMs locally in a Chrome extension. Today, I’m excited to share TinyWhale, a monorepo that lets you run Qwen 3.5 0.8B entirely on-device, both as a web application and a Chrome extension. I plan to support mobile and desktop apps in the same repo.