Show HN: Qwen 3.5 running on a $300 Android phone – on-device, open source

https://github.com/alichherawalla/off-grid-mobile-ai/

4•ali_chherawalla•2h ago

Qwen 3.5 Small dropped two days ago. I had it running on a mid-tier Android phone within hours.

It's great seeing the on-device AI community light up around this release. Off Grid brings it to Android: phones with 6GB RAM in the $200-300 range, ~8 tok/sec on the 2B model. Fully offline.

Text generation, vision AI, image gen, voice transcription, tool calling, document analysis — all on-device, nothing uploaded, ever. Works in airplane mode.

780+ GitHub stars. ~2,000 downloads across Android and iOS. Early days.

GitHub: https://github.com/alichherawalla/off-grid-mobile-ai

Play Store: https://play.google.com/store/apps/details?id=ai.offgridmobi...

App Store: https://apps.apple.com/us/app/off-grid-local-ai/id6759299882

Comments

alefiyakachwala•2h ago

Can you share some technical details? How did you do it? What’s under the good?

ali_chherawalla•2h ago

ofcourse ofcourse,

I've documented everything here: https://github.com/alichherawalla/off-grid-mobile-ai/blob/ma...

llama.cpp compiled as a native Android library via the NDK, linked into React Native through a custom JSI bridge. GGUF models loaded straight into memory. On Snapdragon devices we use QNN (Qualcomm Neural Network) for hardware acceleration. OpenCL GPU fallback on everything else. CPU-only as a last resort.

Image gen is Stable Diffusion running on the NPU where available. Vision uses SmolVLM and Qwen3-VL. Voice is on-device Whisper.

The model browser filters by your device's RAM so you never download something your phone can't run. The whole thing is MIT licensed - happy to answer anything about the architecture.

CBP Tapped into the Online Advertising Ecosystem to Track Peoples' Movements

MCP Servers Are Now Searchable

Microsoft Expands Starlink Alliance to Grow Azure and AI in Kenya

Slab tearing and segmented subduction termination driven by transform tectonics

Rare Earths Norway says estimate of Europe's biggest deposit jumps 81%

Anthropic-backed super PAC spends $1.6M in primary race divided over datacenters

First AI Agent on a Smartwatch

Killed by Mozilla

PRX Part 3 – Training a Text-to-Image Model in 24h

Helsinki just went a full year without a single traffic death

Select your fruit (No JavaScript)

If You Like PICO-8, You'll Love Kaplay (Probably)

It's an Obscure Psychedelic Used to Treat Trauma. Could It Help Me?

MicroTimes Interviews Borland's Philippe Kahn Again (1995)

Behold the Power of Meta:Substitute

Pincer – Python AI agent framework, security-first

Compiling Prolog to Forth [pdf]

Maryland Senators Approve Bill to Let Off-Duty Firefighters, EMTs Use Cannabis

Zed will require age identification for its services

Linux in Space: The aerospace industry's attitude for Space Architechture

The magic of adding random noise to black and white images [video]

Who Pays for Tariffs Along the Supply Chain? Evidence from European Wine Tariffs

Are these AI cost‑curve assumptions realistic? (ARK Big Ideas 2026)

Why specialized AI systems may outperform general‑purpose models

Generating Colour Palettes Thanks to Microgpt

Economic Possibilities for our Grandchildren (1930) [pdf]

Planning Center is the newest Rails Foundation Contributing member

Lazarus 4.6

Podcasts Lead AM/FM in Spoken-Word Listening, Marking a First

Verification of Stochastic Systems: Guarantees in the Presence of Uncertainty