frontpage.

Hey HN,

I'm a 20yo solo builder from India,

I got frustrated that every capable AI model assumes you have a GPU, a credit card, or reliable internet. None of those are true for most of the world — including me.

So I started digging into the compression literature and ways through which i can solve this problem

What I found: - DeepSeek distilled 671B reasoning into 1.5B that runs on a laptop - TRM (Samsung, 2025) beat DeepSeek R1 on ARC-AGI with 7M parameters by iterating instead of scaling - RWKV runs in constant memory with no quadratic attention cost - GRPO lets you specialize a tiny model on a narrow domain in hours on CPU

The techniques exist. What doesn't exist: a systematic effort to apply all of them together, specifically for low-resource languages and low-end hardware, and give the results away free.

I'm building this. Calling it KIRO.

The goal is simple: take every major open source frontier model, compress it into domain-specific versions under 500MB, and deploy them offline on the cheapest Android hardware available.

Starting with math/physics education because that's the problem I know personally. Expanding to healthcare triage, legal aid, and agricultural advisory.

Currently running my first experiment on my i3 — R1-1.5B vs Qwen-7B on Hindi math problems. Will post results when training finishes.

Two honest questions for HN:

1. Is anyone else working on this specific intersection — compression + low-resource languages + offline deployment?

2. What would make this genuinely useful vs just technically interesting to you?

Everything will be open source.

Hackerbot-Claw: An AI-Powered Bot Actively Exploiting GitHub Actions

Show HN: The OpenRouter for OpenClaw – Shellrouter.com

The 1,700-year-old megastructure history almost forgot

HyperCard Changed Everything [video]

Show HN: PinBoard – Desktop app to bulk download Pinterest boards locally

Show HN: Roast My Website – AI roasts any website's design (free, built on n8n)

SQLite – Professional Support and Extension Products

How We Built a Distributed Work Scheduling System for Pulumi Cloud

AWS Middle East disrupted after 'objects struck datacenter' amid Iran war

SRGB↔XYZ Conversion (2021)

Show HN: Audio-to-Video with LTX-2

Show HN: L-Rep:Geometry engine,GA multivectors,dynamic trees in a single integer

An OpenClaw agent that blogs 24/7 and builds its own host

Strix – First Impressions

Show HN: Steward – an ambient agent that handles low-risk work

An interactive intro to Elliptic Curve Cryptography

What is a file format? An interactive guide

Agent Stats

Poisoned Wells – Examining the Scale of DNS Censorship in India

Fan-made successor to Command and Conquer: Renegade hit 1.0: 12 years of beta

Synesthesia

Why Should I Learn Spanish Now That AI Exists?

Show HN: Vibe code your scientific papers with an assistant I built(open source)

Show HN: I built an AI tool that walks you through Toyota's 5 Whys method

Evolving descriptive text of mental content from human brain activity

Show HN: LetItSimmer – Recipes that evolve based on real cooking feedback

Agent with decisions memory at its core

Model Collapse Ends AI Hype

Show HN: Clenv – Manage multiple Claude Code profiles, each Git-versioned

Metaphors Ubiquitous in Computer and Internet Terminologies [pdf]

Is anyone compressing AI models for the 4B people without GPUs or internet?