frontpage.

I had been looking to try <500M parameter language models but you wouldn't find an API to try them anywhere, so I built this cloudflare hosted static website that hosts weights and built an inference runtime for these models that uses WebGPU and runs inference from your browser.

These are only so useful in a multi-turn conversation but it's still interesting to see what you can pack in a <250mb model.

I tried using ONNX versions earlier, but there were too many quirks of using them with language models and the TPS wasn't too impressive. Inspired by svenflow/webgpu-gemma, I put my codex and claude to the task of writing WGSL to run inference for GGUF versions of these models.

Once you load this website and a model, it should load offline too, until your browser evicts the model from the cache.

Tsz: What if TypeScript chose Rust instead of Go

An Excerpt from "Go the Fuck to College" by Adam Mansbach

Consumer AI's ARPU Problem

Can I Copyright a Song I Made with AI?

ScalaTimes – A Free, Once-Weekly Scala News Flash

Show HN: Sigma Guard – deterministic contradiction checks for graph memory

RustChat is a minimal team messenger, alternative to Slack, Mattermost, Zulip

PostgresBench: A Reproducible Benchmark for Postgres Services

Show HN: AI Design Taste – Design.md Generator

The Mismeasure of Open Source

RL Benchmark "Ant" in Hardware

TypeScript checker and language service written in Rust

Startup Skills

A happy 150th birthday to the Otto Cycle internal combustion engine

Show HN: Draw Battle

Why do Oregon farms plant red clover every spring?

The Fast Way to Sweden – BGP Routing Experiments

Primary-source deep-dive of The Pentagon's PURSUE UAP release (Top findings)

API Hell

The Atari 800 – By Paul Lefebvre

LM Link: Use your local models, remotely

FreeBSD: Local Privilege Escalation via Execve()

I Caught the Car

Show HN: Simple Exif an App that allows creators take control of their metadata

Blink – AI Assistant. A knowledge destination

The Middle East had everything data center builders and hyperscalers wish for

Lobotomized Claude Code and it works better

7 days of public development are complete; Thanks to everyone

Show HN: Vibe-coding video games with Claude (Day 26: Primetime)

Show HN: CLI to budget Claude Code session costs

Show HN: ChonkLM – Tiny language models running offline in the browser