frontpage.

I built TTSLab — a free, open-source tool for running text-to-speech and speech-to-text models directly in the browser using WebGPU and WASM. No API keys, no backend, no data leaves your machine.

When you open the site, you'll hear it immediately — the landing page auto-generates speech from three different sentences right in your browser, no setup required.

You can then try any model yourself: type text, hit generate, hear it instantly. Models download once and get cached locally.

The most experimental feature: a fully in-browser Voice Agent. It chains speech-to-text → LLM → text-to-speech, all running locally on your GPU via WebGPU. You can have a spoken conversation with an AI without a single network request.

Currently supported models: - TTS: Kokoro 82M, SpeechT5, Piper (VITS) - STT: Whisper Tiny, Whisper Base

Other features: - Side-by-side model comparison - Speed benchmarking on your hardware - Streaming generation for supported models

Source: https://github.com/MbBrainz/ttslab (MIT)

Feedback I'd especially like: 1. How does performance feel on your hardware? 2. What models should I add next? 3. Did the Voice Agent work for you? That's the most experimental part.

Built on top of ONNX Runtime Web (https://onnxruntime.ai) and Transformers.js — huge thanks to those communities for making in-browser ML inference possible.

Tritone Substitution

Giant Steps

Formal determination of deidentification under California law

Takeaways of building an MCP Server for my app

The Double Standard Is Killing AI Adoption in Your Team

Show HN: OpenLingo – Connecting Sonnet 4.6 to a Duolingo-like interface

Show HN: Lattice – Track what top AI labs are publishing daily

Show HN: MasqueradeORM – Memory Efficient Node ORM: Just Write Classes

Frontier LLM Leaderboard

Nearby Glasses

ICO fines Reddit £14.47M for letting kids slip past the gate

An X-ray of 793 YC startups

The Flawed V02 Max Craze

Quantifying tropical forest rainfall generation

Tactical Prompts for Building AI Systems (Code Architecture, DB Gen, RAG)

Portraits of Ukrainians on the anniversary of Russia's full-scale invasion

An Open Fan Differs from Turboprop and How It Beats Turbofans [video]

Linux Foundation Announces the Formation of the React Foundation

I built a governance layer for multi-agent AI coding – lessons after 6 months

We built chat-with-data without text-to-SQL

Corporate America's Growing Quest for Tariff Refunds

1Password pricing increasing 20% in March

Should Biology Put Complexity First?

SpacetimeDB – Transactions. Zero Bottlenecks

Why I Choose Svelte

Why Bluey Doesn't Sound Like a Cartoon with Bluey's Sound Designer [video]

Will the U.S. confirm that aliens exist before 2027? (27%)

Caddy Server Release Process

An Interactive Intro to Quadtrees

HuggingFace Agent Skills

Show HN: TTSLab – Text-to-speech that runs in the browser via WebGPU