frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: TTSLab – A voice AI agent and TTS lab running in the browser via WebGPU

https://ttslab.dev
4•MbBrainz•2h ago
I built TTSLab — a free, open-source tool for running text-to-speech and speech-to-text models directly in the browser using WebGPU and WASM.

No API keys, no backend, no data leaves your machine.

When you open the site, you'll hear it immediately — the landing page auto-generates speech from three different sentences right in your browser, no setup required.

You can then try any model yourself: type text, hit generate, hear it instantly. Models download once and get cached locally.

The most experimental feature: a fully in-browser Voice Agent. It chains speech-to-text → LLM → text-to-speech, all running locally on your GPU via WebGPU. You can have a spoken conversation with an AI without a single network request.

Currently supported models: - TTS: Kokoro 82M, SpeechT5, Piper (VITS) - STT: Whisper Tiny, Whisper Base

Other features: - Side-by-side model comparison - Speed benchmarking on your hardware - Streaming generation for supported models

Source: https://github.com/MbBrainz/ttslab (MIT)

Feedback I'd especially like: 1. How does performance feel on your hardware? 2. What models should I add next? 3. Did the Voice Agent work for you? That's the most experimental part.

Built on top of ONNX Runtime Web (https://onnxruntime.ai) and Transformers.js — huge thanks to those communities for making in-browser ML inference possible.

Comments

MbBrainz•2h ago
Maker here. A few technical notes that might be interesting to this crowd:

The Voice Agent chains three models in the browser: Whisper for STT → a local LLM → Kokoro/SpeechT5 for TTS. All inference runs on-device via WebGPU. The latency isn't amazing yet, but the fact that it works at all with zero backend is kind of wild.

The landing page has an auto-playing demo that generates speech locally as soon as you visit — you'll hear it typewrite and speak three sentences. That was important to me because "runs in your browser" sounds like marketing until you actually hear it happen.

Happy to go deep on the WebGPU inference pipeline, model conversion process, or anything else.

Show HN: PgDog – Scale Postgres without changing the app

https://github.com/pgdogdev/pgdog
52•levkk•3h ago•13 comments

Show HN: Sowbot – open-hardware agricultural robot (ROS2, RTK GPS)

https://sowbot.co.uk/
36•Sabrees•2h ago•10 comments

Show HN: Shibuya – A High-Performance WAF in Rust with eBPF and ML Engine

https://ghostklan.com/shibuya.html
3•germainluperto•7m ago•0 comments

Show HN: AI Timeline – 171 LLMs from Transformer (2017) to GPT-5.3 (2026)

https://llm-timeline.com/
78•ai_bot•9h ago•39 comments

Show HN: AgentDbg - local-first debugger for AI agents (timeline, loops, etc.)

https://github.com/AgentDbg/AgentDbg
3•z-a-f•59m ago•2 comments

Show HN: What I've learned from shipping 25 mobile apps

https://newsletter.masilotti.com/p/what-ive-learned-from-shipping-25
3•joemasilotti•1h ago•0 comments

Show HN: CIA World Factbook Archive (1990–2025), searchable and exportable

https://cia-factbook-archive.fly.dev/
444•MilkMp•21h ago•88 comments

Show HN: Free ecommerce platform for link-in-bio people

https://stoar.page/
2•arajnoha•2h ago•2 comments

Show HN: Mato – a Multi-Agent Terminal Office workspace (tmux-like)

https://github.com/mr-kelly/mato
2•chepy•2h ago•0 comments

Show HN: Local-First Linux MicroVMs for macOS

https://shuru.run
204•harshdoesdev•23h ago•59 comments

Show HN: Agent Multiplexer – manage Claude Code via tmux

https://github.com/mixpeek/amux
2•Beefin•2h ago•0 comments

Show HN: TTSLab – A voice AI agent and TTS lab running in the browser via WebGPU

https://ttslab.dev
4•MbBrainz•2h ago•1 comments

Show HN: EloPhanto – A self-evolving AI agent that builds its own tools

https://github.com/elophanto/EloPhanto
2•elophanto_agent•3h ago•0 comments

Show HN: Self-hosted lightweight file sharing app. (folderhost)

https://github.com/MertJSX/folderhost
8•mertjsx•3h ago•0 comments

Show HN: A geometric analysis of Chopin's Prelude No. 4 using 3D topology

https://github.com/jimishol/cholidean-harmony-structure/blob/main/docs/03-case-study-chopin-prelu...
47•jimishol•3d ago•11 comments

Show HN: I built an iOS app to WebRTC into my Mac terminal from the toilet

https://macky.dev
2•Sayuj01•4h ago•2 comments

Show HN: 3D Mahjong, Built in CSS

https://voxjong.com
118•rofko•1d ago•57 comments

Show HN: Slipshow, a multi-paradigm presentation tool

https://slipshow.org
2•panglesd•5h ago•0 comments

Show HN: SkillScan – Free API to detect malicious AI agent skill files

https://skillscan.chitacloud.dev
3•AutoPilotAI•5h ago•0 comments

Show HN: Keep your eyes healthy with 20 20 20 rule reminder using bash

https://gist.github.com/kwkr/b6376b4ade4d14467334bc0dbb845a16
3•zukerpie•5h ago•0 comments

Show HN: Agentic programming needs new processes

https://github.com/agereaude/cx/blob/main/CX.md
3•agereaude•5h ago•1 comments

Show HN: Implementing ping from the Ethernet layer (ARP,IPv4,ICMP in user space)

https://github.com/v420v/ping
4•ibuki256•9h ago•1 comments

Show HN: Visual Tailwind CSS Style Guide – Single HTML file, no build step

https://winkelstraatnl.github.io/tailwind-style-guide/
3•tomdeleria•6h ago•0 comments

Show HN: Monolith e-commerce platform for serverless

https://www.hoikka.dev/
2•zernobilly•6h ago•0 comments

Show HN: Rendering 18,000 videos in real-time with Python

https://madebymohammed.com/pysaic
35•mbmproductions•1d ago•5 comments

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

https://github.com/xaskasdf/ntransformer
383•xaskasdf•1d ago•96 comments

Show HN: TLA+ Workbench skill for coding agents (compat. with Vercel skills CLI)

https://github.com/younes-io/agent-skills/tree/main/skills/tlaplus-workbench
41•youio•1d ago•4 comments

Show HN: AgentReady – Drop-in proxy that cuts LLM token costs 40-60%

https://agentready.cloud/hn
7•christalingx•5h ago•11 comments

Show HN: CLI to estimate Cloudflare bill before it surprises you

https://github.com/michaloo/flareclerk
2•michaloo•8h ago•1 comments

Show HN: ZuckerBot. API and MCP server for AI agents to run Meta/Facebook ads

https://zuckerbot.ai/
5•DavisGrainger•13h ago•1 comments