frontpage.

TinyTTS: Ultra-light English TTS (9M params, 20MB), 8x CPU, 67x GPU

1•letrghieu•1h ago

Hey guys,

I wanted to share a small project I've been working on to solve a personal pain point: TinyTTS.

We all love our massive 70B+ LLMs, but when building local voice assistants, running a heavy TTS framework alongside them often eats up way too much precious VRAM and compute. I wanted something absurdly small and fast that "just works" locally.

TL;DR Specs:

Size: ~9 Million parameters

Disk footprint: ~20 MB checkpoint (G.pth)

Speed (CPU): ~0.45s to generate 3.7s of audio (~8x faster than real-time)

Speed (GPU - RTX 4060): ~0.056s (~67x faster than real-time)

Peak VRAM: ~126 MB

License: Apache 2.0 (Open Weights)

Why TinyTTS? It is designed specifically for edge devices, CPU-only setups, or situations where your GPU is entirely occupied by your LLM. It's fully self-contained, meaning you don't need to run a complex pipeline of multiple models just to get audio out.

How to use it? I made sure it’s completely plug-and-play with a simple Python API. Even better, on your first run, it will automatically download the tiny 20MB model from Hugging Face into your cache for you.

pip install git+https://github.com/tronghieuit/tiny-tts.git

Python API:

from tiny_tts import TinyTTS

# Auto-detects device (CPU/CUDA) and downloads the 20MB checkpoint

tts = TinyTTS()

tts.speak("The weather is nice today, and I feel very relaxed.", output_path="output.wav")

CLI:

tiny-tts --text "Local AI is the future" --device cpu

Links:

GitHub: https://github.com/tronghieuit/tiny-tts

Gradio Web Demo: Try it on HF Spaces here

Hugging Face Model: backtracking/tiny-tts

What's next? I plan to clean up and publish the training code soon so the community can fine-tune it easily. I am also looking into adding ultra-lightweight zero-shot voice cloning.

Would love to hear your feedback or see if anyone manages to run this on a literal potato! Let me know what you think.

If you find this project helpful, please give it a on GitHub.

Let's Automate Our Jobs

Perplexity Computer

Show HN: Destroy My Startup

Systemd 260-Rc1 Released: System V Service Scripts No Longer Supported

The C++ input iterator pitfall

Show HN: Oxynote – Technical knowledge base with live Prometheus charts

LadybugDB: DuckDB for Graphs

Show HN: zsweep – Play Modern Minesweeper with Vim Support

Show HN: Polos: Open-source runtime for AI agents with sandbox and durable exec

I started a software research company

Life-threatening blueberry recall upgraded to FDA's highest risk level

I asked Claude for 37,500 random names, and it can't stop saying Marcus

Wikipedia Is Down

LibreOffice resumes work on its self-hosted Google Docs alternative

Show HN: CodeSeeker – Knowledge graph code intelligence for AI coding assistants

AI might make mainframes more defensible, not less

Show HN: Orca, open-source AI agent for deep LinkedIn profile analysis

Trump's MAHA influencer pick for surgeon general goes before Senate

Rowspace launches with $50M to help investors scale their alpha with AI

White House list of media offenders

AI-Assisted Jira Workflows and One-Shot Fixes with Kotlin Koog and OpenAI Codex

AI helps scam centers evade crackdown in Asia and dupe more victims

Why Email Spam Looks Better Than Usual These Days

Show HN: Breathwork Tools – A free breathwork timeras a single HTML file

Why do office chairs have 5 legs? [video]

Our Computer Using agent just solved CAPTCHA up to Level 6

Long-term brain effects of Covid-19 vs. flu-key differences

Show HN: CMS-free in-place editable websites with Svelte (v2)

MAME 0.286

Electric buses are passing a brutal cold-weather test in Wisconsin