frontpage.

Hi HN,

I’m the creator of VoGen [https://vogen.app].

I’ve always been fascinated by how far AI technology has come, but I found that most existing tools with expressing emotions are gated behind expensive subscriptions. I built VoGen to explore how we can make AI voices more "human" and accessible.

What it does:

Voice Cloning: You can clone a voice using a 3-60 second sample. It works best with a clean, single-speaker recording.

Emotional TTS: Instead of a flat tone, you can choose between Happy, Angry, Sad, and etc.

Bilingual Support: It currently supports both English and Mandarin Chinese.

Privacy-First Tools: I also added a browser-based audio speed changer that processes files locally—no audio data ever leaves your machine for that specific tool.

The Tech Stack: The frontend is built with React.js, and it's deployed on Vercel. For the voice engine, I'm using a customized pipeline that focuses on low-latency inference while maintaining high fidelity.

Why is it free? Right now, VoGen is in its early stages (MVP). I want to see how people use it and what kind of voice quality the community expects before even thinking about monetization.

Privacy Note: I know how sensitive voice data is. We don’t use your uploaded cloning samples to train our base models.

I’d love to get some feedback from the HN community. Whether it’s about the latency, the naturalness of the emotions, or the UI/UX—I’m all ears.

What features would make this more useful for your workflow?

RFCs vs. READMEs: The Evolution of Protocols

Kanchipuram Saris and Thinking Machines

Chinese chemical supplier causes global baby formula recall

I've used AI to write 100% of my code for a year as an engineer

Looking for 4 Autistic Co-Founders for AI Startup (Equity-Based)

AI-native capabilities, a new API Catalog, and updated plans and pricing

What changed in tech from 2010 to 2020?

From Human Ergonomics to Agent Ergonomics

Advanced Inertial Reference Sphere

Toyota Developing a Console-Grade, Open-Source Game Engine with Flutter and Dart

Typing for Love or Money: The Hidden Labor Behind Modern Literary Masterpieces

Show HN: A longitudinal health record built from fragmented medical data

CoreWeave's $30B Bet on GPU Market Infrastructure

Creating and Hosting a Static Website on Cloudflare for Free

"The Stanford scam proves America is becoming a nation of grifters"

Elon Musk on Space GPUs, AI, Optimus, and His Manufacturing Method

X (Twitter) is back with a new X API Pay-Per-Use model

Zlob.h 100% POSIX and glibc compatible globbing lib that is faste and better

Show HN: Deterministic signal triangulation using a fixed .72% variance constant

Scientists Discover Levitating Time Crystals You Can Hold, Defy Newton’s 3rd Law

When Michelangelo Met Titian

Solving NYT Pips with DLX

Baldur's Gate to be turned into TV series – without the game's developers

Interview with 'Just use a VPS' bro (OpenClaw version) [video]

EchoJEPA: Latent Predictive Foundation Model for Echocardiography

Disablling Go Telemetry

Effective Nihilism

The UK government didn't want you to see this report on ecosystem collapse

No 10 blocks report on impact of rainforest collapse on food prices

Seedance 2.0 Is Coming

Show HN: VoGen – A web app for hyper-realistic voice generating and cloning