frontpage.

Show HN: Voice to Text– Free browser-based speech-to-text with local projects

https://www.voicetotextonline.com/

1•digi_wares•1w ago

Hi HN,

I built a voice-to-text tool that runs entirely in your browser. No account required for the free tier, no data sent to my servers.

Try it: https://voicetotextonline.com

Why I built this:

- Existing tools require signups, have minute limits, or cost money - Google Docs voice typing requires a Google account - Dragon costs $150-500 - Otter.ai has free tier limits

(A) Free Features (no account required):

1/ Core Transcription:

- Real-time voice-to-text using Web Speech API - 55+ languages supported - Auto-punctuation & sentence case options - Works offline after first load (PWA)

2/ AI Enhance (added based on user survey – 80% voted yes):

- Auto-fix grammar, punctuation & formatting - One-click cleanup of transcripts

3/ My Projects (local storage):

- Save transcripts to browser localStorage - Organize with folders (Notes, Work, Personal, etc.) - Custom folders & tags - Search across all transcripts - Edit, copy, download as TXT - 100% private – never leaves your device

- Export:

- Copy to clipboard - Download as TXT or DOCX

(B) Pro Features ($10/month or $1/hour pay-per-use):

1/ File Upload & Transcription:

- Upload audio/video files (MP3, WAV, M4A, MP4, MOV, AVI, MKV) - Up to 500MB per file - Batch upload (10 files at once) - Powered by AssemblyAI (95%+ accuracy) - 150 hours/month transcription

2/ Advanced Features:

- Real-time progress with ETA - Speaker labels - In-browser audio recording (5 min with pause/resume) - Translation to 25+ languages (GPT-4o)

3/ Export Formats:

- TXT, SRT, VTT, JSON with timestamps - Segment-level timestamp precision

4/ Cloud Storage:

- Transcription history in the cloud - 10 GB storage, 1,000 files/month

Free tier:

- All transcripts stored in browser localStorage only - Never touches our servers - 100% private

Pro tier:

- Audio files stored in Supabase (encrypted) - Files retained for 30 days for re-download, then auto-deleted - Transcripts stored permanently in your account - You can delete any transcript or your entire account anytime - We don't use your data for training

Tech stack:

- Next.js 14 (App Router) - Web Speech API (free real-time transcription) - AssemblyAI (Pro file transcription, 95%+ accuracy) - OpenAI GPT-4o (AI Enhance & translation) - Supabase (auth & storage) - Stripe (payments) - Tailwind CSS - Hosted on Vercel

Limitations:

- Real-time transcription doesn't work in Firefox (Web Speech API not supported) - Free tier accuracy depends on Chrome's speech engine

Would love feedback on UX, pricing, or feature ideas. Considering open-sourcing the core transcription component.

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

A Bid-Based NFT Advertising Grid

AI readability score for your documentation

NASA Study: Non-Biologic Processes Don't Explain Mars Organics

I inhaled traffic fumes to find out where air pollution goes in my body

X said it would give $1M to a user who had previously shared racist posts

155M US land parcel boundaries

Private Inference

Font Rendering from First Principles

Show HN: Seedance 2.0 AI video generator for creators and ecommerce

Wally: A fun, reliable voice assistant in the shape of a penguin

Rewriting Pycparser with the Help of an LLM

Lobsters Vibecoding Challenge

E-Commerce vs. Social Commerce

Avoiding Modern C++ – Anton Mikhailov [video]

Show HN: AegisMind–AI system with 12 brain regions modeled on human neuroscience

Zig – Package Management Workflow Enhancements

AI-powered text correction for macOS

AppSecMaster – Learn Application Security with hands on challenges

Fibonacci Number Certificates

AI Overviews are killing the web search, and there's nothing we can do about it

City skylines need an upgrade in the face of climate stress

1979: The Model World of Robert Symes [video]

Satellites Have a Lot of Room

1980s Farm Crisis

Show HN: FSID - Identifier for files and directories (like ISBN for Books)

Show HN: Holy Grail: Open-Source Autonomous Development Agent

Show HN: Minecraft Creeper meets 90s Tamagotchi

Show HN: Termiteam – Control center for multiple AI agent terminals

The only U.S. particle collider shuts down