frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Voice to Text– Free browser-based speech-to-text with local projects

https://www.voicetotextonline.com/
1•digi_wares•1h ago
Hi HN,

I built a voice-to-text tool that runs entirely in your browser. No account required for the free tier, no data sent to my servers.

Try it: https://voicetotextonline.com

Why I built this:

- Existing tools require signups, have minute limits, or cost money - Google Docs voice typing requires a Google account - Dragon costs $150-500 - Otter.ai has free tier limits

(A) Free Features (no account required):

1/ Core Transcription:

- Real-time voice-to-text using Web Speech API - 55+ languages supported - Auto-punctuation & sentence case options - Works offline after first load (PWA)

2/ AI Enhance (added based on user survey – 80% voted yes):

- Auto-fix grammar, punctuation & formatting - One-click cleanup of transcripts

3/ My Projects (local storage):

- Save transcripts to browser localStorage - Organize with folders (Notes, Work, Personal, etc.) - Custom folders & tags - Search across all transcripts - Edit, copy, download as TXT - 100% private – never leaves your device

- Export:

- Copy to clipboard - Download as TXT or DOCX

(B) Pro Features ($10/month or $1/hour pay-per-use):

1/ File Upload & Transcription:

- Upload audio/video files (MP3, WAV, M4A, MP4, MOV, AVI, MKV) - Up to 500MB per file - Batch upload (10 files at once) - Powered by AssemblyAI (95%+ accuracy) - 150 hours/month transcription

2/ Advanced Features:

- Real-time progress with ETA - Speaker labels - In-browser audio recording (5 min with pause/resume) - Translation to 25+ languages (GPT-4o)

3/ Export Formats:

- TXT, SRT, VTT, JSON with timestamps - Segment-level timestamp precision

4/ Cloud Storage:

- Transcription history in the cloud - 10 GB storage, 1,000 files/month

(C) Data & Privacy:

Free tier:

- All transcripts stored in browser localStorage only - Never touches our servers - 100% private

Pro tier:

- Audio files stored in Supabase (encrypted) - Files retained for 30 days for re-download, then auto-deleted - Transcripts stored permanently in your account - You can delete any transcript or your entire account anytime - We don't use your data for training

Tech stack:

- Next.js 14 (App Router) - Web Speech API (free real-time transcription) - AssemblyAI (Pro file transcription, 95%+ accuracy) - OpenAI GPT-4o (AI Enhance & translation) - Supabase (auth & storage) - Stripe (payments) - Tailwind CSS - Hosted on Vercel

Limitations:

- Real-time transcription doesn't work in Firefox (Web Speech API not supported) - Free tier accuracy depends on Chrome's speech engine

Would love feedback on UX, pricing, or feature ideas. Considering open-sourcing the core transcription component.

Target 1: Baseten

https://www.silares.com/targets/target-1-baseten
1•todsacerdoti•7m ago•0 comments

A multi-entry CFG design conundrum

https://bernsteinbear.com/blog/multiple-entry/
2•fbuilesv•12m ago•0 comments

A Massacre in Mashhad

https://www.newyorker.com/news/as-told-to/a-massacre-in-mashhad
3•Tomte•15m ago•1 comments

Disney to Pay $10M in FTC Children's Data Settlement

https://natlawreview.com/article/court-approves-order-requiring-disney-pay-10mm-settle-ftc-attorn...
2•petethomas•17m ago•0 comments

Show HN: Lumina – Open-source observability for LLM applications

https://github.com/use-lumina/Lumina
3•iggycodexs•27m ago•0 comments

Hacker turned WiFi airwaves into LED art with a Raspberry Pi

https://www.theregister.com/2026/01/23/raspberry_pi_wifi_wall_art/
3•ghendelf•32m ago•0 comments

What Is Clawdbot? (and Why People Are Losing Their Minds over It)

https://twitter.com/noahepstein_/status/2015073824799371370
3•taubek•37m ago•0 comments

Kitty Cards

https://lmno.lol/alvaro/introducing-kitty-cards
3•todsacerdoti•37m ago•0 comments

'Fundamental reset': Scott Bessent has a plan to free the nation's banks

https://www.politico.com/news/2026/01/24/scott-bessent-banks-00744468
1•scrubs•41m ago•1 comments

Show HN: Markdown Viewer with LaTeX Math Support and Export to PDF/Word/HTML

https://markdownviewer.cc
1•LuckyBuddy•44m ago•0 comments

Box64 Expands into RISC-V and LoongArch territory

https://boilingsteam.com/box64-expands-into-risc-v-and-loong-arch-territory/
1•ekianjo•44m ago•0 comments

Neveragain.tech

https://neveragain.tech/
3•m-hodges•45m ago•0 comments

Show HN: Run world-class focus groups in minutes

https://chatgpt.com/g/g-6835306f9c48819185d3a665c09cc5d2-focus-groups-like-a-pro
1•vassilbek•46m ago•0 comments

Part 1: IndiaAI mission does not need compute, it needs data

https://gpt3experiments.substack.com/p/part-1-indiaai-mission-does-not-need
1•nutanc•46m ago•0 comments

JVic – The web-based VIC 20 emulator built with libGDX

https://vic20.games/
1•duck•54m ago•0 comments

The most precise mechanical indicators ever made – The Mikrokator [video]

https://www.youtube.com/watch?v=_HIKmxHcxkg
1•pillars•1h ago•0 comments

Apt-bundle: brew bundle for apt

https://github.com/apt-bundle/apt-bundle
1•sadeshmukh•1h ago•0 comments

Show HN: Document your tRPC API without mapping to OpenAPI

1•liorcodev•1h ago•0 comments

Deaths and deportations of citizens by Trump administration

https://en.wikipedia.org/wiki/Deaths,_detentions_and_deportations_of_American_citizens_in_the_sec...
13•praptak•1h ago•0 comments

Jeffrey Way: I'm Done

https://www.youtube.com/watch?v=g_Bvo0tsD9s
3•doppp•1h ago•0 comments

Ask HN

2•mekod•1h ago•0 comments

Fast Joystick Read – Elon Musk on Usenet, 1994

https://twitter.com/UsenetGems/status/2004876626161721467
1•nomilk•1h ago•0 comments

World’s most powerful literary critic is on TikTok

https://www.newstatesman.com/culture/books/2026/01/the-worlds-most-powerful-literary-critic-is-on...
2•insistey•1h ago•0 comments

Non-Functional Requirements: The Secret Sauce Nobody Wants to Season

https://blog.hermesc.gr/non-functional-requirements-the-secret-sauce-nobody-wants-to-season/
1•puppion•1h ago•0 comments

Show HN: Voice to Text– Free browser-based speech-to-text with local projects

https://www.voicetotextonline.com/
1•digi_wares•1h ago•0 comments

Show HN: Structured data extraction using local quantized LLMs

https://github.com/nxank4/loclean
1•nxank4•1h ago•0 comments

Jurassic Park: SGI Computers (2010)

http://www.sgistuff.net/funstuff/hollywood/jpark.html
1•exvi•1h ago•0 comments

Earth's Rotation Limits IBIS Performance to 6.3 Stops

https://thecentercolumn.com/2020/01/17/earths-rotation-limits-ibis-performance-to-6-3-stops/
4•Geo_ge•1h ago•0 comments

Lawsuit Claims Meta Can See WhatsApp Chats in Breach of Privacy

https://www.bloomberg.com/news/articles/2026-01-25/lawsuit-claims-meta-can-see-whatsapp-chats-in-...
4•g-b-r•1h ago•0 comments

The Brain of the Greatest Solo Climber

https://nautil.us/the-strange-brain-of-the-worlds-greatest-solo-climber-236051/
1•blondie9x•1h ago•0 comments