frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: VoxConvo – "X but it's only voice messages"

https://voxconvo.com
6•siim•2h ago
Hi HN,

I saw this tweet: "Hear me out: X but it's only voice messages (with AI transcriptions)" - and couldn't stop thinking about it.

So I built VoxConvo.

Why this exists:

AI-generated content is drowning social media. ChatGPT replies, bot threads, AI slop everywhere.

When you hear someone's actual voice: their tone, hesitation, excitement - you know it's real. That authenticity is what we're losing.

So I built a simple platform where voice is the ONLY option.

The experience:

Every post is voice + transcript with word-level timestamps:

Read mode: Scan the transcript like normal text or listen mode: hit play and words highlight in real-time.

You get the emotion of voice with the scannability of text.

Key features:

- Voice shorts

- Real-time transcription

- Visual voice editing - click a word in transcript deletes that audio segment to remove filler words, mistakes, pauses

- Word-level timestamp sync

- No LLM content generation

Technical details:

Backend running on Mac Mini M1:

- TypeGraphQL + Apollo Server

- MongoDB + Atlas Search (community mongo + mongot)

- Redis pub/sub for GraphQL subscriptions

- Docker containerization for ready to scale

Transcription:

- VOSK real time gigaspeech model eats about 7GB RAM

- WebSocket streaming for real-time partial results

- Word-level timestamp extraction plus punctuation model

Storage:

- Audio files are stored to AWS S3

- Everything else is local

Why Mac Mini for MVP? Validation first, scaling later. Architecture is containerized and ready to migrate. But I'd rather prove demand on gigabit fiber than burn cloud budget.

Comments

cdrini•1h ago
Neat idea! Not sure if I'm willing to register just try it, though. Having the main feed public would be nice! Or even a sample feed.
1bpp•1h ago
How would this prevent someone from just plugging ElevenLabs into it? Or the inevitable more realistic voice models? Or just a prerecorded spam message? It's already nearly impossible to tell if some speech is human or not. I do like the idea of recovering the emotional information lost in speech -> text, but I don't think it'd help the LLM issue.
layman51•51m ago
Or also a genuine human voice reading a script that’s partially or almost entirely LLM written? I think there must be some video content creators who do that.
SrslyJosh•47m ago
Detecting "human speech" means shutting out people who cannot speak and rely on TTS for verbal communication.
cjflog•59m ago
Did you ever use AirChat?
esafak•53m ago
So you're going to reject recordings detected as computer generated, or human recorded from a computer-generated script?

I feel like you are making your users jump through hoops to do bot and slop detection, when you ought to be investing in technology to do the same. Here is a focusing question: would you still demand audio recordings if you had that technology?

Maybe you will court an interesting set of users when you do this? I just know I will not be one of them; ain't got time for that. Good luck.

zahlman•50m ago
> I saw this tweet: "Hear me out: X but it's only voice messages (with AI transcriptions)" - and couldn't stop thinking about it.

> Why this exists: AI-generated content is drowning social media.

> Real-time transcription

... So you want to filter out AI content by requiring users to produce audio (not really any harder for AI than text), and you add AI content afterward (the transcriptions) anyway?

I really think you should think this through more.

The "authenticity" problem is fundamentally about how users discover each other. You get flooded with AI slop because the algorithm is pushing it in front of you. And that algorithm is easily gamed, and all the existing competitors are financially incentivized to implement such an algorithm and not care about the slop.

Also, I looked at the page source and it gives a strong impression that you are using AI to code the project and also that your client fundamentally works by querying an LLM on the server. It really doesn't convey the attitude supposedly motivating the project.

Nice tech demo though, I guess.

jagged-chisel•20m ago
“Sign in with Google”

:grimace:

Sorry, but I have to pass.

oulipo2•14m ago
Idea is cool, but the STT is bad (at least with an accent), and the fact that you need to edit each word is too cumbersome

I (don't) have dementia [video]

https://www.youtube.com/watch?v=nvkQd_yZqdM
1•Fr0styMatt88•1m ago•0 comments

Her Research Could Improve Training for Service Dogs

https://www.nytimes.com/2025/11/06/science/lost-science-hecht-service-dogs.html
1•mikhael•2m ago•0 comments

The Path to a Superhuman AI Mathematician

https://cacm.acm.org/news/the-path-to-a-superhuman-ai-mathematician/
1•bikenaga•3m ago•0 comments

Mullvad: Shutting down our search proxy Leta

https://mullvad.net/en/blog/shutting-down-our-search-proxy-leta
1•holysoles•8m ago•1 comments

Mechanism Design Theory

1•mertbirlik•9m ago•0 comments

Simcube: Boost AI App Revenue with Conversational Product Placement

https://www.simcube.ai/
2•jwstanwick03•11m ago•0 comments

Defunct Pennsylvania oil and gas wells may leak methane and metals into water

https://phys.org/news/2025-11-defunct-pennsylvania-oil-gas-wells.html
3•bikenaga•18m ago•1 comments

OpenAI's $1T Infrastructure Spend for 2025-2035

https://tomtunguz.com/openai-hardware-spending-2025-2035/
2•walterbell•20m ago•3 comments

James Watson, who co-discovered the structure of DNA, has died at age 97

https://www.npr.org/2025/11/07/nx-s1-5144654/james-watson-dna-double-helix-dies
1•voxadam•21m ago•0 comments

Get Stronger by Greasing the Groove

https://www.artofmanliness.com/health-fitness/fitness/get-stronger-by-greasing-the-groove/
1•sbmthakur•22m ago•0 comments

Removing notifications for mentions in commit messages

https://github.blog/changelog/2025-11-07-removing-notifications-for-mentions-in-commit-messages/
1•super_linear•22m ago•0 comments

Immutable Software Deploys Using ZFS Jails on FreeBSD

https://conradresearch.com/articles/immutable-software-deploy-zfs-jails
2•vermaden•24m ago•0 comments

Wine Gaming in Containers with BastilleBSD Jails on FreeBSD

https://pertho.net/2025/11/07/wine-gaming-freebsd-jails/
2•vermaden•25m ago•0 comments

Windows "SUCKS": How I'd Fix it by a retired Microsoft Windows engineer [video]

https://www.youtube.com/watch?v=oTpA5jt1g60
1•pregnenolone•26m ago•0 comments

Samsung, Micron, SK Hynix dodge DRAM Price Fixing Lawsuit (2022)

https://www.tomshardware.com/news/samsung-micron-sk-hynix-dodge-dram-price-fixing-lawsuit
2•walterbell•29m ago•0 comments

Satellite images, maps and records reveal surge in China's missile production

https://www.cnn.com/2025/11/07/world/china-missile-production-expansion-revealed-satellite-images...
7•Teever•30m ago•0 comments

Average credit card processing fees and costs in 2025

https://www.fool.com/money/research/average-credit-card-processing-fees-costs-america/
1•hhs•30m ago•0 comments

The Boss Has a Message: Use AI or You're Fired

https://www.wsj.com/tech/ai/ai-work-use-performance-reviews-1e8975df
6•zerosizedweasle•31m ago•4 comments

Snapchat open-sources Valdi a cross-platform UI framework

https://github.com/Snapchat/Valdi
2•yehiaabdelm•31m ago•0 comments

Cycling Club: El Social Rides

https://www.elsr.co.uk/about
1•pppone•36m ago•0 comments

Thoughts by a non-economist on AI and economics

https://windowsontheory.org/2025/11/04/thoughts-by-a-non-economist-on-ai-and-economics/
1•gmays•37m ago•0 comments

The New Y Combinator

https://jaredheyman.medium.com/on-the-new-y-combinator-3c28e548896c
6•langitbiru•41m ago•0 comments

Mojo Miji – A Guide to Mojo Programming Language from a Pythonista's Perspective

https://mojo-lang.com/miji/
1•SerCe•43m ago•0 comments

Building a High-Performance Ticketing System with TigerBeetle

https://renerocks.ai/blog/2025-11-02--tigerfans/
3•jorangreef•45m ago•0 comments

Cerebras Code now supports GLM 4.6 at 1000 tokens/sec

https://www.cerebras.ai/code
1•nathabonfim59•45m ago•0 comments

Averting Collapse Is No Longer Profitable

https://thehonestsorcerer.substack.com/p/averting-collapse-is-no-longer-profitable
2•rolph•46m ago•0 comments

Micron chip factories in Upstate NY will be delayed by 2-3 years, company says

https://www.syracuse.com/micron/2025/11/micron-chip-factories-in-upstate-ny-delayed-by-two-to-thr...
3•hhs•47m ago•0 comments

Australia's social media age restrictions are already working

https://www.abc.net.au/religion/australia-social-media-age-restrictions-already-working/105986156
3•breve•48m ago•0 comments

Data scientists perform last rites for 'dearly departed datasets' in 2nd Trump

https://apnews.com/article/census-bureau-data-scientists-trump-doge-7558df32c4ff7d2152aa5d8b02c0ae57
2•smcin•48m ago•1 comments

New quantum hardware puts the mechanics in quantum mechanics

https://arstechnica.com/science/2025/11/new-quantum-computing-hardware-sorts-ions-for-computation/
6•rbanffy•50m ago•0 comments