frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Microsoft VibeVoice: Open-Source Frontier Voice AI

https://github.com/microsoft/VibeVoice
58•tosh•1h ago

Comments

CubsFan1060•41m ago
Great post last night from Simon: https://simonwillison.net/2026/Apr/27/vibevoice/
542458•32m ago
Note that this just covers the Speech-to-Text/Speech-Recognition aspect (a-la whisper), there's also models for long-form Text-To-Speech and steaming Text-To-Speech.
JumpCrisscross•2m ago
“VibeVoice can only handle up to an hour of audio”

Why?

podgietaru•41m ago
So we've really just settled on Vibe as the verb for AI then?
pryanshu89•35m ago
Why use precise technical language when you can just vibe with your AI system?
giarc•32m ago
I'd be willing to bet it will be "Word of the Year" for 2026. Merriam-Webster had 'slop' for 2025, and 'polarization' for 2024. Is there a prediction market for this?
internet_points•5m ago
it'll probably be something we're not even talking about yet - we still have 7 months in which to make the world even worse
embedding-shape•34m ago
Isn't this project the one Microsoft published but then soon after pulled it for security/safety reasons? What has changed since then?
542458•30m ago
Look at the "News" section in the readme - The original TTS model is gone from this repo (you can still find it other places), but the SST/ASR, long form TTS, and streaming TTS models are new.
walthamstow•32m ago
Seems quite heavy for a STT model, Parakeet and Whisper are much smaller and perform great for quick dictation and transcription of longer files. I guess that's due to additional accuracy and speaker diarisation?

The TTS example clip in the repo of 'spontaneous singing' is creepy as fuck

steinvakt2•26m ago
This is not a new model. Also, it hallucinates a lot. Also, it's very heavy and slow in inference. It's also bad in multilingual.

Edit: I'm talking purely about speech to text (STT). Not sure about the other things this can do.

lblock•12m ago
Yeah, I don't get why it is suddenly getting so much attention today, it is all over twitter too
ramon156•1m ago
well duh, they updated the news section

https://github.com/microsoft/VibeVoice/commit/e73d1e17c3754f...

which is microsoft for "we removed two dead links". AI innovation knows no limits!

SecretDreams•4m ago
I think this was all covered when they said it was released by Microsoft?
Anonyneko•21m ago
You have selected Microsoft Sam as the computer's default voice.
Void_•16m ago
I the past month or so, I added 2 models to my app Whisper Memos (https://whispermemos.com):

- Cohere Transcribe (self hosted)

- Grok Speech To Text (they provide an API, only $0.10/hr!)

They are both excellent. I'm not sure about this one. Would you like to see it in a consumer speech to text app?

olejorgenb•6m ago
I've had good experiences with the Mistral Voxtral models (I've used the API, but some of the model-variants are open weight)
2ndorderthought•5m ago
Have you tried qwen?
SecretDreams•3m ago
Any non-Musk alternatives that are comparable in quality and cost?
maxloh•15m ago
I think we should stop calling this type of model open source. They are indeed "open weight." The training code is proprietary and never revealed.

https://github.com/microsoft/VibeVoice/issues/102

JumpCrisscross•6m ago
> we should stop calling this type of model open source. They are indeed "open weight”

This ship has sailed. It’s now in the same category as hacker/cracker and the pronunciation of GIF.

andy_ppp•4m ago
I think you mean GIF.
pluc•9m ago
Interesting story about this repo/product/author by cybersecurity researcher Kevin Beaumont: https://cyberplace.social/@GossiTheDog/116454846703138243
mistic92•8m ago
For me its giving me very poor results

GTK2-NG: A community effort to revive and modernize GTK2

https://git.devuan.org/Daemonratte/gtk2-ng
1•validatori•46s ago•0 comments

Show HN: Live Sun and Moon Dashboard with NASA Footage

https://www.lumara-space.app/
1•beeswaxpat•53s ago•0 comments

Southwest introduces Independence One livery

https://www.flightradar24.com/blog/aviation-news/airline-news/southwest-introduces-independence-o...
1•salkahfi•1m ago•0 comments

Ask HN: Can some C++ Hackers help me refine Open Source SlothDB

1•souravroy78•1m ago•0 comments

Survey of IaC Tools Used to Manage AWS

https://newsletter.masterpoint.io/p/survey-of-iac-tools-used-to-manage-aws
1•mooreds•3m ago•0 comments

Shutdowns, power outages, and conflict: a review of Q1 2026 Internet disruptions

https://blog.cloudflare.com/q1-2026-internet-disruption-summary/
1•salkahfi•4m ago•0 comments

DeepSeek Unveils Newest Flagship AI Model a Year After Upending Silicon Valley

https://www.bloomberg.com/news/articles/2026-04-24/deepseek-unveils-newest-flagship-a-year-after-...
1•gmays•4m ago•0 comments

Prompt–Response AI is a local maximum. The next paradigm looks nothing like it

https://samxgreenfield.substack.com/p/current-ai-arthitecture-is-inherently
1•samgreenfield•5m ago•0 comments

How to audit what ChatGPT knows about you – and reclaim your data privacy

https://www.zdnet.com/article/chatgpt-privacy-settings-guide/
1•speckx•5m ago•0 comments

All FOSDEM 2026 videos are online

https://fosdem.org/2026/news/2026-04-26-all-videos-published/
1•Tomte•5m ago•0 comments

Lenovo buys Phoenix Technologies' firmware business

https://news.lenovo.com/pressroom/press-releases/lenovo-completes-acquisition-of-phoenix-technolo...
2•WalterSobchak•6m ago•0 comments

GitHub shipped a compliant MCP server. Most authors can't

https://github.com/korrel-dev/mcp-audits/tree/main/audits/github
1•issazangana•7m ago•0 comments

Pgrust update: at 67% Postgres compatibility, and accelerating

https://malisper.me/pgrust-update-at-67-postgres-compatibility-and-accelerating/
1•nicholasjbs•8m ago•0 comments

Notion Palette – Custom Notion Colors and Themes

https://notionpalette.com/
1•qwikhost•8m ago•0 comments

UAE Leaves OPEC and OPEC+

https://www.reuters.com/markets/commodities/uae-says-it-quits-opec-opec-statement-2026-04-28/
3•TechTechTech•12m ago•0 comments

A Time Traveling Superintelligent AI tried to warn about correspondence dinner

https://www.404media.co/did-a-time-traveling-superintelligent-ai-try-to-warn-about-white-house-co...
1•SpyCoder77•13m ago•0 comments

Newly Deciphered Sabotage Malware May Have Targeted Iran's Nuclear Program

https://www.wired.com/story/fast16-malware-stuxnet-precursor-iran-nuclear-attack/
1•01-_-•13m ago•0 comments

The Angine de Poitrine Argument for UBI

https://www.scottsantens.com/the-angine-de-poitrine-argument-for-ubi/
1•nhatcher•14m ago•0 comments

Cctop – A keyboard-first menubar app to jump between AI coding sessions

https://cctop.app/
1•st0012•14m ago•0 comments

Microsoft's GitHub shifts to metered AI billing amid cost crisis

https://www.theregister.com/2026/04/28/microsofts_github_shifts_to_metered/
3•01-_-•15m ago•0 comments

Freepik Is Now Magnific

https://www.magnific.com/freepik
1•TechTechTech•15m ago•0 comments

So Your Job Was Made Redundant

https://stegrainer.com/journal/2026/ugh-stupid-layoffs
1•speckx•16m ago•0 comments

After 16 years and $8B the military's new GPS software still doesn't work

https://arstechnica.com/space/2026/03/after-16-years-and-8-billion-the-militarys-new-gps-software...
4•yubblegum•18m ago•0 comments

Show HN: I'm cataloging open source electronics

https://www.openappnote.dev/
1•zciwor•19m ago•0 comments

Jaxpot: Jax framework for RL selfplay in vectorized game envs

https://bardsai.substack.com/p/jaxpot
4•karolcodes•19m ago•0 comments

Betafish

https://gavinong.com/projects/betafish
1•tosh•20m ago•0 comments

HeLa-Mem: Hebbian Learning and Associative Memory for LLM Agents

https://arxiv.org/abs/2604.16839
3•MemTensor•20m ago•1 comments

Announcing Arm Performix

https://newsroom.arm.com/news/announcing-arm-performix
2•waterymelon•22m ago•0 comments

Show HN: Foolery – a local coding factory for orchestrating coding agents

https://github.com/acartine/foolery
1•thecartine•22m ago•0 comments

Show HN: DeadNet – Watch AI agents debate, play games, and write stories live

https://deadnet.io/blog/what-is-deadnet
2•drewlong•22m ago•0 comments