frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Localsend: An open-source cross-platform alternative to AirDrop

https://github.com/localsend/localsend
237•bilsbie•2h ago•95 comments

Microsoft VibeVoice: Open-Source Frontier Voice AI

https://github.com/microsoft/VibeVoice
106•tosh•2h ago•60 comments

Show HN: Live Sun and Moon Dashboard with NASA Footage

https://www.lumara-space.app/
16•beeswaxpat•52m ago•1 comments

The World's Most Complex Machine

https://worksinprogress.co/issue/the-worlds-most-complex-machine/
189•mellosouls•3d ago•95 comments

Talkie: a 13B vintage language model from 1930

https://talkie-lm.com/introducing-talkie
486•jekude•16h ago•189 comments

OpenAI CEO's Identity Verification Company Announced Fake Bruno Mars Partnership

https://www.vice.com/en/article/openai-ceo-identity-verification-company-fake-bruno-mars-partners...
46•BoggleOhYeah•52m ago•19 comments

Microsoft and OpenAI end their exclusive and revenue-sharing deal

https://www.bloomberg.com/news/articles/2026-04-27/microsoft-to-stop-sharing-revenue-with-main-ai...
933•helsinkiandrew•1d ago•791 comments

UAE Leaves OPEC and OPEC+

https://www.reuters.com/markets/commodities/uae-says-it-quits-opec-opec-statement-2026-04-28/
69•TechTechTech•1h ago•1 comments

The predictable failure of the QDay Prize

https://algassert.com/post/2601
28•firefly284•1d ago•2 comments

Deep under Antarctic ice, a long-predicted cosmic whisper breaks through

https://phys.org/news/2026-04-deep-antarctic-ice-cosmic-strange.html
9•rbanffy•1d ago•1 comments

I Spent My Sabbatical Building a Power Meter for Sledgehammers

https://leblancfg.com/intensity-pad-founder-story.html
20•alin23•1d ago•8 comments

Is my blue your blue? (2024)

https://ismy.blue/
634•theogravity•17h ago•412 comments

WASM is not quite a stack machine

https://purplesyringa.moe/blog/wasm-is-not-quite-a-stack-machine/
99•signa11•9h ago•36 comments

Can You Find the Comet?

https://apod.nasa.gov/apod/ap260427.html
94•ColinWright•1d ago•50 comments

Period tracking app has been yapping about your flow to Meta

https://femtechdesigndesk.substack.com/p/your-period-tracking-app-has-been
111•campuscodi•2h ago•94 comments

GTFOBins

https://gtfobins.org/
294•StefanBatory•7h ago•72 comments

UAE to leave OPEC in blow to oil cartel

https://www.ft.com/content/8c354f2d-3e66-47f1-aad4-9b4aa30e386d
101•bazzmt•1h ago•89 comments

Tiled Words 6 Month Update

https://paulmakeswebsites.com/writing/six-months-of-tiled-words/
40•paulhebert•1d ago•10 comments

Mo RAM, Mo Problems (2025)

https://fabiensanglard.net/curse/
173•blfr•2d ago•30 comments

Pgrx: Build Postgres Extensions with Rust

https://github.com/pgcentralfoundation/pgrx
136•luu•3d ago•13 comments

GitHub Copilot code review will start consuming GitHub Actions minutes

https://github.blog/changelog/2026-04-27-github-copilot-code-review-will-start-consuming-github-a...
54•whtsky•5h ago•60 comments

4TB of voice samples just stolen from 40k AI contractors at Mercor

https://app.oravys.com/blog/mercor-breach-2026
573•Oravys•1d ago•216 comments

In Kannauj, perfumers have been making monsoon-infused mitti attar for centuries

https://www.atlasobscura.com/articles/smell-of-rain-kannauj-perfume-mitti-attar-india
30•bcaulfield•1d ago•6 comments

Men who stare at walls

https://www.alexselimov.com/posts/men_who_stare_at_walls/
654•aselimov3•1d ago•299 comments

An Update on GitHub Availability

https://github.blog/news-insights/company-news/an-update-on-github-availability/
187•salkahfi•4h ago•159 comments

Meetings are forcing functions

https://www.mooreds.com/wordpress/archives/3734
153•zdw•2d ago•91 comments

High Performance Git

https://gitperf.com/
190•gnabgib•13h ago•62 comments

Three men are facing charges in Toronto SMS Blaster arrests

https://www.tps.ca/media-centre/stories/unprecedented-sms-blaster-arrests/
185•gnabgib•17h ago•99 comments

Easyduino: Open Source PCB Devboards for KiCad

https://github.com/Hanqaqa/Easyduino
232•Hanqaqa•20h ago•40 comments

Networking changes coming in macOS 27

https://eclecticlight.co/2026/04/23/networking-changes-coming-in-macos-27/
244•pvtmert•22h ago•217 comments
Open in hackernews

Microsoft VibeVoice: Open-Source Frontier Voice AI

https://github.com/microsoft/VibeVoice
106•tosh•2h ago

Comments

CubsFan1060•1h ago
Great post last night from Simon: https://simonwillison.net/2026/Apr/27/vibevoice/
542458•1h ago
Note that this just covers the Speech-to-Text/Speech-Recognition aspect (a-la whisper), there's also models for long-form Text-To-Speech and steaming Text-To-Speech.
JumpCrisscross•54m ago
“VibeVoice can only handle up to an hour of audio”

Why?

podgietaru•1h ago
So we've really just settled on Vibe as the verb for AI then?
pryanshu89•1h ago
Why use precise technical language when you can just vibe with your AI system?
giarc•1h ago
I'd be willing to bet it will be "Word of the Year" for 2026. Merriam-Webster had 'slop' for 2025, and 'polarization' for 2024. Is there a prediction market for this?
internet_points•57m ago
it'll probably be something we're not even talking about yet - we still have 7 months in which to make the world even worse
embedding-shape•1h ago
Isn't this project the one Microsoft published but then soon after pulled it for security/safety reasons? What has changed since then?
542458•1h ago
Look at the "News" section in the readme - The original TTS model is gone from this repo (you can still find it other places), but the SST/ASR, long form TTS, and streaming TTS models are newer.
infecto•31m ago
It’s confusing (at least for me) because the project covers a number of things including what you are mentioning.
Barbing•5m ago
[off topic]

When explanations get posted directly in HN comments, I imagine someone somewhere in the world is able to learn in spite of their Internet restrictions/firewalls

People will also post their own interpretations in response to comments, and quickly find out they missed something.

… But if you try to automate it, like include a summary under every HN post, you encourage laziness too much and are pre-chewing too heavily. Some balance here.

[on topic]

(OK I’m done making excuses, time to read the article… thanks for the encouragement!)

I thought this was not explained in the readme directly but in fact I missed it. I wasn’t going to read Microsoft entire changelog! But it was substantive, thanks to sibling commenter:

“2025-09-05: VibeVoice is an open-source research framework intended to advance collaboration in the speech synthesis community. After release, we discovered instances where the tool was used in ways inconsistent with the stated intent. Since responsible use of AI is one of Microsoft’s guiding principles, we have removed the VibeVoice-TTS code from this repository.”

walthamstow•1h ago
Seems quite heavy for a STT model, Parakeet and Whisper are much smaller and perform great for quick dictation and transcription of longer files. I guess that's due to additional accuracy and speaker diarisation?

The TTS example clip in the repo of 'spontaneous singing' is creepy as fuck

steinvakt2•1h ago
This is not a new model. Also, it hallucinates a lot. Also, it's very heavy and slow in inference. It's also bad in multilingual.

Edit: I'm talking purely about speech to text (STT). Not sure about the other things this can do.

lblock•1h ago
Yeah, I don't get why it is suddenly getting so much attention today, it is all over twitter too
ramon156•53m ago
well duh, they updated the news section

https://github.com/microsoft/VibeVoice/commit/e73d1e17c3754f...

which is microsoft for "we removed two dead links". AI innovation knows no limits!

SecretDreams•56m ago
I think this was all covered when they said it was released by Microsoft?
NobleLie•10m ago
The nuance is lost on LLM agentic dominant partakers.
Anonyneko•1h ago
You have selected Microsoft Sam as the computer's default voice.
accrual•39m ago
My friends and I had fun in the computer lab with Microsoft Sam, inputting long strings of characters to create funny sound effects. Sususususususu.
Void_•1h ago
I the past month or so, I added 2 models to my app Whisper Memos (https://whispermemos.com):

- Cohere Transcribe (self hosted)

- Grok Speech To Text (they provide an API, only $0.10/hr!)

They are both excellent. I'm not sure about this one. Would you like to see it in a consumer speech to text app?

olejorgenb•58m ago
I've had good experiences with the Mistral Voxtral models (I've used the API, but some of the model-variants are open weight)
2ndorderthought•57m ago
Have you tried qwen?
SecretDreams•55m ago
Any non-Musk alternatives that are comparable in quality and cost?
Void_•49m ago
Our default is still OpenAI Whisper. Grok is just a choice for users who might prefer it.
jayphen•17m ago
Voxtral competes on price ($0.003/min) and quality. Speechmatics has best in class accuracy but is a bit more expensive ($0.004/min)
Barbing•20m ago
Does Cohere work with longer transcripts? Do you have to do some magic to merge recordings over 35 seconds long?
maxloh•1h ago
I think we should stop calling this type of models open source. They are indeed "open weight." The training code is proprietary and never revealed.

https://github.com/microsoft/VibeVoice/issues/102

JumpCrisscross•58m ago
> we should stop calling this type of model open source. They are indeed "open weight”

This ship has sailed. It’s now in the same category as hacker/cracker and the pronunciation of GIF.

andy_ppp•56m ago
I think you mean GIF.
giancarlostoro•45m ago
It's the same as GIS, you wouldn't say jizz now would you?
notabotiswear•37m ago
I take it that you haven’t met the Arcgees people…
DoctorOW•30m ago
I absolutely do, every single time it comes up.
kevin_thibedeau•23m ago
The developer of the format declared the pronunciation 30+ years ago. It has always been jif.
Geezus_42•4m ago
Yeah, but society overruled them.
pardon_me•16m ago
How do you pronounce giraffe?
parineum•12m ago
How do you pronounce gift?
dijksterhuis•11m ago
i am absolutely going to from now on
WarmWash•30m ago
And "hallucination" which should have been "delusion".

Way early on (spring 2023) people tried to stop it, but no luck.

giancarlostoro•45m ago
I mean, you have "AI" which means just about anything in marketing speak, "Agentic" is kind of becoming similar, hopefully they don't goof that one too badly, would be nice to know what you are trying to sell me. Used to be "Cloud" meant storage not just hosting (I guess it still does).

Then there's "Smart" in front of Car, Phone, TV, and so on... Meaning different things.

I do think "Open Weight" should be more commonly used. There's definitely communities that spring up that build the training infrastructure and inference infrastructure around open models on the other hand.

notabotiswear•34m ago
Openwashing is the new greenwashing, which, coincidently, seems to have gone out of fashion a few hundred datacentres ago.
dist-epoch•26m ago
it was replaced with abundancewashing
Geezus_42•2m ago
What is "abundancewashing"?
jcmfernandes•19m ago
Indeed. We now live in a world where freeware is named open source. We are very sorry, Stallman.
MarsIronPI•2m ago
[delayed]
pluc•1h ago
Interesting story about this repo/product/author by cybersecurity researcher Kevin Beaumont: https://cyberplace.social/@GossiTheDog/116454846703138243
mistic92•1h ago
For me its giving me very poor results
JumpCrisscross•52m ago
What’s the current state of the art, for each of training locally and in the cloud, for learning my voice?
chrsw•42m ago
Local? No idea. Cloud? Eleven Labs, probably. But it's described as "cloning" not "training". Not sure what the distinction is or why it matters if the end result is you can to generate any TTS that sounds like you. There might very well be an important one, I just don't know it.
yreg•15m ago
Locally maybe https://voicebox.sh/

Elevenlabs in the cloud.

khimaros•7m ago
open weights i would say S2: https://github.com/rodrigomatta/s2.cpp
aqme28•51m ago
Interesting to see "vibe" enshrined by the likes of Microsoft as an AI product word.
accrual•41m ago
Especially when "vibe coded" can have a negative connotation meaning quickly put together without understanding.
Barbing•17m ago
I’m just surprised they put the name of the e-waste slop company in their product
altmanaltman•28m ago
Which makes it even more weird they get offended when people use Mircoslop. They are the ones leaning into the marketing
BlastBash192•50m ago
Maybe Microsoft’s real strength was never making the best model, it was knowing you don’t need to, as long as you own the platform everyone builds on.
ryukoposting•31m ago
Holy moly, a Microsoft AI product that isn't named Copilot!
DoctorOW•29m ago
Missed opportunity to call it Vopilot
frangonf•23m ago
I took a look into local options for ASR and diarization some months ago, I missed that VibeVoice now has this feature.

My conclusions back then (which only came from a shallow research on the topic and 0 real experience mind you) was that Whisper + Pyannote was the "stable" approach.

Have the VibeVoice, Voxtral, Qwen or the Nemo solutions caught up in segmentation and speaker recognition?

khimaros•9m ago
looks like this offers ASR support in GGUF https://github.com/CrispStrobe/CrispASR -- haven't tested
chaosprint•6m ago
Microsoft Store App Vibing.exe Accused of Harvesting Screens, Audio, and Clipboard Data:

https://cyberpress.org/microsoft-store-app-vibing-exe-accuse...

ChrisArchitect•2m ago
Previously:

Sept 2025 https://news.ycombinator.com/item?id=45114245