frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Microsoft releases VibeVoice, generates 90-minute, 4-speaker audio

https://microsoft.github.io/VibeVoice/
1•watsonmusic•2h ago

Comments

watsonmusic•2h ago
VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and natural turn-taking. A core innovation of VibeVoice is its use of continuous speech tokenizers (Acoustic and Semantic) operating at an ultra-low frame rate of 7.5 Hz. These tokenizers efficiently preserve audio fidelity while significantly boosting computational efficiency for processing long sequences. VibeVoice employs a next-token diffusion framework, leveraging a Large Language Model (LLM) to understand textual context and dialogue flow, and a diffusion head to generate high-fidelity acoustic details. The model can synthesize speech up to 90 minutes long with up to 4 distinct speakers, surpassing the typical 1-2 speaker limits of many prior models.
watsonmusic•2h ago
https://huggingface.co/microsoft/VibeVoice-1.5B
watsonmusic•2h ago
https://github.com/microsoft/VibeVoice

LibreOffice 25.8 in Windows 7 x64 ESU environment

https://trackerninja.codeberg.page/post/complete-guide-on-how-to-run-libre-office-version-25-8-in...
1•spacedrone808•1m ago•0 comments

Trump Media, Crypto.com to launch crypto treasury firm

https://www.reuters.com/legal/government/trump-media-cryptocom-launch-crypto-treasury-firm-via-sp...
1•geox•2m ago•0 comments

Security Flaws in the WebMonetization Site

https://shkspr.mobi/blog/2025/08/security-flaws-in-the-webmonetization-site/
1•edent•2m ago•0 comments

The Coding Agent Metagame

https://calv.info/coding-agent-metagame
1•caust1c•2m ago•0 comments

Windows 7 x64 Extended Support Page

https://trackerninja.codeberg.page/post/windows-7-power-256-threads-192-gb-ram
1•spacedrone808•3m ago•0 comments

Show HN: A component-first approach to internationalization

https://github.com/aymericzip/intlayer
1•MarineCG40•3m ago•0 comments

Going Viral with Product-Led "Visual Wow" Moments

https://iamcharliegraham.substack.com/p/part-3-going-viral-with-product-led
1•tylerg•4m ago•0 comments

Targeted Wearout Attacks in Microprocessor Cores

https://arxiv.org/abs/2508.16868
2•bikenaga•5m ago•0 comments

When people do you wrong

https://www.jasonfeifer.com/when-people-do-you-wrong/
1•keepamovin•6m ago•0 comments

Scientists Create Molecule That Stores Energy Like Plants Do

https://thedebrief.org/artificial-photosynthesis-breakthrough-scientists-create-molecule-that-sto...
2•LAsteNERD•6m ago•0 comments

My mail notifier avoids interrupting me (2010)

https://utcc.utoronto.ca/~cks/space/blog/tech/AvoidNotifierInterrupts
2•todsacerdoti•7m ago•0 comments

Framework Laptop 16 update brings Nvidia GeForce to the modular gaming laptop

https://arstechnica.com/gadgets/2025/08/framework-laptop-16-update-brings-nvidia-geforce-to-the-m...
1•rntn•8m ago•0 comments

Show HN: Bringing Back Spectrum Visualizers in Stereo Systems

https://www.hackster.io/sylwekkominek/fully-configurable-open-source-audio-spectrum-analyzer-0e2b8b
1•sylwekkominek•8m ago•0 comments

Show HN: A zoomable, searchable archive of BYTE magazine

https://byte.tsundoku.io
1•chromy•9m ago•0 comments

Show HN: Legal Eyes – Turn casual text into legalese with one click

https://www.legal-eyes.ai/
1•jamsey•11m ago•0 comments

You need a kitchen slide rule

https://entropicthoughts.com/kitchen-slide-rule
2•kqr•11m ago•1 comments

FastAPI Cloud

https://fastapicloud.com/
1•shahargl•11m ago•0 comments

I reverse-engineered a bug in my PPO agent that gave it a 9x performance boost

https://theprincipledagent.com/2025/08/26/forensic-rl-investigating-a-surprisingly-successful-bug...
1•wmaxlees•11m ago•1 comments

Compress10MB – Online Video Compressor

https://compress10mb.com/
1•hordekle•12m ago•0 comments

How to run LLMs on PC at home using Llama.cpp

https://www.theregister.com/2025/08/24/llama_cpp_hands_on/
1•ibobev•13m ago•0 comments

What's Next for Kotlin Multiplatform and Compose Multiplatform

https://blog.jetbrains.com/kotlin/2025/08/kmp-roadmap-aug-2025/
1•iBelieve•14m ago•0 comments

Yes, AI is affecting employment. Here's the data

https://www.adpresearch.com/yes-ai-is-affecting-employment-heres-the-data/
2•toomuchtodo•15m ago•1 comments

GPT5 is the best coding LLM because other LLMs admit it?

1•adinhitlore•16m ago•0 comments

Grok 2.5 has not been open-sourced

https://www.zdnet.com/article/no-grok-2-5-has-not-been-open-sourced-heres-how-you-can-tell/
1•CrankyBear•17m ago•0 comments

Wan-S2V: Audio-Driven Cinematic Video Generation

https://humanaigc.github.io/wan-s2v-webpage/
1•diggan•17m ago•0 comments

Learning Deep Representations of Data Distributions

https://ma-lab-berkeley.github.io/deep-representation-learning-book/
1•seanlane•18m ago•0 comments

Silicon Valley is pouring millions into pro-AI PACs to sway midterms

https://techcrunch.com/2025/08/25/silicon-valley-is-pouring-millions-into-pro-ai-pacs-to-sway-mid...
2•sailfast•18m ago•0 comments

CDC scaled back a surveillance program for foodborne illnesses

https://www.nbcnews.com/health/health-news/cdc-quietly-scaled-back-surveillance-program-foodborne...
2•pseudolus•18m ago•0 comments

Multiplayer Word Game in the Browser

https://royale.circuitsgame.com/
1•samanbb•19m ago•1 comments

Show HN: I Built a Privacy First Clipboard History Manager for macOS

1•ajmayafi•19m ago•0 comments