frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Boost Your Voice AI Agents with Open-Source Ten VAD

https://github.com/TEN-framework/ten-vad
8•Jingyi0321•13h ago
Voice Activity Detection (VAD) is a crucial component for Voice AI, enabling more natural and efficient interactions. TEN VAD is an open-source solution designed to supercharge your Voice AI Agents with lightning-fast, human-like conversations! TEN VAD offers some key advantages:

ONNX Support: Deploy on virtually any platform or hardware architecture! This means greater flexibility and easier integration into your existing systems. Superior Detection Accuracy: Experience noticeable improvements in voice detection, leading to fewer errors and more reliable performance. Smaller & Faster: Enjoy a 32% reduction in Real-Time Factor (RTF) and an 86% size reduction compared to Silero VAD! This translates to lower resource consumption and faster processing.

Get the code: https://github.com/ten-framework/ten-vad

Comments

JuneWW•13h ago
This looks really promising! VAD is such a critical piece of the puzzle for voice AI. Definitely going to check this out. Thanks for sharing!
fm100•13h ago
What are the differences between TEN VAD and WebRTC VAD?
Jingyi0321•11h ago
In general, WebRTC VAD uses pitch information for VAD. Note that pitch only appears in voiced speech, but not in unvoiced speech. With this characteristic, WebRTC VAD may fails in detecting the start of a word, losing the unvoiced start, which will then result in e.g. increased WER in ASR system. On the other hand, noise whose spectrum is similar to voiced speech, e.g. music, may be extracted a non-zero pitch by WebRTC VAD pitch detection system.

Our model incorporates fbank and the pitch information together, and can analyse the input pattern deeply, therefore has better performance than WebRTC VAD.

rambo11•13h ago
Thanks for sharing this awesome VAD model. A high-performance and low latency VAD is very helpfull in Conversation AI agents.
strassenbahn•13h ago
It seems the performance is much better than the existing VAD SOTA Silero VAD and the size is much smaller. Good to see this new SOTA VAD model!

Final report on Alaska Airlines Flight 1282 in-flight exit door plug separation

https://www.ntsb.gov:443/investigations/Pages/DCA24MA063.aspx
1•starkparker•52s ago•0 comments

Google unveils MedGemma, an open-source AI model suite for medical applications

https://the-decoder.com/google-unveils-medgemma-an-open-source-ai-model-suite-for-medical-applications/
1•alwillis•1m ago•0 comments

Apple's Bluetooth trust persists after cryptographic failure (iOS 18.5)

https://substack.com/home/post/p-168022064
1•FluGameAce007•3m ago•1 comments

Show HN: I built a laurel wreath generator

https://laurelwreathgenerator.com
1•seuyu_bin•3m ago•0 comments

Generating Zero-Knowledge Proofs in Sublinear Space

https://www.researchgate.net/publication/393569456_Zero-Knowledge_Proofs_in_Sublinear_Space
1•logannyeMD•3m ago•0 comments

Android's Canary Channel Is a New Way to Test Upcoming Updates

https://www.howtogeek.com/android-has-a-new-canary-channel/
1•Bluestein•4m ago•0 comments

How to scale RL to 10^26 FLOPs

https://blog.jxmo.io/p/how-to-scale-rl-to-1026-flops
1•jxmorris12•5m ago•0 comments

Meta Poached Apple's Pang with Pay Package over $200M

https://www.bloomberg.com/news/articles/2025-07-09/meta-poached-apple-s-pang-with-pay-package-over-200-million
2•SG-•5m ago•1 comments

The Quest to Reinvent Anesthesia

https://medicalxpress.com/news/2025-06-quest-reinvent-anesthesia.html
1•PaulHoule•7m ago•0 comments

Context engineering with DSPy (13min video)

https://www.youtube.com/watch?v=1I9PoXzvWcs
2•jeffchuber•9m ago•0 comments

The Many Faces of Themeable Design Systems

https://bradfrost.com/blog/post/the-many-faces-of-themeable-design-systems/
1•brianzelip•9m ago•1 comments

Infiltrating a Soviet Particle Accelerator

https://www.youtube.com/watch?v=L5QHeoVbug4
1•lazysheepherd•9m ago•0 comments

Why We're Moving Beyond "Misinformation" and "Disinformation"

https://www.newsguardrealitycheck.com/p/commentary-why-were-moving-beyond
1•emschwartz•13m ago•0 comments

Poll: Where do you store your personal code?

2•akulbe•13m ago•3 comments

Bitwarden MCP Server

https://github.com/bitwarden/mcp-server
1•gnabgib•16m ago•0 comments

Chris Foss: The Joy of Starships (2011)

https://web.archive.org/web/20110927010203/http://www.newscientist.com/blogs/culturelab/2011/09/chris-foss-the-joy-of.html
2•Michelangelo11•17m ago•0 comments

Grok 4: Breaking Down XAI's Leap

https://hjortur.substack.com/p/grok-4-breaking-down-xais-leap
1•hjortureh•18m ago•0 comments

Video game industry agrees to AI restrictions in new labor contract

https://qz.com/video-game-industry-ai-restrictions
1•Bluestein•18m ago•0 comments

Could Psilocybin Be the Magical Ingredient for a Longer Life?

https://studyfinds.org/scientists-discover-psilocybin-may-be-the-magical-ingredient-for-a-longer-life/
1•domofutu•20m ago•1 comments

Microdefinitions

https://mailchi.mp/018fe21e4f18/gems-10335584?e=99dc217640
1•dougb5•20m ago•0 comments

Comet Browser Is Just 2 Chrome Extensions?

2•thecautiousdan•21m ago•1 comments

Show HN: Ownyourpixel.fun – Buy Pixels and Resell Them Like Digital Real Estate

https://www.ownyourpixel.fun/
2•mandarwagh•23m ago•0 comments

What Happened to All the Human Bird Flu Cases?

https://undark.org/2025/07/10/opinion-bird-flu-emergency-end/
1•EA-3167•23m ago•0 comments

Britain is cheap, and should learn to love it

https://www.economist.com/leaders/2025/07/10/britain-is-cheap-and-should-learn-to-love-it
1•jxmorris12•27m ago•0 comments

Why, Why, Why, Eliza?

https://www.learningfromexamples.com/p/why-why-why-eliza
11•silt•34m ago•0 comments

Detroit and Baltimore Built Local State Capacity to Bring Crime to New Lows

https://www.governance.fyi/p/two-failed-cities-detroit-and-baltimore
2•guardianbob•35m ago•1 comments

The 800k Hours Career Guide

https://www.boristhebrave.com/2025/07/10/the-800000-hours-career-guide/
3•ibobev•38m ago•0 comments

Icechunk 1.0: Production-Grade Cloud-Native Array Storage Is Here

https://earthmover.io/blog/icechunk-1-0-production-grade-cloud-native-array-storage-is-here
1•rhodysurf•39m ago•0 comments

A Game About Typing the Alphabet

https://www.agameabouttypingthealphabet.com
1•braska•40m ago•1 comments

'Intelligent' copper tariffs will 'wake people up', says mining billionaire

https://www.ft.com/content/bcd9b72d-4ebb-4d07-b20e-3a2e2c014f41
1•petethomas•43m ago•0 comments