frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Omnilingual ASR: Advancing automatic speech recognition for 1600 languages

https://ai.meta.com/blog/omnilingual-asr-advancing-automatic-speech-recognition/?_fb_noscript=1
47•jean-•4h ago
HF Demo: https://huggingface.co/spaces/facebook/omniasr-transcription...

GitHub: https://github.com/facebookresearch/omnilingual-asr

Comments

meetpateltech•4h ago
HF Demo: https://huggingface.co/spaces/facebook/omniasr-transcription...

GitHub: https://github.com/facebookresearch/omnilingual-asr

dang•2h ago
Thanks! I've added those links to the toptext as well.
tschellenbach•2h ago
any insights on latency?
samat•1h ago
How hard is it to make TTS out of this? A few independent journalists from Belarus asked for TTS in their language, but I am no expert, was thinking about re-using Mozilla's work. What's the easiest way to get working TTS for a language?
kulahan•1h ago
From TFA, it says that it’s extremely easy to add new languages with just a few examples. I didn’t see specifics on how “few” it really is, though.
nl•20m ago
This is ASR not TTS though.
woodson•13m ago
You can use the OmniASR SSL models instead of their older MMS models to create TTS models: https://github.com/ylacombe/finetune-hf-vits
stuffoverflow•1h ago
This seems like a massive improvement for openly available local ASR. Even the 300M model outperforms whisper-large-v3 according to the paper's benchmarks.
lostmsu•4m ago
[delayed]
AIorNot•23m ago
the global language explorer is fascinating -great work guys

https://aidemos.atmeta.com/omnilingualasr/language-globe

- we are getting closer to BabelFish.. at least for the Earth!

cadamsdotcom•23m ago
Only a few gb of weights will recognize speech in 1600+ languages.

Freely downloadable and usable by anyone for almost anything.

We truly live in the future.

Fei Fei Li: Spatial Intelligence is AI’s Next Frontier

https://drfeifei.substack.com/p/from-words-to-worlds-spatial-intelligence
58•mkirchner•1h ago•32 comments

Unexpected things that are people

https://bengoldhaber.substack.com/p/unexpected-things-that-are-people
367•lindowe•6h ago•186 comments

Writing your own BEAM

https://martin.janiczek.cz/2025/11/09/writing-your-own-beam.html
81•cbzbc•1d ago•11 comments

The lazy Git UI you didn't know you need

https://www.bwplotka.dev/2025/lazygit/
157•linhns•4h ago•55 comments

TTS Still Sucks

https://duarteocarmo.com/blog/tts-still-sucks
21•speckx•1h ago•28 comments

High-performance 2D graphics rendering on the CPU using sparse strips [pdf]

https://github.com/LaurenzV/master-thesis/blob/main/main.pdf
10•PaulHoule•40m ago•0 comments

Zeroing in on Zero-Point Motion Inside a Crystal

https://physics.aps.org/articles/v18/178
15•lc0_stein•1h ago•0 comments

Using Generative AI in Content Production

https://partnerhelp.netflixstudios.com/hc/en-us/articles/43393929218323-Using-Generative-AI-in-Co...
48•CaRDiaK•3h ago•21 comments

Error ABI

https://matklad.github.io/2025/11/09/error-ABI.html
48•todsacerdoti•20h ago•9 comments

Memory Safety for Skeptics

https://queue.acm.org/detail.cfm?id=3773095
42•steveklabnik•4h ago•27 comments

Registered OAuth Parameters

https://www.iana.org/assignments/oauth-parameters/oauth-parameters.xhtml#parameters
22•mooreds•6d ago•3 comments

Linux in a Pixel Shader – A RISC-V Emulator for VRChat

https://blog.pimaker.at/texts/rvc1/
12•rbanffy•55m ago•3 comments

Omnilingual ASR: Advancing automatic speech recognition for 1600 languages

https://ai.meta.com/blog/omnilingual-asr-advancing-automatic-speech-recognition/?_fb_noscript=1
48•jean-•4h ago•10 comments

Unix v4 Tape Found

https://discuss.systems/@ricci/115504720054699983
53•greatquux•4d ago•4 comments

Head in the Zed Cloud

https://maxdeviant.com/posts/2025/head-in-the-zed-cloud/
43•todsacerdoti•8h ago•8 comments

Benchmarking leading AI agents against Google reCAPTCHA v2

https://research.roundtable.ai/captcha-benchmarking/
80•mdahardy•6h ago•60 comments

Building a high-performance ticketing system with TigerBeetle

https://renerocks.ai/blog/2025-11-02--tigerfans/
56•jorangreef•2d ago•8 comments

Launch HN: Hypercubic (YC F25) – AI for COBOL and Mainframes

63•sai18•6h ago•42 comments

Dependent Types and How to Get Rid of Them

https://chadnauseam.com/coding/pltd/are-dependent-types-actually-erased
8•pie_flavor•1w ago•0 comments

Synesthesia helps me find four-leaf clovers (2023)

https://matthewjamestaylor.com/synesthesia-four-leaf-clovers
53•iansteyn•1w ago•36 comments

3D Heterogeneous Integration Powers New DARPA Fab

https://spectrum.ieee.org/3d-heterogeneous-integration
3•rbanffy•40m ago•0 comments

Canadian military will rely on public servants to boost its ranks by 300k

https://ottawacitizen.com/public-service/defence-watch/canadian-military-public-servants
62•Teever•5h ago•138 comments

Redmond, WA, turns off Flock Safety cameras after ICE arrests

https://www.seattletimes.com/seattle-news/law-justice/redmond-turns-off-flock-safety-cameras-afte...
197•dredmorbius•4h ago•191 comments

Pose Animator – An open source tool to bring SVG characters to life (2020)

https://blog.tensorflow.org/2020/05/pose-animator-open-source-tool-to-bring-svg-characters-to-lif...
126•jerlendds•6d ago•13 comments

Interesting SPI Routing with iCE40 FPGAs

https://danielmangum.com/posts/spi-routing-ice40-fpga/
86•hasheddan•9h ago•6 comments

LLMs are steroids for your Dunning-Kruger

https://bytesauna.com/post/dunning-kruger
272•gridentio•7h ago•222 comments

Asus Ascent GX10

https://www.asus.com/networking-iot-servers/desktop-ai-supercomputer/ultra-small-ai-supercomputer...
178•jimexp69•6h ago•166 comments

Cybersecurity breach at Congressional Budget Office remains a live threat

https://www.politico.com/live-updates/2025/11/10/congress/cbo-still-under-threat-00644930
12•mooreds•42m ago•0 comments

How cops can get your private online data

https://www.eff.org/deeplinks/2025/06/how-cops-can-get-your-private-online-data
231•jamesgill•6h ago•51 comments

Sysgpu – Experimental descendant of WebGPU written in Zig

https://github.com/hexops-graveyard/mach-sysgpu
3•coffeeaddict1•1h ago•0 comments