frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Fastmail Donates USD 10k to the Perl and Raku Foundation

https://www.perl.com/article/fastmail-donates-usd-10-000-to-the-perl-and-raku-foundation/
43•oalders•25m ago•5 comments

Voxtral Transcribe 2

https://mistral.ai/news/voxtral-transcribe-2
199•meetpateltech•2h ago•46 comments

Attention at Constant Cost per Token via Symmetry-Aware Taylor Approximation

https://arxiv.org/abs/2602.00294
87•fheinsen•3h ago•43 comments

A sane but bull case on Clawdbot / OpenClaw

https://brandon.wang/2026/clawdbot
141•brdd•1d ago•238 comments

Tractor

https://incoherency.co.uk/blog/stories/tractor.html
46•surprisetalk•20h ago•14 comments

Study: emotional support from social media found to reduce anxiety

https://news.uark.edu/articles/80669/emotional-support-from-social-media-found-to-reduce-anxiety
4•giuliomagnifico•19m ago•0 comments

Converge (YC S23) Is Hiring Product Engineers (NYC, In-Person)

https://www.runconverge.com/careers/product-engineer
1•thomashlvt•34m ago

Procedures for Repair of Potholes in Asphalt-Surfaced Pavements

https://highways.dot.gov/media/7941
24•treebrained•3d ago•20 comments

A case study in PDF forensics: The Epstein PDFs

https://pdfa.org/a-case-study-in-pdf-forensics-the-epstein-pdfs/
131•DuffJohnson•2h ago•54 comments

Data centers in space makes no sense

https://civai.org/blog/space-data-centers
944•ajyoon•21h ago•1072 comments

Guinea worm on track to be 2nd eradicated human disease; only 10 cases in 2025

https://arstechnica.com/health/2026/02/guinea-worm-on-track-to-be-2nd-eradicated-human-disease-on...
117•bookofjoe•3h ago•47 comments

Coding Agent VMs on NixOS with Microvm.nix

https://michael.stapelberg.ch/posts/2026-02-01-coding-agent-microvm-nix/
34•secure•3d ago•13 comments

Lessons learned shipping 500 units of my first hardware product

https://www.simonberens.com/p/lessons-learned-shipping-500-units
747•sberens•2d ago•357 comments

Old Insurance Maps – Georeferencing Sanborn Fire Insurance Maps on Modern Maps

https://oldinsurancemaps.net/
49•lapetitejort•1w ago•12 comments

FBI couldn't get into WaPo reporter's iPhone because Lockdown Mode enabled

https://www.404media.co/fbi-couldnt-get-into-wapo-reporters-iphone-because-it-had-lockdown-mode-e...
366•robin_reala•3h ago•293 comments

Show HN: Ghidra MCP Server – 110 tools for AI-assisted reverse engineering

https://github.com/bethington/ghidra-mcp
213•xerzes•10h ago•56 comments

The Voxel Is a Cutting-Edge Theater Experiment

https://bmoreart.com/2024/09/the-voxel-is-a-cutting-edge-theater-experiment.html
8•simonw•5d ago•2 comments

Brazilian Micro-SaaS Map

https://saas-map.ssr.trapiche.cloud/
73•acfilho•3d ago•3 comments

Show HN: Craftplan – I built my wife a production management tool for her bakery

https://github.com/puemos/craftplan
485•deofoo•3d ago•146 comments

I miss thinking hard

https://www.jernesto.com/articles/thinking_hard
1073•jernestomg•13h ago•593 comments

Microsoft's Pivotal AI Product Is Running into Big Problems

https://www.wsj.com/tech/ai/microsofts-pivotal-ai-product-is-running-into-big-problems-ce235b28
33•fortran77•1h ago•22 comments

New York’s budget bill would require “blocking technology” on all 3D printers

https://blog.adafruit.com/2026/02/03/new-york-wants-to-ctrlaltdelete-your-3d-printer/
604•ptorrone•1d ago•704 comments

Launching the Rural Guaranteed Minimum Income Initiative

https://blog.codinghorror.com/launching-the-rural-guaranteed-minimum-income-initiative/
9•d4ft•34m ago•6 comments

French streamer unbanked by Qonto after criticizing Palantir and Peter Thiel

https://twitter.com/Ced_haurus/status/2018716889191498172
9•hocuspocus•26m ago•1 comments

Deno Sandbox

https://deno.com/blog/introducing-deno-sandbox
505•johnspurlock•1d ago•153 comments

Thatcher Effect – Optical Illusion and Explanation

https://optical.toys/thatcher-effect/
40•robin_reala•3h ago•13 comments

High-Altitude Adventure with a DIY Pico Balloon

https://spectrum.ieee.org/explore-stratosphere-diy-pico-balloon
90•jnord•3d ago•45 comments

Agent Skills

https://agentskills.io/home
506•mooreds•1d ago•244 comments

Broken Proofs and Broken Provers

https://lawrencecpaulson.github.io/2026/01/15/Broken_proofs.html
50•RebelPotato•8h ago•9 comments

Claude Is a Space to Think

https://www.anthropic.com/news/claude-is-a-space-to-think
46•meetpateltech•5h ago•12 comments
Open in hackernews

Voxtral Transcribe 2

https://mistral.ai/news/voxtral-transcribe-2
199•meetpateltech•2h ago

Comments

observationist•1h ago
Native diarization, this looks exciting. edit: or not, no diarization in real-time.

https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-26...

~9GB model.

coder543•1h ago
The diarization is on Voxtral Mini Transcribe V2, not Voxtral Mini 4B.
observationist•1h ago
Ahh, yeah, and it's explicitly not working for realtime streams. Good catch!
sbrother•41m ago
Do you have experience with that model for diarization? Does it feel accurate, and what's its realtime factor on a typical GPU? Diarization has been the biggest thorn in my side for a long time..
coder543•13m ago
> Do you have experience with that model

No, I just heard about it this morning.

serf•1h ago
things I hate:

"Click me to try now!" banners that lead to a warning screen that says "Oh, only paying members, whoops!"

So, you don't mean 'try this out', you mean 'buy this product'.

Let's not act like it's a free sampler.

I can't comment on the model : i'm not giving them money.

ReadEvalPost•1h ago
You can try it on HF: https://huggingface.co/spaces/mistralai/Voxtral-Mini-Realtim...
boobsbr•1h ago
I'm impressed.
mdrzn•1h ago
There's no comparison to Whisper Large v3 or other Whisper models..

Is it better? Worse? Why do they only compare to gpt4o mini transcribe?

GaggiX•1h ago
Gpt4o mini transcribe is better and actually realtime. Whisper is trained to encode the entire audio (or at least 30s chunks) and then decode it.
emmettm•1h ago
The linked article claims the average word error rate for Voxtral mini v2 is lower than GPT-4o mini transcribe
GaggiX•1h ago
Gpt4o mini transcribe is better than whisper, the context is the parent comment.
mdrzn•1h ago
So "gpt4o mini transcribe" is not just whisper v3 under the hood? Btw it's $0.006 / minute

For Whisper API online (with v3 large) I've found "$0.00125 per compute second" which is the cheapest absolute I've ever found.

GaggiX•1h ago
>So it's not just whisper v3 under the hood?

Why it should be Whisper v3? They even released an open model: https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-26...

tekacs•1h ago
WER is slightly misleading, but Whisper Large v3 WER is classically around 10%, I think, and 12% with Turbo.

The thing that makes it particularly misleading is that models that do transcription to lowercase and then use inverse text normalization to restore structure and grammar end up making a very different class of mistakes than Whisper, which goes directly to final form text including punctuation and quotes and tone.

But nonetheless, they're claiming such a lower error rate than Whisper that it's almost not in the same bucket.

tekacs•1h ago
On the topic of things being misleading, GPT-4o transcriber is a very _different_ transcriber to Whisper. I would say not better or worse, despite characterizations such. So it is a little difficult to compare on just the numbers.

There's a reason that quite a lot of good transcribers still use V2, not V3.

satvikpendem•54m ago
Different how?
dmix•1h ago
> At approximately 4% word error rate on FLEURS and $0.003/min

Amazons transcription service is $0.024 per minute, pretty big difference https://aws.amazon.com/transcribe/pricing/

mdrzn•1h ago
Is it 0.003 per minute of audio uploaded, or "compute minute"?

For example fal.ai has a Whisper API endpoint priced at "$0.00125 per compute second" which (at 10-25x realtime) is EXTREMELY cheaper than all the competitors.

Oras•1h ago
I think the point is having it for real-time; this is for conversations rather than transcribing audio files.
Archelaos•1h ago
As a rule of thumb for software that I use regularly, it is very useful to consider the costs over a 10-year period in order to compare it with software that I purchase for lifetime to install at home. So that means 1,798.80 $ for the Pro version.

What estimates do others use?

antirez•1h ago
Italian represents, I believe, the most phonetically advanced human language. It has the right compromise among information density, understandability, and ability to speech much faster to compensate the redundancy. It's like if it had error correction built-in. Note that it's not just that it has the lower error rate, but is also underrepresented in most datasets.
Archelaos•59m ago
This is largely due to the fact that modern Italian is a systematised language that emerged from a literary movement (whose most prominent representative is Alessandro Manzoni) to establish a uniform language for the Italian people. At the time of Italian unification in 1861, only about 2.5% of the population could speak this language.
gbalduzzi•44m ago
The language itself was not invented for the purpose: it was the language spoken in Florence, than adopted by the literary movement and than selected as the national language.

It seems like the best tradeoff between information density and understandability actually comes from the deep latin roots of the language

gbalduzzi•50m ago
I was honestly surprised to find it in the first place, because I assumed English to be at first place given the simpler grammar and the huge dataset available.

I agree with your belief, other languages have either lower density (e.g. German) or lower understandability (e.g. English)

riffraff•37m ago
English has a ton of homophones, way more sounds that differ slightly (long/short vowels), and major pronunciation differences across major "official" languages (think Australia/US/Canada/UK).

Italian has one official italian (two, if you count IT_ch, but difference is minor), doesn't pay much attention to stress and vowel length, and only has a few "confusable" sounds (gl/l, gn/n, double consonants, stuff you get wrong in primary school). Italian dialects would be a disaster tho :)

NewsaHackO•46m ago
The only knowledge I have about how difficult Italian is comes from Inglourious Basterds.
mmooss•9m ago
At least some relatively well-known research finds that all languages have similar information density in terms of bits/second (~39 bits/second based on a quick search). Languages do it with different amounts of phonetic sound / syllables / words per bit and per second, but the bps comes out the same.

I don't know how widely accepted that conclusion is, what exceptions there may be, etc.

simonw•1h ago
This demo is really impressive: https://huggingface.co/spaces/mistralai/Voxtral-Mini-Realtim...

Don't be confused if it says "no microphone", the moment you click the record button it will request browser permission and then start working.

I spoke fast and dropped in some jargon and it got it all right - I said this and it transcribed it exactly right, WebAssembly spelling included:

> Can you tell me about RSS and Atom and the role of CSP headers in browser security, especially if you're using WebAssembly?

Oras•1h ago
Thank you for the link! Their playground in Mistral does not have a microphone. it just uploads files, which does not demonstrate the speed and accuracy, but the link you shared does.

I tried speaking in 2 languages at once, and it picked it up correctly. Truly impressive for real-time.

tekacs•53m ago
Having built with and tried every voice model over the last three years, real time and non-real time... this is off the charts compared to anything I've seen before.

And open weight too! So grateful for this.

daemonologist•47m ago
404 on https://mistralai-voxtral-mini-realtime.hf.space/gradio_api/... for me (which shows up in the UI as a little red error in the top right).
jaggederest•32m ago
It can transcribe Eminem's Rap God fast sequence, really, really impressive.
rafram•3m ago
That's almost certainly in the training data, to be fair.
pyprism•17m ago
Wow, that’s weird. I tried Bengali, but the text transcribed into Hindi!I know there are some similar words in these languages, but I used pure Bengali that is not similar to Hindi.
adarsh2321•9m ago
That demo is indeed impressive! It's fascinating to see advancements in speech-to-text technology like Voxtral Transcribe 2. Do you think tools like this could revolutionize the way we interact with browsers and web content in the future?
satvikpendem•56m ago
Looks like this model doesn't do realtime diarization, what model should I use if I want that? So far I've only seen paid models do diarization well. I heard about Nvidia NeMo but haven't tried that or even where to try it out.
aavci•55m ago
What's the cheapest device specs that this could realistically run on?
kamranjon•25m ago
I haven't quite figured out if the open weights they released on huggingface amount to being able to run the (realtime) model locally - i hope so though! For the larger model with diarization I don't think they open sourced anything.
pietz•48m ago
Do we know if this is better than Nvidia Parakeet V3? That has been my go-to model locally and it's hard to imagine there's something even better.
tylergetsay•34m ago
I've been using Parakeet V3 locally and totally ancedotaly this feels more accurate but slightly slower
boringg•42m ago
Pseudo related -- am I the only one uncomfortable using my voice with AI for the concern that once it is in the training model it is forever reproducible? As a non-public person it seems like a risk vector (albeit small),
dumpstate•4m ago
I'm on voxtral-mini-latest and that's why I started seeing 500s today lol