frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Removing 'um' from a recording is harder than it sounds

https://doug.sh/posts/erm-a-local-cli-that-strips-ums-uhs-and-erms-from-speech/
19•dougcalobrisi•2h ago

Comments

dougcalobrisi•2h ago
This post is mostly about how surprisingly hard it is to cut filler words out of speech cleanly. Apparently, stripping ums isn't a find and replace type thing, because Whisper's timestamps are off by up to a few hundred ms and cutting on them chops syllables or leaves stutters. So, I built a tool, erm, that starts from Whisper's guess, finds where each word actually starts and stops in the audio, and snaps the cuts to silence so there's no click, with ffmpeg doing the splicing.

https://github.com/dougcalobrisi/erm

rindalir•1h ago
This is fascinating! I'm going to try this on a certain clip from Jurassic Park.
sciencesama•36m ago
there is a aah counter in toast master !! this is the software that helps !!
cadamsdotcom•27m ago
What an awesome tool and idea. I’d be keen to see if it can integrate with video editing tools.

Ideally it would slice the video in the timeline without actually removing anything, so you can scrub through your video and try with and without each disfluency (thank you - awesome word) & decide case by case which to keep!

cryptoz•27m ago
Really cool stuff and definitely going to try it; I’m also finding it wild that Google put effort into adding ums and erms into their text to speech model a while back. AI puts it in, AI helps take it out.
sublinear•23m ago
Disfluencies are not necessarily "filler". They can convey mood or hesitation. Cutting them can change the meaning.

A trivial example is "umm... well... (sigh) okay" versus just "okay". Not okay!

heroprotagonist•9m ago
Not to promote something, but Wispr Flow does that for me automatically if I trigger a setting for it..

While it's a commercial product with a subscription, I spent a long time on the free tier not even hitting their limits until I started using it so extensively that I wanted to pay for it.

And I've used Whisper in the past, mostly for tinkering. I tried it for a couple of use cases but haven't touched the base project in a while. But I do regularly use Faster-Whisper-XXL, an open source project based on Whisper, for subtitle generation.

Though, for subtitle generation, I decided to support the project and mainly use the non-public build of Faster-Whisper-XXL Pro built for donators to the open source project.

The extra features smooth out the subtitle editing process very substantially. Toss in "--roformer_overlap 0.125 --roformer_vram 16 --best_of 15 --ff_vocal_extract mb-roformer --vad_method pyannote_v3" to the cli parameters (and sometimes --realign) and you have much less work to do in SubtitleEdit or Tero Subtitler afterwards to clean it up.

Nobody ever gets credit for fixing problems that never happened (2001) [pdf]

https://web.mit.edu/nelsonr/www/Repenning=Sterman_CMR_su01_.pdf
137•sam_bristow•2h ago•45 comments

Claude Fable is relentlessly proactive

https://simonwillison.net/2026/Jun/11/fable-is-relentlessly-proactive/
119•lumpa•1h ago•78 comments

Show HN: Homebrew 6.0.0

https://brew.sh/2026/06/11/homebrew-6.0.0/
1024•mikemcquaid•13h ago•241 comments

Show HN: FablePool – pool money behind a prompt, and Fable builds it in public

https://fablepool.com
277•matthewbarras•5h ago•159 comments

If you are asking for human attention, demonstrate human effort

https://tombedor.dev/human-attention-and-human-effort/
319•jjfoooo4•4h ago•87 comments

A greyscale iPhone setup that works in everyday life

https://www.fabianhemmert.com/opinions/a-greyscale-iphone-setup-that-works-in-everyday-life
53•hemmert•20h ago•32 comments

MiMo Code is now released and open-source

https://mimo.xiaomi.com/mimocode
433•apeters•12h ago•250 comments

Anthropic apologizes for invisible Claude Fable guardrails

https://www.theverge.com/ai-artificial-intelligence/948280/anthropic-claude-fable-invisible-disti...
331•rarisma•15h ago•329 comments

Petition to Withdraw Canada's Bill C-22

https://www.ourcommons.ca/petitions/en/Petition/Sign/e-7416
378•hmokiguess•11h ago•131 comments

A jacket that harvests drinking water from the air

https://news.utexas.edu/2026/06/11/this-jacket-pulls-drinking-water-from-thin-air/
50•ilreb•4h ago•30 comments

Software is made between commits

https://zed.dev/blog/introducing-deltadb
215•jeremy_k•10h ago•163 comments

Ear Training Practice

https://tonedear.com/
170•mattbit•3d ago•88 comments

macOS 27 Beta breaks the ability to boot Asahi Linux

https://www.phoronix.com/news/macOS-27-Beta-Breaks-Asahi
255•josephcsible•2d ago•110 comments

The RCE that AMD wouldn't fix

https://mrbruh.com/amd2/
233•MrBruh•11h ago•100 comments

Tailwind and slop apps

https://briandouglas.ie/llm-tailwind-template/
51•coneonthefloor•5h ago•30 comments

Emacs appearances in pop culture

https://ianyepan.github.io/posts/emacs-in-pop-culture/
273•ggcr•1d ago•76 comments

Claude Fable 5: mid-tier results on coding tasks

https://www.endorlabs.com/learn/claude-fable-5-mythos-grade-hype
250•bugvader•11h ago•114 comments

Lines of code got a better publicist

https://curlewis.co.nz/posts/lines-of-code-got-a-better-publicist/
366•RyeCombinator•14h ago•249 comments

Developer gets Half-Life running at 30 FPS on a Nokia N95

https://www.tomshardware.com/video-games/handheld-gaming/developer-gets-half-life-running-at-30-f...
227•ljf•3d ago•75 comments

Faking keyword arguments to functions in C++

https://nibblestew.blogspot.com/2026/06/faking-keyword-arguments-to-functions.html
12•ibobev•2d ago•0 comments

Making a vintage LLM from scratch

https://crlf.link/log/entries/260525-1/
29•croqaz•18h ago•4 comments

Show HN: Boo – Screen-style terminal multiplexer built on libghostty

https://github.com/coder/boo
54•kylecarbs•6h ago•20 comments

Reading for pleasure is sharply down among schoolkids, report shows

https://www.nbcnews.com/data-graphics/kids-reading-less-lower-levels-department-education-study-r...
90•freejoe76•1d ago•99 comments

How a new DSL may survive in the era of LLMs

https://www.williamcotton.com/articles/how-a-new-dsl-survives-in-the-era-of-llms
18•williamcotton•12h ago•4 comments

MTG Bench: Testing how well LLMs can play Magic

https://mtgautodeck.com/articles/mtg-bench/
32•CallumFerg•11h ago•19 comments

Apple didn't revolutionize power supplies; new transistors did (2012)

https://www.righto.com/2012/02/apple-didnt-revolutionize-power.html
95•geerlingguy•9h ago•8 comments

FPS.cob: A first person shooter in COBOL

https://github.com/icitry/FPS.cob
107•MBCook•11h ago•63 comments

Waymo Premier

https://waymo.com/blog/2026/06/waymo-premier/
161•boulos•10h ago•409 comments

Babel-USB: USB drive with every file

https://github.com/p2r3/babel-usb
31•LorenDB•1d ago•12 comments

Open Reproduction of DeepSeek-R1

https://github.com/huggingface/open-r1
205•yogthos•13h ago•17 comments