frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

NASA now allowing astronauts to bring their smartphones on space missions

https://twitter.com/NASAAdmin/status/2019259382962307393
2•gbugniot•3m ago•0 comments

Claude Code Is the Inflection Point

https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point
1•throwaw12•5m ago•0 comments

MicroClaw – Agentic AI Assistant for Telegram, Built in Rust

https://github.com/microclaw/microclaw
1•everettjf•5m ago•2 comments

Show HN: Omni-BLAS – 4x faster matrix multiplication via Monte Carlo sampling

https://github.com/AleatorAI/OMNI-BLAS
1•LowSpecEng•5m ago•1 comments

The AI-Ready Software Developer: Conclusion – Same Game, Different Dice

https://codemanship.wordpress.com/2026/01/05/the-ai-ready-software-developer-conclusion-same-game...
1•lifeisstillgood•7m ago•0 comments

AI Agent Automates Google Stock Analysis from Financial Reports

https://pardusai.org/view/54c6646b9e273bbe103b76256a91a7f30da624062a8a6eeb16febfe403efd078
1•JasonHEIN•11m ago•0 comments

Voxtral Realtime 4B Pure C Implementation

https://github.com/antirez/voxtral.c
1•andreabat•13m ago•0 comments

I Was Trapped in Chinese Mafia Crypto Slavery [video]

https://www.youtube.com/watch?v=zOcNaWmmn0A
1•mgh2•19m ago•0 comments

U.S. CBP Reported Employee Arrests (FY2020 – FYTD)

https://www.cbp.gov/newsroom/stats/reported-employee-arrests
1•ludicrousdispla•21m ago•0 comments

Show HN: I built a free UCP checker – see if AI agents can find your store

https://ucphub.ai/ucp-store-check/
2•vladeta•26m ago•1 comments

Show HN: SVGV – A Real-Time Vector Video Format for Budget Hardware

https://github.com/thealidev/VectorVision-SVGV
1•thealidev•28m ago•0 comments

Study of 150 developers shows AI generated code no harder to maintain long term

https://www.youtube.com/watch?v=b9EbCb5A408
1•lifeisstillgood•28m ago•0 comments

Spotify now requires premium accounts for developer mode API access

https://www.neowin.net/news/spotify-now-requires-premium-accounts-for-developer-mode-api-access/
1•bundie•31m ago•0 comments

When Albert Einstein Moved to Princeton

https://twitter.com/Math_files/status/2020017485815456224
1•keepamovin•32m ago•0 comments

Agents.md as a Dark Signal

https://joshmock.com/post/2026-agents-md-as-a-dark-signal/
2•birdculture•34m ago•0 comments

System time, clocks, and their syncing in macOS

https://eclecticlight.co/2025/05/21/system-time-clocks-and-their-syncing-in-macos/
1•fanf2•36m ago•0 comments

McCLIM and 7GUIs – Part 1: The Counter

https://turtleware.eu/posts/McCLIM-and-7GUIs---Part-1-The-Counter.html
2•ramenbytes•38m ago•0 comments

So whats the next word, then? Almost-no-math intro to transformer models

https://matthias-kainer.de/blog/posts/so-whats-the-next-word-then-/
1•oesimania•39m ago•0 comments

Ed Zitron: The Hater's Guide to Microsoft

https://bsky.app/profile/edzitron.com/post/3me7ibeym2c2n
2•vintagedave•42m ago•1 comments

UK infants ill after drinking contaminated baby formula of Nestle and Danone

https://www.bbc.com/news/articles/c931rxnwn3lo
1•__natty__•43m ago•0 comments

Show HN: Android-based audio player for seniors – Homer Audio Player

https://homeraudioplayer.app
3•cinusek•43m ago•1 comments

Starter Template for Ory Kratos

https://github.com/Samuelk0nrad/docker-ory
1•samuel_0xK•45m ago•0 comments

LLMs are powerful, but enterprises are deterministic by nature

2•prateekdalal•49m ago•0 comments

Make your iPad 3 a touchscreen for your computer

https://github.com/lemonjesus/ipad-touch-screen
2•0y•54m ago•1 comments

Internationalization and Localization in the Age of Agents

https://myblog.ru/internationalization-and-localization-in-the-age-of-agents
1•xenator•54m ago•0 comments

Building a Custom Clawdbot Workflow to Automate Website Creation

https://seedance2api.org/
1•pekingzcc•57m ago•1 comments

Why the "Taiwan Dome" won't survive a Chinese attack

https://www.lowyinstitute.org/the-interpreter/why-taiwan-dome-won-t-survive-chinese-attack
2•ryan_j_naughton•57m ago•0 comments

Xkcd: Game AIs

https://xkcd.com/1002/
2•ravenical•58m ago•0 comments

Windows 11 is finally killing off legacy printer drivers in 2026

https://www.windowscentral.com/microsoft/windows-11/windows-11-finally-pulls-the-plug-on-legacy-p...
2•ValdikSS•59m ago•0 comments

From Offloading to Engagement (Study on Generative AI)

https://www.mdpi.com/2306-5729/10/11/172
1•boshomi•1h ago•1 comments
Open in hackernews

Show HN: A CLI tool to transcribe and clean YouTube videos with Whisper and LLMs

https://github.com/itsmevictor/youtube-to-text
4•itsmevictor•7mo ago
Hi HN,

I built a simple command-line tool that quickly transcribes YouTube videos into clean, readable text. It uses OpenAI's Whisper for transcription and leverages the LLM of your choice to intelligently clean up the transcripts, removing filler words, correcting grammar, and improving readability.

Some highlights include:

- Automatically downloads audio directly from YouTube.

- Supports multiple output formats (TXT, SRT, VTT).

- LLM-driven transcript cleaning tailored for presentations, conversations, or lectures.

- Easy setup and straightforward CLI usage.

My main motivation to build this is that I read faster than I listen, and it is not rare that I'm interested in only a short segment of a (long) video, so it's easier to just cmd-F and jump in to that section in the transcript.

Feedback welcome!

Comments

Leftium•7mo ago
> My main motivation to build this is that I read faster than I listen

Yes! However occasionally I find it useful to refer to the original video (especially when I want to share a video at a certain timestamp.) Searchable transcripts are a great way to navigate a video if they have links that jump to the relevant timestamp in the video.

So I designed a special file format and web app based on oTranscribe + Markdown:

- https://raw.githubusercontent.com/Leftium/oTranscribe/refs/h...

- https://otranscribe.netlify.app/?vsl=definedefine

I made a tool to convert YouTube SBV/TTML files; it should be possible to add support for one of your output formats: https://github.com/Leftium/otrgen

---

There was a similar show HN[1] that opened my eyes to OpenAI Whisper, however your python script provides a better starting point than a bash script. I'll probably reference both projects when I make my own projects (including a beat-aware YouTube player that needs the audio data for beat-detection analysis.)

[1]: https://hw.leftium.com/#/item/41473379

itsmevictor•7mo ago
Yes, you're right, that's a good idea! I just checked the oTranscribe Netlify app and I think it's pretty cool.

However, I agree that it could be improved by having cleaner (transcribed) text. You should be able to integrate my approach pretty easily since srt and vtt output formats maintain the time stamps.

Let me know if there's something I can do to make your life easier. Otherwise, naturally, feel free to fork my repo etc. :-)