frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Cpdown – Copy any webpage/YouTube subtitle as clean Markdown(LLM-ready)

https://github.com/ysm-dev/cpdown
10•ysm0622•8h ago
TL;DR: I built a Chrome extension that copies webpage content or YouTube subtitles as clean, clutter-free Markdown with one click (or a shortcut). It even shows the token count, making it super handy for LLM prompts! Hi, HN!

I often copy information from the web into my notes (Obsidian) or to feed context to LLMs for summaries/translations. So I built cpdown, a browser extension for this.

cpdown lets you convert any webpage/youtube subtitle into clean Markdown and copy it to your clipboard with a single click (or a keyboard shortcut).

Here are the key features:

* Intelligent Content Extraction: Uses Mozilla's Readability or Defuddle (by the Obsidian devs!) to strip away ads, sidebars, and other noise, focusing only on the main article content. * Clean Markdown Conversion: The extracted HTML is then neatly converted to Markdown using the Turndown library. * Token Count for LLMs: It calculates and displays the token count of the copied text using tiktoken. This is super handy for knowing if you're within limits before pasting into LLMs like ChatGPT or Claude. * YouTube Transcript Copying: On YouTube video pages, it can copy the full transcript in Markdown format, automatically adding the video title as an H1 header. * Customizable Options: You can choose your preferred content extractor (Readability/Defuddle), opt to wrap copied content in a code block, and more.

cpdown is completely free and open-source, built with WXT, React, and TypeScript.

You can grab it from the Chrome Web Store or check out the source code on GitHub. I'd love to hear your feedback, feature requests, or any suggestions!

* Chrome Web Store: https://chromewebstore.google.com/detail/cpdown/knnaflplggjd... * GitHub: https://github.com/ysm-dev/cpdown

Thanks for checking it out!

Comments

voiper1•6h ago
Chrome web store link got truncated, this is the link from github: https://chromewebstore.google.com/detail/cpdown/knnaflplggjd...

Will give it a try, thanks!

Helmut10001•1h ago
Yes, such a feature is often needed these days. I use Typora with copy & paste, which also works really well.

Workout.cool – Open-source fitness coaching platform

https://github.com/Snouzy/workout-cool
138•surgomat•2h ago•42 comments

Homomorphically Encrypting CRDTs

https://jakelazaroff.com/words/homomorphically-encrypted-crdts/
68•jakelazaroff•2h ago•13 comments

Terpstra Keyboard

http://terpstrakeyboard.com/web-app/keys.htm
128•xeonmc•4h ago•38 comments

Is There a Half-Life for the Success Rates of AI Agents?

https://www.tobyord.com/writing/half-life
76•EvgeniyZh•4h ago•39 comments

MiniMax-M1 open-weight, large-scale hybrid-attention reasoning model

https://github.com/MiniMax-AI/MiniMax-M1
231•danboarder•8h ago•52 comments

"poline" is an enigmatic color palette generator using polar coordinates

https://meodai.github.io/poline/
20•zdw•3d ago•2 comments

Scrappy - make little apps for you and your friends

https://pontus.granstrom.me/scrappy/
314•8organicbits•9h ago•105 comments

Introduction to the A* Algorithm

https://www.redblobgames.com/pathfinding/a-star/introduction.html
91•auraham•1d ago•46 comments

I counted all of the yurts in Mongolia using machine learning

https://monroeclinton.com/counting-all-yurts-in-mongolia/
138•furkansahin•7h ago•38 comments

Honda conducts successful launch and landing of experimental reusable rocket

https://global.honda/en/topics/2025/c_2025-06-17ceng.html
1177•LorenDB•1d ago•374 comments

The Grug Brained Developer (2022)

https://grugbrain.dev/
897•smartmic•18h ago•414 comments

Jiga (YC W21) Is Hiring Software Engs to Make Like of Mech Engs Easier

https://www.workatastartup.com/companies/jiga
1•grmmph•3h ago

Reasoning by Superposition: A Perspective on Chain of Continuous Thought

https://arxiv.org/abs/2505.12514
11•danielmorozoff•2h ago•0 comments

Show HN: Lstr – A modern, interactive tree command written in Rust

https://github.com/bgreenwell/lstr
174•w108bmg•12h ago•53 comments

Building Effective AI Agents

https://www.anthropic.com/engineering/building-effective-agents
455•Anon84•21h ago•79 comments

3D-printed device splits white noise into an acoustic rainbow without power

https://phys.org/news/2025-06-3d-device-white-noise-acoustic.html
197•rbanffy•2d ago•48 comments

Munich from a Hamburger's Perspective

https://mertbulan.com/2025/06/14/munich-from-a-hamburgers-perspective/
29•toomuchtodo•2d ago•11 comments

A Straightforward Explanation of the Good Regulator Theorem

https://www.lesswrong.com/posts/JQefBJDHG6Wgffw6T/a-straightforward-explanation-of-the-good-regulator-theorem
34•surprisetalk•4d ago•3 comments

What Google Translate can tell us about vibecoding

https://ingrids.space/posts/what-google-translate-can-tell-us-about-vibecoding/
233•todsacerdoti•19h ago•136 comments

OpenSERDES – Open Hardware Serializer/Deserializer (SerDes) in Verilog

https://github.com/SparcLab/OpenSERDES
59•peter_d_sherman•11h ago•7 comments

Now might be the best time to learn software development

https://substack.com/home/post/p-165655726
281•nathanfig•1d ago•239 comments

Preparation of a neutral nitrogen allotrope hexanitrogen C2h-N6

https://www.nature.com/articles/s41586-025-09032-9
23•bilsbie•2d ago•16 comments

Resurrecting a dead torrent tracker and finding 3M peers

https://kianbradley.com/2025/06/15/resurrecting-a-dead-tracker.html
589•k-ian•21h ago•182 comments

Proofs Without Words

https://artofproblemsolving.com/wiki/index.php/Proofs_without_words
83•squircle•4d ago•18 comments

Making 2.5 Flash and 2.5 Pro GA, and introducing Gemini 2.5 Flash-Lite

https://blog.google/products/gemini/gemini-2-5-model-family-expands/
346•meetpateltech•23h ago•200 comments

Why JPEGs still rule the web (2024)

https://spectrum.ieee.org/jpeg-image-format-history
198•purpleko•1d ago•356 comments

Grokking NAT and packet mangling in Linux

https://vivekn.dev/blog/grokking-nat-and-packet-mangling-in-linux
21•viveknathani_•9h ago•12 comments

LLMs pose an interesting problem for DSL designers

https://kirancodes.me/posts/log-lang-design-llms.html
194•gopiandcode•19h ago•124 comments

Timescale Is Now TigerData

https://www.tigerdata.com/blog/timescale-becomes-tigerdata
149•pbowyer•1d ago•106 comments

Bzip2 crate switches from C to 100% Rust

https://trifectatech.org/blog/bzip2-crate-switches-from-c-to-rust/
305•Bogdanp•19h ago•153 comments