frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Show HN: Kage – Shadow any website to a single binary for offline viewing

https://github.com/tamnd/kage
86•tamnd•1h ago•27 comments

Rio de Janeiro's "homegrown" LLM appears to be a merge of an existing model

https://github.com/nex-agi/Nex-N2/issues/4
162•unrvl22•3h ago•89 comments

Firewood Splitting Simulator

https://screen.toys/firewood/
413•memalign•4d ago•139 comments

Ask HN: What are you working on? (June 2026)

62•david927•3h ago•229 comments

The Birth and Death of JavaScript (2014)

https://www.destroyallsoftware.com/talks/the-birth-and-death-of-javascript
177•subset•6h ago•100 comments

Caddy compatibility for zeroserve: 3x throughput and 70% lower latency

https://su3.io/posts/zeroserve-caddy-compat
98•losfair•5h ago•29 comments

Perlisisms

https://www.cs.yale.edu/homes/perlis-alan/quotes.html
59•tosh•4h ago•26 comments

Formal Methods and the Future of Programming

https://blog.janestreet.com/formal-methods-at-jane-street-index/?from_theconsensus=1
97•eatonphil•6h ago•28 comments

USB Power Delivery: Plugging into the Benefits

https://www.aptiv.com/en/insights/article/usb-power-delivery-plugging-into-the-benefits
14•mooreds•3d ago•12 comments

Not mine, but it's a website where you can use a segment display

https://aresluna.org/segmented-type/
11•unexpectedVCR•3d ago•0 comments

Lisp's Influence on Ruby

https://blog.tacoda.dev/lisps-influence-on-ruby-6a54f1a7740e
175•tacoda•3d ago•25 comments

No, everyone is not using AI for everything

https://gabrielweinberg.com/p/people-are-consuming-ai-like-they
303•yegg•4h ago•311 comments

FarOutCompany

https://faroutcompany.com/
69•bookofjoe•5h ago•9 comments

The only scalable delete in Postgres is DROP TABLE

https://planetscale.com/blog/the-only-scalable-delete
77•hollylawly•3d ago•35 comments

Show HN: Dual YOLOv8n UAV Detection on RK3588S at 42 FPS Using NPU

https://github.com/alebal123bal/khadas_yolov8n_multithread
41•alebal123bal•4h ago•7 comments

I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

128•iliashad•3h ago•30 comments

Global density and biomass of arbuscular mycorrhizal fungal networks

https://www.science.org/doi/10.1126/science.adu4373
31•zdw•1d ago•2 comments

Linux 7.1

https://lore.kernel.org/lkml/CAHk-=wi4BF4bMhZNZ1tqs+FFV4OuZRe3ZqdWB+LxRLmRweUzQw@mail.gmail.com/T/#u
113•berlianta•3h ago•18 comments

Show HN: 3D print Z reinforcement via injected loops

https://mgunlogson.github.io/magma/
29•mgunlogson•5d ago•9 comments

Rio de Janeiro's city government model Rio3.5 beats Qwen3.7 in recent benchmarks

https://twitter.com/zenmagnets/status/2065796012820848699
120•lucasfcosta•4h ago•30 comments

Rome Fell and Nobody Noticed

https://friedkielbasa.substack.com/p/rome-fell-and-nobody-noticed
30•fkozlowski•43m ago•1 comments

Quivers: A year of linear algebra by drawing arrows

https://lisyarus.github.io/blog/posts/quivers-a-year-of-linear-algebra-by-drawing-arrows.html
20•ibobev•4d ago•3 comments

How to Earn a Billion Dollars

https://paulgraham.com/earn.html
287•kingstoned•7h ago•799 comments

How did Atari apply side art to Arcade Cabinets?

https://arcadeblogger.com/2026/06/14/how-did-atari-apply-side-art-to-arcade-cabinets/
55•msephton•6h ago•15 comments

Free SQL→ER diagram tool, runs in the browser, nothing uploaded

https://sqltoerdiagram.com/
316•robhati•15h ago•63 comments

Honda Civics and the Evil Valet

https://juniperspring.org/posts/honda-evil-valet/
369•librick•18h ago•89 comments

A 'cold blob' in the Atlantic could be a sign of AMOC shutdown

https://www.cnn.com/2026/06/12/climate/cold-blob-atlantic-amoc-ocean-circulation
136•tambourine_man•4h ago•177 comments

Swiss voters reject proposal to cap population at ten million

https://www.swissinfo.ch/eng/swiss-politics/swiss-voters-reject-proposal-to-cap-population-at-ten...
152•FabCH•2h ago•138 comments

KPMG pulls report on AI usage due to apparent hallucinations

https://techcrunch.com/2026/06/13/kpmg-pulls-report-on-ai-usage-due-to-apparent-hallucinations/
93•Brajeshwar•5h ago•13 comments

Dillo directory – Directory of useful sites that work reasonably well on Dillo

https://dir.dillo-browser.org/
24•HotGarbage•2h ago•0 comments
Open in hackernews

Show HN: Kage – Shadow any website to a single binary for offline viewing

https://github.com/tamnd/kage
86•tamnd•1h ago

Comments

maxloh•1h ago
I find SingleFile [0] to be a much more robust version of this.

It strips out all the JavaScript too, but also packs everything into a single HTML file that is easy to transfer. Binary assets (like web fonts and images) are packed as base64 strings.

They also offer a CLI powered by Puppeteer. [1]

[0]: https://github.com/gildas-lormeau/singlefile

[1]: https://github.com/gildas-lormeau/single-file-cli

tamnd•1h ago
It seems this repo only saves one web page?

What I'm implementing here is mirroring a whole website, with all its subpages, so you can browse it all offline. For example, all essays from paulgraham.com.

tamnd•1h ago
And thanks for the link. Let me implement this single HTML feature, it looks nice to have!
HelloUsername•58m ago
What's the difference with, any webbrowser on a computer, File -> Save as ?
nmstoker•52m ago
That's for a single page, this handles the whole site. Also the browser Save As options often work poorly.
gregwebs•1h ago
This seems like it has potential to create a lot of load on a site- are there settings to set how fast it clones or avoid images/videos? Is there a way to only get a subset of a website?
tamnd•1h ago
Could you help create a new issue for that? I will do it later. It is already 1:00 AM my time, but I am happy that anyone is interested in it. : )
sanqui•1h ago
Cool concept. I would like to see this combined with mitmproxy for archive grade fidelity. You could be saving exactly the data served and at the same time a representation by a modern (contemporary) browser, with all JS having run. This combination would be my perfect replacement for the WARC format.
tamnd•1h ago
I'm working on WARC too, with format from Common Crawl!

By converting it to Markdown, we save a lot of space, but it is for a different purpose and a different project: https://github.com/tamnd/ccrawl-cli

sanqui•1h ago
That's neat! In my opinion, the WARC format is quite tricky and underspecified especially since HTTP2 introduced new semantics. It encodes too much in-band and requires rewriting of the server data. A mitmproxy capture is higher fidelity and supports capturing modern features such as WebSockets. I think if we could wrap Kage's crawler interactions by it and store its capture (the intercepted traffic), we could make a potentially nice new archival format.
tamnd•59m ago
I tried to follow well-known formats first, such as WARC and ZIM from Kiwix, so we could benefit from existing tooling support.

For my own custom data format, I have a lot of private code that I plan to release soon. It is optimized for compression, fast lookups, and more. I have been working on it for two years. This is part of a larger, ambitious umbrella project: I am building Google from scratch (all open source), something that anyone can host, including the crawler, indexer, storage, and serving layers. Stay tuned!

rahimnathwani•1h ago
So this is like using wget --mirror except that it works on pages that require javascript, right?
tamnd•1h ago
Yeah, it is. For example, openai.com is rendered with Next.js, so I will try to mirror it tomorrow.
wolttam•1h ago
One use I'd have for this is company wikis that you want to give folks easy offline access to (maybe the wiki has documentation that's useful at sites that don't have cellular coverage).

Cool!

It would be especially cool to have a version that didn't require the separate serving process - even though it's nifty you can package up a whole site as a single binary.

Maybe a single HTML entrypoint shim with a bit of javascript that could index into an archive (potentially embedded) of the site's content?

tamnd•1h ago
Submitting this to Hacker News is the right place! Thanks for your idea. I will consider implementing that :)

Also, in my mind, I already have a script/program to convert HTML to Markdown, so it could actually store everything on disk as a folder of Markdown files, and then commit them to a Git repo.

grahamstanes17•1h ago
nice
dimiprasakis•53m ago
Neat project, I like the idea. One thing from a quick read: you launch Chrome with --no-sandbox. Is there a good reason for that? Security wise it's probably not a good idea. If there is no reason, I'd suggest leaving the sandbox on!

In any case, cool stuff :)

lolpython•48m ago
This is cool. I could see myself downloading the articles behind the first couple pages of hacker news with this, for viewing on a flight or long distance train ride with spotty internet
daviding•46m ago
Nice idea! fwiw, false positives and all, but the Windows 11 default Windows Security doesn't like it: `leakless.exe: Operation did not complete successfully because the file contains a virus or potentially unwanted software.`
delduca•32m ago
curl can do this
Igor_Wiwi•26m ago
This is quite useful tool, especially for the cases where internet access is limited (the flights for example). I implemented it as a separate feature in mdview.io: for example you can export a document as a html file for offline usage, with all the presentation features like reach tables, mermaid and etc built in. Example https://mdview.io/s/why-markdown-became-default-format-for-a... then try to Export - Export HTML
telesilla•17m ago
I've been using httrack (https://www.httrack.com) to download wikis to read on flights, which isn't perfect but better than I'd found previously. I'll try this out, I'd be delighted to have good results. Thanks for the post.
sanqui•52m ago
I'm a fan of compatibility with established formats!

Sounds awesome. There is a lot of untapped potential with respect to efficiently archiving and indexing websites. I saw the impressive things Marginalia Search is doing in this area (the blog is great when it gets technical). There is also a lot of very complete archives of websites out there which are not being indexed at all, and I would love to make them available for researchers. In any case, I'm interested in your project!

Prime_Axiom•7m ago
Looking forward to the next project! I love these kinds of archiving tools.
Dhavidh•1h ago
sound interesting