frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/
426•klaussilveira•5h ago•97 comments

Hello world does not compile

https://github.com/anthropics/claudes-c-compiler/issues/1
21•mfiguiere•42m ago•8 comments

The Waymo World Model

https://waymo.com/blog/2026/02/the-waymo-world-model-a-new-frontier-for-autonomous-driving-simula...
775•xnx•11h ago•472 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
142•isitcontent•6h ago•15 comments

Monty: A minimal, secure Python interpreter written in Rust for use by AI

https://github.com/pydantic/monty
135•dmpetrov•6h ago•57 comments

Dark Alley Mathematics

https://blog.szczepan.org/blog/three-points/
41•quibono•4d ago•3 comments

Show HN: I spent 4 years building a UI design tool with only the features I use

https://vecti.com
246•vecti•8h ago•117 comments

A century of hair samples proves leaded gas ban worked

https://arstechnica.com/science/2026/02/a-century-of-hair-samples-proves-leaded-gas-ban-worked/
70•jnord•3d ago•4 comments

Show HN: If you lose your memory, how to regain access to your computer?

https://eljojo.github.io/rememory/
180•eljojo•8h ago•124 comments

Microsoft open-sources LiteBox, a security-focused library OS

https://github.com/microsoft/litebox
314•aktau•12h ago•154 comments

How we made geo joins 400× faster with H3 indexes

https://floedb.ai/blog/how-we-made-geo-joins-400-faster-with-h3-indexes
12•matheusalmeida•1d ago•0 comments

Sheldon Brown's Bicycle Technical Info

https://www.sheldonbrown.com/
311•ostacke•12h ago•85 comments

Hackers (1995) Animated Experience

https://hackers-1995.vercel.app/
397•todsacerdoti•13h ago•217 comments

An Update on Heroku

https://www.heroku.com/blog/an-update-on-heroku/
322•lstoll•12h ago•233 comments

PC Floppy Copy Protection: Vault Prolok

https://martypc.blogspot.com/2024/09/pc-floppy-copy-protection-vault-prolok.html
12•kmm•4d ago•0 comments

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

https://github.com/phreda4/r3
48•phreda4•5h ago•8 comments

I spent 5 years in DevOps – Solutions engineering gave me what I was missing

https://infisical.com/blog/devops-to-solutions-engineering
109•vmatsiiako•11h ago•34 comments

How to effectively write quality code with AI

https://heidenstedt.org/posts/2026/how-to-effectively-write-quality-code-with-ai/
186•i5heu•8h ago•129 comments

Understanding Neural Network, Visually

https://visualrambling.space/neural-network/
236•surprisetalk•3d ago•31 comments

I now assume that all ads on Apple news are scams

https://kirkville.com/i-now-assume-that-all-ads-on-apple-news-are-scams/
976•cdrnsf•15h ago•415 comments

Learning from context is harder than we thought

https://hy.tencent.com/research/100025?langVersion=en
144•limoce•3d ago•79 comments

Introducing the Developer Knowledge API and MCP Server

https://developers.googleblog.com/introducing-the-developer-knowledge-api-and-mcp-server/
17•gfortaine•3h ago•2 comments

I'm going to cure my girlfriend's brain tumor

https://andrewjrod.substack.com/p/im-going-to-cure-my-girlfriends-brain
49•ray__•2h ago•11 comments

FORTH? Really!?

https://rescrv.net/w/2026/02/06/associative
41•rescrv•13h ago•17 comments

Evaluating and mitigating the growing risk of LLM-discovered 0-days

https://red.anthropic.com/2026/zero-days/
35•lebovic•1d ago•11 comments

Why I Joined OpenAI

https://www.brendangregg.com/blog/2026-02-07/why-i-joined-openai.html
52•SerCe•2h ago•42 comments

Show HN: Smooth CLI – Token-efficient browser for AI agents

https://docs.smooth.sh/cli/overview
77•antves•1d ago•57 comments

The Oklahoma Architect Who Turned Kitsch into Art

https://www.bloomberg.com/news/features/2026-01-31/oklahoma-architect-bruce-goff-s-wild-home-desi...
18•MarlonPro•3d ago•4 comments

Claude Composer

https://www.josh.ing/blog/claude-composer
108•coloneltcb•2d ago•71 comments

Show HN: Slack CLI for Agents

https://github.com/stablyai/agent-slack
39•nwparker•1d ago•10 comments
Open in hackernews

Finding Dead Websites

https://www.marginalia.nu/log/a_122_dead_websites/
121•ingve•7mo ago

Comments

55555•7mo ago
It's a real edge case, but someone could conceivably let their own domain expire and then register it anew and restore their website. It will be impossible to tell this apart from an SEO buying and restoring a website to use for link juice.
AznHisoka•7mo ago
The DNS records would be completely revamped, or removed in that case.
marginalia_nu•7mo ago
Yeah there's no shortage of caveats in this space. One could conceivably compare the outgoing links (being a search engine and all and having historical crawl data to compare against), but my hunch the cost of distinguishing between these two cases is going to be way out of proportion when compared to the benefit.
atribecalledqst•7mo ago
Before I RTFA, I was wondering if this would be about trying to find a way to include Wayback Machine results in search. Searching the Wayback Machine is always such a nightmare, and wouldn't it be nice if your search turned up that long-dead 1997 web page that has the exact answer for what you're looking for...

(minor use case I had recently was I was trying to find old Japanese blogs for Tamagotchis, which I gather there were a ton of in the 90s but almost none survive today - imagine if I could get those instead of the 1,000,000 sites just trying to sell them to me)

Lammy•7mo ago
Kagi has this feature, “Blast From The Past” https://blog.kagi.com/kagi-features#:~:text=Interesting%20fi...
marginalia_nu•7mo ago
They're likely only serving previously accessible domains already in their index as wayback machine links, which is neat, but doesn't really solve the problem of indexing the wayback machine in a broader sense.

Would be a very nice feature to have indeed, though the data is a bit too inaccessible to index as far as I can tell (even though I've not given it any serious effort, so maybe it is?)

Lammy•7mo ago
I kinda consider that a feature and not a bug. If it were easier to find all the really deep stuff in the Wayback Machine, people would be trying to censor it all the time. I like being able to spear-fish my way into the deep shit by finding layers of URI references in other archived pages.
cosmicgadget•7mo ago
Agreed, it'd be neat to test links on the fly and substitute wayback links if they are dead and cached information if there is no snapshot.
guestbest•7mo ago
The Usenet was considered a better place to store data about a topic for a year than a website because the interactive format allowed other people to comment. There was only cgi guestbooks which pale in comparison to a newsreader. People didn’t think back then about keeping information around forever because books were still the primary way information was captured, retained and transferred
pabs3•7mo ago
Would be great if Marginalia would be feeding crawl data to the Wayback Machine too.
GJim•7mo ago
> 1,000,000 sites just trying to sell

Welcome to the modern internet.

l5870uoo9y•7mo ago
What a pleasant website theme for reading.
mlhpdx•7mo ago
I’m not sure what the authors point was with respect to ASN 16509. Are they saying parked domains don’t like being viewed by Amazon IPs or that moving to Amazon is a strong signal for being parked? The latter seems absurd. But is it?
marginalia_nu•7mo ago
It seems an especially strong signal along with the other signals, i.e. ok status + losing encryption.

The entire game is combining a bunch of weak indicators into a strong one.

koprocezar•7mo ago
That was interesting.
renegat0x0•7mo ago
Whoa, this is what I have been wondering for some time, for my crawler.

Crawler results depend on domain authority. If page owner, or page contents page change the ranking may, or should change.

However original author also could change contents, and page ranking should not be changed. So this is not easy to determine what to do with domain of it becomes inactive, or changes contents dramatically.

Currently I use only 30 day window to keep track of domains. After that period inactive domain is thrown out of the window.

However valuable domains, even if dead, reside longer. My UI provides easy link to wayback machine. So even for dead links I can browse them.

I noticed also that some domains, even if expired do serve contents, even if author left it alone. Page contents is served, but with a text that it expired.

JdeBP•7mo ago
As someone with a WWW site hit by Brexit where half the country voted to stop me having my domain name (and some other things) I read this with interest to consider how badly it would be caught out on the sort of false positive where a WWW site owner has to change ASes, change HTTP servers, set up redirects and meta information for the time left before eu. becomes unavailable, and even change DNS servers let alone a number of resource records. A lot of those seem to be things that will add up in this model. As would the fact that my prior domain name is today parked. In Canada!

Not the first sudden and unwelcome discontinuity, either.

Google came close to thinking that I was dead, and turned out when I recently checked to be still looking for me under eu., years after the fact.

And with a broader view, this sort of stuff happens to the world, and there are enough people in the same boat that it is worth thinking of false positives when major upheavals occur. They can range from ISPs just up and deciding to close up shop with zero notice (which also happened to me) to international geopolitical upheavals. Who knows! If Brexit happened, it is conceivable that one day, the island of Niue might eventually prevail and then decide overnight that non-Niue citizens may not own a nu. domain. (-:

I wonder how many times Marginalia would have declared me dead, by now. (-:

marginalia_nu•7mo ago
I think some degree of false positives is inevitable with this type of feature, but it can still provide use even if it's not perfect. Websites with flakey profiles that keep changing emit a signal of their own.
zinekeller•7mo ago
The TLS verifier workaround that you've constructed is reasonably sound (based on how TLS validation works in browsers), but the boring answer to workaround certificate problems is to do what Firefox actually does: skip AIA and only rely on known intermediates. You can import Mozilla's intermediate list as roots (https://wiki.mozilla.org/CA/Intermediate_Certificates, requires additional processing to convert to usable certs) to emulate this.

Chrome on the other hand... just read this article (https://blog.benjojo.co.uk/post/browsers-biggest-tls-mistake), it's very hard to emulate and is really, really bonkers.

marginalia_nu•7mo ago
Haha, so this rabbit hole keeps getting somehow even worse. I do not envy anyone attempting to implement browser-compatible TLS with actual security in mind.
tart-lemonade•7mo ago
> While the change detection currently only runs on a subset of about 2 million domains, the crawler is aware of approximately 36,000,000 domains in total, and about 1,500,000 of those are subdomains of tumblr.com.

So Tumblr makes up ~4% of all internet domains (or at least, all linked publicly-accessible domains)? Makes sense, but still wild to think about. I'd love to know what the top 10 domains (by sub-domain count) are.

marginalia_nu•7mo ago
Here's top 25 as far as marginalia's domain awareness goes

  +---------+------------------+
  | CNT     | DOMAIN_TOP       |
  +---------+------------------+
  | 2445090 | tumblr.com       |
  |  915418 | blogspot.com     |
  |  712132 | wordpress.com    |
  |  244369 | bandcamp.com     |
  |  236676 | uptodown.com     |
  |  115058 | github.io        |
  |   99616 | substack.com     |
  |   83419 | deviantart.com   |
  |   78243 | livejournal.com  |
  |   73866 | weebly.com       |
  |   71687 | wixsite.com      |
  |   68073 | informer.com     |
  |   67849 | medium.com       |
  |   65004 | knoji.com        |
  |   59438 | blogspot.co.uk   |
  |   59412 | list-manage.com  |
  |   41923 | itch.io          |
  |   41150 | mystrikingly.com |
  |   40743 | ibooked.dk       |
  |   39978 | fnxradio.com     |
  |   35671 | blogspot.ca      |
  |   34450 | site123.me       |
  |   33889 | ibooked.nl       |
  |   33649 | xuijs.com        |
  |   30391 | infospaceinc.com |
  +---------+------------------+