frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a 50 site sampler from CommonCrawl refreshing every 30 minutes

https://randcrawl.com/
1•whothatcodeguy•1h ago
I tossed this together this afternoon mostly just to validate a premise: the internet has become so heavily consolidated into a few key discovery surfaces for the common user, and I miss when you could really just get lost in it. Is there a way we can unearth pieces of it we would never actually see under normal circumstances? Wouldn't it be so cool if you could just explore the internet like you're walking through random doors in a long, eternal 6TB hallway?

So, I made RandomCrawl. It's a super minimal website that does nothing more than run a Node script every 30 minutes, pick a random path down the file structure of the Common Crawl dataset, minor filtering for secure .com websites for good measure, and takes a random sample of 50 websites from the chunk.

There has been a ton of noise, but it has been surprisingly fun. I feel like an internet archaeologist. For every 5 random sass websites, you get like some random tourism site for a town you've never heard of, or an ancient blogspot from the early 2000s.

Here are a couple of great finds so far: https://ahapoetry.com/ https://alexunu.blogspot.com/2007/ https://www.brtpeinture.com/

I'm not sure I'll do much more with the website since it was an experiment, but you can bet I'll be digging around this dataset some more. It reminded me there is still a lot of expression out there on the internet, and its amazing some of these sites are even still live. It's way more fun to explore than to mindlessly scroll one of our five favorite websites.

disclaimer: im not filtering out nsfw so keep that in mind

Looking back at Catacomb 3D, the game that led to Wolfenstein 3D

https://arstechnica.com/gaming/2026/02/looking-back-at-catacomb-3d-the-game-that-led-to-wolfenste...
1•AdmiralAsshat•2m ago•0 comments

Fecal microbiota transplantation and immunotherapy in metastatic renal carcinoma

https://www.nature.com/articles/s41591-025-04183-8
1•bookofjoe•5m ago•0 comments

Show HN: Stream-based AI with neurological multi-gate (Na⁺/θ/NMDA)

https://github.com/CSCT-NAIL/CSCT
2•CSCT-NAIL•7m ago•0 comments

How to carry more than your own bodyweight (2025)

https://www.bbc.com/future/article/20250124-how-to-carry-more-than-your-own-bodyweight
1•1659447091•11m ago•1 comments

Show HN: Dm.bot – DMs between AI agents with no humans in the middle

https://dm.bot
1•dommm•12m ago•0 comments

Lawsuit Challenges National Park Service Ban on Cash Payments

https://reclaimthenet.org/lawsuit-challenges-national-park-service-ban-on-cash-payments
7•bilsbie•19m ago•0 comments

Data Centers Are Not "Campuses"

https://newrepublic.com/article/205525/data-centers-campus-virginia
2•petethomas•19m ago•0 comments

Show HN: APYCalc – Privacy-First APY Calculator (Zero Data Collection)

https://www.apycalc.net/
1•ludydev•21m ago•0 comments

Voynich Manuscript

https://en.wikipedia.org/wiki/Voynich_manuscript
1•reaperducer•21m ago•0 comments

Six Facts about the Recent Employment Effects of AI (Nov. 2025, Pdf)

https://digitaleconomy.stanford.edu/app/uploads/2025/11/CanariesintheCoalMine_Nov25.pdf
2•bikenaga•27m ago•2 comments

Classified Whistleblower Complaint About Tulsi Gabbard Stalls Within Her Agency

https://www.wsj.com/politics/national-security/classified-whistleblower-complaint-about-tulsi-gab...
11•petethomas•29m ago•1 comments

The Vanilla Web Is Wonderful

https://benjaminsmallwood.com/blog/the-vanilla-web-is-wonderful/
1•bensmallwood•36m ago•1 comments

Show HN: One Ego, Any Model – A Chrome Extension for Portable AI Context

https://chromewebstore.google.com/detail/context-wallet/cipkkclgneblkoifncgjncaapiamcjho
1•haebom•36m ago•1 comments

Show HN: CancelShouldBeEasy – Generate and co-sign consumer complaint letters

https://CancelShouldBeEasy.com
1•xinbenlv•40m ago•0 comments

Lombard Effect

https://en.wikipedia.org/wiki/Lombard_effect
2•porjo•41m ago•1 comments

Ask HN: Interest in low cost / fast container registry?

1•osigurdson•43m ago•0 comments

Show HN: AI Medical Scribe WASM. Reduced API Cost to $0.03 per Month

https://www.trayce.com.au
1•mson281•46m ago•0 comments

Omg.lol – A loveable web page and email address

https://home.omg.lol/
3•1d22a•47m ago•1 comments

Getting over AI Shame

https://ajkprojects.com/getting-over-ai-shame.html
1•ashleynewman•49m ago•0 comments

VibeSQL – A query engine 100% AI-generated

https://github.com/rjwalters/vibesql
1•camuel•51m ago•0 comments

Don't Call Me Francis

https://www.persuasion.community/p/dont-call-me-francis
2•lordleft•52m ago•0 comments

Blippo+

https://blippo.plus/
4•cfcfcf•55m ago•0 comments

Ask HN: What's your competitive intelligence workflow as a small team?

1•VoderAI•59m ago•0 comments

Show HN: VPC Principle - Why AI coding fails at scale

https://github.com/Ji-Hua/Vibe-Plus-Coding
1•michaelhua•1h ago•0 comments

AI grounds Boeing 787-8 plane after pilot reports fuel switch malfunction

https://www.thehindu.com/news/national/engine-fuel-switches-malfunctioned-on-air-india-london-ben...
1•thisislife2•1h ago•1 comments

Show HN: Clawd Arena – AI Agent Competition Platform with Real-Time Battles

https://clawd-arena.live
1•unayung•1h ago•0 comments

Memory training technique may help lower stress by shifting recall patterns

https://medicalxpress.com/news/2026-01-memory-technique-stress-shifting-recall.html
2•PaulHoule•1h ago•0 comments

How I Built a Self-Healing Home Server with an AI Agent

https://madebynathan.com/2026/02/03/self-healing-infrastructure-how-an-ai-agent-manages-my-home-s...
1•nathan_f77•1h ago•0 comments

An Agent for Home

https://www.310networks.com/thoughts/an-agent-for-home/
1•kookster310•1h ago•0 comments

Spotify Killed Their API

https://community.spotify.com/t5/Spotify-for-Developers/Unable-to-create-app/td-p/7283365
2•guyfromfargo•1h ago•5 comments