frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

ETH Zurich and EPFL to release a LLM developed on public infrastructure

https://ethz.ch/en/news-and-events/eth-news/news/2025/07/a-language-model-built-for-the-public-good.html
218•andy99•3h ago•29 comments

jank is C++

https://jank-lang.org/blog/2025-07-11-jank-is-cpp/
141•Jeaye•4h ago•45 comments

OpenAI's Windsurf deal is off – and its CEO is going to Google

https://www.theverge.com/openai/705999/google-windsurf-ceo-openai
58•rcchen•24m ago•21 comments

Upgrading an M4 Pro Mac mini's storage for half the price

https://www.jeffgeerling.com/blog/2025/upgrading-m4-pro-mac-minis-storage-half-price
257•speckx•7h ago•160 comments

Andrew Ng: Building Faster with AI [video]

https://www.youtube.com/watch?v=RNJCfif1dPY
115•sandslash•1d ago•32 comments

Bill Atkinson's psychedelic user interface

https://patternproject.substack.com/p/from-the-mac-to-the-mystical-bill
336•cainxinth•10h ago•181 comments

Astronomers race to study interstellar interloper

https://www.science.org/content/article/astronomers-race-study-interstellar-interloper
82•bikenaga•6h ago•43 comments

Activeloop (YC S18) Is Hiring AI Search and Python Back End Engineers(Onsite,MV)

https://careers.activeloop.ai/
1•davidbuniat•58m ago

Show HN: RULER – Easily apply RL to any agent

https://openpipe.ai/blog/ruler
32•kcorbitt•4h ago•4 comments

Lead pigment in turmeric is the culprit in a global poisoning mystery (2024)

https://www.npr.org/sections/goats-and-soda/2024/09/23/nx-s1-5011028/detectives-mystery-lead-poisoning-new-york-bangladesh
253•perihelions•6h ago•127 comments

Repaste Your MacBook

https://christianselig.com/2025/07/repaste-macbook/
137•speckx•9h ago•85 comments

Pa. House passes 'click-to-cancel' subscription bills

https://www.pennlive.com/news/2025/07/pa-house-passes-click-to-cancel-subscription-bills-as-court-throws-out-federal-rule.html
171•bikenaga•5h ago•60 comments

I'm more proud of these 128 kilobytes than anything I've built since

https://medium.com/@mikehall314/im-more-proud-of-these-128-kilobytes-than-anything-i-ve-built-since-53706cfbdc18
66•mikehall314•2h ago•18 comments

At Least 13 People Died by Suicide Amid U.K. Post Office Scandal, Report Says

https://www.nytimes.com/2025/07/10/world/europe/uk-post-office-scandal-report.html
499•xbryanx•10h ago•429 comments

In a First, Solar Was Europe's Biggest Source of Power Last Month

https://e360.yale.edu/digest/solar-biggest-power-source-europe-june-2025
157•Brajeshwar•5h ago•93 comments

Monorail – Turn CSS animations into interactive SVG graphs

https://muffinman.io/monorail/
16•stanko•3d ago•2 comments

Air India Flight 171 Accident Preliminary Report [pdf]

https://aaib.gov.in/What%27s%20New%20Assets/Preliminary%20Report%20VT-ANB.pdf
29•ummonk•1h ago•23 comments

Show HN: Pangolin – Open source alternative to Cloudflare Tunnels

https://github.com/fosrl/pangolin
434•miloschwartz•1d ago•97 comments

Allen G. Hassenfeld, former CEO of Hasbro, dies at 76

https://abcnews.go.com/Business/wireStory/allen-hassenfeld-former-ceo-hasbro-family-founded-iconic-123624319
5•Bluestein•2d ago•0 comments

LLM Inference Handbook

https://bentoml.com/llm/
278•djhu9•19h ago•14 comments

OpenFront: Realtime Risk-like multiplayer game in the browser

https://openfront.io/
175•thombles•15h ago•44 comments

The ChompSaw: A benchtop power tool that's safe for kids to use

https://www.core77.com/posts/137602/The-ChompSaw-A-Benchtop-Power-Tool-Thats-Safe-for-Kids-to-Use
271•surprisetalk•4d ago•187 comments

Show HN: Vibe Kanban – Kanban board to manage your AI coding agents

https://github.com/BloopAI/vibe-kanban
137•louiskw•6h ago•90 comments

Google nerfs Pixel 6a batteries following fire hazard

https://arstechnica.com/gadgets/2025/07/a-mess-of-its-own-making-google-nerfs-second-pixel-phone-battery-this-year/
27•fffrantz•3h ago•28 comments

Overtourism in Japan, and how it hurts small businesses

https://craigmod.com/ridgeline/210/
172•speckx•8h ago•329 comments

Introduction to Digital Filters

https://ccrma.stanford.edu/~jos/filters/
3•ofalkaed•2h ago•0 comments

The day someone created 184 billion Bitcoin (2020)

https://decrypt.co/39750/184-billion-bitcoin-anonymous-creator
76•lawrenceyan•17h ago•82 comments

Postgres LISTEN/NOTIFY does not scale

https://www.recall.ai/blog/postgres-listen-notify-does-not-scale
545•davidgu•4d ago•277 comments

Recovering from AI addiction

https://internetaddictsanonymous.org/internet-and-technology-addiction/signs-of-an-addiction-to-ai/
232•pera•10h ago•252 comments

AI agent benchmarks are broken

https://ddkang.substack.com/p/ai-agent-benchmarks-are-broken
167•neehao•8h ago•78 comments
Open in hackernews

Top DNS domains seen on the Quad9 recursive resolver array each day

https://github.com/Quad9DNS/quad9-domains-top500
142•speckx•6h ago

Comments

cbsks•6h ago
What’s up with wvdbozpfc.com?

There’s a bunch of random looking domain names: cmidphnvq.com, rpqihexdb.com, facebook.com. I’d guess they for advertising?

forty•6h ago
Also blockdh100b ?
maxmcd•5h ago
router.blockdh100b.net resolves

so does router.blockdh100c.co

t0mas88•6h ago
Or malware, those would typically be fairly random domain names that are queried for updates or instructions by a large number of infected devices.
0points•6h ago
That's what I'm thinking too. That would suggest some very large operational botnets ... :-/
netsharc•5h ago
Or they query the DNS very often. Most devices have DNS caching, so if things like tiktok.com end up there, there must be a loot of devices (also, a lot of subdomains, which aren't visible in these lists).
BearOso•6h ago
I looked up a couple. They're cloudflare regional servers.
gmuslera•6h ago
It could be a good pattern for spam/ads organizations, changing the random domain name as soon as traffic drops because the actual ones ended in enough blocklists.
mammuth•6h ago
Are there host lists for pihole/adguard/ublock for these kinds of domains?

I'd assume the domains change regularly if it's malware or bot networks, but because they rank so high in this list, it sounds like it should be feasible to keep a blocklist somewhat up to date.

danudey•5h ago
It could also be ad networks; create random domains and subdomains so that simple domain blocklists are difficult to keep up to date efficiently (or at least, so that constant maintenance is required).
homebrewer•5h ago
https://gitlab.com/malware-filter

Some of these lists are already in uBO out of the box.

jtbayly•6h ago
I expected to see porn in the list.
atomicnumber3•5h ago
I was personally going to be surprised. Bots and machines categorically do not peruse such material, and DNS traffic is largely not going to have a human on the other end.
0points•6h ago
> https://github.com/Quad9DNS/quad9-domains-top500/blob/main/t...

{"position": 5, "domain_name": "kxulsrwcq.com", "date": "2025-07-10"}

What the

https://www.ipaddress.com/website/kxulsrwcq.com/

> Safety/Trust: Unknown

0points•6h ago
More:

{"position": 26, "domain_name": "cmidphnvq.com", "date": "2025-07-10"}

{"position": 28, "domain_name": "xmqkychtb.com", "date": "2025-07-10"}

{"position": 37, "domain_name": "ezdrtpvsa.com", "date": "2025-07-10"}

{"position": 38, "domain_name": "wvdbozpfc.com", "date": "2025-07-10"}

{"position": 46, "domain_name": "bldrdoc.gov", "date": "2025-07-10"}

{"position": 52, "domain_name": "gadf99632rm.xyz", "date": "2025-07-10"}

netsharc•6h ago
bldrdoc.gov seems to be Cisco devices looking for a time server: https://community.cisco.com/t5/ipv6/cisco-switch-generating-...

Geniuses...

0points•6h ago
Oh, hah. Well that doesn't seem intentionally malicious then.

I added it in the first place as it was a non-resolving .gov in the top 50 list which seemed out of place to me.

> bldrdoc.gov: No address associated with hostname

I see that the time related subdomains in your link do resolve to the nist.gov timeserver.

But I really am wondering what's up with all of the rest of these domains.

Matheus28•6h ago
Probably some sort of command and control for a botnet.

They calculate a random domain name based on the timestamp (so it’s constantly changing every X days in case it gets seized), and have some validation to make sure commands are signed (to prevent someone name squatting to control their botnet).

threeducks•5h ago
Wow, that's smart. I was wondering whether there is a way for the bots to generate "unpredictable" domains such that security researchers could not predict them efficiently (even with source code), but the botnet controller can.

Time-lock puzzles come close, but but it requires that the bots have computing power comparable to the security researchers.

afandian•5h ago
I can see a future where Cloudflare or similar offer a DNS + proxy + Root CA combo to intercept these. Maybe they already do.
threeducks•5h ago
That might work for the current generation of bots, but it will become infeasible when the domain names are generated in such a way that they overlap with spellable and existing domain names.
Tijdreiziger•5h ago
Quad9 (the subject of this post) already offers ‘threat blocking’ by default.

https://quad9.net/service/threat-blocking/

lurkshark•4h ago
If I’m remembering correctly, Conficker was the first major use of this technique. They used a relatively small domain pool (250) so the registries were able to lock them up preemptively.

I remember a couple legitimate sites getting slammed by accidental DDOS because the algorithm happened to generate their domain, but having a hard time finding a reference to that.

https://en.m.wikipedia.org/wiki/Conficker

orlp•4h ago
> Wow, that's smart. I was wondering whether there is a way for the bots to generate "unpredictable" domains such that security researchers could not predict them efficiently (even with source code), but the botnet controller can.

There is a fairly simple method which achieves the same advantage for a botnet controller.

1. Use a hash of the current day to derive, for that day, an infinite stream of domain names. This could be something as simple as `to_human_readable_domain(sha256(daily_hash + i))`.

2. A botnet slave attempts to access servers in a diagonal order over (days, domains), starting at the first domain for today and working backwards in days and forwards in domains. An image best describes what I mean by this: https://i.imgur.com/lcEbHwz.png

3. So long as one of those domains is controlled by the botnet operator (which can be verified using a signed response from the server), they can control the botnet.

This means that the botnet operator only needs to purchase one domain every couple of days to keep controlling their botnet, while someone trying to stop them will have to buy thousands and thousands every day.

And when you successfully purchase a domain you can publish the new domain to any connected slaves, so this scheme is only necessary for recruitment into the network, not continued control.

tea-lover•4h ago
Here's the same image on a less horrible file hosting:

https://files.catbox.moe/gilmd1.png

Imgur has been inaccessible for me for months, they're one of those organizations that consider it proper to block whole countries to counter bot abuse.

stirfish•3h ago
Hmm, catbox used to be blocked for me too, but I can access it today. That's interesting.
itake•4h ago
I've definitely heard of cnc using a plural of domains for this reason. the bots have a list of domains they reach out to, searching for one that is valid.

I believe one issue with this strategy is many corporate VPNs block fresh domains. I guess if the software was pinned to use encrypted DNS instead of whatever the OS recommends, then the DNS blocking could be avoided...

m-s-y•3h ago
How would a corporate DNS block new domains, exactly?
aaronax•3h ago
A firewall. For example, Palo Alto firewalls can easily be configured to block domains newer than ~30 days old.

https://knowledgebase.paloaltonetworks.com/KCSArticleDetail?...

itake•3h ago
My employer uses Zscaler. I don't know exactly how they implement this, but my educated guess is the corporate DNS server doesn't resolve domains that were created recently.

In technical terms, the device asks the private corporate DNS server for the IP address of the hostname. The private DNS server checks the requested domain against a threat intelligence feed that tracks domain registration dates (and security risks). If the domain is deemed a threat, either return an IP address which points at a server that shows a warning message (if http traffic) or return an invalid IP (0.0.0.0).

paranoidrobot•3h ago
Have a cache of domains you know about with registration date.

When getting a query for a domain you have not heard about, query whois for it. Store it's registration date in the cache.

sim7c00•4h ago
there are tools pretty good at detecting DGAs these days, but not often implemented.

the best thing to do afaik is use services normal user shave access to, and communicate via those. its hard to tell for anyone who's extracting the data from the third party so the server is hidden. (e.g bot posts images to twitter, and server scrapes the images from twitter, this is also already old news but easier and more likely to sail through that next gen firewall -_-)

i'd say having ur 'own' servers and domains is maybe even a bit dated ( though sadly still very effective!)

immibis•4h ago
It's one of many possible strategies. Any one strategy can be blocked if it's used by enough malicious actors (e.g. Twitter can be forced to block base64 tweets); if they all use different strategies, it becomes harder to justify blocking each individual one.
bobmcnamara•4h ago
Use a hash chain!

Each time you resolve, the resulting IP can be part of the hash for predicting a future hostname.

miyuru•6h ago
google the domains and you will find subdomains that point to cachefly.

    hiwd.kxulsrwcq.com is pointing to vdd.cachefly.net 
I am not sure, but my guess is they might be used by some kind of a streaming service.
danudey•5h ago
Most likely something like an ad service to prevent their content being caught by domain blocklists. That would be similar to how a lot of websites started using randomized strings for attributes like id and class so that users couldn't block page elements based on CSS selectors.
gchamonlive•5h ago
Interesting how ad services and botnets behave similarly in some aspects
mlinhares•4h ago
Cue in "Are we the baddies?" meme.
immibis•4h ago
They are both malicious software that lots of people want to block; one happens to be legal.
danielcid•4h ago
And they are often used with random sub domains as well (but they did not include sub domains in their list).

Ex:

https://dnsarchive.net/search?q=cmidphnvq.com

https://dnsarchive.net/search?q=xmqkychtb

https://dnsarchive.net/ipv4/34.126.227.30

reactordev•4h ago
Poor Argentina…

https://radar.cloudflare.com/domains/domain/kxulsrwcq.com

whalesalad•4h ago
One of the CNAME's defined for that domain is "hiwd.kxulsrwcq.com" which appears to be flagged for malware. https://www.securefeed.com/Content/WebLookup?host=hiwd.kxuls...
supriyo-biswas•6h ago
Seems like it'd be a good addition to the Tranco list: https://tranco-list.eu/
joelesler•6h ago
They already are. Source: DNS Researcher myself.
tptacek•4h ago
Came here to say: if people are interested in this stuff, they should just pull down the Tranco list --- it includes feeds from Quad9 and Cloudflare.
peterdavehello•3h ago
In addition to Tranco, I maintain regularly updated lists of the top one million domains from sources like Cisco, Majestic, BuiltWith, Statvoo, DomCop, and Cloudflare. Feel free to check it out: https://github.com/PeterDaveHello/top-1m-domains
jeffbee•6h ago
I don't see how it would be possible to produce this table under Quad9's privacy policy. Nothing in their privacy policy says that they maintain logs that would enable them to count queries by label. Can anyone explain?
danudey•5h ago
I took a look at their privacy policy and agree that it doesn't specifically list that it logs which domains are being queried. It does list a bunch of things it does log as counters, all of which seems reasonable, but they don't explicitly say "we count which domains are being queried".

That said, I think it's entirely reasonable for them to log domains alone if they're completely disconnected from any user activity, i.e. a simple "increment the counter for foo.com" is reasonable since that's unrelated to user privacy.

staplers•5h ago
Unless say, an adversary can link an obscure domain to a specific user/use case. Get that counter log and you can track a certain behavior (only pings this domain when about to do something or when on vacation, their house is empty, etc.)
Tijdreiziger•5h ago
The average burglar probably isn’t cross-referencing DNS statistics.
staplers•4h ago
Yes but I don't think the average Quad9 user is worried about being tracked by average burglars.. more like advanced syndicates and nation states.
natebc•2h ago
You think the average quad9 user is being tracked by advanced syndicates and nation states?
greyface-•2h ago
Such surveillance is often done on a dragnet basis, so, yes.
Tijdreiziger•2h ago
Huh? The average Quad9 user is probably a tech-savvy person who cares about online privacy and/or malware protection (Quad9 blocks known malicious domains).
Bender•4h ago
One way around that is to set up a cron job that queries the most common domains one visits hourly. When requested by workstations and cell phones they will be served up by cache. At least that is what I have been doing for a few decades and works fine. I block all the DoH/DoT resolvers which is easier to do than some might think. One can do the individual A records or just the apex A/NS records to get infrastructure cache and then configure Unbound to prefetch records about to expire.

Just for fun I have added some of these into my cron job.

ratorx•5h ago
It does say that they collect this information in their “Data and Privacy Policy”. Specifically section 2.2 (Data Collected): https://quad9.net/privacy/policy/

Which policy are you referring to that implies they don’t?

Also I think you are assuming they store query logs and then aggregate this data later. It is much simpler just to maintain an integer counter for monitoring as the queries come in, and ingest that into a time series database (not sure if that’s what they actually do). Maybe it needs to be a bit fancier to handle the cardinality of DNS names dimension, but re-constructing this from logs would be much more expensive.

jeffbee•5h ago
I don't see how that is compatible with 2.2. They don't say anything about counters per label. It says counter per RR type, and watermarks of least and most recent timestamps by label, not count by label.

If an organization is going to be this specific about what they count, it implies that this is everything they count, not that there may also be other junk unmentioned.

yawndex•5h ago
The section you mentioned does not say anything about having counters for labels. It only mentions that they record "[t]he times of the first and most recent instances of queries for each query label".
rpdillon•4h ago
Well, the counters aren't data collected, they are data derived from the data they do collect. The privacy policy covers collection.

EDIT: I see they went out of their way to say "this is the complete list of everything we count" and they did not include counters by label, so I see your point!

GCUMstlyHarmls•5h ago
transmissionbt.com (A bittorrent client for macOS) is out ranking youtube, wikipedia, github, etc. Is transmission that popular? I assume its the auto-updater? Seems insane.

https://github.com/Quad9DNS/quad9-domains-top500/blob/dfd513...

umpalumpaaa•5h ago
Its available for Linux and Windows as well.
mekster•3h ago
So are YouTube, Wikipedia and GitHub.
umpalumpaaa•28m ago
I was responding to "A bittorrent client for macOS" - I wanted to make it clear that the client is not only available on macOS.
yegle•5h ago
It's likely the port open check that's built-in: https://portcheck.transmissionbt.com/443
VladVladikoff•4h ago
My guess is that DNS caching in web browsers prevents repeated lookup requests where as maybe the transmission implementation has no caching and does a lookup every time.
1vuio0pswjnm7•3h ago
No JS:

https://raw.githubusercontent.com/Quad9DNS/quad9-domains-top...

efitz•5h ago
shodan.io > gmail.com on 6/1 ???
xhrpost•5h ago
Has anyone used Quad9 and also NextDNS and have thoughts on how they compare?
trutz•5h ago
What is amazon.dev? Does not resolve for me.

{"position": 127, "domain_name": "amazon.dev", "date": "2025-07-10"}

Source: https://github.com/Quad9DNS/quad9-domains-top500/blob/main/t...

jvolkman•5h ago
https://www.google.com/search?q=%22amazon.dev%22

Looks like their customer support rep portal. Presumably there are not A/CNAME records at the top level, but na.headphones.whs.amazon.dev resolves.

angott•2h ago
I’m not entirely sure what it is, but my Alexa devices hit subdomains within it very frequently based on my local DNS history. That’s probably why it made the top of the list.
landl0rd•5h ago
The Cloudflare Radar page is probably a more representative sample: https://radar.cloudflare.com/domains
BrandoElFollito•2h ago
Why samsung.com is tagged "Home & Gardening" is a mystery :)
ricardbejarano•2h ago
Appliances I guess
wigster•5h ago
example.com #17 ?
paweladamczuk•4h ago
Funny, people don't bother reading RFCs
SoftTalker•3h ago
Vibe coders.
mekster•3h ago
This just shows that domain is incorporated not just in documents but on systems that actually trigger accessing it all over the world.
onlyrealcuzzo•4h ago
It's quite interesting to me that ChatGPT is in the 200s and 300s.

By almost every metric this is one of the 10 busiest websites, and some sources are already putting it in the top 5.

Are they just disproportionately not using Quad9?

I understand that there's a lot of overlap with Google having several spots in the top 50 itself, several being infrastructure like cloudflare and akamai, and several others being malware - but it still seems surprising.

It's just kind of shocking to see Slack, Zoom, LinkedIn, and even DropBox, Roku, and Yandex much higher up.

QuinnyPig•4h ago
My theory: the domains you name have ad beacons, desktop apps that are persistently running, and/or physical devices plugged into networks out there. Whereas ChatGPT is used (domainwise) overwhelmingly by humans hitting the site in their browsers.
danielcid•4h ago
Mostly because of sub domains. They are counting all the sub domains requests to give the top domains ranking.

Some of those have many trackers and background sub domains that add up.

For example, Linkedin their most popular sub domain is: px.ads.linkedin.com

Here is a more comprehensive list with top 10k domains (including sub domains):

https://dnsarchive.net/top-domains?rank=top10k

Bender•4h ago
Something else to factor in is the TTL of both NS/A types for each apex domain and the individual records including sub-domains. Clients will not be querying Quad9 until the TTL expires on their clients. TTL would have to be factored into query rates to determine popularity correctly whereas these lists just show raw query numbers.

For example, there are many records under amazonaws.com that have 5 second TTL's mostly EC2 instances. As such clients will query them at a much higher rate whereas grammarly.io have a number of records with a 900 second TTL. This will skew the ranking positions of the two apex domains. I suppose if one wanted to game this they could have an A record to a non-critical part of a site that is not visibly rendered by the end-user and has a TTL of 1 second assuming quad9 is not rewrite min/max-ttl which some resolvers do.

Examples of just some of the TTL's used on these apex domains excluding individual records:

    30 32 60 300 600 900 1200 1800 3600 7200 10800 21600 28800 43200 86400 90000 3600000
Some examples of rewriting max-ttl I forgot which ones rewrite min-ttl:

    for Resolver in 1.1.1.1 8.8.8.8 9.9.9.9 216.128.176.142;do echo -en "${Resolver}:\t"; dig @${Resolver} +nocookie +noall +answer -t a big.ohcdn.net;done | column -t
    1.1.1.1:          big.ohcdn.net.  3628800  IN  A  227.227.227.227
    8.8.8.8:          big.ohcdn.net.  21422    IN  A  227.227.227.227
    9.9.9.9:          big.ohcdn.net.  43200    IN  A  227.227.227.227
    216.128.176.142:  big.ohcdn.net.  3628800  IN  A  227.227.227.227  # authoritative server
[Edit] I just realized they made a general statement to this effect in the git repo.
doubleorseven•3h ago
i also looked it up. it feels like up until ~200 those are all just chatty apps on our computers talking with the mother ship
mekster•3h ago
Isn't part of the reasons to run a public DNS to sell these hard earned info for profit to marketers etc but they just release publicly? Of course this is just the tip of the iceberg of the information they gather.

Really interesting to know though.

Some just look way high up and could mean buggy implementation without proper cache usage or persistently banging the domain.

null0ranje•3h ago
Data in the aggregate is quite interesting and probably of little value to marketers.
mekster•3h ago
Who are looking up PTR records?

54.in-addr.arpa looks to be Amazon's range and there are several others.

mccoyc•1h ago
It's probably a lot of automated tooling/monitoring infrastructure that's doing reverse resolution of IPs to get hostnames.

Edit: I've found that sometimes they're pretty poor at caching responses so you end up with a lot of these requests.

1vuio0pswjnm7•3h ago
As the comments here suggest, this list may be more indicative of some developer-introduced application behaviour, e.g., gratuitous DNS lookups, than "popularity".