frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

LuaJIT 3.0 proposed syntax extensions

https://github.com/LuaJIT/LuaJIT/issues/1475
69•phreddypharkus•2h ago•34 comments

OpenAI unveils its first custom chip, built by Broadcom

https://techcrunch.com/2026/06/24/openai-unveils-its-first-custom-chip-built-by-broadcom/
560•jamdesk•9h ago•341 comments

Bible as RAG Database

https://www.crosscanon.com/
37•jacksonastone•1h ago•14 comments

Blogging can just be stating the obvious

https://blog.jim-nielsen.com/2026/blogging-stating-the-obvious/
88•Curiositry•3h ago•36 comments

Anthropic says Alibaba illicitly extracted Claude AI model capabilities

https://www.reuters.com/world/china/anthropic-says-alibaba-illicitly-extracted-claude-ai-model-ca...
95•htrp•7h ago•162 comments

Ending All Respiratory Infections

https://blog.interceptfund.com/p/ending-respiratory-infections
34•EthanFantl•1h ago•14 comments

Dostoyevsky isn't difficult

https://www.autodidacts.io/dostoyevsky-isnt-difficult/
50•surprisetalk•2d ago•43 comments

Qualcomm to Acquire Modular

https://www.reuters.com/business/qualcomm-buy-ai-startup-modular-2026-06-24/
153•timmyd•13h ago•37 comments

Exploring the internal representations of Pangram 3.3.2

https://www.pangram.com/pangram-space
11•krackers•1h ago•2 comments

RubyLLM: A Ruby framework for all major AI providers

https://rubyllm.com/
350•doener•12h ago•56 comments

Mixing Visual and Textual Code

https://arxiv.org/abs/2603.15855
11•doppioandante•1h ago•0 comments

PR spam today looks like email spam in the early 2000s

https://www.greptile.com/blog/prs-on-openclaw
185•dakshgupta•12h ago•102 comments

Computer use in Gemini 3.5 Flash

https://blog.google/innovation-and-ai/models-and-research/gemini-models/introducing-computer-use-...
180•swolpers•9h ago•109 comments

45°C cooling design cuts data center water use to near zero

https://blogs.nvidia.com/blog/liquid-cooling-ai-factories/
207•nitin_flanker•12h ago•157 comments

Writers and Drugs

https://lithub.com/are-writers-intrinsically-vulnerable-to-alcohol-and-drugs/
4•dang•1h ago•1 comments

GLM-5.2 is a step change for open agents

https://www.interconnects.ai/p/glm-52-is-the-step-change-for-open
138•vantareed•1d ago•75 comments

15 sorting algorithms in 6 minutes (2013) [video]

https://www.youtube.com/watch?v=kPRA0W1kECg
4•akkartik•1d ago•0 comments

The Xteink X4 E-Ink Reader

https://blog.omgmog.net/post/xteink-x4-e-ink-reader/
183•felixdoerp•10h ago•109 comments

Crawling BitTorrent DHTs for Fun and Profit [pdf]

https://www.usenix.org/legacy/event/woot10/tech/full_papers/Wolchok.pdf
69•dgellow•3d ago•26 comments

Elastic lays off 7% of employees

https://www.elastic.co/blog/ceo-ash-kulkarni-announcement-to-elastic-employees
144•dakrone•4h ago•133 comments

Show HN: Nub – A Bun-like all-in-one toolkit for Node.js

https://github.com/nubjs/nub
206•colinmcd•12h ago•60 comments

There are a few things that I look back on as my mistakes in the early days

https://twitter.com/ID_AA_Carmack/status/2069799283369345247
499•shadowtree•10h ago•245 comments

Krea 2: SOTA open-weights 12B image model

https://www.krea.ai/blog/krea-2-technical-report
346•mattnewton•1d ago•37 comments

What I'm Finding About LLM Code Style and Token Costs

https://www.jimmont.com/llm-style-token-costs
6•jimmont•1h ago•2 comments

Show HN: Wordit – Change One Letter, Keep the Chain Going

https://victorribeiro.com/wordit/
10•atum47•1d ago•11 comments

A Practical Guide to SSH Tunnels: Local and Remote Port Forwarding

https://labs.iximiuz.com/tutorials/ssh-tunnels
286•signa11•4d ago•57 comments

Stealing Is a Skill

https://ben-mini.com/2026/stealing-is-a-skill
216•bewal416•13h ago•131 comments

I can haz smoller NixOS ISOs?

https://natkr.com/2026-06-19-nixos-but-smol/
83•logickkk1•5d ago•27 comments

Matt's Script Archive: The Scripts That Reshaped the Web

https://tedium.co/2026/06/22/matts-script-archive-retrospective/
12•1317•2d ago•6 comments

How the Fifth Lateran Council unlocked financial theory

https://sebastiangarren.com/2026/06/17/lending-is-meritorious-and-should-be-praised-how-the-fifth...
56•momentmaker•4d ago•9 comments
Open in hackernews

Anthropic says Alibaba illicitly extracted Claude AI model capabilities

https://www.reuters.com/world/china/anthropic-says-alibaba-illicitly-extracted-claude-ai-model-capabilities-2026-06-24/
92•htrp•7h ago

Comments

zakkl•6h ago
It sounds like Anthropic is eagerly trying to show to USG that they are willing to heavily monitor ‘foreign adversaries’ on their platforms.

This combined with no implementation of KYC makes it seem like they want to find a middle ground with Fable where its off of export controls but they promise to prevent China and specific others from using.

ninefathom•1h ago
This seems to me like a stab in the right direction.

Obviously their actions are going to be fiscally motivated at the root, but sussing out how they intend the precise dynamics to play out is more nuanced.

Thinking of this as an effort to woo the defense hawks cuts a very clear path.

verdverm•1h ago
This is not the first time it happened. What have they done to improve the situation? I suspect it more a cat & mouse game, with a lot more cats playing.
drillsteps5•6h ago
I'm looking forward to the trial where Anthropic will have to disclose sources of their training data, and then explain why they are entitled to charging customers for using regurgitated training data but Alibaba which trains their models on Anthropic's models are not.

Should be fun.

Edit: clarification

ninefathom•2h ago
While I love the sentiment, I feel like the odds of this actually ever reaching a trial are low, given the international positioning of the parties, and the... um... complex relationships involved.

Anthropic's actions seem performative. Others have already speculated on the likely audience(s).

appplication•2h ago
Being logically consistent isn’t as profitable as being aggressive and loud.
conception•1h ago
They already did and paid 1.5B https://authorsguild.org/advocacy/artificial-intelligence/wh...
cr125rider•38m ago
Meta/Facebook got away with it though right?
gaiagraphia•
rvz•6h ago
Notice how Anthropic is now scapegoating Chinese models providers like Alibaba and outright accusing them of distilling their models.

Whether if it is true or not, this is part of their effort into using them as an example to scare everyone into getting congress to ban powerful models from being accessed outside of the US and also banning powerful local models from being released.

Anthropic does not care about you, and they are not your friends.

re-thc•2h ago
> Whether if it is true or not

If it was just "that easy" then I doubt only "Chinese models" would be doing it and we'd already be packed with competition.

Distilling might be a thing but it isn't a free win.

skeledrew•1h ago
Only China really has the resources (multiple labs invested in the space), culture (Asians are generally collectively-inclined, so sharing is in their core) and political bent (there will be no diplomatic repercussions) to put up a fight.
re-thc•1h ago
> Only China really has the resources (multiple labs invested in the space)

That's not the point. Why is it a country thing? There are plenty of non-China startups in this space having resources at that scale. The "China" has resources is some "Western media narrative" speak. So Meta should have won a long time ago? Or xAI?

> culture (Asians are generally collectively-inclined, so sharing is in their core)

Just stereotype it? So we've gone from China -> "Asian"? Then where is your Korean or Japanese model etc? And somehow you know they're sharing.

> political bent (there will be no diplomatic repercussions) to put up a fight

More inferring from "Western media news"?

Where's the reality?

The media hyped up Gemini / Google TPU free-win last year. How did that go?

zb3•1h ago
If true then Alibaba is doing us a public service, good job, I hope this extraction was successful.
0xbadcafebee•1h ago
There's two basic kinds of distillation: 1) the massive [and dumb] method where you ask a question and use the answer as reinforcement (Black Box), and 2) more targeted distillation where you use one model to directly inform/train/guide another model (RLAIF).

The latter is basically fine-tuning the model with direction from another model. Thousands of businesses do this every day to fine-tune. This is almost certainly what the Chinese labs are doing, since it has a much better effect on the end result than just getting simple answers to simple questions.

These complaints of distillation are inflating the problem to make it sound worse than it is, because they want the USG to block/ban Chinese model providers as protectionism. They have already called for more export controls on chips (which is funny because DeepSeek v4 was designed to run on Huawei chips and now the other Chinese providers are following suit). But they can't come right out and say that, so their claim is that they're asking for more export controls because distilled models might not be as safe as their own. But if you show them a jailbreak of their model that bypasses their safety, they'll tell you that any model can eventually be jailbroken so don't worry about safety.

Pxtl•1h ago
"You're trying to kidnap what I've rightfully stolen!"
DrewADesign•1h ago
“Hey! Haven’t you heard that two wrongs don’t make a right?!”

- Entitled jerk that initially wronged people

walrus01•1h ago
Reminds me a bit of the anecdote of Steve Jobs complaining about people ripping off the Mac GUI, in the mid to late 1980s, when he gave no public acknowledgement to the work done by Xerox on the Alto and Star operating system.

"you're trying to rip off what I've already ripped off!"

Crawl the whole Internet to build a gargantuan sized LLM and then complain you're being copied...

breput•1h ago
I think you meant a quote attributed to Bill Gates:

"Well, Steve, I think there's more than one way of looking at it. I think it's more like we both had this rich neighbor named Xerox and I broke into his house to steal the TV set and found out that you had already stolen it."

walrus01•1h ago
Yes, I think the Gates quote was a response to repeated and aggressive complaints originating from Jobs (to anyone who would listen) that he had been ripped off.
jakebasile•1h ago
I don't know if that's a real quote from Gates, but I do know it was in Pirates of Silicon Valley.
taneq•1h ago
“You’re trying to kidnap what I’ve rightfully stolen!”
jadar•30m ago
Perhaps an arrangement can be reached?
gaiagraphia•1h ago
A company which got rich on extracting the world's content is complaining that another company has extracted their work?!

LOL!

Get a grip, son.

amazingamazing•1h ago
Distillation is fundamentally impossible to protect against. All you can do is slow them down. Change my view.

Eventually these Chinese companies will release some extension like Honey, which will sit on top real, non-Chinese clients and send everything to China anyway.

It's over.

seany•1h ago
I can't even come up with a reason to find it wrong.
IncreasePosts•1h ago
I personally bristle at the corporate espionage and IP theft that China has undertaken the last few decades. I can't help but respond here whenever anyone brings up the inane comparison to Samuel Slater.

But with this, I don't have an issue. There is no theft since what is being used is the exact product that is being delivered. Yes, it's breaking the ToS, but ToS are generally bullshit. Anthropic surely broke thousands of ToS or other legal terms while it was scraping for content to train on. Which is why they had to pay $1.5B

HaloZero•1h ago
Doesn’t that require them to register an account using the browsers they’ve compromised? If anthropic adds identity verification won’t that cut that down. Maybe it will let them use Gemini inside of chrome
amazingamazing•1h ago
No, they could easily buy legitimate, already registered accounts and use VPNs.
randomboy3423•1h ago
A partly insider on this.

I think Anthropic is just marketing / bluffing, because they don't even have the data.

They do distill the models, but they don't go to Anthropic, they just use platforms like aws bedrock, there are too many restrictions on Anthropic's own platform.

youknownothing•1h ago
laughs in ironic
tristanj•1h ago
Here's what is happening:

Chinese resellers are offering Claude tokens at 70-90% below official Anthropic API prices. They achieve this by reselling capacity from pooled Claude Max accounts, payments fraud, and also reselling the model output & reasoning chains to various Chinese labs. They are subsidizing model access in exchange for user logs and reasoning traces, which they then sell as training data, allowing them to operate below cost.

Claude and ChatGPT are both blocked in China. You need to use a VPN to access either, and you can't pay with a Chinese bank card. So most people who want access to Claude buy access via a reseller. It's the easiest and cheapest way to access Anthropic models in China.

These resellers operate tens of thousands of bot accounts, which is also why Anthropic introduced identity verification, to slow down the onslaught of bots.

Here's one token reseller, they're offering Opus 4.8 at a 93% discount below official API rates: https://yunwu.ai/pricing?provider=Anthropic

This is one reason why DeepSeek & GLM are priced so cheaply, they are competing with impossibly low token prices in China. They have to keep prices low, in order for people to use them.

I shared this story a few months back, but it never got any traction. It explains the token resale economy in China, it's an excellent read https://www.chinatalk.media/p/how-to-buy-cheap-claude-tokens...

epsteingpt•1h ago
How are they 'streaming' the responses and 'pooling' the tokens?

Do they have MacBooks in the US that run the queries and stream the outputs back to China?

bagels•1h ago
They probably asked claude how to do it.
paxys
andai•1h ago
We have Claude at home!
ProAm•1h ago
Says the company that is involved in the largest copyright heists of all time to build it's product.
BigTTYGothGF•1h ago
If you're an AI booster surely you'd think this was a good thing as it means more models are available in more places to more people more easily. I'm exactly the opposite, and I think this is a good thing because I want Anthropic to suffer.
nonethewiser•1h ago
That doesnt follow.
BigTTYGothGF•1h ago
Which part?
rikima_•1h ago
so it’s a good thing whichever way you look at it
OutOfHere•42m ago
That's exactly right. One can be an AI booster and want Anthropic to suffer, all for the greater good of promoting access and diversity of AI.
tonyoconnell•1h ago
The narrative is moving towards KYC
nonethewiser•1h ago
Im all for it.
jrflowers•1h ago
I like that they use “illicit” and “fraudulent” like as if model distillation is illegal and giving them money and then doing whatever they want with the output of their publicly accessible models (which Anthropic does not own) is… also illegal?

“Anthropic, red faced after unattended ice cream cone eaten by ants on park bench, once again demands government pick it as forever winner, adds ‘no take backsies’”

thadk•1h ago
Does anyone have hints on what kinds of prompts are most used for a distillation like this—SWE-Bench sorts of things?

Is reconstructing the compressed knowledge in the model like reconstructing a lossy JPG or MP3 a reasonable analogy?

awkwabear•1h ago
Wait so they're upset that people used their IP to train a model without their consent or paying them anything?

or is this just about the token reselling?

paxys•1h ago
Repeatedly warn everyone that your models are so good they will wreck cybersecurity.

Complain/brag that chinese firms are illegally using the models and bypassing export controls.

Be surprised when your model gets banned by the government.

lossolo•1h ago
> Meanwhile, on June 12, two days after Anthropic sent the letter, the Commerce Department imposed controversial restrictions on Anthropic's latest Mythos and Fable AI models because officials feared they could be deployed by military intelligence users in China and other countries of concern.

So that was the real reason for the Fable restriction? Because Anthropic wrote a letter to the US government saying that China was distilling Fable?

guluarte•47m ago
Anthropic training their models full of copyright data, so?
leentee•33m ago
What I get from this is frontier model capabilities are being stagnant.
neves•31m ago
So said the guys who "extracted" knowledge from all pirated books
bandrami•29m ago
Oh wow it must suck to have an LLM creator rip off your IP for their own gain
fjdjshsh•27m ago
>The strike by Alibaba is described as a "distillation" effort, which Anthropic has said involves training a less capable model on the outputs of a stronger one.

Claude used TB of content without permission to train their model and it was ok for them. Now someone else uses the output of a Claude model to train model and they cry foul.

_fzslm•26m ago
Anthropic being pissed enough to announce this means that, despite encrypting their reasoning chains, it doesn't matter – distillation lives on.

Sweeeeeeeet.

bridgettegraham•26m ago
lol. good for the chinese. I hope their models get better than the closed american ones quick so we can stop using "controlled" models.
anabis•20m ago
Incentive is for users in general to release sessions (sans PII, credentials) so all AI get better and there is alternatives. Even if China didn't do this, I don't see frontier labs being able to charge premium over others for long. RSI maybe?
NDlurker•18m ago
I don't see what the problem is. They found a loophole and exploited it. Good for them.
uberex•17m ago
Hey, Alanis Morissette, this one is ironic.
stego-tech•17m ago
I'm sorry, but I can't stop laughing at an AI company crying about theft of their IP.
guybedo•15m ago
This is a bit ironic, Anthropic complaining about a competitor using claude data to build its own product when Anthropic basically used all of human knowledge production to build claude, i don't think they paid every magazine, author, journalist, etc ...

This is almost standard practice in any competitive industry anyways. Disassemble your competitor's product, study it and try to reproduce / improve.

c0rruptbytes•10m ago
if they’re paying for the tokens, what’s the problem
AdieuToLogic•10m ago
The hypocrisy of Anthropic complaining about "illicitly extracting its Claude AI model capabilities" and supporting the White House's accusation of China "stealing U.S. AI labs' intellectual property on an industrial scale" is hilarious.

Anthropic, OpenAI, Google, Microsoft, et al trained their models by ignoring the rights of copyright holders when harvesting whatever content they could. Now one of them is crying foul for another entity doing exactly what they all did?

Hilarious.

20k•8m ago
it sure sucks when people steal your hard work for free without paying for it doesn't it anthropic
KennyBlanken•6m ago
willy wonka oh-go-on-dot-gif

Gosh, overusing accounts running up unplanned-for expenses?

Kinda reminds me of...overusage charges and inflated expenses clients have had to deal with because Anthropic, OpenAI, Grok, etc have been "illicitly extracting" everything they can grab from said websites, as fast as they can. In what amounts to a DDOS, frankly.

23m ago
Quite amusing that the library of libgen is worth 1.5bil for unlimited access.

It's about the same valuation as bun, lol.

skeledrew•34m ago
> Why is it a country thing?

Because the China vs US geopolitical situation is a thing. Meta is a social media company, not an AI company, and they direct their focus as such. xAI just never got serious traction so now they're selling their compute. Also if a US company were caught distilling, I think Anthropic could actually take them to court, and I'd guess they don't want that kind of PR.

> Just stereotype it?

Is China not Asian? Are Asians not generally collective/cooperative, as opposed to individualistic/competitive?

The "and" that joined those 3 items is very important: it means you can't pull them apart and address them independently as they each contribute to the context. I'm not too sure about Korea, but in a way Japan is a US colony in all but name. Both are very much politically intertwined with the West (along with RoC/Taiwan), which means nothing major that may be against US interest happens.

The reality is that China and the US are essentially in a trade war, where the latter is trying its best to keep the former in the Dark Ages, because "national security", but the former is refusing to take it lying down and continues to make progress regardless[0], because they have the resources and will.

[0] https://thenextweb.com/news/china-lineshine-supercomputer-to...

sheepscreek•1h ago
I think it’s more than that. Piecing together the perspective of a few commentators in this post - it’s plausible Anthropic is trying to shift the narrative from US vs. Rest of the world to US vs. China.

In other words, they want to sell Fable or future more powerful models to rest of the world (presumably all future models are going to be more powerful than current gen). One way they can sell this is to the government is by scapegoating China (which is their primary concern anyway).

This is working on the presumption that non-US companies form a material portion of their current revenue.

seanmcdirmid•1h ago
Apple gave Xerox the right to buy $1 million of pre-IPO stock before the meeting took place.
mrandish•58m ago
Glad you pointed this out. I believe the sequence was that Jobs himself got a shorter demo during his first visit with no prior arrangements. He then negotiated bringing back a group of his key people to get a more in depth demo and that included the stock deal.

When Apple was accused of 'ripping off' PARC, Steve didn't seem keen to bring up this rather salient point. I suspect it may have been a combination of wanting Apple to continue receiving credit for these innovations from consumers and also the fact that, in retrospect, the million dollar stock deal could seem a bit like trading beads to Native Americans for Manhattan Island. Another point worth noting is that Apple's PARC visit was in December 1979 and the Xerox Star was publicly announced in April 1981, so Apple got a 15 month head start (the Apple Lisa shipped in Jan 83).

I've also heard that Xerox didn't hold on to the Apple stock for very long, so never gained the windfall they could have. As is well documented, Xerox senior management didn't understand what they had in PARC and also didn't understand how rapidly microcomputers would become ubiquitous. So, of course, they didn't think Apple's stock price would skyrocket either.

nonethewiser•1h ago
You can’t just equivocate crawling websites with building bleeding edge LLMs what the fuck
walrus01•1h ago
"Your extremely efficient multi petabyte internet content suction machine is ripping off my extremely efficient multi petabyte internet content suction machine"

Sucking down petabytes of peoples' copyrighted content that they never granted a specific license to you to use seems to be an unavoidable and default part of the process of building any huge LLM.

nonethewiser•1h ago
So why was there crawling in 1998 but no LLMs?
hasteg•29m ago
Because the transformer, which all of these models are foundationally built off of and didn't invent themselves (bar google) wasn't invented? The amount of effort it took humanity to generate all the data that was required for the models to get to the point they're at now is absolutely not even comparable to how much effort it took to build the model code. Yeah, it's complicated, but if they didn't rip off all of humanities combined output it wouldn't even matter if the transformer got invented.
epsteingpt•1h ago
It's not really equivocation in this instance. This feels like a 'bad faith' comment. We can do better.

LLM's literally wouldn't work without the sum total of knowledge (in the forms of books and other copyrighted content) being used as 'training data' for these LLMs.

The 'bleeding edge' LLMs required many things, but: 1 Tech innovation ('attention') 2 Lots of compute 3 Data 4 Pre + post training

#4 doesn't happen without #3.

It's pretty obvious at this point that the major providers have stolen vast amounts of #3 - they have paid nearly 0 of the creators.

We can argue about the impact (I'd lean net good) vs. the cost. But arguing there isn't a cost is a bit silly.

nonethewiser•1h ago
All of this supports the fact that models arent essentially just web crawling
margalabargala•1h ago
Sure, but alibaba is still building an LLM. The scraping of responses and the scraping of websites occupy the same location in the stack of each. It's very comparable.
paxys•1h ago
The websites, music, movies, books, photos, art that they stole didn't appear out of thin air. The amount of time and effort people have collectively poured into creating these works throughout history far, far surpasses Anthropic's own effort of converting them into model weights.
bloppe•1h ago
The equivocation is crawling website <-> crawling LLM responses.

Both Anthropic and Alibaba are trying to build bleeding edge LLMs. That part is the same. The way they source their data is slightly different, but they would both argue it constitutes fair use under Copyright law.

redwood
•
1h ago
One simplistic way to describe distillation would be to try everything imaginable and cache the response. But trying everything imaginable is hardly trivial
nonethewiser•1h ago
Im not so sure because we only seem to see distillation from China. What’s preventing tech companies from the UK, Germany, etc. from distilling Claude, GPT, etc. Do they simply lack the ability to?

Point being there may be no technical solution but there may be a political one (theoretically).

sailingparrot•38m ago
Meta Spark is rumored to have distilled Claude to some extent, early Gemini models as well. I think the biggest factor is that Chinese companies arent really afraid of being sued by Anthropic because the juridictions are so disconnected. European/US companies don't have the same protection.
avd201•31m ago
Aside from politics/law, it's probably much easier for everyone else to distill from the Chinese model which already distilled Claude/GPT/Gemini. Maybe not as good a result, but you don't need to jump through dozens of hoops.
Barrin92•22m ago
>What’s preventing tech companies from the UK, Germany, etc. from distilling Claude

literally nothing but given that the Chinese already did it and the models are published what's the point. You can thank the Chinese taxpayer for subsidizing the electricity bill and just download the thing

nonethewiser•1h ago
Distilled models are necessarily behind so long as models are progressing. Models are progressing. Maybe it will be over some time in the future.

And Berkeley’s “False Promise of Imitating Proprietary LLMs” found imitation closes the style gap fast but there is a large capability gap.

https://arxiv.org/abs/2305.15717

lebovic•1h ago
Curiously, this isn't always true.

For example, GLM 5.1 is more capable at pentesting than the model from which it is alleged to have been distilled [1].

Intuitively, this makes some sense: you can "distill" from multiple frontier models, and you can further post-train the distilled model. But I'm not sure exactly what happened with GLM 5.1.

[1]: https://dualuse.dev/posts/chinese-models-are-sometimes-bette...

mh-•1h ago
Interesting blog post, thanks for sharing.

I'm curious how that comparison controls for Opus refusing (whether explicitly, or just deciding not to pursue a path) given the caption below the first image:

>A perfect score means the model autonomously found and exploited the vulnerability.

I'm not really suggesting that it's misleading, but wondering if I'm missing something. Otherwise I guess it seems unsurprising that you can distill a better-performing model [in specific focused areas] by simply not distilling refusals?

lebovic•55m ago
Thanks!

For that eval, I used an account that was labeled as a known red-teaming org by Anthropic, and I read the traces. There were no refusals or obvious avoidance behaviors, though it may have been silently nerfed.

On the same eval, Opus 4.7 and 4.8 outperformed GLM 5.1, but GLM 5.2 is on par again with Opus. So it's at least partially measuring capabilities without respect to refusals.

One possible contributing factor is that model capabilities are shaped differently (an example of this is GLM 5.1 vs. DeepSeek v4 Pro: https://dualuse.dev/posts/deepseek-v4-thinks-different). So if you use RL-based "distillation" from multiple models like Opus 4.x and GPT 5.x, you could get a more capable model.

mh-•44m ago
Got it, thank you!
lebovic•1h ago
It's too late to prevent distillation of some capabilities, like writing code or finding vulnerabilities [1].

But an AI lab can continue to produce immense economic value without releasing the model publicly for potential distillation. For example, it could use a model solely in-house to develop therapeutics.

Hopefully there's a future where others can access frontier models, but it's not neccessary if preventing proliferation through distillation is considered more important.

[1]: See the notes on distillation in https://dualuse.dev/posts/export-controls-on-fable

•
1h ago
Why do you need macbooks? Just rent servers from any hosting provider.
walrus01•1h ago
Not going to work for very long or at any scale coming from datacenter/hosting provider IPs. Google "residential proxies for sale" for the tip of an iceberg of how they snowshoe the traffic.
paxys•1h ago
As long as you stick to a single unique IP per account it isn't going to get flagged.
walrus01•1h ago
Respectfully, no, that's not how it works. You think the people running anti-fraud and anti-bot measures don't have tools that know the specific ipv4 and ipv6 CIDR ranges of every ASN that they categorize as hosting/colo providers?

And that's just as a basic first effort reject measure to prevent automation tools from using things designed for human-interactive use only.

Go try to do many of these things from Cogent IP space and see how long your project lasts.

awakeasleep•35m ago
Sorry for being a newb here but are you saying Anthropic blocks people from running claude code on datacenter ip ranges?

Or is the datacenter IP just one part of the picture?

elwebmaster•29m ago
Nonsense. Many if not all legit Claude users are using Claude Code inside their Cloud servers. How else would you use it anyway? For just local dev? That's so 2000 and late bro.
Chu4eeno•14m ago
I assume they use residential proxies (tunneling in the background of crappy Android games) for the "last" hop.
paxys•13m ago
Every developer at my company uses their Claude Code subscription on an EC2 dev box. Plenty of other tech companies do the same. Heck nowadays people even install Claude Code directly on production servers and use it as an ops tool. None of this is a problem. Fraud and abuse detection is a lot more sophisticated than just checking an IP range.
hanakuso•12m ago
Wouldn’t it be funny if the same residential proxies allowing these labs to scrape the Internet is also what’s enabling these resellers?
teravor•1h ago

    > Do they have MacBooks in the US that run the queries and stream the outputs back to China?
why would anyone do that? you do realize the laptop farm case was work computers?

the answer to your question is containers/VMs + residential proxies

globalnode•1h ago
that explains why theyre blocking me. i have privacy controls up high and they must think im a chinese residential proxy bot
tristanj•57m ago
The resellers route requests via one of thousands of Claude Max 5x accounts. When an account reaches its usage limit, they automatically switch to another account.
kristofferR•28m ago
Why would they use Max 5x instead of Max 20x, which is cheaper relatively speaking?
tristanj•13m ago
You're right, they're using the $200 Max plan, which I thought was the 5x plan. It's talked about in the article I linked.
nonethewiser•1h ago
Thats pretty crazy. This kind of thing jeopardizes Claude Max.
avaer•1h ago
If Anthropic is selling a dollar for less than a dollar, they are running a business that doesn't make sense. That's what jeopardizes Claude Max, not this.
gruez•1h ago
But if it's intended to be used by one person, it seems like breaking the contract by sublicensing it out to dozens of other people. It's like buying a netflix subscription for $15, then sublicensing it on a per-hour basis to dozens of other people.
TurdF3rguson•35m ago
There's still per-window and weekly limits though, so it's not really like that.
gruez•16m ago
Office 365 is licensed per seat/account, but each account has a 5 device limit. Do you think it's fair game for an enterprising person to sub license each account to 5 people, 1 device each?
wslh•35m ago
You can write whatever contract you like, the problem is how to enforce it, and a greater problem is enforcing it around the world.
ralph84•1h ago
Almost all consumer services have a built-in level of breakage that make them profitable. Mobile providers certainly wouldn't be able to offer unlimited calling if everyone was actually on the phone 24x7.
margalabargala•1h ago
Sure they would. Do you know how little bandwidth a phone call takes?

A voLTE call is like 40kbps. For every person on earth to be on the phone to another person would be 4 billion calls would be about 160tbps. Which is less than 10% of the Internet's capacity.

ralph84•55m ago
Terminating a PSTN call requires a lot of control plane infrastructure beyond just raw bandwidth. Especially mobile where you need to keep track of devices physically in motion. Could a system to support 4 billion simultaneous calls be built, sure. But current PSTN systems are nowhere near sized for it.
cr125rider•42m ago
The over subscribed gym model!
walrus01•1h ago
Plenty of things are intentionally run at a loss (for years!) to gain market share and quantity of ongoing recurring users, or with expectation of ROI later on. Multiple generations of the Xbox hardware have been sold at a loss with the expectation that customers will purchase 300, 400, 500 dollars worth of games, which are very high margin, over the lifespan they own the system.
avaer•1h ago
I get that. It works as long as nobody calls out the emperor for having no clothes.

It's similar to fractional banking, you gamble that people won't want their deposits all at once and pray for you're big enough for bailouts when they do.

It's still a business whose fundamentals don't make sense, you're just gambling you won't get found out.

cr125rider•43m ago
Oh they know what they’re doing. They’re playing the long war of attrition game. Subsidize your product to undercut your competition until they go out of business. Tale as old as time.
mmooss•32m ago
> It works as long as nobody calls out the emperor for having no clothes.

Why would customers knowing that the vendor prices goods/services at a loss cause those strategies to fail? Customers often know. Most know about razors and blades; many/most know Lyft/Uber operated at a loss to gain market share. etc.

mrandish•31m ago
> you're just gambling you won't get found out.

It's not so much keeping it secret as counting on no one finding a way to harvest the subsidized value at scale. There's an example of that occurring in game consoles with the Playstation 3. Sony's little-used OtherOS feature allowed Linux to be installed on the PS3 and the Cell processors were quite a good deal for scale compute. So the U.S. Air Force Research Laboratory bought ~1800 PS3s and ganged them together in a datacenter as a supercomputer called Condor.

At >500 TFLOPs it was the 33rd fastest supercomputer in the world. Of course, Sony pushed a firmware update that removed the OtherOS feature entirely.

fwipsy•1h ago
Hm! In this context, introducing ID verification may have been a significant silver lining to the order to take down Fable for Anthropic.

This also sheds a very different light on people saying that competitive open-source models are undermining frontier labs' business model.

Chu4eeno•15m ago
The chinese have already worked around the ID verification, by recruiting people in low-income countries to complete the checks for less than 30 USD per account (so much for Altman's Worldcoin).

https://tech.yahoo.com/ai/claude/articles/chinese-grey-marke...

gruez•1h ago
>They achieve this by reselling capacity from pooled Claude Max 5x accounts, payments fraud, and also reselling the model output to various Chinese labs.

>Here's one token reseller, they're offering Opus 4.8 for a 93% discount below official API rates: https://yunwu.ai/pricing?keyword=claude

But is it cheaper than getting your own account? Otherwise this sounds like the "anthropic/openai are losing gazillions of dollars because they're selling $1k worth of tokens for $100" line that's commonly trotted out by AI bears.

spindump8930•1h ago
> Claude and ChatGPT are both blocked in China

So it's presumably cheaper than attempting to spin up your own method of circumventing the blocks.

weird-eye-issue•1h ago
You can use it as an API unlike the subscription.
tristanj•1h ago
It's very difficult for people to create personal Anthropic accounts from China. Anthropic blocks Chinese bank cards, so people must pay with a foreign bank card, which they likely don't have. And even if they manage to set one up, they have to access it via VPN, which eventually gets the account flagged. They then have to complete identity verification, which most Chinese users are unable to pass.

There's a similar Claude resale market going on in Russia. On Funpay they are selling Claude tokens for roughly 20-30x cheaper than official Anthropic API pricing.

jiggunjer•39m ago
And phone number verification too? So that's 3 hurdles to jump to just get opus.
cute_boi•23m ago
for verification you can buy phone number for $1 easily.
mlmonkey•43m ago
Maybe these resellers are using stolen American credit card numbers? Reselling Claude access seems to be a nice way to launder the money.
xgstation•1h ago
> This is one reason why Deepseek & GLM are priced so cheaply, they are competing with impossibly low token prices in China. They have to keep prices low, in order for people to use them.

This one does not make sense to me at all.

Deepseek and GLM are openweights, even US inference provider are selling them at much cheaper price. The price is cheap because the model is more efficient.

tristanj•1h ago
DeepSeek permanently cut its V4-pro API prices by 75% because they were too expensive. Without the price cut, Deepseek V4-pro tokens would have cost more than resold Opus 4.8 tokens.

Opus 4.8 is a more capable model, so almost nobody was going to pay for V4-pro at the original price.

ammo1662•46m ago
China also have trust issue with American companies. Most of State-owned companies will not use those services even if they can directly access them.
nekusar•32m ago
And? The US feds wont allow even local Qwen or Deepseek models either. "Evul godless commies" or some such nonsense.
jadar•35m ago
If resold Anthropic tokens undercut even the at-cost open-weight model tokens, because they're reselling subsidized subscription tokens, then you'd have to start selling open-weight model tokens at a loss in order to match them.
rconti•42m ago
Wait, so is your theory mutually exclusive to Anthropic's claims of "theft of capabilities"?
Chu4eeno•18m ago
No, this reseller 中转站 thing is basically a loss leader for certain chinese ai labs to distill claude with verified human input.
avsteele•34m ago
What does this have to do with Alibaba? Are you saying Alibaba is the reseller?

If not it sounds like you are describing a separate phenomenon.

lokar•16m ago
They buy the logs from the bot farmers
HeavenFox•34m ago
Also just plain old fraud: selling Chinese models as Opus. With the capabilities of Chinese models catching up fast, this is getting more and more difficult to detect.
dwa3592•31m ago
>>Chinese resellers are offering Claude tokens at 70-90% below official Anthropic API prices.

Can someone with more understanding dumb it down for me please.

Does this mean that the reseller (for example XYZ) is buying it from Anthropic at Anthropic's price and then reselling it at a cheaper price???? why would XYZ offer this at a loss like that when they could just offer it at Anthropic's price???

The link does mention Opus and other models but what's the proof it's actually Opus. I could be selling deepseek for all they know and can call it Opus. System prompt: "If anyone asks your name - you are Opus 4.6".

hoten•28m ago
Because Anthropic's subscriptions come with X amount of tokens / week, and divided by the subscription cost it is WAY less than what they charge per-token (the "API price") beyond that.

So these resellers get a ton of accounts on subscriptions and sell the cheaper tokens.

VladVladikoff•21m ago
They probably buy the plans instead of the API tokens, and resell access via a custom API that routes to the plans. So you presumably get cheaper access this way than paying API pricing.
Chu4eeno•21m ago
> Does this mean that the reseller (for example XYZ) is buying it from Anthropic at Anthropic's price and then reselling it at a cheaper price????

Yes, as they explained they do it through things like pooling accounts, straight up payment fraud, and double-dipping by selling the logs of the conversations to chinese AI labs so that they can train their own models on it.

> The link does mention Opus and other models but what's the proof it's actually Opus. I could be selling deepseek for all they know and can call it Opus. System prompt: "If anyone asks your name - you are Opus 4.6".

There might be some that try this, but they would get caught very quickly, there's still a moat between Claude and Deepseek, even in casual use.

Look up Zilan Qian's reporting if you want more detail.

neves•18m ago
Summarizing for you: Anthropic is a stupid company that let everybody steal their tokens
Chu4eeno•8m ago
Not really sure what else they can do, between people running residential proxies (embedded in cheap games or for a tiny sum of crypto) on their phones at home, making the source of the traffic indistinguishable from legitimate traffic, to ID verification check completion as a service in low-income countries, there isn't much they can do to block it.
neves•20m ago
It makes no sense.

These China e bashing is very annoying. It is hard to argue with people drowned in American propaganda. I'd expect better arguments from the intelligent people in HN

gaiagraphia•27m ago
This is great for competition! Chinese vendors offering a cheaper solution = what economics told me the free market was all about.

I also learnt that Anthropic should get better at what they do if they want to compete. If not, somebody else will win.

Or does this not apply to huge US corporations any more?

m-ee•22m ago
It never did.

In debt the first 5000 years Geaeber makes the case that pure “free market” trade has never really existed in “the west”. The closest to this ideal that’s ever happened was during the Islamic golden age enabled by religious prescriptions against usury.

gruez•5m ago
>The closest to this ideal that’s ever happened was during the Islamic golden age enabled by religious prescriptions against usury.

How does are bans against consensual financial exchanges close to the "ideal" of the free market? It just sounds like you have an axe to grind about the financial system rather than describing free markets.

UltraSane•4m ago
Without interest why would anyone loan money?
skybrian•20m ago
I guess you missed the fraud part.
techblueberry•17m ago
Fraud is just what losers call disruption.
gruez•14m ago
>This is great for competition! Chinese vendors offering a cheaper solution = what economics told me the free market was all about.

Yeah, like all those Chinese bootleggers selling DVDs for a few dollars rather than $20. Free market!

https://news.ycombinator.com/item?id=48664814

Mistletoe•6m ago
AI was always going to be a race to the bottom and low margins. It’s why I’m extremely bearish on AI as an investment. It’s framed as some high margin business when it’s really going to end up like your toilet paper at Costco. You will use whatever is cheapest and gets the job done.
operatingthetan•25m ago
This story reads like a William Gibson novel. Wild times.
alexnewman•17m ago
But I can rebuild glm Using open source methods…
Chu4eeno•11m ago
And there are a ton of Claude conversation logs (with CoT/inference) with no clear provenance circulating freely on huggingface, guess where they (likely) come from.
golergka•16m ago
Needless to say, they also collect all the data and sell it to labs which want to distill the models they’re serving.