Deepseek Unmasked [pdf]

https://selectcommitteeontheccp.house.gov/sites/evo-subsites/selectcommitteeontheccp.house.gov/files/evo-media-document/DeepSeek%20Final.pdf

55•ironyman•9mo ago

Comments

Igrom•9mo ago

What a pointed title. That aside, I am rather surprised that a committee's investigation report is this light on what in my opinion are fundamental details, including the make-up of the committee, the members' respective duties and the course of the investigative process. Notwithstanding the potentially political raison d'etre of the report, is that customary for Congressional committees?

pmags•9mo ago

Here's the full membership of the committee:

https://selectcommitteeontheccp.house.gov/members

Igrom•9mo ago

The gripe I have with this is that it is 1) an impermanent external resource that shows 2) the current, not the contemporary make-up of the commitee that's 3) subject to change at any time, and thus not a lasting appendix to the report. I guess I had expected more academic rigour from a congressional committee.

wormlord•9mo ago

You have too much faith in US leadership.

sschueller•9mo ago

".. siphons data back to the People’s Republic of China (PRC)"

How does that work when I run the model myself?

Cry me a river, you tried to build a massive moat to force the rest of the world to suck you off for access and now you got caught with your pants down by a model that has been given out for free.

I wouldn't want to know how the US would use the discovery of cold fusion or a cure for all to make a profit for its elite instead of giving it out for the greater good.

adfm•9mo ago

You can say that about any country. Pull the crank hard enough and you could say the same about Liechtenstein.

latentcall•9mo ago

America and Americans by large and “greater good” do not mix well. It’s a very individualist society aka “care about myself” society.

isusmelj•9mo ago

As someone in Europe, I sometimes wonder what’s worse: letting US companies use my data to target ads, or handing it to Chinese companies where I have no clue what’s being done with it. With one I at least get an open source model. The other is a big black box.

fragmede•9mo ago

They're not open source. It's nice of Meta and Deepseek to offer up their models for download, but that doesn't make them open source.

chvid•9mo ago

Hard to be fully open source if you train on copyrighted material.

Anyway. Deepseek is the most open of the sota models.

MoonGhost•9mo ago

Did they open their datasets already? Would be nice to have 'thinking' part.

anonym29•9mo ago

Isn't this a bit of semantic lawyering? Open model weights are not the same as open source in a literal sense, but I'd go so far as to suggest that open model weights fulfill much of the intent / "soul" of the open source movement. Would you disagree with that notion?

lxgr•9mo ago

> open model weights fulfill much of the intent / "soul" of the open source movement

Absolutely not. The intent of the open source movement is sharing methods, not just artifacts, and that would require training code and methodology.

A binary (and that's arguably what weights are) you can semi-freely download and distribute is just shareware – that's several steps away from actual open source.

There's nothing wrong with shareware, but calling it open source, or even just "source available" (i.e. open source with licensing/usage restrictions), when it isn't, is disingenuous.

Manabu-eo•9mo ago

I saw the argument that the source code is the preferred base to make changes and modifications in software, but in the case of those large models, the weights themselves are the preferred way.

It's much easier and cheap to make a finetune or LoRA than to train from scratch to adapt it to your use case. So it's not quite like source vs binary in software.

MoonGhost•9mo ago

> The intent of the open source movement is sharing methods, not just artifacts, and that would require training code and methodology.

That's not enough. The key point was trust. When executable can be verified by independent review and rebuild. It it cannot be rebuilt it can be virus, troyan, backdoor, etc. For LLMs there is no way to reproduce, thus no way to verify them. So, they cannot be trusted and we have to trust producers. It's not that important when models are just talking, but with tools use it can be a real damage.

anonym29•9mo ago

But there is also no publicly known way to implant unwanted telemetry, backdoors, or malware into modern model formats either (which hasn't always been true of older LLM model formats), which mitigates at least one functional concern about trust in this case, no?

It's not quite like executing a binary in userland - you're not really granting code execution to anyone with the model, right? Perhaps there is some undisclosed vulnerability in one or more of the runtimes, like llama.cpp, but that's a separate discussion.

lxgr•9mo ago

The biggest problem is arguably at a different layer: These models are often used to write code, and if they write code containing vulnerabilities, they don't need any special permissions to do a lot of damage.

It's "reflections on trusting trust" all the way down.

anonym29•9mo ago

If people who cannot read code well enough to evaluate whether or not it is secure are using LLM's to generate code, no amount of model transparency will solve the resulting problems. At least not while LLM's still suffer from the the major problems they have, like hallucinations, or being wrong (just like humans!).

Whether the model is open source, open weight, both, or neither has essentially zero impact on this.

lxgr•9mo ago

Hm, I wouldn't say that that's the key point of open software. There are many open source projects that don't have reproducible builds (some don't even offer any binary builds), and conversely there is "source available" software with deterministic builds that's not freely licensed.

On top of that, I don't think it works quite that way for ML models. Even their creators, with access to all training data and training steps, are having a very hard time reasoning about what these things will do exactly for a given input without trying it out.

"Reproducible training runs" could at least show that there's not been any active adversarial RHLF, but seem prohibitively expensive in terms of resources.

MoonGhost•9mo ago

Well, 'open source' is interpreted in different ways. I think the core idea is it can be trusted. You can get Linux distribution and recompile every component except for the proprietary drivers. With that being done by independent groups you can trust it enough to run bank's systems. The other options are like Windows where you have to trust Microsoft and their supply chain.

There are different variations, of course. Mostly related to the rights and permissions.

As for big models even their owners, having all the hardware and training data and code, cannot reproduce them. Model may have some undocumented functionality pretrained or added in post process, and it's almost impossible to detect without knowing the key phrase. It can be a harmless watermark or something else.

_aavaa_•9mo ago

Meta models do not, they have use restrictions. At least deepseek does not.

fragmede•9mo ago

It does not and I totally disagree with that. Unless we can see the code that goes into the model to stop of from telling me how to make cocaine, it's not the same sort of soul.

mrkramer•9mo ago

US is capitalistic liberal democracy and China is one party capitalistic dictatorship. Make your choice.

mr90210•9mo ago

You make it seem so binary. If you do enough research on the US you might change your mind. YES, I would still choose the US.

PartiallyTyped•9mo ago

The US tends towards dictatorship; due process is an afterthought, people disappearing off the streets, citizens getting arrested at the border for nothing, tourists getting deported over minute issues such as an iffy hotel booking, and that's just off the top of my head from the last 2 days.

credit_guy•9mo ago

Both are bad. If Europe does not develop local alternatives to ChatGpt or DeepSeek, it will (slowly) lose its sovereingty.

ryoshoe•9mo ago

Europe is developing local alternative models such as Mistral

MoonGhost•9mo ago

> With one I at least get an open source model. The other is a big black box.

It doesn't matter much as in both cases provider has access to you ins and outs. The only question is if you trust company operating the model. (yes, you can run local model, but it's not that capable)

ein0p•9mo ago

What's interesting is that most of this is applicable to proprietary US models when used by non-US users, too. "Stores data in the US"? Yes. "Complies with approved narratives"? Check. "Cooperates with intelligence services and the military"? Check. The only real solution here are open weights, and Deepseek is the strongest open-weights model to this day. Don't like it? Compete.

qwertytyyuu•9mo ago

As a non American, all of these don’t seem to be any worse than US based models?

skanga•9mo ago

Looks like Deepseek is having it's Tiktok moment!

chvid•9mo ago

The Sinophobia going through America is a form of insanity causing America to do enormous harm to itself.

From banning open source software to destroying the business of its largest and most profitable companies.

anonym29•9mo ago

It's amusing to see the hypocrisy on display, though. The authors of the report seem to be seriously accusing DeepSeek of IP theft from OpenAI, which was built on... IP theft. LOL.

ChrisArchitect•9mo ago

Summary: https://selectcommitteeontheccp.house.gov/media/reports/deep...

comrade1234•9mo ago

As long as I can run it on my own cheap hardware I’ll be using it. Our contracts with some of our customers is that their data never leaves our servers.

qeternity•9mo ago

A minimum production setup for V3/R1 is 16x H100s…I guess that’s up to you whether that qualifies as cheap.

orbital-decay•9mo ago

In journalism it's called a hit piece, and this one is particularly low-quality. Embarrassing.

Centigonal•9mo ago

Dang, you really can make anything sound scary if you use the right language!

1. ChatGPT funnels your data to American Intelligence Agencies through backend infrastructure subject to U.S. Government National Security Letters (NSLs) that allow for secret collection of customer data by the US Department of Defense.

2. ChatGPT covertly manipulates the results it presents to align with US propaganda, as a result of the widely disseminated Propaganda Model and close ties between OpenAI's leadership and the US Government.

3. It is highly likely that OpenAI used unlawful model training techniques to create its model, stealing from leading international news sources, academic institutions, and publishing houses.

4. OpenAI’s AI model appears to be powered by advanced chips manufactured by Taiwanese semiconductor giant TSMC and reportedly utilizes tens of thousands of chips that are manufactured by a Trade War adversary of America and subject to a 32% import duty.

banku_brougham•9mo ago

Yeah the Chinese govt has far less incentive to mess with me personally than the US govt does. Its hard to convince people of this point of view I have found.

mrkramer•9mo ago

Everybody is spying on everybody, it's free for all....if you want to be out of the reach, either stop using software for sensitive information and communication or start using fully encrypted products. Cryptography is the key.

latentcall•9mo ago

Sinophobic junk. You got shown up by a free and open model and wasted a gazillion dollars, good job. So yes let’s ban the competition and force Americans to use the junky ad riddled cheap clones.

anarticle•9mo ago

Get your models before they're gone: https://huggingface.co/collections/deepseek-ai/deepseek-r1-6...

beams_of_light•9mo ago

It's interesting that they call out NVIDIA specifically as an enabler. MAGA going to war against NVIDIA now?

hdjjhhvvhga•9mo ago

It's shocking how much American soft power diminished in such a short period. White House documents used to mean something, had a certain weigh, whereas now some of them are simply ridiculous. This one in particular is not particularly bad even. Although we know who inspired it and that, given the fact that DeepSeek made their models available and OpenAI didn't, whatever is written should be taken with more than one grain of salt.

Havoc•9mo ago

Wohoo. Better token throughput for europeans...

somerandomness•9mo ago

It's important to distinguish the DeepSeek App from the open-weight models, which are released under very liberal licenses, and you have full control of where data fed to the model goes, e.g. stays in the USA.

The rocky 1960s origins of online dating (2025)

Show HN: Agent-fetch – Sandboxed HTTP client with SSRF protection for AI agents

Why there is no official statement from Substack about the data leak

Effects of Zepbound on Stool Quality

Show HN: Seedance 2.0 – The Most Powerful AI Video Generator

Ask HN: Do we need "metadata in source code" syntax that LLMs will never delete?

Pentagon cutting ties w/ "woke" Harvard, ending military training & fellowships

Can Quantum-Mechanical Description of Physical Reality Be Considered Complete? [pdf]

Kessler Syndrome Has Started [video]

Complex Heterodynes Explained

EVs Are a Failed Experiment

MemAlign: Building Better LLM Judges from Human Feedback with Scalable Memory

CCC (Claude's C Compiler) on Compiler Explorer

Homeland Security Spying on Reddit Users

Actors with Tokio (2021)

Can graph neural networks for biology realistically run on edge devices?

Deeper into the shareing of one air conditioner for 2 rooms

Weatherman introduces fruit-based authentication system to combat deep fakes

Why Embedded Models Must Hallucinate: A Boundary Theory (RCC)

A Curated List of ML System Design Case Studies

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

Open Problems in Mechanistic Interpretability

Bye Bye Humanity: The Potential AMOC Collapse

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

Digital Iris [video]

Essential CDN: The CDN that lets you do more than JavaScript

They Hijacked Our Tech [video]

Vouch

HRL Labs in Malibu laying off 1/3 of their workforce

The rocky 1960s origins of online dating (2025)

Show HN: Agent-fetch – Sandboxed HTTP client with SSRF protection for AI agents

Why there is no official statement from Substack about the data leak

Effects of Zepbound on Stool Quality

Show HN: Seedance 2.0 – The Most Powerful AI Video Generator

Ask HN: Do we need "metadata in source code" syntax that LLMs will never delete?

Pentagon cutting ties w/ "woke" Harvard, ending military training & fellowships

Can Quantum-Mechanical Description of Physical Reality Be Considered Complete? [pdf]

Kessler Syndrome Has Started [video]

Complex Heterodynes Explained

EVs Are a Failed Experiment

MemAlign: Building Better LLM Judges from Human Feedback with Scalable Memory

CCC (Claude's C Compiler) on Compiler Explorer

Homeland Security Spying on Reddit Users

Actors with Tokio (2021)

Can graph neural networks for biology realistically run on edge devices?

Deeper into the shareing of one air conditioner for 2 rooms

Weatherman introduces fruit-based authentication system to combat deep fakes

Why Embedded Models Must Hallucinate: A Boundary Theory (RCC)

A Curated List of ML System Design Case Studies

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

Open Problems in Mechanistic Interpretability

Bye Bye Humanity: The Potential AMOC Collapse

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

Digital Iris [video]

Essential CDN: The CDN that lets you do more than JavaScript

They Hijacked Our Tech [video]

Vouch

HRL Labs in Malibu laying off 1/3 of their workforce

Deepseek Unmasked [pdf]

Comments