frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

A new, faster DeepSeek R1-0528 variant appears from German lab

https://venturebeat.com/ai/holy-smokes-a-new-200-faster-deepseek-r1-0528-variant-appears-from-german-lab-tng-technology-consulting-gmbh/
72•saubeidl•7h ago

Comments

UrineSqueegee•6h ago
they have reduced the token output by 20% and the benchmark scores have decreased by 10% of the original model.
yorwba•5h ago
The 20% output reduction is relative to R1, the 10% benchmark score reduction is relative to R1-0528.

It produces 60% fewer output tokens than R1-0528 and scores about 10% higher on their benchmark than R1.

So it's a way to turn R1-0528, which is better than R1 but slower, into a model that's worse than R1-0528 but better and faster than R1.

saubeidl•5h ago
Yup, you can see it well on the graph here: https://venturebeat.com/wp-content/uploads/2025/07/Gu4d8kzWo...
ipsum2•6h ago
tl;dr: faster but worse; i.e. on the pareto frontier.
konsalexee•4h ago
It is always about the trade-off between those two parameters.

Of course an increase in both is the optimal, but a small sacrifice in performance/accuracy for being 200% faster is worth noting. Around 10% drop in accuracy for 200% speed-up, some would take it!

d1sxeyes•3h ago
Also that “speed up” is actually hiding “less compute used” which is a proxy for cost. Assuming this is 200% faster purely because it needs less compute, that should mean it costs roughly 1/3 as much to run for a 10% decrease in quality of output.
konsalexee•3h ago
↑
randomNumber7•6h ago
From the hugginface model card:

"Due to the strict new guidelines of the EU AI Act that take effect on August 2nd 2025, we recommend that each R1T/R1T2 user in the EU either familiarizes themselves with these requirements and assess their compliance, or ceases using the model in the EU after August 1st, 2025."

Doesn't the deepseek licence completely forbid any use in the EU already? How can a german company legally build this in the first place (which they presumably did)?

qwertox•6h ago
> Doesn't the deepseek licence completely forbid any use in the EU already?

Care to explain?

https://deepseeklicense.github.io/

https://github.com/deepseek-ai/DeepSeek-Coder/blob/main/LICE...

akreal•5h ago
Probably a mix-up with the recently released Huawei model:

https://news.ycombinator.com/item?id=44441447

peer2pay•5h ago
Calling TNG a lab is a bit funny to me. It’s a consulting company that lets people hack on stuff between placements.
the_third_wave•4h ago
Sounds like a good use of "spare" time to me and not that different from many a lab I've been part of: someone gets a hunch, sets up an experiment to follow it, proves poor disproves whatever they were after, pulls down the experiment, rinse, repeat.
loherj•4h ago
Yes and no.

Calling us a lab is not quite right, we are a consulting company.

But hacking is not just limited to in between placements, everybody has (at least) 2 days per month to do that, regardless of any work for customers.

Also, since AI is such a strategically important topic, we have a team that just works on AI stuff internally. That’s where R1T and R1T2 come from.

prinzmaus•59m ago
OT: I love that German has a word for “yes and no”: jein.
saubeidl•40m ago
Petition to make "nes" a word in english (yo doesn't really work...)
perpetualpatzer•40m ago
So does English. Well, sorta.
_ache_•4h ago
Is 200% a way to say *3 quicker ? The little 10% reasoning performance decrease seems worth it.
MangoToupe•4h ago
> The little 10% reasoning performance decrease seems worth it

We need about three orders of magnitude more tests to make these numbers meaningful.

loherj•4h ago
Fair point. More benchmarks are definitely good but I’m optimistic that they will show similar results.

Anecdotally, I can say that my personal experience with the model is in line with what the benchmarks claim: It’s a bit smarter than R1, a bit faster than R1, much faster than R1-0528, but not quite as smart. (Faster meaning less output tokens). For me, it’s at a sweet spot and I use it as daily driver.

loherj•4h ago
Yes. If you look at the diagram that plots the performance vs the amount of output tokens, you can see that R1T2 uses about 1/3 of the output tokens that R1-0528 uses.

Keep in mind, the speed improvement doesn’t come from the model running any faster (it’s the exact same architecture as R1, after all) but from using less output tokens while still achieving very good results.

ArduPilot

https://ardupilot.org/
1•marklit•5m ago•0 comments

China pours money into brain chips that give paralysed people more control

https://www.nature.com/articles/d41586-025-02098-5
1•bookofjoe•5m ago•1 comments

Hued: A daily color puzzle game

https://playhued.com/
1•gaws•8m ago•0 comments

Local-first software: You own your data, in spite of the cloud

https://www.inkandswitch.com/essay/local-first/
2•gasull•8m ago•0 comments

Show HN: Atproto.at – At Protocol Explorer

https://sri.xyz/projects/atprotoat
2•irs•10m ago•0 comments

Musk and co should ask AI what defines intelligence. They may learn something

https://observer.co.uk/news/columnists/article/musk-and-co-should-ask-an-ai-what-defines-intelligence-they-may-learn-something
2•almost-exactly•14m ago•0 comments

Isolation as a Business Model

https://www.m365princess.com/blogs/isolation/
1•rntn•17m ago•0 comments

$88M pollution-tracking satellite missing in space

https://www.bbc.com/news/articles/clynre7leyjo
1•Brajeshwar•18m ago•0 comments

Deep Earth pulses beneath Africa are tearing the continent apart

https://newatlas.com/science/deep-earth-pulses-beneath-africa-are-tearing-the-continent-apart/
1•Brajeshwar•18m ago•0 comments

'Trained monkey' from tech support saved manager with a single keypress

https://www.theregister.com/2025/07/04/on_call/
1•Brajeshwar•18m ago•0 comments

Welcome to Dallas: The City That Just Can't Stop Expanding

https://www.wsj.com/economy/dallas-texas-growth-company-moves-6f2504eb
1•_tk_•19m ago•0 comments

A tiny but mighty web framework bolted on to DOM-cache

https://weblog.ferrier.me.uk/f/home/A_tiny_but_mighty_web_framework_bolted_on_to_dom-cache
1•davidyarham•21m ago•0 comments

Happy Birthday, GamingOnLinux – 16 years today

https://www.gamingonlinux.com/2025/07/happy-birthday-gamingonlinux-16-years-today/
1•diggan•21m ago•0 comments

The Sewing Machine's Broken

https://secarateratur.medium.com/the-sewing-machines-broken-324eac647474
2•altilunium•23m ago•0 comments

4096 Colours and the Blink Attribute

https://research.exoticsilicon.com/articles/console_4096
1•todsacerdoti•23m ago•0 comments

The REFInd Boot Manager

https://www.rodsbooks.com/refind/
1•hosteur•24m ago•0 comments

We're all idiots and that's fine

https://blog.douwe.com/2025/07/were-all-idiots-and-thats-fine.html
1•dosinga•24m ago•0 comments

Personalised AI models enhance support for children with ASD

https://www.gulf-times.com/article/706888/qatar/personalised-ai-models-enhance-support-for-children-with-asd
4•Bluestein•29m ago•0 comments

Generic Containers in C: Span

https://uecker.codeberg.page/2025-07-02.html
2•uecker•30m ago•0 comments

Academics on leaving US for 'scientific asylum' in France

https://www.theguardian.com/education/2025/jul/05/academics-leaving-us-scientific-asylum-france-trump
5•Bluestein•30m ago•1 comments

Discovery of Delta Wave (0211)

https://fr3action.com/
2•memv•30m ago•1 comments

Europe's first geostationary sounder satellite is launched

https://www.eumetsat.int/europes-first-geostationary-sounder-satellite-launched
36•diggan•32m ago•8 comments

Show HN: A word puzzle game I made for my friends

https://wordpivot.com
3•max0563•34m ago•0 comments

Reflections on 2 years of CPython's JIT Compiler: The good, the bad, the ugly

https://fidget-spinner.github.io/posts/jit-reflections.html
2•todsacerdoti•42m ago•0 comments

Russia shut down mobile internet 200 times in May and June against drone attacks

https://theins.ru/en/news/282568
1•amai•46m ago•0 comments

Make Worse Software, Slower

https://blog.redplanetlabs.com/2025/06/17/make-worse-software-slower/
1•yurivish•50m ago•0 comments

Before releasing a new AI model Sam Altman would be put into a Server room

https://twitter.com/the_yanco/status/1941388896387875282
8•delichon•50m ago•1 comments

How to Save a Dog

https://www.newyorker.com/news/the-weekend-essay/how-to-save-a-dog
1•fortran77•52m ago•0 comments

Hedge-fund quality equity research in minutes

https://valuationbot.ai
1•deadlyned•52m ago•0 comments

Some Thoughts on Techno-Fascism from Socialism 2025

https://organizingmythoughts.org/some-thoughts-on-techno-fascism-from-socialism-2025/
2•cratermoon•54m ago•0 comments