Open Reproduction of DeepSeek-R1

68•yogthos•2h ago

Comments

Tiberium•1h ago

Last update over a year ago, so I hope (2025) gets added to the title:

> [2025/05/26] (Step 1 completed!) We release Mixture-of-Thoughts--a curated reasoning dataset of 350k verified traces distilled from R1. The dataset spans tasks in mathematics, coding, and science, and is designed to teach language models to reason step-by-step. We also provide a recipe to train OpenR1-Distill-7B, which replicates the reasoning capabilities of deepseek-ai/DeepSeek-R1-Distill-Qwen-7B and marks the completion of step 1 in the Open R1 project.

Doesn't look like they managed to actually reproduce R1, and only stopped on Step 1 out of their 3-step plan.

spmurrayzzz•1h ago

One of my favorite code comments of all time is still in the src:

"# TODO: implement a proper validator to compare against ground truth. For now we just check for exact string match on each line of stdout." [1]

This was one of my chief complaints about the entire R1 news cycle, it felt like no one actually read the technical report. They were being heralded for their openness, but they left out the most meaningful details that you'd need to reproduce their work.

[1] https://github.com/huggingface/open-r1/blob/1416fa0cf21595d2...

neutronicus•47m ago

Reminds me of my days in a computational physics PhD program.

madiator•1h ago

Check out OpenThoughts. It has a widely used dataset, a model that beats the deepseek's smaller reasoning models, and a paper that talks in detail about the data curation methodology.

https://www.open-thoughts.ai/

yogthos•37m ago

neat

christkv•1h ago

What is the estimated cost these days to train something like this to conclusion?

yieldcrv•1h ago

Too old now

aesthesia•46m ago

If you really want to see fully open training pipelines for modern LLMs, Olmo and to a lesser extent Nemotron are what you should look at.

https://github.com/allenai/OLMo

https://github.com/NVIDIA-NeMo/Nemotron

spijdar•8m ago

I'm not really familiar with either, but I'm more familiar with Olmo. My impression is Nemotron is newer -- why is it less applicable? Is it not totally open like Olmo?

MiMo Code Is Now Released and Open-Source

Lines of Code Got a Better Publicist

FPS.cob: A first person shooter in COBOL

MapComplete – Contibute to OpenStreetMaps

Nextcloud Hub 26 Spring: Built together, designed for the future

Pokémon Go Scans Trained the Navigation Tech for Military Drones

Open Reproduction of DeepSeek-R1

Workers are spending over 6 hours a week botsitting AI, fueling job frustration

Queues Don't Fix Overload (2014)

AI agent runs amok in Fedora and elsewhere

Why Thermodynamics Rules Future Orbital Data Centers

Web Browsers on Video Game Consoles

Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable

πFS

Anthropic requires 30 day data retention for Fable and Mythos

Build a Basic AI Agent from Scratch: Long Task Planning

Show HN: Open-source API Key server written in Go by Ory

US-Canada border library gets new Quebec-only entrance

Driving in America Is Headlight Hell

Supporting Exchange and beyond

Euro-Office: First version of the open-source web office is here

Linux latency measurements and compositor tuning

I'm Eric Ries, author of "The Lean Startup" and new book "Incorruptible" – AMA

Why AI hasn't replaced software engineers, and won't

Starfish by Peter Watts (1999)

Reverse engineering the Creative Katana soundbar to control it from Linux

Sequoyah’s syllabary created a written language for the Cherokee

PgDog is funded and coming to a database near you

How JPL keeps the 13-year-old Curiosity rover doing science

L'Affaire Siloxane

Open Reproduction of DeepSeek-R1

Comments

MiMo Code Is Now Released and Open-Source

Lines of Code Got a Better Publicist

FPS.cob: A first person shooter in COBOL

MapComplete – Contibute to OpenStreetMaps

Nextcloud Hub 26 Spring: Built together, designed for the future

Pokémon Go Scans Trained the Navigation Tech for Military Drones

Open Reproduction of DeepSeek-R1

Workers are spending over 6 hours a week botsitting AI, fueling job frustration

Queues Don't Fix Overload (2014)

AI agent runs amok in Fedora and elsewhere

Why Thermodynamics Rules Future Orbital Data Centers

Web Browsers on Video Game Consoles

Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable

πFS

Anthropic requires 30 day data retention for Fable and Mythos

Build a Basic AI Agent from Scratch: Long Task Planning

Show HN: Open-source API Key server written in Go by Ory

US-Canada border library gets new Quebec-only entrance

Driving in America Is Headlight Hell

Supporting Exchange and beyond

Euro-Office: First version of the open-source web office is here

Linux latency measurements and compositor tuning

I'm Eric Ries, author of "The Lean Startup" and new book "Incorruptible" – AMA

Why AI hasn't replaced software engineers, and won't

Starfish by Peter Watts (1999)

Reverse engineering the Creative Katana soundbar to control it from Linux

Sequoyah’s syllabary created a written language for the Cherokee

PgDog is funded and coming to a database near you

How JPL keeps the 13-year-old Curiosity rover doing science

L'Affaire Siloxane