frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Open Reproduction of DeepSeek-R1

https://github.com/huggingface/open-r1
68•yogthos•2h ago

Comments

Tiberium•1h ago
Last update over a year ago, so I hope (2025) gets added to the title:

> [2025/05/26] (Step 1 completed!) We release Mixture-of-Thoughts--a curated reasoning dataset of 350k verified traces distilled from R1. The dataset spans tasks in mathematics, coding, and science, and is designed to teach language models to reason step-by-step. We also provide a recipe to train OpenR1-Distill-7B, which replicates the reasoning capabilities of deepseek-ai/DeepSeek-R1-Distill-Qwen-7B and marks the completion of step 1 in the Open R1 project.

Doesn't look like they managed to actually reproduce R1, and only stopped on Step 1 out of their 3-step plan.

spmurrayzzz•1h ago
One of my favorite code comments of all time is still in the src:

"# TODO: implement a proper validator to compare against ground truth. For now we just check for exact string match on each line of stdout." [1]

This was one of my chief complaints about the entire R1 news cycle, it felt like no one actually read the technical report. They were being heralded for their openness, but they left out the most meaningful details that you'd need to reproduce their work.

[1] https://github.com/huggingface/open-r1/blob/1416fa0cf21595d2...

neutronicus•47m ago
Reminds me of my days in a computational physics PhD program.
madiator•1h ago
Check out OpenThoughts. It has a widely used dataset, a model that beats the deepseek's smaller reasoning models, and a paper that talks in detail about the data curation methodology.

https://www.open-thoughts.ai/

yogthos•37m ago
neat
christkv•1h ago
What is the estimated cost these days to train something like this to conclusion?
yieldcrv•1h ago
Too old now
aesthesia•46m ago
If you really want to see fully open training pipelines for modern LLMs, Olmo and to a lesser extent Nemotron are what you should look at.

https://github.com/allenai/OLMo

https://github.com/NVIDIA-NeMo/Nemotron

spijdar•8m ago
I'm not really familiar with either, but I'm more familiar with Olmo. My impression is Nemotron is newer -- why is it less applicable? Is it not totally open like Olmo?

MiMo Code Is Now Released and Open-Source

https://mimo.xiaomi.com/mimocode
107•apeters•1h ago•45 comments

Lines of Code Got a Better Publicist

https://curlewis.co.nz/posts/lines-of-code-got-a-better-publicist/
202•RyeCombinator•3h ago•136 comments

FPS.cob: A first person shooter in COBOL

https://github.com/icitry/FPS.cob
24•MBCook•41m ago•2 comments

MapComplete – Contibute to OpenStreetMaps

https://mapcomplete.org/
82•GTP•1h ago•15 comments

Nextcloud Hub 26 Spring: Built together, designed for the future

https://nextcloud.com/blog/nextcloud-hub26-spring/
61•doener•1h ago•30 comments

Pokémon Go Scans Trained the Navigation Tech for Military Drones

https://dronexl.co/2026/06/09/pokemon-go-scans-niantic-vantor-military-drone-navigation/
560•vrganj•9h ago•256 comments

Open Reproduction of DeepSeek-R1

https://github.com/huggingface/open-r1
71•yogthos•2h ago•9 comments

Workers are spending over 6 hours a week botsitting AI, fueling job frustration

https://www.businessinsider.com/botsitting-ai-hidden-human-labor-at-work-2026-6
172•ZeidJ•2h ago•125 comments

Queues Don't Fix Overload (2014)

https://ferd.ca/queues-don-t-fix-overload.html
13•locknitpicker•2d ago•4 comments

AI agent runs amok in Fedora and elsewhere

https://lwn.net/SubscriberLink/1077035/c7e7c14fbd60fae9/
520•tanelpoder•15h ago•230 comments

Why Thermodynamics Rules Future Orbital Data Centers

https://spectrum.ieee.org/orbital-data-centers-heat
30•rbanffy•2h ago•31 comments

Web Browsers on Video Game Consoles

https://vale.rocks/posts/game-console-browsers
123•robin_reala•7h ago•61 comments

Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable

https://techcrunch.com/2026/06/10/cybersecurity-researchers-arent-happy-about-the-guardrails-on-a...
549•speckx•23h ago•483 comments

πFS

https://github.com/philipl/pifs
897•helterskelter•21h ago•198 comments

Anthropic requires 30 day data retention for Fable and Mythos

https://support.claude.com/en/articles/15425996-data-retention-practices-for-mythos-class-models
568•lebovic•1d ago•287 comments

Build a Basic AI Agent from Scratch: Long Task Planning

https://medium.com/@rogi23696/build-a-basic-ai-agent-from-scratch-long-task-planning-14e803f9bd6d
98•ruxudev•2d ago•41 comments

Show HN: Open-source API Key server written in Go by Ory

https://github.com/ory/talos/tree/master
11•leetvibecoder•45m ago•2 comments

US-Canada border library gets new Quebec-only entrance

https://www.bbc.com/news/videos/clyrvrde160o
92•NalNezumi•2h ago•73 comments

Driving in America Is Headlight Hell

https://www.theatlantic.com/technology/2026/06/car-headlights-too-bright-adaptive-beams/687488/
39•pavel_lishin•59m ago•19 comments

Supporting Exchange and beyond

https://brendan.abolivier.bzh/exchange-pt-2/
9•babolivier•2d ago•1 comments

Euro-Office: First version of the open-source web office is here

https://www.heise.de/en/news/Euro-Office-First-version-of-the-open-source-web-office-is-here-1132...
44•doener•1h ago•18 comments

Linux latency measurements and compositor tuning

https://farnoy.dev/posts/linux-latency
99•GalaxySnail•2d ago•30 comments

I'm Eric Ries, author of "The Lean Startup" and new book "Incorruptible" – AMA

752•eries•1d ago•520 comments

Why AI hasn't replaced software engineers, and won't

https://www.normaltech.ai/p/why-ai-hasnt-replaced-software-engineers
190•trueduke•8h ago•235 comments

Starfish by Peter Watts (1999)

https://www.rifters.com/real/STARFISH.htm#prelude
117•zetalyrae•2d ago•45 comments

Reverse engineering the Creative Katana soundbar to control it from Linux

https://blog.nns.ee/2026/02/20/katana-v2x-re/
119•theanonymousone•4d ago•10 comments

Sequoyah’s syllabary created a written language for the Cherokee

https://www.smithsonianmag.com/innovation/man-created-written-language-cherokee-did-efficiently-e...
184•grahambargeron•17h ago•116 comments

PgDog is funded and coming to a database near you

https://pgdog.dev/blog/our-funding-announcement
522•levkk•1d ago•245 comments

How JPL keeps the 13-year-old Curiosity rover doing science

https://spectrum.ieee.org/curiosity-rover-jpl-mars-science
266•pseudolus•22h ago•82 comments

L'Affaire Siloxane

https://mceglowski.substack.com/p/laffaire-siloxane
273•idlewords•2d ago•48 comments