frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

I trained a Language Model to schedule events with GRPO

https://huggingface.co/blog/anakin87/qwen-scheduler-grpo
1•anakin87•9h ago

Comments

anakin87•9h ago
I experimented with GRPO lately, since I am fascinated by models learning from prompts and rewards - no example answers needed like in Supervised Fine-Tuning.

After the DeepSeek boom, everyone is trying GRPO with GSM8K or the Countdown Game, but I wanted a different challenge. So I opted for teaching a model to create a schedule from a list of events and priorities.

Choosing an original problem forced me to think about the problem setting, generate data, choose the base model, design reward functions, and run multiple rounds of training, hoping that my model would learn something.

A fun and rewarding experience. :-)

I learned a lot of things, that I want to share with you.

Blog post: https://huggingface.co/blog/anakin87/qwen-scheduler-grpo

Code: https://github.com/anakin87/qwen-scheduler-grpo

Hugging Face collection (dataset and model): https://huggingface.co/collections/anakin87/qwen-scheduler-g...

Adafruit hit with surprise $36K tariff bill: "pay in one week"

https://www.msn.com/en-us/money/companies/brooklyn-electronics-company-adafruit-hit-with-surprise-36k-tariff-bill-pay-in-one-week/ar-AA1EqmmM
1•clumsysmurf•6s ago•0 comments

The data survey disconnect and the dollar

https://kennethrogoff.substack.com/p/the-data-survey-disconnect-and-the
1•danboarder•58s ago•0 comments

Ask HN: Would this low-code back end idea be useful to you?

1•emrullahayaz•5m ago•0 comments

Newark Mayor Ras Baraka Arrested at ICE Detention Center in NJ

https://pix11.com/news/local-news/newark-mayor-ras-baraka-taken-into-custody-by-ice-in-new-jersey/
4•tastyface•8m ago•0 comments

How many gadgets have YOU owned on the eWaste Graveyard? [video]

https://www.youtube.com/watch?v=QrNFJOX7PBs
1•lg_rocket•15m ago•0 comments

Career Progression: How to Use 30, 60, and 90 Days Approach

https://diamantinoalmeida.com/career-progression-how-to-use-30-60-and-90-days-approach/
1•MitiaHiers•17m ago•0 comments

Overcoming Self-Doubt: A Practical Guide to Building Lasting Confidence

https://diamantinoalmeida.com/overcoming-self-doubt-a-practical-guide-to-building-lasting-confidence/
1•MitiaHiers•17m ago•0 comments

The Acid King

https://www.rollingstone.com/feature/acid-lsd-king-william-leonard-pickard-prison-pete-wilkinson-184390/
1•udit99•18m ago•0 comments

Japanese PhD Student Has Visa Revoked in the US Due to Alleged Criminal History

https://www.tokyoweekender.com/japan-life/news-and-opinion/japanese-phd-student-faces-us-deportation-over-minor-infractions/
2•miles•18m ago•0 comments

Israel's NSO Group ordered to pay nearly $170M to WhatsApp for hacking accounts

https://www.politico.com/news/2025/05/06/nso-group-pegasus-whatsapp-hack-170-million-damages-00332155
1•TMWNN•19m ago•1 comments

Revisiting Lower Bounds for Two-Step Consensus

https://arxiv.org/abs/2505.03627
1•otrack•23m ago•0 comments

Show HN: Bardmore – AI Speech Analysis and Feedback

https://bardmore.com/
1•ChristopherLaw_•24m ago•0 comments

How Being Watched Changes How You Think

https://www.scientificamerican.com/article/how-being-watched-changes-how-you-think/
1•SkyMarshal•25m ago•0 comments

Why everybody's drinking milk again

https://thehustle.co/originals/why-everybodys-drinking-milk-again
2•paulpauper•28m ago•0 comments

Perplexity Hacked Its Growth – Everything You Can Adopt from It

1•rishikeshranjan•29m ago•0 comments

Pope Leo XIV–why does this matter to the worlds of art and heritage?

https://www.theartnewspaper.com/2025/05/08/robert-francis-prevost-has-been-elected-pope-leo-xivwhy-does-this-matter-to-the-worlds-of-art-and-heritage
1•paulpauper•30m ago•0 comments

The post-screen future does not exist

https://www.chrbutler.com/shut-up-siri-post-screen-future
3•delaugust•30m ago•0 comments

NSF Unidata Pause in Most Operations

https://www.unidata.ucar.edu/blogs/news/entry/nsf-unidata-pause-in-most
2•trauco•32m ago•0 comments

Start, Fresh – Redesigning the Windows Start Menu for You

https://microsoft.design/articles/start-fresh-redesigning-windows-start-menu/
2•withinrafael•32m ago•1 comments

Joys and sorrows of designing a language [video]

https://www.youtube.com/watch?v=Zx5DcBt61bQ
2•todsacerdoti•35m ago•0 comments

Designing an architecture using dark matter and dark energy

https://microservices.io/post/microservices/2021/11/30/dark-matter-dark-energy.html
1•Alupis•38m ago•0 comments

CoreWeave seeks new $1.5B debt deal after downsized IPO

https://www.ft.com/content/453c47ae-997a-458d-9343-aa84370a2925
3•toomuchtodo•40m ago•2 comments

Show HN: Noti – Notes and reminders that live in your notification center

https://apps.apple.com/us/app/noti-simple-notifications/id6745527023
1•ajs808•40m ago•0 comments

Era of U.S. dollar may be winding down

https://news.harvard.edu/gazette/story/2025/05/era-of-u-s-dollar-may-be-winding-down/
78•gnabgib•42m ago•69 comments

Refactoring Agent for Bad Coders

https://github.com/bkidd1/wash-cli
2•BrinleeKidd•43m ago•0 comments

Extraterrestrial Tongues

https://aeon.co/essays/why-alien-languages-could-be-far-stranger-than-we-imagine
3•chrbutler•45m ago•0 comments

Added Quantum Field Theory to a Radiation Simulator for 22% Better Predictions

https://github.com/r0nlt/Space-Radiation-Tolerant/pull/26
1•r0nlt•48m ago•1 comments

CSS Snippets

https://adactio.com/journal/21896
3•chrbutler•49m ago•0 comments

Engineering Design Optimization Textbook

https://mdobook.github.io/
3•TheHideout•50m ago•0 comments

Career Is Only Meaningful

https://substack.com/home/post/p-162989745
3•MitiaHiers•52m ago•0 comments