frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Group Sequence Policy Optimization

https://arxiv.org/abs/2507.18071
2•kdavis•4h ago

Comments

kdavis•4h ago
This paper introduces Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant reinforcement learning algorithm for training large language models. Unlike previous algorithms that adopt token-level importance ratios, GSPO defines the importance ratio based on sequence likelihood and performs sequence-level clipping, rewarding, and optimization. We demonstrate that GSPO achieves superior training efficiency and performance compared to the GRPO algorithm, notably stabilizes Mixture-of-Experts (MoE) RL training, and has the potential for simplifying the design of RL infrastructure. These merits of GSPO have contributed to the remarkable improvements in the latest Qwen3 models.

Proxy 4: The Next Leap in C++ Polymorphism

https://devblogs.microsoft.com/cppblog/announcing-proxy-4-the-next-leap-in-c-polymorphism/
2•janjones•1m ago•0 comments

I gave Claude Code a folder of tax documents and used it as a tax agent

https://martinalderson.com/posts/building-a-tax-agent-with-claude-code/
1•martinald•2m ago•1 comments

The Dangerous Legal Strategy Coming for Our Books

https://www.theatlantic.com/ideas/archive/2025/08/book-bans-public-schools/683921/
1•littlexsparkee•3m ago•1 comments

Privacy Washing Is a Dirty Business

https://www.privacyguides.org/articles/2025/08/20/privacy-washing-is-a-dirty-business/
2•samuel246•7m ago•0 comments

Can Peanut Allergies Be Cured?

https://www.scientificamerican.com/article/new-treatments-can-free-kids-from-the-deadly-threat-of-peanut-allergy/
1•stevenjgarner•7m ago•0 comments

Show HN: Llmswap v3.0 – CLI and SDK for OpenAI, Claude, Gemini, Watsonx

https://pypi.org/project/llmswap/
1•sreenathmenon•9m ago•0 comments

Show HN: Turn any study material into practice questions with one photo

https://www.lexielearn.com/en
2•e_patjas•9m ago•0 comments

Show HN: I built an app to track expense temptation

https://app.skipwise.org
1•0xshadow•10m ago•0 comments

FictusVNC – Fake VNC server to serve your images easily

https://github.com/ayebrian/fictusvnc
1•LorenDB•10m ago•0 comments

Economics of RL

https://www.mechanize.work/blog/cheap-rl-tasks-will-waste-compute/
2•Tamaybes•10m ago•0 comments

Google announces Tennessee as site for small modular nuclear reactor

https://www.reuters.com/sustainability/boards-policy-regulation/google-announces-tennessee-site-small-modular-nuclear-reactor-2025-08-18/
2•rbanffy•12m ago•0 comments

Researchers build first 'microwave brain' on a chip – Cornell Chronicle

https://news.cornell.edu/stories/2025/08/researchers-build-first-microwave-brain-chip
1•rbanffy•12m ago•0 comments

Made by Google '25 launch event [video]

https://www.youtube.com/watch?v=JXCXTQIIvM0
2•ChrisArchitect•13m ago•0 comments

Awesome-ricing, tools to help with ricing on Linux

https://github.com/fosslife/awesome-ricing
2•dxs•14m ago•0 comments

DOM-Based Extension Clickjacking

https://marektoth.com/blog/dom-based-extension-clickjacking/
1•_xgw•15m ago•0 comments

The Content Trap

https://9to5tofounder.substack.com/p/no-1-starting-from-scratch
1•dimitrit•16m ago•0 comments

Pixel 10 Phones

https://blog.google/products/pixel/google-pixel-10-pro-xl/
23•gotmedium•17m ago•12 comments

Misinformation Rises, Climate Fades; Global Risk Is Now a Popularity Contest

https://www.pewresearch.org/global/2025/08/19/international-opinion-on-global-threats/
5•bdev12345•18m ago•1 comments

NSA's Acting Director Tried to Save Top Scientist from Purge

https://www.nytimes.com/2025/08/20/us/politics/security-clearances-scientist-fired.html
7•_tk_•22m ago•1 comments

Families caring for older adults at home say aging in place may be worth it

https://www.cbsnews.com/newyork/news/aging-in-place-home-remodeling-cost-of-caregiving/
2•mooreds•23m ago•0 comments

Texas Energy Crunch to Worsen as Trump Policies Target Solar and Wind Power

https://www.bloomberg.com/news/articles/2025-08-19/texas-energy-crunch-to-worsen-as-trump-policies-target-solar-wind-power
6•voxadam•23m ago•1 comments

You no longer need a Clipper card to ride BART

https://www.kron4.com/news/bay-area/youll-no-longer-need-a-clipper-card-to-ride-bart-starting-this-week/
1•kaycebasques•24m ago•0 comments

Teaching: A Few Useful Analogies

https://jonathandinu.com/writing/on-teaching/
1•clearspandex•28m ago•0 comments

Show HN: LightSwitch: Multi-View Relighting with Material-Guided Diffusion

https://yehonathanlitman.github.io/light_switch/
4•indigoomega•30m ago•1 comments

Researchers discover what saves babies' lives. It's not medical, it's money

https://www.npr.org/sections/goats-and-soda/2025/08/18/g-s1-83197/infants-health-cash-aid-kenya
3•laurex•32m ago•0 comments

Scientists get a rare peek inside of an exploding star

https://apnews.com/article/supernova-explosion-dying-star-9924d1cbfb8d8e5d9548defe38d7105a
4•petethomas•32m ago•0 comments

An Update on Pytype

https://github.com/google/pytype
21•mxmlnkn•36m ago•3 comments

Is the A.I. Sell-Off the Start of Something Bigger?

https://www.nytimes.com/2025/08/20/business/dealbook/ai-dip-blip-palantir-nvidia.html
8•voxadam•37m ago•2 comments

How harmful is blue light for sleep?

https://www.nytimes.com/2025/08/17/well/health-effects-blue-light-screen-use.html
3•bookofjoe•40m ago•1 comments

US Health Secretary Ends Decades of Research into Environmental Causes of Autism

https://www.propublica.org/article/rfk-jr-autism-environment-research-funding
5•klipt•40m ago•1 comments