frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Polaris: A Post-training recipe for scaling RL on Advanced Reasoning models

https://hkunlp.github.io/blog/2025/Polaris/
4•limoce•8h ago

Comments

NitpickLawyer•7h ago
Really cool paper, lots of examples of what worked, lots of interesting ideas. Some things I got from a first read-through:

- sample selection while training - while removing 0/8 and 8/8 problems was done before, I think it's interesting that they're doing it during training as well (as the model learns to solve some problems, they shift from x/8 closer to 8/8, and in this paper they remove them dynamically). Cool idea.

- increasing temp after an "entropy decrease" in the model - As the model "learns" new patterns, the entropy of answers decreases (based on ngrams) so they dynamically increase temperature to encourage discovery of more diverse answers.

- rope gives you free gains.

- each model is different and what works at one scale doesn't necessarily work at other scales - I think this was "known", but cool to see it applies to RL as well.

Fart: Language Model That Predicts Taste of Molecules

https://idp.nature.com/authorize?response_type=cookie&client_id=grover&redirect_uri=https%3A%2F%2Fwww.nature.com%2Farticles%2Fs41538-025-00474-z
1•yz-exodao•29s ago•0 comments

Only the Biggest Neoclouds Will Survive

https://www.nextplatform.com/2025/07/08/only-the-biggest-neoclouds-will-survive/
1•rbanffy•37s ago•0 comments

Show HN: Combine Minesweeper and Nanogram Game

https://nano-quantum-game.netlify.app/
1•evrmgzm•1m ago•0 comments

PG&E warns of aggressive scammers in Bay Area with more than 2,500 reports

https://abc7news.com/post/pge-scam-utility-company-warns-aggressive-scammers-bay-area-more-2500-cases-year-heres-what-know/17025303/
1•randycupertino•3m ago•1 comments

HN is censoring news about X / Twitter

5•tslocum•5m ago•3 comments

Four Billion Years of Vibecoding

https://aboard.com/four-billion-years-of-vibecoding/
1•gbseventeen3331•5m ago•0 comments

How AI is changing software engineering at Shopify with Farhan Thawar

https://newsletter.pragmaticengineer.com/p/how-ai-is-changing-software-engineering
1•gbseventeen3331•5m ago•0 comments

No, Grok, No

https://thezvi.substack.com/p/no-grok-no
2•jsnider3•6m ago•0 comments

Run Pandas on cloud GPUs (without Docker or K8s)

https://developer.nvidia.com/blog/simplify-setup-and-boost-data-science-in-the-cloud-using-nvidia-cuda-x-and-coiled/
1•scj13•6m ago•1 comments

Show HN: Beagle Security – AI driven pentesting for web apps and APIs

https://beaglesecurity.com/
1•rejah•9m ago•0 comments

No more disks: the architecture behind stateless compute in ClickHouse Cloud

https://clickhouse.com/blog/clickhouse-cloud-stateless-compute
1•tschreiber•9m ago•0 comments

Publish Your Home-Assistant Instance Using Matter

https://github.com/t0bst4r/home-assistant-matter-hub
1•Bluestein•10m ago•0 comments

FlexOlmo: A paradigm for LLM training and data collaboration

https://allenai.org/blog/flexolmo
1•maxloh•10m ago•0 comments

AI Movies – From 1920s to Now

https://biglysales.com/most-riveting-ai-movies/
1•ishita159•10m ago•0 comments

Linda Yaccarino resigns as CEO of X (Twitter)

https://www.politico.com/news/2025/07/09/linda-yaccarino-x-ceo-resign-00443742
2•rurp•11m ago•0 comments

Are there design shops doing cheap mobile apps using AI?

1•gritlin•11m ago•0 comments

Wildfires are challenging air quality monitoring infrastructure

https://undark.org/2025/07/04/wildfires-aqi-infrastructure/
1•rntn•12m ago•0 comments

For startups safeguarding AI models, revenue remains modest

https://twitter.com/theinformation/status/1942641099782148426
1•andy99•13m ago•0 comments

California's fire agency made an AI chatbot. Don't ask about evacuation orders

https://themarkup.org/artificial-intelligence/2025/07/09/californias-fire-protection-agency-made-an-ai-chatbot-dont-ask-it-about-evacuation-orders
1•CharlesW•14m ago•0 comments

Sometimes you just bump into people

https://thelastleg.substack.com/p/sometimes-you-just-bump-into-people
1•greenie_beans•15m ago•0 comments

Linda Yaccarino is out as CEO of X

https://www.cnn.com/2025/07/09/tech/linda-yaccarino-steps-down-x-ceo
1•gniting•16m ago•0 comments

R&B your way thru the Hebrew Bible

https://klappn.com/
1•linenmerchant•17m ago•1 comments

YouGotMail – build digital co-workers in MS Outlook

https://github.com/WitoldKowalczyk/YouGotMail
1•witold_kow•17m ago•1 comments

Linda Yaccarino steps down as CEO of Elon Musk's X

https://techcrunch.com/2025/07/09/linda-yaccarino-steps-down-as-ceo-of-elon-musks-x/
2•impish9208•19m ago•0 comments

Perplexity Launches Comet for Pro Subscribers

https://techcrunch.com/2025/07/09/perplexity-launches-comet-an-ai-powered-web-browser/
2•gniting•20m ago•0 comments

How well optimised are sites for AI crawlers?

https://trakkr.ai/ai-reports
1•mektrik•20m ago•0 comments

Advancing Claude for Education

https://www.anthropic.com/news/advancing-claude-for-education
2•meetpateltech•21m ago•2 comments

Real AI agents solve bounded problems

https://venturebeat.com/ai/forget-the-hype-real-ai-agents-solve-bounded-problems-not-open-world-fantasies/
2•kristianc•23m ago•0 comments

What is the voice inside my head?

https://www.bbc.com/audio/play/w3ct5rhk
5•Bluestein•24m ago•2 comments

BitChat, New Offline Messaging App, Uses Bluetooth Mesh, No Internet

https://reclaimthenet.org/bitchat-uses-bluetooth-mesh-no-internet
1•anonymousiam•24m ago•1 comments