frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR

https://arxiv.org/abs/2509.02522
22•getnormality•2h ago

Comments

getnormality•2h ago
I stumbled across this AI paper just now. It sounds intimidatingly technical, but if you read the abstract and look at Figures 1 and 2 and Equation 6, I think it's got some neat and accessible conceptual ideas.

Supervised learning is a much more mature technology than reinforcement learning, so it seems like a good thing to leverage that.

yorwba•1h ago
I think you meant to link to

Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR https://arxiv.org/abs/2509.02522

not

Winning Gold at IMO 2025 with a Model-Agnostic Verification-and-Refinement Pipeline https://arxiv.org/abs/2507.15855

dang•1h ago
We've changed the top link to that from https://arxiv.org/abs/2507.15855. Thanks!
getnormality•1h ago
Ack, thank you.
anfego•42m ago
Is this DPO?

I built a tool that lets you backtest trading strategies using plain English

1•satabdom27•1m ago•0 comments

Why George R.R. Martin Broke the Cardinal Rule of Hollywood

https://www.hollywoodreporter.com/movies/movie-features/george-r-r-martin-howard-waldrop-ugly-chi...
2•throwoutway•5m ago•0 comments

He Drops Trump Jr.'S Name in Pursuit of Billion-Dollar Deals

https://www.wsj.com/politics/policy/donald-trump-jr-friend-gentry-beach-03824825
2•doener•7m ago•0 comments

Hybrid unary-binary design for multiplier-less printed ML classifiers

https://arxiv.org/abs/2509.15316
1•PaulHoule•8m ago•0 comments

The two minute mile problem

https://hollisrobbinsanecdotal.substack.com/p/the-two-minute-mile-problem
1•HR01•8m ago•0 comments

HLS Visiting Professor Placed on Leave After Firing Pellet Gun Near Synagogue

https://www.thecrimson.com/article/2025/10/5/hls-visiting-prof-arrested/
1•YZF•9m ago•0 comments

Move Fast and Break Nothing

https://www.theatlantic.com/technology/2025/10/is-waymo-safe/684432/
1•andrewmutz•11m ago•0 comments

Google Chrome RCE (No Sandbox) via CanonicalEquality:EqualValueType()

https://ssd-disclosure.com/google-chrome-rce-no-sandbox-via-canonicalequalityequalvaluetype/
2•ogig•11m ago•0 comments

Ask HN: 10-Year Reddit Account Hacked Despite 2FA

3•guilamu•11m ago•1 comments

Short Science Fiction, by Isaac Asimov – Standard Ebooks

https://standardebooks.org/ebooks/isaac-asimov/short-science-fiction
2•WithinReason•11m ago•0 comments

Toybox: All-in-one Linux command line

https://github.com/landley/toybox
1•welovebunnies•13m ago•0 comments

Dimensional Analysis in Programming Languages (2018)

https://www.gmpreussner.com/research/dimensional-analysis-in-programming-languages
1•v9v•15m ago•1 comments

Show HN: OpenScreen. Open-source video assessment screening tool

https://github.com/dylnbk/open-screen
1•dylnbk•19m ago•0 comments

Kenneth Clark Civilisation 1969

https://www.youtube.com/playlist?list=PL4wbshl89IWTHc94BhZI-C-v-neF40boG
1•mosiuerbarso•22m ago•0 comments

A year of improving Node.js compatibility in Cloudflare Workers

https://blog.cloudflare.com/nodejs-workers-2025/
2•CharlesW•26m ago•0 comments

Florida student asks ChatGPT how to kill his friend, ends up in jail: deputies

https://www.wfla.com/news/florida/florida-student-asks-chatgpt-how-to-kill-his-friend-ends-up-in-...
2•trhway•28m ago•0 comments

Cariad: VW subsidiary largely discontinues its own software development

https://www.heise.de/en/news/Cariad-VW-subsidiary-largely-discontinues-its-own-software-developme...
1•esher•28m ago•0 comments

I Have a Wodehouse Problem. The Problem Is I Can't Stop Reading Him

https://thewalrus.ca/i-have-a-wodehouse-problem-the-problem-is-i-cant-stop-reading-him/
1•lermontov•30m ago•0 comments

GBoard Dial Version

https://www.youtube.com/watch?v=BgdWyD0cBx4
2•skogstokig•33m ago•0 comments

Gliding behind existing aircraft, Aerocart cargo gliders

https://www.aerolane.com/
2•fcpguru•33m ago•0 comments

How I finally got Firebase to verify my Squarespace domain

https://www.lokmanefe.com/writings/firebase-squarespace-custom-domain
1•lokicik•36m ago•0 comments

Engineering Viruses to Fight Bacteria

https://www.popularmechanics.com/science/a68825778/custom-viruses-fight-e-coli/
1•bookofjoe•37m ago•1 comments

Surgeon Returns to War Zone to Help One of Congo's Thousands of Rape Victims

https://www.thetimes.com/world/africa/article/sophie-duchess-congo-rape-victims-fj7s0vm3q
1•mhb•39m ago•0 comments

A recent phishing attack on GitHub

https://digitalseams.com/blog/a-recent-phishing-attack-on-github
1•bobbiechen•40m ago•1 comments

How we built a cloud GPU notebook that boots in seconds

https://modal.com/blog/notebooks-internals
1•birdculture•40m ago•0 comments

Without Deeds, Without Names

https://www.laphamsquarterly.org/celebrity/without-deeds-without-names
8•toomuchtodo•44m ago•0 comments

Deaths, disappearances, forced recruitment: Horrors of relentless war in Sudan

https://www.theguardian.com/world/2025/oct/02/refugees-recall-horrors-of-sudan-civil-war
2•mhb•44m ago•0 comments

Laser Sintering 3D-Prints Silver Electronics in Space

https://bioengineer.org/laser-sintering-3d-prints-silver-electronics-in-space/
1•westurner•44m ago•0 comments

An algorithm that turns images into Obama

https://github.com/Spu7Nix/obamify
2•hsuduebc2•45m ago•1 comments

Microsoft Accidentally removes extensions from VSCode Marketplace

https://github.com/microsoft/vscode/issues/269737
2•Alupis•47m ago•0 comments