frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

https://vmax.ai/team/populora-co-evolving-llm-populations-for-reasoning-self-play
17•AMavorParker•47m ago

Comments

AMavorParker•47m ago
We introduce PopuLoRA, a population-based asymmetric self-play framework for reinforcement learning with verifiable rewards (RLVR) post-training of LLMs. Teachers and students are specialised LoRA adapters on a shared frozen base: teachers propose problems, matched students solve them under a programmatic verifier, and cross-evaluation between sub-populations replaces the self-calibration that limits single-agent self-play. A family of LoRA weight-space evolution operators (mutations and crossovers that produce same-rank population members in seconds) serves as the replacement step of a population-based training loop at 7B scale. We instantiate PopuLoRA on top of Absolute Zero Reasoner and compare it against a per-adapter compute-matched single-agent baseline. Where the single agent self-calibrates to generating easy problems it can reliably solve, the population enters a co-evolutionary arms race: teachers produce increasingly complex problems, student solve rates oscillate, and problem-space coverage keeps expanding throughout training. Despite lower training-time reward, the population mean outperforms the baseline on three code benchmarks (HumanEval+, MBPP+, LiveCodeBench) and seven math benchmarks (AIME 24/25, AMC 23, MATH-500, Minerva, GSM8K, OlympiadBench), and even the weakest member of the population beats the baseline on aggregate.

An OpenAI model has disproved a central conjecture in discrete geometry

https://openai.com/index/model-disproves-discrete-geometry-conjecture/
461•tedsanders•2h ago•297 comments

GitHub confirms breach of 3,800 repos via malicious VSCode extension

https://www.bleepingcomputer.com/news/security/github-confirms-breach-of-3-800-repos-via-maliciou...
298•Timofeibu•8h ago•95 comments

Google Declaring War on the Web

https://tante.cc/2026/05/20/on-google-declaring-war-on-the-web/
34•cdrnsf•25m ago•4 comments

Flipper One Tech Specs

https://docs.flipper.net/one/general/tech-specs
143•gregsadetsky•3h ago•54 comments

Not alive, but not dead: disembodied human brains used for drug testing

https://www.science.org/content/article/not-alive-not-dead-disembodied-human-brains-used-drug-tes...
69•Timofeibu•2h ago•45 comments

How fast is N tokens per second really?

https://mikeveerman.github.io/tokenspeed/
229•hexagr•2d ago•56 comments

Starship's Twelfth Flight Test

https://www.spacex.com/launches/starship-flight-12
9•pantalaimon•17m ago•0 comments

Qwen3.7-Max: The Agent Frontier

https://qwen.ai/blog?id=qwen3.7
559•kevinsimper•11h ago•216 comments

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

https://vmax.ai/team/populora-co-evolving-llm-populations-for-reasoning-self-play
17•AMavorParker•47m ago•1 comments

Why is Inkwell stuck in review

https://www.manton.org/2026/05/19/why-is-inkwell-stuck-in.html
69•speckx•4h ago•23 comments

Qian Xuesen: The missile genius America lost and China gained (2025)

https://www.usni.org/magazines/naval-history/2025/december/missile-genius-america-lost-and-china-...
63•thnaks•4h ago•41 comments

Saying Goodbye to Asm.js

https://spidermonkey.dev/blog/2026/05/20/saying-goodbye-to-asmjs.html
280•eqrion•9h ago•121 comments

SBCL: the ultimate assembly code breadboard (2014)

https://pvk.ca/Blog/2014/03/15/sbcl-the-ultimate-assembly-code-breadboard/
109•yacin•6h ago•6 comments

Map of Metal

https://mapofmetal.com/
372•robin_reala•11h ago•131 comments

Sharla Boehm, the programmer whose code underpins the Internet

https://www.scientificamerican.com/article/the-programmer-whose-code-underpins-the-internet/
72•dxs•2d ago•22 comments

Google's AI is being manipulated. The search giant is quietly fighting back

https://www.bbc.com/future/article/20260519-google-tackles-attempts-to-hack-its-ai-results
228•tigerlily•11h ago•164 comments

Incident Report: May 19, 2026 – GCP Account Suspension

https://blog.railway.com/p/incident-report-may-19-2026-gcp-account-outage
344•0xedb•13h ago•208 comments

Tracking Starbucks' 'widely recyclable' cups: none ended up at recycling

https://www.beyondplastics.org/press-releases/starbucks-cups-recyclable-report
153•theanonymousone•3h ago•116 comments

LoRA and Weight Decay (2023)

https://irhum.github.io/blog/lorawd/
22•jxmorris12•1d ago•0 comments

Meta blocks human rights accounts from reaching audiences in Saudi Arabia, UAE

https://www.alqst.org/ar/posts/1190
865•giuliomagnifico•9h ago•367 comments

Formal Verification Gates for AI Coding Loops

https://reubenbrooks.dev/blog/structural-backpressure-beats-smarter-agents/
89•pyrex41•6h ago•20 comments

Étienne Ghys: The Shape of Letters: From Leonardo da Vinci to Donald Knuth

https://www.youtube.com/watch?v=1OIxzewWilc
44•tzury•2d ago•4 comments

Node.js 26.0.0 (Now with Temporal)

https://nodejs.org/en/blog/release/v26.0.0
94•aarestad•3h ago•28 comments

Apparently Google hates us now

https://twitter.com/pokemoncentral/status/2057123807404638250
379•zeitg3ist•5h ago•191 comments

Testing distributed systems with AI agents

https://github.com/shenli/distributed-system-testing
69•shenli3514•7h ago•10 comments

Handling the great code forge fragmentation

https://www.alexselimov.com/posts/forge_fragmentation/
36•mooreds•3d ago•19 comments

Show HN: Dari-docs – Optimize your docs using parallel coding agents

https://github.com/mupt-ai/dari-docs
7•byhong03•5h ago•1 comments

Japan is gripped by mass allergies. A 1950s project is to blame

https://www.bbc.com/future/article/20260515-the-1950s-blunder-which-causes-mass-hay-fever-in-japan
335•ranit•20h ago•152 comments

When Fast Fourier Transform Meets Transformer for Image Restoration (2024)

https://github.com/deng-ai-lab/SFHformer
74•teleforce•2d ago•7 comments

Tennessee man jailed 37 days for Trump meme wins settlement after lawsuit

https://www.fire.org/news/victory-tennessee-man-jailed-37-days-trump-meme-wins-835000-settlement-...
621•ceejayoz•7h ago•400 comments