frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

https://twitter.com/LakshyAAAgrawal/status/1949867947867984322
2•LakshyAAAgrawal•7h ago

Comments

LakshyAAAgrawal•7h ago
Large language models (LLMs) are increasingly adapted to downstream tasks via reinforcement learning (RL) methods like Group Relative Policy Optimization (GRPO), which often require thousands of rollouts to learn new tasks. We argue that the interpretable nature of language can often provide a much richer learning medium for LLMs, compared with policy gradients derived from sparse, scalar rewards. To test this, we introduce GEPA (Genetic-Pareto), a prompt optimizer that thoroughly incorporates natural language reflection to learn high-level rules from trial and error. Given any AI system containing one or more LLM prompts, GEPA samples system-level trajectories (e.g., reasoning, tool calls, and tool outputs) and reflects on them in natural language to diagnose problems, propose and test prompt updates, and combine complementary lessons from the Pareto frontier of its own attempts. As a result of GEPA's design, it can often turn even just a few rollouts into a large quality gain. Across four tasks, GEPA outperforms GRPO by 10% on average and by up to 20%, while using up to 35x fewer rollouts. GEPA also outperforms the leading prompt optimizer, MIPROv2, by over 10% across two LLMs, and demonstrates promising results as an inference-time search strategy for code optimization.

MetaCPAN's Traffic Crisis: An Eventual Success Story

https://www.perl.com/article/metacpan-traffic-crisis/
1•oalders•1m ago•0 comments

The Math That Predicts Almost Anything [video]

https://www.youtube.com/watch?v=KZeIEiBrT_w
1•mgh2•8m ago•0 comments

WP Cron Pixie v1.5.0 released: Front end switched from Elm to Gleam

https://ianmjones.com/2025/06/wp-cron-pixie-v1-5-0-released-front-end-switched-from-elm-to-gleam/
1•todsacerdoti•17m ago•0 comments

Zig Profiling on Apple Silicon

https://blog.bugsiki.dev/posts/zig-profilers/
2•signa11•18m ago•0 comments

Cyberattack on Russian airline causes the cancellation of more than 100 flights

https://www.politico.com/news/2025/07/28/cyberattack-on-russian-airline-aeroflot-causes-the-cancellation-of-more-than-100-flights-00479963
2•rurp•19m ago•0 comments

Navy Set to Unplug Critical Hurricane Satellites This Week

https://michaelrlowry.substack.com/p/navy-set-to-unplug-critical-hurricane
2•garrettdreyfus•21m ago•1 comments

voyage-context-3: Contextual Retrieval Without the LLM

https://blog.voyageai.com/2025/07/23/voyage-context-3/
1•fzliu•23m ago•0 comments

Polish Train Maker Is Suing the Hackers Who Exposed Its Anti-Repair Tricks

https://www.ifixit.com/News/112008/polish-train-maker-is-suing-the-hackers-who-exposed-its-anti-repair-tricks
5•gnabgib•25m ago•1 comments

NASA worked around 48-year-old Voyager 1's corrupted storage 15B miles away

https://www.ecoportal.net/en/voyager-1-transmitted-messages-nasa/10719/
2•maxloh•25m ago•1 comments

FinTech Dystopia

https://fintechdystopia.com/
8•LasEspuelas•36m ago•1 comments

Compressed Sensing

https://en.wikipedia.org/wiki/Compressed_sensing
2•downboots•36m ago•0 comments

Ask HN: How are you sharing Claude Code Sub Agents?

2•bredren•43m ago•0 comments

Walmart salary data revealed: How much it pays designers, software engineers

https://www.businessinsider.com/walmart-salary-data-revealed-how-much-tech-workers-make-2025-7
4•cebert•44m ago•1 comments

Energy, decarb, geoengineering: interview with climate specialists (Ezra Klein)

https://www.youtube.com/watch?v=vuW4PdhqKmo
2•gsf_emergency_2•52m ago•0 comments

Can China be a defender of free trade

https://instituteofgeoeconomics.org/en/research/2025072203/
1•gsf_emergency_2•54m ago•0 comments

Chinese consumer complaints show widespread padding of car sales

https://www.reuters.com/business/autos-transportation/chinese-consumer-complaints-show-widespread-padding-car-sales-figures-2025-07-28
4•mhga•55m ago•0 comments

Evolutionary continuity in social dominance: Insights from primate tractography

https://www.jneurosci.org/content/early/2025/07/09/JNEUROSCI.1646-24.2025
1•PaulHoule•56m ago•0 comments

Cheyenne to host AI data center using more electricity than all Wyoming homes

https://apnews.com/article/ai-artificial-intelligence-data-center-electricity-wyoming-cheyenne-44da7974e2d942acd8bf003ebe2e855a
2•petethomas•59m ago•0 comments

How do I get a paid internship as a 16yo developer?

2•uint23•1h ago•6 comments

Project Lumen: Artificial Sunrise Simulation and Voice Assistant (2022)

https://www.youtube.com/watch?v=g5WBZYqh060
1•guiambros•1h ago•0 comments

'Quincaillerie' Is French for 'Hardware Store,' but It Means So Much More

https://www.nytimes.com/2025/07/17/travel/quincaillerie-french-hardware-store.html
2•bookofjoe•1h ago•1 comments

PlainApp: Android app to securely manage your phone from a web browser

https://github.com/ismartcoding/plain-app
1•thunderbong•1h ago•0 comments

CEOs Are Shrinking Their Workforces–and They Couldn't Be Prouder

https://www.wsj.com/lifestyle/careers/layoff-business-strategy-reduce-staff-11796d66
4•cebert•1h ago•0 comments

Cling – Instant fuzzy find any file on macOS

https://github.com/FuzzyIdeas/Cling
2•mickelsen•1h ago•0 comments

AI Is Wrecking a Fragile Job Market for College Graduates

https://www.wsj.com/lifestyle/careers/ai-entry-level-jobs-graduates-b224d624
22•alephnerd•1h ago•5 comments

Mighty Memoirs – Children memory app

https://mightymemoirs.com/
1•adamattic•1h ago•0 comments

CSS Hyphens, Words, Syllables, and Languages

https://blog.frankmtaylor.com/2025/07/17/css-hyphens-words-syllables-and-languages/
1•eustoria•1h ago•0 comments

Identity and Behaviour

https://ismaelcelis.com/posts/2025-07-identity-and-behaviour/
2•todsacerdoti•1h ago•0 comments

Lisuan 7G106 runs Chinese AAA titles at 4K over 70 FPS and matches RTX 4060

https://www.tomshardware.com/pc-components/gpus/china-advances-toward-tech-independence-with-new-homegrown-6nm-gaming-and-ai-gpus-lisuan-7g106-runs-chinese-aaa-titles-at-4k-over-70-fps-and-matches-rtx-4060-in-synthetic-benchmarks
3•rguiscard•1h ago•0 comments

CDC Ties 85 Cases of THC-Related Symptoms to Wisconsin Restaurant

https://www.nytimes.com/2025/07/28/well/wisconsin-restaurant-thc-poisoning.html
3•wslh•1h ago•1 comments