news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

The Big Hunger by Walter J Miller, Jr. (1952)

https://lauriepenny.substack.com/p/the-big-hunger

1•shervinafshar•1m ago•0 comments

The Genus Amanita

https://www.mushroomexpert.com/amanita.html

1•rolph•5m ago•0 comments

We have broken SHA-1 in practice

https://shattered.io/

1•mooreds•6m ago•1 comments

Ask HN: Was my first management job bad, or is this what management is like?

1•Buttons840•7m ago•0 comments

Ask HN: How to Reduce Time Spent Crimping?

1•pinkmuffinere•9m ago•0 comments

KV Cache Transform Coding for Compact Storage in LLM Inference

https://arxiv.org/abs/2511.01815

1•walterbell•13m ago•0 comments

A quantitative, multimodal wearable bioelectronic device for stress assessment

https://www.nature.com/articles/s41467-025-67747-9

1•PaulHoule•15m ago•0 comments

Why Big Tech Is Throwing Cash into India in Quest for AI Supremacy

https://www.wsj.com/world/india/why-big-tech-is-throwing-cash-into-india-in-quest-for-ai-supremac...

1•saikatsg•15m ago•0 comments

How to shoot yourself in the foot – 2026 edition

https://github.com/aweussom/HowToShootYourselfInTheFoot

1•aweussom•15m ago•0 comments

Eight More Months of Agents

https://crawshaw.io/blog/eight-more-months-of-agents

3•archb•17m ago•0 comments

From Human Thought to Machine Coordination

https://www.psychologytoday.com/us/blog/the-digital-self/202602/from-human-thought-to-machine-coo...

1•walterbell•18m ago•0 comments

The new X API pricing must be a joke

https://developer.x.com/

1•danver0•19m ago•0 comments

Show HN: RMA Dashboard fast SAST results for monorepos (SARIF and triage)

https://rma-dashboard.bukhari-kibuka7.workers.dev/

1•bumahkib7•19m ago•0 comments

Show HN: Source code graphRAG for Java/Kotlin development based on jQAssistant

https://github.com/2015xli/jqassistant-graph-rag

1•artigent•24m ago•0 comments

Python Only Has One Real Competitor

https://mccue.dev/pages/2-6-26-python-competitor

3•dragandj•26m ago•0 comments

Tmux to Zellij (and Back)

https://www.mauriciopoppe.com/notes/tmux-to-zellij/

1•maurizzzio•26m ago•1 comments

Ask HN: How are you using specialized agents to accelerate your work?

1•otterley•28m ago•0 comments

Passing user_id through 6 services? OTel Baggage fixes this

https://signoz.io/blog/otel-baggage/

1•pranay01•28m ago•0 comments

DavMail Pop/IMAP/SMTP/Caldav/Carddav/LDAP Exchange Gateway

https://davmail.sourceforge.net/

1•todsacerdoti•29m ago•0 comments

Visual data modelling in the browser (open source)

https://github.com/sqlmodel/sqlmodel

1•Sean766•31m ago•0 comments

Show HN: Tharos – CLI to find and autofix security bugs using local LLMs

https://github.com/chinonsochikelue/tharos

1•fluantix•32m ago•0 comments

Oddly Simple GUI Programs

https://simonsafar.com/2024/win32_lights/

1•MaximilianEmel•32m ago•0 comments

The New Playbook for Leaders [pdf]

https://www.ibli.com/IBLI%20OnePagers%20The%20Plays%20Summarized.pdf

1•mooreds•32m ago•1 comments

Interactive Unboxing of J Dilla's Donuts

https://donuts20.vercel.app

1•sngahane•34m ago•0 comments

OneCourt helps blind and low-vision fans to track Super Bowl live

https://www.dezeen.com/2026/02/06/onecourt-tactile-device-super-bowl-blind-low-vision-fans/

1•gaws•35m ago•0 comments

Rudolf Vrba

https://en.wikipedia.org/wiki/Rudolf_Vrba

1•mooreds•36m ago•0 comments

Autism Incidence in Girls and Boys May Be Nearly Equal, Study Suggests

https://www.medpagetoday.com/neurology/autism/119747

1•paulpauper•37m ago•0 comments

Wellness Hotels Discovery Application

https://aurio.place/

1•cherrylinedev•38m ago•1 comments

NASA delays moon rocket launch by a month after fuel leaks during test

https://www.theguardian.com/science/2026/feb/03/nasa-delays-moon-rocket-launch-month-fuel-leaks-a...

1•mooreds•38m ago•0 comments

Sebastian Galiani on the Marginal Revolution

https://marginalrevolution.com/marginalrevolution/2026/02/sebastian-galiani-on-the-marginal-revol...

2•paulpauper•41m ago•0 comments

Open in hackernews

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

https://twitter.com/LakshyAAAgrawal/status/1949867947867984322

2•LakshyAAAgrawal•6mo ago

Comments

LakshyAAAgrawal•6mo ago

Large language models (LLMs) are increasingly adapted to downstream tasks via reinforcement learning (RL) methods like Group Relative Policy Optimization (GRPO), which often require thousands of rollouts to learn new tasks. We argue that the interpretable nature of language can often provide a much richer learning medium for LLMs, compared with policy gradients derived from sparse, scalar rewards. To test this, we introduce GEPA (Genetic-Pareto), a prompt optimizer that thoroughly incorporates natural language reflection to learn high-level rules from trial and error. Given any AI system containing one or more LLM prompts, GEPA samples system-level trajectories (e.g., reasoning, tool calls, and tool outputs) and reflects on them in natural language to diagnose problems, propose and test prompt updates, and combine complementary lessons from the Pareto frontier of its own attempts. As a result of GEPA's design, it can often turn even just a few rollouts into a large quality gain. Across four tasks, GEPA outperforms GRPO by 10% on average and by up to 20%, while using up to 35x fewer rollouts. GEPA also outperforms the leading prompt optimizer, MIPROv2, by over 10% across two LLMs, and demonstrates promising results as an inference-time search strategy for code optimization.