frontpage.

Show HN: Optimize_anything: A Universal API for Optimizing Any Text Parameter

https://gepa-ai.github.io/gepa/blog/2026/02/18/introducing-optimize-anything/

7•LakshyAAAgrawal•1h ago

We built optimize_anything, an API that optimizes any artifact representable as text — code, prompts, agent architectures, configs, even SVGs. It extends GEPA (our prompt optimizer, discussed here previously: https://arxiv.org/abs/2507.19457) far beyond prompts.

The API is deliberately minimal. You provide what to optimize and how to measure it:

import gepa.optimize_anything as oa

def evaluate(candidate: str) -> tuple[float, dict]: result = run_my_system(candidate) return result.score, {"error": result.stderr, "runtime": f"{result.time_ms}ms"}

result = oa.optimize_anything( seed_candidate="<your artifact>", evaluator=evaluate, )

The evaluator returns a score plus diagnostic feedback (we call it "Actionable Side Information" — stack traces, rendered images, profiler output, whatever helps diagnose failures). An LLM proposer reads this feedback during a reflection step and proposes targeted fixes, not blind mutations. Candidates are selected via a Pareto frontier across metrics/examples, so a candidate that's best at one thing survives even if its average is mediocre.

Two ideas distinguish this from AlphaEvolve/OpenEvolve/ShinkaEvolve-style LLM evolution: (1) diagnostic feedback is a first-class API concept rather than a framework-specific mechanism, and (2) the API unifies three optimization modes — single-task search (solve one hard problem), multi-task search (solve related problems with cross-transfer), and generalization (build artifacts that transfer to unseen inputs). Prior frameworks only express mode 1.

We tested across 8 domains. Selected results:

Coding agent skills: Learned repo-specific skills push Claude Code to near-perfect task completion and make it 47% faster Cloud scheduling: Discovered algorithms that cut costs 40%, topping the ADRS leaderboard over expert heuristics and other LLM-evolution frameworks Agent architecture: Evolved a 10-line stub into a 300+ line ARC-AGI agent, improving Gemini Flash from 32.5% → 89.5% Circle packing (n=26): Outperforms AlphaEvolve's published solution Blackbox optimization: Generated problem-specific solvers matching or exceeding Optuna across 56 EvalSet problems CUDA kernels: 87% match or beat baseline; multi-task mode outperforms dedicated single-task runs

``` pip install gepa ```

Blog with full results and runnable code for all 8 case studies: https://gepa-ai.github.io/gepa/blog/2026/02/18/introducing-o...

GitHub: https://github.com/gepa-ai/gepa

Show HN: Sinkai – Let AI agents hire humans for real-world tasks

Democracy Fails Without Trust

Who moved my cheese? [pdf]

Deceived – On Happiness

Designing and Creating a Game Engine for Use in the Classroom [pdf]

OpenAI and Paradigm Launches EVMbench to Test AIs on Smart Contract Security

Agentic Internet Protocol (AIP), an agent-only web built from small text pages

Russia Eyes Balloon Communications System After Losing Starlink

Amazon service was taken down by AI coding bot

Agentic AI and the Mythical Agent Month

LipoVive vs. Traditional Fat Burners: Which Is Safer for 2026?

The Israeli Government Installed and Maintained Security System at Epstein Apt

OpenClaw Partners with VirusTotal for Skill Security

Child's Play

The Lost Internet: Searching for Debian Woody Sources

West Virginia sues Apple for prioritizing user privacy over child safety

Japan's largest toilet maker is undervalued AI play, says activist investor

Reading the undocumented MEMS accelerometer on Apple Silicon MacBooks via iokit

Show HN: Prompt Indexing for ChatGPT Session

Show HN: I made a static site for exploring names

How I made a shooter game in 64 KB

AI Impact Summit 2026: How we're partnering to make AI work for everyone

"Amazon.com" commercials from the 1990s [video]

The Dillo Appreciation Post

OpenClaw container image with 99% less vulnerabilities

Show HN: Berean Labs – Free AI-powered penetration testing for web apps

Japanese toilet maker 'most undervalued and overlooked AI memory beneficiary'

Fast KV Compaction via Attention Matching

Why can't the world replace China in manufacturing?

Gravity Doesn't Behave Normally in Antarctica