frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: OpenEvolve – open-source implementation of DeepMind's AlphaEvolve

8•codelion•8mo ago
I've built an open-source implementation of Google DeepMind's AlphaEvolve system called OpenEvolve. It's an evolutionary coding agent that uses LLMs to discover and optimize algorithms through iterative evolution.

Try it out: https://github.com/codelion/openevolve

What is this?

OpenEvolve evolves entire codebases (not just single functions) by leveraging an ensemble of LLMs combined with automated evaluation. It follows the evolutionary approach described in the AlphaEvolve paper but is fully open source and configurable.

I built this because I wanted to experiment with evolutionary code generation and see if I could replicate DeepMind's results. The original system successfully improved Google's data centers and found new mathematical algorithms, but no implementation was released.

How it works:

The system has four main components that work together in an evolutionary loop:

1. Program Database: Stores programs and their metrics in a MAP-Elites inspired structure

2. Prompt Sampler: Creates context-rich prompts with past solutions

3. LLM Ensemble: Generates code modifications using multiple models

4. Evaluator Pool: Tests programs and provides feedback metrics

What you can do with it:

- Run existing examples to see evolution in action

- Define your own problems with custom evaluation functions

- Configure LLM backends (works with any OpenAI-compatible API)

- Use multiple LLMs in ensemble for better results

- Optimize algorithms with multiple objectives

Two examples I've replicated from the AlphaEvolve paper:

- Circle Packing: Evolved from simple geometric patterns to sophisticated mathematical optimization, reaching 99.97% of DeepMind's reported results (2.634 vs 2.635 sum of radii for n=26).

- Function Minimization: Transformed a random search into a complete simulated annealing algorithm with cooling schedules and adaptive step sizes.

Technical insights:

- Low latency LLMs are critical for rapid generation cycles

- Best results using Gemini-Flash-2.0-lite + Gemini-Flash-2.0 as the ensemble

- For the circle packing problem, Gemini-Flash-2.0 + Claude-Sonnet-3.7 performed best

- Cerebras AI's API provided the fastest inference speeds

- Two-phase approach (exploration then exploitation) worked best for complex problems

Getting started (takes < 2 minutes)

# Clone and install

git clone https://github.com/codelion/openevolve.git

cd openevolve

pip install -e .

# Run the function minimization example

python openevolve-run.py

examples/function_minimization/initial_program.py \

  examples/function_minimization/evaluator.py \

  --config examples/function_minimization/config.yaml \

  --iterations 50
All you need is Python 3.9+ and an API key for an LLM service. Configuration is done through simple YAML files.

I'll be around to answer questions and discuss!

Comments

codelion•8mo ago
I actually managed to replicate the new SOTA for circle packing in unit squares as found in the alphaevole paper - 2.635 for 26 circles in a unit square. Took about 800 iterations to find the best program which itself uses an optimisation phase and running it lead to the optimal packaging in one of its runs.
helsinki•8mo ago
How many tokens did it take to generate the 800 versions of the code?
codelion•8mo ago
Checked my openrouter stats, it took ~3M tokens but that involved quite a few runs of various experiments.

Show HN: 33rpm – A vinyl screensaver for macOS that syncs to your music

https://33rpm.noonpacific.com/
1•kaniksu•3m ago•0 comments

Google Workers Demand End to Cloud Services for Immigration Agencies

https://www.nytimes.com/2026/02/06/business/google-employees-protest.html
1•donohoe•3m ago•0 comments

Gravity = Magnetism and Heat

https://zenodo.org/records/18498514
1•phdlalala•9m ago•0 comments

Debian project leader warns of developers' silent withdrawal

https://www.heise.de/en/news/Debian-Project-leader-warns-of-developers-silent-withdrawal-11167886...
2•layer8•9m ago•0 comments

Your Agent's Memory Is Broken. Here's Why

https://ramsriharsha.substack.com/p/your-agents-memory-is-broken-heres
1•riemannzeta•12m ago•0 comments

Show HN: I built a YC-style interview practice tool for myself, now public

https://www.getycready.com/
1•zacharykapank•14m ago•1 comments

Continual learning and the post monolith AI era

https://www.baseten.co/resources/research/continual-learning/#introduction
1•jxmorris12•17m ago•0 comments

Detecting backdoored language models at scale

https://www.microsoft.com/en-us/security/blog/2026/02/04/detecting-backdoored-language-models-at-...
1•geox•20m ago•0 comments

$530B in AI Capex looks terrible if you forget how accounting works

https://deadneurons.substack.com/p/the-cloud-hyperscalers-are-starting
1•nr378•21m ago•0 comments

Ask HN: Non AI-obsessed tech forums

4•nanocat•24m ago•2 comments

Show HN: A React testing boilerplate built for vibe coding

https://www.testsolid.com/
1•dudeskey•24m ago•0 comments

Persistent Memory for OpenClaw/Moltbot/Clawdbot

https://mem0.ai/blog/mem0-memory-for-openclaw
1•ninadwrites•26m ago•0 comments

Executive Function as Code: using (Doom) Emacs to script my brain

https://milly.kittycloud.eu/posts/executive-function-as-code-doom-emacs-adhd/
2•0xMillyByte•26m ago•0 comments

Ultima IX

https://www.filfre.net/2026/02/ultima-ix/
2•doppp•27m ago•0 comments

Will firms try to combine software developer and product manager roles?

https://bjornwestergard.com/firms-combine-software-roles/
1•bwestergard•27m ago•0 comments

Show HN: Chiptune Tracker

https://chiptunes.netlify.app
2•iamdan•29m ago•0 comments

Words That Mean Nothing

https://dogdogfish.com/blog/2026/02/06/words-that-mean-nothing/
2•matthewsharpe3•31m ago•0 comments

Show HN: Falcon's Eye (isometric NetHack) running in the browser via WebAssembly

https://rahuljaguste.github.io/Nethack_Falcons_Eye/
1•rahuljaguste•33m ago•1 comments

Claude Opus 4.6 vs. GPT-5.3-Codex: AI Model Showdown

https://badlucksbane.com/posts/claude-opus-4-6-vs-gpt-5-3-codex-the-ai-model-showdown.html
1•IAmNeo•34m ago•0 comments

Show HN: Vibe coded real-time Super Bowl Squares app (Claude Code and Opus 4.5)

https://defirate.com/squares/
1•ksaville•35m ago•0 comments

The Problem with Silicon Carbon Batteries [video]

https://www.youtube.com/watch?v=zPAY2VxfFBk
1•mgh2•36m ago•0 comments

Gizmo: A TikTok for interactive, vibe-coded mini apps

https://techcrunch.com/2026/02/04/meet-gizmo-a-tiktok-for-interactive-vibe-coded-mini-apps/
1•fcpguru•38m ago•0 comments

Both GCC and Clang generate strange/inefficient code

https://codingmarginalia.blogspot.com/2026/02/both-gcc-and-clang-generate.html
4•rsf•41m ago•2 comments

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

https://github.com/phreda4/r3
9•phreda4•42m ago•0 comments

Pilot mistakenly attempted to take off from a taxiway at Brussels Airport

https://www.brusselstimes.com/1956996/pilot-accidentally-takes-off-on-the-wrong-lane-at-brussels-...
1•susam•43m ago•0 comments

One Year of Using Go

https://rugu.dev/en/blog/one-year-of-go/
1•kugurerdem•43m ago•0 comments

Show HN: Ty-lsp skill for coding agents

https://github.com/agentic-utils/claude-plugins
1•brtkwr•44m ago•0 comments

Choose to be the person you need the most

https://blog.aintapp.com/be-who-you-need-the-most/
1•i_k•45m ago•0 comments

Using a Jailbroken Gemini to Make Opus 4.6 Architect a Kinetic Kill Vehicle

https://recursion.wtf/posts/shadow_queen/
1•inanna_malick•48m ago•2 comments

Visualize MySQL query execution plans as interactive FlameGraphs

https://github.com/vgrippa/myflames
1•tanelpoder•50m ago•0 comments