frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: OpenEvolve – open-source implementation of DeepMind's AlphaEvolve

8•codelion•1y ago
I've built an open-source implementation of Google DeepMind's AlphaEvolve system called OpenEvolve. It's an evolutionary coding agent that uses LLMs to discover and optimize algorithms through iterative evolution.

Try it out: https://github.com/codelion/openevolve

What is this?

OpenEvolve evolves entire codebases (not just single functions) by leveraging an ensemble of LLMs combined with automated evaluation. It follows the evolutionary approach described in the AlphaEvolve paper but is fully open source and configurable.

I built this because I wanted to experiment with evolutionary code generation and see if I could replicate DeepMind's results. The original system successfully improved Google's data centers and found new mathematical algorithms, but no implementation was released.

How it works:

The system has four main components that work together in an evolutionary loop:

1. Program Database: Stores programs and their metrics in a MAP-Elites inspired structure

2. Prompt Sampler: Creates context-rich prompts with past solutions

3. LLM Ensemble: Generates code modifications using multiple models

4. Evaluator Pool: Tests programs and provides feedback metrics

What you can do with it:

- Run existing examples to see evolution in action

- Define your own problems with custom evaluation functions

- Configure LLM backends (works with any OpenAI-compatible API)

- Use multiple LLMs in ensemble for better results

- Optimize algorithms with multiple objectives

Two examples I've replicated from the AlphaEvolve paper:

- Circle Packing: Evolved from simple geometric patterns to sophisticated mathematical optimization, reaching 99.97% of DeepMind's reported results (2.634 vs 2.635 sum of radii for n=26).

- Function Minimization: Transformed a random search into a complete simulated annealing algorithm with cooling schedules and adaptive step sizes.

Technical insights:

- Low latency LLMs are critical for rapid generation cycles

- Best results using Gemini-Flash-2.0-lite + Gemini-Flash-2.0 as the ensemble

- For the circle packing problem, Gemini-Flash-2.0 + Claude-Sonnet-3.7 performed best

- Cerebras AI's API provided the fastest inference speeds

- Two-phase approach (exploration then exploitation) worked best for complex problems

Getting started (takes < 2 minutes)

# Clone and install

git clone https://github.com/codelion/openevolve.git

cd openevolve

pip install -e .

# Run the function minimization example

python openevolve-run.py

examples/function_minimization/initial_program.py \

  examples/function_minimization/evaluator.py \

  --config examples/function_minimization/config.yaml \

  --iterations 50
All you need is Python 3.9+ and an API key for an LLM service. Configuration is done through simple YAML files.

I'll be around to answer questions and discuss!

Comments

codelion•1y ago
I actually managed to replicate the new SOTA for circle packing in unit squares as found in the alphaevole paper - 2.635 for 26 circles in a unit square. Took about 800 iterations to find the best program which itself uses an optimisation phase and running it lead to the optimal packaging in one of its runs.
helsinki•1y ago
How many tokens did it take to generate the 800 versions of the code?
codelion•1y ago
Checked my openrouter stats, it took ~3M tokens but that involved quite a few runs of various experiments.

How to scan for vulnerabilities with GitHub Security Lab's AI-powered framework

https://github.blog/security/how-to-scan-for-vulnerabilities-with-github-security-labs-open-sourc...
1•EFLKumo•54s ago•0 comments

Out of Band, Not Out of Prompt: Intent Verification for Agentic Tool Calls

https://hyperautomation.substack.com/p/out-of-band-not-out-of-prompt-intent
1•hevalon•2m ago•0 comments

GPT Guesses Between 1 and 100

https://github.com/exmergo/research-chatgpt-guesses-between-1-and-100
1•adunk•2m ago•0 comments

God and LLMs

https://calnewport.com/on-god-and-llms/
1•ahamez•3m ago•0 comments

Researchers identify people through ordinary Wi-Fi routers with 99.5% accuracy

https://www.tomshardware.com/tech-industry/researchers-identify-people-through-ordinary-wi-fi-rou...
1•giuliomagnifico•5m ago•0 comments

How to Enter Side Doors

https://velvetnoise.substack.com/p/how-to-enter-side-doors
1•eigenBasis•5m ago•0 comments

It's like the Olympics – except steroids are allowed

https://www.bbc.com/news/articles/cedpz1zqp8po
1•ranit•7m ago•0 comments

Show HN: I built a tool that finds unused Prometheus metrics

https://github.com/dominikhei/cardamon
1•dhei123•7m ago•0 comments

What Are You Reading?

1•wompapumpum•8m ago•0 comments

Switching to Colemak

https://pta2002.com/blog/colemak/
1•xngbuilds•11m ago•0 comments

Advanced C++ Optimization Techniques for High-Performance Applications

https://medium.com/@martin00001313/advanced-c-optimization-techniques-for-high-performance-applic...
1•rramadass•12m ago•1 comments

Kiewit-built Key Bridge could have cost $9B

https://www.thebanner.com/economy/key-bridge-kiewit-9-billion-GK4BLGATPRHYXIEIZLG5PUNSKQ/
1•hnthrowaway0315•12m ago•0 comments

Riz Ahmed says UK spies tried to recruit him on three occasions

https://www.theguardian.com/culture/2026/may/24/riz-ahmed-says-uk-spies-tried-to-recruit-him-on-t...
1•bookofjoe•13m ago•0 comments

Show HN: Grizzlars – High Performance DataFrame to Compete with Polars

https://github.com/NavodPeiris/grizzlars
1•NavodPeiris•14m ago•0 comments

Seeing Around Corners Using Smartphone-Grade Lidar

https://spectrum.ieee.org/smartphone-grade-lidar
1•marc__1•15m ago•0 comments

We Shortened Development Feedback Loops from 30M to 30s

https://engineering.monday.com/how-we-shortened-development-feedback-loops-from-30m-to-30s/
1•aviramha•19m ago•1 comments

Does anyone else find Hacker News visually exhausting?

https://nodus-ai.app/hn-radar
1•m_m_carvalho•19m ago•3 comments

AI Model Idle Game: I made this for friends don't know how AI industry works

https://game.trae.academy/play
1•haebom•19m ago•1 comments

Thunderbolt vs. USB-C: what the connector hides

https://www.whatcable.uk/blog/thunderbolt-vs-usb-c
1•sleepingNomad•21m ago•0 comments

Paper Airplane Designs

https://www.foldnfly.com/
2•brianzelip•21m ago•1 comments

AI turning software building into cultural arbitrage

https://xcancel.com/levelsio/status/2058196816877797888
1•thoughtpeddler•22m ago•0 comments

'Wordle': One Year Later (2023)

https://www.gdcvault.com/play/1029425/-Wordle-One-Year
1•Michelangelo11•23m ago•0 comments

Why usage-based hosting bills creep up over time

https://hostim.dev/blog/usage-based-pricing-creep/
1•pv1337•23m ago•0 comments

Bun team is rewriting SIMD from Rust to C++

https://github.com/oven-sh/bun/pull/31351
2•impoppy•24m ago•2 comments

America's plutonium puzzle: from cold war relics to AI ambitions

https://nationalinterest.org/blog/energy-world/americas-plutonium-puzzle-from-cold-war-relics-to-...
2•leonidasrup•26m ago•0 comments

Who Buys Custom Silicon?

https://www.youtube.com/watch?v=nf-4YGZp998
1•johncole•30m ago•0 comments

Satlas: Real-time space situational awareness

https://satlas.app/
2•jonbaer•30m ago•0 comments

6502 Emulator Runs 1 Instruction/S (Written in Markdown, Running in an LLM)

https://dunkels.com/adam/llm-6502-emulator/
1•adunk•33m ago•1 comments

Show HN: Peakedin – archiving LinkedIn's most unhinged posts as satire

https://peakedin.capyfind.com/
1•lirena00•34m ago•0 comments

The Genius of Spencer Pratt's Campaign – Part 1

https://twitter.com/AmericanDebunk/status/2056555463466967457
1•bilsbie•36m ago•0 comments