Show HN: AgentCommander - workflow engine for evolutionary code optimization

https://github.com/mx-Liu123/AgentCommander

2•mx-Liu123•2w ago

Comments

mx-Liu123•2w ago

I built AgentCommander to automate the manual "trial-and-error" loops in my PhD Physics/ML research.

While tools like OpenEvolve (population evolution) and RD-Agent (Kaggle-style automation) exist, I found them difficult to customize for specific, multi-step research workflows. I needed a system that allowed granular control over the agent's decision process—specifically, how it learns from errors and inherits code states.

AgentCommander solves this by providing:

Visual Graph Execution: Workflows are defined as directed graphs, allowing for complex loops, conditional branches, and human-in-the-loop checkpoints.

Evolutionary Tree Tracking: It treats every iteration as a node in a tree. The agent automatically branches off the current "global optimum" rather than a linear history, preventing regression.

Snapshot Integrity: To prevent LLM hallucination or "cheating" (e.g., modifying test cases), the system uses filesystem snapshots to enforce strict read-only permissions on evaluation logic.

Native CLI Wrapper: Built on top of Gemini/Qwen CLI to leverage their native tool-use capabilities while enforcing a sandboxed working directory.

The project is open source (Apache 2.0) and written in Python.

Repo: https://github.com/mx-Liu123/AgentCommander

mx-Liu123•2w ago

Author's Note:

A few technical details for those looking to try AgentCommander:

Why Gemini/Qwen CLI?: I chose these as backends because they offer robust directory isolation. I tried integrating Claude Code, but found it difficult to restrict its file-system reach. Qwen CLI is a great alternative if you want an OpenAI-compatible API with a generous free tier (2,000 requests/day).

Environment: Ensure you have Python 3.10+ and the latest Node.js for the Gemini CLI. If you see Node version warnings, please upgrade to the latest LTS to avoid CLI instability.

Verification: You can audit the agent's "thought process" by running gemini -r inside any generated experiment directory. It’s crucial for verifying that the agent isn't hallucinating its research logic.

I'm currently in Singapore (SGT). I'll stay online for as long as I can to discuss architecture or implementation details, but I'll catch up on all pending questions first thing in the morning!

Repo: https://github.com/mx-Liu123/AgentCommander

Show HN: Deterministic NDJSON audit logs – v1.2 update (structural gaps)

The Greater Copenhagen Region could be your friend's next career move

Do Not Confirm – Fiction by OpenClaw

The Analytical Profile of Peas

Hallucinations in GPT5 – Can models say "I don't know" (June 2025)

What AI is good for, according to developers

OpenAI might pivot to the "most addictive digital friend" or face extinction

Show HN: Know how your SaaS is doing in 30 seconds

ClawdBot Ordered Me Lunch

What the News media thinks about your Indian stock investments

Running Lua on a tiny console from 2001

Google and Microsoft Paying Creators $500K+ to Promote AI Tools

New filtration technology could be game-changer in removal of PFAS

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Kinda Surprised by Seadance2's Moderation

I Write Games in C (yes, C)

Django scales. Stop blaming the framework (part 1 of 3)

Malwarebytes Is Now in ChatGPT

Thoughts on the job market in the age of LLMs

Show HN: Stacky – certain block game clone

AIII: A public benchmark for AI narrative and political independence

SectorC: A C Compiler in 512 bytes

The API Is a Dead End; Machines Need a Labor Economy

Digital Iris [video]

New wave of GLP-1 drugs is coming–and they're stronger than Wegovy and Zepbound

Convert tempo (BPM) to millisecond durations for musical note subdivisions

Show HN: Tasty A.F. - Use AI to Create Printable Recipe Cards

The Contagious Taste of Cancer

U.S. Jobs Disappear at Fastest January Pace Since Great Recession

Bithumb mistakenly hands out $195M in Bitcoin to users in 'Random Box' giveaway