frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: AgentCommander - workflow engine for evolutionary code optimization

https://github.com/mx-Liu123/AgentCommander
1•mx-Liu123•1h ago

Comments

mx-Liu123•1h ago
I built AgentCommander to automate the manual "trial-and-error" loops in my PhD Physics/ML research.

While tools like OpenEvolve (population evolution) and RD-Agent (Kaggle-style automation) exist, I found them difficult to customize for specific, multi-step research workflows. I needed a system that allowed granular control over the agent's decision process—specifically, how it learns from errors and inherits code states.

AgentCommander solves this by providing:

Visual Graph Execution: Workflows are defined as directed graphs, allowing for complex loops, conditional branches, and human-in-the-loop checkpoints.

Evolutionary Tree Tracking: It treats every iteration as a node in a tree. The agent automatically branches off the current "global optimum" rather than a linear history, preventing regression.

Snapshot Integrity: To prevent LLM hallucination or "cheating" (e.g., modifying test cases), the system uses filesystem snapshots to enforce strict read-only permissions on evaluation logic.

Native CLI Wrapper: Built on top of Gemini/Qwen CLI to leverage their native tool-use capabilities while enforcing a sandboxed working directory.

The project is open source (Apache 2.0) and written in Python.

Repo: https://github.com/mx-Liu123/AgentCommander

mx-Liu123•1h ago
Author's Note:

A few technical details for those looking to try AgentCommander:

Why Gemini/Qwen CLI?: I chose these as backends because they offer robust directory isolation. I tried integrating Claude Code, but found it difficult to restrict its file-system reach. Qwen CLI is a great alternative if you want an OpenAI-compatible API with a generous free tier (2,000 requests/day).

Environment: Ensure you have Python 3.10+ and the latest Node.js for the Gemini CLI. If you see Node version warnings, please upgrade to the latest LTS to avoid CLI instability.

Verification: You can audit the agent's "thought process" by running gemini -r inside any generated experiment directory. It’s crucial for verifying that the agent isn't hallucinating its research logic.

I'm currently in Singapore (SGT). I'll stay online for as long as I can to discuss architecture or implementation details, but I'll catch up on all pending questions first thing in the morning!

Repo: https://github.com/mx-Liu123/AgentCommander

I Stopped Creating Package.json Scripts

https://benhouston3d.com/blog/stopped-creating-package-json-scripts
1•bhouston•42s ago•0 comments

Show HN: I built an AI video editor around scenes, not timelines

https://www.roanot.com/app/demo/de745846-87e2-4861-88f2-b91fa8f68a55
1•Vagantem•1m ago•1 comments

Scheme implementation as O'Reilly book via Claude Code

https://ezzeriesa.notion.site/Scheme-implementation-as-O-Reilly-book-via-Claude-Code-2ee1308b4204...
1•kurinikku•1m ago•0 comments

Show HN: Osprey API Tester – VS Code API Testing from NestJS Controllers/DTOs

https://github.com/jeremi-24/osprey-api-tester
1•jeremi-24•1m ago•0 comments

Dynamic Load Balancer in Intel Xeon Scalable Processor

https://danglingpointers.substack.com/p/dynamic-load-balancer-in-intel-xeon
1•blakepelton•3m ago•0 comments

Agentic Code Reviewer

https://github.com/richhaase/agentic-code-reviewer
1•richhhh•3m ago•1 comments

Show HN: D-engine – Embeddable Raft consensus for Rust

https://github.com/deventlab/d-engine
1•joshuachi•3m ago•0 comments

Show HN: cm – a TUI to monitor multiple Docker container logs side-by-side

https://github.com/rehrumesh/cm
2•rehrumesh•6m ago•0 comments

Tell HN: What's Happening in Minnesota

6•throwawayforice•7m ago•0 comments

Show HN: APIsec MCP Audit – Audit what your AI agents can access

https://github.com/apisec-inc/mcp-audit
1•rajaramr7•7m ago•0 comments

Show HN: Run4ever – a browser-based long-term running progression game

https://run4ever.win
1•marcosme•8m ago•0 comments

Brush.Q: An Articulated Ground Mobile Robot with Compliant Brush-Like Wheels

https://www.mdpi.com/2218-6581/15/1/3
1•PaulHoule•8m ago•0 comments

11-year streak of record global warming continues

https://news.un.org/en/story/2026/01/1166758
2•yusufaytas•9m ago•0 comments

Creating virtual block devices with ublk

https://jpospisil.com/posts/2026-01-13-creating-virtual-block-devices-with-ublk
1•jiripospisil•11m ago•0 comments

Ask HN: Would you trust a new browser security extension in 2025?

1•linklock•11m ago•0 comments

Postgres Serials Should Be Bigint (and How to Migrate)

https://www.crunchydata.com/blog/postgres-serials-should-be-bigint-and-how-to-migrate
1•enz•14m ago•0 comments

Who Owns Your Data?

https://werd.io/who-owns-your-data/
1•benwerd•14m ago•0 comments

Google's AI Overview Has Been Sending Me the Wrong Customers for 6 Months

https://glama.ai/blog/2026-01-20-bad-google-s-ai-overview
2•punkpeye•15m ago•0 comments

AI boom could falter without wider adoption, Microsoft chief Satya Nadella warns

https://www.irishtimes.com/business/2026/01/20/ai-boom-could-falter-without-wider-adoption-micros...
4•cdrnsf•16m ago•1 comments

Parsing Election Results PDFs Using LLMs

https://openelections.net/spsa2026/
1•m-hodges•16m ago•0 comments

Unconventional PostgreSQL Optimizations

https://hakibenita.com/postgresql-unconventional-optimizations
1•haki•17m ago•0 comments

Show HN: Shadow Report – Why your "black box" redactions aren't hiding anything

1•cd_mkdir•17m ago•1 comments

Show HN: Mother MCP – Manage your Agent Skills like a boss-Auto provision skills

https://github.com/dmgrok/mcp_mother_skills
1•DavidGraca•18m ago•0 comments

Minneapolis software engineers mistaken for ICE agents

https://www.foxnews.com/us/minneapolis-software-engineers-mistaken-ice-agents-eating-lunch-harass...
2•DivingForGold•19m ago•0 comments

The Last Algorithm

https://danielmiessler.com/blog/the-last-algorithm
1•zanjani•20m ago•0 comments

Show HN: 8-10x Faster Development with LLM Memory That Persists

https://www.buddhilw.com/posts-output/2026-01-20-hive-mcp/
1•BuddhiLW•21m ago•1 comments

Magnetic nanoparticles fight bone cancer and help healing

https://www.sciencedaily.com/releases/2026/01/260106224627.htm
1•mhb•21m ago•0 comments

Europe could 'weaponize' $10T of US assets over Greenland

https://www.bloomberg.com/news/articles/2026-01-19/-weaponizing-10-trillion-of-us-assets-is-tough...
4•saubeidl•23m ago•1 comments

An ultrathin coating for electronics looked like a miracle insulator

https://theconversation.com/an-ultrathin-coating-for-electronics-looked-like-a-miracle-insulator-...
2•PLenz•23m ago•0 comments

Open source's new mission: Rebuild Europe's tech stack

https://www.theregister.com/2026/01/19/open_sources_new_mission_rebuild/
1•Flundstrom2•24m ago•0 comments