frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Going Beyond AlphaEvolve in Agent Scientific Discovery

https://arxiv.org/abs/2512.13857
1•kyuksel•2h ago

Comments

kyuksel•2h ago
Google DeepMind’s AlphaEvolve made a key insight clear: hashtag#AgenticAI can act as a team of evolutionary scientists, proposing meaningful algorithm changes inside an evaluation loop. AlphaEvolve and similar methods also share a fundamental limitation. Each mutation overwrites the structure. Earlier variants become inert. Partial improvements cannot be recombined. Credit assignment is global and coarse. Over long horizons, evolution becomes fragile. I introduce EvoLattice, which removes this limitation by changing the unit of evolution itself. Instead of evolving a single program, EvoLattice evolves an internal population encoded inside one structure. A program (or agent) is represented as a DAG where each node contains multiple persistent alternatives. Every valid path through the graph is executable. Evolution becomes additive, non-destructive, and combinatorial — not overwrite-based. We evaluate EvoLattice on NAS-Bench-Suite-Zero, under identical compute and evaluation settings. EvoLattice outperforms AlphaEvolve, achieves higher rank correlation, exhibits lower variance and faster stabilization, and improves monotonically without regression. We further validate generality on training-free optimizer update rule discovery, where EvoLattice autonomously discovers a nonlinear sign–curvature optimizer that significantly outperforms SGD, SignSGD, Lion, and tuned hybrids — using the same primitives and no training.

Why this matters? Persistent internal diversity: AlphaEvolve preserves diversity across generations. EvoLattice preserves it inside the program. Strong components never disappear unless explicitly pruned. Fine-grained credit assignment: Each micro-operator is evaluated across all contexts in which it appears, producing statistics (mean, variance, best-case). AlphaEvolve only sees a single scalar score per program. Quality–Diversity (QD) without archives: EvoLattice naturally exhibits MAP-Elites-style dynamics: monotonic improvement of elites, widening gap between best and average, bounded variance — without external archives or novelty objectives. Structural robustness: AlphaEvolve relies on the hashtag#LLM to preserve graph correctness. EvoLattice applies deterministic self-repair after every mutation, removing structural fragility from the loop.

AlphaEvolve shows how hashtag#LLMs can mutate programs. EvoLattice shows what they should evolve: the internal computational fabric, not entire programs. This turns LLM-guided evolution from a fragile rewrite process into a stable, cumulative, QD-driven discovery system. The same framework applies to prompt and agentic workflow evolution. As agent systems grow deeper and more interconnected, overwrite-based evolution breaks down. EvoLattice’s internal population and self-repair make long-horizon agentic evolution feasible and interpretable.

Building Apps for ChatGPT with Apollo MCP Server and Apollo Client

https://www.apollographql.com/blog/building-apps-for-chatgpt-with-apollo-mcp-server-and-apollo-cl...
1•JTech2three•1m ago•1 comments

200 Years Ago: Abel's Resolution of the Quintic Question

https://www.ams.org/journals/notices/202601/noti3264/noti3264.html
1•bikenaga•7m ago•0 comments

Trump Is Doubling Down on His Disastrous A.I. Chip Policy

https://www.nytimes.com/2025/12/17/opinion/trump-ai-chips-nvidia-china.html
3•voxadam•9m ago•1 comments

Peter Higgs: I wouldn't be productive enough for today's academic system

https://www.theguardian.com/science/2013/dec/06/peter-higgs-boson-academic-system
1•firefax•9m ago•0 comments

Why Do We Still Pay for International Calls in 2025?

https://rodyne.com/?p=3293
1•boznz•11m ago•0 comments

Six billionaires who could move markets, policy in 2026

https://nairametrics.com/2025/12/18/six-billionaires-who-could-move-markets-policy-in-2026/
1•kckkmgboji•14m ago•0 comments

DNS as a Filesystem: A Practical Study in Applied Category Theory

https://loss.dev/?node=honk-protocol
1•graemefawcett•16m ago•2 comments

Spaceorbust – Terminal RPG where GitHub commits power space civilization

https://spaceorbust.com
2•zjkramer•20m ago•2 comments

Data Science Weekly – Issue 630

https://datascienceweekly.substack.com/p/data-science-weekly-issue-630
1•sebg•23m ago•0 comments

New AI Tool That Helps with Meta Ads

https://www.audience-plus.com
1•alexTs101•23m ago•1 comments

Trmnl – 2025 in Review

https://usetrmnl.com/blog/2025-in-review
1•MBCook•25m ago•0 comments

Show HN: Roblox Python tower defense game

https://github.com/jackdoe/roblox-python-tower-defense
1•jackdoe•26m ago•0 comments

Fee-based primary care is rapidly rising in US, hastening doctor shortages

https://medicalxpress.com/news/2025-12-fee-based-primary-rapidly-hastening.html
2•bikenaga•29m ago•1 comments

Chemical Hygiene

https://karpathy.bearblog.dev/chemical-hygiene/
2•zdw•35m ago•0 comments

North Korean hackers stole a record $2B of crypto in 2025, Chainalysis says

https://www.coindesk.com/business/2025/12/18/north-korean-hackers-stole-a-record-usd2b-of-crypto-...
4•hhs•35m ago•0 comments

How to Use AI as a Real Software Engineering Tool

https://chat.engineer/p/how-to-use-ai-as-a-real-software-engineering-tool
2•olh•36m ago•0 comments

Show HN: Patch PHPUnit to shard your Laravel test suite

https://github.com/boltci/shards
1•matt413•44m ago•0 comments

Wall Street Ruined the Roomba and Then Blamed Lina Khan

https://www.thebignewsletter.com/p/how-wall-street-ruined-the-roomba
3•danboarder•46m ago•0 comments

Show HN: Infexec – A utility for pinning commands to terminal panes

https://github.com/Software-Deployed/infexec
2•indigophone•47m ago•0 comments

A Testing Conundrum

https://nedbatchelder.com/blog/202512/a_testing_conundrum.html
1•todsacerdoti•48m ago•0 comments

Show HN: CLI tools to browse Claude Code and Codex CLI logs interactively

1•hy_wondercoms•48m ago•0 comments

Show HN: TiliaJS FRP JavaScript/TypeScript/ReScript State Management

https://tiliajs.com
1•indigophone•49m ago•0 comments

Exploring the Swift SDK for Android

https://swift.org/blog/exploring-the-swift-sdk-for-android/
1•frizlab•49m ago•0 comments

Cocktail Distributed Key Generation

https://github.com/C2SP/C2SP/blob/main/cocktail-dkg.md
1•choult•50m ago•0 comments

Prediction Market Investors – Where Do I Find Them?

7•h100ker•51m ago•8 comments

Understanding Encoder and Decoder LLMs

https://magazine.sebastianraschka.com/p/understanding-encoder-and-decoder
1•jeffjeffbear•52m ago•0 comments

Show HN: Squache – A self-hosted HTTPS caching proxy for web scraping

https://github.com/devrupt-io/squache
2•devrupt•54m ago•0 comments

LinkedIn's war against bot scrapers ramps up as AI gets smarter

https://news.bloomberglaw.com/artificial-intelligence/linkedins-war-against-bot-scrapers-ramps-up...
1•hhs•57m ago•0 comments

Once Again, Health Care Proves to Be a Bitter Political Pill for GOP

https://www.nytimes.com/2025/12/18/us/politics/health-care-gop.html
2•duxup•58m ago•5 comments

Show HN: Git repo visualization and interactive stars and commits history

https://git-history.com/
2•rohitghumare•58m ago•0 comments