frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Mathematical Exploration and Discovery at Scale

https://terrytao.wordpress.com/2025/11/05/mathematical-exploration-and-discovery-at-scale/
58•nabla9•2h ago

Comments

piker•54m ago
That was dense but seemed nuanced. Anyone care to summarize for those of us who lack the mathematics nomenclature and context?
qsort•30m ago
I'm not claiming to be an expert, but more or less what the article says is this:

- Context: Terence Tao is one of the best mathematician alive.

- Context: AlphaEvolve is an optimization tool from Google. It differs from traditional tools because the search is guided by an LLM, whose job is to mutate a program written in a normal programming language (they used Python). Hallucinations are not a problem because the LLM is only a part of the optimization loop. If the LLM fucks up, that branch is cut.

- They tested this over a set of 67 problems, including both solved and unsolved ones.

- They find that in many cases AlphaEvolve achieves similar results to what an expert human could do with a traditional optimization software package.

- The main advantages they find are: ability to work at scale, "robustness", i.e. no need to tune the algorithm to work on different problems, better interpretability of results.

- Unsurprisingly, well-known problems likely to be in the training set quickly converged to the best known solution.

- Similarly unsurprisingly, the system was good at "exploiting bugs" in the problem specification. Imagine an underspecified unit test that the system would maliciously comply to. They note that it takes significant human effort to construct an objective function that can't be exploited in this way.

- They find the system doesn't perform as well on some areas of mathematics like analytic number theory. They conjecture that this is because those problems are less amenable to an evolutionary approach.

- In one case they could use the tool to very slightly beat an existing bound.

- In another case they took inspiration from an inferior solution produced by the tool to construct a better (entirely human-generated) one.

It's not doing the job of a mathematician by any stretch of the imagination, but to my (amateur) eye it's very impressive. Google is cooking.

nsoonhui•18m ago
>> If the LLM fucks up, that branch is cut.

Can you explain more on this? How on earth are we supposed to know LLM is hallucinating?

khafra•15m ago
Math is a verifiable domain. Translate a proof into Lean and you can check it in a non-hallucination-vulnerable way.
tux3•9m ago
In this case AlphaEvolve doesn't write proofs, it uses the LLM to write Python code (or any language, really) that produces some numerical inputs to a problem.

They just try out the inputs on the problem they care about. If the code gives better results, they keep it around. They actually keep a few of the previous versions that worked well as inspiration for the LLM.

If the LLM is hallucinating nonsense, it will just produce broken code that gives horrible results, and that idea will be thrown away.

qsort•7m ago
We don't, but the point is that it's only one part of the entire system. If you have a (human-supplied) scoring function, then even completely random mutations can serve as a mechanism to optimize: you generate a bunch, keep the better ones according to the scoring function and repeat. That would be a very basic genetic algorithm.

The LLM serves to guide the search more "intelligently" so that mutations aren't actually random but can instead draw from what the LLM "knows".

stabbles•45m ago
Link to the problems: https://google-deepmind.github.io/alphaevolve_repository_of_...
iNic•19m ago
I didn't know the sofa problem had been resolved. Link for anyone else: https://arxiv.org/abs/2411.19826
muldvarp•9m ago
There seems to be zero reason for anyone to invest any time into learning anything besides trades anymore.

AI will be better than almost all mathematicians in a few years.

andrepd•8m ago
I'm genuinely sorry for anyone with such a crass worldview.

How to declutter, quiet down, and take the AI out of Windows 11 25H2

https://arstechnica.com/gadgets/2025/11/what-i-do-to-clean-up-a-clean-install-of-windows-11-23h2-...
1•oldnetguy•1m ago•0 comments

China delays Shenzhou-20 crew return after suspected space debris impact

https://spacenews.com/china-delays-shenzhou-20-crew-return-after-suspected-space-debris-impact/
1•perihelions•2m ago•0 comments

Playbook for Public Spending Galore

https://thomaslemstrom.substack.com/p/master-of-pivots
1•opportourist•2m ago•0 comments

Spectravideo Computers Get a Big Upgrade

https://hackaday.com/2025/11/05/spectravideo-computers-get-a-big-upgrade/
1•oldnetguy•3m ago•0 comments

DHH and Omarchy: Midlife Crisis

https://blogs.gnome.org/alatiera/2025/11/06/dhh-and-omarchy-midlife-crisis/
2•cheshire_cat•8m ago•0 comments

Women Fear Taking NYC Buses, Another Groping, Attack

https://bronxvoicenyc.blogspot.com/2025/11/bronx-news-mta-bus-assault-female-rider-groped.html
2•NYCNews•13m ago•1 comments

Wealth Taxes Will Barely Slow Inequality. So Why Do the Super-Rich Resist Them?

https://truthout.org/articles/wealth-taxes-will-barely-slow-inequality-so-why-do-the-super-rich-r...
1•robtherobber•14m ago•0 comments

A Note on Fil-C

https://graydon2.dreamwidth.org/320265.html
1•todsacerdoti•15m ago•0 comments

Chinese molten salt reactor achieves conversion of thorium-uranium fuel

https://www.world-nuclear-news.org/articles/chinese-msr-achieves-conversion-of-thorium-uranium-fuel
2•bilsbie•15m ago•0 comments

Wikiboard - A Visual Wikipedia Browser

https://www.wikiboard.org/
1•redbell•15m ago•0 comments

Reeves to hit drivers with pay-per-mile tax

https://www.telegraph.co.uk/politics/2025/11/05/reeves-to-hit-drivers-with-pay-per-mile-tax-in-bu...
1•alexellisuk•16m ago•0 comments

High-speed rail plan – European Commission

https://transport.ec.europa.eu/transport-modes/rail/high-speed-rail-plan_en
1•bpierre•16m ago•0 comments

New gel regrows tooth enamel

https://www.sciencedaily.com/releases/2025/11/251106003151.htm
3•gradus_ad•17m ago•0 comments

In Philadelphia, a young nonprofit buys a century-old magazine

https://www.niemanlab.org/2025/11/in-philadelphia-a-young-nonprofit-buys-a-century-old-magazine/
2•giuliomagnifico•20m ago•0 comments

Arithmetic Models: Better Than You Think

https://entropicthoughts.com/arithmetic-models-better-than-you-think
1•kqr•23m ago•0 comments

Gem.coop Update #1

https://gem.coop/updates/1/
1•todsacerdoti•23m ago•0 comments

China unveils power of thorium reactor for largest cargo ship

https://www.scmp.com/news/china/science/article/3331031/china-unveils-power-thorium-reactor-world...
1•bilsbie•23m ago•0 comments

Failurists - When Things go Arwy [pdf]

https://networkcultures.org/wp-content/uploads/2023/04/Failurists_INC2023_TOD47.pdf
1•jruohonen•25m ago•0 comments

Builders Are Offering Mortgage Rate Discounts. Home Buyers Aren't Biting

https://www.wsj.com/economy/housing/builders-are-offering-mortgage-rate-discounts-home-buyers-are...
1•mousacre•26m ago•1 comments

'Jenga Tower' US Economy Teeters as Middle Class Pulls Back Spending

https://www.bloomberg.com/news/articles/2025-11-06/us-economy-at-risk-of-weakening-with-growing-g...
2•mousacre•29m ago•0 comments

The elusive 'hidden people' of Iceland

https://www.bbc.com/travel/article/20181217-the-elusive-hidden-people-of-iceland
1•svoit•31m ago•0 comments

The Collapse of Reality in Napoli

https://medium.com/luminasticity/the-collapse-of-reality-in-napoli-a3d4f7487f93
2•bryanrasmussen•33m ago•1 comments

The Importance of an Adversary (2017)

https://www.ta-stl.com/blog/the-importance-of-an-adversary
2•wseqyrku•33m ago•0 comments

I860 Intel took a RISC: it did not end well [video]

https://www.youtube.com/watch?v=WTkFGZqVCM8
2•TMWNN•35m ago•0 comments

Show HN: WebPizza – AI/RAG pipeline running in the browser with WebGPU

https://github.com/stramanu/webpizza-ai-poc
1•stramanu•35m ago•0 comments

Show HN: Minimal Portfolio Tracker for Stocks, Crypto, Gold and Funds

https://play.google.com/store/apps/details?id=com.ahmetyildiz.portfoyapp&hl=en_US
1•ahmtyldz•40m ago•0 comments

Show HN: qqqa – a fast, stateless LLM-powered assistant for your shell

https://github.com/matisojka/qqqa
10•iagooar•41m ago•5 comments

Tyrannosaurus Redesign 2018 – Saurian

https://sauriangame.squarespace.com/blog/2018/9/20/tyrannosaurus-redesign-2018
1•maxloh•41m ago•0 comments

First artificial photosynthesis now produces infinite, clean energy

https://www.thetimes.com/business-money/technology/article/why-the-oc-star-ben-mckenzie-is-leadin...
1•kedmi•44m ago•0 comments

Hamas fighters are stuck in tunnels in Israeli-controlled Gaza

https://www.theaustralian.com.au/world/hundreds-of-hamas-fighters-are-stuck-in-tunnels-in-israeli...
1•asdefghyk•48m ago•1 comments