frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

AI just proved Erdos Problem #124

https://www.erdosproblems.com/forum/thread/124#post-1892
122•nl•22h ago
https://twitter.com/vladtenev/status/1994922827208663383

Comments

ares623•19h ago
[flagged]
bogdan•19h ago
Seriously, blame the investors.
wasmainiac•18h ago
Nah, everyone is to blame here in bubble.
dang•1h ago
"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

"Don't be curmudgeonly. Thoughtful criticism is fine, but please don't be rigidly or generically negative."

https://news.ycombinator.com/newsguidelines.html

menaerus•19h ago
This seems to be 2nd in row proof from the same author by using the AI models. First time it was the ChatGPT which wrote the formal Lean proof for Erdos Problem #340.

https://arxiv.org/html/2510.19804v1#Thmtheorem3

> In over a dozen papers, beginning in 1976 and spanning two decades, Paul Erdős repeatedly posed one of his “favourite” conjectures: every finite Sidon set can be extended to a finite perfect difference set. We establish that {1, 2, 4, 8, 13} is a counterexample to this conjecture.

NooneAtAll3•14m ago
that one was vibe-coded

Ai was given step-by-step already found proof, and asked "please rewrite in Lean"

---

here Ai did the proof itself

wasmainiac•18h ago
Ok… has this been verified? I see no publication or at least an announcement on Harmonics webpage. If this is a big deal, you think it would be a big deal, or is this just hype?
singularity2001•16h ago
verified by lean so 99.99% yes
cluckindan•15h ago
Lean verified a proof of a solution to a problem, but was it the same problem as Erdős problem #124?

https://www.erdosproblems.com/forum/thread/124#post-1899

wasmainiac•8h ago
> My summary is that Aristotle solved "a" version of this problem (indeed, with an olympiad-style proof), but not "the" version.

> I agree that the [BEGL96] problem is still open (for now!), and your plan to keep this problem open by changing the statement is reasonable. Alternatively, one could add another problem and link them. I have no preference. — BorisAlexeev

There we go, so there is hype to some degree.

aaomidi•8h ago
Is there some good literature to read about lean? First time I’m hearing about it and it seems pretty cool.
adt•18h ago
Related, independent, and verified:

GPT-5 solved Erdős problem #848 (combinatorial number theory):

https://cdn.openai.com/pdf/4a25f921-e4e0-479a-9b38-5367b47e8...

https://lifearchitect.ai/asi/

ares623•17h ago
FTL? Seriously?
demirbey05•16h ago
I read how GPT-5 contributed to proof. It is not fully solved by GPT-5 instead assisted. For more look here https://www.math.columbia.edu/~msawhney/Problem_848.pdf
esperent•18h ago
More interesting discussion than on Twitter here:

https://www.erdosproblems.com/forum/thread/124#post-1892

dang•50m ago
Ok, we've changed the link to that from https://twitter.com/vladtenev/status/1994922827208663383, but I've included the latter in the toptext. Thanks!
demirbey05•17h ago
This is response from mathematician: "This is quite something, congratulations to Boris and Aristotle!

On one hand, as the nice sketch provided below by tsaf confirms, the final proof is quite simple and elementary - indeed, if one was given this problem in a maths competition (so therefore expected a short simple solution existed) I'd guess that something like the below would be produced. On the other hand, if something like this worked, then surely the combined talents of Burr, Erdős, Graham, and Li would have spotted it.

Normally, this would make me suspicious of this short proof, in that there is overlooked subtlety. But (a) I can't see any and (b) the proof has been formalised in Lean, so clearly it just works!

Perhaps this shows what the real issue in the [BEGL96] conjecture is - namely the removal of 1 and the addition of the necessary gcd condition. (And perhaps at least some subset of the authors were aware of this argument for the easier version allowing 1, but this was overlooked later by Erdős in [Er97] and [Er97e], although if they were aware then one would hope they'd have included this in the paper as a remark.)

At the moment I'm minded to keep this as open, and add the gcd condition in the main statement, and note in the remarks that the easier (?) version allowing 1 and omitting the gcd condition, which was also asked independently by Erdős, has been solved."

The commentator is saying: "I can't believe this famous problem was solved so easily. I would have thought it was a fake proof, but the computer verified it. It turns out the solution works because it addresses a slightly different set of constraints (regarding the number 1) than what Erdős originally struggled with. (Generated by Gemini)

aaomidi•8h ago
I think that person is the owner of the website discussing it too.

He was cited https://the-decoder.com/leading-openai-researcher-announced-...

Where a while back OpenAI made a misleading claim about solving some of these problems.

dang•52m ago
> Generated by Gemini

Please don't post generated comments here. We want HN to be a place for humans writing in their own voice.

ComplexSystems•16h ago
Are you kidding? This is an incredible result. Stuff like this is the most important stuff happening in AI right now. Automated theorem proving? It's not too far to say the entire singular point of the technology was to get us to this.
ares623•16h ago
This is me being snarky and ignorant, but if it solved one problem and it is automated what’s stopping it from solving all the others? That’s what’s ultimately being sold by the tweet right.
aeve890•13h ago
>This is me being snarky and ignorant, but if it solved one problem and it is automated what’s stopping it from solving all the others?

Yeah that's the crux of the matter. How do AI did it? Using already existing math. If we need new math to prove Collatz, Goldbach or Riemman, LLMs are simply SOL. That's what's missing and hype boys always avoid to mention.

ComplexSystems•8h ago
Ultimately the main thing that will stop it from solving literally "all the others" are things like the impossibility of solving the halting problem, considerations like P ≠ NP, etc. But as we have just seen, despite these impossibility theorems, AI systems are still able to make substantive progress on solving important open real-world problems.
BanditDefender•15h ago
I think it is way too far to say that!

We've had automated theorem proving since the 60s. What we need is automated theorem discovery. Erdős discovered these theorems even if he wasn't really able to prove them. Euler and Gauss discovered a ton of stuff they couldn't prove. It is weird that nobody considers this to be intelligence. Instead intelligence is a little game AI plays with Lean.

AI researchers keep trying to reduce intelligence into something tiny and approachable, like automated theorem proving. It's easy: you write the theorem you want proven and hope you get a proof. It works or it doesn't. Nice and benchmarkable.

Automated axiom creation seems a lot harder. How is an LLM supposed to know that "between any two points there is a line" formalizes an important property of physical space? Or how to suggest an alternative to Turing machines / lambda calculus that expresses the same underlying idea?

kvemkon•13h ago
> What we need is automated theorem discovery.

I've been thinking mathematicians have fun doing math, making discoveries, crafting proofs.

Does Tour de France & Co. make no sense since small, lightweight and powerful e-bicycles appeared?

Using computer as a helper like bicycles is one thing, using LLMs seems more like e-bicycle and is something another.

ComplexSystems•8h ago
> We've had automated theorem proving since the 60s.

By that logic, we've had LLMs since the 60s!

> What we need is automated theorem discovery.

I don't see any reason you couldn't train a model to do this. You'd have to focus it on generating follow-up questions to ask after reading a corpus of literature, playing around with some toy examples in Python and making a conjecture out of it. This seems much easier than training it to actually complete an entire proof.

> Erdős discovered these theorems even if he wasn't really able to prove them. Euler and Gauss discovered a ton of stuff they couldn't prove. It is weird that nobody considers this to be intelligence.

Who says they don't? I wouldn't be surprised if HarmonicMath, DeepMind, etc have also thought about this kind of thing.

> Automated axiom creation seems a lot harder. How is an LLM supposed to know that "between any two points there is a line" formalizes an important property of physical space?

That's a good question! It would be interesting to see if this is an emergent property of multimodal LLMs trained specifically on this kind of thing. You would need mathematical reasoning, visual information and language encoded into some shared embedding space where similar things are mapped right next to each other geometrically.

AdieuToLogic•14m ago
>> We've had automated theorem proving since the 60s.

> By that logic, we've had LLMs since the 60s!

From a bit earlier[0], actually:

  Progressing to the 1950s and 60s

  We saw the development of the first language models.
Were those "large"? I'm sure at the time they were thought to be so.

0 - https://ai-researchstudies.com/history-of-large-language-mod...

dang•1h ago
(We detached this subthread from https://news.ycombinator.com/item?id=46094763.)
BanditDefender•16h ago
"Mathematicial superintelligence" is so obnoxious. Why exactly do they think it is called an Erdős problem when Erdős didn't find the solution? Because Erdős discovered the real mathematics: the conjecture!

These people treat math research as if it is a math homework assignment. There needs to be an honest discussion about what the LLM is doing here. When you bang your head against a math problem you blindly try a bunch of dumb ideas that don't actually work. It wastes a lot of paper. The LLM automates a lot of that.

It is actually pretty cool that modern AI can help speed this up and waste less paper. It is very similar to how classical AI (Symbolica) sped up math research and wasted less paper. But we need to have an honest discussion about how we are using the computer as a tool. Instead malicious idiots like Vlad Tenev are making confident predictions about mathematical superintelligence. So depressing.

consumer451•15h ago
For reference, this is from: https://harmonic.fun/

related article:

> Is Math the Path to Chatbots That Don’t Make Stuff Up?

https://www.nytimes.com/2024/09/23/technology/ai-chatbots-ch...

mmmmbbbhb•9h ago
Meanwhile I have to use ai just to understand the problem statement.

A Love Letter to FreeBSD

https://www.tara.sh/posts/2025/2025-11-25_freebsd_letter/
208•rbanffy•5h ago•119 comments

Advent of Sysadmin 2025

https://sadservers.com/advent
60•lazyant•2h ago•14 comments

Algorithms for Optimization [pdf]

https://algorithmsbook.com/optimization/files/optimization.pdf
113•Anon84•4h ago•7 comments

Writing a good Claude.md

https://www.humanlayer.dev/blog/writing-a-good-claude-md
359•objcts•9h ago•113 comments

Advent of Code 2025

https://adventofcode.com/2025/about
759•vismit2000•14h ago•261 comments

Windows drive letters are not limited to A-Z

https://www.ryanliptak.com/blog/windows-drive-letters-are-not-limited-to-a-z/
390•LorenDB•13h ago•194 comments

Bricklink suspends Marketplace operations in 35 countries

https://jaysbrickblog.com/news/bricklink-suspends-marketplace-operations-in-35-countries/
77•makeitdouble•4h ago•31 comments

Migrating Dillo from GitHub

https://dillo-browser.org/news/migration-from-github/
290•todsacerdoti•13h ago•168 comments

Seeing a Molecule's Quantum Shadow

https://physics.aps.org/articles/v18/s149
7•lc0_stein•6d ago•0 comments

LLVM-MOS – Clang LLVM fork targeting the 6502

https://llvm-mos.org/wiki/Welcome
115•jdmoreira•10h ago•41 comments

GitHub to Codeberg: my experience

https://eldred.fr/blog/forge-migration/
167•todsacerdoti•11h ago•68 comments

ESA Sentinel-1D delivers first high-resolution images

https://www.esa.int/Applications/Observing_the_Earth/Copernicus/Sentinel-1/Sentinel-1D_delivers_f...
86•giuliomagnifico•9h ago•26 comments

Program-of-Thought Prompting Outperforms Chain-of-Thought by 15% (2022)

https://arxiv.org/abs/2211.12588
84•mkagenius•8h ago•24 comments

Is America's jobs market nearing a cliff?

https://www.economist.com/finance-and-economics/2025/11/30/is-americas-jobs-market-nearing-a-cliff
63•harambae•2h ago•98 comments

CachyOS: Fast and Customizable Linux Distribution

https://cachyos.org/
267•doener•16h ago•242 comments

ETH-Zurich: Digital Design and Computer Architecture; 227-0003-10L, Spring, 2025

https://safari.ethz.ch/ddca/spring2025/doku.php?id=start
120•__rito__•9h ago•17 comments

How to run phones while being struck by suicide drones

https://nasa.cx/hn/posts/how-to-run-hundreds-of-phones-while-being-struck-by-suicide-drones/
45•nasaok•6h ago•13 comments

AI just proved Erdos Problem #124

https://www.erdosproblems.com/forum/thread/124#post-1892
122•nl•22h ago•31 comments

Show HN: Fixing Google Nano Banana Pixel Art with Rust

https://github.com/Hugo-Dz/spritefusion-pixel-snapper
136•HugoDz•4d ago•22 comments

Show HN: Real-time system that tracks how news spreads across 200k websites

https://yandori.io/news-flow/
233•antiochIst•5d ago•56 comments

Mike Gordon and hardware verification (2023)

https://lawrencecpaulson.github.io/2023/01/04/Hardware_Verification.html
8•sebg•6d ago•0 comments

Stereo Images of Giant Galaxies

https://www.bbc.com/future/article/20251121-sir-brian-mays-stereo-vision-of-galaxies
13•benbreen•5d ago•4 comments

The Thinking Game Film – Google DeepMind documentary

https://thinkinggamefilm.com
155•ChrisArchitect•11h ago•108 comments

“Boobs check” – Technique to verify if sites behind CDN are hosted in Iran

https://twitter.com/hkashfi/status/1995109785679573167
229•defly•6h ago•77 comments

In Re: 23andMe, Inc. Customer Data Security Breach Litigation

https://www.23andmedatasettlement.com/
51•toomuchtodo•1h ago•28 comments

Paul Hegarty's updated CS193p SwiftUI course released by Stanford

https://cs193p.stanford.edu/
154•yehiaabdelm•5d ago•35 comments

RetailReady (YC W24) Is Hiring Associate Product Manager

https://www.ycombinator.com/companies/retailready/jobs/KPKDu3D-associate-product-manager
1•sarah74•10h ago

There is No Quintic Formula [video]

https://www.youtube.com/watch?v=9HIy5dJE-zQ
61•DamnInteresting•9h ago•24 comments

Malware embedded into audio driver is silently recording from system mic

https://twitter.com/Officialwhyte22/status/1995024999934001602
34•CGMthrowaway•2h ago•9 comments

Langjam Gamejam: Build a programming language then make a game with it

https://langjamgamejam.com/
62•birdculture•11h ago•40 comments