frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

OpenAI researcher announced GPT-5 math breakthrough that never happened

https://the-decoder.com/leading-openai-researcher-announced-a-gpt-5-math-breakthrough-that-never-happened/
79•Topfi•1h ago

Comments

amelius•37m ago
> Summary (from the article)

* OpenAI researchers claimed or suggested that GPT-5 had solved unsolved math problems, but in reality, the model only found known results that were unfamiliar to the operator of erdosproblems.com.

* Mathematician Thomas Bloom and Deepmind CEO Demis Hassabis criticized the announcement as misleading, leading the researchers to retract or amend their original claims.

* According to mathematician Terence Tao, AI models like GPT-5 are currently most helpful for speeding up basic research tasks such as literature review, rather than independently solving complex mathematical problems.

jgalt212•35m ago
After the circular financing schemes involving hundreds of billions of dollars were uncovered, nothing I read about the AI business and its artificial hype machine surprises me anymore.
bbor•34m ago
This is just tit-for-tat clickbait. The researcher’s wording was a bit unclear for sure, but far from incorrect.
resoluteteeth•28m ago
I disagree. There is no way to interpret "GPT-5 just found solutions to 10 (!) previously unsolved Erdos problems" as saying something other than GPT-5 having solved them.

If it just found existing solutions then they obviously weren't "previously unsolved" so the tweet is wrong.

He clearly misunderstood the situation and jumped to the conclusion that GPT-5 had actually solved the problems because that's what he wanted to believe.

That said, the misunderstanding is understandable because the tweet he was responding to said they had been listed as "open", but solving unsolved erdos problems by itself would be such a big deal that he probably should have double checked it.

ayhanfuat•28m ago
How is “GPT-5 found solutions to 10 previously unsolved Erdös problems” far from incorrect?
andrewstuart•30m ago
Humans hallucinating about AI.
MattGaiser•2m ago
Humans "hallucinate" in the AI way constantly, which is why I don't see them as a barrier to LLMs replacing humans in many contexts. It really isn't unusual for a human to make stuff up or be unaware of stuff.
random9749832•2m ago
Best case: Hallucination

Worst case (more probable): Lying

mentalgear•27m ago
Another instance of openAI manipulating results to prolong their unsustainable circular hype bubble.

The inevitable collapse could be even more devastating than the 2008 financial crisis.

All while so vast resources are being wasted on non-verifiable gen AI slob, while real approaches (neuro-symbolic like DeepMind's AlphaFold) are mostly ignored financially because they don't generate the quick stock market increases that hype does.

the_duke•16m ago
People keep spouting this, but I don't see how the AI bubble bursting would be all that devastating.

2008 was a systemic breakdown rippling through the foundations of the financial system.

It would lead to a market crash (80% of gains this year were big tech/AI) and likely a full recession in the US, but nothing nearly as dramatic as a global systemic crisis.

In contrast to the dot com bubble, the huge AI spending is also concentrated on relatively few companies, many with deep pockets from other revenue sources (Google, Meta, Microsoft, Oracle), and the others are mostly private companies that won't have massive impact on the stock market.

A sudden stop in AI craze would be hard for hardware companies and a few big AI only startups , but the financial fallout would be much more contained than either dot com or 2008.

Topfi•7m ago
Isn't the dot com bubble a far better proxy? Notably, todays spending is both higher and more concentrated in a few companies that a large part of the population has exposure to (most dot com companies weren't publicly traded and far smaller vs MSFT, Alphabet, Meta, Oracle, NVDA making up most investment today) by way of pension funds, ETFs, etc.?
kif•26m ago
This honestly doesn’t surprise me. We have reached a point where it’s becoming clearer and clearer that AGI is nowhere to be seen, whereas advances in LLM ability to ‘reason’ have slowed down to (almost?) a halt.
dawnerd•8m ago
But if you ask an AI hype person they’ll say we’re almost there we just need a bit more gigawatts of compute!
vbezhenar•7m ago
In my book, chat-based AGI has been reached years ago, when I couldn't reliably distinguish computer from human.

Solving problems that humanity couldn't solve is super-AGI or something like that. It's not there indeed.

steveBK123•2m ago
Hence the pivot into ads, shop-in-chat and umm.. adult content.
Analemma_•10m ago
“AGI achieved internally”

Another case of culture flowing from the top I guess.

strangescript•6m ago
This entire thing has been pretty disingenuous on both sides of the fence. All the anti-AI (or anti OpenAI) people are doing victory laps, but what GPT-5 Pro did is still very valuable.

1) What good is your open problem set if really its a trivial "google search" away from being solved. Why are they not catching any blame here?

2) These answers still weren't perfectly laid out for the most part. GPT-5 was still doing some cognitive lifting to piece it together.

If a human would have done this by hand it would have made news and instead the narrative would have been inverted to ask serious questions about the validity of some these style problem sets and/or ask the question how many other solutions are out there that just need pieced together from pre-existing research.

But, you know, AI Bad.

nurettin•1m ago
AI great, but AI not creative, yet.
random9749832•3m ago
You are telling me a language model trained on Reddit can't solve novel problems? Shocking.

Teen sues to destroy the nudify app that left her in constant fear

https://arstechnica.com/tech-policy/2025/10/teen-haunted-by-fake-nudes-sues-to-kill-nudify-app-bl...
1•ndsipa_pomu•18s ago•0 comments

Pathmetrics

https://www.pathmetrics.io/
1•Hamid213•1m ago•0 comments

Feed me up, Scotty – custom RSS feed generation using CSS selectors

https://feed-me-up-scotty.vincenttunru.com/
1•diymaker•4m ago•0 comments

Cyborgs vs. rooms, two visions for the future of computing

https://interconnected.org/home/2025/10/13/dichotomy
1•surprisetalk•7m ago•0 comments

Geometric Kernel for Digital Manufacturing

https://quaoar.su/blog/page/maas-geometric-kernel
1•surprisetalk•7m ago•0 comments

Warning: Gmail client Show Original can omit lines of the original

1•chrisjj•8m ago•0 comments

A statistic is as a statistic does

https://surfingcomplexity.blog/2025/09/28/a-statistic-is-as-a-statistic-does/
1•mooreds•9m ago•0 comments

Warning: Gmail client can misrepresent bounce message content

1•chrisjj•9m ago•0 comments

I invited strangers to message me through a receipt printer

https://aschmelyun.com/blog/i-invited-strangers-to-message-me-through-a-receipt-printer/
1•chrisdemarco•10m ago•0 comments

Extrapolating Quantum Factoring

https://www.johndcook.com/blog/2025/09/28/extrapolating-quantum-factoring/
1•cbzbc•10m ago•0 comments

Prefer TF Module Composition over Inheritance

https://newsletter.masterpoint.io/p/prefer-module-composition-over-inheritance
1•mooreds•11m ago•0 comments

'How Life Works' by Philip Ball: Beware metaphorical systems thinking in biology

https://chillphysicsenjoyer.substack.com/p/how-life-works-by-philip-ball
1•crescit_eundo•12m ago•0 comments

I Tried to Toughen Up My Son. Things Didn't Go as Planned

https://www.nytimes.com/2025/10/19/magazine/national-parks-badlands-roosevelt-south-dakota.html
1•mooreds•12m ago•0 comments

The Planets, Personified

https://worldhistory.substack.com/p/the-planets-personified
1•crescit_eundo•13m ago•0 comments

Update on Plans for Privacy Sandbox Technologies (Google Chrome)

https://privacysandbox.com/news/update-on-plans-for-privacy-sandbox-technologies/
1•DavideNL•15m ago•1 comments

The MCP Server?

https://github.com/Dicklesworthstone/ultimate_mcp_server
2•gtsnexp•16m ago•0 comments

Deck Dialogues: How Alex Transformed His UConsole into a Botnet Simulator

https://uconsole.net/deck-dialogues-how-alex-transformed-his-uconsole-into-a-botnet-simulator/
1•tiniuclx•20m ago•0 comments

The macOS LC_COLLATE hunt: Or why does sort order differently on macOS and Linux

https://blog.zhimingwang.org/macos-lc_collate-hunt
1•g0xA52A2A•22m ago•0 comments

Single-Step Electrochemical Battery Recycling

https://advanced.onlinelibrary.wiley.com/doi/10.1002/adfm.202511009
1•PaulHoule•22m ago•0 comments

Jason Wei on 3 Key Ideas in AI in 2025 [video]

https://www.youtube.com/watch?v=b6Doq2fz81U
1•simonpure•26m ago•0 comments

Is MCP authentication that complicated?

https://blog.helix.ml/p/is-mcp-authentication-that-complicated
1•deforciant•29m ago•0 comments

Ask HN: Will Biotech Have Its "Arduino Moment"?

1•Gooblebrai•31m ago•0 comments

Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy

https://github.com/sql-hkr/syna
1•sql-hkr•38m ago•0 comments

Fixing AWS Architecture Diagrams: AI Document Processing

https://www.ilograph.com/blog/posts/fixing-aws-diagrams-ai-document-processing/
1•billyp-rva•40m ago•0 comments

The Beauty of Batteries

https://worksinprogress.co/issue/the-beauty-of-batteries/
2•HR01•43m ago•0 comments

A Tower on Billionaires' Row Is Full of Cracks. Who's to Blame?

https://www.nytimes.com/2025/10/19/nyregion/432-park-avenue-condo-tower.html
5•danso•47m ago•2 comments

Finland proposes €200B Russian asset transfer to Ukraine

https://euromaidanpress.com/2025/10/18/finland-proposes-e200-billion-russian-asset-transfer-to-uk...
4•saubeidl•54m ago•0 comments

Ask HN: Why put a HAProxy in front of Nginx?

2•rvitorper•55m ago•1 comments

Anthropic's Jack Clark is drawing White House ire

https://www.wsj.com/tech/ai/the-fight-over-whose-ai-monster-is-scariest-41a43193
4•johanam•1h ago•1 comments

Creating an Igcse Pseudocode Interpreter

https://ezntek.com/posts/creating-an-igcse-pseudocode-interpreter-pt-2-20250913t1349/
1•runxiyu•1h ago•0 comments