frontpage.

Hi HN, it seemed like there was broad interest in the previous Erdos problem that GPT 5.2 Pro solved: https://news.ycombinator.com/item?id=46664631

I recruited a team of smart undergraduates to construct a dataset of ChatGPT responses to every open Erdos problem and verify the output.

They found:

- 3 problems with new proofs (though in 2 cases, historical partial results were found that could be extended to solve the same problem)

- 4 problems where 5.2 Pro or Deep Research found an exact solution in the prior literature that hadn't been documented

- 3 problems where 5.2 Pro or Deep Research were able to strengthen a prior result in the literature

- 3 problems where typos were identified in the problem statement

The most common failure case is that 5.2 Pro solves the problem as stated, but professional mathematicians understand there's an implicit constraint for the problem. For example, maybe the problem says integers, but they really mean only positive integers.

Happy to answer any questions about the dataset!

P2P crypto exchange development company

Vocal Guide – belt sing without killing yourself

Write for Your Readers Even If They Are Agents

Knowledge-Creating LLMs

Maple Mono: Smooth your coding flow

Sid Meier's System for Real-Time Music Composition and Synthesis

Show HN: Slop News – HN front page now, but it's all slop

Show HN: Empusa – Visual debugger to catch and resume AI agent retry loops

Show HN: Bitcoin wallet on NXP SE050 secure element, Tor-only open source

White House Explores Opening Antitrust Probe on Homebuilders

Show HN: MindDraft – AI task app with smart actions and auto expense tracking

How do you estimate AI app development costs accurately?

Going Through Snowden Documents, Part 5

Show HN: MCP Server for TradeStation

Canada unveils auto industry plan in latest pivot away from US

The essential Reinhold Niebuhr: selected essays and addresses

Rentahuman.ai Turns Humans into On-Demand Labor for AI Agents

StovexGlobal – Compliance Gaps to Note

Show HN: Afelyon – Turns Jira tickets into production-ready PRs (multi-repo)

Trump says America should move on from Epstein – it may not be that easy

Tiny Clippy – A native Office Assistant built in Rust and egui

LegalArgumentException: From Courtrooms to Clojure – Sen [video]

US moves to deport 5-year-old detained in Minnesota

If you lose your passport in Austria, head for McDonald's Golden Arches

Show HN: Mermaid Formatter – CLI and library to auto-format Mermaid diagrams

RFCs vs. READMEs: The Evolution of Protocols

Kanchipuram Saris and Thinking Machines

Chinese chemical supplier causes global baby formula recall

I've used AI to write 100% of my code for a year as an engineer

Looking for 4 Autistic Co-Founders for AI Startup (Equity-Based)