frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: GPT-Erdos – the results of GPT 5.2 Pro on the Erdos problems

https://www.ocf.berkeley.edu/~neel/erdos.html
1•nsomani•1h ago
Hi HN, it seemed like there was broad interest in the previous Erdos problem that GPT 5.2 Pro solved: https://news.ycombinator.com/item?id=46664631

I recruited a team of smart undergraduates to construct a dataset of ChatGPT responses to every open Erdos problem and verify the output.

They found:

- 3 problems with new proofs (though in 2 cases, historical partial results were found that could be extended to solve the same problem)

- 4 problems where 5.2 Pro or Deep Research found an exact solution in the prior literature that hadn't been documented

- 3 problems where 5.2 Pro or Deep Research were able to strengthen a prior result in the literature

- 3 problems where typos were identified in the problem statement

The most common failure case is that 5.2 Pro solves the problem as stated, but professional mathematicians understand there's an implicit constraint for the problem. For example, maybe the problem says integers, but they really mean only positive integers.

Happy to answer any questions about the dataset!

Vinod Khosla publicly disavows Keith Rabois' comments on ICE shooting

https://techcrunch.com/2026/01/26/vinod-khosla-publicly-disavows-keith-rabois-comments-on-ice-sho...
1•SilverElfin•1m ago•0 comments

Quack-Cluster: A Serverless Distributed SQL Query Engine with DuckDB and Ray

https://github.com/kristianaryanto/Quack-Cluster
1•tanelpoder•2m ago•0 comments

Palantir Defends Work with ICE to Staff Following Killing of Alex Pretti

https://www.wired.com/story/palantir-ice-dhs-alex-pretti-killing-workers-slack-minneapolis/
1•nickthegreek•2m ago•0 comments

Show HN: FSA Savings Calculator (see how much pre-tax saves you)

https://prewallet.lovable.app/fsa-calculator
1•nemath•4m ago•0 comments

One week and $608 later – Skyscraper's launch into the big Bluesky

https://blog.cameron.software/2026/01/one-week-and-608-later-skyscrapers-launch-into-the-big-blue...
1•CameronBanga•5m ago•0 comments

Show HN: Protogen – An Autopoietic Autonomous World Model

https://github.com/jzkool/Aetherius-sGiftsToHumanity/blob/main/Architectural%20Software/Protogen_...
1•hiddenarchitect•5m ago•0 comments

Covid's long shadow looms over a new generation of college students

https://www.sfgate.com/bayarea/article/covid-cohort-college-students-21309223.php
2•pseudolus•5m ago•0 comments

Spartakiada – Mass gymnastics event, held in Prague, Czech Republic

https://old.reddit.com/r/Damnthatsinteresting/comments/s5mjic/spartakiada_mass_gymnastics_event_h...
1•vinnyglennon•10m ago•0 comments

Swift-Sass – Embed Dart Sass Compiler in Swift with Custom Importers, Functions

https://github.com/johnfairh/swift-sass
1•TheWiggles•11m ago•0 comments

Shooting Stars

https://dumbideas.xyz/posts/shooting-stars/
2•omegastick•12m ago•0 comments

Wyoming Goes from Sub-Zero Temperatures to Hurricane Force Winds in 24 Hours

https://cowboystatedaily.com/2026/01/26/wyomings-gusty-monday-a-result-of-chinook-winds-rushing-i...
3•Bender•13m ago•0 comments

IDE-SHEPHERD: Your shield against threat actors lurking in your IDE

https://securitylabs.datadoghq.com/articles/ide-shepherd-release-article/
1•tanelpoder•13m ago•1 comments

Cody Woman One of America's First Female CIA Agents, Pioneered Covert Operations

https://cowboystatedaily.com/2026/01/26/cody-woman-pioneered-cia-covert-operations/
3•Bender•14m ago•0 comments

Multiple vulnerabilities in React Server Components (CVE-2026-23864)

https://www.cve.org/CVERecord?id=CVE-2026-23864
1•nthypes•15m ago•1 comments

OpenAI spills technical details about how its AI coding agent works

https://arstechnica.com/ai/2026/01/openai-spills-technical-details-about-how-its-ai-coding-agent-...
1•Bender•15m ago•0 comments

Building a Personal CTO Operating System with Claude Code

https://obie.medium.com/building-a-personal-cto-operating-system-with-claude-code-b3fb9c4933c7
1•sdoering•17m ago•0 comments

The Engineer who invented the Mars Rover Suspension in his garage [video]

https://www.youtube.com/watch?v=QKSPk_0N4Jc
6•UltraSane•18m ago•0 comments

Summarize – CLI and Chrome Side Panel for Fast Summaries

https://summarize.sh/
1•duck•19m ago•0 comments

ChronDB: Transforming a Clojure Database into a Polyglot Library with GraalVM N

https://avelino.run/chrondb-polyglot-ffi-clojure-graalvm-native-image/
1•todsacerdoti•20m ago•0 comments

Infamous Gang of 40 Leader Banned from Wikipedia

https://www.neutralpov.com/p/infamous-gang-of-40-leader-banned
5•shykes•20m ago•2 comments

K-Surfaces: Bézier-Splines Interpolating at Gaussian Curvature Extrema

https://dl.acm.org/doi/epdf/10.1145/3618383
3•E-Reverance•21m ago•0 comments

China hacked Downing Street phones for years

https://www.telegraph.co.uk/news/2026/01/26/china-hacked-downing-street-phones-for-years/
5•cwwc•22m ago•0 comments

In humble defense of the .zip TLD

https://luke.zip/posts/zip-defense/
1•birdculture•22m ago•1 comments

Show HN: Open-source tool for finding VPNs in your traffic

https://github.com/TLop503/ipcheq2
1•tlop•23m ago•0 comments

TikTok disallows DMs with the word "Epstein"

https://twitter.com/krassenstein/status/2015911471507530219
16•crishoj•25m ago•0 comments

Deep Dive: Por que O Easy Copy é a evoluçãO do Clipboard Management

https://magasine.substack.com/p/deep-dive-por-que-o-easy-copy-e-a
1•magasineHN•27m ago•0 comments

Escape Tsunami for Brainrots – Unblocked Free Game,Roblox Game Guide,Codes

https://escapetsunamiforbrainrots.pro/
1•mumuchen•27m ago•0 comments

Ask HN: How do you do multi-agent workflows with web apps?

1•ativzzz•28m ago•0 comments

Open Code Review – Multi-agent code review (Local First but CI-ready)

https://github.com/spencermarx/open-code-review
1•mrxdev•32m ago•1 comments

Is the Internet Hijacking Our Ambition?

https://calnewport.com/is-the-internet-hijacking-our-ambition/
1•zdw•33m ago•0 comments