news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Epoch confirms GPT5.4 Pro solved a Frontier Math Open Problem for the first time

https://epoch.ai/frontiermath/open-problems/ramsey-hypergraphs

36•in-silico•1h ago

Comments

6thbit•43m ago

> Subsequent to this solve, we finished developing our general scaffold for testing models on FrontierMath: Open Problems. In this scaffold, several other models were able to solve the problem as well: Opus 4.6 (max), Gemini 3.1 Pro, and GPT-5.4 (xhigh).

Interesting. Whats that “scaffold”? A sort of unit test framework for proofs?

inkysigma•9m ago

I think in this context, scaffolds are generally the harness that surrounds the actual model. For example, any tools, ways to lay out tasks, or auto-critiquing methods.

I think there's quite a bit of variance in model performance depending on the scaffold so comparisons are always a bit murky.

karmasimida•8m ago

No denial at this point, AI could produce something novel, and they will be doing more of this moving forward.

osti•6m ago

Seems like the high compute parallel thinking models weren't even needed, both the normal 5.4 and gemini 3.1 pro solved it. Somehow Gemini 3 deepthink couldn't solve it.

renewiltord•6m ago

Fantastic news! That means with the right support tooling existing models are already capable of solving novel mathematics. There’s probably a lot of good mathematics out there we are going to make progress on.

Iran War Live Updates: U.S. and Iran Send Conflicting Signals on Peace Prospects

https://www.nytimes.com/live/2026/03/23/world/iran-war-oil-trump

2•mananbasim•6m ago•0 comments

Traders placed $580M in oil bets ahead of Trump's social media post on Iran

https://www.ft.com/content/1171d623-3709-4f6e-8ded-a5df4ec57696

4•TheAlchemist•7m ago•1 comments

My Split Keyboard Journey

https://simondalvai.org/blog/split-keyboard-journey/

1•gsky•8m ago•0 comments

Aptos Atomic Arbitrage Bot

https://github.com/tacogips/aptos-atomic-arb-bot

1•tacogips•11m ago•0 comments

Show HN: QuarterMaster – Generate performance reviews from your GitHub activity

https://github.com/imponenm/QuarterMaster

1•imponenm•12m ago•0 comments

AI coding tools have broad filesystem and network access

1•shadag•15m ago•1 comments

Levels of 5-MeO-DMT

https://heart.qri.org/retreats/2023-canada/andres-gomez-emilsson/levels

2•obiefernandez•17m ago•0 comments

$1,2B path Deck feedback

https://drive.google.com/file/d/159LfgazF1Ziqdt97F7Hh6RhEp2lQE2L0/view?usp=drivesdk

1•Tokelo546•20m ago•0 comments

Show HN: AI Brief Room – Upload docs, get McKinsey-style briefings

https://aibriefroom.com/

1•priitmaxx•21m ago•0 comments

Embracing Bayesian Methods in Clinical Trials

https://jamanetwork.com/journals/jama/fullarticle/2847011

1•nextos•22m ago•0 comments

Single ATC controller was managing LaGuardia Airport at time of Air Canada crash [video]

https://www.youtube.com/watch?v=Pbm-QJAAzNY

1•notRobot•24m ago•0 comments

In Defense of Algebra

https://www.nybooks.com/articles/2026/04/09/in-defense-of-algebra-paul-lockhart/

1•mitchbob•24m ago•1 comments

A vibe coded app – a companion app for ukulele

https://baijum.github.io/ukulele-companion/

2•baijum•27m ago•1 comments

Jury in landmark Meta, Google trial having difficulty coming to consensus

https://nypost.com/2026/03/23/business/jury-in-landmark-meta-google-addiction-trial-having-diffic...

1•1vuio0pswjnm7•28m ago•0 comments

Conlon Nancarrow: The Prince of the Player Piano (2015)

https://www.nybooks.com/online/2015/06/24/nancarrow-prince-player-piano/

1•mitchbob•30m ago•1 comments

6.6M Tokens. $4,800 Theoretical. Zero Visibility. So I Built a Dashboard

https://github.com/outcomeops/react-ai-token-monitor

1•th3tekllc•33m ago•1 comments

OnlyFans owner Leonid Radvinsky dies at 43

https://www.bbc.com/news/articles/c33le6yv7pno

3•doppp•33m ago•1 comments

Firesign: The Electromagnetic History of Everything as Told on 9 Comedy Albums

https://www.nybooks.com/articles/2026/04/09/not-insane-firesign-theatre/

1•mitchbob•35m ago•1 comments

Linux Kodachi 9

https://www.kodachi.cloud/wiki/bina/index.html

3•rythmshifter•36m ago•0 comments

Singapore's Sound Card Hero [video]

https://www.youtube.com/watch?v=VTPa6wRECw0

1•tartoran•37m ago•0 comments

Toronto Transit Truth-Real Time Transit Reporting for Toronto Commuters

https://ttc-reality-check.onrender.com/

1•AppMaker•37m ago•0 comments

ByteDance is swallowing the internet–in China and beyond

https://www.economist.com/business/2026/03/23/bytedance-is-swallowing-the-internet-in-china-and-b...

3•andsoitis•42m ago•0 comments

Show HN: AgentDrive – Persistent file storage for AI agents

https://www.getagentdrive.com

2•itstomo•43m ago•0 comments

US bans new foreign-made consumer Internet routers

https://www.bbc.com/news/articles/c74787w149zo

16•g-mork•44m ago•2 comments

It Depends

https://andys.blog/q/

2•andytratt•45m ago•0 comments

Show HN: ContextSpectre – Reasoning hygiene layer for Claude Code sessions

https://github.com/ppiankov/contextspectre

1•ppiankov•46m ago•0 comments

Strategy as Code: Will the Industry of the Future Be Specification-Driven

https://medium.com/ifs-tech/will-the-industry-of-the-future-be-specification-driven-7efe15f6a159

1•mr_andersson•46m ago•2 comments

Driverless Big Rigs Are Coming to American Highways, and Soon

https://www.nytimes.com/2026/03/17/business/self-driving-semi-trucks-texas-us.html

2•bookofjoe•48m ago•1 comments

I stopped promoting my app for 2 days and traffic instantly dropped

https://superbrain.base44.app

2•TommyDotDev•49m ago•0 comments

Show HN: Picca – A Parallel Implementation of Common Checksum Algorithms

https://codeberg.org/bryson/picca

1•brysonsteck•55m ago•1 comments