frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

OpenAI claiming gold medal standard at IMO 2025

https://github.com/aw31/openai-imo-2025-proofs
8•ocfnash•5h ago

Comments

ocfnash•5h ago
According to the 6/N from this series, they are claiming full marks for problems 1 -- 5

https://x.com/alexwei_/status/1946477742855532918

Davidzheng•4h ago
I posted about one of the twitter threads at https://news.ycombinator.com/item?id=44613840
Davidzheng•4h ago
The proof superficially look super interesting. Especially bc it's not in style of usual LLM babble fillers. It's like almost exactly opposite, very efficient use of words and eliminating parts of grammar not important. Reminds me of how people write down proofs in drafts/how we communicate proofs with peers before writing final versions.
Davidzheng•4h ago
P1 has in setup section basically a very precise summary of proof which it fills in later "So main is: (a) for n>=4, any n-line cover must contain a side-line; inductively reduce to n=3. (b) Analyze n=3 exactly."

I suspect there's some (tree-based?) search + separate process verifier + large # of parallel generation sessions. Coming just from hints of how structured/monotone the generated text is.

A lot of colons. like So: Now: Need: etc..

Davidzheng•4h ago
P2 is geometry. It looks coordinate bashed? Very interesting to see it writing Good. and Perfect. after some lines. Very human-like in thinking process. It reads like a person talking about the proof orally.
Davidzheng•4h ago
P3: interesting that in the basics section, it makes an easy observation but no proof sketch. unlike P1/P2 (P1 has full proof idea sketch P2 says we'll bash). This suggests actually the whole proof is generated one-shot (unlike my previous comment). I guess it's not doing search in the text space (like output some line search for next line etc). OFC there's probably some final process outputing the proof from some parts so it could be obfuscated the search.

come to think of it, informal proof gen probably can't easily use search? Probably it's doing parallel generation with some information sharing + global verification process. No real evidence except for the fact that the entire proof is very unstructured despite at each line it's written with some style consistency.

energy123•4h ago
This is incredible. We know these questions are not in the training data. How can you still say that LLMs aren't reasoning.

Where Did All Those Brave Free Speech Warriors Go?

https://www.techdirt.com/2025/05/19/where-did-all-those-brave-free-speech-warriors-go/
1•the_why_of_y•3m ago•0 comments

Interesting thoughts on the limits of AI in the context of software development

https://www.ufried.com/blog/ai_and_software_development_5/
1•BinaryIgor•4m ago•0 comments

A Look Back at WeChat's PhxSQL and the 'Fastest Majority'

https://www.supasaf.com/blog/general/phxsql
1•supasaf•5m ago•0 comments

New Russian law criminalizes online searches for controversial content

https://www.washingtonpost.com/world/2025/07/17/russia-internet-censorship/
1•voxleone•7m ago•0 comments

DunedinPACNI estimates the longitudinal Pace of Aging from a single brain image

https://www.nature.com/articles/s43587-025-00897-z
1•bookofjoe•9m ago•0 comments

Why Is ReactOS Development So Undervalued?

1•Waraqa•12m ago•1 comments

AI guzzled books without permission. Authors are fighting back

https://www.washingtonpost.com/technology/2025/07/19/ai-books-authors-congress-courts/
2•amirkabbara•14m ago•0 comments

Kimi K2 scored 59% on the aider polyglot coding benchmark

https://twitter.com/paulgauthier/status/1946165321611526229
1•tosh•15m ago•0 comments

Spectrally Tunable Lighting: How LEDs can emulate blackbody emitters

https://enody.lighting/journal/01-spectrally-tunable-lighting/
1•carterpeterson•15m ago•0 comments

'I was floored by the data': Psilocybin shows anti-aging properties

https://www.livescience.com/health/ageing/i-was-floored-by-the-data-psilocybin-shows-anti-aging-properties-in-early-study
2•Bluestein•20m ago•0 comments

Extending Iterated, Spatialized Prisoners Dilemma to Understand Multicellularity

https://lksshw.github.io/
1•ca98am79•23m ago•0 comments

Field Guide to the North American Weigh Station

https://hackaday.com/2025/06/26/field-guide-to-the-north-american-weigh-station/
1•toomuchtodo•24m ago•0 comments

uv 0.8

https://github.com/astral-sh/uv/releases/tag/0.8.0
2•tosh•24m ago•0 comments

Is automating your AI too hard? Let AI automate that too

https://github.com/czlonkowski/n8n-mcp
1•greggh•24m ago•2 comments

Origami Space Planes Could Solve a Major Problem in Orbit

https://gizmodo.com/origami-space-planes-could-solve-a-major-problem-in-orbit-2000629875
1•Bluestein•25m ago•0 comments

Scenarios for solar radiation modification need to include perceptions of risk

https://iopscience.iop.org/article/10.1088/2752-5295/addd42
1•PaulHoule•26m ago•0 comments

Google Backs 10 New Nuclear Reactors for AI, Built by AI. What Could Go Wrong?

https://www.pcmag.com/news/google-backs-10-new-nuclear-reactors-for-ai-will-it-work-this-time
1•Bluestein•27m ago•0 comments

Karen Hao – Empire of AI: Dreams and Nightmares in Sam Altman's OpenAI

https://www.youtube.com/watch?v=NtQCthF2vlY
1•belter•31m ago•1 comments

Death by AI

https://davebarry.substack.com/p/death-by-ai
2•ano-ther•32m ago•0 comments

Elon Musk's Starlink internet works great if hardly anyone uses it

https://www.washingtonpost.com/technology/2025/07/18/starlink-internet-satellite-speed-elon-musk/
2•reaperducer•33m ago•0 comments

Angel vs. Devil Accounting: Reviving a 500-Yr-Old Idea for Modern Mental Health

https://ledgeroflife.blog/angel-vs-devil-accounting-resurrecting-a-500-year-old-idea-for-modern-mental-health/
4•shadowvoxing•38m ago•0 comments

That how to calculate the hours that you worked and revenue amount

https://billr.us/
1•ulicaki8991•39m ago•0 comments

Groq's First Compound AI System

https://groq.com/blog/now-in-preview-groqs-first-compound-ai-system
1•tosh•40m ago•0 comments

Why you should choose HTMX for your next web-based side project (2024)

https://hamy.xyz/blog/2024-02_htmx-for-side-projects
2•kugurerdem•44m ago•2 comments

Not Even Bronze: Evaluating LLMs on 2025 International Math Olympiad

https://matharena.ai/imo/
37•hardmaru•44m ago•22 comments

The Epic Battle for AI Talent–With Exploding Offers, Secret Deals and Tears

https://www.wsj.com/tech/ai/meta-ai-recruiting-mark-zuckerberg-sam-altman-140d5861
1•pinewurst•45m ago•1 comments

Ask HN: Looking for UE5 Devs and Artists for My Open Concept – Solvaldr

1•hejhdiss•45m ago•0 comments

How a Florida Pension Fund manager produced 50 years of market-beating returns

https://www.barrons.com/articles/florida-pension-plan-50-years-of-market-beating-returns-b9f78df9
2•hhs•47m ago•0 comments

Kubernetes Observability with OpenTelemetry Helm Charts [A guide I wish I had

https://signoz.io/blog/kubernetes-observability-with-opentelemetry/
2•todsacerdoti•50m ago•1 comments

Japan Uses Drones to Light Up Exit Signs at Concerts and Events

https://thecsrjournal.in/japan-uses-drones-to-light-up-exit-signs-at-concerts-and-events/
2•ksec•51m ago•0 comments