frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Think of a Number

https://xenaproject.wordpress.com/2025/01/20/think-of-a-number/
30•IdealeZahlen•3d ago

Comments

AnotherGoodName•4h ago
A great example of this is to ask AI to ingest and restate with detailed annotations advanced maths papers. This should be simple but the AI fails at this.

A lot of maths is terse. It can take years to grok a very advanced topic. Eg. The ABC conjecture is supposed to be solved by https://en.wikipedia.org/wiki/Inter-universal_Teichm%C3%BCll... but that theory is tough even for the smartest minds so it's still considered up in the air if it's solved or not, not enough mathematicians grok it yet to have a consensus. It's not disproven as nonsense, the paper appears to make sense. It's just that it's a very advanced topic that takes years to understand.

So as someone wanting to understand such topics you may be tempted to have AI read the paper and give annotations and summaries. You might be tempted to have AI give some numeric examples of formulas.

Guess what happens? COMPLETE AND TOTAL FAILURE. The AI can't do it. Because the paper has no online examples where people have written numeric examples and given annotations there's nothing for the AI to go off. It gives numeric examples with mistakes that don't even match the statement it's meant to be giving an example of. Often it gives up with statements like, "At this point the numeric example fails to solve the solution but you can imagine if it did". You can ask it to try and try again but it just keeps failing. Even simple and well known papers generally don't work unless there's already a simple explanation someone's already posted online that it can regurgitate.

Which is pretty damning right? Reading a paper, giving numeric examples of what the paper states and giving some plain english summaries to the most dense portions should be what a language processing system does best. We're not even asking it to come up with original ideas here. We're asking it to summarise well known mathematical papers. The only time i've seen it have success is if someone's already done such an explanation on mathsoverflow.

jordigh•3h ago
> It's not disproven as nonsense, the paper appears to make sense

Not obviously utter nonsense, but a couple of mathematicians who have studied it have claimed to have found gaps and were unsatisfied with the resolution to those gaps that Mochizuki offered.

It's kind of like, well, LLM output. Has the right shape but upon scrutiny it seems to fall apart. Plausible-looking but probably nonsense.

BlackFingolfin•3h ago
A follow up post is at https://xenaproject.wordpress.com/2025/03/16/think-of-a-numb...
jenny91•2h ago
Mathematics is such a wide field and the questions asked here are ill defined.

If the comment is "the AI founder bros are hyping it up and it's not as good as they claim", I think we all agree that's true. LLMs are good, but exactly how good depends on many subjective points.

If the question is: "can we come up with questions that are easy for some tiny niche set of experts, but basically impossible for an LLM", I think the answer will always be "yes", especially if you can make "niche set of experts" more and more niche every time.

If the question is "will mathematicians be unemployed in a few years", obviously the answer is also "no".

If the question is "can LLMs be used to speed up mathematics research", the answer is "yes and no, depending on what you're doing".

prats226•2m ago
An issue would be as soon as you make questions public, even by letting hosted LLMs predict on them, they are tainted. You can't use them anymore. So would it be a one time test dataset?

Andrej Karpathy's YC AI SUS talk on the future of the industry

https://www.donnamagi.com/articles/karpathy-yc-talk
145•pudiklubi•3h ago•68 comments

The Unreasonable Effectiveness of Fuzzing for Porting Programs

https://rjp.io/blog/2025-06-17-unreasonable-effectiveness-of-fuzzing
100•Bogdanp•4h ago•12 comments

Show HN: Workout.cool – Open-source fitness coaching platform

https://github.com/Snouzy/workout-cool
426•surgomat•7h ago•146 comments

Show HN: I built a tensor library from scratch in C++/CUDA

https://github.com/nirw4nna/dsc
68•nirw4nna•5h ago•6 comments

Writing documentation for AI: best practices

https://docs.kapa.ai/improving/writing-best-practices
74•mooreds•4h ago•20 comments

Homomorphically Encrypting CRDTs

https://jakelazaroff.com/words/homomorphically-encrypted-crdts/
156•jakelazaroff•7h ago•49 comments

"poline" is an enigmatic color palette generator using polar coordinates

https://meodai.github.io/poline/
144•zdw•3d ago•34 comments

My iPhone 8 Refuses to Die: Now It's a Solar-Powered Vision OCR Server

https://terminalbytes.com/iphone-8-solar-powered-vision-ocr-server/
63•hemant6488•4h ago•21 comments

Yes I Will Read Ulysses Yes

https://www.theatlantic.com/magazine/archive/2025/07/zachary-leader-richard-ellmann-james-joyce-review/682907/
35•petethomas•2h ago•31 comments

Terpstra Keyboard

http://terpstrakeyboard.com/web-app/keys.htm
182•xeonmc•9h ago•64 comments

Introduction to the A* Algorithm

https://www.redblobgames.com/pathfinding/a-star/introduction.html
197•auraham•1d ago•71 comments

MiniMax-M1 open-weight, large-scale hybrid-attention reasoning model

https://github.com/MiniMax-AI/MiniMax-M1
290•danboarder•13h ago•67 comments

Attimet (YC F24) – Quant Trading Research Lab – Is Hiring Founding Engineer

https://www.ycombinator.com/companies/attimet/jobs/b1w9pjE-founding-engineer
1•kbanothu•3h ago

Framework Laptop 12 review

https://arstechnica.com/gadgets/2025/06/framework-laptop-12-review-im-excited-to-see-what-the-2nd-generation-looks-like/
153•moelf•5h ago•187 comments

Is There a Half-Life for the Success Rates of AI Agents?

https://www.tobyord.com/writing/half-life
156•EvgeniyZh•9h ago•85 comments

Scrappy - make little apps for you and your friends

https://pontus.granstrom.me/scrappy/
387•8organicbits•15h ago•125 comments

Revisiting Minsky's Society of Mind in 2025

https://suthakamal.substack.com/p/revisiting-minskys-society-of-mind
37•suthakamal•4h ago•10 comments

Locally hosting an internet-connected server

https://mjg59.dreamwidth.org/72095.html
120•pabs3•15h ago•119 comments

I counted all of the yurts in Mongolia using machine learning

https://monroeclinton.com/counting-all-yurts-in-mongolia/
191•furkansahin•12h ago•69 comments

Building agents using streaming SQL queries

https://www.morling.dev/blog/this-ai-agent-should-have-been-sql-query/
78•rmoff•4h ago•7 comments

After millions of years, why are carnivorous plants still so small?

https://www.smithsonianmag.com/articles/carnivorous-plants-have-been-trapping-animals-for-millions-of-years-so-why-have-they-never-grown-larger-180986708/
176•gmays•5d ago•75 comments

Spatializing 6k years of global urbanization from 3700 BC to AD 2000

https://www.nature.com/articles/sdata201634
19•talonx•3d ago•1 comments

Should we design for iffy internet?

https://bytes.zone/posts/should-we-design-for-iffy-internet/
44•surprisetalk•2d ago•22 comments

Spherical CNNs (2018)

https://arxiv.org/abs/1801.10130
8•rkp8000•2d ago•1 comments

The Grug Brained Developer (2022)

https://grugbrain.dev/
982•smartmic•1d ago•478 comments

Real-time action chunking with large models

https://www.pi.website/research/real_time_chunking
53•pr337h4m•1d ago•7 comments

A different take on S-expressions

https://gist.github.com/tearflake/569db7fdc8b363b7d320ebfeef8ab503
26•tearflake•3d ago•17 comments

Reasoning by Superposition: A Perspective on Chain of Continuous Thought

https://arxiv.org/abs/2505.12514
43•danielmorozoff•7h ago•1 comments

Show HN: Free local security checks for AI coding in VSCode, Cursor and Windsurf

21•jaimefjorge•7h ago•10 comments

Game Hacking – Valve Anti-Cheat (VAC)

https://codeneverdies.github.io/posts/gh-2/
61•LorenDB•3h ago•45 comments