frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Another experiment with an Erdos problem and LLMs

1•ilitirit•1h ago
Background: I am a coder, not a mathematician, but I was quite entertained by this story:

https://news.ycombinator.com/item?id=47903126

I wondered how far I could get by just choosing a random open problem and throwing it at LLMs.

Disclosure: I have no idea what the problem even refers to, let alone whether or not the output is even remotely correct. My interest is purely for testing capabilities of various models, curiousity, and entertainment.

Problem: https://www.erdosproblems.com/691

  Given A\subseteq \mathbb{N} let M_A=\{ n \geq 1 : a\mid n\textrm{ for some }a\in A\} be the set of multiples of A. 
  Find a necessary and sufficient condition on A for M_A to have density 1.
My approach: I used DeepSeek in Expert mode, using the same prompt as in the linked HN submission. It thought for a very long time, but I was doing other things in the background so I didn't really time it. I pressed "Continue" twice over the space of maybe 60mins. The output says it thought for about 46mins.

Once it generated a proof, I asked Opus 4.7 to review it, and then entered the review into DeepSeek which made edits, corrections and refinements. This back-and-forth continued till Opus 4.7 was reasonably happy. At that point, I called in Gemini 3.1 Pro Preview, which raised issues which Opus missed. Opus acknowledged the feedback, and then I placed its feedback into Deepseek for a final round. Essentially, what Opus says Deepseek generated was a "clean exposition of a D[avenport]-E[rdos] corollary", not a new result. In all likelihood this result may already be known (Deepseek was not allowed to use the internet for this phase), or even wrong.

In "simple" terms:

  The argument actually proves a stronger fact for every set \( A \) of natural numbers:  
  The upper density of the set \( M_A \) equals the largest possible lower density you can get from finite subsets of \( A \), and that also equals the lower density of \( M_A \).
  When the upper density is 1, it forces the lower density to also be 1, so the natural (ordinary) density exists and equals 1 automatically, without needing any extra conditions.
  The only non-basic part of the proof is the Davenport–Erdős theorem; everything else is simple.
In any case, these were my takeaways:

- These new models seem to be surprisingly capable especially when used to in conjunction with each other, even with fairly simple prompts

- I am quite impressed by Deepseek. I'm going to review its coding ability, and may even switch completely from Anthropic

- This was a genuinely interesting exercise, even if I have no idea if any of it is correct or useful

Some other observations:

- Opus was really fast at reviewing Deepseek's output. Literally seconds

- Gemini had trouble figuring out what "Erdos 691" referred to

- The free version of ChatGPT of generated mostly useless output. I didn't include it.

Chat links below:

https://chat.deepseek.com/share/hpguvrhcxn226bi3hn

https://claude.ai/share/4f3ccad1-d862-4e37-8333-8a1ebd84b38f

https://aistudio.google.com/app/prompts?state=%7B%22ids%22:%...

Google's best practices document for designing AI products

https://pair.withgoogle.com/guidebook/
1•dotancohen•33s ago•0 comments

Excited Delirium[audio]

https://thisiscriminal.com/episode-355-excited-delirium-3-6-2026/
1•muddi900•2m ago•0 comments

North Korean IT workers are stealing remote jobs: Americans are helping them

https://fortune.com/2026/04/25/north-korean-it-worker-scheme-american-faciliators/
1•napolux•3m ago•1 comments

New Bible TUI App Releases v1.0.0

https://github.com/DeLsonJabberwo/bible-tui
1•delsonjabberwo•3m ago•0 comments

Agent Harness Engineering

https://addyosmani.com/blog/agent-harness-engineering/
1•kiyanwang•5m ago•0 comments

The Normal Work of Creating Reliability

https://surfingcomplexity.blog/2026/04/26/the-normal-work-of-creating-reliability/
1•azhenley•6m ago•0 comments

Everything that went wrong with Claude

https://clawd.rip/
1•aratahikaru5•11m ago•0 comments

How to hire people who are better than you

https://longform.asmartbear.com/hire-better-than-you/
1•kiyanwang•13m ago•0 comments

Terraform is dead

https://grahamgilbert.com/blog/2026/04/20/terraform-is-dead/
2•milkglass•16m ago•0 comments

Chernobyl disaster (April 26th, 1986)

https://en.wikipedia.org/wiki/Chernobyl_disaster
1•simonebrunozzi•16m ago•0 comments

But what is L0-L2 processing for satellite data?

https://medium.com/@aryachauhan7/but-what-is-l0-l2-processing-for-satellite-data-27f39f5324a1
1•marklit•18m ago•0 comments

Don't Confuse Computer Science with Coding

https://substack.com/home/post/p-194090221
2•AbbeFaria•18m ago•0 comments

AI can cost more than human workers now

https://www.axios.com/2026/04/26/ai-cost-human-workers
8•nreece•27m ago•2 comments

Conversations with Cosmos

https://madsenaim.substack.com/p/coming-soon
1•aimmia•31m ago•0 comments

I Am Doing This: The Origin Story of Project-AI

https://zenodo.org/records/19592336
1•IAmSoThirsty•31m ago•0 comments

Lipovive Review: Effective Formula for Fitness

https://www.morningstar.com/news/accesswire/1138075msn/lipovive-reviews-shocking-2026-report-what...
1•JamesLoynes•32m ago•0 comments

OpenAI boss 'deeply sorry' for not telling police of mass shooter's account

https://www.bbc.com/news/articles/cq6je7e80r7o
3•chistev•36m ago•0 comments

An AI driven WP theming workflow

https://anchor.host/a-custom-wordpress-theme-from-scratch-in-2026-an-ai-driven-workflow/
1•g00m•37m ago•0 comments

Savings Hacks You Must Know

https://www.threads.com/@financial.tips.101/post/DXnfOxRCMEX
1•hennix22•41m ago•0 comments

Claude 4.7 vs. ChatGPT 5.5

https://www.tomsguide.com/ai/7-0-wipeout-i-put-chatgpt-5-5-and-claude-4-7-through-7-impossible-te...
3•ageospatial•42m ago•0 comments

Claude Platform on AWS (Coming Soon)

https://aws.amazon.com/claude-platform/
1•qainsights•44m ago•0 comments

Txtfold – summarize large files for LLMs

https://github.com/kristiandupont/txtfold
1•kristiandupont•47m ago•0 comments

The Silencing Engine

https://kitchencloset.com/realstuff/essays/the_silencing_engine/
1•bcRIPster•52m ago•0 comments

Show HN: ChatForm – Create an AI chat form in 1 minute

https://chatform.000ooo.ooo/
1•fengyiqicoder•59m ago•0 comments

Draft's knowledge graph engine – deterministic codebase understanding for AI

https://www.getdraft.dev/blog/local-graph-engine/
1•mayurpise•1h ago•0 comments

Why the Future Doesn't Need US

https://web.archive.org/web/20160210081017/http://www.wired.com/2000/04/joy-2/
1•signa11•1h ago•0 comments

The Publishing Mystery That No One Wants to Talk About

https://www.theatlantic.com/books/2026/04/who-really-wrote-autistic-author-woody-brown-novel/686814/
1•samclemens•1h ago•0 comments

AMD's Zen: Coming Back from the Dead

https://clamtech.org/?dest=zen1
1•matt_d•1h ago•1 comments

Coyote vs. Acme (1990)

https://www.newyorker.com/magazine/1990/02/26/coyote-v-acme
2•aaronbrethorst•1h ago•0 comments

Learning About FPGAs in Finance

https://www.semidesignjobs.com/blog/fpgas-in-finance-hft
1•johncole•1h ago•0 comments