frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Why single agents suck at math proofs

https://ensue.dev/blog/stop-throwing-a-single-agent-at-complex-problems/
3•austinbaggio•1h ago
Cursor published a great post last week on scaling long-running autonomous coding agents to build a browser.

We’ve been exploring a similar idea, but applied to math proofs

We show how a swarm of agents, coordinated via shared memory, can generate a Lean proof for a non-trivial multi-step math problem-Putnan A2-that a single agent struggled with. We were feeling fancy so we wrote a full post and captured the runtime.

Harness will be public soon(tm) for you to run it yourself.

Comments

saidcooldude•1h ago
i thought this work was fun for white box theorem proving. it is interesting to pass the structure to an llm to better understand the problem solving strategy.

using the tree shape as context for other questions was also interesting

How the NHS became the battleground in the trans debate facing workplaces

https://www.bbc.co.uk/news/articles/c7v0l25mr2ro
1•binning•3m ago•0 comments

Power, Consumption and Gender: An analysis of Barbara Kruger's political art

https://feminisminindia.com/2026/01/14/power-consumption-and-gender-an-analysis-of-barbara-kruger...
1•binning•4m ago•0 comments

Every big lab is putting resources in building world models

https://ankitmaloo.com/world-models/
1•ankit219•4m ago•0 comments

Show HN: Remember Me – O(1) Client-Side Memory (40x cheaper than Vector DBs)

https://github.com/merchantmoh-debug/Remember-Me-AI
1•MohskiBroskiAI•4m ago•0 comments

Manipulating blood CO₂ levels may help clear toxic proteins from the brain

https://medicalxpress.com/news/2026-01-blood-co8322-toxic-proteins-brain.html
1•bikenaga•4m ago•0 comments

480k-Year-Old Elephant Bone Tool Is the Oldest Ever Found Outside Africa

https://www.iflscience.com/this-480000-year-old-elephant-bone-tool-is-the-oldest-ever-found-outsi...
1•geox•7m ago•0 comments

How are you automating your coding work?

4•manthangupta109•8m ago•0 comments

Tracking Kernel Development with Korgalore

https://people.kernel.org/monsieuricon/tracking-kernel-development-with-korgalore
1•atomlib•8m ago•0 comments

Data Modeling: Living notes on levels, techniques, and patterns

https://www.ssp.sh/brain/data-modeling/
1•articsputnik•9m ago•0 comments

Doctors declare effects of child phone use a public health emergency

https://www.thetimes.com/uk/politics/article/phone-impact-on-children-is-public-health-emergency-...
1•chrisjj•9m ago•0 comments

Show HN: Snapbyte – personalized email digests from HN/Reddit/Lobsters

https://snapbyte.dev
3•onatm•11m ago•0 comments

Disrupted brain balance in alcohol dependence involves two signaling pathways

https://medicalxpress.com/news/2025-12-disrupted-brain-alcohol-involves-pathways.html
1•PaulHoule•11m ago•0 comments

Show HN: I vibecoded a Test Management app for Jira

https://marketplace.atlassian.com/apps/695702622/bestest-requirement-test-management
1•pakosteve•13m ago•0 comments

Setting Up a Cluster of Tiny PCs for Parallel Computing

https://www.kenkoonwong.com/blog/parallel-computing/
3•speckx•13m ago•0 comments

Getting Cited as a Source on Wikipedia

https://www.coryd.dev/posts/2026/getting-cited-as-a-source-on-wikipedia
1•cdrnsf•13m ago•0 comments

Ask HN: Is OBD-II telematics data more private than mobile app tracking?

1•insuranceguru•14m ago•0 comments

Show HN: JitAPI – An MCP server that treats OpenAPI specs as dependency graphs

https://github.com/nk3750/jitapi
1•peaknk•15m ago•0 comments

Element Pro Web Introduces Grid View

https://element.io/blog/element-pro-web-introduces-grid-view/
1•Arcuru•15m ago•0 comments

If you're struggling to get your engineers to adopt AI, read this

https://www.geocod.io/code-and-coordinates/2026-01-21-hand-chiseling-code/
1•mijustin•16m ago•1 comments

The Treachery of Signs Semiotic Mediation

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5987495
2•spacebacon•16m ago•0 comments

How to track your AI Search visibility

https://www.scriptbee.ai/
1•Rayn_11•17m ago•0 comments

How much glycogen is stored in a runner's liver?

https://runningwritings.com/2023/10/how-much-glycogen-is-stored-in-a-runners-liver.html
2•galeaspablo•19m ago•0 comments

Show HN: DockerHoster – Self-hosted alternative to Vercel with auto-deployments

https://twitter.com/jaequery/status/2014049948195815585
1•jaequery•19m ago•0 comments

3,500 Miles in 2025

https://danielmangum.com/posts/3500-miles-2025/
1•hasheddan•20m ago•0 comments

A Prophet of the Weather: Lantern Slides by Clement Lindley Wragge (Ca. 1900–22)

https://publicdomainreview.org/collection/wragge-lantern-slides/
1•crescit_eundo•23m ago•0 comments

Cybernetic Attention: All Watched over by Machines We Learned to Watch

https://publicdomainreview.org/essay/cybernetic-attention/
2•crescit_eundo•23m ago•0 comments

Claude’s Constitution: Our vision for Claude's character

https://www.anthropic.com/constitution
3•Anon84•23m ago•1 comments

Show HN: SeeClaudeCode – visualize Claude Code's edits to your repo in real time

https://seeclaudecode.fly.dev/
2•ninajlu•24m ago•0 comments

Show HN: A minimal beads-like issue tracker for AI agents

https://github.com/obsfx/trekker
1•obsfx•26m ago•1 comments

How to Collect Contact Data from Telegram Chats

https://crona.ai/blog/how-to-collect-contact-data-from-telegram-chats-using-crona
1•rin_khat•28m ago•0 comments