frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

The Illusion of the Illusion of Thinking – A Comment on Shojaee et al. (2025)

https://arxiv.org/abs/2506.09250
7•gfortaine•6h ago

Comments

ForHackernews•15m ago
"5 Alternative Representations Restore Performance To test whether the failures reflect reasoning limitations or format constraints, we conducted preliminary testing of the same models on Tower of Hanoi N = 15 using a different representation: Prompt: "Solve Tower of Hanoi with 15 disks. Output a Lua function that prints the solution when called."

Results: Very high accuracy across tested models (Claude-3.7-Sonnet, Claude Opus 4, OpenAI o3, Google Gemini 2.5), completing in under 5,000 tokens.

The generated solutions correctly implement the recursive algorithm, demonstrating intact reasoning capabilities when freed from exhaustive enumeration requirement""

Is there's something I'm missing here?

This seems like it demonstrates the exact opposite of what the authors are claiming: Yes, your bot is an effective parrot that can output a correct Lua program that exists somewhere in the training data. No, your bot is not "thinking" and cannot effectively reason through the algorithm itself.

ForHackernews•8m ago
> Recent reports have claimed that most 7th graders are unable to independently derive the Pythagorean Theorem, however our analysis reveals that these apparent failures stem from experimental design choices rather than inherent student limitations.

When given access to Google and prompted to "tell me how to find the length of hypotenuse of a right triangle", a majority of middle-schoolers produced the correct Pythagorean Theorem, demonstrating intact reasoning capabilities when freed from the exhaustive comprehension requirement.

ForHackernews•4m ago
Wait is C. Opus just the anthropic bot? Did I waste my time reading AI nonsense?

Because Apple should have let you save Audio to the Camera Roll

https://apps.apple.com/gb/app/just-send-record/id6745911500
1•zahirbmirza•19s ago•0 comments

Decomposing iOS Builds and Tracking Size Changes over Time Locally

https://dotipa.app
1•elpakal•30s ago•0 comments

3D printable 6" f/5 compact travel telescope model

https://www.printables.com/model/1325533-smallest-telescope-kit-for-150750
1•chantepierre•2m ago•1 comments

How LLMs Know When to Stop Talking?

https://www.louisbouchard.ai/how-llms-know-when-to-stop/
1•Capstanlqc•3m ago•1 comments

Rust compiler performance survey 2025

https://blog.rust-lang.org/2025/06/16/rust-compiler-performance-survey-2025/
1•mikece•4m ago•0 comments

Slack (2017)

https://thezvi.wordpress.com/2017/09/30/slack/
1•eamag•5m ago•0 comments

EU plans ban on new Russian gas contracts using trade law

https://www.ft.com/content/8b005c13-2088-47cd-aa47-9163e36efa4a
3•toomuchtodo•6m ago•1 comments

End of Windows 10 is approaching, so it's time to consider Linux and LibreOffice

https://blog.documentfoundation.org/blog/2025/06/11/the-end-of-windows-10/
3•speckx•8m ago•0 comments

Meteus – One API to Post on Instagram, TikTok, Pinterest, and More

https://www.meteus.dev
1•meyusufdemirci•8m ago•1 comments

Overclocked: An Archive of Graphics Card Box Art

https://lockbooks.net/products/overclocked-an-archive-of-graphics-card-box-art
2•thenthenthen•10m ago•1 comments

Finding Private Information Through Resumes on Google Search

https://nelson.cloud/finding-private-information-through-resumes-on-google-search/
1•nelsonfigueroa•11m ago•0 comments

Mac Mini Service Program for No Power Issue

https://support.apple.com/mac-mini-2023-service-program-for-no-power-issue
1•doener•13m ago•0 comments

Revealing Political Bias in LLMs Through Structured Multi-Agent Debate

https://arxiv.org/abs/2506.11825
1•rntn•14m ago•0 comments

WhatsApp is getting ads using personal data from Instagram and Facebook

https://noyb.eu/en/whatsapp-getting-ads-using-personal-data-instagram-and-facebook
1•ThePhysicist•14m ago•0 comments

Merrypopins a Library for Nanoindentation

https://mnky9800n.substack.com/p/merrypopins-a-library-for-nanoindentation
1•mnky9800n•16m ago•0 comments

Ask HN: How do I market to consumers as a solo dev about to go to uni?

2•Alex-Programs•17m ago•1 comments

Crypto group Tron to go public after U.S. pauses probe into billionaire founder

https://www.ft.com/content/13a6cead-af71-4811-9b90-553f233ac45f
1•cempaka•20m ago•0 comments

Trump Organization enters phone market with $499 Trump Mobile device

https://www.reuters.com/world/us/trump-organization-unveils-self-branded-mobile-phone-network-2025-06-16/
6•jayknight•20m ago•1 comments

Apollo 11 Technical Crew Debriefing – Tape 3 [video]

https://www.youtube.com/watch?v=yTzEIIJm1-Y
1•pgreenwood•20m ago•0 comments

Engineers at our startup don't build features anymore

1•s4293918•21m ago•0 comments

Replace Your Gmail Password Now, Google Tells 2B Users

https://www.forbes.com/sites/daveywinder/2025/06/15/change-your-gmail-password-now-google-tells-2-billion-users/
2•thunderbong•22m ago•1 comments

Breaking Murphy's Law

http://www.breakingmurphyslaw.com/
1•bookofjoe•23m ago•0 comments

Ask HN: How do you handle an employee who complies but never delivers?

4•tropicalfruit•25m ago•3 comments

Gbadev.org

https://gbadev.org/
1•ibobev•27m ago•0 comments

How Storytelling Fixed My Broken User Experience

https://eonurk.com/2025/06/16/enhancing-user-experience-via-storytelling/
1•celltalk•27m ago•0 comments

My grandparents chose to die together, the end chapter of love spanning 70 years

https://www.theguardian.com/australia-news/2025/jun/08/when-they-chose-to-die-together-my-grandparents-wrote-the-final-chapter-of-a-love-story-spanning-70-years
2•NaOH•27m ago•1 comments

Tyme+ – The Everyday App

https://tyme.today
1•tymelabs•29m ago•2 comments

Summary of Heroku June 10 Outage

https://www.heroku.com/blog/summary-of-june-10-outage/
1•dakull•31m ago•0 comments

Use AI to Get Your Time Back

https://algarch.com/blog/use-ai-to-get-your-time-back
2•jdalton•34m ago•1 comments

Believing you only have one option is dangerous

https://www.clearerthinking.org/post/believing-you-only-have-one-option-is-dangerous
5•gmays•41m ago•0 comments