news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Rethinking the Illusion of Thinking

https://arxiv.org/abs/2507.01231

2•jamesblonde•10h ago

Comments

jamesblonde•10h ago

In June 2025, Apple published a highly controversial paper, "The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity" that claimed Large Reasoning Models (LRM) did very little reasoning (planning).

Anthropic's Lawson fired back, condemning the experimental setup as flawed and the conclusions overstated.

This paper provides evidence supporting Apple's take - "failures solving the Towers of Hanoi were not purely result of output constraints, but also partly a result of cognition limitations: LRMs still stumble when complexity rises moderately (around 8 disks)"

" we also identified persistent failure modes that reveal limitations in long-horizon consistency and symbolic generalization. Our analysis suggests that these reasoning breakdowns stem not only from architectural constraints, but also from the inherently stochastic nature of these systems and the optimization methods they rely on."

Is there an "I" in AI? [PDF]

https://berryvilleiml.com/wp-content/uploads/Is-there-an-%E2%80%9CI%E2%80%9D-in-AI-.pdf

1•Bluestein•4m ago•0 comments

Elon Musk's plan to rain SpaceX's rocket debris over Hawaii's pristine waters

https://www.theguardian.com/technology/2025/jul/17/hawaii-elon-musk-spacex-rocket-debris

1•Stratoscope•4m ago•0 comments

Baltimore's Extraordinary Year

https://popular.info/p/the-secret-to-baltimores-extraordinary

1•MaysonL•5m ago•0 comments

Zig's fieldParentPtr for dumbos like me

https://www.ryanliptak.com/blog/zig-fieldparentptr-for-dumbos/

2•todsacerdoti•6m ago•0 comments

The QWERTY Keyboard Sucks [video]

https://www.youtube.com/watch?v=mFkC3F0lmjA

1•megamike•7m ago•0 comments

Scraping and vibe coding a schedule webapp for a conference on my phone

https://simonwillison.net/2025/Jul/17/vibe-scraping/

2•simonw•8m ago•0 comments

Spent the week at one of the top AI research conferences

https://www.aol.com/ve-spent-week-one-world-160705643.html

1•Bluestein•9m ago•0 comments

Rivian restarting work on its Georgia factory, emails show

https://techcrunch.com/2025/07/17/rivian-restarting-work-on-its-georgia-factory-emails-show/

1•rntn•11m ago•0 comments

Smarter, workspace-aware code completions for C++ in VS Code

https://devblogs.microsoft.com/cppblog/smarter-workspace-aware-code-completions-for-c-in-vs-code/

1•mariuz•11m ago•0 comments

UTCP: Open, direct alternative to MCP for tool calling

https://github.com/universal-tool-calling-protocol/python-utcp

1•aliraza1006•12m ago•1 comments

New Russian law criminalizes online searches for controversial content

https://www.washingtonpost.com/world/2025/07/17/russia-internet-censorship/

2•perihelions•12m ago•0 comments

PBS, NPR Set to Lose Federal Funding as Senate Passes Doge Cuts

https://www.bloomberg.com/news/articles/2025-07-17/pbs-npr-set-to-lose-federal-funding-as-senate-passes-doge-cuts

1•JumpCrisscross•12m ago•0 comments

Venture Capital firms just got easier to launch

https://venturecapital.createsend1.com/t/d-e-suthdz-l-t/

1•wTheRockb•13m ago•0 comments

Robotaxi Competition Between Tesla, Uber and Waymo Is Beginning

https://www.bloomberg.com/opinion/articles/2025-07-17/robotaxi-competition-between-tesla-uber-and-waymo-is-beginning

1•JumpCrisscross•14m ago•0 comments

Agent DB

https://agentdb.dev/

1•tosh•19m ago•0 comments

Registration Opens for 2025 NASA International Space Apps Challenge

https://www.nasa.gov/earth/registration-opens-for-2025-nasa-international-space-apps-challenge/

1•DocFeind•19m ago•0 comments

Tron: Ares – Official Trailer [video]

https://www.youtube.com/watch?v=YShVEXb7-ic

1•amichail•21m ago•0 comments

China hosts first autonomous AI robot football match

https://www.theguardian.com/technology/2025/jun/30/china-hosts-first-fully-autonomous-ai-robot-football-match

2•PaulHoule•23m ago•1 comments

Command GitHub's Coding Agent from VS Code

https://code.visualstudio.com/blogs/2025/07/17/copilot-coding-agent

1•feross•23m ago•0 comments

I used Bluefin for 5 months – long term review [video]

https://www.youtube.com/watch?v=1hxH3WLg6SI

1•indigodaddy•23m ago•0 comments

Show HN: Portfolio Site with AI Agent and Notion‑CMS Pushed as GitHub Snapshots

https://aditbala.com/

1•aditbala•24m ago•0 comments

FDA Authorizes Marketing of Tobacco- and Menthol-Flavored Juul E-Cigarettes

https://www.fda.gov/tobacco-products/ctp-newsroom/fda-authorizes-marketing-tobacco-and-menthol-flavored-juul-e-cigarette-products

1•impish9208•25m ago•1 comments

The best and worst countries to be a woman

https://www.nationalgeographic.com/culture/article/peril-progress-prosperity-womens-well-being-around-the-world-feature

3•lentoutcry•26m ago•0 comments

Nintendo Switch 2 account bans continue: warning after buying old copy of Bayo 3

https://www.tomshardware.com/video-games/nintendo/nintendo-switch-2-account-bans-continue-content-creator-with-over-a-million-subs-issues-warning-after-buying-an-old-copy-of-bayo-3-on-ebay

3•freedomben•27m ago•0 comments

Lovable raises $200M at a $1.8B valuation

https://techcrunch.com/2025/07/17/lovable-becomes-a-unicorn-with-200m-series-a-just-8-months-after-launch/

1•felixbraun•27m ago•0 comments

Daniel Heinemeier Hansson: "American Hype"

https://world.hey.com/dhh/american-hype-6f7afd1b

1•mustache_kimono•29m ago•2 comments

First look at Kiro vs. Cursor (30min video)

https://twitter.com/hot_town/status/1945837910147645947

1•matijash•30m ago•0 comments

Show HN: Chrome extension to optimize your site for ChatGPT search results

https://github.com/codingdelta/llm-search-terms-extension

1•futureisnow23•38m ago•0 comments

Belgium Is Not Just Chocolate and Beer – It's Also Semiconductors

https://cepa.org/article/belgium-is-not-just-chocolate-and-beer-its-also-semiconductors/

1•sc90•41m ago•1 comments

Logical implication is a comparison operator

https://btdmaster.bearblog.dev/logical-implication-as-comparison/

3•btdmaster•42m ago•0 comments