frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•3mo ago

Comments

tocs3•3mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The Secret of Highly Efficient Teams

https://managerstories.co/the-secret-of-highly-efficient-teams/
1•damsos•38s ago•0 comments

The Magic of Showing Up

https://prisonculture.substack.com/p/the-magic-of-showing-up
1•speckx•3m ago•0 comments

Visitors travelling to Europe will face new digital checks

https://economictimes.indiatimes.com/nri/visit/starting-this-october-visitors-travelling-to-europ...
1•01-_-•5m ago•0 comments

Students Using AI for Controversial Viewpoints

https://www.insidehighered.com/opinion/views/2025/09/16/about-fires-free-speech-rankings-opinion
1•HR01•5m ago•0 comments

Hired Through GitHub: Part 1

https://zed.dev/blog/hired-through-github-part-1
1•meetpateltech•7m ago•0 comments

The Look of Success: AI-Measured Face Factors and Venture Financing

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5426917
2•Bostonian•8m ago•0 comments

Memoir of chess influencer and cult survivor Danny Rensch

https://www.newyorker.com/sports/sporting-scene/the-many-lives-of-danny-rensch
1•drw•11m ago•0 comments

FBI couldn't get my husband to decrypt his Tor node so he was jailed for 3 years

https://old.reddit.com/r/TOR/comments/1ni5drm/the_fbi_couldnt_get_my_husband_to_decrypt_his_tor/
20•heavyset_go•18m ago•2 comments

Robert Redford Has Died

https://www.nytimes.com/2025/09/16/movies/robert-redford-dead.html
7•uptown•18m ago•1 comments

Bear Blog Discovery Feed

https://bearblog.dev/discover/
1•fatfox•18m ago•0 comments

FAA seeks to fine Boeing $3.1M for safety violations, door plug blowout

https://www.npr.org/2025/09/13/nx-s1-5540728/boeing-faa-safety-fines-door-plug-blowout
1•thelastgallon•18m ago•1 comments

Safari 26.0 Release Notes

https://developer.apple.com/documentation/safari-release-notes/safari-26-release-notes
1•ksec•22m ago•0 comments

Fixing AWS Architecture Diagrams: AI Document Processing

https://www.ilograph.com/blog/posts/fixing-aws-diagrams-ai-document-processing/
2•billyp-rva•22m ago•1 comments

Growth of complex oxide crystals melting over 2200 °C using tungsten crucible

https://www.nature.com/articles/s41598-025-12535-0
1•PaulHoule•27m ago•0 comments

Stem cell transplant for stroke leads to brain cell growth in mice

https://medicalxpress.com/news/2025-09-stem-cell-transplant-brain-growth.html
1•geox•32m ago•0 comments

Ask HN: Reddit banned my account with no explanation, why?

3•monneyboi•34m ago•3 comments

Campaigners urge EU to mandate 15 years of OS updates

https://www.theregister.com/2025/09/16/campaigners_urge_eu_to_mandate/
3•rntn•35m ago•0 comments

Free Website Traffic Checker

https://trafficchecker.org/
1•modao526•37m ago•1 comments

Ask HN: Generalists, when do you say "I know enough" about any particular topic?

3•AbstractH24•37m ago•4 comments

Show HN: I'm Building an Alternative to Shopify

https://www.searchagora.com/build-store
3•astronautmonkey•37m ago•2 comments

AMD ROCm 7.0 Begins Rocking Out on GitHub

https://www.phoronix.com/news/AMD-ROCm-7.0-Rolling-Out
2•ohmyblock•41m ago•1 comments

You Want Technology with Warts

https://entropicthoughts.com/you-want-technology-with-warts
2•todsacerdoti•45m ago•0 comments

Wrkflw: Validate and run Microsoft GitHub Actions locally

https://github.com/bahdotsh/wrkflw
1•fanf2•46m ago•0 comments

Nvidia tipped to be TSMC's first 16A customer, ahead of Apple

https://www.tomshardware.com/tech-industry/semiconductors/nvidia-dethrones-apple-to-debut-tsmc-a16
1•gloxkiqcza•47m ago•0 comments

Ask HN: Any important paper in AI apart from attention is all you need?

1•miletus•49m ago•1 comments

Ask HN: Looking for blog post about conversations in Apple ads

1•TuringTux•51m ago•1 comments

Rolling Stone magazine lays off critic Alan Sepinwall, among others

https://bsky.app/profile/iancass.bsky.social/post/3lyw72kxjpc2f
2•OgsyedIE•54m ago•0 comments

Full M18 battery diagnostics revealed

https://www.youtube.com/watch?v=tHj0-Gzvbeo
1•ziptron•54m ago•0 comments

Show HN: Atlas – Open-source network discovery and visualization tool

https://github.com/karam-ajaj/atlas
1•vnerd•55m ago•0 comments

Punch-Cards to Prompts: Rebooting Reality

https://medium.com/@PromptingHumanity/punch-cards-to-prompts-rebooting-reality-3a7973ed4f01
2•merusame•58m ago•0 comments