frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

LLMs can unmask pseudonymous users at scale with surprising accuracy

https://arstechnica.com/security/2026/03/llms-can-unmask-pseudonymous-users-at-scale-with-surpris...
1•Gagarin1917•1m ago•0 comments

LexisNexis confirms React2Shell powered data breach

https://www.bleepingcomputer.com/news/security/lexisnexis-confirms-data-breach-as-hackers-leak-st...
1•esaym•2m ago•0 comments

New Python library by Guido van Rossum

https://github.com/microsoft/typeagent-py
1•tzury•3m ago•1 comments

YGG, Largest French Torrent Tracker, Hacked

https://yggleak.top/fr/home/ygg-dossier
1•TechSquidTV•4m ago•0 comments

Stop Trying to Hand Trump a Censorship Weapon

https://www.techdirt.com/2026/03/03/ron-wyden-is-begging-his-colleagues-to-stop-trying-to-hand-tr...
2•HotGarbage•4m ago•0 comments

We are now on Substack Give us a follow

https://substack.com/profile/387803331-pcg-inc/note/c-222710211
1•CCK80LLC•17m ago•0 comments

An AI Just Did Everything I Do on a Computer – Written by the AI Itself

https://coasty.ai/
1•PrateekJ17•19m ago•1 comments

Mac external displays for designers and developers, part 2

https://bjango.com/articles/macexternaldisplays2/
4•fragmede•19m ago•0 comments

New Launch Workshops and Masterclasses

https://www.pretium-inc.com/workshops
1•CCK80LLC•19m ago•0 comments

Oscar Six Radar – vulnerability scanner with native A2A (agent-to-agent) support

1•oscarsixsecllc•21m ago•0 comments

Show HN: Lip Flip Before and After – Real Results and AI Preview

https://lipflip.ai/
1•ovelv•23m ago•0 comments

Yes... and...

https://htmx.org/essays/yes-and/
2•rammy1234•24m ago•0 comments

Show HN: Term-CLI – interactive terminals for AI agents (for SSH/TUI/REPL flows)

https://github.com/EliasOenal/term-cli
3•eliasoe•24m ago•0 comments

The secret green shelters that feed London's cabbies (2018)

https://www.bbc.com/travel/article/20180430-the-secret-green-shelters-that-feed-londons-cabbies
2•1659447091•25m ago•0 comments

Show HN: Hanaco Garden – A Calm iOS Garden

https://apps.apple.com/us/app/hanaco-garden/id6759095190
3•tsuyoshi_k•27m ago•1 comments

Number Research Inc

https://numberresearch.xyz/
2•eieio•28m ago•0 comments

Show HN: Docker pulls more than it needs to

https://dockerpull.com
2•a_t48•30m ago•1 comments

Show HN: Schelling Protocol – Where AI agents coordinate on behalf of humans

https://github.com/codyz123/schelling-protocol
2•codyz123•31m ago•1 comments

We built high speed threat hunting for email security

https://sublime.security/blog/how-we-built-high-speed-threat-hunting-for-email-security/
2•jkamdjou•35m ago•0 comments

MrBeast Is Getting into Financial Services. Parents Should Pay Attention

https://www.nytimes.com/2026/03/03/business/mrbeast-step-banking-crypto.html
8•sigwinch•37m ago•3 comments

Graphics Programming Resources

https://develop--gpvm-website.netlify.app/resources/
3•abetusk•39m ago•0 comments

Show HN: Upload test cases and get automated Playwright tests back

https://instantqa.ai/
2•ksurace•39m ago•0 comments

Testbed for the Development and Validation of Contactless Vital Signs Monitoring

https://www.mdpi.com/1424-8220/26/4/1092
2•PaulHoule•39m ago•0 comments

Claude Code rolls out a voice mode capability

https://techcrunch.com/2026/03/03/claude-code-rolls-out-a-voice-mode-capability/
6•zX41ZdbW•39m ago•0 comments

Paralympian Brenna Huckaby Uses Oura

https://ouraring.com/blog/us-paralympian-brenna-huckaby/
1•wslh•40m ago•0 comments

Show HN: Local, privacy-first MCP code intelligence in Rust

https://github.com/avirajkhare00/yoyo
1•avirajkhare•40m ago•0 comments

YC Controls the Frame: Sam Altman's VC Advisors' Protocol Leaked [video]

https://www.youtube.com/watch?v=pjO2_UWhlKA
5•ncouture•44m ago•3 comments

Show HN: OpenCovibe – a local-first desktop UI for Claude Code

https://github.com/AnyiWang/OpenCovibe
1•way007•48m ago•0 comments

The Power of Messy Teams

https://sloanreview.mit.edu/article/the-hidden-power-of-messy-teams/
3•gnabgib•48m ago•0 comments

Polishing Cloth Is Compatible with the New MacBook Air and Pro, Studio Displays

https://old.reddit.com/r/mac/comments/1rjtmge/breaking_apple_announces_that_the_polishing_cloth/
2•virgildotcodes•49m ago•0 comments