frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

I got tired of AI chatbots so we turned the OS into an AI agent

https://www.jeriko.ai/
1•Khaleel7337•4m ago•1 comments

Ask HN: Uploaded a post and it was [dead] within a minute

1•_ananos_•5m ago•0 comments

CrackArmor: Multiple Vulnerabilities in AppArmor

https://cdn2.qualys.com/advisory/2026/03/10/crack-armor.txt
1•mmsc•5m ago•0 comments

The evolution of Mac app window corners

https://lapcatsoftware.com/articles/2026/3/4.html
1•robenkleene•5m ago•0 comments

Consumer rights wiki becomes a browser extension

https://github.com/FULU-Foundation/CRW-Extension
1•NotLemikiy•5m ago•0 comments

Russia is carrying out a cyber campaign targeting Signal and WhatsApp accounts

https://www.aivd.nl/actueel/nieuws/2026/03/09/rusland-voert-cybercampagne-uit-tegen-signal--en-wh...
1•komape•7m ago•0 comments

How to make your own static site generator

https://gaultier.github.io/blog/how_to_make_your_own_static_site_generator.html
1•gingersnap•7m ago•0 comments

YouTube videos that have almost zero previous views

http://astronaut.io/
1•Zealotux•8m ago•1 comments

I traced $2B in grants and 45 states' lobbying behind age‑verification bills

https://old.reddit.com/r/linux/comments/1rshc1f/i_traced_2_billion_in_nonprofit_grants_and_45/
3•shaicoleman•14m ago•0 comments

The End of the Open Web

https://www.netmeister.org/blog/open-web.html
3•speckx•14m ago•0 comments

50 Years of Thinking Different

https://www.apple.com/50-years-of-thinking-different/
3•tilt•17m ago•1 comments

Show HN: Privacy Mask – prevent secrets leaking to AI agents

2•fullstackcrew•19m ago•0 comments

Show HN: fftool – A Terminal UI for FFmpeg – Shows Command Before It Runs

https://bensantora.com/posts/fftool-ffmpeg-tui-go/
3•taskset•21m ago•0 comments

Benchmarking Hosted Browser Providers: Speed, Stealth, Captcha, and Concurrency

https://techstackups.com/comparisons/hosted-browser-benchmarks/
2•ritzaco•22m ago•0 comments

How to Run a Pool of Autonomous Coding Agents on Your Jira Backlog

https://jaksa.me/blog/2026-03-01-pool-of-agents
2•jaksa•23m ago•0 comments

Advertising was always going to come for AI chatbots. The real question is how

https://reutersinstitute.politics.ox.ac.uk/news/advertising-was-always-going-come-ai-chatbots-rea...
2•jruohonen•25m ago•0 comments

Show HN: I forked Python's Requests to add HTTP/3, async, and multiplexing

https://github.com/jawah/niquests/tree/v3.18.2
4•mesahm•28m ago•2 comments

Beyond Agents.md: Harness Eng, Loop-Based Delivery, and Context-Aware Prompting

https://teamcadence.ai/blog/context-aware-prompting/
4•daveslutzkin•29m ago•0 comments

Updates on Analyst Platform for Data Analysts

https://anallyst.onrender.com
2•Sechele•29m ago•0 comments

AI Isn't People

https://www.todayintabs.com/p/a-i-isn-t-people
3•q-base•30m ago•0 comments

Show HN: I wrote a free trilogy about perception, presence, and leadership

https://marcus-corvin.github.io/thecalibratedview/
3•mr_octopus•32m ago•1 comments

How Japan Is Buying Back Its Semiconductor Industry [video]

https://www.youtube.com/watch?v=9t9D0gVfPX4
2•mgh2•34m ago•0 comments

Show HN: Payment Hunter – AI-powered invoice reminders for freelancers

2•paymenthunter01•35m ago•0 comments

What do coders do after AI?

https://www.anildash.com/2026/03/13/coders-after-ai/
2•speckx•37m ago•0 comments

Slate: Moving Beyond ReAct and RLM

https://randomlabs.ai/blog/slate
2•vinhnx•38m ago•0 comments

Safety Agents for Autonomous Systems

https://stackresearch.org/blog/control-ops/
2•dnmacon•39m ago•1 comments

Claude can generate custom diagrams, and charts directly in your conversation

https://support.claude.com/en/articles/13979539-custom-visuals-in-chat
2•simianwords•39m ago•0 comments

Claude now has Generative UI – interactive charts and diagrams

https://twitter.com/claudeai/status/2032124273587077133
2•simianwords•41m ago•1 comments

Show HN: Cigarette Rocket Booster – a rocket where the body itself is fuel

https://github.com/solenopsys/CRB
2•solenopsys•41m ago•0 comments

Show HN: JobStocks – track hiring changes at public companies vs. stock price

https://jobstocks.ai/
2•TalO•44m ago•0 comments