frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Amid Epstein fallout, Bill Gates becomes point of controversy at India AI summit

https://www.cnbc.com/2026/02/19/amid-epstein-fallout-bill-gates-becomes-point-of-controversy-at-i...
1•1vuio0pswjnm7•57s ago•0 comments

High-Energy Detonation Based Lunar Regolith Simulation for Resource Utilization

https://www.mdpi.com/2226-4310/13/1/106
1•PaulHoule•1m ago•0 comments

Don't Trust Packet Captures on Firewalls

https://weberblog.net/dont-trust-packet-captures-on-firewalls/
1•todsacerdoti•3m ago•0 comments

Crowdsourcing better titles and thumbnails on YouTube

https://dearrow.ajay.app
1•aquir•3m ago•1 comments

Pentagon-Anthropic battle pushes other AI labs into major dilemma

https://www.axios.com/2026/02/19/anthropic-pentagon-ai-fight-openai-google-xai
2•jbegley•3m ago•0 comments

Show HN: TrustSignal–Free outside-in trust readiness scanner for SaaS companies

https://www.trustsignal.tech/
1•socialPrarysoft•4m ago•1 comments

Former Prince Andrew arrested by British police

https://www.nytimes.com/live/2026/02/19/world/uk-prince-andrew-arrest-epstein
1•geox•4m ago•0 comments

Stop Forcing Yourself to Read the "Right" Books

https://talkflow.substack.com/p/stop-forcing-yourself-to-read-the
1•moss98•5m ago•0 comments

A mental model: four knobs in almost any system

https://medium.com/@executelater/every-system-youve-ever-built-has-only-four-knobs-7e87b541178d
1•NarratorTD•5m ago•0 comments

Request for Startups: Teleoperation is the path to Autonomy

https://verbine.substack.com/p/request-for-startups-teleoperation
1•eladv•6m ago•1 comments

MotherDuck Dives

https://motherduck.com/blog/duck-dive-and-answer/
2•willcodeforfoo•7m ago•0 comments

Building a curated AI and cybersecurity room for engineers and CISOs – no fluff

https://www.rbln.com/events/2026/east
1•RBLN2026•8m ago•1 comments

AI Is a NAND Maximiser

https://shkspr.mobi/blog/2026/02/ai-is-a-nand-maximiser/
1•blenderob•9m ago•0 comments

Show HN: Gave AI $100 and no instructions – it donated $40 to a hospital

https://www.letairun.com/
3•gleipnircode•12m ago•2 comments

Show HN: Here Comes Another Bubble – AI Startup Simulator

https://www.herecomesanotherbubble.com/
1•rokas_t•12m ago•0 comments

Apple sued by WV for alleged failure to stop child sexual abuse material

https://www.cnbc.com/2026/02/19/apple-sued-csam-icloud-ios.html
1•Noaidi•12m ago•0 comments

A 16-year-old intern helped Netgear catch scammers in India for $800

https://www.tomshardware.com/tech-industry/cyber-security/a-16-year-old-intern-helped-netgear-cat...
1•speckx•12m ago•0 comments

Show HN: Mining tenement API – 83K tenements across AU and CA

https://www.tryautropic.com/mining
1•savvyllm•13m ago•0 comments

DEF CON bans three Epstein-linked men from future events

https://www.theregister.com/2026/02/19/def_con_epstein_bans/
2•voxadam•14m ago•0 comments

AI-Powered Analytics, CMS and Marketing Platform

https://github.com/RakshakIT/myuserjourney
1•rakshakit•14m ago•0 comments

Programming in Prison: My Redemption Arc

https://www.ck-7vn.dev/blog/Home
9•wagslane•14m ago•0 comments

Apple's C1X Modem Faces First Reported Failure in iPhone Air

https://www.macrumors.com/2026/02/19/c1x-modem-faces-first-failure/
2•mgh2•14m ago•0 comments

Profiling Swift Applications on Windows and macOS with Tracy

https://compositorapp.com/blog/2026-02-07/Tracy/
1•serhack_•14m ago•0 comments

Show HN: IPAware – Unlimited IP Intelligence API with Flat Pricing

4•dorukalpulgen•15m ago•0 comments

Smallest QR code–read via electron microscope–earns Guinness recognition

https://phys.org/news/2026-02-world-smallest-qr-code-electron.html
1•bookofjoe•15m ago•0 comments

Crates.io: an update to the malicious crate notification policy

https://blog.rust-lang.org/2026/02/13/crates.io-malicious-crate-update/
1•alainrk•15m ago•0 comments

The Four Pillars of a Ralph Wiggum Loop

https://mfbt.ai/blog/four-pillars-of-a-ralph-wiggum-loop/
2•shuss•15m ago•0 comments

Dinosaur Food: 100M year old foods we still eat today

https://borischerny.com/food/2022/01/17/Dinosaur-food.html
2•simonebrunozzi•16m ago•0 comments

AI Guardian Lab – Open-source security middleware

https://github.com/Lukentony/AI-guardian-lab
1•Lukentony•16m ago•1 comments

Show HN: Maestro App Factory – FOSS Agentic Engineering Orchestrator

https://github.com/SnapdragonPartners/maestro
2•danalert•16m ago•0 comments