frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Analyzing GPT-5.5 and Opus 4.7 with ARC-AGI-3

https://arcprize.org/blog/arc-agi-3-gpt-5-5-opus-4-7-analysis
1•rzk•46s ago•0 comments

Specialised Intelligence Platform for the Business and Economics of AI

https://www.tokenando.ai
1•karimmatar•50s ago•0 comments

I built an app that decides what to cook based on what's in your fridge

https://play.google.com/store/apps/details?id=com.pikanini.app&hl=en_US
1•amtkm•1m ago•0 comments

Show HN: WattFactory – Browser Based Indoor Cycling

https://wattfactory.fit
1•robputt•3m ago•0 comments

What Happens When Europeans Find Out How Poor They Are?

https://www.wsj.com/opinion/what-happens-when-europeans-find-out-how-poor-they-are-270cff5d
1•harambae•3m ago•0 comments

iOrchestra.ai prompt to hardware mass production platform YC looks for [video]

https://www.youtube.com/shorts/0gCeJFyRYx4
1•andresparraarz•6m ago•1 comments

Midori, the first browser to offer a VPN with Mesh technology

https://astian.org/midori-en/performance-adblock-and-more-in-midori-11-7-1/
1•ponchale•12m ago•0 comments

Kloak: Kernel-space secret injection via eBPF on Kubernetes

https://a-cup-of.coffee/blog/kloak/
1•ankitg12•14m ago•0 comments

OSTree Overview – Ostreedev/Ostree

https://ostreedev.github.io/ostree/introduction/
1•ankitg12•20m ago•0 comments

Ask.com, home of search butler Jeeves closes as conversational search comes back

https://www.theregister.com/2026/05/04/askjeeves_closes/
1•beardyw•20m ago•0 comments

CARA 2.0 – "I Built an Better Robot Dog"

https://www.aaedmusa.com/projects/cara2
1•hakonjdjohnsen•21m ago•0 comments

Rootshell – The terminal, reimagined for Apple platforms

https://rootshell.com/
1•theshrike79•22m ago•0 comments

Allegedly Nintendo Stopped Selling to Amazon After Being Asked to Break the Law

https://kotaku.com/reggie-fils-aime-says-nintendo-stopped-selling-to-amazon-after-being-asked-to-...
2•croes•22m ago•0 comments

Call JavaScript functions across thread boundaries like they were local

https://www.npmjs.com/package/achi-rpc
1•stiles11•24m ago•0 comments

Golang Tutorial: Find and fix vulnerable dependencies with govulncheck

https://go.dev/doc/tutorial/govulncheck
1•l1am0•26m ago•0 comments

The Myth of RAM, part I (2014)

https://www.ilikebigbits.com/2014_04_21_myth_of_ram_1.html
1•downbad_•27m ago•1 comments

The Back-end for Front-end Pattern (BFF) (2015)

https://philcalcado.com/2015/09/18/the_back_end_for_front_end_pattern_bff.html
1•locknitpicker•30m ago•0 comments

AWS Lambda Cold Start: Java, Python, Go, Rust

https://medium.com/@yalovoy/four-million-lambda-invokes-across-python-java-rust-and-go-5b9218f64563
1•zero-ground-445•32m ago•0 comments

Ping Poetics (2009)

https://electronicbookreview.com/publications/ping-poetics/
1•jruohonen•34m ago•0 comments

Package and distribute Electron apps with "auto update" support

https://www.electron.build/index.html
2•ankitg12•43m ago•0 comments

Banger: One-command dev sandboxes on Firecracker microVMs

https://git.thaloco.com/thaloco/banger
1•thunderbong•44m ago•0 comments

Vampires, prisoners, and late-stage capitalism

https://gagliardoni.net/#20260504_vampire_capitalism
2•tomgag•51m ago•0 comments

AI Coding Models You Can Run Locally on Consumer Hardware

https://firethering.com/best-coding-models-consumer-hardware/
1•steveharing1•53m ago•1 comments

Microelectrode Techniques: The Plymouth Workshop Handbook

https://plymsea.ac.uk/id/eprint/7954/
2•teleforce•56m ago•0 comments

DSPy – Programming – not prompting – LMs

https://dspy.ai/
1•sakompella•56m ago•0 comments

Text Files as a User Interface

https://ratfactor.com/cards/text-files-as-ui
1•ingve•1h ago•0 comments

Pinecone Nexus: The Knowledge Engine for Agents

https://www.pinecone.io/blog/knowledge-infrastructure-for-agents/
1•berlianta•1h ago•0 comments

Designing Microkernel IPC

https://seiya.me/blog/microkernel-ipc-design
1•ingve•1h ago•1 comments

Evolving the Android and Chrome Vulnerability Reward Programs for the AI Era

https://bughunters.google.com/blog/evolving-the-android-chrome-vrps-for-the-ai-era
3•tjek•1h ago•0 comments

Clawback: Safer OpenClaw Upgrade Rehearsals

https://github.com/haishmg/Clawback
1•princeharry86•1h ago•0 comments