frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

A Paradigm Shift in Microbial Protein Manufacturing

https://www.mdpi.com/2075-1729/16/1/129
1•PaulHoule•9s ago•0 comments

Show HN: Sarab – Expose localhost to the internet using Cloudflare Tunnels

https://github.com/meedoomostafa/sarab
1•meedoomostafa•1m ago•0 comments

The Joy of Clothes

https://dynkarken.substack.com/p/the-joy-of-wearing-pretty-clothes
1•thinkingaboutit•4m ago•0 comments

The K-Shaped Future of Software Engineering

https://www.ian.so/writing/k-shaped-future-software-engineering
2•ian_dot_so•6m ago•0 comments

Body Censorship

https://www.youtube.com/watch?v=FcpRrytvYYo
1•barrister•7m ago•0 comments

AI diagram looks great and nobody will read it

https://jpcaparas.medium.com/your-ai-diagram-looks-great-and-nobody-will-read-it-f1e34fe9c8f1
1•birdculture•7m ago•0 comments

Nvidia helped DeepSeek hone AI models later used by China's military

https://www.reuters.com/world/china/nvidia-helped-deepseek-hone-ai-models-later-used-by-chinas-mi...
2•DustinEchoes•9m ago•0 comments

Major grid operator for Pa., East Coast predicts energy shortfall by mid-2027

https://www.spotlightpa.org/news/2026/01/pjm-grid-short-fall-power-plants-data-centers-environment/
1•bikenaga•12m ago•0 comments

A Mild Take on Coding Agents

https://meelo.substack.com/p/a-mild-take-on-coding-agents
2•milowata•13m ago•0 comments

Show HN: A constraints based IaC decision tool

https://whichiac.com
1•batemanchris•13m ago•1 comments

Common Lisp Extension for Zed

https://github.com/etyurkin/zed-cl
1•mike_ivanov•13m ago•0 comments

The K-Shaped Future of Software Engineering

https://twitter.com/ian_dot_so/status/2013316676637294890
1•mji•15m ago•0 comments

Claude Code hacks its way to success

https://www.theeggeadventure.com/2026/01/recovering-ssh-on-a-headless-raspberry-pi-through-a-priv...
2•brother_corp•19m ago•0 comments

Ask HN: How do you measure AI adoption within your teams? (Best Practices)

1•nemath•19m ago•0 comments

Show HN: LSP and Grammar for HTML and Mustache Templates

https://github.com/reteps/tree-sitter-htmlmustache
1•spicypete•20m ago•0 comments

Show HN: A novel pattern for handling in-flight requests in distributed caches

https://www.infoq.com/articles/durable-objects-handle-inflight-requests/
1•gkoos•21m ago•0 comments

Trump voters support military intervention in more countries

https://www.politico.com/news/2026/01/28/trump-is-threatening-strike-iran-his-supporters-wouldnt-...
6•doctor_radium•21m ago•4 comments

Reframing Agents: Why I Don't Think Agents Are the Future of AI Software

https://valtetu.framer.website/blog/reframing-agents
2•valtetu•23m ago•1 comments

SeqMem – New update. Specified details. V2

https://drive.google.com/file/d/1PgAdnsFrHaataORGucauLjLbn-jkLp42/view?usp=drive_link
1•goofgef•24m ago•0 comments

Haskell Hangman (Surveillance)

https://www.youtube.com/watch?v=bGHMET8knv4
1•barrister•25m ago•0 comments

Elon Musk's SpaceX, Tesla, and xAI in talks to merge, according to reports

https://techcrunch.com/2026/01/29/elon-musk-spacex-tesla-xai-merger-talks-ipo-reuters/
6•gfortaine•26m ago•2 comments

In car safety, why some companies merely meet a standard and others exceed it

https://anderson-review.ucla.edu/in-car-safety-why-some-companies-merely-meet-a-standard-and-othe...
1•hhs•26m ago•0 comments

I Hope This Email Finds You Before I Do

https://www.lastweekinaws.com/blog/i-hope-this-email-finds-you-before-i-do/
1•dabinat•28m ago•0 comments

Nvidia PersonaPlex: Natural Conversational AI with Any Role and Voice

https://research.nvidia.com/labs/adlr/personaplex/
2•smusamashah•28m ago•0 comments

A lock-free, high-performance IPC channel inspired by Firedancer's Tango

https://crates.io/crates/rust-tango
1•amineelqaraoui•29m ago•0 comments

Show HN: GitViral – Turn technical READMEs into social media threads

https://git-viral.vercel.app
1•Helios999•30m ago•1 comments

Transport for London Interchange Signs Standard [pdf]

https://content.tfl.gov.uk/tfl-interchange-signs-standard.pdf
2•susam•30m ago•0 comments

Original Oil Painting

https://twitter.com/i/status/1902923432997658918
1•barrister•31m ago•2 comments

RlmUI- the HTML/CSS User Interface Library

https://github.com/mikke89/RmlUi
1•kreco•31m ago•0 comments

Bluesky 2025 Transparency Report

https://bsky.social/about/blog/01-29-2026-transparency-report-2025
1•emschwartz•32m ago•0 comments