frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The new ChatGPT Images is here

https://chatgptimages.pro/
1•nanobanana•1m ago•0 comments

Reverse-Engineering the RK3588 NPU: Hacking Limits to Run Vision Transformers

https://amohan.dev/blog/2025/shard-optimizing-vision-transformers-edge-npu/
1•rcarmo•2m ago•0 comments

Chat-tails: Throwback terminal chat, built on Tailscale

https://tailscale.com/blog/chat-tails-terminal-chat
2•nulbyte•4m ago•0 comments

Prediction: AI will make formal verification go mainstream

https://martin.kleppmann.com/2025/12/08/ai-formal-verification.html
1•evankhoury•6m ago•0 comments

Naming is hard: Google's PubSubHubbub

https://pubsubhubbub.appspot.com/
2•hedayet•6m ago•0 comments

The persistent need for general knowledge and subject-matter expertise with LLMs

https://graphthinking.blogspot.com/2025/12/the-persistent-need-for-general.html
1•physicsgraph•7m ago•0 comments

TruffleHog now detects public-key JWTs and verifies them for liveness

https://trufflesecurity.com/blog/trufflehog-now-detects-jwts-with-public-key-signatures-and-verif...
1•SnowflakeOnIce•8m ago•0 comments

Is the Poisson distribution useful for modeling goals?

https://blog.engora.com/2025/12/poisson-to-predict-football-results.html
1•Vermin2000•8m ago•0 comments

MIT [nuclear science] professor shot and killed in his Brookline home

https://www.boston.com/news/crime/2025/12/16/mit-professor-shot-and-killed-in-his-brookline-home/
1•dctoedt•9m ago•1 comments

Intel Discontinues Its Open-Source User-Space Gaudi Driver Code

https://www.phoronix.com/news/Intel-SynapseAI-Stops
1•BeetleB•9m ago•0 comments

Questions and Answers on the European Grids Package

https://ec.europa.eu/commission/presscorner/detail/en/qanda_25_2946
1•doener•13m ago•0 comments

Basketball Player Tracking, Team Detection, and Number Recognition with Python

https://www.youtube.com/watch?v=yGQb9KkvQ1Q
1•SkalskiP•13m ago•1 comments

Texas is suing all of the big TV makers for spying on what you watch

https://www.theverge.com/news/845400/texas-tv-makers-lawsuit-samsung-sony-lg-hisense-tcl-spying
8•tortilla•15m ago•1 comments

Show HN: A24z – AI Engineering Ops Platform

https://www.a24z.ai/
2•brandonin•16m ago•1 comments

Show HN: Deterministic PCIe Diagnostics for GPUs on Linux

https://github.com/parallelArchitect/gpu-pcie-diagnostic
1•gpu_systems•17m ago•0 comments

LG TVs Get Unremovable Microsoft Copilot App

https://gizmodo.com/lg-tvs-get-unremovable-microsoft-copilot-app-2000699870
3•stalfosknight•19m ago•1 comments

Rolldown: Fast Rust Bundler for JavaScript/TypeScript with Rollup-Compatible API

https://github.com/rolldown/rolldown
1•nateb2022•20m ago•0 comments

The success of 'natural language programming'

https://brooker.co.za/blog/2025/12/16/natural-language.html
1•qianli_cs•21m ago•0 comments

Most US Teens Use YouTube and TikTok Daily–Some 'Almost Constantly,' Survey Says

https://www.nytimes.com/2025/12/09/well/family/tik-tok-you-tube-teen-use-pew-study.html
2•bookofjoe•21m ago•2 comments

Sway-displays: interactive output setup with profiles (+ zero-copy mirroring)

https://github.com/pescheckit/sway-displays
1•botw44•22m ago•1 comments

Neuromodulatory control of energy reserves in dopaminergic neurons

https://www.pnas.org/doi/10.1073/pnas.2523019122
1•bikenaga•22m ago•1 comments

Academic Doping – Are Cognitive Enhancers Real? [video]

https://www.youtube.com/watch?v=S9DBzebu0r8
1•msuniverse2026•22m ago•0 comments

Chicago95: Windows 95 for Linux

https://github.com/grassmunk/Chicago95
1•doener•24m ago•1 comments

George Osborne Joins OpenAI

https://www.bbc.co.uk/news/articles/cd6xz1jv4ezo
4•oncallthrow•26m ago•0 comments

Ty: A fast Python type checker and LSP

https://astral.sh/blog/ty
13•gavide•28m ago•0 comments

Letta Code

https://www.letta.com/blog/letta-code
1•ascorbic•29m ago•0 comments

Go 1.26 Release Candidate 1 is released

https://groups.google.com/g/golang-nuts/c/LcB-3xh5o68
1•typical182•30m ago•1 comments

Linux computer with 843 components designed by AI boots on first attempt

https://www.tomshardware.com/tech-industry/artificial-intelligence/dual-pcb-linux-computer-with-8...
6•whynotmaybe•30m ago•0 comments

React2Shell Side Quest: Tracking Down Malicious MeshCentral Nodes

https://www.labs.greynoise.io/grimoire/2025-12-09-react2shell-meshcentral/
2•a_morris•31m ago•0 comments

Mark Carney criticised for using British spellings in Canadian documents

https://www.theguardian.com/world/2025/dec/16/mark-carney-british-spellings-canada
2•n1b0m•33m ago•1 comments