frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Godot is drowning in AI slop pull requests

https://bsky.app/profile/akien.bsky.social/post/3meyerixvhs2p
1•mbreese•2m ago•0 comments

The case for gatekeeping, or: why medieval guilds had it figured out

https://www.joanwestenberg.com/the-case-for-gatekeeping-or-why-medieval-guilds-had-it-figured-out/
1•todsacerdoti•2m ago•0 comments

Wordpress.com adds an AI Assistant that can edit, adjust styles, create images

https://techcrunch.com/2026/02/17/wordpress-com-adds-an-ai-assistant-that-can-edit-adjust-styles-...
1•taubek•3m ago•0 comments

I don't know anyone who uses Grokipedia, it will never work, says Jimmy Wales

https://www.moneycontrol.com/europe/?url=https://www.moneycontrol.com/news/business/india-ai-impa...
1•jmsflknr•4m ago•0 comments

Show HN: Potatoverse platform for apps in single binary, SQLite db

https://github.com/blue-monads/potatoverse
4•born-jre•5m ago•0 comments

Data Center

https://store.steampowered.com/app/4170200/Data_Center/
1•prawn•7m ago•0 comments

I hacked ChatGPT and Google's AI – and it only took 20 minutes

https://www.bbc.com/future/article/20260218-i-hacked-chatgpt-and-googles-ai-and-it-only-took-20-m...
3•ranit•9m ago•0 comments

Agent37

https://www.agent37.com
2•bellamoon544•13m ago•1 comments

Ask HN: How do you debug multi-step AI workflows when the output is wrong?

1•terryjiang2020•15m ago•0 comments

Show HN: I created iHateCSV.com for those who hate it when spreadsheets break

https://ihatecsv.com/
1•vinserello•16m ago•0 comments

Cuban-American Voters Who Supported Hardline Immigration Policies

1•poojagill•17m ago•1 comments

Data leak at Abu Dhabi finance summit exposes politicians and business leaders

https://www.ft.com/content/b86cefd5-90e7-410b-bf58-09b9fde307cb
3•JumpCrisscross•17m ago•1 comments

OpenClaw refactored in Go, runs on $10 hardware

https://picoclaw.net/
1•Nazzareno•17m ago•0 comments

Rubio's warm words to Orbán reinforce EU fears that US seeks disunity in Europe

https://www.theguardian.com/us-news/2026/feb/17/marco-rubio-viktor-orban-eu-disunity-analysis
1•robtherobber•19m ago•1 comments

VVTerm – Ghostty-powered SSH client for iOS, iPad, macOS

https://vvterm.com/
1•wiedymi•19m ago•1 comments

Stop prompting. Let the AI interview you to build specs

https://www.ideaforge.chat/
2•enha•19m ago•1 comments

Show HN: Benchmarking Apple Silicon unified mem for GPU-accelerated SQL analysis

https://github.com/sadopc/unified-db-2
2•sadopc•23m ago•1 comments

Proxmox-GitOps: IaC Automation Framework for LXC: Local Development and Staging

1•gitopspm•23m ago•0 comments

Show HN: Shiro.computer static page, Unix/NPM shimmed enough to host Claude Code

https://shiro.computer/about
1•sagebird•24m ago•0 comments

Show HN: Add reverb and bass boost via WebAudio API to Spotify/YouTube

https://v0-screen-capture-audio.vercel.app/
1•jardy•24m ago•1 comments

Why my country's AI scene is built on sand

1•Rioverde•25m ago•2 comments

The resources I'm using to learn Maths, AI and Robotics

https://parsam.io/maths-ai-robotics/
3•pzrsa•26m ago•1 comments

A way to manage your versioning and changelogs with a focus on monorepos

https://github.com/changesets/changesets
2•lwhsiao•27m ago•0 comments

Show HN: How do you prioritize user feedback without going insane?

1•superproton•28m ago•0 comments

US lawyers start privacy class action accusing Lenovo of data transfers to China

https://www.theregister.com/2026/02/17/lenovo_privacy_lawsuit/
1•robtherobber•31m ago•0 comments

Decades in the Machine – Meaning and Purpose in Technology

https://www.classcentral.com/course/youtube-decades-in-the-machine-meaning-and-purpose-in-technol...
1•andsoitis•35m ago•0 comments

His Brownstone Is Worth $5.4M. Why Is His Tax Bill So Low?

https://www.nytimes.com/2024/04/02/nyregion/nyc-property-tax.html
2•JumpCrisscross•36m ago•1 comments

Making Flow – Interview with Director Gints Zilbalodis

https://www.blender.org/user-stories/making-flow-an-interview-with-director-gints-zilbalodis/
1•abdelhousni•36m ago•0 comments

Can we leverage AI/LLMs for self-learning?

https://techne98.com/blog/can-we-use-ai-for-self-learning/
1•fixedprog•37m ago•0 comments

What's cooking on Sourcehut? Q1 2026

https://sourcehut.org/blog/2026-02-18-whats-cooking-q1-2026/
1•todsacerdoti•37m ago•0 comments