frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: A project planning tool designed around drvleopers

https://getfrostbyte.dev/
2•thamiltonsmith•1m ago•0 comments

Psychopathic female criminals show unexpected patterns of emotional processing

https://www.psypost.org/psychopathic-female-criminals-exhibit-unexpected-patterns-of-emotional-pr...
1•binning•1m ago•0 comments

A tiny tool to extract data from any website

https://superdevpro.com/web-scraper
1•mddanishyusuf•6m ago•0 comments

Maternity care is failing women on an epic scale

https://millihill.substack.com/p/maternity-care-is-failing-women-on
2•binning•6m ago•0 comments

Genetic Data from over 20k U.S. Children Misused for 'Race Science'

https://www.nytimes.com/2026/01/24/us/children-genetics-race-science.html
1•perihelions•6m ago•0 comments

Asbestos found in children's play sand sold in UK

https://www.theguardian.com/business/2026/jan/24/childrens-play-sand-hobbycraft-asbestos-removed-...
2•binning•8m ago•0 comments

What does it feel like to be an agent?

https://liamconnell.github.io/blog/2026/01/23/what-does-it-feel-like-to-be-an-agent/
2•liamconnell•8m ago•0 comments

JVIC: New web-based Commodore VIC 20 emulator

https://vic20.games/#/basic/24k
1•lance_ewing•12m ago•1 comments

Show HN: ReTraced – Job scheduler that makes retries visible as data (v1.0)

https://github.com/Anshikakalpana/ReTraced
1•Anshikakalpana•15m ago•0 comments

Thinking about memory for AI coding agents

3•hoangnnguyen•16m ago•0 comments

Show HN: DayZen: Visual day planner for ADHD brains

https://apps.apple.com/us/app/dayzen-visual-time-planner/id6754326173
17•Kavolis_•16m ago•3 comments

A terminal-based coding agent

https://shittycodingagent.ai/
3•doppp•16m ago•0 comments

Show HN: I built A GUI for managing waypoints in large-scale robot navigation

https://github.com/Yutarop/waypoints_editor
1•ponta17•19m ago•0 comments

Monster High Characters

https://monster-high-characters.com
3•jokera•20m ago•0 comments

Why Read Novels?

https://dynomight.net/novels/
4•dynm•31m ago•0 comments

Forgotten Polygons: Multimodal Large Language Models Are Shape-Blind

https://arxiv.org/abs/2502.15969
2•chbint•32m ago•0 comments

Built a library of LLM prompts for RAG

https://agentset.ai/rag-prompts
1•midamurat•34m ago•0 comments

Terence Tao: A collection of optimization problems in mathematics

https://github.com/teorth/optimizationproblems
1•zaikunzhang•34m ago•0 comments

Show HN: ResourceAI – Local LLM inference optimized for consumer iGPUs

1•Fenix46•37m ago•0 comments

Errata for the Linux Programming Interface

https://www.man7.org/tlpi/errata/index.html
1•pgalkin•37m ago•1 comments

Leaving Twitter with Shamir's Secret Sharing Scheme

https://blog.divyendusingh.com/p/leaving-twitter-with-shamirs-secret
1•divyenduz•43m ago•0 comments

Tell HN: AI is all about the tools (for now)

1•keepamovin•43m ago•0 comments

Epstein Stalker

https://chromewebstore.google.com/detail/epstein-stalker-dataset-w/imdhklfboelonbegpgkfmgcibackgdde
2•xecaz•44m ago•0 comments

Why I Fail a Lot

https://thinkering.blog/why-i-fail-a-lot/
1•nasrovsky•46m ago•0 comments

Many Small Queries Are Efficient in SQLite

https://www.sqlite.org/np1queryprob.html
4•tosh•50m ago•0 comments

Objective-S

https://objective.st/
1•tosh•54m ago•0 comments

Unrequited Passion

https://cinemasojourns.com/2026/01/23/unrequited-passion/
1•jjgreen•54m ago•0 comments

The Economics of Abundant Intelligence

https://tuananh.net/2026/01/23/the-post-agentic-world-the-economics-of-abundant-intelligence/
2•tuananh•1h ago•0 comments

Agent Skills Support in Mastra

https://github.com/mastra-ai/mastra/pull/12252
1•alaeddine-13•1h ago•0 comments

Malicious PyPI Packages Spellcheckpy and Spellcheckerpy Deliver Python Rat

https://www.aikido.dev/blog/malicious-pypi-packages-spellcheckpy-and-spellcheckerpy-deliver-pytho...
1•birdculture•1h ago•0 comments