frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Another AI slop story: ChatGPT vs. Human

https://joshua.hu/ai-slop-story-nginx-leaking-dns-chatgpt
1•detaro•12s ago•0 comments

Show HN: Built an all-in-one feedback board for SaaS apps – Ship features

2•rahulbstomar•1m ago•1 comments

What if our ancestors didn't feel pain the way we do

https://www.theatlantic.com/magazine/2026/01/human-ancestors-emotion-history/684959/
1•HR01•2m ago•0 comments

Dependent Names with a Little Encouragement

https://consteval.ca/2025/09/27/dependent-names/
1•HeliumHydride•3m ago•0 comments

HTML as an Accessible Format for Papers

https://info.arxiv.org/about/accessible_HTML.html
3•el3ctron•7m ago•1 comments

AoCO 2025: Division

https://xania.org/202512/06-dividing-to-conquer
1•HeliumHydride•11m ago•0 comments

Carlo is no longer maintained

https://github.com/GoogleChromeLabs/carlo
2•keepamovin•12m ago•0 comments

Nemawashi: A Powerful, Forgotten Japanese Innovation and Change Management Tool

https://www.changebase.app/blog/nemawashi-japanese-change-management-tool
2•mooreds•13m ago•0 comments

A 2kb library that makes WebWorkers enjoyable

https://github.com/GoogleChromeLabs/comlink
2•keepamovin•13m ago•0 comments

Make images smaller using best-in-class codecs, right in the browser

https://github.com/GoogleChromeLabs/squoosh
1•keepamovin•14m ago•0 comments

Measuring AI impact like it's 1995

https://www.swarmia.com/blog/measuring-ai-impact-like-1995/
1•mooreds•15m ago•0 comments

Messy Crappy Art for the Win

https://pammoore.substack.com/p/messy-crappy-art-for-the-win
1•mooreds•17m ago•0 comments

The Traveling Salesperson Problem (Modernized)

https://github.com/norvig/pytudes/blob/main/ipynb/TSP.ipynb
1•vismit2000•17m ago•0 comments

Show HN: Tududi v0.87 – A self-hosted life OS: areas, projects, tasks, notes

https://tududi.com
1•cvicpp123•18m ago•0 comments

Drones to Diplomas: How Russia's Largest Private University Is Linked to a $25M

https://krebsonsecurity.com/2025/12/drones-to-diplomas-how-russias-largest-private-university-is-...
1•todsacerdoti•21m ago•0 comments

Web Performance Advent Calendar 2025 – 17th edition

https://calendar.perfplanet.com/2025/
1•vismit2000•25m ago•0 comments

Kidney Recipient Dies After Transplant from Organ Donor Who Had Rabies

https://www.nytimes.com/2025/12/06/health/rabies-death-skunk-kidney-transplant.html
5•quapster•26m ago•6 comments

Manage risk with drawdown, not hope

https://pyquantnews.substack.com/p/size-positions-by-drawdown-not-hope
1•strimp099•28m ago•0 comments

Decades-old study on common weed killer retracted

https://www.cbc.ca/news/health/glyphosate-retraction-9.7004363
11•geox•28m ago•2 comments

Ceremonial Bugle

https://ceremonialbugle.com/
1•mhb•35m ago•0 comments

Show HN: WebGPU back end for PyTorch sneak peek

https://github.com/jmaczan/torch-webgpu
3•yu3zhou4•39m ago•0 comments

What would someone like me do with a tiny modular synth? [pdf]

https://www.musicthing.co.uk/collateral/WhatWouldSomeoneLikeMeDoWithATinyModularSynth_book.pdf
1•mhb•40m ago•0 comments

Kernel Float: Unlocking Mixed-Precision GPU Programming

https://dl.acm.org/doi/pdf/10.1145/3779120
3•gpuhacker•41m ago•0 comments

Godfather of AI' Geoffrey Hinton says Google is 'beginning to overtake' OpenAI

https://www.businessinsider.com/ai-godfather-geoffrey-hinton-google-overtaking-openai-2025-12
4•ashishgupta2209•42m ago•1 comments

Resolution Dynamics: Deriving the Fine Structure Constant from Shannon Capacity

https://zenodo.org/records/17821936
8•Jascon71•42m ago•1 comments

Show HN: Seq2Seq ML Learns the Inverse of Manual Multiplication (Gelosia Method)

https://gitlab.com/9o1d/gelosia
1•9o1d•44m ago•0 comments

A Class of Models with the Potential to Represent Fundamental Physics

https://www.wolframphysics.org/technical-introduction/
2•frizlab•44m ago•0 comments

Iran files charges against organizers after women runners flout hijab law

https://www.timesofisrael.com/iran-files-charges-against-marathon-organizers-after-women-runners-...
1•mhb•46m ago•0 comments

Why "all-in-one" productivity tools confuse new users

9•suffei771•47m ago•3 comments

Check out these YouTube Slides generated by VeedoAI

https://veedo.ai/yt-slides/s/0VKmpTYVWV4FXBtm4
2•shortpodo•47m ago•0 comments