frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Integrate Cashfree Payments in less than 7 minutes

https://tech.cashfree.com/building-cashfree-agent-skills-a-task-aware-knowledge-layer-for-ai-codi...
1•shritama_saha•1m ago•0 comments

Gitea Runner 1.0.0 is released

https://blog.gitea.com/release-of-runner-1.0.0/
2•jandeboevrie•3m ago•0 comments

Second Circuit Sidesteps "Server Test" in Embedded Video Copyright

https://natlawreview.com/article/second-circuit-sidesteps-server-test-embedded-video-copyright-ru...
1•petethomas•8m ago•0 comments

Some deaf children are hearing again because of a new gene therapy

https://www.vox.com/future-perfect/487590/gene-therapy-crispr-deafness-food-and-drug-administration
1•yanis_t•9m ago•0 comments

A game-changer for good health? Scientists believe 'we are when we eat'

https://www.theguardian.com/commentisfree/2026/may/05/game-changer-good-health-scientists-we-are-...
1•akbarnama•9m ago•0 comments

Can language models rebuild programs from scratch?

https://programbench.com
2•beau•12m ago•1 comments

American History X was a hit but ego blew my career, says director

https://www.thetimes.com/culture/film/article/tony-kaye-edward-norton-american-history-x-zbwcg7chq
1•petethomas•14m ago•0 comments

The guide to RL environments: building and scaling them in the LLM era

https://huggingface.co/spaces/AdithyaSK/rl-environments-guide
2•babelfish•21m ago•0 comments

Show HN: Cargo-affect – Plan affected Rust workspace tests from a Git diff

https://github.com/lightsofapollo/cargo-affect
1•lightsofapollo•22m ago•0 comments

Pennsylvania sues Character.AI over claims chatbot posed as doctor

https://www.npr.org/2026/05/05/nx-s1-5812861/characterai-chatbot-medical-advice-pennsylvania-lawsuit
2•geox•28m ago•0 comments

Math Behind "AI Will Replace Engineers" Is Embarrassingly Wrong

https://www.youtube.com/watch?v=ItSLny8on5I
2•amcorvi•30m ago•0 comments

ReMarkable Paper Pure with Marker

https://www.aucklanddutyfree.co.nz/remarkable-paper-pure-with-marker.html
1•hboon•31m ago•0 comments

Carbon pollution is making food less nutritious and risking health of billions

https://www.washingtonpost.com/climate-environment/interactive/2026/carbon-pollution-diluting-key...
2•JeanKage•32m ago•0 comments

Apple settles lawsuit admitting Apple Intelligence isn't here yet

https://www.reuters.com/legal/litigation/apple-settles-lawsuit-over-late-siri-ai-features-250-mil...
4•oofbey•36m ago•0 comments

Show HN: I Built a Retro Survival RPG in Vanilla JavaScript

2•jasonkester•36m ago•0 comments

Behavior-Oriented Concurrency for Python

https://microsoft.github.io/bocpy/
3•mpweiher•37m ago•0 comments

Attackers are cashing in on fresh 'CopyFail' Linux flaw

https://www.theregister.com/2026/05/05/cisa_sounds_the_alarm_on/
3•pjmlp•48m ago•0 comments

The Schneider float32 byte order isn't in the spec, and other Modbus surprises

https://gist.github.com/PhilYeh1212/227a35b12236f1f37dfea0405b12be2b
2•PhilYeh75•50m ago•0 comments

Generative UI Browser

https://github.com/manupareekk/generative-ui-browser
2•manupareek•53m ago•0 comments

You built the product. Now test if people want it

https://www.viral.ad/
3•cjdesignstudio•56m ago•0 comments

Knitting Bullshit

https://katedaviesdesigns.com/2026/04/29/knitting-bullshit/
3•ColinEberhardt•58m ago•0 comments

Why LLM APIs Shouldn't Ship UTF-8", "Stop Wasting Bandwidth on LLM Text APIs

https://github.com/wdunn001/codec
3•Zombwaffle•59m ago•1 comments

CLI2API: Turn Your Claude Subscription into an OpenAI-Compatible API

https://github.com/zhusq20/CLI2API
1•zsqzz•59m ago•0 comments

An ode to swapping computer warez in the 80s

https://suno.com/song/79267d82-c558-4981-a7e5-3b0a7939dea9
1•JPolka•1h ago•2 comments

Software Development Job Postings on Indeed in the United States

https://fred.stlouisfed.org/series/IHLIDXUSTPSOFTDEVE
1•stefap2•1h ago•0 comments

Pyramid of the Capitalist System (1911)

https://publicdomainreview.org/collection/pyramid-of-the-capitalist-system/
3•prismatic•1h ago•0 comments

Show HN: App that marks each hour of your day as yours or lost to your phone

https://apps.apple.com/us/app/oh-my-hours/id6760450002
1•yarsanich•1h ago•0 comments

OpenAI favors more flexible datacenter deals

https://www.tomshardware.com/tech-industry/artificial-intelligence/openai-has-effectively-abandon...
1•gmays•1h ago•0 comments

Detection of an atmosphere on a trans-Neptunian object beyond Pluto

https://www.nature.com/articles/s41550-026-02846-1
1•tobr•1h ago•0 comments

Verification Test – Please Ignore

1•zhoykn•1h ago•0 comments