frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: I made samspov.com to easily track updates

https://samspov.com
2•NKCSS•4m ago•0 comments

Manny Nosowsky, Whose Wordplay Enlivened Times Crosswords, Dies at 94

https://www.nytimes.com/2026/05/24/us/manny-nosowsky-dead.html
1•Michelangelo11•4m ago•0 comments

Hetzner Price Adjustment

https://www.hetzner.com/pressroom/standardization-and-price-adjustment-of-our-server-products/
1•TechTechTech•5m ago•0 comments

SOLID – ISP Is a Conditional Corollary of Dip Applied per Client

https://zenodo.org/records/20350293
1•humanfromearth9•6m ago•1 comments

Using typing in Python leads to different sorts of code

https://utcc.utoronto.ca/~cks/space/blog/python/TypeHintsDifferentCode
1•ingve•9m ago•0 comments

Fiber Optic Drone

https://en.wikipedia.org/wiki/Fiber_optic_drone
1•throwfaraway135•10m ago•0 comments

SOLID – Why SRP Is Wrong: The Cardinality Error in the SRP

https://zenodo.org/records/20415656
1•humanfromearth9•11m ago•1 comments

When products think: navigating the AI product shift

https://www.cesarrg.com/when-products-think-navigating-the-ai-product-shift/
1•cesarrg•15m ago•0 comments

A useful paper on the case for AI data centres in space [pdf]

https://starcloudinc.github.io/wp.pdf
1•nilen•15m ago•0 comments

Prepare your "no" and keep it handy

https://sive.rs/n0
1•Michelangelo11•15m ago•0 comments

Robinhood Lets Customers Use AI to Trade Stocks, Make Credit-Card Purchases

https://www.wsj.com/tech/ai/robinhood-lets-customers-use-ai-to-trade-stocks-make-credit-card-purc...
1•cesarrg•18m ago•0 comments

Seven Ways to Avoid Losing Your Job to AI

https://www.thefp.com/p/tyler-cowen-seven-ways-to-avoid-losing
2•yarapavan•18m ago•2 comments

Why the US is moving troops from Germany to Poland: a US Army officer explains

https://vulpesetleo.substack.com/p/kosciuszko-smiles-for-our-freedom
2•hnjm•20m ago•0 comments

DiffusionBlocks: Training Neural Networks One Block at a Time

https://pub.sakana.ai/diffusionblocks/
2•hardmaru•20m ago•0 comments

Ax language: Compact source, build for agents

https://axlanguage.github.io/axlang/
2•mellosouls•21m ago•1 comments

Show HN: LaunchPact – get upvotes for your ProductHunt launch

https://www.launchpact.io
2•devtanna•21m ago•0 comments

Learnings from training a frontier font generation model

https://www.mixfont.com/blog/learnings-from-training-a-frontier-font-generation-model
1•justswim•25m ago•0 comments

Train 1T parameter LLM with 8 GPUs?

1•kendy1992•28m ago•0 comments

Energy Efficiency across Programming Languages [pdf]

https://greenlab.di.uminho.pt/wp-content/uploads/2017/09/paperSLE.pdf
1•doener•30m ago•2 comments

Google worker charged with using internal data to make $1.2M on Polymarket

https://www.bbc.com/news/articles/c052yv259jvo
1•ZeljkoS•31m ago•0 comments

Rcmd: Reimagined Command-Tab

https://lowtechguys.com/rcmd/
3•doener•33m ago•0 comments

After Ferrari EV backlash, Lamborghini says canceling EVs was the right choice

https://www.cnbc.com/2026/05/27/ferrari-luce-backlash-lamborghini-ceo-ev.html
2•kleiba2•36m ago•1 comments

Apify actors at $0.001/result (Google Maps, NPI, SEC EDGAR, more)

https://apify.com/meticulous_snail
1•meticuloussnail•37m ago•0 comments

A Love Letter to Neovim

https://caio.ca/blog/a-love-letter-to-neovim
1•birdculture•38m ago•0 comments

Show HN: AI agents for UK GDAD PCF roles and their skills

1•jph•38m ago•0 comments

Digital Autonomy with RISC-V in Europe

https://dare-riscv.eu/
1•pimterry•42m ago•0 comments

The Future of Research Isn't Coming, It's Here

https://zenodo.org/records/20424253
1•anasteciadunu•43m ago•0 comments

Analyzing Table Space and Row Counts

https://medium.com/@joyshaw987/analyzing-table-space-and-row-counts-68a21a81013d
1•thunderbong•46m ago•0 comments

The first framework that can post train DeepSeek V4-pro on a single-node?

2•timxzz•49m ago•4 comments

Typst with Pandoc: A Modern, Fast Alternative to (Xe)LaTeX for PDF Generation

https://slhck.info/software/2025/10/25/typst-pdf-generation-xelatex-alternative.html
1•ankitg12•50m ago•0 comments