frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•5mo ago

Comments

tocs3•5mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Interesting SPI Routing with iCE40 FPGAs

https://danielmangum.com/posts/spi-routing-ice40-fpga/
1•hasheddan•27s ago•0 comments

Debunking "When Prophecy Fails"

https://onlinelibrary.wiley.com/doi/abs/10.1002/jhbs.70043
1•sipofwater•47s ago•0 comments

How to declutter, quiet down, and take the AI out of Windows 11 25H2

https://arstechnica.com/gadgets/2025/11/what-i-do-to-clean-up-a-clean-install-of-windows-11-23h2-...
1•breve•1m ago•0 comments

5 years from now, 95% of the tokens will be on tasks humans never did before

https://twitter.com/levie/status/1986620885592113218
1•bilsbie•1m ago•0 comments

Harmony – Generate 3D CAD models from text prompts

https://harmonycad.dev
1•HamzahSKhan•2m ago•1 comments

Ask HN: Ideas to Reduce Healthcare Cost?

2•mountaineer98•4m ago•0 comments

Go Board vs. Go Stone

https://gafferongames.com/post/go_stone_vs_go_board/
1•andsoitis•5m ago•0 comments

Show HN: Extending LLM SVG generation beyond pelicans and bicycles

https://gally.net/temp/20251107pelican-alternatives/index.html
1•tkgally•6m ago•0 comments

I don't test different designs at the same time

https://adamsilver.io/blog/why-i-dont-test-different-designs-at-the-same-time/
1•todsacerdoti•7m ago•0 comments

'You're just ready:' Parents say ChatGPT encouraged son to kill himself

https://www.cnn.com/2025/11/06/us/openai-chatgpt-uicide-lawsuit-invs-vis
2•nh43215rgb•7m ago•0 comments

Show HN: Trendlists Digest of AI, App and Tech Topics

https://trendscout.com/?r=index
1•Luuucas•8m ago•1 comments

Acquired: The Steve Ballmer Interview (Shortcast)

https://shortcast.me/8HbjUd37Gsrv5fOkkb3d
1•rokgregoric•8m ago•0 comments

Training Junior Engineers

https://natashajaffe.substack.com/p/training-junior-engineers
1•mooreds•12m ago•0 comments

Doing It Manually

https://blog.jim-nielsen.com/2025/doing-in-manually/
1•mooreds•13m ago•0 comments

Liturgical Arts Journal

https://www.liturgicalartsjournal.com/
1•danielam•13m ago•0 comments

Providers

https://adactio.com/journal/22235
1•mooreds•14m ago•0 comments

Ask HN: Is AI useful for self generating content?

1•mountaineer98•15m ago•0 comments

LongCat-Flash-Omni: Open-Source Omni-Modal AI Model

https://longcatflashomni.net
1•AI_kid1412•17m ago•0 comments

BindWeave: Subject-Consistent Video Generation Platform

https://bindweave.video
1•AI_kid1412•17m ago•0 comments

Amazon WorkSpaces Linux Flaw Lets Attackers Steal Tokens

https://www.jphfeeds.top/2025/11/amazon-workspaces-linux-flaw-lets.html
1•FIGYJ•20m ago•0 comments

SanDisk launches dongle-like Extreme Fit USB-C flash drive with up to 1 TB

https://www.notebookcheck.net/Sandisk-launches-dongle-like-Extreme-Fit-USB-C-flash-drive-with-up-...
2•teleforce•23m ago•0 comments

Build a ClojureScript native desktop app in 5 minutes [video]

https://www.youtube.com/watch?v=uEVo8rqJgyw
2•Borkdude•29m ago•0 comments

GPS reveals that football practices can be up to 40% more demanding than games

https://medicalxpress.com/news/2025-10-gps-technology-reveals-football-demanding.html
1•PaulHoule•31m ago•0 comments

Ready for Post-Quantum Cryptography TLS?

https://qcready.com
1•weddpros•31m ago•0 comments

A simple commitment, on how to dogfood your design practice

https://perte.io/blog/a-simple-commitment
3•perteraul•32m ago•0 comments

Cognitive warfare: the new battlefield exploiting our brains

https://www.polytechnique-insights.com/en/columns/geopolitics/cognitive-warfare-the-new-battlefie...
2•miltava•32m ago•0 comments

Show HN: I made a better DOM morphing algorithm

https://joel.drapper.me/p/morphlex/
3•joeldrapper•32m ago•0 comments

Garbage Collection Is a Hack

https://blog.adamant-lang.org/2018/garbage-collection-is-a-hack/
1•mattrighetti•32m ago•0 comments

Show HN: OSS implementation of Test Time Diffusion that runs on a 24gb GPU

https://github.com/eamag/MMU-RAG-competition
1•eamag•33m ago•0 comments

External Secrets Operator is now GA with version v1.0.0

https://github.com/external-secrets/external-secrets/releases/tag/v1.0.0
1•skarlso•33m ago•1 comments