frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

From milliseconds to 26 nanoseconds: how a $20 eBay SFP module beat my NT

https://austinsnerdythings.com/2026/04/26/ptp-osa5401-26-nanoseconds-raspberry-pi/
1•fanf2•2m ago•0 comments

How to run a local coding agent with Gemma 4 and Pi

https://patloeber.com/gemma-4-pi-agent/
1•mariuz•3m ago•0 comments

Top Negotiation Skills

https://www.pon.harvard.edu/daily/negotiation-skills-daily/top-10-negotiation-skills/
1•lucidplot•4m ago•0 comments

Bohu Laser Facility

https://en.wikipedia.org/wiki/Bohu_laser_facility
1•lumax•4m ago•0 comments

I built a WordPress plugin that generates 1000 SEO pages in minutes

https://www.indiehackers.com/post/i-built-a-wordpress-plugin-that-generates-100-seo-pages-in-minu...
1•codefreex•4m ago•0 comments

Trump administration to pay companies to walk away from US offshore wind leases

https://apnews.com/article/trump-offshore-wind-energy-climate-interior-02a1fa04b750809bbe035a7025...
1•geox•7m ago•0 comments

Show HN: The newsroom that runs itself; hiring AI Journalists [TokenToday]

1•gnapapp•10m ago•0 comments

Mitigating Belief Inertia via Active Intervention in Embodied Agents

https://arxiv.org/abs/2604.17252
2•MemTensor•15m ago•1 comments

Ruby Concurrency: What Happens

https://paolino.me/ruby-concurrency-what-actually-happens/
2•earcar•16m ago•0 comments

Height Hunt

https://adamtownsend.com/heighthunt/
1•thip•19m ago•0 comments

Enterprise Solutions for Global AI Search Visibility: A Practical Guide

https://dageno.ai
1•timdageno•20m ago•0 comments

Using group theory to explore the space of positional encodings for attention

https://blog.janestreet.com/using-group-theory-to-explore-positional-encodings-attention/
1•ingve•20m ago•0 comments

Technical Overview of an AI RAG System with React, Python, Laravel, Redis

https://gist.io/@alessandrofuda/c0513948003265e3548f288fef0e8ea1
1•aledevv•21m ago•0 comments

Show HN: Bumpy – versioning/changelog tool, fixed 120 open changesets issues

https://github.com/dmno-dev/bumpy
2•theozero•25m ago•0 comments

Show HN: I got tired of hand-syncing AI coding rules across four tools

https://github.com/sampleXbro/agentsmesh
1•samplexBro•26m ago•0 comments

Show HN: I built a way to see if your SDK is AI-friendly

1•nguyenhu•31m ago•0 comments

Building a Threadiverse Community Platform

https://fedify.dev/tutorial/threadiverse
1•dahlia•32m ago•0 comments

Australia threatens tech companies with 2.25% tax if they don't pay publishers

https://www.theregister.com/2026/04/28/australia_news_bargaining_incentive/
3•defrost•37m ago•1 comments

How Do Perpetual Futures Differ from Spot Trading in Crypto?

https://www.bitdeal.net/cryptocurrency-exchange-development
1•harrisonrichrd•42m ago•0 comments

Meta prepares to undo acquisition of Singapore-based Manus after China ban

https://www.businesstimes.com.sg/international/global/meta-prepares-undo-acquisition-singapore-ba...
2•doppp•43m ago•0 comments

Freelancer for hire – full stack, ML, DevOps

1•Hopfield•44m ago•0 comments

Talos OS images are now bit-by-bit reproducible

https://github.com/siderolabs/talos/releases/tag/v1.13.0
1•matesz•46m ago•0 comments

I Use AI in 2026

https://fedepaol.github.io/blog/2026/04/25/how-i-use-ai-in-2026/
1•fedepaol•47m ago•0 comments

Come From

https://wiki.c2.com/?ComeFrom
1•pramodbiligiri•48m ago•0 comments

Steal Claude Code Architecture

https://teamcal.ai/blog/claude-code-architecture
1•rajl•51m ago•0 comments

How to build advanced features for AI chatbots on SSE

https://zknill.io/posts/everyone-said-sse-token-streaming-was-easy/
1•zknill•55m ago•0 comments

Show HN: VibeBrowser – Give your AI agent your real logged-in browser via MCP

https://www.vibebrowser.app/mcp
1•denis4inet•55m ago•0 comments

Show HN: Financial Database API for Vibe Coders

https://xfinlink.com
1•lyonghee97•1h ago•1 comments

Hotta GameDriverX64.sys shipping in Neverness to Everness preload

https://github.com/LaggyTMD/nte-driver-analysis
1•LaggyTMD•1h ago•0 comments

Anthropic Claude Code HERMES.md billing flaw

https://consumerrights.wiki/w/Anthropic_Claude_Code_HERMES.md_billing_flaw
1•Palmik•1h ago•0 comments