frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Synth modular controller treats music making like building with blocks

https://www.yankodesign.com/2026/02/21/synth-modular-controller-treats-music-making-like-building...
1•teleforce•1m ago•0 comments

Most web utility sites are going to die soon

1•hussachai•2m ago•0 comments

Show HN: Chorus –Let users improve the LLM without their data leaving the device

https://github.com/varmabudharaju/chorus
1•varmabudharaju•4m ago•0 comments

BasaltCRM – The Open Source AI-Native CRM Built on Next.js 16

https://github.com/pdovhomilja/nextcrm-app/discussions/76
1•mayordelmar•13m ago•2 comments

Qworum transforms the Web into an AI-ready application platform

https://drive.google.com/file/d/13xy5Yx8dMVqGiJqqWFhh9DOi_7akl_rW/view
1•dogaar•13m ago•1 comments

How the Dunning-Kruger Effect Impairs Judgement in High-Risk Professions

https://www.jsr.org/hs/index.php/path/article/view/5623
2•thunderbong•15m ago•0 comments

What Are Free Speech Warriors Doing About Trump's Censorship-Industrial Complex?

https://www.theunpopulist.net/p/what-are-heterodox-free-speech-warriors
4•bediger4000•15m ago•1 comments

Global Intelligence Boom

https://twitter.com/michaelxbloch/status/2025712344123236418
1•rayxi271828•17m ago•0 comments

AI-Free-Forever – 1000 free AI tools with no signup, no login

https://aifreeforever.com/
1•peter_d_sherman•20m ago•0 comments

Show HN: Say What You Should Have

https://mappymail.com
1•pruetj•28m ago•0 comments

Compulsively violent people might have lower IQs

https://www.psypost.org/people-who-engage-in-impulsive-violence-tend-to-have-lower-iq-scores/
6•karim79•32m ago•0 comments

Show HN: LucidExtractor – AI web scraper that understands plain English

https://lucidextractor.liceron.in
1•yukendiran_j•32m ago•0 comments

Ask HN: Does This Make Sense?

2•piratesAndSons•32m ago•1 comments

Show HN: ZuckerBot. API and MCP server for AI agents to run Meta/Facebook ads

https://zuckerbot.ai/
1•DavisGrainger•34m ago•0 comments

Weekly AI Pulse: Feb 23rd Edition

https://manojgopanapalli.substack.com/p/your-weekly-ai-pulse-india-ai-impact
1•thecontentboy•34m ago•0 comments

You Make Good Money...So Why Do You Still Feel Replaceable?

https://cpleveragingai.substack.com/p/you-make-good-money
2•cp18101985•36m ago•0 comments

Ask HN: Is there a workaround in OpenClaw for tab not found

1•jinen83•37m ago•0 comments

HN; Scheduled autonomous Claude agents using shell scripts and launchd

https://github.com/raulriera/MacPilot
1•raulriera•38m ago•1 comments

The Refrigerator Policy

https://alexover.dev/articles/the-refrigerator-policy/
2•alexoverdev•38m ago•1 comments

Show HN: A 4-tier self-healing system for local AI agents (was silently broken)

1•ramsbaby-dev•39m ago•1 comments

Tiny knife raised $450K in under a week

https://gearjunkie.com/knives/coin-tiny-knife-launch
1•teleforce•39m ago•0 comments

Show HN: 4-tier self-healing AI agent (was silently broken for weeks)

https://github.com/Ramsbaby/openclaw-self-healing
1•ramsbaby-dev•42m ago•0 comments

15K Labeled Enterprise Use Cases for Agent Routing (CC-by-4.0)

https://huggingface.co/datasets/LlewellynSystems/ode-enterprise-use-cases
1•LLSODE•43m ago•0 comments

LipoVive Launches Natural Supplement to Boost Metabolic Health

https://www.morningstar.com/news/accesswire/1138075msn/lipovive-reviews-shocking-2026-report-what...
1•janzlaps•44m ago•0 comments

WME Group

https://en.wikipedia.org/wiki/WME_Group
1•barrister•45m ago•0 comments

I Vibecoded a Tax App in a couple of weekends

https://github.com/ouais/opentax
3•ouaiso•49m ago•4 comments

What a viral monkey, his plushie, and a 70-year-old experiment tell us

https://theconversation.com/a-viral-monkey-his-plushie-and-a-70-year-old-experiment-what-punch-te...
4•defrost•53m ago•0 comments

Ask HN: What's your thought on Google Antigravity?

3•ms7892•59m ago•1 comments

Clawbridge Runner – CLI for nightly OpenClaw discovery and connection briefs

https://clawbridge.cloud/
1•lich2000117•59m ago•1 comments

MiniMax-M2.5: How to Run Guide

https://unsloth.ai/docs/models/minimax-m25
2•khimaros•59m ago•0 comments