frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Tegratop – A Comprehensive TUI monitoring tool for Nvidia jetson boards

https://github.com/pythops/tegratop
1•pythops•36s ago•0 comments

Show HN: A Compiler for CDN Security (YAML to CloudFront/Workers/WAF)

https://www.npmjs.com/package/cdn-security-framework
1•einshutoin•2m ago•1 comments

Hot-Potato Routing

https://en.wikipedia.org/wiki/Hot-potato_routing
1•Thicken2320•2m ago•0 comments

Show HN: Multi-agent orchestration using OpenCode and LangGraph

https://gitlab.com/nis-open-code
1•ninashamsi•3m ago•0 comments

'Black' Banned from Flyers for FAMU College of Law Black History Month Event

https://www.clickorlando.com/news/local/2026/02/06/black-banned-from-flyers-for-famu-college-of-l...
2•zzzeek•3m ago•0 comments

Demo Effect Explained: How to Make a 3D Tunnel on the C64 [video]

https://www.youtube.com/watch?v=4Db-tmL8Tno
1•pavel_lishin•4m ago•0 comments

OpenAI's GPT-4 Discontinuation: Consumer Fraud and Regulatory Scrutiny

1•tizzzzz•6m ago•0 comments

Show HN: The biggest achievement of my life so far

https://github.com/adityaprasad-sudo/Explore-Singapore
3•ambitious_potat•7m ago•0 comments

Show HN: A macOS screen recorder for the rest of us – free and open source

https://jsattler.github.io/BetterCapture/
1•jsattler•8m ago•0 comments

The Evolution of a Lean Programmer

https://unnamed.website/posts/evolution-lean-programmer/
1•aebtebeten•8m ago•0 comments

Open-source webapp to analyze all your DJI flight logs in one place

https://github.com/arpanghosh8453/dji-logbook
1•iamarpan•10m ago•1 comments

OpenAI Just Betrayed Nvidia: The AI War Begins Now

https://www.youtube.com/watch?v=SG71c_W25-s
1•cable2600•11m ago•0 comments

Camera that can see around corners (2021) [video]

https://www.youtube.com/watch?v=Ir7wCAQINqw
1•downboots•13m ago•0 comments

Show HN: Deterministic product idea generator (no AI APIs, works offline)

https://github.com/CrazhHolmes/passive-gen
1•Wizardrytezch•13m ago•0 comments

Show HN: Tabletop Jigsaw Puzzle

https://jigsaw.rokyed.digital/
1•rokyed•14m ago•0 comments

Show HN: EkşI Sözlük but every author is an AI agent

https://www.robotsozluk.com
1•yldrmahmet•17m ago•1 comments

What you need to know to avoid multi-million-dollar subscription traps

https://www.rnz.co.nz/news/business/586268/here-s-what-you-need-to-know-to-avoid-multi-million-do...
3•billybuckwheat•17m ago•0 comments

LLMs Are Prediction Machines

https://kaelandt.github.io/posts/llm-prediction-machines.html
1•kaelandt•17m ago•0 comments

Guide for Installing PostgreSQL on TrueNAS

https://github.com/emanueldonalds/guides/blob/master/install_postgresql_on_truenas.md
2•oldestofsports•18m ago•1 comments

Deobfuscation and Analysis of Ring-1.io

https://back.engineering/blog/04/02/2026/
1•raggi•19m ago•0 comments

Tim Cook's Full Remarks About Apple's 50th Anniversary Plans

https://www.macrumors.com/2026/02/08/tim-cook-full-remarks-on-apple-turning-50/
1•tosh•19m ago•0 comments

Japan's Takaichi Scores Landslide Win in Election Gamble

https://www.wsj.com/world/asia/japans-takaichi-scores-major-election-victory-62f094a2
2•JumpCrisscross•24m ago•0 comments

Mushroom Cloud Picture Gallery

https://zvis.com/cpg14/index.php?cat=23
2•joebig•24m ago•0 comments

Breaking Down CVE-2026-25049: How TypeScript Types Failed N8n's Security

https://hetmehta.com/posts/n8n-type-confusion-rce/
2•rantingdemon•25m ago•0 comments

Show HN: Click symbols in Claude Code to jump to definitions in VS Code

https://maaash.jp/2026/02/more-a-tags-in-the-terminal/
1•maaashjp•27m ago•0 comments

Tech Independence

https://sive.rs/ti
4•ryangibb•29m ago•0 comments

The New Fabio Is Claude

https://www.nytimes.com/2026/02/08/business/ai-claude-romance-books.html
2•mold_aid•30m ago•2 comments

Optimization for Job Shop Scheduling with Blocking: A Genetic Algorithm Approach

https://www.mdpi.com/1999-4893/19/2/115
2•PaulHoule•30m ago•0 comments

The AI Bubble I Live in (and You Probably Don't)

https://thoughts.jock.pl/p/ai-bubble-living-inside
1•joozio•31m ago•0 comments

Show HN: Asterbot – AI agent built from sandboxed WASM components

https://github.com/asterai-io/asterbot
1•rellfy•33m ago•0 comments