frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Timedb: Open-Source Database for Timeseries

https://github.com/rebase-energy/timedb
1•davide_rebase•3m ago•1 comments

Mastodon Engagement Viewer

https://www.leeholmes.com/projects/mastodon-engagement-viewer/?url=https%3A%2F%2Finfosec.exchange...
1•latexr•3m ago•0 comments

How I made a shooter game in 64 KB [video]

https://www.youtube.com/watch?v=qht68vFaa1M
1•freetonik•4m ago•0 comments

I Built an AI Agent That Trades Crypto on a Mac Mini for $2/Month

https://jdbot54.substack.com/p/i-built-an-ai-agent-that-trades-crypto
1•JohncheckMunich•5m ago•0 comments

Show HN: Vim-Claude-code – Use Claude directly inside Vim

https://github.com/rishi-opensource/vim-claude-code
1•rishi-sharma•6m ago•0 comments

Show HN: An RPG in the Windows file explorer

https://store.steampowered.com/app/3333010/Directory_Dungeon__File_Explorer_Dungeon_Crawler/
1•juhrjuhr•8m ago•0 comments

Show HN: Markdown in the Middle – proxy to convert HTML to Markdown

https://github.com/rickcrawford/markdowninthemiddle
1•crawdog•9m ago•0 comments

MasterAI RankWriter

https://wordpress.org/plugins/masterai-rankwriter/
1•producttube•9m ago•1 comments

Alternative with Do

1•jameshuntdo•12m ago•0 comments

Spotify Update on Developer Access and Platform Security

https://developer.spotify.com/blog/2026-02-06-update-on-developer-access-and-platform-security
1•hazzamanic•12m ago•1 comments

My Skill Makes Claude Code Great at TDD

https://www.aihero.dev/skill-test-driven-development-claude-code
1•saikatsg•13m ago•0 comments

It Was the Best of Times, etc.

https://www.robinsloan.com/lab/worst-or-best/
1•MindGods•13m ago•0 comments

Mojo's Take on Metaprogramming

https://mzaks.medium.com/when-magic-becomes-explicit-mojos-take-on-metaprogramming-61bcfafc5145
2•tosh•17m ago•0 comments

TeleCMI – Best cloud communication platform

1•sundarmsp•17m ago•0 comments

Decimal-Java is a library to convert java.math.BigDecimal to and from IEEE-754r

https://github.com/FirebirdSQL/decimal-java
3•mariuz•18m ago•0 comments

Pinhead – Quality public domain icons for your map pins

https://pinhead.ink/
1•altilunium•19m ago•0 comments

GyroidOS Virtualization Solution

https://www.cnx-software.com/2026/02/24/gyroidos-virtualization-solution-aims-to-secure-embedded-...
1•No_CQRT•24m ago•1 comments

Show HN: Autonomous AI Agent Fleets

https://www.openlegion.ai/
4•benriazy•25m ago•3 comments

Feedback wanted: monorepos, getting started and "week 1" problems, complexity

https://github.com/renovatebot/renovate/discussions/41414
1•mkesper•25m ago•0 comments

LLM and MCP: A simple introduction to the brain and hands of modern AI

https://teotti.com/llm-and-mcp-a-primer/
1•agenteo•27m ago•1 comments

An Interactive Intro to Quadtrees

https://growingswe.com/blog/quadtrees
3•growingswe•28m ago•0 comments

Show HN: Built an AI tool that routes tasks to agents, humans. Am I crazy?

1•rhelm-ai•29m ago•0 comments

Be My Baby

https://en.wikipedia.org/wiki/Be_My_Baby
1•handfuloflight•31m ago•1 comments

Show HN: AI Jam Sessions – MCP server that teaches AI to practice piano

https://github.com/mcp-tool-shop-org/ai-jam-sessions
1•mikeyfrilot•31m ago•0 comments

I want to get acquired by openrouter. For my OpenClaw alternative

2•alwassikhan•31m ago•1 comments

Show HN: ForceBreak – A Break Reminder with Friction

https://apps.apple.com/cn/app/forcebreak/id6758971359?mt=12
1•glidea•33m ago•0 comments

Agentic swarms are an org-chart delusion

https://www.joanwestenberg.com/agentic-swarms-are-an-org-chart-delusion/
1•MindGods•34m ago•0 comments

The Prime Prompt

https://suthakamal.substack.com/p/the-prime-prompt
1•suthakamal•36m ago•1 comments

I do a podcast, I don't talk int it

https://www.streaming-radar.com/p/i-do-a-podcast-i-dont-talk-in-it
1•lbostral•38m ago•1 comments

I Tried to Build the Smallest WASM Website on the Internet

https://github.com/tyler-harpool/1kb
1•tdhz77•41m ago•0 comments