frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Bunny Database

https://bunny.net/blog/meet-bunny-database-the-sql-service-that-just-works/
1•dabinat•48s ago•0 comments

VR window manager for Windows/Quest

https://www.reddit.com/r/augmentedreality/s/3GzbDCgYNv
1•nothingneko•3m ago•0 comments

How do you prevent AI collaboration burnout?

1•causal_anchor•3m ago•0 comments

DOJ January 2026 Release

https://www.jmail.world/about/doj-2026
1•pera•4m ago•0 comments

Show HN: npx claude-mycelium grow – fungi agent orchestration for your repo

https://www.npmjs.com/package/claude-mycelium
1•altras•4m ago•0 comments

Bazzite Postmortem

https://ba.antheas.dev/bazzite-postmortem.html
1•JackMorgan•6m ago•0 comments

French headquarters of Elon Musk's X raided by Paris cybercrime unit

https://www.theguardian.com/technology/2026/feb/03/french-headquarters-elon-musk-x-raided-paris-c...
4•mkolarek•6m ago•0 comments

Show HN: Weather forecast/visualization without numbers

https://weather-sense.leftium.com/?calm
1•Leftium•11m ago•0 comments

Incident CVE-2024-Yikes

https://nesbitt.io/2026/02/03/incident-report-cve-2024-yikes.html
1•robin_reala•13m ago•0 comments

Show HN: TurboLibBMP stb-style BMP decoder/encoder in C no allocations, embedded

1•DenisDolya•14m ago•0 comments

Show HN: ESP32-based Remote Wake-on-LAN that works behind CGNAT

https://github.com/kreaxv/esp32-remote-wol
1•kreaxv•16m ago•0 comments

The AI Conversations

https://twitter.com/dhh/status/2018631575337095389
1•tosh•20m ago•0 comments

Stockfish 18

https://stockfishchess.org/blog/2026/stockfish-18/
1•AceyMan•22m ago•1 comments

Show HN: A small browser-based Hearts card game I built as a solo dev

https://playheartsonlinefree.com
1•Growtbloom•22m ago•3 comments

Who's Coding on Their Phone?

1•raunaqvaisoha•24m ago•2 comments

Israeli Military Found Gaza Health Ministry Death Toll Was Accurate

https://theintercept.com/2026/01/30/israel-gaza-death-toll-accurate-denial/
3•Qem•24m ago•0 comments

Show HN: HostsLab – A Mac app for managing hosts file and SSH config

https://github.com/Matzielab/HostsLab
1•matzie•24m ago•0 comments

The creator of Clawd: "I ship code I don't read" [video]

https://www.youtube.com/watch?v=8lF7HmQ_RgY
1•uneven9434•26m ago•0 comments

Muse: AI-Native MIDI Composer

https://www.muse.art/home
1•spking•27m ago•0 comments

GOG Says Game Banner Ad Was Made with AI but Claims It Was Shared by Mistake

https://kotaku.com/gog-ai-art-banner-ad-confirms-discord-message-small-team-slop-2000665056
2•HelloUsername•30m ago•0 comments

uLauncher

https://github.com/jrpie/launcher
1•dtj1123•33m ago•0 comments

We built a web IDE where AI edits structured code instead of text

https://stellisoft.com
1•mattstellify•35m ago•3 comments

Show HN: Kvile – Lightweight Postman alternative that uses .http files

https://kvile.app
1•tskulbru•38m ago•0 comments

I collected all valuable AI Skills repositories

https://github.com/codeaholicguy/ai-devkit/blob/main/skills/registry.json
2•hoangnnguyen•38m ago•0 comments

U.K. physics community braces for deep funding cuts

https://www.science.org/content/article/u-k-physics-community-braces-deep-funding-cuts
2•sega_sai•39m ago•1 comments

From magic to malware: How OpenClaw's agent skills become an attack surface

https://1password.com/blog/from-magic-to-malware-how-openclaws-agent-skills-become-an-attack-surface
1•_____k•39m ago•1 comments

Terminal-native AI assistant – curl -L plztell.me

https://plztell.me
2•maafifi•43m ago•1 comments

Slack hacks for software engineers and managers

https://newsletter.manager.dev/p/7-slack-hacks-for-engineers-and-managers
1•AntonZ234•44m ago•0 comments

At Age 25, Wikipedia Refuses to Evolve

https://spectrum.ieee.org/wikipedia-at-25
2•pseudolus•46m ago•1 comments

Show HN: Federal Election Commission Claude Code Plugin and Agent Skill and MCP

https://github.com/hodgesmr/agent-fecfile
1•m-hodges•47m ago•0 comments