frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Nvidia Introduces First PCs Designed for AI Agents

https://www.wsj.com/tech/ai/nvidia-introduces-first-pcs-designed-for-ai-agents-47445bcd
1•fortran77•1m ago•1 comments

PS1 Forge – Zsh/Bash, EzPrompt blocks, Light/Dark mode and local persistence

https://ps1-forge.vercel.app/
1•speckx•1m ago•0 comments

Linux Basics for Hackers

https://github.com/ahegazy0/linux-basics-for-hackers-notes
1•ibobev•1m ago•0 comments

Pinyin

https://en.wikipedia.org/wiki/Pinyin
1•tosh•2m ago•0 comments

How to add a passkey prompt in your application with FusionAuth

https://fusionauth.io/community/forum/topic/3098/wanted-to-add-a-passkey-prompt-in-my-application
1•mooreds•3m ago•0 comments

Stop Killing Games

https://jxself.org/stop-killing-games.shtml
2•amcclure•3m ago•0 comments

Surface Laptop Ultra

https://www.microsoft.com/en-us/surface/devices/surface-laptop-ultra
2•fumar•4m ago•0 comments

Two-player networked Tetris with a twist

https://github.com/bcantrill/BattleTris
1•mooreds•4m ago•0 comments

DeepSeek-V4-Flash (284B params) running on a Raspberry Pi 5 8GB

https://twitter.com/danveloper/status/2061435541199994890
2•m-hodges•7m ago•1 comments

Show HN: AI Agents Need Inspectable State. That's Why I Built LangMCP

https://medium.com/towards-artificial-intelligence/ai-agents-need-inspectable-state-thats-why-i-b...
1•muhammad-shafat•9m ago•0 comments

Announcing Zstandard in Rust

https://trifectatech.org/blog/announcing-zstandard-in-rust/
1•jmillikin•9m ago•0 comments

How HN: Easy ChartFlow, Free 2D and 3D chart maker inside Chrome side panel

https://chromewebstore.google.com/detail/easy-chartflow/jfcbhlkbkacaeihjlidngmpeehgllpog
1•Shaxpartan•10m ago•1 comments

Daily pill daraxonrasib doubles survival time for pancreatic cancer patients

https://www.bbc.com/news/articles/cy82l435171o
1•olalonde•11m ago•0 comments

Bernie Sanders: The Public Should Own Half of the Big A.I. Companies

https://www.nytimes.com/2026/06/01/opinion/artificial-intelligence-bernie-sanders.html
1•timmg•11m ago•0 comments

Autonomous capabilities audit of a hotel voice AI assistant

https://ktoyame.substack.com/p/autonomous-security-audit-of-a-hotel
2•ktoyame•14m ago•0 comments

Memgraph on Arm

https://learn.arm.com/install-guides/memgraph-on-arm
2•taubek•15m ago•0 comments

Launch HN: Expanse (YC P26) – Unlock Wasted GPU Capacity

2•ismaeel_bashir•15m ago•0 comments

Show HN: Built a browser game inspired by Rust

https://github.com/jmtame/scrapland
1•jmtame•20m ago•0 comments

Generating OG Images in Elixir

https://jola.dev/posts/generating-og-images
3•shintoist•21m ago•0 comments

The Sandbox Shift – sandboxes are the new containers, for AI-written code

https://zozo123.github.io/sandboxes-why-how-when/
2•zozo123-IB•21m ago•0 comments

Satellite images suggest Iran's strikes more extensive than US acknowledged

https://www.bbc.com/news/articles/c2l2yl7r8r2o
3•tcp_handshaker•22m ago•0 comments

China approves invasive brain-computer chip

https://www.technologyreview.com/2026/06/01/1138133/china-world-first-brain-chip/
2•rippeltippel•24m ago•0 comments

ik_llama.cpp – llama.cpp fork with better CPU performance

https://github.com/ikawrakow/ik_llama.cpp
2•peter_d_sherman•24m ago•0 comments

Game Boy Port of Snake in Assembly

https://www.4rknova.com//blog/2026/02/01/gb-snake
1•ibobev•25m ago•0 comments

ZX Spectrum System Tour: Text Mode

https://bumbershootsoft.wordpress.com/2026/05/30/zx-spectrum-system-tour-text-mode/
2•ibobev•25m ago•0 comments

Monotonic Collections: middle ground between immutable and mutable (2025)

https://neilmadden.blog/2025/11/11/monotonic-collections-a-middle-ground-between-immutable-and-fu...
1•mooreds•26m ago•0 comments

Microsoft says it will not pursue security researchers after zero-day backlash

https://therecord.media/microsoft-says-it-will-not-pursue-security-researchers-disclosure
2•tcp_handshaker•26m ago•0 comments

Salesforce Signs Definitive Agreement to Acquire Contentful

https://www.salesforce.com/news/stories/salesforce-signs-definitive-agreement-to-acquire-contentf...
1•saos•26m ago•0 comments

Remembering Dotcom, Pondering LLMs

https://www.datagubbe.se/dhabi/
2•ibobev•26m ago•0 comments

Show HN: Zatron – Encrypted semantic search, 98% quality, 8x faster than FHE

https://github.com/zahraarmantech/ZATRON
1•zahraarman•27m ago•0 comments