frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

VibeOs: You're still arguing about which model is better?

https://github.com/kaansenol5/VibeOS/blob/main/README.md
1•bakigul•1m ago•0 comments

SkyPilot: One system to use and manage all AI compute (K8s, 20 clouds, Slurm)

https://github.com/skypilot-org/skypilot
1•covi•2m ago•0 comments

Meta is closing down three VR studios as part of its metaverse cuts

https://www.theverge.com/news/861420/meta-reality-labs-layoffs-vr-studios-twisted-pixel-sanzaru-a...
1•jsheard•5m ago•0 comments

Instagram AI Influencers Are Defaming Celebrities with Sex Scandals

https://www.404media.co/instagram-ai-influencers-are-defaming-celebrities-with-sex-scandals/
1•cdrnsf•9m ago•0 comments

X Is a Power Problem, Not a Platform Problem

https://connectedplaces.online/reports/a-power-problem-not-a-platform-problem/
2•cdrnsf•10m ago•0 comments

Who remembers AWS Spot's auction era before the 2017 pricing change?

1•aleroawani•11m ago•0 comments

Chrome 142 Mixed Content Local Network Access

https://developer.chrome.com/blog/local-network-access
1•goodburb•12m ago•0 comments

Show HN: Async bulkhead for Java with explicit overload semantics (v0.3.0)

https://github.com/janbalangue/async-bulkhead
1•janbalangue•13m ago•1 comments

Creating a Cistercian Numerals Generator

https://christianheilmann.com/2026/01/13/monky-business-creating-a-cistercian-numerals-generator/
1•ArmageddonIt•13m ago•0 comments

First AI Directed Reality TV Show

https://twitter.com/Cookiesarefunnn/status/1986463874435178651
1•Nadav--Shanun•14m ago•0 comments

Show HN: Serverless Compute Platform for AWS

https://github.com/acikelli/hyperp
1•oacikelli•15m ago•0 comments

Why Keeping Score Isn't Fun Anymore

https://www.nytimes.com/2026/01/13/books/review/why-keeping-score-isnt-fun-anymore.html
1•anarbadalov•16m ago•0 comments

Chaldean American Fact Sheet

https://www.sterlingheights.gov/DocumentCenter/View/484/Getting-to-Know-Your-Chaldean-American-Ne...
1•marysminefnuf•17m ago•0 comments

Just the Browser: Remove AI features and other annoyances from web browsers

https://justthebrowser.com/
1•oneeyedpigeon•17m ago•0 comments

Analysing Footage of Minneapolis ICE Shooting

https://www.bellingcat.com/news/2026/01/13/analysing-footage-of-minneapolis-ice-shooting/
2•tastyface•18m ago•0 comments

Iran makes high-tech additions to its age-old playbook for crushing protests

https://www.cnn.com/2026/01/13/middleeast/iran-high-tech-additions-playbook-crushing-protests-intl
2•acjohnson55•20m ago•1 comments

Why India's plan to make AI companies pay for training data should go global

https://restofworld.org/2026/india-ai-data-license-fee/
2•brandrick•23m ago•0 comments

Show HN: MemSky: Bluesky timeline viewer web app that saves where you left off

https://memalign.github.io/m/memsky/index.html
1•memalign•24m ago•0 comments

Heltun Removed from Works with Home Assistant

https://www.home-assistant.io/blog/2026/01/13/partner-update-heltun/
1•solarist•24m ago•0 comments

A Galaxy You Can Dig: When Human-Scale Intuition Breaks

https://medium.com/@jud.dagnall/a-galaxy-you-can-dig-when-human-scale-intuition-breaks-e00c4f834e7d
1•saulpw•30m ago•0 comments

Wearable ECG Applications Based on the AD823X Microchip and the Arduino Platform

https://www.mdpi.com/2673-4591/118/1/86
1•PaulHoule•32m ago•0 comments

Hybrid Sovereignties and Inuit Land Claims: Native Corporations [pdf]

https://isonomiaquarterly.com/wp-content/uploads/2025/11/zellen-pfwo.pdf
1•brandonlc•32m ago•1 comments

My Homelab Setup in 2026

https://nikola.kotur.org/my-homelab-setup-in-2026
2•kotnik•32m ago•1 comments

Free Crowdsourced Collection of Cars Owner Manuals

https://justgivemethedamnmanual.com/
2•armenarmen•35m ago•0 comments

OWS (Fake Compressed Archive)

http://justsolve.archiveteam.org/wiki/OWS_(fake_compressed_archive)
1•macote•35m ago•0 comments

Fake Cancer Doctor Insider Trading

https://www.bloomberg.com/opinion/newsletters/2026-01-13/fake-cancer-doctor-insider-trading
1•ioblomov•36m ago•1 comments

Topic2Manim Now Has a UI

https://github.com/mateolafalce/topic2manim
1•lafalce•38m ago•0 comments

SkyCompute – Satellite-Based Cloud Computing Architecture

https://www.dropbox.com/scl/fi/p2kkk9b3bnun9g10zejx8/SKYCOMPUTE.PROJECT.pdf?dl=0j&noscript=1&rlke...
1•pouyam19•40m ago•1 comments

Discontinuing the Teensy at Adafruit

https://blog.adafruit.com/2026/01/12/discontinuing-the-teensy-at-adafruit/
2•ta988•40m ago•0 comments

Apple Apps Will No Longer Receive All New Features Without a Subscription

https://www.macrumors.com/2026/01/13/apple-creator-studio-exclusive-app-features/
5•htk•41m ago•1 comments