frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

LifeWiki | The Wiki for Conway's Game of Life

https://conwaylife.com/wiki/
1•frozenseven•4m ago•0 comments

The Nintendo Virtual Boy Is Now Available for Preorder

https://www.cnet.com/deals/nintendo-virtual-boy-preorders-now-available/
1•not4uffin•7m ago•0 comments

By the Waters of Babylon (1937) by Stephen Vincent Benét [video]

https://www.youtube.com/watch?v=40C2Ua5FYdU
1•ShrugLife•11m ago•0 comments

Why is manufacturing productivity growth so low?

https://www.nber.org/papers/w34264
1•hhs•11m ago•0 comments

Oils 0.37.0 – Alpine Linux, YSH, and mycpp

https://oils.pub/blog/2025/12/release-0.37.0.html
1•birdculture•12m ago•0 comments

Silicon Valley was consistently 10 years ahead of its time

https://old.reddit.com/r/funny/comments/1pl2ui3/bro_how_was_the_show_silicon_valley_so/
2•doener•13m ago•0 comments

A Webapp to Search Emails as an Unikernel

https://blog.robur.coop/articles/2025-04-12-ptt-search-webapp.html
1•TheWiggles•14m ago•0 comments

OpenAI are quietly adopting skills, now available in ChatGPT and Codex CLI

https://simonwillison.net/2025/Dec/12/openai-skills/
2•simonw•18m ago•0 comments

Brain Crack (2006) [video]

https://www.youtube.com/watch?v=0sHCQWjTrJ8
2•RossBencina•21m ago•0 comments

50 years of proof assistants

https://lawrencecpaulson.github.io//2025/12/05/History_of_Proof_Assistants.html
3•baruchel•22m ago•0 comments

Special Dyslexia Fonts Are Based on Voodoo Pseudoscience

https://daringfireball.net/linked/2025/12/12/dyslexia-fonts-pseudoscience
1•erickhill•24m ago•0 comments

We Rebuilt Settings in Zed

https://zed.dev/blog/settings-ui
1•erhuve•27m ago•0 comments

US TikTok investors in limbo as deal set to be delayed again

https://www.bbc.com/news/articles/cp34442z25ko
1•1659447091•28m ago•0 comments

Compute in Space: a first principles interactive model

https://astrocompute.dev/
2•kvee•30m ago•0 comments

Industrialized Cybercrime Targets Trust in Public and Private Sectors

https://oilprice.com/Geopolitics/International/Industrialized-Cybercrime-Targets-Trust-in-Public-...
2•PaulHoule•31m ago•0 comments

Turning my reading list into podcasts

https://www.coryd.dev/posts/2025/turning-my-reading-list-into-podcasts
1•cdrnsf•32m ago•0 comments

Why RSS Matters

https://werd.io/why-rss-matters/
2•cdrnsf•32m ago•0 comments

Arduino UNO Q

https://www.arduino.cc/product-uno-q
2•swatson741•35m ago•0 comments

The Biggest Causes of Medical Device Recalls

https://spectrum.ieee.org/medical-device-recalls
2•sohkamyung•38m ago•0 comments

Show HN: Team-first Slack bot that turns bug reports into PRs using Claude

https://github.com/MattKilmer/claude-autofix-bot
1•madcash•38m ago•0 comments

Magit-insert-worktrees improves status buffers

https://huonw.github.io/blog/2025/12/magit-insert-worktrees/
2•dbaupp•38m ago•0 comments

Atmospheric CO₂ Monitoring Dashboard

https://climate.portaljs.com/co2-monitoring
1•CharlesW•40m ago•0 comments

Ukrainians sue US chip firms for powering Russian drones, missiles

https://arstechnica.com/tech-policy/2025/12/ukrainians-sue-us-chip-firms-for-powering-russian-dro...
10•voxadam•41m ago•0 comments

Show HN: Spacelink – link budget / comm system modeling Python library

https://github.com/cascade-space-co/spacelink
1•n6hpa•43m ago•0 comments

Rethinking Data Integrity: Why Domain-Driven Design Is Crucial

https://thenewstack.io/rethinking-data-integrity-why-domain-driven-design-is-crucial/
1•franckpachot•46m ago•0 comments

Post-Quantum Cryptography on CHERIoT

https://cheriot.org/pqc/2025/12/12/pqc-on-cheriot.html
3•todsacerdoti•51m ago•0 comments

Arkansas becoming first state to sever ties with PBS, effective July 1

https://www.ctvnews.ca/world/article/arkansas-becoming-1st-state-to-sever-ties-with-pbs-effective...
2•kotaKat•52m ago•0 comments

Trump signs order to block states from enforcing own AI rules

https://www.bbc.com/news/articles/crmddnge9yro
8•deliass•53m ago•1 comments

Defrag.exfat Is Inefficient and Dangerous

https://github.com/exfatprogs/exfatprogs/issues/318
3•dxdxdt•54m ago•0 comments

The Beauty of Dissonance

https://www.plough.com/en/topics/culture/music/the-beauty-of-dissonance
1•tintinnabula•58m ago•0 comments
Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."