frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Boundaries A talk by Gary Bernhardt from SCNA 2012

https://www.destroyallsoftware.com/talks/boundaries
1•throwaw12•3m ago•0 comments

Fedspeak

https://en.wikipedia.org/wiki/Fedspeak
1•lelandfe•3m ago•0 comments

I built a daily word game with six rarity tiers and no letter tiles

https://playoutlier.com
2•mszerencsy•4m ago•0 comments

Which Buffett? Warren or Jimmy. Can you tell them apart?

https://whichbuffett.github.io/quiz/
2•brightbeige•6m ago•1 comments

S4 – Squished S3

https://github.com/abyo-software/s4
1•tcp_handshaker•6m ago•0 comments

CSP: Bookmarklets should bypass pages' policies (2013 → Infeasible 2026)

https://issues.chromium.org/issues/40077444
1•myfonj•9m ago•0 comments

Illinois Gov. JB Pritzker to suspend tax breaks offered to data centers

https://www.nbcnews.com/politics/2028-election/illinois-gov-jb-pritzker-suspend-tax-breaks-offere...
2•tcp_handshaker•9m ago•0 comments

A Short Story About SQL's Biggest Rival (QUEL) (2020)

https://www.holistics.io/blog/quel-vs-sql/
1•peter_d_sherman•10m ago•1 comments

Commodore's Weirdest OS Is Back for 2026 [video]

https://www.youtube.com/watch?v=z_Lw-Zl7tCM
1•evo_9•11m ago•0 comments

dash: fast webserver for kdb (>100kq/sec)

https://github.com/geocar/dash
1•tosh•11m ago•0 comments

Show HN: Change multiple parts of an image at once with annotations tool[video]

https://www.youtube.com/watch?v=drmvrxDdNoc
1•julienreszka•14m ago•0 comments

Asus GB300 NVL72 Test Lab Tour

https://www.lttlabs.com/articles/2026/06/06/asus-test-server-tour
2•LabsLucas•15m ago•0 comments

Adopting the Parallel DWARF linker in dsymutil

https://jonasdevlieghere.com/post/dsymutil-parallel-linker/
1•JDevlieghere•15m ago•0 comments

ArchiveAlert: Exposing Malicious Scripts

https://www.youtube.com/channel/UCZZUvY13b_Ax5y8Fkrwc5ew
3•Cider9986•17m ago•0 comments

Qualcomm Linux

https://github.com/qualcomm-linux
2•tosh•19m ago•0 comments

Rustikon Agenda

https://www.rustikon.dev/agenda
2•mahirsaid•20m ago•0 comments

The Author Who Saw AI Coming, Somehow [video]

https://www.youtube.com/watch?v=xe4nUZqX9Ec
2•danboarder•24m ago•0 comments

Watch These Judges Rip into Lawyers for Citing Cases That Don't Exist

https://www.404media.co/new-york-court-ai-citations-landberg-case/
2•droidjj•25m ago•0 comments

A Citizen Science Organization Aims to Preserve the Places It Brings Touris

https://www.wired.com/story/preserving-the-places-tourists-go-to-study/
2•joozio•26m ago•0 comments

Rise in NEETs Is a 'Retirement Crisis in Slow Motion', Pension Experts Warn

https://www.ft.com/content/5f9ff5cf-a406-4824-9909-b7b6ba101fcd
3•karakoram•32m ago•1 comments

'More Harmful Than Helpful': Young People Sour on AI

https://www.ft.com/content/73fc962e-ce68-4521-9c5d-841a666eed10
7•karakoram•35m ago•1 comments

RariaSense – vanilla JavaScript library for interactive hover sounds

https://github.com/paulo807/rariasense
2•paulo949•37m ago•0 comments

How the Fight Against Ebola Tested a Culture's Traditions (2015)

https://www.nationalgeographic.com/adventure/article/150130-ebola-virus-outbreak-epidemic-sierra-...
2•recursivecaveat•37m ago•0 comments

How the Boomers Screwed Europe

https://www.economist.com/europe/2026/05/28/how-the-boomers-screwed-europe
3•karakoram•38m ago•2 comments

Heathkit: America's Biggest Loss [video]

https://www.youtube.com/watch?v=lUQOG1hp54g
3•ynac•38m ago•1 comments

I'm waiting for Claude to rm rf my computer

https://12gramsofcarbon.com/p/agentics-local-coding-agents-are
3•theahura•38m ago•1 comments

How Long Does It Take to Plan a Bridge?

https://www.construction-physics.com/p/how-long-does-it-take-to-plan-a-bridge
2•paulpauper•40m ago•0 comments

Did the Credibility Revolution Skip Public Management?

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6816199
2•paulpauper•40m ago•0 comments

The world is bigger than you can imagine

https://scottsumner.substack.com/p/the-world-is-bigger-than-you-can
2•paulpauper•40m ago•0 comments

Public Identifiers, UUIDs and a Tiny SEO Fix

https://perlhacks.com/2026/06/public-identifiers-uuids-and-a-tiny-seo-fix/
2•scottchiefbaker•42m ago•0 comments
Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."