frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Makefiles, Metalanguages, Matrioshka Automata

https://robot-wranglers.github.io/compose.mk/
1•robot-wrangler•59s ago•0 comments

A Battle with My Blood – Tatiana Schlossberg

https://www.newyorker.com/culture/the-weekend-essay/a-battle-with-my-blood
1•pseudolus•1m ago•1 comments

Jailbreaking LLMs via Game-Theory Scenarios

https://arxiv.org/abs/2511.16278
1•belter•4m ago•0 comments

Call Center Lion Air Medan

1•Niggah_Bash•6m ago•0 comments

Internet Protocol over Avian Carriers (1990)

https://www.rfc-editor.org/rfc/rfc1149
1•wmichelin•6m ago•1 comments

Show HN: Bindu – an auth, payment, and communication layer for AI agents

https://github.com/GetBindu/Bindu
1•ai_biden•8m ago•0 comments

73% of AI startups are just prompt engineering

https://pub.towardsai.net/i-reverse-engineered-200-ai-startups-73-are-lying-a8610acab0d3
2•kllrnohj•9m ago•0 comments

Metrik – Real-time LLM latency for voice agents and free API

https://metrik-dashboard.vercel.app/
1•mbouassa•9m ago•1 comments

Show HN: Jabcode Studio, high-density QR-like 2D barcodes for sharing files

https://jabcode.studio
1•jabber-feller•11m ago•1 comments

Show HN: AI Factor Model Stock Screener

https://sophistia.ai
1•valeagent•11m ago•0 comments

Bagaimana Cara Menghubungi AirAsia Indonesia

1•Niggah_Bash•11m ago•0 comments

Paris court blocks auction of earliest-known calculator

https://www.bbc.com/news/articles/c2kpkq90ygno
1•pseudolus•13m ago•1 comments

Mount Proton Drive on Linux using rclone and systemd

https://github.com/dadtronics/protondrive-linux
1•cf100clunk•14m ago•0 comments

Full-Time Work Is Increasing Among Married Moms

https://ifstudies.org/blog/full-time-work-is-increasing-among-married-moms
1•skx001•17m ago•0 comments

BOM's new boss asked to examine $96.5M bill for website redesign

https://www.abc.net.au/news/2025-11-23/bureau-of-meteorology-new-website-cost-blowout-to-96-milli...
1•NvrBeenToAus•20m ago•1 comments

We stopped roadmap work for a week and fixed 189 bugs

https://lalitm.com/fixits-are-good-for-the-soul/
1•lalitmaganti•20m ago•0 comments

Gov. People Announces 6 Critical Tech Areas for the War Department

https://www.war.gov/News/Releases/Release/Article/4333074/under-secretary-of-war-for-research-and...
2•donutloop•24m ago•0 comments

Show HN: Reduce time debugging AI slop in prod

https://github.com/dingus-technology/DINGUS
1•SleepyWalrus•25m ago•0 comments

Interviewing Andrea Borman [video]

https://www.youtube.com/watch?v=iLpIzfZ_2zI
1•spacebuffer•25m ago•0 comments

Gabe Newell: "We don't worry about Piracy" (2009) [video]

https://www.youtube.com/watch?v=Imf-QeQCexk
1•uyzstvqs•25m ago•0 comments

You can save money on LLM tokens as a developer with MCP / ChatGPT apps

https://www.mikeborozdin.com/post/how-mcp-and-chatgpt-apps-can-save-you-tokens
1•mikeborozdin•30m ago•0 comments

Kickstart.nvim: A minimal, single-file starting point for Neovim configuration

https://github.com/nvim-lua/kickstart.nvim
1•nathan-barry•30m ago•0 comments

Volvo ends relationship with Luminar, removes Lidar from vehicles

https://www.repairerdrivennews.com/2025/11/21/volvo-ends-relationship-with-luminar-removes-lidar-...
2•bookofjoe•34m ago•0 comments

How Does an Electron Microscope Work?

https://www.thermofisher.com/blog/materials/how-does-an-electron-microscope-work/
1•kamaraju•34m ago•0 comments

X begins rolling out 'About this account' location feature to users' profiles

https://techcrunch.com/2025/11/21/x-begins-rolling-out-the-about-this-account-feature-to-users-pr...
4•xqcgrek2•37m ago•0 comments

Evals drive the next chapter in AI for businesses

https://openai.com/index/evals-drive-next-chapter-of-ai/
1•gmays•38m ago•0 comments

Bytes before FLOPS: your algorithm is (mostly) fine, your data isn't

https://www.bitsdraumar.is/bytes-before-flops/
1•bofersen•39m ago•0 comments

Meta Looks to Power Trading to Support Its AI Energy Needs

https://www.bloomberg.com/news/articles/2025-11-21/meta-enters-power-trading-to-support-ai-data-c...
2•geox•42m ago•0 comments

Working Title (Insurance)

https://www.bitsaboutmoney.com/archive/working-title-insurance/
1•surprisetalk•43m ago•0 comments

USB in a nutshell: making sense of the USB standard

https://www.beyondlogic.org/usbnutshell/usb1.shtml
2•fanf2•44m ago•0 comments
Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."