frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Why aren't LLM's trained on their own Chain Of Thought?

3•simianwords•1h ago
I realise that without allowing reasoning tokens, a model performs very poorly. It can't perform simple arithmetic or simple logic and hallucinates a bit.

But by allowing it to think a bit and then answer, the result is much better and way more trustable.

This shows a clean RL environment.. or just a nice data-set. Where you prompt the model two times - one without allowing thinking and one with thinking. Penalise the result from non thinking if the result contradicts the answer obtained from thinking.

Comments

i7l•1h ago
For the same reason that anyone's reasoning process and answers to random exam questions are never used as textbooks: if the reasoning is not guaranteed to be right, why would you want to make that training material?
simianwords•1h ago
We can empirically figure out how often the reasoning model is correct. With a 95% empirical accuracy, it should still help the model directionally. No training data set needs to be 100% accurate. No?

Rig: Build modular LLM apps in Rust – 20 providers, one unified interface

https://github.com/0xPlaygrounds/rig
1•michidk•12s ago•0 comments

Ask HN: Will AI be the end of new programming languages?

1•otherayden•25s ago•0 comments

Women were never meant to give birth on their backs

https://www.bbc.com/future/article/20260401-women-were-never-meant-to-give-birth-on-their-backs
1•ilt•1m ago•0 comments

Show HN: Orcastrate – Sync GitHub Actions workflows across repos via templates

https://github.com/michidk/orcastrate
1•michidk•1m ago•0 comments

Mourning for dinosaurs, 65M years too late

https://www.cnn.com/2026/04/05/science/dinosaurs-tiktok-documentary-cec
1•mooreds•2m ago•0 comments

The AI Compute Race: Microsoft's Miss and Oracle's Opportunity (2025)

https://isolveproblems.substack.com/p/the-ai-compute-race-microsofts-miss
1•mooreds•3m ago•0 comments

Zipf's Law

https://en.wikipedia.org/wiki/Zipf%27s_law
1•mooreds•4m ago•0 comments

Loqi, a memory system that preserves context after LLM compaction

https://github.com/wf802222/loqi
1•nobris•6m ago•0 comments

The Forgotten Ones: Actron AM1608 16-Bit CPU. – The CPU Shack Museum

https://www.cpushack.com/2026/04/01/the-forgotten-ones-actron-am1608-16-bit-cpu/
1•rbanffy•8m ago•0 comments

Apple at 50: My journey to the Mac – anderegg.ca

https://anderegg.ca/2026/04/01/apple-at-50-my-journey-to-the-mac
1•rbanffy•11m ago•0 comments

Show HN: Genetic algorithm engine that evolves trading strategies

https://github.com/NeuZhou/finclaw
1•neuzhou•12m ago•0 comments

Musician says AI company is cloning her music, filing claims against her

https://twitter.com/i/status/2040577536136974444
1•lando2319•14m ago•0 comments

Emilia Britannia (public domain freedom mascot)

https://github.com/Joy-less/EmiliaBritannia
3•Joy-less•14m ago•0 comments

Show HN: RPLY - one Inbox for iMessage, WhatsApp, Slack, and Gmail on macOS

https://www.heynox.com
3•mcantillon•20m ago•3 comments

Track what top investors own (13F) and why they own it (10K AI Analysis)

https://superinvestorsbelike.com
3•oodelally•21m ago•0 comments

Framework? I sure hope it does

https://blog.valknight.xyz/framebroken.html
1•coinfused•27m ago•0 comments

Artemis II Tracker – Live Mission Control

https://artemis.cdnspace.ca/
1•rbanffy•27m ago•0 comments

Days Since OpenClaw CVE

https://days-since-openclaw-cve.com/
2•verandaguy•31m ago•0 comments

Show HN: MailMark – Cold email tool where you own your domain and mailboxes

1•debasishbarai•32m ago•1 comments

Show HN: Arbory – Native iOS dashboard and widgets for Plausible Analytics

https://arbory.io/
2•jorijn•38m ago•0 comments

An elegant Pomodoro timer for your terminal

https://github.com/kaushalvivek/pom
1•kaushalvivek•38m ago•0 comments

/Render – 3D Model Skill for Claude Code

https://github.com/mfranzon/render
4•mfranzon•41m ago•0 comments

Can We Measure Software Slop? An Experiment

https://pscanf.com/s/352/
1•RohanAdwankar•41m ago•0 comments

Chinese Chip Firms Hit Record High Revenue Driven by the AI Boom and U.S. Curbs

https://www.cnbc.com/2026/04/03/chinese-chip-firms-record-revenue-ai-boom-us-curbs.html
2•karakoram•43m ago•0 comments

Why Your Engineering Team Is Slow (It's the Codebase, Not the People)

https://piechowski.io/post/codebase-drag-audit/
3•BerislavLopac•43m ago•0 comments

Journalist detained for booing Trump at Kennedy Center Chicago performance

https://www.advocate.com/news/kennedy-center-journalist-detained-trump
5•wahnfrieden•43m ago•0 comments

Securing AI infrastructure to prevent backdoors and sabotage

https://www.the-substrate.net/p/securing-ai-infrastructure-to-prevent
2•erwald•46m ago•0 comments

Code Is Worthless

https://nathanielfishel.substack.com/p/your-code-is-worthless
1•birdculture•47m ago•0 comments

YAML is (not) my preferred configuration format

https://belkadan.com/blog/2026/03/YAML-Is-Not-My-Preferred-Configuration-Format/
3•frizlab•49m ago•0 comments

Eggplant

https://xn--gi8h42h.ws/
1•memalign•51m ago•1 comments