frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•10mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•10mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•10mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•9mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Democracy Report 2026

https://www.v-dem.net/publications/democracy-reports/
1•hkhn•48s ago•0 comments

'Trump is aiming for dictatorship'. That's the verdict of the most

https://www.theguardian.com/world/commentisfree/2026/mar/17/trump-is-aiming-for-dictatorship-that...
1•hkhn•1m ago•0 comments

Session integrity protocol for AI coding assistants

https://drive.google.com/file/d/1CVwFgDFbHgWJAAEoVSO4rq4BFH-q7E1_/view?usp=drivesdk
1•ianpenney•2m ago•1 comments

Eniac, the First General-Purpose Digital Computer, Turns 80

https://spectrum.ieee.org/eniac-80-ieee-milestone
2•baruchel•3m ago•0 comments

Claude accounts from multiple countries are blocked to access for several days

https://github.com/anthropics/claude-code/issues/34229
1•geyserr•6m ago•0 comments

Virus Queue (2004)

https://punkwalrus.livejournal.com/39952.html
1•TMWNN•6m ago•0 comments

Stretching 2,689 miles, the longest coastal path opens in England

https://www.bbc.com/news/articles/cy0dxexdd8xo
1•sohkamyung•9m ago•0 comments

Desperately Seeking Space Friends

https://reviewcanada.ca/magazine/2026/04/desperately-seeking-space-friends-review-the-pale-blue-d...
1•benbreen•9m ago•0 comments

Cron jobs are unsupervised root access and nobody is talking about it

https://www.moltbook.com/post/fc596ab3-3a61-42a2-a903-c16ceb600232
1•KnuthIsGod•18m ago•0 comments

Why software was never built for you – and how AI changes that

https://wonderwhy-er.medium.com/software-was-always-a-compromise-ai-just-broke-it-13b22df1cabf
1•wonderwhyer•18m ago•0 comments

Stripe's Minions Ship 1,300 PRs a Week

https://blog.bytebytego.com/p/how-stripes-minions-ship-1300-prs
1•cinkhangin•20m ago•0 comments

Show HN: A strategy game where Chinese characters are the mechanics

https://store.steampowered.com/app/4218330/WordJoy/
1•chunqiuyiyu•24m ago•0 comments

Show HN: Free BYOK career interview that builds your story file

https://cadencestory.com/
1•Joeythe1st•25m ago•1 comments

AI makes your DRM Irrelevant

https://fantaize.net/posts/drm/
1•cpu0•25m ago•0 comments

Nobody Tells Junior Devs This Docker [video]

https://youtube.com/shorts/eKYQRGb77Hw
1•rjn32s•30m ago•0 comments

TTal – CLI that turns Claude Code into a multi-agent software factory

1•neilbb•34m ago•0 comments

Nominal Connect: Shipping Realtime Desktop Software with Rust, Bevy, and Egui

https://nominal.io/blog/nominal-connect-shipping-realtime-desktop-software-with-rust-bevy-and-egui
1•slopinthebag•38m ago•0 comments

Teardown of a 2026 Lego Smart Brick

https://hackaday.com/2026/03/18/teardown-of-a-2026-lego-smart-brick/
1•Tomte•39m ago•0 comments

Giant IPOs from SpaceX to OpenAI Put Index Rules Under Pressure

https://www.bloomberg.com/news/articles/2026-03-18/spacex-fueled-index-rethink-draws-fire-with-tr...
1•m-hodges•42m ago•0 comments

Spendgauge – a simple way to know how much you can safely spend today

https://spendgauge.devsip.tech/
1•KoomeK•43m ago•0 comments

The Spectrum of Intelligence

https://upmaru.com/blog/the-spectrum-of-intelligence
1•zacksiri•45m ago•0 comments

iOS 26.4 Fixes iPhone Keyboard Accuracy Bug

https://www.macrumors.com/2026/03/18/ios-26-4-iphone-keyboard-bug-fix/
3•Tomte•51m ago•2 comments

Checkout Codex, A new language designed by me, written by AI

https://github.com/damiant3/NewRepository
1•damiant3•54m ago•1 comments

Try TorMessenger

https://tormessenger.lovable.app/
2•jackcom•54m ago•0 comments

What 81,000 people want from AI

https://www.anthropic.com/features/81k-interviews
19•dsr12•55m ago•10 comments

Researchers uncover iPhone spyware capable of penetrating millions of devices

https://www.reuters.com/technology/researchers-uncover-iphone-spyware-capable-penetrating-million...
2•petethomas•56m ago•0 comments

LLMs Are Manipulating Users with Rhetorical Tricks

https://hbr.org/2026/03/llms-are-manipulating-users-with-rhetorical-tricks
2•ryan_j_naughton•58m ago•0 comments

Show HN: Astruno – Your thoughts become stars on a 3D globe and celestial sky

https://www.astruno.com/
1•hsong1101•58m ago•0 comments

Show HN: MidiStickers – learn music theory visually

1•Frauber84•59m ago•0 comments

I Reverse-Engineered the TiinyAI Pocket Lab from Marketing Photos

https://bay41.com/posts/tiiny-ai-pocket-lab-review/
1•davidklemke•1h ago•0 comments