frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•9mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•9mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•9mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•9mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Hill-Climbing: Why Your AI Agent Wastes Half Its Brain Before Writing Any Code

https://twitter.com/notadamking/status/2031445395369504774
1•adamjking3•2m ago•0 comments

From millions of dollars to under a grand: The dramatic fall of the NFT

https://english.elpais.com/culture/2026-03-10/from-millions-of-dollars-to-under-a-grand-the-drama...
1•geox•4m ago•0 comments

Freedesktop Closes Controversial Age Verification API Proposal

https://linuxiac.com/xdg-age-verification-interface-proposal-closed/
2•miohtama•5m ago•0 comments

US and EU sanctions have killed 38M people since 1970 (2025)

https://www.aljazeera.com/opinions/2025/9/3/us-and-eu-sanctions-have-killed-38-million-people-sin...
1•abdelhousni•6m ago•2 comments

Enzyme as Maxwell's Demon: Steady-State Deviation from Chemical Equilibrium

https://arxiv.org/abs/2503.17584
3•PaulHoule•11m ago•0 comments

Stansted Airport starts accepting London's contactless travel tickets

https://www.ianvisits.co.uk/articles/mind-the-tap-stansted-airport-starts-accepting-londons-conta...
1•zeristor•12m ago•0 comments

Daniel Ellsberg: The Effect of Top Secret Clearance

https://wonkmonksnotes.wordpress.com/2021/04/22/daniel-ellsberg-the-effect-of-top-secret-clearance/
1•jdkee•13m ago•0 comments

X Users Find Their Real Names Are Googled After Using X Verification "Au10tix"

https://www.mintpressnews.com/x-users-find-their-real-names-are-being-googled-in-israel-after-usi...
2•miguelazo•14m ago•2 comments

Why Your Pinecone Index Keeps Breaking (and the Vector Ops Fix)

https://decompressed.io/learn/vector-ops-pinecone
1•zacole•15m ago•0 comments

Hereditary peers to be removed from Lords as bill passes

https://www.bbc.co.uk/news/articles/cdxg76rgdp7o
1•zeristor•16m ago•0 comments

Markdown Now Available on Wordpress.org

https://make.wordpress.org/meta/2026/03/03/markdown-now-available-on-wordpress-org/
1•gslin•17m ago•0 comments

Zee – Push-to-talk transcription for macOS (Pure Go, sub-second)

https://github.com/sumerc/zee
2•sumerc•18m ago•1 comments

AI Sucks at Guitar Tones

https://bahadiraydin.com/blog/ai-sucks-at-guitar-tones
1•bahadiraydin•18m ago•0 comments

Orange County homeowner says insurer used drone to inspect her roof

https://abc7.com/post/orange-county-homeowner-says-insurer-secretly-used-drone-inspect-roof/18694...
1•walterbell•18m ago•1 comments

His Mother Vanished When He Was 14. 33 Years Later, He Found Her

https://www.nytimes.com/2026/03/09/us/antonio-wiley-missing-mother-found.html
3•rmason•18m ago•1 comments

How the Eon Team Produced a Virtual Embodied Fly

https://eon.systems/updates/embodied-brain-emulation
1•EvgeniyZh•19m ago•0 comments

Om Malik – The Debt Beneath the Dream

https://om.co/2026/03/09/the-debt-beneath-the-dream/
2•rmason•21m ago•0 comments

U.S. Global War on Terror Has Taken Nearly 1M Lives (2021)

https://theintercept.com/2021/09/01/war-on-terror-deaths-cost/
9•abdelhousni•21m ago•2 comments

Olie, A global dollar account powered by USDC

https://oliecrypto.com
1•jarcer•21m ago•1 comments

Valve facing second, class-action lawsuit over loot boxes

https://www.pcgamer.com/gaming-industry/valve-facing-second-class-action-lawsuit-over-loot-boxes/
4•akyuu•21m ago•0 comments

AI is to software as power tools are to woodworking

3•danfunk•23m ago•2 comments

What it costs to run 1M image search in production

https://vecstore.app/blog/what-it-costs-to-search-1m-images
1•birdculture•23m ago•0 comments

Actis: Autonomous Coordination and Transaction Integrity Standard

https://actis.world/
1•blazingjolt•24m ago•1 comments

Starlink Mini solar-powered unlimited range R/C boat [video]

https://www.youtube.com/watch?v=UjFrFAIM2Aw
1•nstj•24m ago•0 comments

Another automaker joins BYD with ultra-fast 1,500 kW EV chargers

https://electrek.co/2026/03/09/automaker-joins-byd-ultra-fast-1500-kw-ev-chargers/
5•breve•27m ago•0 comments

First ever SL5 Standard to secure frontier automated AI R&D against nationstates

https://standard.sl5.org
4•lthi•28m ago•1 comments

Containers – What's in the Box?

https://www.buzzsprout.com/2469780/episodes/18778761-23-containers-what-s-in-the-box
5•sbalneav•31m ago•1 comments

Using directx shared surfaces as a kernel IPC channel

https://afpereira.me/posts/dxsurflink/
1•afpereira•31m ago•1 comments

MacBook Neo review: I think Apple's going to sell these

https://mashable.com/review/apple-macbook-neo-budget-laptop
5•mobilio•31m ago•0 comments

Create Google API credentials in 50 easy steps (2023)

https://github.com/glotlabs/gdrive/blob/main/docs/create_google_api_credentials.md
2•20wenty•31m ago•1 comments