frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•1y ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•1y ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•1y ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•1y ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

HTML: Composer and Perl HTML Templating

https://rawley.xyz/posts/html-composer.html
1•rawleyfowler•36s ago•0 comments

My Love-Hate Relationship with Page Builders

https://eliotdill.substack.com/p/my-love-hate-relationship-with-page
1•DillyDally125•44s ago•0 comments

New Intel Linux Driver Patches Enable HDR over DP MST Connections

https://lore.kernel.org/dri-devel/20260626175510.3899476-1-gildekel@google.com/
1•DemiGuru•2m ago•0 comments

Roblox parental controls are a dystopian security disaster

1•notsure357•2m ago•0 comments

QA/Testing at Startups

1•ovi_firstqa•4m ago•0 comments

The EU Wants to Grow Homegrown Tech. Its Courts Keep Making That Impossible

http://www.techdirt.com/2026/06/26/the-eu-wants-to-grow-homegrown-tech-its-courts-keep-making-tha...
1•beardyw•6m ago•0 comments

In Loving Memory of Om Malik – Hodinkee

https://www.hodinkee.com/articles/in-loving-memory-of-om-malik-friend-writer-venture-capitalist-a...
1•adamfuhrer•6m ago•0 comments

AI Models Directory (To Compare)

https://aimodels.directory/
1•entempsllc•7m ago•0 comments

Transformers Explained for Software Engineers

https://bharad.dev/blog/transformers-and-attention
1•bharadwajp•7m ago•0 comments

Europe's largest datacentre hub leaves town sweltering

https://www.theguardian.com/environment/2026/jun/26/slough-is-like-an-experiment-europes-largest-...
1•speckx•8m ago•0 comments

Designing a Personal Pebble Watchface

https://www.jonashietala.se/blog/2026/06/26/designing_a_personal_pebble_watchface/
1•lawn•11m ago•0 comments

Auto-Charge Tracker makes Steam Controller move toward its charging dock

https://videocardz.com/newz/modder-makes-steam-controller-move-itself-to-the-charging-puck
1•LorenDB•16m ago•0 comments

The Ontological Consequences of AGI Autonomy

https://secondexpulsion.substack.com/p/the-apple-the-serpent-and-the-outside
2•cosmosjang•16m ago•0 comments

Cory Doctorow on the Right – and Wrong – Way to Criticize AI

https://jacobin.com/2026/06/ai-bubble-layoffs-workers-copyright
1•thunderbong•19m ago•0 comments

I was laid off from Meta and now I work at a butcher shop [video]

https://www.youtube.com/watch?v=oLzuO4FmcqE
3•mmarian•21m ago•0 comments

If your blog doesn't have an RSS feed, then it's not a blog

https://martinschuhmann.com/rss
16•speckx•21m ago•4 comments

The New Blub Paradox, Or: Why TypeScript Is a Poor Choice for the AI Era

https://www.iankduncan.com/engineering/2026-06-26-the-new-blub-paradox/
1•gnabgib•22m ago•1 comments

High School Student Journalists Investigate Flock

https://scotscoop.com/smile-youre-on-flock-how-san-mateo-county-built-a-network-of-mass-surveilla...
1•coloneltcb•24m ago•0 comments

How PgBouncer Works

https://www.augusteo.com/blog/how-pgbouncer-works/
1•linggen•27m ago•0 comments

Show HN: Run multiple instances of Codex(GUI) each with their own auth

https://github.com/ccheney/codex-multi-account
1•ccheney•28m ago•0 comments

Trump threatens 100% tariff on any country that imposes digital services tax

https://www.reuters.com/world/us/trump-threatens-100-tariff-any-country-that-imposes-digital-serv...
2•onemoresoop•28m ago•1 comments

Fireship was bought by a major investing firm

https://old.reddit.com/r/webdev/comments/1m8a2yx/fireship_was_bought_by_a_major_investing_firm/
1•mmarian•29m ago•1 comments

GPT-5.6 Preview System Card

https://deploymentsafety.openai.com/gpt-5-6-preview
2•blazespin•30m ago•0 comments

The SaaS Adventure (2015)

https://techcrunch.com/2015/02/01/the-saas-travel-adventure/
1•mooreds•32m ago•0 comments

Gossamer: a Rust-flavoured language with real goroutines and pause-free memory

https://gossamer-lang.org/
2•mwheeler•32m ago•0 comments

Expect RAM prices to stay high with Micron locking in deals for 5 years

https://www.gamingonlinux.com/2026/06/expect-ram-prices-to-stay-high-with-micron-locking-in-deals...
1•speckx•34m ago•2 comments

Slisp: Simple Lisp compiler (Linux/amd64)

https://github.com/skx/slisp
12•stevekemp•34m ago•1 comments

SpaceX's Starfall Demo Mission

https://www.spacex.com/launches/starfalldemo
10•caseysoftware•35m ago•0 comments

Hot Lotto Fraud Scandal (2017)

https://en.wikipedia.org/wiki/Hot_Lotto_fraud_scandal
1•badcryptobitch•35m ago•0 comments

Show HN: ToolPalace – 25 free browser tools that work offline, no sign-up

https://toolpalace.online
2•sohilpathan•36m ago•0 comments