frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•1y ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•1y ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•1y ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•1y ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Why India wants German submarines

https://www.dw.com/en/why-india-wants-german-submarines-and-what-pakistan-and-china-have-to-do-wi...
1•rustoo•1m ago•0 comments

The first trillionaire is a killer

https://www.theverge.com/tech/949259/the-worlds-first-trillionaire-is-a-killer
1•okneil•1m ago•0 comments

Seasonal changes in human hair growth

https://pubmed.ncbi.nlm.nih.gov/2003996/
1•JumpCrisscross•1m ago•0 comments

Why greatness cannot be planned

https://yinuoli.org/ken-stanley-and-joel-lehman-why-greatness-cant-be-planned/
1•andsoitis•1m ago•0 comments

What Happens to an Economy When It's Too Hot to Work?

https://www.bloomberg.com/news/features/2026-06-12/india-s-extreme-heat-is-hurting-its-economy-an...
3•littlexsparkee•5m ago•0 comments

Running DOS on Behringers DDX3216 with a DIY x86-Bios from Scratch

https://chrisdevblog.com/2026/06/08/running-dos-on-behringers-ddx3216-using-a-diy-x86-bios/
2•rasz•8m ago•0 comments

Something is jamming GPS over Europe. Here's what we found

https://www.youtube.com/watch?v=tz23G_UXCGA
2•nradov•9m ago•0 comments

Show HN: Deterministic and offline duplicate-code detector

https://github.com/Rafaelpta/dupehound
3•rafaepta•10m ago•0 comments

TinyWind

https://tinywind.io
2•kqr•12m ago•0 comments

Memory-mapped files considered harmful (for databases) (2022)

https://quasar.ai/2022/01/24/memory-mapped-files-considered-harmful/
2•tosh•14m ago•0 comments

Rows Are Made for Sorting and That's Just What We'll Do (2023) [pdf]

https://duckdb.org/pdf/ICDE2023-kuiper-muehleisen-sorting.pdf
2•tosh•14m ago•0 comments

Google Gemini-SQL2 tops text-to-SQL benchmarks

https://the-decoder.com/google-researchs-gemini-sql2-tops-text-to-sql-benchmarks-by-a-wide-margin/
2•geox•17m ago•0 comments

AI forgoes toxic positivity for neurodivergents

https://medium.com/@mantaman555/the-daily-exhaustion-of-waiting-mode-why-standard-productivity-sy...
2•FDX2018•18m ago•0 comments

Show HN: Seer – Private Ollama Chat in the Browser, No Account Needed

https://manticthink.com/
2•Colewilliamz•18m ago•0 comments

Crime theory: Rehabilitation, or harsh punishment

https://agoralogica.com/debates/cee5881c-a333-4e79-b81d-17552904a568
2•Phaedruss•19m ago•0 comments

There is not 'sentient plasma', refuting the claims of David Grusch

2•dabadabad00•22m ago•0 comments

Guru AI Lab

https://guruailab.com/
2•POILCIAMILTON•24m ago•2 comments

Libraries Bet That Readers Haven't Arrived Yet

https://hari.computer/the-library-should-move-at-lindy-speed
2•markovblanket•25m ago•0 comments

How Much Stuff Do You Own?

https://www.inconspicuous.info/p/how-much-stuff-do-you-own
3•NaOH•26m ago•0 comments

CastChat-Chat de Zov

http://CastChat-Chatdezov.com
1•POILCIAMILTON•26m ago•1 comments

Free Open Source full on APP called treemap. This thing is a gem

https://github.com/Prithvi-Web/Treemap
1•DaGoat487•31m ago•0 comments

SpaceX Went Public – A Disaster Waiting to Happen [video]

https://www.youtube.com/watch?v=FPIGu0anfAE
1•_feynon•34m ago•0 comments

The Future of Work and AI

https://www.wsj.com/tech/ai/economists-weigh-in-on-the-future-of-work-and-ai-f59311e9
2•reconnecting•34m ago•0 comments

Ask HN: Did we witness the "Trinity moment" for AI?

3•vld_chk•38m ago•1 comments

Tell HN: iOS devs, get back lots of disk space: xcrun simctl delete unavailable

2•amichail•38m ago•0 comments

Tell HN: June 2026 shows largest gap between "Who is hiring?/wants to be hired?"

1•lukasm•40m ago•0 comments

Eywa: Local-first memory for AI agents, with a receipt for every fact

https://arxiv.org/abs/2605.30771
1•agentseal•43m ago•0 comments

An interview with an Apple emoji designer

https://shadycharacters.co.uk/2026/06/ollie-wagner/
1•nate•43m ago•0 comments

Fable situation update from David Sacks

https://twitter.com/DavidSacks/status/2065853007619588171
3•rohansood15•45m ago•6 comments

Do More with Less

https://biggestfish.substack.com/p/do-more-with-less
5•lilfrost•47m ago•0 comments