frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•12mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•12mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•12mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•12mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Revealed: Facebook AI accounts spreading fake political 'good news'

https://www.independent.co.uk/news/uk/politics/facebook-ai-post-nigel-farage-full-fact-b2977352.html
1•01-_-•34s ago•0 comments

On reading Iain M. Banks

https://matiasseidler.substack.com/p/on-reading-iain-m-banks
1•pseudolus•1m ago•0 comments

Observations on AI agent token consumption

https://willhackett.com/agent-token-consumption/
1•speckx•1m ago•0 comments

Show HN: RAGDebugger – observability – retrieval-augmented generation pipeline

https://www.ragdebugger.com/
1•vasan_natarajan•3m ago•0 comments

The Rage of the Billionaires Is Coming

https://www.thebignewsletter.com/p/monopoly-round-up-the-rage-of-the
1•aworks•4m ago•0 comments

Drake Equation Calculator

https://mendiak.github.io/drake.equation/
1•surprisetalk•5m ago•0 comments

The Next War Is Here. the West Isn't Ready. [audio]

https://www.latent.space/p/the-fourth-law
1•mooreds•5m ago•0 comments

Evil Recipe Blog (2025)

https://willmooney3.com/creation/evil-recipe-blog
1•mooreds•5m ago•0 comments

Captcha, Ergo Sum

https://badastronomy.beehiiv.com/p/captcha-ergo-sum
1•celias•6m ago•0 comments

Enemies of the Invisible Hand

https://www.thefreedomfrequency.org/p/letter-4-enemies-of-the-invisible
1•mooreds•7m ago•0 comments

EU plans to force companies to buy parts from non-Chinese suppliers

https://www.ft.com/content/57b4852c-f323-45b8-b8d0-5a6426dd781e
1•robtherobber•8m ago•0 comments

Homunctor – The Simplest Agent

https://github.com/abtinf/homunctor
1•abtinf•8m ago•1 comments

Capturing ideas with voice, local LLMs, and obsidian

https://aidenredmondd.substack.com/p/my-life-is-a-mess-2
1•clinicalred•9m ago•0 comments

Show HN: Dataset for AI training and fine tuning

https://neurvance.com/
1•Adam_SDDk•13m ago•0 comments

Steven Soderbergh defends using AI in a documentary about John Lennon

https://apnews.com/article/john-lennon-steven-soderbergh-ai-cannes-documentary-7794a4344ed455cae4...
1•smurda•13m ago•0 comments

AI Revenue May Jump 5x to $200B This Year as Spending Race Intensifies

https://www.fidelity.com/news/article/company-news
1•gluke_bywalker•13m ago•0 comments

New arsenal jersey with Deel (YC) on the sleeves

https://arsenaldirect.arsenal.com/Football-Shirts-and-Kit/Home/Arsenal-adidas-26-27-Authentic-Hom...
3•aaqaishtyaq•15m ago•0 comments

We engineered RAG to be 50% faster

https://elevenlabs.io/blog/engineering-rag
1•ChicknNuggt•15m ago•0 comments

Show HN: Uber burned its 2026 AI budget by April. Why? 73% redundant reads

https://argosbrain.com/blog/re-read-tax
1•CataDef•15m ago•0 comments

PostgreSQL ext makes LLM available as an index for similarity searches,inference

https://codeberg.org/gregburd/pg_infer
1•kermatt•15m ago•0 comments

What's My Size?: How to Read a Size Chart

https://truestyle.substack.com/p/whats-my-size-how-to-read-a-size
1•crescit_eundo•17m ago•0 comments

Mermaid Diagrams Are Unreadable in Real-World Technical Docs

https://clairetsao.substack.com/p/mermaid-diagrams-are-unreadable-in
1•missmoss•18m ago•0 comments

The Simplicity Trap: Why AI is making us "simple" in the wrong way (or not)

https://higashi.blog/2026/05/09/simplicity/
1•yuedongze•18m ago•0 comments

Trump cuts to weather data could make forecasts less reliable, warn experts

https://www.theguardian.com/us-news/2026/may/18/trump-cuts-ai-weather-prediction-forecasts
3•Gedxx•18m ago•0 comments

We Have Prison Gangs (2024)

https://asteriskmag.com/issues/08/why-we-have-prison-gangs
1•surprisetalk•19m ago•0 comments

'AI' Could Lead to a Rise in Research Slop

https://www.nominalnews.com/p/ai-research-slop-p-hacking
2•NomNew•20m ago•0 comments

1024000^2 Blocks, 2B2T Minecraft Server World Download Project, and Discoveries

https://github.com/2b2tplace/1m_release
2•exploraz•22m ago•0 comments

Bad News for the Average Pentester

https://www.atredis.com/blog/2026/5/15/bad-news-for-the-average-pentester
2•speckx•24m ago•0 comments

GoDaddy Agent Naming Service (ANS)

https://www.godaddy.com/ans/
3•hasheddan•24m ago•0 comments

Sharla Boehm, the programmer whose code underpins the Internet

https://www.scientificamerican.com/article/the-programmer-whose-code-underpins-the-internet/
2•dxs•24m ago•0 comments