frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•1y ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•1y ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•1y ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•1y ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Being a Dad

https://www.derekthompson.org/p/three-reasons-to-be-a-parent
1•Curiositry•5m ago•1 comments

AI Erodes a Legacy of Reading

https://molochinations.substack.com/p/ai-erodes-a-legacy-of-reading
2•domhudson•5m ago•0 comments

The Permission Slip

https://www.cringely.com/2026/05/28/the-permission-slip/
1•B1FF_PSUVM•15m ago•0 comments

Iterating Faster with TypeScript 7

https://code.visualstudio.com/blogs/2026/06/26/iterating-faster-with-ts-7
2•soheilpro•16m ago•0 comments

Show HN: Boot a tiny app-making OS inside an AI chat

https://hollingsworthd.github.io/RAu/
1•logn•16m ago•0 comments

The Impacts of Unauthorized Immigration on U.S. Labor and Housing Markets

https://www.dallasfed.org/research/papers/2026/wp2607
1•nradov•31m ago•1 comments

Enterprise AI customers pulling back from OpenAI and Anthropic as costs mount

https://qz.com/enterprise-ai-spending-openai-anthropic-roi-pullback-062626
2•toomuchtodo•35m ago•4 comments

Compiler Education Deserves a Revolution

https://thunderseethe.dev/posts/compiler-education-deserves-a-revoluation/
1•azhenley•36m ago•0 comments

I Co-Founded Wikipedia. Now I'm Banned for Life

https://www.thefp.com/p/larry-sanger-wikipedia-co-founder-banned
9•bko•36m ago•3 comments

Show HN: Tested – AI Tools Scored by a Panel of LLMs (Claude, GPT, Gemini, Grok)

https://trytested.com
1•pro_methe5•38m ago•0 comments

What's the value of the Biblical half shekel?

https://www.chabad.org/library/article_cdo/aid/673063/jewish/Whats-the-value-of-the-biblical-half...
1•teleforce•39m ago•0 comments

Show HN: GetBlocked – a local-only Chrome extension for blocking web trackers

https://github.com/alex-w-developer/GetBlocked/
6•sudo_cowsay•44m ago•3 comments

Indus civilisation reveals its volumetric system

https://www.thehindu.com/news/national/Indus-civilisation-reveals-its-volumetric-system/article16...
1•teleforce•46m ago•0 comments

Evolving the Node.js Release Schedule

https://nodejs.org/en/blog/announcements/evolving-the-nodejs-release-schedule
1•thunderbong•46m ago•0 comments

Greatgramps: A static site generator for GRAMPS family trees

https://bennuttall.com/blog/2026/06/greatgramps/
1•benn_88•49m ago•0 comments

Join .agent – Pre-Register Your Domain Free

https://agentcommunity.org/join
2•Anon84•55m ago•1 comments

The State of the AI Economy

https://intelligence.exponentialview.co/
2•marc__1•55m ago•0 comments

How a Star Wars Lego Dispute Triggered an Armed Police Raid in Utah

https://www.wsj.com/us-news/how-a-star-wars-lego-dispute-triggered-an-armed-police-raid-in-utah-c...
1•fortran77•56m ago•1 comments

Building While Becoming

https://pushapaula.substack.com/
2•paulawp•57m ago•0 comments

The Supreme Court's Era of Meaningless Rights

https://www.theatlantic.com/ideas/2026/06/supreme-court-decisions-unenforceable-rights/687713/
1•paulpauper•59m ago•0 comments

Show HN: I scanned 87 MCP servers for agent-authority hygiene – leaderboard

https://capframe.ai/leaderboard
1•euan21•1h ago•2 comments

Show HN: Hacker News on a Train Station Style Flip Board

https://popflame.quickish.space/hn-flipboard/
2•PaybackTony•1h ago•1 comments

The Duck Is Growing

https://gemenergyanalytics.substack.com/p/the-duck-is-growing
1•paulpauper•1h ago•1 comments

The State of the AI Economy

https://www.exponentialview.co/p/the-state-of-the-ai-economy
1•paulpauper•1h ago•0 comments

The formation of human populations in South and Central Asia

https://www.science.org/doi/10.1126/science.aat7487
1•teleforce•1h ago•0 comments

Show HN: Hatchr – Share Claude Designs with a public link

https://www.hatchr.link/
1•othmanosx•1h ago•0 comments

Show HN: Noise Lang, JIT stochastic programming language in Rust

https://noiselang.com/
1•manucorporat•1h ago•0 comments

PrismLib – semantic LLM cache and cluster mesh that cuts token spend

https://github.com/insightitsGit/prismlib
2•insightits•1h ago•0 comments

Economists have pushed for prediction markets. They're not what they'd hoped for

https://www.cnn.com/2026/06/21/business/prediction-markets-economists
2•JumpinJack_Cash•1h ago•1 comments

Routing for serverless servers with Pingora, Envoy, and Spanner

https://modal.com/blog/serverless-servers
2•birdculture•1h ago•0 comments