frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•11mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•11mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•11mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•11mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

We built a machine-readable merchant verification layer for AI shopping agents

https://github.com/warwickwood-cell/gengeo-agent-registry
2•gengeo-ai•2m ago•0 comments

Recursive Self-Improvement Delivers New SOTA Coding Performance

https://poetiq.ai/posts/recursive_self_improvement_coding/
2•icodestuff•3m ago•0 comments

Honda posts first annual loss on $9B EV writedown, scraps EV sales goals

https://www.reuters.com/business/autos-transportation/honda-books-first-annual-loss-hit-by-hefty-...
3•kristianp•7m ago•1 comments

What's in My Cert Kit?

https://blog.networkprofile.org/whats-in-my-cert-kit/
2•monstermunch•8m ago•0 comments

Accelerating Hamming Quasi-Cyclic (HQC) with Additive FFT

https://eprint.iacr.org/2026/014
1•teleforce•8m ago•0 comments

The real singularity is the friends we made along the way

https://geohot.github.io//blog/jekyll/update/2026/05/09/real-singularity.html
1•oliculipolicula•15m ago•0 comments

Raindrop – Local Agent Debugger

https://github.com/raindrop-ai/workshop
1•felixbraun•17m ago•0 comments

Windows BitLocker zero-day gives access to protected drives, PoC released

https://www.bleepingcomputer.com/news/security/windows-bitlocker-zero-day-gives-access-to-protect...
1•akyuu•17m ago•0 comments

LLM Policy for Rust Compiler

https://github.com/rust-lang/rust-forge/pull/1040
1•liyanage•18m ago•0 comments

LLMs run on top of an OS designed for code, not weights

https://github.com/matthewworner/spike
2•matthewworner•18m ago•0 comments

Sam Altman Is Taking a Lot of Punches on the Witness Stand

https://www.motherjones.com/politics/2026/05/altman-musk-openai-lawsuit-witness-questioning-ai/
2•cdrnsf•18m ago•0 comments

New Fragnesia Linux flaw lets attackers gain root privileges

https://www.bleepingcomputer.com/news/security/new-fragnesia-linux-flaw-lets-attackers-gain-root-...
1•akyuu•23m ago•0 comments

AMD EPYC CPUs Reach Record Server Revenue Share of 46.2%

https://www.techpowerup.com/349029/amd-epyc-cpus-reach-record-server-revenue-share-of-46-2
4•akyuu•28m ago•0 comments

Have a Coherent AI Policy

https://brianmeeker.me/2026/05/14/have-a-coherent-ai-policy/
3•ai_critic•30m ago•0 comments

Shareable AI Editable Visualizations

https://framejs.io/docs/intro.html
1•dionjw•32m ago•0 comments

Boeing, Toyota Donated $1M Each to Transportation Secretary's Road-Trip Show

https://www.wsj.com/business/boeing-toyota-donated-1-million-each-to-transportation-secretarys-ro...
1•impish9208•32m ago•2 comments

Decisions in the past have long running repercussions

https://www.distributedthoughts.org/2026-05-07-roman-bridge-still-determines-your-commute/
2•prosaic-hacker•36m ago•1 comments

A Professor in Every Pocket – A New Framework for Higher Education

https://lagomor.ph/2026/01/a-professor-in-every-pocket/
1•ChilledTonic•43m ago•0 comments

Isaac Newton on Laputa

https://www.historytoday.com/archive/great-debates/isaac-newton-laputa
1•hhs•51m ago•0 comments

mimalloc: A new, high-performance, scalable memory allocator for the modern era

https://www.microsoft.com/en-us/research/blog/mimalloc-a-high-performance-scalable-memory-allocat...
6•matt_d•51m ago•0 comments

A scientist made a clone of a clone of a clone of a clone

https://www.nationalgeographic.com/science/article/scientists-reclone-mice+
1•mrtedbear•51m ago•0 comments

Learn Python the Hard Way Was Right About One Thing

https://fagnerbrack.com/learn-python-the-hard-way-was-right-about-one-thing-9b6ab0b67526
2•birdculture•57m ago•0 comments

AI to infest eight in ten premium phones within two years

https://www.theregister.com/personal-tech/2026/05/14/ai-to-infest-eight-in-ten-premium-phones-wit...
2•Bender•58m ago•0 comments

Cisco to fire 4k staff and generously give them free training – on Cisco

https://www.theregister.com/networks/2026/05/14/cisco-to-fire-4000-staff-and-generously-give-them...
3•Bender•1h ago•0 comments

To gain root access at this company, all an intruder had to do was ask nicely

https://www.theregister.com/security/2026/05/14/to-gain-root-access-intruder-just-had-to-ask/5239853
1•Bender•1h ago•0 comments

Encountering the roots of mathematics

https://www.ias.edu/ideas/encountering-roots-mathematics
1•hhs•1h ago•0 comments

AI Poop Analysis App Offered to Sell Me Database of Its Users' Poops

https://www.404media.co/ai-poop-analysis-app-offered-to-sell-me-access-to-its-users-poops/
2•Cider9986•1h ago•0 comments

ICLR 2026 – Institutional Affiliations Dataset and Analysis

https://github.com/DmytroLopushanskyy/iclr2026-affiliations
4•stared•1h ago•0 comments

Do deep learning models recognize 3D shapes in the same way humans do?

https://www.santafe.edu/news-center/news/do-deep-learning-models-recognize-3d-shapes-in-the-same-...
1•hhs•1h ago•0 comments

Mirror Life's Doomsday Potential

https://www.noemamag.com/the-doomsday-organism/
1•littlexsparkee•1h ago•0 comments