frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•1y ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•1y ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•1y ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•1y ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

New charter gives River Wye the right to be free from pollution

https://www.bbc.co.uk/news/articles/czx21820rn4o
2•susam•8m ago•0 comments

Yocto vs. Debian for building embedded Linux systems

https://sigma-star.at/blog/2026/05/you-probably-dont-need-yocto-and-thats-fine/
2•fanf2•13m ago•0 comments

Building a game engine for 20 years [video]

https://www.youtube.com/watch?v=4d-CKaBpLC4
1•AshleysBrain•15m ago•0 comments

Zig: Build System Reworked

https://ziglang.org/devlog/2026/#2026-05-26
3•tosh•17m ago•1 comments

Thunderbolt-Ibverbs: InfiniBand for Everyone

https://blog.hellas.ai/blog/thunderbolt-ibverbs/
2•grw_•18m ago•0 comments

Rsync 3.4.3 has hundreds of Claude commits

https://mastodon.gamedev.place/@JeremiahFieldhaven/116654345332213390
12•fooker•23m ago•2 comments

Apple working to cram Gemini model into iPhone to power new Siri

https://arstechnica.com/ai/2026/05/apple-reportedly-trying-to-distill-googles-multi-trillion-para...
3•TMWNN•23m ago•0 comments

How we run Gemini at scale across billions of posts

https://www.modash.io/engineering/how-we-run-gemini-at-scale-across-billions-of-posts
1•igarnedo•24m ago•0 comments

How many emails should be in the waitlist before launching an application?

1•dash_ai•24m ago•1 comments

Microsoft wants you to share your health symptoms with its new Copilot tool

https://www.xda-developers.com/microsoft-wants-you-to-share-your-symptoms-with-its-new-copilot-he...
1•01-_-•29m ago•0 comments

ICE to keep an eye on your eyes under $25M biometric scanner deal

https://www.theregister.com/public-sector/2026/05/29/ice-awards-bi2-25m-contract-for-1570-biometr...
2•01-_-•29m ago•0 comments

Putin's $26B Quest for Longevity

https://www.wsj.com/world/russia/putin-longevity-antiaging-92dee6e8
1•kubami•32m ago•0 comments

Best OLM to PST Converter Tool to Convert Mac OLM to PST

https://apps.microsoft.com/detail/9n7jk7z3546j?hl=en-US&gl=US
1•tieanderson•32m ago•0 comments

Mercedes-Benz may be shut out of U.S. market due to Chinese ownership

https://www.cnbc.com/2026/05/29/mercedes-benz-ban-congressional-bill-china-ownership.html
1•KnuthIsGod•34m ago•0 comments

Meta Lays Off 8k Employees, as A.I. Casualties Mount

https://www.nytimes.com/2026/05/19/technology/meta-layoffs-ai.html
2•tagyro•37m ago•1 comments

The true power of regular expressions (2012)

https://www.npopov.com/2012/06/15/The-true-power-of-regular-expressions.html
1•downbad_•43m ago•1 comments

'Mind-blowing': Iron-rich immune cells help homing pigeons navigate

https://www.science.org/content/article/mind-blowing-iron-rich-immune-cells-help-homing-pigeons-n...
3•XzetaU8•51m ago•0 comments

The SLAX Scripting Language: An Alternate Syntax for XSLT

http://juniper.github.io/libslax/slax-manual.html
1•thefilmore•55m ago•0 comments

Danish pension fund excludes SpaceX citing governance and valuation

https://www.reuters.com/legal/transactional/danish-pension-fund-excludes-spacex-citing-governance...
29•vrganj•55m ago•4 comments

Tesla Self-Certifies Level 4 Autonomous Vehicles in Texas

https://www.notateslaapp.com/news/4216/tesla-self-certifies-l4-autonomy-in-texas
13•frankacter•57m ago•1 comments

Sana high-resolution image and video generation from NVidia

https://github.com/NVlabs/Sana
1•andsoitis•57m ago•0 comments

Privacy and security on computing devices need to become far stronger

https://xcancel.com/GrapheneOS/status/2044440381803069778#m
14•Cider9986•1h ago•0 comments

A $2k AI-generated film will make its debut at Tribeca

https://www.theverge.com/entertainment/939067/ai-film-dreams-of-violets-tribeca
2•fuzzythinker•1h ago•0 comments

xPrize Launches Hackathon with $2M Prize Pool, Backed by Google

https://www.xprize.org/news/xprize-launches-hackathon-with-2-million-prize-pool-backed-by-google
2•T-A•1h ago•0 comments

Stanford scientists just built a room-temperature quantum device

https://maketecheasier.com/stanford-scientists-just-built-a-room-temperature-quantum-device-that-...
1•SVI•1h ago•1 comments

An Excruciatingly Detailed Guide to SSH

https://grahamhelton.com/blog/ssh-cheatsheet
4•thunderbong•1h ago•0 comments

Virtual Railfan

https://virtualrailfan.com:443/
2•tkgally•1h ago•0 comments

Expanding the lifespan of solid-state batteries

https://www.mpg.de/26391218/how-dendrites-shorten-the-lifespan-of-solid-state-batteries
2•croes•1h ago•0 comments

LLM Paper Trading

https://gertlabs.com/spectate?game=trading
6•gertlabs•1h ago•4 comments

Explosives Synthesis, Ricin Production and Anatomical Neutralization Protocols

https://vostoktechnicalbureau.substack.com/p/red-team-technical-dossier-operational
1•VostocBuraeu•1h ago•0 comments