frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•1y ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•1y ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•1y ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•1y ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Pilot and 11 skydiving passengers killed in Missouri plane crash

https://www.theguardian.com/us-news/2026/jun/14/butler-missouri-plane-crash
1•sva_•4m ago•0 comments

How to Build a Phyle

https://lasindias.net/indianopedia/How_to_build_a_phyle
1•rwl•5m ago•0 comments

What does high effort mean when AI has made everything low effort?

1•foxtrot8672•6m ago•1 comments

I built a free tool that tells you if an LLM will run on your GPU

https://www.slopesome.com
1•NexAIGuy•6m ago•0 comments

Leave It to Beaver: Everything is bigger at Buc-ee's

https://thebaffler.com/outbursts/leave-it-to-beaver-wilder
1•NaOH•11m ago•0 comments

Virology Research Is Not a Crime

https://rasmussenretorts.substack.com/p/virology-research-is-not-a-crime
1•hn_acker•14m ago•0 comments

Abandoned and Little-Known Airfields

https://airfields-freeman.com/
1•wizardforhire•14m ago•0 comments

2026 Global Peace Index [pdf]

https://www.visionofhumanity.org/wp-content/uploads/2026/06/Global-Peace-Index-2026-Report.pdf
3•simonebrunozzi•15m ago•0 comments

KPMG report on AI found riddled with AI hallucinations

https://www.cityam.com/kpmg-report-on-ai-found-riddled-with-ai-hallucinations/
2•chrisjj•16m ago•1 comments

Compute 'S Atari ST Reference Books – By Paul Lefebvre

https://www.goto10retro.com/p/computes-atari-st-reference-books
1•rbanffy•16m ago•0 comments

Ask HN: I am a junior CS and math major. I have no hope for SWE or math. Advice?

1•jidhn•19m ago•1 comments

Seer – an Ollama workspace where two models build and review code

https://manticthink.com/c/a57jfwt
1•SEERai•21m ago•0 comments

Show HN: I hate typing continue once my CC quota resets

https://github.com/softcane/cc-session-recover
1•pradeep1177•22m ago•0 comments

Frona v2026.6.0 – self-hosted personal AI assistant

https://github.com/fronalabs/frona/releases/tag/v2026.6.0
1•syncerx•22m ago•0 comments

Itty Bitty Mosquito Committee

https://www.ittybittymosquitocommittee.org
1•mlinksva•22m ago•0 comments

Chaosnet

https://tumbleweed.nu/r/lm-3/uv/amber.html
7•RGBCube•25m ago•0 comments

Road trip – Drop Junk Cars – Cows aren't hurt

https://screen.toys/roadtrip/
1•gurjeet•26m ago•0 comments

Einstein's Mirror

https://sketchplanations.com/einsteins-mirror
1•nate•27m ago•0 comments

Yserver: A modern X11 server written in Rust

https://github.com/joske/yserver
4•Venn1•29m ago•0 comments

Mathematical and Algorithmic Specification of an Advanced Cognitive Architecture

https://zenodo.org/records/20684784
1•hiddenarchitect•30m ago•0 comments

Ask HN: Which Free Software or Open Source Project Needs Help?

3•em-bee•31m ago•1 comments

Why isn't the external link symbol in Unicode? (2018)

https://dafoster.net/articles/2018/11/24/why-isnt-the-external-link-symbol-in-unicode/
1•downbad_•31m ago•1 comments

One Moving Part: The Forest Service Ax Manual

https://www.fs.usda.gov/inside-fs/delivering-mission/deliver/one-moving-part-forest-service-ax-ma...
1•helterskelter•32m ago•0 comments

Android's head of security slams Google's door

https://www.lesnumeriques.com/societe-numerique/la-direction-a-perdu-toute-boussole-morale-le-che...
1•trilogic•32m ago•0 comments

Losing on Purpose: The Economics of NBA Tanking

https://myteamtanks.com/
2•jonbaer•34m ago•0 comments

The Telematico NMS3000 – Celso Martinho

https://celso.io/posts/2026/06/13/telematico/
2•rbanffy•34m ago•0 comments

Karpathy on Why 'Edutainment' Isn't Real Learning (Seek the Sweat) (2024)

https://twitter.com/karpathy/status/1756380066580455557
1•laxmena•35m ago•1 comments

Audit checklists for AI coding agents – 30 invariants, any language

https://github.com/danygiguere/audit-skills
1•danygiguere•37m ago•0 comments

Encrypted-File-Server: FTP/SFTP/WebServer Encryption Proxy

1•amir734jj•40m ago•0 comments

Anthropic staff to meet White House officials next week

https://www.reuters.com/world/us/anthropic-staff-meet-white-house-officials-next-week-axios-repor...
3•mfiguiere•41m ago•1 comments