frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•1y ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•1y ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•1y ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•1y ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Show HN: How to Use Unlimited Token Plan with OpenCode

https://twitter.com/CanopyWave_AI/status/2069957571184832525
1•Timmyzzz•1m ago•0 comments

Anthropic Claims Alibaba Ran 'Brazen' Campaign to Access Its Claude AI Model

https://www.wsj.com/tech/ai/anthropic-claims-alibaba-ran-brazen-campaign-to-access-its-claude-ai-...
1•flowerlad•3m ago•0 comments

ORA: Smaller Models. Same Intelligence

https://www.oracomputing.com/
1•doener•5m ago•0 comments

The Era of Tokenmaxxing Is Over

https://techcrunch.com/2026/06/24/companies-are-scrambling-to-stop-employees-from-maxing-out-ai-b...
1•sambcui•5m ago•0 comments

Show HN: Tree, truth, druid and tar share one Proto-Indo-European root

https://p.migdal.pl/tree-of-tree/
1•stared•5m ago•0 comments

Vibecoding a High Performance System

https://andrewkchan.dev/posts/systems.html
2•davedx•12m ago•0 comments

NextBSD – FreeBSD ABI-compatible kernel with Mach built in and launchd

https://nextbsd.org/
2•sunshine-o•15m ago•0 comments

Hollywood and Big Tech Are Preparing for War

https://www.hollywoodreporter.com/business/business-news/hollywood-big-tech-war-entertainment-pla...
1•thm•16m ago•0 comments

Show HN: Cc-preview – Browse images pasted into Claude Code sessions

https://github.com/Watari995/cc-preview
1•Watari995•16m ago•0 comments

Flatpak package for GIMP 0.54.1 (1996)

https://gitlab.gnome.org/balooii/gimp-0.54
1•birdculture•20m ago•0 comments

13 years and $500M for a stage adapter? Report justifies NASA cancellations

https://arstechnica.com/space/2026/06/analysis-finds-the-exploration-programs-nasa-recently-cance...
1•rbanffy•20m ago•0 comments

Show HN: JSON Bonsai – browser JSON viewer that stays smooth on 100k+ nodes

https://github.com/pedrosousa13/JSON-Bonsai
1•pedrosousa•21m ago•0 comments

How to Build 1-Minute OHLC Bars from Non-Uniform Market Snapshot Data

https://medium.com/@DolphinDB_Inc/how-to-build-1-minute-ohlc-bars-from-non-uniform-market-snapsho...
2•dbaa4real•22m ago•0 comments

Show HN: Best Alternative for Zendesk, Intercom, and Freshdesk

2•Daniel-Pan•23m ago•0 comments

Monolith Rift, a brutalist corridor of light and impossible scale

https://sand-morph.up.railway.app/monolith-rift
4•echohive42•25m ago•1 comments

UMLBot: Converting natural language and code excerpts to editable UML diagrams

https://www.sciencedirect.com/science/article/pii/S2352711026002815
1•geox•26m ago•0 comments

Rcarmo/kata: Repetition makes perfect

https://github.com/rcarmo/kata
1•rcarmo•33m ago•0 comments

Claude Opus 4.5 vs. GLM-5.2

https://gopeekapp.blogspot.com/2026/06/glm-52-vs-claude-opus-45.html
1•bhartipoddar•33m ago•0 comments

US fighter pilot avoids British trial after raping a woman in England

https://www.theguardian.com/uk-news/ng-interactive/2026/jun/25/us-fighter-pilot-strangled-woman-e...
4•Alien1Being•37m ago•1 comments

Qualcomm Investor Day 2026 Data Center Announcements CPUs

https://www.servethehome.com/qualcomm-investor-day-2026-data-center-announcements-cpus-ai-acceler...
1•ksec•39m ago•1 comments

A Y2K bug surfaced 26 years late today

https://old.reddit.com/r/sysadmin/comments/1uetdyw/a_y2k_bug_surfaced_26_years_late_today/
1•thunderbong•40m ago•0 comments

Calculate Real-Time Implied Volatility for Commodity Options

https://medium.com/@DolphinDB_Inc/how-we-built-a-real-time-implied-volatility-engine-for-commodit...
2•Polly_Liu•42m ago•0 comments

Warrior Cats Name Generator

https://warriorcatsnamegenerator.net/
1•woflying•43m ago•0 comments

We Made Trading Signals Microsecond-Level Easy – 100 Factors in 40µs

https://medium.com/@DolphinDB_Inc/we-just-made-trading-signals-microsecond-level-easy-100-factors...
2•CrazyTomato•43m ago•0 comments

You are leaving tech, what's next?

https://www.seattletimes.com/business/local-business/older-tech-workers-are-tapping-out-early-her...
1•ynac•43m ago•1 comments

Long distance Uber alternative (for passengers and drivers)

https://localsride.com/en
1•dr_dimitru•43m ago•0 comments

Will AI replace technical writers?

https://willaireplacetechnicalwriters.com/
1•theletterf•44m ago•0 comments

Got access to Gemini's actual thinking

1•StizzurpXDD•44m ago•0 comments

The AI Coding Era Makes Boring Tests More Valuable

https://www.vincentschmalbach.com/the-ai-coding-era-makes-boring-tests-more-valuable/
1•vincent_s•45m ago•0 comments

10x More Selective

https://yosefk.com/blog/10x-more-selective.html
1•tosh•46m ago•0 comments