frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•8mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•8mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•8mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•8mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Functional Programming in M4

https://minnie.tuhs.org/pipermail/tuhs/2020-August/022108.html
1•fanf2•1m ago•0 comments

AI makes it easier to build the wrong thing faster

https://newsletter.masilotti.com/p/ai-makes-it-easier-to-build-the-wrong
1•joemasilotti•1m ago•1 comments

Show HN: I built a macOS desktop toy that patrols while you work

https://airwolfspace.com/tinytanks
1•kailuo•1m ago•0 comments

Poison at Play: Unsafe lead levels found in half of New Orleans playgrounds

https://veritenews.org/2026/02/05/poison-at-play-playgrounds-lead-levels/
1•hn_acker•1m ago•0 comments

Unresponsive Buttons on My Fastest Hardware

https://blog.jim-nielsen.com/2026/unresponsive-buttons/
1•speckx•2m ago•0 comments

AI-First Company Memos

https://the-ai-native.company/
1•bobismyuncle•2m ago•0 comments

How to Test ProxySQL Read/Write Split with Sysbench

https://rendiment.io/mysql/proxysql/2026/02/03/sysbench-proxysql.html
1•nethalo•3m ago•0 comments

The singularity won't be gentle – by Nate Silver

https://www.natesilver.net/p/the-singularity-wont-be-gentle
1•rbanffy•4m ago•0 comments

A New Computer Could Replace Electricity with Light

https://www.popularmechanics.com/science/a70223544/computer-could-replace-electricity-with-light/
1•falcor84•5m ago•0 comments

Show HN: Health.md - Apple Health → Markdown

https://healthmd.isolated.tech/
1•codybontecou•5m ago•0 comments

PicoClaw: Ultra-Efficient AI Assistant in Go

https://github.com/sipeed/picoclaw
1•wicket•6m ago•0 comments

AITools.coffee – GitHub metrics observatory tracking 27K+ open-source AI repos

https://aitools.coffee
1•alexela84•6m ago•1 comments

AI Agents 101: From Concept to Code (No Frameworks Required)

https://medium.com/@kamil.tustanowski/ai-agents-101-from-concept-to-code-no-frameworks-required-2...
1•semerkchet•6m ago•0 comments

Databases should contain their own Metadata – Use SQL Everywhere

https://floedb.ai/blog/databases-should-contain-their-own-metadata-instrumentation-in-floe
3•matheusalmeida•7m ago•0 comments

Seeking Order in Chaos

https://garrit.xyz/posts/2026-02-11-on-seeking-order-in-chaos
3•garritfra•7m ago•0 comments

Show HN: Funxy – A typed scripting language that embeds into Go apps

https://github.com/funvibe/funxy
1•funbitty•7m ago•0 comments

The jarring experience of developing today

https://its.beer/thoughts/the-jarring-experience-of-developing-today
1•beerd•8m ago•0 comments

Kiro: DeepSeek, MiniMax, and Qwen now available as open weight model options

https://kiro.dev/changelog/models/deepseek-minimax-and-qwen-now-available-as-open-weight-model-op...
2•siegers•8m ago•0 comments

Terence Tao: Why I Co-Founded SAIR

https://www.youtube.com/watch?v=Z5GKnb4H_bM
1•nyc111•10m ago•0 comments

Maia 200: The AI accelerator built for inference

https://blogs.microsoft.com/blog/2026/01/26/maia-200-the-ai-accelerator-built-for-inference/
1•MarlonPro•13m ago•0 comments

Gravity: Dynamically typed, embeddable programming language written in C

https://www.gravity-lang.org
1•klaussilveira•13m ago•0 comments

Power-User Utility to Recover, Export, Merge, Audit, and Sort Chrome Extensions

https://github.com/ZulfekarAliAgha/REMAS
1•zulali•14m ago•1 comments

Show HN: A compiled programming language for LLM-to-LLM communication [pdf]

https://sifsystemsmcrd.com/KL_White_Paper.pdf
1•tmbird•14m ago•1 comments

Show HN: See what your AI agents do under the hood

https://pingpulsehq.com
1•shafeeq2207•15m ago•0 comments

EPA to repeal its own conclusion that greenhouse gases warm the planet

https://www.nbcnews.com/science/climate-change/epa-to-repeal-endangerment-finding-climate-change-...
8•geox•15m ago•2 comments

Can you trust LastPass in 2026? Inside the quest to rebuild its security culture

https://www.zdnet.com/article/lastpass-2026-rebuilding-trust-ceo-interview/
3•arusahni•19m ago•0 comments

Show HN: Z-Image Base – Fast AI Image Generator (Open-Source, Free Tier)

https://z-imagebase.com/
1•chengai1106•20m ago•0 comments

When the Competition Is Down the Hall

https://k2xl.substack.com/p/when-the-competition-is-down-the
1•k2xl•20m ago•0 comments

The Banality of MAGA Evil

https://paulkrugman.substack.com/p/the-banality-of-maga-evil
5•rbanffy•21m ago•0 comments

Show HN: Onlybots.cam

https://onlybots.cam
1•m0rtyn•21m ago•0 comments