frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•8mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•8mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•8mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•8mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

EmuDevz – A game about building emulators

https://store.steampowered.com/app/4260720/EmuDevz/
1•evo_9•4m ago•0 comments

The Catcher in the Prompt: Day 60

https://blog.pytoshka.me/post/the-catcher-in-the-prompt/
1•kenny-opennix•4m ago•1 comments

Chinese Batteries Will Run the World

https://www.nytimes.com/2026/01/19/opinion/trump-energy-china-future.html
2•cmod•5m ago•0 comments

Show HN: A simple fork of gpodder2go for lightweight self-hosted podcast sync

https://github.com/ijustlovemath/gpodder2go
1•ijustlovemath•7m ago•0 comments

The Overcomplexity of the Shadcn Radio Button

https://maxkapur.com/2025/12/19/perfect-match-integer-programming.html
1•owenlacey•9m ago•0 comments

Hijacking Bluetooth Accessories Using Google Fast Pair

https://whisperpair.eu/
1•csmantle•12m ago•0 comments

RCS for Business

https://developers.google.com/business-communications/rcs-business-messaging
1•sshh12•13m ago•1 comments

Cyberpunk 2077 VR mod disappears after mod maker pulls the plug

https://www.pcgamer.com/games/vr/cyberpunk-2077-vr-mod-disappears-after-mod-maker-decides-hed-rat...
1•evo_9•15m ago•0 comments

The Tenth Watch for the Tenth Pitch Drop

http://thetenthwatch.com
1•mattas•16m ago•0 comments

Product Validation Launch Platform

https://www.launchradar.cc/
1•abdullah9•22m ago•0 comments

ADHD Routine Planner

https://www.habitualy.app/
1•abdullah9•23m ago•0 comments

Why isn't cindyllm blocked? Obviously a bot

https://news.ycombinator.com/user?id=cindyllm
2•raw_anon_1111•25m ago•1 comments

Who Contributed to PostgreSQL Development in 2025?

http://rhaas.blogspot.com/2026/01/who-contributed-to-postgresql.html
1•thunderbong•30m ago•0 comments

Common Lisp developer role Ravenpack

https://old.reddit.com/r/lisp/comments/1qedv4v/common_lisp_developer_role_ravenpack/
1•mike_ivanov•31m ago•0 comments

Observing Positronium Beam as a Quantum Matter Wave for the First Time

https://www.tus.ac.jp/en/mediarelations/archive/20260115_5801.htm
1•rustoo•31m ago•0 comments

Velox: A Port of Tauri to Swift by Miguel de Icaza

https://github.com/velox-apps/velox
2•wahnfrieden•35m ago•0 comments

x86 prefixes and escape opcodes flowchart

https://soc.me/interfaces/x86-prefixes-and-escape-opcodes-flowchart.html
1•gaul•40m ago•0 comments

React Native for macOS

https://github.com/microsoft/react-native-macos
1•wahnfrieden•42m ago•0 comments

Dear Jonas

https://denmark.news-pravda.com/en/world/2026/01/19/19000.html
2•barrister•43m ago•0 comments

RFC: A proposal to replace API integration with LLM Semantic Translation

https://github.com/kaylorjc-protocol/semantic-integration-layer-SIL-protocol
1•kaylorjc•44m ago•2 comments

The Good Hallucinations

https://chris-hartwig.com/blog/you-want-hallucinated-code/
2•weddpros•54m ago•0 comments

Good listeners connect more easily with strangers, study finds

https://phys.org/news/2025-12-good-easily-strangers.html
1•PaulHoule•55m ago•0 comments

Flying with Photons: Rendering Novel Views of Propagating Light

https://anaghmalik.com/FlyingWithPhotons/
1•pillars•59m ago•0 comments

The Paper 2

https://zenodo.org/records/18304357
1•KaoruAK•59m ago•0 comments

F-16 Falcon Strike, modern combat flight SIM for Atari XL/XE

https://webchrono.pl/F16FalconStrike/index.html
35•starkparker•1h ago•2 comments

Keeping 20k GPUs Healthy

https://modal.com/blog/gpu-health
1•aburan28•1h ago•0 comments

Show HN: Circe – Deterministic, offline-verifiable receipts for AI agent actions

https://github.com/wv26296-ux/circe-receipts
1•W_rey45•1h ago•1 comments

CoreSpeed: Agent Runtime Infrastructure

https://corespeed.io
1•handfuloflight•1h ago•0 comments

Open Reimplementation of Google Widevine Content Decryption Module for Browsers

https://github.com/tchebb/openwv
2•pabs3•1h ago•0 comments

Idiomatic Rust – A peer-reviewed collection of Rust articles/talks/repos

https://github.com/rust-lang-nursery/rust-cookbook
2•Brysonbw•1h ago•1 comments