frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•9mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•9mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•9mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•9mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

FlowDesk Version 1.2.1

https://www.indiehackers.com/post/flowdesk-version-1-2-1-DjQEJgd0OGdrflMb5nbL
1•AaronDeSloover•41s ago•0 comments

AnySpeech – AI Voice Cloning and Text to Speech Tool

https://anyspeech.io
1•Jasonleo•2m ago•1 comments

Show HN: I Replaced ML Anomaly Detection with Artificial Immune System in Rust

https://github.com/zot-run/zot-cell
1•nkr_hn•4m ago•1 comments

Structural signature of plasma proteins classifies status of Alzheimer's disease

https://www.nature.com/articles/s43587-026-01078-2
1•bookofjoe•4m ago•0 comments

Years After the Early Death of a Math Genius, Her Ideas Gain New Life

https://www.quantamagazine.org/years-after-the-early-death-of-a-math-genius-her-ideas-gain-new-li...
1•tzury•7m ago•0 comments

Show HN: Watchtower – Minimal, terminal-based global intelligence dashboard

https://github.com/lajosdeme/watchtower
1•lajosdeme•9m ago•0 comments

The UNIX License Plate (1980)

https://www.unix.org/license-plate.html
1•quuxplusone•10m ago•0 comments

Five Hundred PRs with Claude Code and the Future of Software Engineering

https://tobeva.com/articles/five-hundred/
2•pbw•10m ago•0 comments

Nord-Lock Bolt Washers

https://pr0gramm.com/top/6934365
1•daneel_w•11m ago•0 comments

Show HN: A calculator for your 10,000th day on Earth and other milestones

https://bonusbirthdays.com
1•eirabben•14m ago•0 comments

U.S. Military Has Used Long-Range Kamikaze Drones in Combat for the First Time

https://www.twz.com/news-features/u-s-military-has-used-long-range-kamikaze-drones-in-combat-for-...
1•breve•14m ago•0 comments

Serious Game

https://en.wikipedia.org/wiki/Serious_game
1•downboots•16m ago•0 comments

Nobody should be on-call in 2026

https://nominal.dev/
1•donutshop•17m ago•0 comments

'Can't sell house' searches are higher now than during the 2008 housing crisis

https://www.morningstar.com/news/marketwatch/20260228147/cant-sell-house-searches-are-higher-now-...
4•DocFeind•19m ago•1 comments

Apache Otava

https://otava.apache.org/
2•djoldman•19m ago•0 comments

Show HN: No Free Ride – Split Gas Bills with Friends

https://apps.apple.com/us/app/no-free-ride-split-gas/id6759541565
1•krispycreame•19m ago•0 comments

Handoff: pick up where you left off when switching between Claude Code and Codex

https://github.com/sahir2k/handoff
2•er1t0•22m ago•1 comments

The Uttar Pradesh Association of Dead People

https://economist.com/interactive/1843/2026/02/27/the-uttar-pradesh-association-of-dead-people
1•andsoitis•23m ago•0 comments

Don't go to the shoe shop to buy plates

https://naomialderman.substack.com/p/dont-go-to-the-shoe-shop-to-buy-plates
1•hairofadog•23m ago•0 comments

ACM's Expression of Concern on a 2024 paper

https://cacm.acm.org/research/reevaluating-googles-reinforcement-learning-for-ic-macro-placement/
1•azhenley•24m ago•0 comments

Show HN: MCP Playground – free MCP test servers, inspector, and 10K+ server list

https://mcpplaygroundonline.com
3•rupatiwari25•24m ago•5 comments

Open Source, SaaS, and the Silence After Unlimited Code Generation

https://worksonmymachine.ai/p/open-source-saas-and-the-silence
1•Stwerner•25m ago•0 comments

Software for One

https://koenvangilst.nl/lab/software-for-one
2•vnglst•26m ago•0 comments

Show HN: ApplyGhost – Auto-apply to jobs with quality, not quantity

https://applyghost.com
1•Gaasre•31m ago•0 comments

Show HN: The L Project- An analysis of over 1600 job rejection emails that I got

https://rohankhante.substack.com/p/thank-you-for-your-application-breakdown
1•Rohunyyy•36m ago•0 comments

How HN: Agent-Vault – A Zero-trust credential manager for AI agents

https://github.com/ewimsatt/agent-vault
1•ewimsatt•36m ago•1 comments

AI Made Writing Code Easier. It Made Being an Engineer Harder

https://www.ivanturkovic.com/2026/02/25/ai-made-writing-code-easier-engineering-harder/
51•saikatsg•36m ago•34 comments

On the emotional weight of a life in medicine

https://www.aaroncheng.me/trauma-life-medicine/
1•milkcircle•37m ago•1 comments

Pentagon Adopts Incel-Speak

https://www.theguardian.com/science/2026/mar/01/incel-slang-mainstream-government-media
10•zabzonk•37m ago•2 comments

The Norwegian Consumer Council delved into enshittification and how to resist it [video]

https://vimeo.com/1168468796
1•jahala•37m ago•1 comments