frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•9mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•9mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•9mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•9mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

MongoDB outage – AWS UAE and Bahrain datacenters

https://status.mongodb.com/incidents/7g5qmxgkc2y4
1•peterjliu•10s ago•0 comments

Show HN: Scan your dev machine for AI agents, MCP servers, and IDE extensions

https://github.com/step-security/dev-machine-guard
1•varunsharma07•33s ago•0 comments

Show HN: Mozzie – a local desktop orchestrator for AI coding agents

https://github.com/usemozzie/mozzie
1•zacharykapank•1m ago•0 comments

API Design Principles for the Agentic Era

https://www.apideck.com/blog/api-design-principles-agentic-era
1•mooreds•2m ago•0 comments

Lloyds, Bank of Scot and Halifax apps showed customers other users' transactions

https://www.bbc.co.uk/news/articles/c4g23npxpwgo
1•ColinWright•2m ago•0 comments

IonRouter (YC W26) – High throughput, low cost inference

https://ionrouter.io
1•vshah1016•3m ago•1 comments

AI Cluster Runtime: Reproducible Configs for GPU-Accelerated Kubernetes Clusters

https://developer.nvidia.com/blog/validate-kubernetes-for-gpu-infrastructure-with-layered-reprodu...
1•mchmarny•3m ago•1 comments

NM framework on Karpathy's autoresearch factory

https://nervousmachine.substack.com/p/3000-agents-are-running-experiments
1•hb_124•3m ago•0 comments

Oops, You Wrote a Database

https://dx.tips/oops-database
2•tosh•7m ago•0 comments

Engram – A distributed memory system for AI agents, with extensible architecture

https://vincents-ai.github.io/engram/
2•section_me•8m ago•1 comments

Show HN: Mingle – find and connect with people, like LinkedIn but in your chat

https://github.com/aeoess/mingle-mcp
1•Tima_fey•9m ago•0 comments

Current and former Block workers say AI can't do their jobs

https://www.theguardian.com/technology/2026/mar/08/block-ai-layoffs-jack-dorsey
4•devonnull•10m ago•0 comments

PHP-rnet – a PHP extension that mimics real browser TLS fingerprints

1•takielias•13m ago•1 comments

A small CLI for stopping Git worktrees from fighting over ports

https://github.com/johndockery/portlock
1•ilovejazz442•13m ago•0 comments

Svelte Best Practices

https://svelte.dev/docs/svelte/best-practices
1•Erenay09•13m ago•0 comments

Lowdown can translate Markdown to an mdoc manpage

https://kristaps.bsd.lv/lowdown/mdoc.html
1•fanf2•14m ago•0 comments

FSC Age Verification Bill Tracker

https://action.freespeechcoalition.com/age-verification-bills/
1•muyuu•16m ago•0 comments

Disney+ Teases Creator-Driven Content as It Launches Vertical Video Feature

https://www.hollywoodreporter.com/business/digital/disney-creator-content-launches-vertical-video...
1•andsoitis•18m ago•1 comments

The FermAI Paradox: Agents Need Their IDE Moment

https://docs.ctx.rs/blog/the-fermai-paradox
3•ripped_britches•18m ago•1 comments

New F1 regulations take bravery out of the sport, drivers say

https://www.reuters.com/sports/formula1/new-f1-regulations-take-bravery-out-sport-drivers-say-202...
2•samizdis•21m ago•0 comments

Local Agents with Llama.cpp and Pi

https://huggingface.co/docs/hub/agents-local
2•kristianpaul•21m ago•0 comments

Show HN: Aurion OS – A 32-bit GUI operating system written from scratch in C

https://github.com/Luka12-dev/AurionOS
11•Luka12-dev•22m ago•1 comments

Ask HN: Rethinking SaaS architecture for AI-native systems

2•RobertSerber•23m ago•1 comments

Weak Cyberdefenses Threaten U.S. Tech Dominance

https://www.foreignaffairs.com/united-states/americas-endangered-ai
5•fheiding•23m ago•0 comments

Anthropic invests $100M into the Claude Partner Network

https://www.anthropic.com/news/claude-partner-network
4•surprisetalk•24m ago•0 comments

gstack – Garry Tan's Claude Code Setup

https://github.com/garrytan/gstack
3•jumploops•25m ago•0 comments

The Tao of Kung Fu: The Undiscerning Mind [video]

https://www.youtube.com/watch?v=Q5J4nHdr134
1•jamesgill•26m ago•0 comments

Is MacBook Neo "The One"? [video]

https://www.youtube.com/watch?v=AwuKCgSgcR4
2•tosh•26m ago•0 comments

WebZero – a web server that serves 5k req/SEC on a 2001 Pentium III

https://github.com/davitotty/webzero
2•Davitotty1•27m ago•1 comments

'The shine has been taken off': Dubai faces existential threat

https://www.theguardian.com/world/2026/mar/11/the-shine-has-been-taken-off-dubai-faces-existentia...
4•akbarnama•27m ago•0 comments