frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•11mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•11mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•11mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•11mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Bret Taylor's Sierra Buys YC-Backed AI Startup Fragment

https://techcrunch.com/2026/04/23/bret-taylors-sierra-buys-yc-backed-ai-startup-fragment/
1•zachdotai•3m ago•0 comments

Light-activated material offers new approach to carbon dioxide conversion

https://phys.org/news/2026-03-material-approach-carbon-dioxide-conversion.html
1•PaulHoule•3m ago•0 comments

Prompt engineering is dead, but Claude still tries

https://blog.exe.dev/prompt-engineering-is-dead
1•vinipolicena•3m ago•0 comments

Launching XOXO Explore

https://xoxofest.com/blog/2026-launching-xoxo-explore/
1•benwerd•6m ago•0 comments

Dial9: A Flight Recorder for Tokio

https://tokio.rs/blog/2026-03-18-dial9
1•PaulHoule•7m ago•0 comments

Making a 3B Robot Policy Faster

https://www.hapticlabs.ai/blog/2026/04/23/my-first-week-at-haptic-making-a-3b-robot-policy-faster
4•ibero•9m ago•0 comments

Glápagos Back end – built for the Americas

https://www.glapagos.com/glapp
1•thecastroquiels•10m ago•0 comments

Cybersecurity incident leaves U.S. drivers stranded

https://spectrum.ieee.org/connected-vehicle-risks
2•pseudolus•14m ago•1 comments

New Long-Necked Dinosaur Diacovered in Patagonia

https://snsb.de/en/palaeontologists-discover-new-long-necked-dinosaur-in-patagonia/
2•gmays•15m ago•0 comments

Supply chain cracks constrain AI boom

https://www.axios.com/2026/04/23/ai-iran-supply-chain
1•petethomas•18m ago•0 comments

Shearwaters washing up dead on Australian beaches not due to 'natural' causes

https://theconversation.com/more-shearwaters-are-washing-up-dead-on-australian-beaches-its-not-du...
1•defrost•21m ago•0 comments

Tenfold: Ten Years of Ink & Switch

https://www.inkandswitch.com/
1•spiralganglion•22m ago•0 comments

Ask HN: How do solo devs protect their work in the age of vibe coding?

2•langs•23m ago•2 comments

Combatting the person who trademarked the name of silent actress Louise Brooks

https://louisebrookssociety.blogspot.com/2026/04/trademark-on-film-icon-louise-brooks.html
1•aworks•23m ago•0 comments

Show HN: MirrorNeuron – an open-source runtime for reliable on-device AI agents

https://www.mirrorneuron.io/
1•homerquan•25m ago•0 comments

Tesla Never Stopped Developing the Model S [video]

https://www.youtube.com/watch?v=IwnJzP0TlCk
1•CHB0403085482•28m ago•0 comments

The Kissinger Tapes

https://blog.oup.com/2026/04/the-kissinger-tapes/
1•jruohonen•32m ago•0 comments

Selvedge: Capture the why behind AI code changes

https://github.com/masondelan/selvedge
1•masondelan•33m ago•1 comments

Tagging Music with MusicBrainz Picard

https://lwn.net/Articles/1066384/
1•signa11•36m ago•0 comments

Deterministic arcade shooter – same game for everyone, AI coaches your replay

https://gamefilm.org
1•shettysuraj•36m ago•0 comments

The Declining Driver's License: Good, Bad, or Both?

https://maxmautner.com/2026/04/21/teen-drivers-license-decline.html
2•jez•38m ago•0 comments

Show HN: SparseLab–real sparse training(CSR+custom kernel) in PyTorch, CPU-first

https://news.ycombinator.com/from?site=github.com/darshanfofadiya
1•DARSHANFOFADIYA•41m ago•1 comments

Skopx – AI analytics platform with built-in project management

https://skopx.com
1•skopx•42m ago•1 comments

AI GTM plugin for SEO and GEO in Claude Code

https://github.com/zhizdev/overgrow
4•zhizdev•43m ago•0 comments

Writing Matters

https://blog.apaonline.org/2026/04/22/writing-matters/
3•jruohonen•45m ago•1 comments

Private health records of half a million Britons for sale on Chinese Alibaba

https://www.theguardian.com/technology/2026/apr/23/private-health-records-uk-biobank-chinese-webs...
1•0in•46m ago•0 comments

Ask HN: Why are companies so distrustful of remote employees?

2•lyfeninja•51m ago•6 comments

Meta Layoffs – 10% in May

https://www.npr.org/2026/04/23/nx-s1-5797855/meta-layoffs-10-percent-staff
1•dzonga•51m ago•1 comments

How to Win

https://nekolucifer.substack.com/p/how-to-win
2•andai•53m ago•0 comments

Scraped from ancient Roman toilets, remains expose a pathogen found earlier

https://phys.org/news/2026-04-ancient-roman-toilets-crusted-expose.html
1•wglb•55m ago•2 comments