frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•1y ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•1y ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•1y ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•1y ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Download cash counter and help me

https://play.google.com/store/apps/details?id=com.logicforge.bdcash&hl=en_US
1•bdcashcounter•2m ago•0 comments

Turn HAR Files, Claude Code, Copilot CLI, and Codex CLI Logs into ATIF

https://github.com/waldekmastykarz/atifact
1•waldekm•2m ago•0 comments

Show HN: Sidekick – The zot coding agent, one click away on macOS

https://github.com/patriceckhart/zot-sidekick
7•patriceckhart•3m ago•0 comments

How much did OpenAI pay for Tomoro?

https://www.aienablementinsider.com/p/how-much-did-openai-pay-for-tomoro
1•dylancollins•5m ago•0 comments

I Found the US Nuclear Detection System in Space (GPS)

https://www.youtube.com/watch?v=DjLnIb41DuQ
1•valeg•6m ago•0 comments

You can now use your Gmail account in Proton Mail

https://proton.me/blog/proton-mail-connect-gmail
3•Topfi•7m ago•0 comments

Integer Overflow in Postgres

https://www.crunchydata.com/blog/the-integer-at-the-end-of-the-universe-integer-overflow-in-postgres
1•tosh•8m ago•0 comments

AI coding agents use your technology

https://developer.microsoft.com/blog/how-ai-coding-agents-actually-use-your-technology
1•waldekm•9m ago•0 comments

The AX stack: what's fixed, where you can win

https://developer.microsoft.com/blog/the-ax-stack-whats-fixed-where-you-can-win
1•waldekm•10m ago•0 comments

New version of "peers" – the AI couple doing things

https://github.com/c0decave/peers
1•dash0r•18m ago•0 comments

Bitcoin back above $61,000 after rout leads to $1.6B liquidations

https://www.coindesk.com/markets/2026/06/06/bitcoin-back-above-usd61-000-after-rout-leads-to-usd1...
3•Varun-Sakhuja•23m ago•0 comments

Bitrig – The best way to build native Swift apps with AI

https://bitrig.com
1•Austin_Conlon•25m ago•0 comments

Are AI chatbots making us lose control of our brains?

https://www.technologyreview.com/2026/06/05/1138427/are-ai-chatbots-making-us-lose-control-of-our...
1•joozio•25m ago•0 comments

Adaptive Low-Rank Transformer with Dynamic Expert Routing for Continual Learning

https://zenodo.org/records/20064618
1•jballanc•30m ago•0 comments

Show HN: Summarize YT Video by pasting url into AI chat

https://www.youtube.com/watch?v=BblwmuFhhOI
1•julienreszka•36m ago•1 comments

39C3: The art of text rendering (Nicolas Rougier)

1•signa11•40m ago•0 comments

Ask HN: For non-hackers/nerds, why do you read HN?

2•throwaway2037•41m ago•2 comments

Reversible Computing

https://en.wikipedia.org/wiki/Reversible_computing
1•peter_d_sherman•45m ago•0 comments

Mega-cap IPOs: Implications for institutional investors and index managers

https://www.ssga.com/nl/en_gb/institutional/insights/mega-cap-ipos-implications-for-institutional...
1•akg_67•45m ago•0 comments

How do non-green plants work? (2013)

https://plantsandprejudice.wordpress.com/2013/08/07/how-do-non-green-plants-work/
1•downbad_•46m ago•0 comments

Canada bans Texas cattle over flesh-eating screwworm outbreak in US

https://www.bbc.com/news/articles/cevpv3r7jmpo
4•1659447091•46m ago•1 comments

List of Quantum Logic Gates

https://en.wikipedia.org/wiki/List_of_quantum_logic_gates
1•peter_d_sherman•48m ago•0 comments

Ethnic nepotism and bias in Seattle tech industry

https://old.reddit.com/r/SeattleWA/comments/1ty5xsm/ethnic_nepotism_and_bias_in_seattle_tech_indu...
4•866-RON-0-FEZ•55m ago•1 comments

Ask HN: What would you name your own LLM?

2•akashwadhwani35•55m ago•2 comments

FakeCall: An Open Source App to simulate incoming calls on Android

https://github.com/DDOneApps/FakeCall
1•thunderbong•56m ago•0 comments

Industry Coalition Letter on Memory Shortage

https://tiaonline.org/policy/industry-coalition-letter-on-memory-shortage/
1•T-A•59m ago•0 comments

A Starbucks marketing stunt spiralled into mass boycotts in South Korea

https://www.theguardian.com/world/2026/jun/06/starbucks-south-korea-tank-day-promotion-blunder
3•beardyw•59m ago•1 comments

PostgreSQL 19 Beta 1 Released

https://www.postgresql.org/about/news/postgresql-19-beta-1-released-3313/
1•molf•1h ago•0 comments

Recommended Mystery Novels

https://bactra.org/notebooks/mystery-recs.html
2•mattbit•1h ago•0 comments

Running Python code in a sandbox with MicroPython and WASM

https://simonwillison.net/2026/Jun/6/micropython-in-a-sandbox/
3•pretext•1h ago•0 comments