frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•1y ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•1y ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•1y ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•1y ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Scientists Find Intriguing Link Between Ozempic and Violent Behavior

https://gizmodo.com/scientists-find-intriguing-link-between-ozempic-and-violent-behavior-2000772629
1•akyuu•1m ago•0 comments

Structural steel estimating: the steps were never the hard part

https://bidferra.com/blog/the-honest-guide-to-structural-steel-estimating
1•fazlerocks•4m ago•0 comments

Lenovo releases new 14-inch ThinkPad with 64 GB RAM and built-in pen

https://www.notebookcheck.net/Lenovo-releases-new-14-inch-ThinkPad-with-64-GB-RAM-and-built-in-pe...
1•teleforce•5m ago•0 comments

RFC 10008: The HTTP Query Method

https://www.rfc-editor.org/info/rfc10008/
1•schappim•7m ago•0 comments

From Combinatorial Mess to Linear Elegance: Architecting a Conversion Engine

https://blog.minimal.app/conversion-engine/
1•arthurofbabylon•9m ago•0 comments

Could Earth have sent life to Jupiter's moon Europa?

https://phys.org/news/2026-06-earth-life-jupiter-moon-europa.html
1•pseudolus•11m ago•1 comments

Color Picking OKLCH for Mortals

https://hugodaniel.com/posts/color-picking-oklch/
1•hugodan•12m ago•0 comments

Is MCP a sign of the reopening of the internet?

https://bakkenbaeck.com/tech/is-mcp-the-reopening-of-the-internet
1•_n_nym__s•13m ago•0 comments

Zlib-Rs in Firefox

https://trifectatech.org/blog/zlib-rs-in-firefox/
1•mcraiha•14m ago•0 comments

Ask HN: Does your mind drift while waiting for AI prompts to finish?

1•cryptoSympozium•21m ago•7 comments

The MRV engine for carbon removal

https://www.cula.tech/
1•doener•21m ago•0 comments

Against essential and accidental complexity (2020)

https://danluu.com/essential-complexity/
1•pramodbiligiri•21m ago•0 comments

Magnetically Hovering Guitar Strings

https://www.youtube.com/watch?v=ueCO4spGNPs
1•SweetSoftPillow•22m ago•0 comments

Ask HN: How much we change since LLM era?

1•modinfo•24m ago•0 comments

Dear Pinboard, I'm breaking up with you. It's me and it's you

https://michaelharley.net/posts/2026/06/16/dear-pinboard-im-breaking-up-with-you-its-me-and-its-you/
1•shaunpud•24m ago•0 comments

The Internet Isn't in the Cloud. It's on the Ocean Floor

https://axisbrief.substack.com/p/the-internet-isnt-in-the-cloud-its
2•Axis_Brief•25m ago•0 comments

Google: The New SDLC with Vibe Coding (2026)

https://www.kaggle.com/whitepaper-the-new-SDLC-with-vibe-coding
1•kubik369•28m ago•0 comments

W Social, Public Institutions and the Theater of European Digital Sovereignty

https://blog.elenarossini.com/w-social-public-institutions-and-the-theater-of-european-digital-so...
2•rapnie•28m ago•0 comments

Mastra NPM Supply Chain Attack: 140 Packages Backdoor via easy-day-JS Typosquat

https://www.stepsecurity.io/blog/mastra-npm-packages-compromised-using-easy-day-js
1•shaunpud•30m ago•0 comments

Show HN: OpenTalk2HTML – Convert video meeting transcripts to readable HTML

https://github.com/Aimino-Tech/opentalk2html
1•xducn1•32m ago•0 comments

AI Scenarios 2030: Helping policymakers plan for the future of AI

https://www.gov.uk/government/publications/ai-scenarios-2030-helping-policymakers-plan-for-the-fu...
1•hunglee2•32m ago•0 comments

The (Fake) Long Decline of Fertility

https://lymanstone.substack.com/p/the-fake-long-decline-of-fertility
1•barry-cotter•32m ago•0 comments

Show HN: Noema64 – an open-source LLM chess engine (still in beta though)

https://github.com/ahmeddyounis/noema64
1•ahmed_duski•34m ago•1 comments

Pipkin's Light Bulb Moment

https://spark.iop.org/pipkins-light-bulb-moment
1•redbell•34m ago•0 comments

SwissJURA3D – 3D geological model of the Swiss Jura

https://www.swisstopo.admin.ch/en/jura3d-en
1•bschne•37m ago•0 comments

AI Made Internal Tools Easy to Build. Keeping Them Alive Is the Hard Part

https://www.dforge.io/blog/internal-tools-built-to-last
1•andreypt•37m ago•0 comments

Chrome Extensions: The Hidden Risks No One Talks About and How to Stay Safe

https://old.reddit.com/r/AgentContext_dev/comments/1u862iu/chrome_extensions_the_hidden_risks_no_...
1•javaeeeee•39m ago•0 comments

Nanowar of Steel – Kotlin (Official Power Point Video)

https://www.youtube.com/watch?v=BsfXZjKLT9A
2•thinker5555•40m ago•0 comments

Show HN: Open-Source RAG Security Kit for Zero-Trust Retrieval

https://blog.aetherguard.ai/building-a-zero-trust-security-layer-for-rag-pipelines
1•aamir_m•41m ago•0 comments

Engineering vs. Software

https://crackedbeefcake.com/on/engineering/
1•lazerjesus•43m ago•0 comments