frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•1y ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•1y ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•1y ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•1y ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Federal Trade Commission sues leading transgender health group

https://www.cnn.com/2026/06/17/health/ftc-transgender-care-lawsuit
1•2OEH8eoCRo0•3m ago•0 comments

Thoma Bravo hands Medallia to lenders in one of private equity's biggest losses

https://www.ft.com/content/ae4b6c77-9b3d-46fb-a7fc-59f072e7291b
1•petethomas•5m ago•0 comments

Rationale for the 2026 Iran War

https://en.wikipedia.org/wiki/Rationale_for_the_2026_Iran_war
2•doener•8m ago•0 comments

How Does One Brain Speak Two Languages?

https://www.nytimes.com/2026/06/15/science/brain-language-grammar.html
2•Anon84•9m ago•2 comments

Agent finder for GitHub Copilot now available

https://github.blog/changelog/2026-06-17-agent-finder-for-github-copilot-now-available/
2•soheilpro•10m ago•0 comments

GitHub Copilot app generally available

https://github.blog/changelog/2026-06-17-github-copilot-app-generally-available/
2•soheilpro•10m ago•0 comments

South Georgia Student Develops Method to Control Kudzu [video]

https://www.youtube.com/watch?v=jDCCuiFynrU
1•assimpleaspossi•13m ago•0 comments

Copilot individual plan sign-ups are reopening

https://github.blog/changelog/2026-06-17-copilot-individual-plan-sign-ups-are-reopening/
1•soheilpro•16m ago•0 comments

Show HN: Pitch-by-pitch baseball simulation app to simulate games and seasons

https://baseball.chesterton.tech/
2•HaxleRose•26m ago•3 comments

A Human Artist's Defense of AI Art

https://asherperlman.substack.com/p/a-human-artists-defense-of-ai-art
2•erikschoster•26m ago•0 comments

Best Laptops

https://www.wired.com/story/best-laptops/
2•adrianwaj•30m ago•0 comments

Free Will (2026)

https://gt.ms/blog/free-will/
1•geetuu•31m ago•1 comments

Bernie Sanders unveils plan to give the public direct ownership of AI companies

https://apnews.com/article/bernie-sanders-ai-public-ownership-57b9f20d96490083e2749adba0f13977
4•petethomas•32m ago•3 comments

ArsenalOS: Anduril's Digital Manufacturing Backbone

https://www.anduril.com/news/introducing-arsenalos-tm-anduril-s-digital-manufacturing-backbone
1•ilreb•32m ago•0 comments

Claude Fable 5: The harness matters more than the model

https://www.endorlabs.com/learn/claude-fable-5-take-two-same-model-different-harness-and-a-very-d...
1•bugvader•33m ago•0 comments

Estonia assigns personal ID numbers to AI agents to grant them "authorizations"

https://www.bloomberg.com/news/articles/2026-06-17/estonia-to-grant-ai-bots-legal-rights-with-per...
3•thoughtpeddler•34m ago•0 comments

USP – Write once in Markdown, post everywhere

https://github.com/adamarutyunov/usp
2•tmatme•35m ago•0 comments

Making GHC Upgrades Easy

https://blog.haskell.org/making-ghc-upgrades-easy/
2•birdculture•37m ago•0 comments

Loop Unrolling in the ML Era

https://hiraditya.github.io/posts/why-loop-unrolling-is-popular-again/
1•matt_d•38m ago•0 comments

JPEG XL Art Gallery

https://jpegxl.info/art/big_gallery.html
1•6581•38m ago•0 comments

Show HN: A free CLI coding agent, powered by ads

https://freebuff.com
6•moado•39m ago•7 comments

Seven Perfect Shuffles Randomize a Deck of Cards. But How Many Sloppy Ones?

https://www.quantamagazine.org/seven-perfect-shuffles-randomize-a-deck-of-cards-but-how-many-slop...
1•jnord•44m ago•0 comments

Why standard WER fails for Indian languages

https://www.sarvam.ai/blogs/evaluating-indian-language-asr
1•laxmena•44m ago•0 comments

Vlk: MemAct for the IDE – persistent working memory agents can prune themselves

https://github.com/aranajhonny/vlk
1•akatsutki•47m ago•0 comments

Taxonomy of the Occlupanida (parasitoids on bread bag tags)

https://www.horg.com/horg/?page_id=921
5•beatthatflight•49m ago•2 comments

StackOverflow closed my OpenClaw and paperclipAI integration q. as "irrelevant"

https://stackoverflow.com/questions/79958607/how-do-i-view-server-logs-from-paperclipai-being-run...
3•khelavastr•50m ago•1 comments

License Plate Cameras Will Soon Track Phones, Wearables, Infotainment and Pets

https://www.thedrive.com/news/license-plate-cameras-will-soon-track-phones-wearables-infotainment...
6•xoxxala•51m ago•0 comments

Clojure Hosted on Go

https://github.com/glojurelang/glojure
3•dnlo•55m ago•0 comments

Show HN: ML condenses billions of logs into a tiny snapshot your LLM can debug

https://github.com/Rocketgraph/rocketgraph
5•kvaranasi_•55m ago•2 comments

A Secret Microsoft Tool Fixed Windows Performance [video]

https://www.youtube.com/watch?v=jH0BYAkPj78
3•tambourine_man•57m ago•0 comments