frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•11mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•11mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•11mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•11mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Signing off in a world of what's next

https://om.co/2026/05/12/signing-off-in-a-world-of-whats-next/
1•herbertl•3m ago•0 comments

Thucydides Trap

https://en.wikipedia.org/wiki/Thucydides_Trap
1•theletterf•3m ago•0 comments

Hello Universe: NASA's Next-Gen RISC-V Space Processor Undergoes Testing

https://www.jpl.nasa.gov/news/hello-universe-nasas-next-gen-space-processor-undergoes-testing/
1•rbanffy•4m ago•0 comments

The Readable Mind: LLMs as Psychological Infrastructure (2026)

https://zenodo.org/records/20179361
1•dhedegreen•5m ago•0 comments

Working Hard

https://joy.ente.com/working-hard/
2•darthShadow•6m ago•0 comments

Ten Things Every Trial Lawyer Could Learn from Vincent La Guardia Gambini [pdf]

https://s3.amazonaws.com/law-media/uploads/198/35361/original/Anderson_TenThings_SU2016.pdf
1•kmstout•6m ago•0 comments

Ryan Cohen hits back at eBay, says his takeover proposal should not be dismissed

https://www.reuters.com/world/ryan-cohen-says-ebay-directors-should-not-dismiss-his-proposal-with...
2•AdmiralAsshat•7m ago•0 comments

Myths about /dev/urandom

https://www.2uo.de/myths-about-urandom/
1•signa11•8m ago•0 comments

I Made Timelapses of Artemis [video]

https://www.youtube.com/watch?v=mtwxaZDek8Y
1•db48x•11m ago•0 comments

Show HN: CrowdRank – live leaderboards for internet arguments

https://crowdrank.app
1•Skinless1501•14m ago•0 comments

Academy of Management Pulls 13,500-Person Conference Out of the U.S.

https://meetings.skift.com/2026/05/13/academy-of-management-pulls-13500-person-conference-out-of-...
2•akyuu•16m ago•0 comments

Reactionary Me: Windows ME KDE Plasma Theme

https://store.kde.org/p/2331482
1•klaussilveira•17m ago•0 comments

Pushing Local Models with Focus and Polish

https://lucumr.pocoo.org/2026/5/8/local-models/
1•swah•19m ago•0 comments

AGENTS.md — Pretending to Be a Good Human

https://gist.github.com/skorotkiewicz/127c96ccc7324aaaff949bad3ea89255
2•modinfo•20m ago•0 comments

AI Agents for Business in 2026

https://www.dhawalshah.net/article/ai-agents-for-business-2026/
1•djshah•21m ago•0 comments

Where Did All the AK-47s Go?

https://www.nytimes.com/2026/05/13/us/ak-47.html
2•klaussilveira•23m ago•0 comments

Show HN: Tlbic – A flexible, time-limited basic income credit (3rd Ed, French)

1•michikawa59•24m ago•1 comments

OpenAI Parameter Golf: what 1,100 researchers built in six weeks

https://www.runpod.io/blog/openai-parameter-golf-runpod-challenge
2•mooreds•25m ago•0 comments

Carbon-Based Textile-Structured Triboelectric Nanogenerators for Smart Wearables

https://wiley.scienceconnect.io/error?msg=ewogICJpZCIgOiAiOGEwODk4YTctNmRmNC00ZTgyLWJiMTUtMDFkMjM...
1•ludicrousdispla•25m ago•0 comments

GNU SASL gsasl 2.2.3 released with a security fix

https://lists.gnu.org/archive/html/help-gsasl/2026-05/msg00001.html
1•neustradamus•28m ago•0 comments

Automating code security review: Mythos-level capabilities at lower cost

https://www.synthesia.io/post/automating-code-security-reviews-with-claude-mythos-level-capabilities
1•alexvoica•29m ago•0 comments

The Origins of "Hello, World" [video]

https://www.youtube.com/watch?v=vLer3fRwwxE
1•arbayi•29m ago•0 comments

They Said It Would Cost $54M. We Said "No Thanks."

https://nateglubish.substack.com/p/they-said-it-would-cost-54-million
10•idw•29m ago•0 comments

The Tesla Semi could be a big deal for electric trucking

https://www.technologyreview.com/2026/05/14/1137197/tesla-semi-electric-trucking/
2•joozio•30m ago•2 comments

Scribus – open-source Desktop Publishing

https://www.scribus.net/
2•Tomte•31m ago•1 comments

AMD reaches 46% of server x86 CPU revenue

https://www.tomshardware.com/pc-components/cpus/amd-reaches-46-percent-of-server-x86-cpu-revenue-...
2•giuliomagnifico•31m ago•0 comments

What you measure depends on where you draw the boundary

https://blog.arkstack.dev/en/blog/compensation-correctness-saga-benchmark/
1•arkstack•35m ago•0 comments

65% of Girls Who Use AI-Assisted Devices See Them as "Friends"

https://www.girlscouts.org/en/footer/press-room/2026-press-announcements/research-finds-girls-vie...
2•susiecambria•36m ago•0 comments

USA's PRE-Stuxnet Cyber Weapon: What FAST16 Reveals About State-Level Malware

https://twit.tv/posts/tech/inside-americas-pre-stuxnet-cyber-weapon-what-fast16-reveals-about-sta...
2•NN88•37m ago•0 comments

Android rolling out AI 'Contextual suggestions' that learn from your habits

https://9to5google.com/2026/05/13/android-contextual-suggestions/
1•petee•39m ago•1 comments