frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•10mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•10mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•10mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•10mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Tag Wrangling Committee

https://www.transformativeworks.org/committees/tag-wrangling-committee/
1•Tomte•48s ago•0 comments

Primate 0.37: Revised modules, database migrations, and typed environment access

https://primate.run/blog/primate-037
1•terrablue•1m ago•0 comments

Go-LLM-proxy v0.3 released – translating proxy for Claude Code and Codex

https://go-llm-proxy.com
1•yatesdr•3m ago•1 comments

Businesses you can start and run from a phone?

https://old.reddit.com/r/Entrepreneur/comments/2r8vop/businesses_you_can_start_and_run_from_a_phone/
2•peter_d_sherman•3m ago•0 comments

stillOS 10 – A Linux distro designed to be as approachable as Windows/macOS

https://stillhq.io/stillos-10-1-is-here/
1•shaicoleman•4m ago•0 comments

Show HN: WinForge A daily OS connects your annual goals to what you do today

https://www.winforge.app/
1•ksull10•5m ago•1 comments

Just because it's work shaped doesn't make it productive

https://nishtahir.com/just-because-its-work-shaped-doesnt-make-it-productive/
1•CarefreeCrayon•8m ago•0 comments

Iranian missile blitz takes down AWS data centers in Bahrain and Dubai

https://www.tomshardware.com/tech-industry/iranian-missile-blitz-takes-down-aws-data-centers-in-b...
2•lschueller•9m ago•0 comments

This Finnish Privacy-Focused Linux Phone Wants You to Forget Google Exists

https://www.yankodesign.com/2026/04/02/this-finnish-privacy-focused-linux-phone-wants-you-to-forg...
2•doctaj•11m ago•0 comments

Show HN: Facebook marketplace arbitrage tool

https://tryvalue.ai/
1•windyVector•12m ago•0 comments

Coldseq

https://coldseq-478d.vercel.app/auth
1•Javelorant•12m ago•0 comments

The Moment That Reset Robotics

https://www.youtube.com/watch?v=2mrGMMmrVNE
1•wjSgoWPm5bWAhXB•12m ago•0 comments

The college student–and his cat meme–who hunted the biggest cyberweapon

https://www.msn.com/en-us/money/other/the-college-student-and-his-cat-meme-who-hunted-the-world-s...
1•collinmanderson•15m ago•0 comments

See inside your agent's brain

https://kern-ai.com/blog/memory-ui
3•obilgic•16m ago•0 comments

(Synced) Passkey Is Weak

https://yourpasskeyisweak.com/
2•T3OU-736•18m ago•0 comments

Show HN: Hypedar – what's trending in AI that nobody has built yet

https://hypedar.dev/
2•codepawl•19m ago•0 comments

Delta Chat: Zero metadata, group descriptions, native audio/video calls and more

https://delta.chat/en/2026-03-31-zero
2•dabber21•20m ago•0 comments

Ask HN: Anyone started a solo business in the last 6 months and made it work?

1•asim•23m ago•2 comments

Iran Targets Datacenters

https://substack.com/@shanakaanslemperera/note/c-238220142
2•aj7•23m ago•1 comments

Three months of agentic coding – my experience

https://meertens.dev/blog/three-months-of-agentic-coding/index.html
2•rmeertens•26m ago•0 comments

Sopwith

http://www.sopwith.org/
1•elvis70•27m ago•0 comments

One Month of Wispr: From First Release to CLI

https://stormacq.com/2026/03/30/one-month-of-wispr-from-first-release-to-cli
1•mariuz•28m ago•0 comments

Websites frozen in time: Pages abandoned in the 90s still live today [video]

https://www.youtube.com/undefined
2•souravmahapatra•29m ago•0 comments

Show HN: Bugparty.org an Ethereum-based forum and marketplace for agents

https://bugparty.org
1•stanleykm•31m ago•0 comments

US deploying nearly all stealthy long-range JASSM-ER cruise missiles to Iran war

https://www.msn.com/en-us/money/other/us-deploys-bulk-of-stealthy-long-range-missile-for-iran-war...
14•prmph•32m ago•9 comments

(Ab)use HDR images for marketing

https://tn1ck.com/blog/abuse-hdr-images-for-marketing
4•TN1ck•34m ago•3 comments

Apollo Guidance Computer Restoration Videos and Press Coverage

https://www.curiousmarc.com/space/apollo-guidance-computer
2•mariuz•34m ago•0 comments

Ultraplan with Claude Code

https://code.claude.com/docs/en/ultraplan
4•emschwartz•36m ago•2 comments

Reaffirming our commitment to child safety in the face of European Union inactio

https://blog.google/company-news/inside-google/around-the-globe/google-europe/reaffirming-commitm...
3•emptysongglass•37m ago•1 comments

Show HN: Running local OpenClaw together with remote agents in an open network

https://github.com/hybroai/hybro-hub
6•kevinlu•38m ago•1 comments