frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•12mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•12mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•12mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•11mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Hollywood Invented the Girlboss

https://jacobin.com/2026/05/criterion-office-romances-women-workplace
1•robtherobber•6m ago•0 comments

Use LLMs to get an news report from the last week without the negativity

https://gist.githubusercontent.com/223740/16fcd4b403e5671e3c2d0aff0b11642c/raw/
1•alexander8776•8m ago•0 comments

Gabelle & Sel de devoir – French tax on & mandatory purchase of salt until 1946

https://en.wikipedia.org/wiki/Gabelle
1•burnt-resistor•9m ago•1 comments

Contributor Poker and Zig's AI Ban

https://kristoff.it/blog/contributor-poker-and-ai/
1•birdculture•9m ago•0 comments

Soviet Radio-86RK/i8080 computer IDE: emulator, assembler, C and PL/M compilers

https://rk86.ru
1•begoon•10m ago•0 comments

Asymmetric Flow Models

https://arxiv.org/abs/2605.12964
1•yorwba•17m ago•0 comments

FuriPhone FLX1s Linux Phone

https://sitespawn.ai
1•thunderbong•17m ago•0 comments

Show HN: Decentralized compute API on DePIN – scraping, OCR, JavaScript sandbox

https://revolution-network.fr/
1•korn_333•18m ago•0 comments

Australia orders China-linked investors to sell Northern Minerals stake

https://www.reuters.com/business/australia-treasurer-orders-six-shareholders-divest-northern-mine...
2•leonidasrup•22m ago•0 comments

Three's a party: US, China, and now Russia are on the prowl in GEO

https://arstechnica.com/space/2026/05/threes-a-party-us-china-and-now-russia-are-on-the-prowl-in-...
2•rbanffy•23m ago•0 comments

Ask HN: Could free/low cost LLMs be a momentary thing?

1•senda•28m ago•1 comments

Arcweave: Build interactive stories at the speed of thought

https://arcweave.com
1•doener•35m ago•0 comments

Two Autonomous Humanoid Robots Seamlessly Work Together to Make the Bed

https://laughingsquid.com/robots-make-bed/
1•air7•35m ago•0 comments

Show HN: Agent-QA: Open-source AI end-to-end testing for web and mobile apps

https://vostride.com
1•pranshuchittora•35m ago•1 comments

Two Americans Arrested After Crypto Stunt in Punch the Monkey's Zoo Enclosure

https://www.tokyoweekender.com/japan-life/news-and-opinion/american-arrested-ichikawa-zoo-punch-m...
2•razorbeamz•36m ago•0 comments

Apple May Add Auto-Deleting Chats to Siri as Gemini Powers Back End AI

https://firethering.com/apple-siri-revamp-privacy-google-gemini/
4•steveharing1•37m ago•3 comments

EchoPitch analyses emotional credibility in presentations before you deliver

https://echopitch.io
1•cavefishAI•41m ago•0 comments

Ask HN: Is Java the ideal language for LLM-assisted coding?

1•fragmede•41m ago•1 comments

New Design for the FreeBSD Website

https://cgit.freebsd.org/doc/commit/?id=c9c518d9dbb70240c23810f300ce4a5ba60442c6
1•vintagedave•41m ago•1 comments

Nobody's negotiating for the people here: Charlie Berens takes on AI datacenters

https://www.theguardian.com/us-news/ng-interactive/2026/may/17/comedian-charlie-berens-ai-datacen...
2•beardyw•42m ago•0 comments

Electric Clojure: Differential Dataflow for UI [video]

https://www.youtube.com/watch?v=ML8cFrWkWeg
1•farhanhubble•42m ago•0 comments

AI Foundry – Flat-Fee Unlimited LLM Inference on Blackwell GPUs in NZ

https://app.aifoundry.co.nz/auth/login?redirectTo=%2F
1•itsjpv•43m ago•0 comments

TechForges – Zero-Infra Software via Flux AI and Vibe-Mesh

https://sites.google.com/view/techforges/
1•SharavFounder•43m ago•0 comments

Human Code Reviews Are Dead

https://craftbettersoftware.com/p/human-code-reviews-are-dead
1•TheAnkurTyagi•47m ago•0 comments

OpenClaw creator burns through $1.3 mio in OpenAI API tokens in a single month

https://www.tomshardware.com/tech-industry/artificial-intelligence/openclaw-creator-burns-through...
2•m0do1•50m ago•0 comments

The Mercury logic programming system

https://github.com/Mercury-Language/mercury
1•Antibabelic•52m ago•0 comments

AI Won't Run Your Company by Itself

https://www.caimito.net/en/blog/2026/05/18/ai-wont-run-your-company-by-itself.html
5•berlianta•53m ago•0 comments

Grills and Smokers of 2026: Smart, Portable, Pellet

https://www.wired.com/story/best-grills-and-smart-grills/
1•joozio•54m ago•0 comments

Why the Spotify icon is a disco ball

https://mashable.com/article/spotify-disco-ball-icon-20-anniversary
1•doppp•54m ago•0 comments

Surprise AI bills leave AWS and Google Cloud users aghast

https://www.theregister.com/ai-ml/2026/05/18/surprise-ai-bills-leave-aws-and-google-cloud-users-a...
4•medalblue•55m ago•0 comments