frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•11mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•11mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•11mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•11mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Day 80 of internet blackout: We gave up in freedom of information in Iran [video]

https://www.youtube.com/watch?v=dQw4w9WgXcQ
1•us321•51s ago•0 comments

I don't think AI will make your processes go faster

https://frederickvanbrabant.com/blog/2026-05-15-i-dont-think-ai-will-make-your-processes-go-faster/
1•TheEdonian•1m ago•0 comments

I use LLMs as a staff engineer in 2026

https://www.seangoedecke.com/how-i-use-llms-in-2026/
2•alexharri•4m ago•0 comments

Shellify: Turning websites into Android apps, without the telemetry tax

https://github.com/smellouk/shellify5
2•dalidx88•4m ago•0 comments

Apple Silicon costs more than OpenRouter

https://www.williamangel.net/blog/2026/05/17/offline-llm-energy-use.html
3•datadrivenangel•6m ago•0 comments

What we learned from a cringey courtroom drama between Elon Musk and Sam Altman

https://www.theguardian.com/us-news/2026/may/16/what-we-learned-elon-musk-sam-altman
2•beardyw•10m ago•0 comments

Products are out, brains are in

https://mrmarket.bearblog.dev/products-are-out-brains-are-in-new/
2•mrmarket•11m ago•0 comments

Scientists believe ibogaine can help veterans overcome PTSD

https://www.bbc.com/future/article/20260514-how-hallucinogenic-ibogaine-helps-veterans-overcome-ptsd
2•bushwart•12m ago•1 comments

Show HN: Snatch Guard – iOS theft detection with accelerometer and Screen Time

https://snatchguard.app
2•olegmmv•16m ago•0 comments

One IP, Six Crawler Identities, One Second (Detection via Nginx Logs)

https://speytech.com/insights/rotational-bot-identity-detection/
2•william1872•16m ago•0 comments

RCP8.5 Is Officially Dead

https://rogerpielkejr.substack.com/p/rcp85-is-officially-dead
1•RickJWagner•18m ago•0 comments

Show HN: Peeklens – Palantir for Marketing

https://peeklens.ai/
1•ramsono•18m ago•0 comments

Prolonging healthy aging: Longevity vitamins and proteins (Study, 2018)

https://pmc.ncbi.nlm.nih.gov/articles/PMC6205492/
1•pogue•23m ago•0 comments

Native all the way, until you need text

https://justsitandgrin.im/posts/native-all-the-way-until-you-need-text/
3•dive•25m ago•1 comments

Every AI Subscription Is a Ticking Time Bomb for Enterprise

https://www.thestateofbrand.com/news/ai-subscription-time-bomb
2•mooreds•26m ago•0 comments

How to Write Articles and Essays Quickly and Expertly (2006)

https://www.downes.ca/post/38526
1•downbad_•27m ago•0 comments

Nine Things I Learned in Ninety Years [pdf]

https://edwardpackard.com/wp-content/uploads/2026/04/Nine-Things-I-Learned-In-Ninety-Years.pdf
1•jimsojim•27m ago•0 comments

Kaiden: Workstation AI Sandbox Desktop Application

https://openkaiden.ai/
1•illusive4080•27m ago•1 comments

Ebola epidemic in DRC, Uganda public health emergency of international concern

https://www.who.int/news/item/17-05-2026-epidemic-of-ebola-disease-in-the-democratic-republic-of-...
3•JumpCrisscross•30m ago•0 comments

How Agile became a mis-Agile Disaster

https://medium.com/@andvgal/how-agile-became-a-mis-agile-disaster-1c1905cba329
1•andvgal•32m ago•0 comments

The age of thin clients and middle managers

https://kixpanganiban.bearblog.dev/the-age-of-thin-clients-and-middle-managers/
2•kixpanganiban•37m ago•0 comments

Claude Code Did the Heavy Lifting to Get Adobe Lightroom CC Running on Linux

https://www.phoronix.com/news/Adobe-Lightroom-CC-Linux
2•bno1•38m ago•0 comments

Your browser probably lies to the big sites (blame Chrome)

https://hackaday.com/2026/05/16/your-browser-probably-lies-to-the-big-sites-blame-chrome/
1•notpushkin•43m ago•0 comments

China bypasses US GPU bans with 1.54-exaflops 'LineShine' supercomputer

https://www.tomshardware.com/tech-industry/artificial-intelligence/china-bypasses-us-gpu-bans-wit...
2•giuliomagnifico•43m ago•0 comments

Mnemonicai – AI that learns from your company's work, not your docs

https://mnemonic.nishantvanawala6118.workers.dev
1•Nishvana•46m ago•0 comments

AI in Finance: What Is Working Today

https://members.sigmazero.cc/posts/ai-in-finance-is-157955538?postId=ai-in-finance-is-157955538
2•sigmazero•46m ago•0 comments

Pixal3D: Pixel-Aligned 3D Generation from Images

https://ldyang694.github.io/projects/pixal3d/
2•steveharing1•50m ago•0 comments

Photo GIMP – A Patch for GIMP 3 for Photoshop Users

https://github.com/Diolinux/PhotoGIMP
1•SockThief•56m ago•0 comments

Private Networking on Hetzner Cloud with Tailscale

https://onatm.dev/2026/01/28/private-networking-on-hetzner-cloud-with-tailscale/
1•onatm•57m ago•0 comments

Agent skill for UB detection in Rust

https://twitter.com/i/status/2055439039692452106
1•Dowwie•58m ago•1 comments