frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•11mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•11mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•11mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•11mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Show HN: Platypus – Local meeting transcription, notes, and chat (Tauri, Rust)

https://platypusnotes.com/
1•pixelmash13•27s ago•0 comments

CVE-2026-42167: SQLi and possible auth bypass or RCE in ProFTPD

https://zeropath.com/blog/proftpd-cve-2026-42167-auth-bypass-privesc-rce
1•AllAlongTheWay•36s ago•1 comments

Tell HN: GitHub Issues Issues

1•debarshri•37s ago•0 comments

Copilot-arewecooked – Know your AI credit cost before June first

https://github.com/PanAchy/copilot-arewecooked
1•panachy•1m ago•1 comments

Pigtikal (puzzles in geometry that I know and love)

https://arxiv.org/abs/0906.0290
1•t-3•2m ago•0 comments

NIS2 compliance for your company? Checklist before you start

https://www.getprobo.com/hub/nis2-compliance-checklist-tech-companies-2026
1•arthurmyx•3m ago•0 comments

Tell HN: Fastmod Is Nice

1•gchamonlive•3m ago•0 comments

Uber Expands to Hotel Bookings with Expedia Partnership

https://www.nytimes.com/2026/04/29/travel/uber-hotel-booking-expedia.html
1•xnx•3m ago•0 comments

The 'expert' who said 'globalization would end war' – 5 years before WWI

https://www.schiffsovereign.com/investing/the-expert-who-said-globalization-would-end-war-5-years...
1•speckx•3m ago•0 comments

Supreme Court appears split over controversial use of 'geofence' search warrants

https://techcrunch.com/2026/04/28/scotus-chatrie-geofence-search-warrant-ruling-arguments/
2•pseudolus•3m ago•0 comments

Show HN: Label Design App for BT Thermal Printers – Niimbot, "Cat Printers"

https://github.com/lukaszliniewicz/catlabel
2•starry_air•3m ago•1 comments

Show HN: Cogito The Strava for Reading

https://www.cogito-app.io
1•hugobeey•4m ago•0 comments

A giant succession wave is coming for family businesses

https://www.economist.com/interactive/business/2026/04/09/a-giant-succession-wave-is-coming-for-f...
1•gmays•5m ago•0 comments

Cursor Camp

https://neal.fun/cursor-camp/
1•meetpateltech•6m ago•0 comments

Show HN: TreasuryFlow – AI CFO that runs in your spreadsheet, from $49/mo

https://treasuryflow.pantollventures.com/from-bench
1•michaelgdwn•6m ago•0 comments

Ask HN: Mining Scientific Papers

1•davidbjaffe•8m ago•0 comments

Show HN: Is Opus Ok Today?

https://isopusok.today/
2•balajmarius•8m ago•0 comments

Dreamware, Judgeware, Actware

https://valand.dev/blog/post/dreamware-judgeware-actware
1•valand•9m ago•0 comments

Put it in pencil: NASA's Artemis III mission to launch no earlier than late 2027

https://arstechnica.com/space/2026/04/put-it-in-pencil-nasas-artemis-iii-mission-will-launch-no-e...
1•rbanffy•10m ago•0 comments

A billion miles in less than a decade: GM's Super Cruise reaches a milestone

https://arstechnica.com/cars/2026/04/gms-super-cruise-passes-a-billion-driven-miles-since-2017/
1•rbanffy•11m ago•1 comments

Ask HN: Is it still worth it to try to get a job in IT

3•morpheos137•12m ago•2 comments

BeddyColor – Free Printable Coloring Pages for Kids

https://beddycolor.com
1•yimiqidage001•12m ago•0 comments

Oil price jumps to $117 after reports of 'extended' Iran blockade

https://www.bbc.com/news/articles/cj4pxr0gr02o
3•tartoran•12m ago•1 comments

Designers Are Having Fun. Again

https://metedata.substack.com/p/010-designers-are-having-fun-again
1•young_mete•12m ago•0 comments

Show HN: Plume – One hand-picked word a day, with etymology and pronunciation

https://apps.apple.com/us/app/plume-word-of-the-day/id6762819388
1•claudiusa•12m ago•0 comments

Amazon is offering new OpenAI products on AWS

https://techcrunch.com/2026/04/28/amazon-is-already-offering-new-openai-products-on-aws/
2•Brajeshwar•14m ago•1 comments

Show HN: BlAST Engine: AST-free static analyzer to generate agents.md in CI pipe

https://github.com/squid-protocol/gitgalaxy
1•squid-protocol•14m ago•0 comments

Only Elon Musk can fire Elon Musk from SpaceX, filing shows

https://www.reuters.com/world/only-elon-musk-can-fire-elon-musk-spacex-filing-shows-2026-04-29/
8•spenvo•14m ago•1 comments

The Right Amount of Automation

https://joelholmes.dev/blog/2026-04-29-right-amount-of-automation/
1•holmes89•15m ago•0 comments

I built an open source Harvey/Legora in two weeks

https://mikeoss.com
1•willchennn•16m ago•1 comments