frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•7mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•7mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•7mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•7mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Ban HN posts with links behind paywalls – YES or NO?

1•JimmyJamesJames•29s ago•0 comments

Bill Ackman Proposes SpaceX IPO Through Sparc, Prioritizing Tesla Shareholders

https://finance.yahoo.com/news/bill-ackman-proposes-elon-musks-193012345.html
1•SilverElfin•1m ago•0 comments

Origami on Another Level with 3D Printing

https://hackaday.com/2025/12/21/origami-on-another-level-with-3d-printing/
1•fangpenlin•4m ago•0 comments

Britons are less well off than they were in 2019 – and these figures show it

https://news.sky.com/story/britons-poorer-than-they-were-in-2019-as-living-standards-continue-to-...
2•ivewonyoung•5m ago•0 comments

The Duodecimal Bulletin, Vol. 55, No. 1, Year 1209 [pdf]

https://dozenal.org/drupal/sites_bck/default/files/DuodecimalBulletinIssue551.pdf
2•susam•7m ago•0 comments

NYC Spends $200 Million on Cell Service for School Chromebooks

https://nysfocus.com/2025/12/22/eric-adams-school-chromebooks-contract
5•h2si•9m ago•0 comments

I Stopped Reading and Embraced Audiobooks

https://www.nytimes.com/2025/12/21/books/review/audiobooks-reading-listening-habits.html
3•lxm•10m ago•0 comments

Administration suspends 5 wind projects off East Coast, cites security concerns

https://apnews.com/article/trump-offshore-wind-energy-climate-c0ac1e447c93126327f1922327921aa0
3•JKCalhoun•11m ago•1 comments

I Died on DMT

https://rebeccadai.substack.com/p/i-died-on-dmt
4•randomparticlez•15m ago•0 comments

Running Claude Code from my phone via SSH with Tailscale and tmux

https://www.qu8n.com/posts/running-claude-code-from-my-phone
3•quanwinn•15m ago•0 comments

In stock on Framework Desktop and updates on the industry-wide silicon crunch

https://community.frame.work/t/in-stock-on-framework-desktop-and-updates-on-the-industry-wide-sil...
2•nateb2022•20m ago•0 comments

Data 2025: The year in review with Mike Stonebraker and Andy Pavlo

https://www.dbos.dev/webcast-2025-in-review-with-mike-stonebraker-and-andy-pavlo
2•teleforce•22m ago•0 comments

Antimeme Antichrist

https://www.johnnychang.com/antimeme/
3•zcase•24m ago•1 comments

FDA Approves Pill Version of Wegovy

https://www.wired.com/story/fda-approves-pill-version-of-wegovy/
4•stein1946•24m ago•2 comments

Release Trains Aren't About Releases

https://cameronwestland.com/release-trains-arent-about-releases/
2•camwest•25m ago•1 comments

Wozmonc64.bas

https://github.com/gabrielsroka/gabrielsroka.github.io/blob/master/wozmon/wozmonc64.bas
2•gabrielsroka•26m ago•1 comments

State of the Geomagnetic Field, December 2025 (PDF 19 pages)

https://www.ncei.noaa.gov/sites/default/files/2025-12/WMM%20SoGF%20Dec2025%20508.pdf
2•defrost•30m ago•0 comments

A House Ahead of Its Time (2024)

https://www.nist.gov/feature-stories/house-ahead-its-time
2•1659447091•33m ago•0 comments

Exposing a $10B Fraudulent Debt Relief Industry [video]

https://www.youtube.com/watch?v=hBNWInRS6hM
3•fortran77•36m ago•0 comments

President Trump Announces New Trump Class Battleship

https://www.navy.mil/Press-Office/Press-Releases/display-pressreleases/Article/4366856/president-...
4•duxup•37m ago•2 comments

Show HN: StringTune-3D – Control Three.js scenes via CSS variables

https://github.com/penev-palemiya/StringTune-3D
2•penev_tech•46m ago•0 comments

We removed 80% of our agent's tools

https://vercel.com/blog/we-removed-80-percent-of-our-agents-tools
2•forks•48m ago•0 comments

Jim Beam halts production at main distillery for a year

https://www.bbc.com/news/articles/cy5gv5z24n2o
4•geox•53m ago•0 comments

Snitch – a friendly netstat alternative for humans

https://github.com/karol-broda/snitch
3•karol-broda•53m ago•0 comments

Grok Collections API

https://x.ai/news/grok-collections-api
2•lmariscal•55m ago•1 comments

San Francisco power won't be fully restored until Tuesday, PG&E says

https://www.sfgate.com/bayarea/article/san-francisco-power-pge-outage-21255221.php
2•bryan0•59m ago•0 comments

Colorado Windstorm Causes 4.8 Microsecond Glitch in Official US Time

https://gizmodo.com/colorado-windstorm-causes-4-8-microsecond-glitch-in-official-u-s-time-2000702524
2•Stratoscope•59m ago•1 comments

Salisbury Steak is super weird

https://www.youtube.com/watch?v=d9z8DK_VWKs
2•bane•59m ago•0 comments

Document search using Claude and Inverted Index

https://annanay.dev/claude-inverted-index-search/
2•annanay•1h ago•0 comments

Aspartame study suggests that current guidelines should be re-examined

https://www.sciencedirect.com/science/article/pii/S0753332225010856
2•shaggie76•1h ago•0 comments