frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•5mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•5mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•5mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•5mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Show HN: Constantine Bytensky's 9x20 Font

https://github.com/cbytensky/cnxt
1•kazinator•3m ago•0 comments

Musci.io – Text-to-Music AI Generator (20-30 second generation time)

https://musci.io/
1•xbaicai•4m ago•1 comments

Google DeepMind Announces Gempix2

https://gempix2.io
1•xbaicai•5m ago•1 comments

FireAI: One Platform to Chat, Create Images, and Design Posters

https://www.bedpage.com/
1•icefunc•13m ago•1 comments

Meta can't afford its $600B love letter to Trump

https://www.theregister.com/2025/11/08/meta_cant_afford_its_600b/
3•raybb•15m ago•0 comments

Why Sam Altman was booted from OpenAI, according to new testimony

https://www.theverge.com/ai-artificial-intelligence/814876/ilya-sutskever-deposition-openai-sam-a...
1•paladin314159•20m ago•0 comments

Where You See a Fancy Fish, Engineers See Alan Turing's Math

https://www.nytimes.com/2025/11/06/science/alan-turing-patterns-boxfish.html
1•mikhael•23m ago•0 comments

Elon Musk says building his own 'TeraFab' chip fab may be the only answer

https://www.tomshardware.com/tech-industry/semiconductors/elon-musk-says-terafab-chip-fab-may-be-...
2•SanjayMehta•28m ago•2 comments

How to make government work: Lessons from a rare British success story

https://samf.substack.com/p/how-to-make-government-work
1•rorylawless•29m ago•0 comments

The Geographic Distribution of China's Last Names, in Maps (2013)

https://www.theatlantic.com/china/archive/2013/10/the-geographic-distribution-of-chinas-last-name...
1•fzliu•35m ago•0 comments

Is Fast Charging Killing the Battery? A 2-Year Test on 40 Phones [video]

https://www.youtube.com/watch?v=kLS5Cg_yNdM
1•htk•36m ago•0 comments

Bootc for Workstation Use

https://lwn.net/SubscriberLink/1042708/90b68e222a964524/
1•todsacerdoti•37m ago•0 comments

Is microwave cooking nuking all the nutrients?

https://www.popsci.com/health/do-microwaves-destroy-nutrients/
2•wjb3•38m ago•1 comments

Show HN: I gave ChatGPT access to live stock market data

https://rallies.ai/
1•rallies•38m ago•0 comments

Trump Says U.S. Visas Can Be Denied to Fat People from Now On

https://newrepublic.com/post/202898/trump-us-visas-deny-fat-people-obesity
7•c420•39m ago•1 comments

Post Perihelion Data on 3I/Atlas

https://avi-loeb.medium.com/post-perihelion-data-on-3i-atlas-3d1e72be2bb4
1•ojosilva•42m ago•0 comments

Sam Altman's pants are on fire

https://garymarcus.substack.com/p/sam-altmans-pants-are-totally-on
14•toomuchtodo•42m ago•1 comments

Israel dumps millions into geo targeting evangelicals in churches and ChatGPT

https://www.disclose.tv/id/wrbhq1fa5c/
13•cramsession•46m ago•5 comments

Jensen Huang Gets It Wrong, Claude Gets It Right

https://www.oreilly.com/radar/jensen-huang-gets-it-wrong/
2•ubasu•50m ago•0 comments

Show HN: Hacker Reader – A clean, open-source Hacker News client for iOS

https://apps.apple.com/us/app/hacker-reader/id6754137305
1•danielcspaiva•51m ago•0 comments

Running a 68060 CPU in Quadra 650

https://github.com/ZigZagJoe/Macintosh-Q650-68060
3•zdw•1h ago•0 comments

How Press Photos Were Transmitted Back in the 1970s (2015)

https://petapixel.com/2015/07/26/this-is-how-press-photos-were-transmitted-back-in-the-1970s/
4•zdw•1h ago•0 comments

What is the sense behind ZFS's limits

https://unix.stackexchange.com/questions/336961/what-is-the-sense-behind-zfss-limits
3•caminanteblanco•1h ago•0 comments

What happens to your body after you drink a can of Coke

https://www.telegraph.co.uk/health-fitness/diet/nutrition/what-cola-does-to-your-body/
3•wjb3•1h ago•0 comments

Why I stopped proofreading and started to listen

https://refp.se/articles/I-stopped-proofreading-and-started-to-listen
2•refp•1h ago•0 comments

Older Adults Outnumber Children in 11 States

https://www.census.gov/newsroom/press-releases/2025/older-adults-outnumber-children.html
19•geox•1h ago•2 comments

Midjourney Powers Web Stack on Bun with Five Engineers Serving Millions

https://twitter.com/_chenglou/status/1986583136369844608
2•dvrp•1h ago•0 comments

Data-formulator.ai from Microsoft Research – free to play data analysis agent

https://data-formulator.ai/
2•flyingglobox•1h ago•0 comments

It Is All about Token: Towards Semantic Information Theory for LLMs

https://arxiv.org/abs/2511.01202
3•dboreham•1h ago•0 comments

Diving into Rama: A Clojure LSH Vector Search Experiment

https://shtanglitza.ai/public/blog/rama-lsh.html
2•nathanmarz•1h ago•0 comments