frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•7mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•7mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•7mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•7mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.

Global ripple effects of corporate tax reforms

https://www.nber.org/papers/w34627
1•hhs•3m ago•0 comments

What are you using today to monitor uptime for small or personal projects?

https://updown.fly.dev/
1•ejncman•9m ago•1 comments

Get Salty [video]

https://www.youtube.com/watch?v=NfxHiT-0inM
2•marysminefnuf•10m ago•1 comments

CES 2026: These 32 Tech Products Made Some of the Biggest Impressions

https://www.cnet.com/pictures/ces-2026-overall-products/
1•SilverElfin•11m ago•0 comments

Carina Hong of Axiom Math at the Neuron

https://www.youtube.com/watch?v=xldMXTPGMGI
1•rasengan0•12m ago•0 comments

Show HN: GlyphLang – An AI-first programming language

1•goose0004•12m ago•0 comments

Show HN: TheTabber – Create, repurpose, and post across 9+ platforms

https://thetabber.com/
1•dibasdauliya•13m ago•0 comments

Show HN: Librario, a book metadata API that aggregates G Books, ISBNDB, and more

5•jamesponddotco•14m ago•0 comments

Cocopilot: Self-Updating Repository

https://acbart.github.io/cocopilot/
1•acbart•15m ago•0 comments

Disaggregated machine learning via in-physics computing at radio frequency

https://www.science.org/doi/10.1126/sciadv.adz0817
2•gnabgib•19m ago•0 comments

Polymaths: An Argument for Analogies

https://nonzerosum.games/polymaths1.html
1•samixg•20m ago•1 comments

The "Good Will Hunting" Problem in Generative AI

https://medium.com/@chipmunkworks/ai-the-will-hunting-of-our-age-59952c1744f1
2•treelover•20m ago•1 comments

Show HN: Embex – 9K downloads in 2 weeks, a universal ORM for vector databases

https://www.bridgerust.dev/embex/introduction/
1•mimchak•22m ago•0 comments

MCP Joins the Linux Foundation

https://github.blog/open-source/maintainers/mcp-joins-the-linux-foundation-what-this-means-for-de...
1•raju•23m ago•1 comments

Private equity firms acquired more than 500 autism centers in past decade: study

https://www.brown.edu/news/2026-01-07/private-equity-autism-centers
5•hhs•23m ago•0 comments

Mapping and editing learned functional geometry inside a CNN (with controls)

https://github.com/boglim1984/functional-geometry-hebbian-manifold
1•boglim1984•24m ago•1 comments

CFT: "sqawk" 0.8.0 – optimized SQL Awk utility with Rust's sqlparser

https://github.com/jgarzik/sqawk/tree/v0.8.0
1•jgarzik•25m ago•1 comments

The da Vinci Code: Quest to Identify Leonardo da Vinci's DNA

https://www.science.org/content/article/have-scientists-found-leonardo-da-vinci-s-dna
1•bookmtn•25m ago•0 comments

Using the physics of radio waves to empower smarter edge devices

https://pratt.duke.edu/news/using-the-physics-of-radio-waves-to-empower-smarter-edge-devices/
1•hhs•28m ago•0 comments

The Celtic Tiger bridge that wouldn't open because of a lost remote control

https://www.thejournal.ie/sean-ocasey-bridge-remote-1713102-Oct2014/
2•JumpCrisscross•32m ago•0 comments

The Robot Cars Have Come for the Kids

https://www.nytimes.com/2026/01/05/us/waymo-kids-los-angeles.html
1•JumpCrisscross•33m ago•0 comments

Iranian Crown Prince in Exile – Interview with Reza Pahlavi (2025) [video]

https://www.youtube.com/watch?v=VwWQ3hnJLZQ
1•thomassmith65•34m ago•1 comments

Philosopher of Pride

https://aeon.co/essays/the-hidden-role-of-pride-and-shame-in-the-human-hive
1•benbreen•35m ago•0 comments

Show HN: Chordle. Learn to identify pitch by playing Wordle with chords

https://codepen.io/tehryanx/full/RNRGGEQ
1•tehryanx•36m ago•0 comments

The Manifold Mind of Saul Bellow

https://www.metropolitanreview.org/p/the-manifold-mind-of-saul-bellow
1•samclemens•36m ago•0 comments

People are abusing Facebook's deceased persons account hacked request form

https://infosec.exchange/@teriradichel/115873364828247139
2•gpi•38m ago•1 comments

[Claude Code Plugin Proposal] Add agent-session-commit to iterate on AGENTS.md

https://github.com/anthropics/claude-code/pull/17395
1•Olshansky•39m ago•0 comments

Tcl Nxtpaper 70 Pro phone has dedicated reading modes that help reduce strain

https://www.pcmag.com/news/tcl-nxtpaper-70-pro-phone-dials-up-the-specs-we-go-hands-on-at-ces-2026
1•teleforce•40m ago•0 comments

Show HN: Lolodex turns email threads and attachments into clean/searchable notes

https://lolodex.com
1•yungookim•49m ago•0 comments

The Wren Stack

https://speakez.tech/blog/wren-stack/
1•Multicomp•52m ago•1 comments