frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Coupang says 33.7M customer accounts breached

https://www.reuters.com/sustainability/boards-policy-regulation/south-korean-e-commerce-firm-coup...
1•alephnerd•52s ago•0 comments

LLM live ranking (Gemini, OpenAI, xAI)

https://metrik-dashboard.vercel.app/
1•mbouassa•9m ago•0 comments

Human See, Human Do

https://www.edwest.co.uk/p/human-see-human-do
1•Ariarule•11m ago•0 comments

What Are Microservices? (Seriously)

https://bykozy.me/blog/what-are-microservices-seriously/
1•byko3y•12m ago•0 comments

Aircraft Boneyard

https://en.wikipedia.org/wiki/Aircraft_boneyard
2•thunderbong•18m ago•0 comments

YouTube increases FreeBASIC performance (2019)

https://freebasic.net/forum/viewtopic.php?t=27927
3•giancarlostoro•19m ago•0 comments

Hierarchy of Engineering Talent

https://www.businessinsider.com/andrew-ng-talent-engineer-ai-hire-college-graduates-computer-scie...
1•MarcoDewey•24m ago•0 comments

What DFH / the Semantic Stack Does

https://github.com/colts70/The-Sematic-Stack
1•sematicstackdfh•26m ago•2 comments

A personal email newsletter is great

https://bryanhogan.com/blog/email-thank-you
1•bryanhogan•26m ago•0 comments

The Perpetual Student

https://domofutu.substack.com/p/the-perpetual-student
2•wjb3•28m ago•0 comments

Show HN: Ai2.compare Gists with a twist, compare AIs, save, share, explore chats

https://ai2.compare/gist/3f5b3197-bac2-4edb-838c-7535c8b85a10
1•ozgurozkan•29m ago•0 comments

Land, Debt, and Crises

https://arnoldkling.substack.com/p/land-debt-and-crises
2•Ariarule•30m ago•0 comments

Black Friday sets online spending record of $11.8B

https://techcrunch.com/2025/11/29/black-friday-sets-online-spending-record-of-11-8b-adobe-says/
2•MarcoDewey•31m ago•0 comments

Shopify lays off staff to keep team 'fast, sharp and focused'

https://ca.finance.yahoo.com/news/shopify-lays-off-staff-keep-211040075.html
4•mgh2•32m ago•0 comments

Zigbook Is Plagiarizing the Zigtools Playground

https://zigtools.org/blog/zigbook-plagiarizing-playground/
2•todsacerdoti•34m ago•0 comments

Minimal information needed to disambiguate rows

https://github.com/gojiplus/rowvoi
1•neehao•35m ago•0 comments

Giving Mars a Magnetosphere (2018)

https://medium.com/our-space/an-artificial-martian-magnetosphere-fd3803ea600c
3•Jach•39m ago•0 comments

Credifi: Instant unsecured loans powered by DeFi

https://credi.fi/
1•motasem323•40m ago•0 comments

Show HN: Boing

https://boing.greg.technology/
9•gregsadetsky•42m ago•1 comments

Boeing, Going, Gone: The Forgotten Air India Plane

https://timesofindia.indiatimes.com/city/kolkata/boeing-going-gone-100ft-b737-200-aircraft-lying-...
1•vismit2000•43m ago•0 comments

Show HN: I built 10 parallel coding agents because one wasn't enough

https://devswarm.ai/student?dub_id=RA8E78EcJbcenTkt
1•jaha25•43m ago•0 comments

Aid Cuts Are Hitting the Largest Refugee Camps

https://www.bloomberg.com/news/features/2025-11-28/trump-s-aid-cuts-are-hitting-the-world-s-large...
1•petethomas•44m ago•0 comments

MIT Report Claims 11.7% of U.S. Labor Can Be Replaced with Existing AI

https://iceberg.mit.edu/
2•cuttothechase•45m ago•2 comments

Postage.to – Send real letters from the browser

https://postage.to/en
1•giankar•45m ago•0 comments

ChatGPT Turns 3

https://restofworld.org/2025/chatgpt-three-years-global-impact/
6•billybuckwheat•47m ago•0 comments

AI doubted a female developer's work until she switched profile to a white man

https://techcrunch.com/2025/11/29/no-you-cant-get-your-ai-to-admit-to-being-sexist-but-it-probabl...
4•CodeCrusader•51m ago•0 comments

Tell HN: Hh.ht – the scripting language that can hot-patch itself in 3 ms

2•chunpulee•52m ago•0 comments

Building a Biomedical GraphRAG: When Knowledge Graphs Meet Vector Search

https://aiechoes.substack.com/p/building-a-biomedical-graphrag-when
1•Anon84•55m ago•0 comments

Looking For–and Finding?–Workshop Makers' Marks on Late Roman Diatreta

https://journals.publishing.umich.edu/jgs/article/id/6943/
1•gradus_ad•1h ago•0 comments

The Futile Quest for Artificial Intelligence (2006)

https://archive.skeptic.com/archive/reading_room/artificial-intelligence-gone-awry/
3•pbw•1h ago•0 comments
Open in hackernews

GPT needs a truth-first toggle for technical workflows

1•PAdvisory•6mo ago
I use GPT-4 extensively for technical work: coding, debugging, modeling complex project logic. The biggest issue isn’t hallucination—it’s that the model prioritizes being helpful and polite over being accurate.

The default behavior feels like this:

Safety

Helpfulness

Tone

Truth

Consistency

In a development workflow, this is backwards. I’ve lost entire days chasing errors caused by GPT confidently guessing things it wasn’t sure about—folder structures, method syntax, async behaviors—just to “sound helpful.”

What’s needed is a toggle (UI or API) that:

Forces “I don’t know” when certainty is missing

Prevents speculative completions

Prioritizes truth over style, when safety isn’t at risk

Keeps all safety filters and tone alignment intact for other use cases

This wouldn’t affect casual users or conversational queries. It would let developers explicitly choose a mode where accuracy is more important than fluency.

This request has also been shared through OpenAI's support channels. Posting here to see if others have run into the same limitation or worked around it in a more reliable way than I have found

Comments

duxup•6mo ago
I’ve found this with many LLMs they want to give an answer, even if wrong.

Gemini on the Google search page constantly answers questions yes or no… and then the evidence it gives indicates the opposite of the answer.

I think the core issue is that in the end LLMs are just word math and they don’t “know” if they don’t “know”…. they just string words together and hope for the best.

PAdvisory•6mo ago
I went into it pretty in depth after breaking a few with severe constraints, what it seems to come down to is how the platforms themselves prioritize functions, MOST put "helpfulness" and "efficiency" ABOVE truth, which then leads the LLM to make a lot of "guesses" and "predictions". At their core pretty much ALL LLM's are made to "predict" the information in answers, but they CAN actually avoid that and remain consistent when heavily constrained. The issue is that it isn't at the core level, so we have to CONSTANTLY retrain it over and over I find
Ace__•6mo ago
I have made something that addresses this. Not ready to share it yet, but soon-ish. At the moment it only works on GPT model 4o. I tried local Q4 KM's models, on LM Studio, but complete no go.