frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Seen the same LLM prompt break invariants weeks later in prod?

2•ritwikkar•1h ago
I’m asking specifically about LLM calls embedded inside real production workflows, not demos, side projects, or exploratory prompt work.

Think backend pipelines like: step 1 → LLM → step 2 → LLM → step 3 where users depend on the output and nothing technically “crashes.”

We’ve seen a recurring pattern: - Same input, same prompt, same model - Works reliably for weeks - Then a constraint is ignored, or a later step contradicts an earlier one - Retries don’t reliably fix it - Logs don’t explain what changed

The hardest part isn’t bad output, it’s not being able to explain failures to PMs or stakeholders when nothing obviously broke.

Curious how others operating LLM-backed workflows in production are diagnosing or containing this kind of behavior over time.

(Not looking for prompt advice or eval frameworks. Interested in operational experiences.)

Comments

chrisjj•52m ago
> The hardest part isn’t bad output, it’s not being able to explain failures to PMs or stakeholders when nothing obviously broke.

Try: The known unreliability of stochastic LLM tech caused obviously predictable failure of output depended upon by the user.

Perhaps present the analogy of a random number generator feeding the calculation of a company's statutory financial accounts.

Is Groove (by OpenAI) a scam that no one talks about?

1•ainthusiast•53s ago•0 comments

Kotlin Multiplatform

https://kmp.rrtutors.com/
1•rrtutors•7m ago•0 comments

After 25 years, Wikipedia has proved that news doesn't need to look like news

https://www.niemanlab.org/2026/01/after-25-years-wikipedia-has-proved-that-news-doesnt-need-to-lo...
3•giuliomagnifico•16m ago•0 comments

An explanation of cheating in Doom2 Deathmatch (1999)

https://www.doom2.net/doom2/cheating.html
1•Lammy•16m ago•1 comments

US electricity demand surged in 2025 – solar handled 61% of it

https://electrek.co/2026/01/16/us-electricity-demand-surged-in-2025-solar-handled-61-percent/
2•doener•17m ago•0 comments

Show HN: PolyMCP – structured skills from MCP tools for efficient agent usage

1•justvugg•18m ago•0 comments

TLDR: Code Analysis for AI Agents

https://github.com/parcadei/llm-tldr
1•handfuloflight•19m ago•0 comments

Don't Waste Your Back Pressure

https://banay.me/dont-waste-your-backpressure/
1•ghuntley•19m ago•0 comments

GCD of Fibonacci Numbers

https://www.cut-the-knot.org/arithmetic/algebra/FibonacciGCD.shtml
1•vismit2000•19m ago•1 comments

Show HN: I made a TIDAL client that runs in the terminal

https://github.com/results-may-vary-org/ttydal
1•a2nb•23m ago•0 comments

TidesDB v7.2.3 and RocksDB v10.9.1 Benchmark Analysis

https://tidesdb.com/articles/benchmark-analysis-tidesdb-v7-2-3-rocksdb-v10-9-1/
1•alexpadula•24m ago•0 comments

Map To Poster – Create Art of your favourite city

https://github.com/originalankur/maptoposter
4•originalankur•32m ago•2 comments

Hypixel released Hytale Early Access

https://hytale.com/news/2026/1/hytale-is-finally-here
1•ssernikk•34m ago•0 comments

Jaw health campaign – looking for funding

1•gushogg-blake•40m ago•0 comments

Docker Releases Hardened Images for Free – What Does It Do Differently?

https://www.i-programmer.info/news/240-devops/18579-docker-releases-hardened-images-for-free-what...
1•birdculture•40m ago•0 comments

30 Years

https://www.charlespetzold.com/blog/2026/01/30-Years.html
1•_hao•42m ago•0 comments

I Render 10MB Markdown Files in the Browser

https://igorstechnoclub.com/how-i-render-10mb-markdown-files-in-the-browser/
1•Igor_Wiwi•43m ago•0 comments

Removing Gemini AI Watermarks: A Deep Dive into Reverse Alpha Blending

https://allenkuo.medium.com/removing-gemini-ai-watermarks-a-deep-dive-into-reverse-alpha-blending...
1•diginova•43m ago•0 comments

ClickHouse raises $400M Series D

https://clickhouse.com/blog/clickhouse-raises-400-million-series-d-acquires-langfuse-launches-pos...
3•ushakov•48m ago•0 comments

Show HN: I built a tool to assist AI agents to know when a PR is good to go

https://dsifry.github.io/goodtogo/
2•dsifry•50m ago•1 comments

The Costs of Studying China from a Distance

https://www.pekingnology.com/p/diao-daming-the-costs-of-studying
1•taiwandongsuan•53m ago•0 comments

Steps How to Delete Yourself from the Internet

https://vpnspin.com/how-to-delete-yourself-from-the-internet/
1•mariusme•57m ago•0 comments

Is Plane Wi-Fi Safe? 3 Critical Dangers Exposed

https://vpnspin.com/is-wifi-on-planes-truly-safe/
1•mariusme•59m ago•0 comments

Study debunks Trump claim that paracetamol causes autism

https://www.theguardian.com/society/2026/jan/16/study-debunks-trump-claim-paracetamol-causes-auti...
1•chrisjj•1h ago•0 comments

Show HN: ReFlow Studio – An offline tool to dub, translate, and censor videos

https://github.com/ananta-sj/ReFlow-Studio
1•linearAmend•1h ago•0 comments

That's FAR-out, man: a kernel infoleak in Mac OS XNU

https://blog.dfsec.com/ios/2023/11/19/thats-far-out-man/
1•fanf2•1h ago•0 comments

Makefile Concepts

https://blog.dbdo.website/posts/makefile/?
1•dbdo•1h ago•0 comments

In the coming weeks, we plan to start testing ads in ChatGPT free and Go tiers

https://twitter.com/OpenAI/status/2012223373489614951
4•Garbage•1h ago•1 comments

Show HN: Partner – An AI co-founder that remembers you

https://getpartner.ai
1•ianberdin•1h ago•0 comments

Bridges – By Kent Beck

https://tidyfirst.substack.com/p/bridges
1•Garbage•1h ago•0 comments