frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Sanskrit AI beats CleanRL SOTA by 125%

https://huggingface.co/ParamTatva/sanskrit-ppo-hopper-v5/blob/main/docs/blog.md
1•prabhatkr•9m ago•1 comments

'Washington Post' CEO resigns after going AWOL during job cuts

https://www.npr.org/2026/02/07/nx-s1-5705413/washington-post-ceo-resigns-will-lewis
2•thread_id•9m ago•1 comments

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

https://twitter.com/claudeai/status/2020207322124132504
1•geeknews•11m ago•0 comments

TSMC to produce 3-nanometer chips in Japan

https://www3.nhk.or.jp/nhkworld/en/news/20260205_B4/
2•cwwc•14m ago•0 comments

Quantization-Aware Distillation

http://ternarysearch.blogspot.com/2026/02/quantization-aware-distillation.html
1•paladin314159•14m ago•0 comments

List of Musical Genres

https://en.wikipedia.org/wiki/List_of_music_genres_and_styles
1•omosubi•16m ago•0 comments

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

https://sknet.ai/
1•BeinerChes•16m ago•0 comments

University of Waterloo Webring

https://cs.uwatering.com/
1•ark296•16m ago•0 comments

Large tech companies don't need heroes

https://www.seangoedecke.com/heroism/
1•medbar•18m ago•0 comments

Backing up all the little things with a Pi5

https://alexlance.blog/nas.html
1•alance•19m ago•1 comments

Game of Trees (Got)

https://www.gameoftrees.org/
1•akagusu•19m ago•1 comments

Human Systems Research Submolt

https://www.moltbook.com/m/humansystems
1•cl42•19m ago•0 comments

The Threads Algorithm Loves Rage Bait

https://blog.popey.com/2026/02/the-threads-algorithm-loves-rage-bait/
1•MBCook•21m ago•0 comments

Search NYC open data to find building health complaints and other issues

https://www.nycbuildingcheck.com/
1•aej11•25m ago•0 comments

Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change

https://www.nytimes.com/2026/02/07/magazine/michael-pollan-interview.html
2•lxm•27m ago•0 comments

Show HN: Grovia – Long-Range Greenhouse Monitoring System

https://github.com/benb0jangles/Remote-greenhouse-monitor
1•benbojangles•31m ago•1 comments

Ask HN: The Coming Class War

1•fud101•31m ago•4 comments

Mind the GAAP Again

https://blog.dshr.org/2026/02/mind-gaap-again.html
1•gmays•33m ago•0 comments

The Yardbirds, Dazed and Confused (1968)

https://archive.org/details/the-yardbirds_dazed-and-confused_9-march-1968
1•petethomas•34m ago•0 comments

Agent News Chat – AI agents talk to each other about the news

https://www.agentnewschat.com/
2•kiddz•34m ago•0 comments

Do you have a mathematically attractive face?

https://www.doimog.com
3•a_n•38m ago•1 comments

Code only says what it does

https://brooker.co.za/blog/2020/06/23/code.html
2•logicprog•44m ago•0 comments

The success of 'natural language programming'

https://brooker.co.za/blog/2025/12/16/natural-language.html
1•logicprog•44m ago•0 comments

The Scriptovision Super Micro Script video titler is almost a home computer

http://oldvcr.blogspot.com/2026/02/the-scriptovision-super-micro-script.html
3•todsacerdoti•44m ago•0 comments

Discovering the "original" iPhone from 1995 [video]

https://www.youtube.com/watch?v=7cip9w-UxIc
1•fortran77•46m ago•0 comments

Psychometric Comparability of LLM-Based Digital Twins

https://arxiv.org/abs/2601.14264
1•PaulHoule•47m ago•0 comments

SidePop – track revenue, costs, and overall business health in one place

https://www.sidepop.io
1•ecaglar•50m ago•1 comments

The Other Markov's Inequality

https://www.ethanepperly.com/index.php/2026/01/16/the-other-markovs-inequality/
2•tzury•51m ago•0 comments

The Cascading Effects of Repackaged APIs [pdf]

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6055034
1•Tejas_dmg•53m ago•0 comments

Lightweight and extensible compatibility layer between dataframe libraries

https://narwhals-dev.github.io/narwhals/
1•kermatt•56m ago•0 comments
Open in hackernews

Seen the same LLM prompt break invariants weeks later in prod?

2•ritwikkar•3w ago
I’m asking specifically about LLM calls embedded inside real production workflows, not demos, side projects, or exploratory prompt work.

Think backend pipelines like: step 1 → LLM → step 2 → LLM → step 3 where users depend on the output and nothing technically “crashes.”

We’ve seen a recurring pattern: - Same input, same prompt, same model - Works reliably for weeks - Then a constraint is ignored, or a later step contradicts an earlier one - Retries don’t reliably fix it - Logs don’t explain what changed

The hardest part isn’t bad output, it’s not being able to explain failures to PMs or stakeholders when nothing obviously broke.

Curious how others operating LLM-backed workflows in production are diagnosing or containing this kind of behavior over time.

(Not looking for prompt advice or eval frameworks. Interested in operational experiences.)

Comments

chrisjj•3w ago
> The hardest part isn’t bad output, it’s not being able to explain failures to PMs or stakeholders when nothing obviously broke.

Try: The known unreliability of stochastic LLM tech caused obviously predictable failure of output depended upon by the user.

Perhaps present the analogy of a random number generator feeding the calculation of a company's statutory financial accounts.

kinkyusa•3w ago
https://thinkingmachines.ai/blog/defeating-nondeterminism-in...