news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Cake Problem: when LLMs make operational promises nobody can fulfill

https://medium.com/@evgeny-chernyshov/the-cake-problem-when-llms-make-operational-promises-nobody-can-fulfill-4696fca70cd1

2•wastemaster•2h ago

Comments

wastemaster•2h ago

We deployed AI agents across 25 hospitality properties and logged ~46,000 guest conversations. The main failure mode wasn’t tone or retrieval. It was “confident gap-filling”: the model promising operational outcomes nobody had verified. This post is about the production failures we saw and the constraints we added to stop them.

wastemaster•2h ago

Happy to answer questions about failure modes, where we draw the line between retrieval and operational decisions, and which constraints actually reduced risk in production. The main lesson for us was that “don’t hallucinate” is too soft for real operations. We had to replace soft prompting with hard boundaries: verified data only, checks for critical actions, and escalation before collecting fulfillment details for anything unverified.

xxwink•1h ago

"if a request is not grounded in verified data, the agent must not improvise." - This is an instruction sales people across the globe also could benefit from. And the LLM is trained on content humans made. Makes sense it needs the same instruction.

wastemaster•1h ago

Spot on. We often joke that a raw LLM acts exactly like an over-eager junior sales rep—it desperately wants to say "yes" to please the customer. Because they learned from us, they inherit the bad human habit of equating "helpfulness" with agreement. The difference is an AI will scale those broken promises instantly, which is why the constraints have to be architectural.

Judge stops Perplexity's bot Amazon shopping in early test of agentic commerce

https://www.geekwire.com/2026/judge-blocks-perplexitys-ai-bot-from-shopping-on-amazon-in-early-te...

1•spenvo•6s ago•0 comments

Ask HN: What would a developer-first alternative to Shopify look like?

1•google_mfg•41s ago•0 comments

Benchmarking Culture

https://www.argmin.net/p/benchmarking-culture

1•bearseascape•2m ago•0 comments

Slate Auto switches CEOs ahead of launch later this year

https://sherwood.news/tech/tesla-killer-slate-auto-switches-ceos-ahead-of-launch-later-this-year/

1•avonmach•3m ago•0 comments

New ways to learn math and science in ChatGPT

https://openai.com/index/new-ways-to-learn-math-and-science-in-chatgpt

1•meetpateltech•4m ago•0 comments

Show HN: Emotive Engine – I wrote 8 elemental shaders to prove one pattern works

https://github.com/joshtol/emotive-engine

1•emotiveengine•5m ago•0 comments

Turbopuffer: Object Storage-native Database for Search [video]

https://www.youtube.com/watch?v=pqoRNwNaxfs

1•gmcabrita•8m ago•0 comments

Who's a Better Writer: A.I. Or Humans? Take Our Quiz

https://www.nytimes.com/interactive/2026/03/09/business/ai-writing-quiz.html

2•tiahura•8m ago•0 comments

Tommy DeCarlo, Boston Fan Who Became Their Lead Singer, Dead at 60

https://www.rollingstone.com/music/music-news/tommy-decarlo-boston-singer-dead-obituary-1235527355/

2•bookofjoe•9m ago•1 comments

The Bay Area Considers the Unthinkable: Life Without BART

https://www.nytimes.com/2026/03/10/us/bart-bay-area-san-francisco-transit.html

2•mitchbob•10m ago•0 comments

A Methodological Critique of "First Proof" (Abouzaid et al., 2026)

1•Beo_VN•10m ago•0 comments

Umbra Open Data Tracker

https://github.com/bellingcat/umbra-open-data-tracker

1•marklit•10m ago•0 comments

Show HN: A tool for arranging photos for home-printing without wasting paper

https://dj-louw.github.io/photo-collage-printer/

1•beAbU•11m ago•0 comments

I've never parented a 6-year-old child. But I've dealt with macOS system updates

https://ihnatko.com/ive-never-had-the-experience-of-parenting-a-6-year-old-child-but-ive-dealt-wi...

3•brie22•12m ago•0 comments

Rising Air-Conditioning Use Intensifies Global Warming

https://www.nature.com/articles/s41467-026-69393-1

3•PaulHoule•13m ago•0 comments

Exigy Shareware Construction Kit

https://exigy.org/about

1•rainingmonkey•13m ago•0 comments

Every paper published in the Bridges Conference on art and mathematics

https://archive.bridgesmathart.org/#gsc.tab=0

1•futurecat•13m ago•0 comments

I built a RabbtiMQ UI alternative because its not 2005 anymore

https://github.com/AgdirAS/rask.agdir.farm

1•xnf•13m ago•1 comments

How God Got So Great

https://www.newyorker.com/magazine/2026/03/09/how-god-got-so-great

3•bookofjoe•15m ago•1 comments

Texas women used crow drones to fly drugs into Louisiana prison

https://www.fox4news.com/news/texas-women-use-crow-drones-fly-drugs-louisiana-prison-authorities-say

2•randycupertino•15m ago•0 comments

Local-First CI/CD with Makefiles

https://shipyard.build/blog/local-first-cicd-with-makefiles/

1•alwillis•16m ago•0 comments

I'm 21 and My Startup Scalify.ai Is Going Viral

https://www.scalify.ai

2•josh-ternyak•16m ago•0 comments

Show HN: RunAnwhere – Faster AI Inference on Apple Silicon

https://github.com/RunanywhereAI/rcli

13•sanchitmonga22•16m ago•0 comments

"This guy's arrogance takes your breath away" (2016)

https://medium.com/@acidflask/this-guys-arrogance-takes-your-breath-away-5b903624ca5f

1•tosh•18m ago•0 comments

The evolution of Mac app window corners

https://lapcatsoftware.com/articles/2026/3/4.html

2•zdw•19m ago•0 comments

Gateproof: Build Software in Reverse

https://gateproof.dev/

1•handfuloflight•20m ago•0 comments

Philippines shifts to four-day work week as Iran war pushes oil prices up

https://www.channelnewsasia.com/asia/philippines-four-day-work-week-iran-us-war-oil-fuel-prices-f...

4•vishalontheline•20m ago•2 comments

Kung Fu: The Way of Violence Has No Mind [video]

https://www.youtube.com/watch?v=D6nvqmsz9do

2•jamesgill•21m ago•0 comments

Team simulates a living cell that grows and divides

https://news.illinois.edu/team-simulates-a-living-cell-that-grows-and-divides/

2•gmays•22m ago•0 comments

Isotopic Evidence for a Cold and Distant Origin of Interstellar Object 3I/Atlas

https://arxiv.org/abs/2603.06911

5•bikenaga•23m ago•0 comments