frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Anyone else losing tokens to hallucinated MCP tool calls in production?

1•Mahjabinbm•1h ago
I have been building an agentic system on a custom internal platform and the llm keeps calling tools with identifiers that dont exist, wrong namespace, wrong handle, wrong enum. gets back an error, retries, still wrong. every bad call is tokens down the drain. I ended up writing a big system prompt to fix it. took weeks of trial and error. its working but i still dont fully trust it. I am curious if others are hitting this or if its just me. Especially if youre running MCP servers on internal platforms the llm has never seen before(the problem occurs most with less powerful models), do you guys have a go to solution for this? Thanks

Comments

dmilicev2•1h ago
Yes, and that's one of the reasons I started working on this tool: to elicit desirable behavior from AIs before turn 1. AI is still non-deterministic and won't follow instructions 100% of the time, but with this tool, I intend to narrow that gap.

Would love to hear your thoughts on it: https://github.com/turnzero-ai/turnzero

Show HN: A Mutating Webhook to automatically strip PII from K8s logs

https://github.com/aragossa/pii-shield
1•aragoss•37s ago•0 comments

MCP server that lets Claude query your Google Calendar

https://github.com/zimdarsj/ai-side-hustle/tree/main/projects/personal-mcp-suite
1•zimdarsj•1m ago•0 comments

Show HN: Codeberg (Forgejo) CLI

https://codeberg.org/thatxliner/codeberg-cli
1•thatxliner•2m ago•0 comments

An AI-native approach to personalized marketing

https://usereachout.com/blog/an-ai-native-approach-to-personalized-marketing
2•killer1loop•3m ago•1 comments

London Is Still Paying Rent to the Queen on a Property Leased in 1211

https://www.atlasobscura.com/articles/london-is-still-paying-rent-to-the-queen-on-a-property-leas...
1•thunderbong•3m ago•0 comments

HAM Radio Is Not Just for Talking

https://rfcorner.in/posts/ham-radio-is-not-just-for-talking/
1•speckx•4m ago•0 comments

Agents for Financial Services and Insurance

https://www.anthropic.com/news/finance-agents
2•louiereederson•4m ago•0 comments

ESP32 Hosts Solarpunk Message Board

https://hackaday.com/2026/05/04/esp32-hosts-solarpunk-message-board/
1•iamnothere•4m ago•0 comments

I tried making my own AG Grid, and it took 9 months

https://visualeaf.com/blog/why-my-custom-table-took-9-months/
1•Jacky101•5m ago•0 comments

I built a tagging system where you don't have to remember your tags (no AI)

https://www.supertags.app/
2•keyes343•5m ago•0 comments

AI systems are about to start building themselves

https://importai.substack.com/p/import-ai-455-automating-ai-research
3•JumpCrisscross•6m ago•0 comments

Show HN: Airbyte Agents – context for agents across multiple data sources

4•mtricot•6m ago•0 comments

Postgres – Asynchronous Commits

https://www.postgresql.org/docs/current/wal-async-commit.html
1•Brysonbw•7m ago•0 comments

AI inference infrastructure built on small and nano models

https://www.youtube.com/watch?v=C-6Zo1JvZkE
1•its_maddy_a•8m ago•1 comments

The ultimate guide to RL environments: building and scaling them in the LLM era

https://huggingface.co/spaces/AdithyaSK/rl-environments-guide
3•kashifr•8m ago•0 comments

It's official: Utah is the U.S. state closest to banning VPNs

https://tech.yahoo.com/vpn/article/its-official-utah-is-the-us-state-closest-to-banning-vpns-1535...
5•giantg2•8m ago•0 comments

Show HN: Claude-smart – Make Claude Code self-improve from every session

https://github.com/ReflexioAI/claude-smart
1•yilu331•9m ago•0 comments

LLM-test-kit – Test consistency, latency, cost and behavior of LLM apps

https://github.com/muskanjoshi01/llm-test-kit
1•muskanjo•10m ago•1 comments

Notes from Optimizing CPU-Bound Go Hot Paths

https://blog.andr2i.com/posts/2026-05-03-notes-from-optimizing-cpu-bound-go-hot-paths
1•molecularman•10m ago•0 comments

Show HN: I used accounting controls to build a governed AI coding tool

https://github.com/CodeMaestro-AI/CodeMaestro
1•lw1981•11m ago•0 comments

I love AI assistants but objectively they're still terrible. (A Lefos review)

https://techstackups.com/articles/lefos-earendil-review/
1•sixhobbits•12m ago•0 comments

Show HN: Memopt: Open-source GPU memory fabric for AI infrastructure

https://github.com/basnetlachu/memopt
1•lachu_536•12m ago•0 comments

HN: AquaLens – Real-time NOAA and GEBCO ocean dashboard for vessel operations

https://research-vessel-ops-4.emergent.host
1•stefymaestro•12m ago•0 comments

AI won't speed up software delivery – nothing has

https://thenewstack.io/feedback-driven-ai-adoption/
1•Brajeshwar•13m ago•0 comments

The Great Pacific Garbage Patch could be part of a hidden problem

https://www.cnn.com/2026/05/04/climate/microplastics-nanoplastics-air-global-warming
1•Tomte•13m ago•0 comments

Whetstone: AI agents don't lack capability, they lack process

https://ilia.ws/blog/ai-agents-dont-lack-capability-they-lack-process
2•ilia-a•14m ago•0 comments

Charting the AI perception gap between experts and the public

https://link.springer.com/article/10.1007/s00146-026-03023-8
1•i7l•15m ago•0 comments

Gmail Storage Full? Find and Delete Large Emails to Fix It Fast

https://clearmailapp.com/blog/gmail-storage-full-delete-large-emails/
1•raghukumar•15m ago•0 comments

When merchandise crowds the aisle and carts crowd the shopper: Effects on sales

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0346492
1•ndr42•15m ago•2 comments

Airborne Microplastics May Be Warming the Planet

https://e360.yale.edu/digest/airborne-microplastics-climate-change
3•speckx•16m ago•0 comments