frontpage.

I'm curious how teams running LLM-heavy applications handle duplicate or redundant API calls in production.

While experimenting with LLM APIs, I noticed that the same prompt can sometimes be sent repeatedly across different parts of an application, which leads to unnecessary token usage and higher API costs.

For teams using OpenAI, Anthropic, or similar APIs in production: How do you currently detect or prevent duplicate prompts or redundant calls? Do you rely on logging and dashboards, caching layers, internal proxy services, or something else? Or is this generally considered a minor issue that most teams just accept as part of normal usage?

Ask HN: Would you use a job board where every listing is verified?

Ask HN: Last time you wrote code?

Tell HN: I'm 60 years old. Claude Code has re-ignited a passion

Ask HN: How many of you hold an amateur radio license in your country?

PhD interrupted by personal safety issues, now publication record is thin

Whisker – Self hosted e-commerce cart, pure PHP, zero dependencies

Ask HN: Can we talk about AI Astroturfing?

How do teams prevent duplicate LLM API calls and token waste?

What Will Happen to Android?

Ask HN: Anyone else feel this community has changed recently?

Ask HN: What career will you switch to when AI replaces developers?

Best Monitoring and Observability Platform?

Tell HN: The proposed KIDS Act (HR 7757) effectively mandates biometric browsing

Should AI web agents skip sponsored/ad results by default?

Ask HN: Doctor with software development experience – careers combining both?

Ask HN: Best way to implement logging and audit trails for AI apps?

Ask HN: How are you using multi-agent AI systems in your daily workflow?

Self taught gen-xers with senior dev/pm exp. Where's my imposter syndrome team?

Ask HN: Do You Enjoy Your Career in Tech Nowadays?

I started making money online in 10th grade – some lessons about capital

$1k and the difficult future that AI has left for many

Ask HN: Do You Have a Homelab?

Turns out making games is the easy part

Ask HN: Anyone fought a big corp over IP theft courts?

Are there any companies who are anti-AI?

Ask HN: Has anyone noticed the fear-driven prompt suggestions that GPT5.3 makes?

Ask HN: Why do we still buy things by browsing catalogs?

Tell HN: Digital Ocean has run out of GPU droplets

Ask HN: If your project is free, what are you building and why keep it free?

I lost my ability to learn anything new because of AI and I need your opinions

How do teams prevent duplicate LLM API calls and token waste?

Ask HN: Would you use a job board where every listing is verified?

Ask HN: Last time you wrote code?

Tell HN: I'm 60 years old. Claude Code has re-ignited a passion

Ask HN: How many of you hold an amateur radio license in your country?

PhD interrupted by personal safety issues, now publication record is thin

Whisker – Self hosted e-commerce cart, pure PHP, zero dependencies

Ask HN: Can we talk about AI Astroturfing?

How do teams prevent duplicate LLM API calls and token waste?

What Will Happen to Android?

Ask HN: Anyone else feel this community has changed recently?

Ask HN: What career will you switch to when AI replaces developers?

Best Monitoring and Observability Platform?

Tell HN: The proposed KIDS Act (HR 7757) effectively mandates biometric browsing

Should AI web agents skip sponsored/ad results by default?

Ask HN: Doctor with software development experience – careers combining both?

Ask HN: Best way to implement logging and audit trails for AI apps?

Ask HN: How are you using multi-agent AI systems in your daily workflow?

Self taught gen-xers with senior dev/pm exp. Where's my imposter syndrome team?

Ask HN: Do You Enjoy Your Career in Tech Nowadays?

I started making money online in 10th grade – some lessons about capital

$1k and the difficult future that AI has left for many

Ask HN: Do You Have a Homelab?

Turns out making games is the easy part

Ask HN: Anyone fought a big corp over IP theft courts?

Are there any companies who are anti-AI?

Ask HN: Has anyone noticed the fear-driven prompt suggestions that GPT5.3 makes?

Ask HN: Why do we still buy things by browsing catalogs?

Tell HN: Digital Ocean has run out of GPU droplets

Ask HN: If your project is free, what are you building and why keep it free?

I lost my ability to learn anything new because of AI and I need your opinions