frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you find out if the LLM API is giving degraded responses?

3•imviky•1d ago
If you are building on top of multiple LLM APIs or even a single one amongst OpenAI, Claude, Gemini, etc. what do you do when the API starts degrading (slow TTFT, elevated error rates, timeouts). Or even worse, when there are responses but the model is drifting. How do you find this out? I'm trying to understand if this is a widespread pain or just something I've been unlucky with.

Four specific questions:

1. When an LLM API starts silently degrading, how do you currently find out? (Your own monitoring? User complaints? Checking the status page? Reddit?)

2. How long does it typically take you to confirm "this is the provider, not my code"?

3. If something told you before you noticed, that Claude API was showing elevated TTFT on Sonnet right now, would that change anything about how you operate? Or would you just retry and move on regardless?

4. Would you pay for an independent alert service that tells you when an LLM's behaviour has drifted, before your users notice?

If this isn't actually a problem for you, I think that also would be the most useful answer I can get.

Ask HN: Will programmers write more efficient code during the memory shortage?

80•amichail•8h ago•132 comments

Ask HN: How do you separate intentional test boilerplate from real duplication?

10•rafaepta•2d ago•7 comments

Ask HN: Using OPA/Rego to secure MCP tool execution. Does it make sense?

4•wmolino•1h ago•0 comments

Ask HN: Is anyone using the A2A protocol?

91•asim•1d ago•41 comments

Ask HN: What's a simple app you'd build if you had a weekend?

3•akashwadhwani35•1h ago•2 comments

Ask HN: What tools are you using for AI-assisted code review?

21•agos•1d ago•22 comments

Ask HN: What is the coolest tech progress outside AI?

10•vantareed•13h ago•6 comments

Ask HN: Open-Source Intelligence

3•silent_butagrim•17h ago•4 comments

Ask HN: Is there a recognized standard for swarm intelligence benchmarking?

5•stephanieriggs•17h ago•1 comments

Ask HN: Is anyone else leaving AUR?

6•lordkrandel•1d ago•6 comments

Self-adapting and mutating LLM based viruses/worms

3•rozumbrada•20h ago•4 comments

Ask HN: I'm lost. How can I define ICP (Ideal Customer Profile)?

5•snowhy•1d ago•6 comments

Trillions of dollars spent just to work on customer services?

8•YihaoZhang•22h ago•2 comments

Ask HN: Is there a way to stop the animated Google Doodles?

11•arnejenssen•1d ago•12 comments

Ask HN: How do you effectively communicate or present?

8•hnthrow10282910•1d ago•5 comments

Meetup.com login appears to be exceeding its reCAPTCHA Enterprise quota

4•infl8ed•1d ago•0 comments

Ask HN: Conflicted about founding engineer role

7•gondolin1683•1d ago•18 comments

Ask HN: Do you find vibe coding / agentic engineering to be fulfilling?

8•uejfiweun•1d ago•11 comments

Ask HN: What's a prompt you've written that you're genuinely proud of?

10•akashwadhwani35•2d ago•7 comments

Ask HN: Has anyone had success with SBIR grants and what is the process like?

11•lyfeninja•2d ago•8 comments

Ask HN: Are other people seeing a spike in IT problems with businesses?

14•PaulHoule•2d ago•11 comments

Reviews have become expensive, rewrites have become cheap

82•_z6bq•4d ago•74 comments

Ask HN: How do you find new books to read?

5•ahmedfromtunis•1d ago•6 comments

Anthropic pauses credit change for Claude Code

35•fabianlindfors•4d ago•12 comments

Ask HN: Opus and regression with patterns not included in trainng data

2•dleech•1d ago•5 comments

Ask HN: Do we even need code anymore?

5•lasky•1d ago•19 comments

How much $ you spend for AI to code?

4•raghuu•1d ago•7 comments

Ask HN: Best resources for learning how to build a forum back end?

3•jupr•1d ago•3 comments

Ask HN: Whats the best and small open source model?

3•hairymouse•1d ago•3 comments

Ask HN: Looking for a CI/CD project for my local lab

5•q8zd3•1d ago•10 comments