Ask HN: Good LLM Observability Platforms?

5•seany62•1d ago

My company has been through 3 different "LLM Observability" vendors and they each have failed to give us the one (simple) thing we want. Willing to pay for this.

The ONLY thing we care about is the ability to: - Log an LLM completion, and be able to press a button that lets us re-run the exact same completion in a UI (industry seems to call this the "playground"). We can rerun this completion exactly how it was in production.

What we DO NOT care about: - "datasets" - "scores" - "prompt enhancers"

Comments

uaas•1d ago

I am curious, what’s the point of re-running these interactions on a UI?

muzani•3h ago

Reproduction I suppose. I would like the same things as OP too.

LLM outputs are qualitative; they can't really be automatically scored and prompt enhancements tend to multiply the bug. It can solve a problem, but introduce a new one. It's practical just to do it manually.

thiago_fm•1d ago

I'm sure if you ask Claude Code exactly that, they will develop what you want.

Tell it to create an API for the LLM data ingestion, then integrate with it on your software.

BTW, this is far from what an LLM Observability tool will offer you. You are a bit confused what O11Y is.

debadyutirc•22h ago

What entails the LLM Completion are you talking sequence of prompts with files / mcp servers. Could you share a bit more, cause I have spent some time with this and have something that might be precisely what you are asking for...

Wonnk13•16h ago

When I think of LLM / Agent observability I think of some combination of open telemetry and like Influxdb, but I don't think that's what your asking for?

Ask HN: Not treated respectfully by colleague – advice?

Ask HN: Is AWS down again?

Tell HN: macOS 26 is making me have regrets for the first time in 12yrs

Tell HN: OpenAI now requires ID verification and won't refund API credits

Ask HN: Dealing with "blocked" emails after DNS issue

Ask HN: Advice for creating a USB device linking 2 computers

Killer WiFi cards can block VPNs

Ask HN: What are you doing this week?

Ask HN: How should new programmers learn in the AI era?

Bugbunny: Securing VibeCoded Apps

Google Demanded My Drivers Lic Before Letting Me Read an Article

Ask HN: Good LLM Observability Platforms?

What do you guys do to improve your focus?

Ask HN: Has anyone deployed your own MCP server connector to ChatGPT?

Ask HN: Do Java and .NET developers avoid learning new tech?

Ask HN: Anyone else use FreePascal as their low level language?

M5 Macs Support Memory Integrity Enforcement

Ask HN: Best open source opsgenie alternatives?

Ask HN: What is a passkey and why is everybody asking for one lately?

iPhone Safari Lost Bookmarks

The Windows 7 Renaissance? StatCounter shows surge in usage

Ask HN: Rigorous study on what jobs are declining due to AI now?

Is there an IDE that can use the local open-source model?

Ask HN: Any good books for a layman on history of quantum computing?

Using jet engines for power generation at AI centers.

Ask HN: What's needed for a minimal production Docker deployment?

Ask HN: What do you use for focus without coffee jitters?

Would the .NET community benefit from an open-source MassTransit fork?

Ask HN: Where should an experienced developer start learning AI development?

Ask HN: Best practices for research code?

Ask HN: Not treated respectfully by colleague – advice?

Ask HN: Is AWS down again?

Tell HN: macOS 26 is making me have regrets for the first time in 12yrs

Tell HN: OpenAI now requires ID verification and won't refund API credits

Ask HN: Dealing with "blocked" emails after DNS issue

Ask HN: Advice for creating a USB device linking 2 computers

Killer WiFi cards can block VPNs

Ask HN: What are you doing this week?

Ask HN: How should new programmers learn in the AI era?

Bugbunny: Securing VibeCoded Apps

Google Demanded My Drivers Lic Before Letting Me Read an Article

Ask HN: Good LLM Observability Platforms?

What do you guys do to improve your focus?

Ask HN: Has anyone deployed your own MCP server connector to ChatGPT?

Ask HN: Do Java and .NET developers avoid learning new tech?

Ask HN: Anyone else use FreePascal as their low level language?

M5 Macs Support Memory Integrity Enforcement

Ask HN: Best open source opsgenie alternatives?

Ask HN: What is a passkey and why is everybody asking for one lately?

iPhone Safari Lost Bookmarks

The Windows 7 Renaissance? StatCounter shows surge in usage

Ask HN: Rigorous study on what jobs are declining due to AI now?

Is there an IDE that can use the local open-source model?

Ask HN: Any good books for a layman on history of quantum computing?

Using jet engines for power generation at AI centers.

Ask HN: What's needed for a minimal production Docker deployment?

Ask HN: What do you use for focus without coffee jitters?

Would the .NET community benefit from an open-source MassTransit fork?

Ask HN: Where should an experienced developer start learning AI development?

Ask HN: Best practices for research code?

Ask HN: Good LLM Observability Platforms?

Comments