The whole observably industry is based on a flawed approach that you can add it in after the fact, ship that data somewhere else, then charge an arm and a leg to access it. That breaks down quickly in any non-trivial distributed system. Even when ignoring issues with sampling, it is cost prohibitive.
zarathustra333•3mo ago
idea0rbit•3mo ago
https://www.linkedin.com/pulse/observability-broken-its-time...
tanelpoder•3mo ago
I'm using eBPF-based dimensional data analysis, starting from bottom up (every system is a bunch of threads, including distributed systems) and move up from there. This doesn't replace existing distributed tracing approaches for end to end request view, but gives you deep observability all the way down to each service's underlying threads' wall-clock time (where blocked, sleeping and why, etc).
At this year's P99CONF I will launch the first GA release of my (open source) 0x.tools xcapture eBPF collectors, with a reference implementation of a TUI tool, showing dimensional performance modeling on these new thread sampling signals (xtop).
A couple of 1-minute asciicasts of xtop are here: https://tanelpoder.com/posts/xcapture-xtop-beta/