I’ve built this product called Rivora.
The idea came from frustrations I kept running into running modern infra. Reliability is still mostly reactive. Something breaks, alerts fire, and someone ends up jumping between dashboards trying to piece together what actually changed hopefully not at 3am.
But infrastructure itself is becoming more autonomous, constant deploys, lots of services, and increasingly AI devtools.
Rivora is my attempt to build a reliability layer for autonomous infrastructure that actually understands what’s happening in the system and surfaces risk before things break.
It connects to your cloud environment, CI/CD, and observability stack to watch how the system evolves and explain why reliability risk is increasing instead of just sending endless alerts.
Still early and definitely rough around the edges.
Curious to hear feedback from anyone running production infrastructure.