frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Launch HN: Relvy (YC F24) – On-call runbooks, automated

https://www.relvy.ai
13•behat•2h ago
Hey HN! We are Bharath, and Simranjit from Relvy AI (https://www.relvy.ai). Relvy automates on-call runbooks for software engineering teams. It is an AI agent equipped with tools that can analyze telemetry data and code at scale, helping teams debug and resolve production issues in minutes. Here’s a video: [[[https://www.youtube.com/watch?v=BXr4_XlWXc0]]]

A lot of teams are using AI in some form to reduce their on-call burden. You may be pasting logs into Cursor, or using Claude Code with Datadog’s MCP server to help debug. What we’ve seen is that autonomous root cause analysis is a hard problem for AI. This shows up in benchmarks - Claude Opus 4.6 is currently at 36% accuracy on the OpenRCA dataset, in contrast to coding tasks.

There are three main reasons for this: (1) Telemetry data volume can drown the model in noise; (2) Data interpretation / reasoning is enterprise context dependent; (3) On-call is a time-constrained, high-stakes problem, with little room for AI to explore during investigation time. Errors that send the user down the wrong path are not easily forgiven.

At Relvy, we are tackling these problems by building specialized tools for telemetry data analysis. Our tools can detect anomalies and identify problem slices from dense time series data, do log pattern search, and reason about span trees, all without overwhelming the agent context.

Anchoring the agent around runbooks leads to less agentic exploration and more deterministic steps that reflect the most useful steps that an experienced engineer would take. That results in faster analysis, and less cognitive load on engineers to review and understand what the AI did.

How it works: Relvy is installed on a local machine via docker-compose (or via helm charts, or sign up on our cloud), connect your stack (observability and code), create your first runbook and have Relvy investigate a recent alert.

Each investigation is presented as a notebook in our web UI, with data visualizations that help engineers verify and build trust with the AI. From there on, Relvy can be configured to automatically respond to alerts from Slack

Some example runbook steps that Relvy automates: - Check so-and-so dashboard, see if the errors are isolated to a specific shard. - Check if there’s a throughput surge on the APM page, and if so, is it from a few IPs? - Check recent commits to see if anything changed for this endpoint.

You can also configure AWS CLI commands that Relvy can run to automate mitigation actions, with human approval.

A little bit about us - We did YC back in fall 2024. We started our journey experimenting with continuous log monitoring with small language models - that was too slow. We then invested deeply into solving root cause analysis effectively, and our product today is the result of about a year of work with our early customers.

Give us a try today. Happy to hear feedback, or about how you are tackling on-call burden at your company. Appreciate any comments or suggestions!

Comments

ramon156•2h ago
Congrats on the launch! I dig the concept, seems like a good tool :)
behat•2h ago
Thank you :)
hrimfaxi•2h ago
How does this differ from cursor cloud agents where I can hook up MCPs, etc and even launch the agent in my own cloud to connect directly to internal hosts like dbs?
behat•2h ago
Thanks. Yeah, Cursor / Claude code + MCP is powerful. We differentiate on two fronts, mainly:

1) Greater accuracy with our specialized tools: Most MCP tools allow agents to query data, or run *ql queries - this overwhelms context windows given the scale of telemetry data. Raw data is also not great for reasoning - we’ve designed our tools to ensure that models get data in the right format, enriched with statistical summaries, baselines, and correlation data, so LLMs can focus on reasoning.

2) Product UX: You’ll also find that text based outputs from general purpose agents are not sufficient for this task - our notebook UX offers a great way to visualize the underlying data so you can review and build trust with the AI.

hrimfaxi•2h ago
To be clear, are the main differentiators basically better built-in MCPs and better UX? Not knocking just trying to understand the differences.

I have had incredible success debugging issues by just hooking up Datadog MCP and giving agents access to it. Claude/cursor don't seem to have any issues pulling in the raw data they need in amounts that don't overload their context.

Do you consider this a tool to be used in addition to something like cursor cloud agents or to replace it?

behat•1h ago
For the debugging workflow you described, we would be a standalone replacement for cursor or other agents. We don't yet write code so can't replace your cursor agents entirely.

Re: diffentiation - yes, faster, more accurate and more consistent. Partially because of better tools and UX, and partially because we anchor on runbooks. On-call engineers can quickly map out that the AI ran so-and-so steps, and here's what it found for each, and here's the time series graph that supports this.

Interesting that you have had great success with Datadog MCP. Do you mainly look at logs?

esafak•1h ago
They claim a 12% lead (from 36% to 48%) over Opus 4.6 in a RCA benchmark: https://www.relvy.ai/blog/relvy-improves-claude-accuracy-by-...
behat•1h ago
heh, I was just about to post the following on your previous comment re: reproducible benchmark results. Thanks for posting the blog.

With the docker images that we offer, in theory, people can re-run the benchmark themselves with our agent. But we should document and make that easier.

At the end of it, you really would have to evaluate on your own production alerts. Hopefully the easy install + set up helps.

rishav•1h ago
Woohoo!!! Congrats on the big launch y'all
sanghyunp•45m ago
Runbook automation is one of those things every team says they'll build internally and never does. After 6 years on backend teams, our "runbooks" were always a Notion page nobody updated. The hard part is always the boundary between what can be automated and what still needs human judgment.
behat•38m ago
Yes! That boundary between what can be automated and what still needs human judgement has shifted so much this last year. Things like 'go check this dashboard' can now be automated.

ROI on runbooks (or good documentation in general) is much higher now if you have AI agents running them autonomously in the background. Makes it worth it to write/maintain runbooks.

Show HN: 41 years sea surface temperature anomalies

https://ssta.willhelps.org
75•willmeyers•2h ago•24 comments

LittleSnitch for Linux

https://obdev.at/products/littlesnitch-linux/index.html
1024•pluc•14h ago•356 comments

Help Keep Thunderbird Alive

https://updates.thunderbird.net/en-US/thunderbird/140.0/apr26-1e/donate/
283•playfultones•7h ago•185 comments

A WebGPU Implementation of Augmented Vertex Block Descent

https://github.com/jure/webphysics
27•juretriglav•2h ago•0 comments

Meta removes ads for social media addiction litigation

https://www.axios.com/2026/04/09/meta-social-media-addiction-ads
81•giuliomagnifico•1h ago•37 comments

Introduction to Nintendo DS Programming

https://www.patater.com/files/projects/manual/manual.html
63•medbar•1d ago•10 comments

How Pizza Tycoon simulated traffic on a 25 MHz CPU

https://pizzalegacy.nl/blog/traffic-system.html
78•FinnKuhn•1h ago•18 comments

Open Source Security at Astral

https://astral.sh/blog/open-source-security-at-astral
273•vinhnx•10h ago•63 comments

FreeBSD Laptop Compatibility: Top Laptops to Use with FreeBSD

https://freebsdfoundation.github.io/freebsd-laptop-testing/
57•fork-bomber•5h ago•20 comments

Wit, unker, Git: The lost medieval pronouns of English intimacy

https://www.bbc.com/future/article/20260408-the-extinct-english-words-for-just-the-two-of-us
81•eigenspace•4h ago•44 comments

Building a framework-agnostic Ruby gem (and making sure it doesn't break)

https://newsletter.masilotti.com/p/on-building-a-framework-agnostic
7•joemasilotti•1d ago•0 comments

Launch HN: Relvy (YC F24) – On-call runbooks, automated

https://www.relvy.ai
13•behat•2h ago•11 comments

Haunted Paper Toys

http://ravensblight.com/papertoys.html
166•exvi•3d ago•21 comments

Tree Calculus

https://treecalcul.us/
40•tosh•6d ago•11 comments

Am I German or Autistic?

https://german.millermanschool.com/
152•doener•1h ago•117 comments

Lichess and Take Take Take Sign Cooperation Agreement

https://lichess.org/@/Lichess/blog/lichess-and-take-take-take-sign-cooperation-agreement/DZS0S0Dy
33•stevage•2h ago•2 comments

Small Engines

https://scottlocklin.wordpress.com/2026/03/25/very-small-engines/
18•surprisetalk•2d ago•3 comments

Show HN: Moon simulator game, ray-casting

https://mooncraft2000.com
55•JKCalhoun•2d ago•13 comments

Dr. Dobb's Developer Library DVD 6

https://archive.org/details/DDJDVD6
99•kristianp•4d ago•39 comments

Creating the Futurescape for the Fifth Element [2019]

https://theasc.com/articles/fantastic-voyage-creating-the-futurescape-for-the-fifth-element
70•nixass•5h ago•44 comments

USB for Software Developers: An introduction to writing userspace USB drivers

https://werwolv.net/posts/usb_for_sw_devs/
358•WerWolv•19h ago•40 comments

Improving storage efficiency in Magic Pocket, Dropbox's immutable blob store

https://dropbox.tech/infrastructure/improving-storage-efficiency-in-magic-pocket-our-immutable-bl...
47•laluser•5d ago•7 comments

I ported Mac OS X to the Nintendo Wii

https://bryankeller.github.io/2026/04/08/porting-mac-os-x-nintendo-wii.html
1736•blkhp19•22h ago•296 comments

Understanding the Kalman filter with a simple radar example

https://kalmanfilter.net
387•alex_be•21h ago•49 comments

They're made out of meat (1991)

http://www.terrybisson.com/theyre-made-out-of-meat-2/
595•surprisetalk•1d ago•160 comments

Claude mixes up who said what and that's not OK

https://dwyer.co.za/static/claude-mixes-up-who-said-what-and-thats-not-ok.html
245•sixhobbits•5h ago•233 comments

ML promises to be profoundly weird

https://aphyr.com/posts/411-the-future-of-everything-is-lies-i-guess
557•pabs3•1d ago•541 comments

The Importance of Being Idle

https://theamericanscholar.org/the-importance-of-being-idle/
242•Caiero•2d ago•141 comments

Git commands I run before reading any code

https://piechowski.io/post/git-commands-before-reading-code/
2130•grepsedawk•1d ago•456 comments

Muse Spark: Scaling towards personal superintelligence

https://ai.meta.com/blog/introducing-muse-spark-msl/?_fb_noscript=1
372•chabons•22h ago•352 comments