My cofounder and I left Roblox to build IncidentFox (https://github.com/incidentfox/incidentfox) — an open-source AI SRE that investigates incidents and finds root causes.
We know AI SREs exist, and you’ve probably seen a hundred of them pitched. In our experience, they don’t work because they lack context about your systems and ask you to spend weeks building integrations. Who has time to build their own MCP servers?
Our take: context is everything, and UX matters more than people think. When things are on fire at 3am, you don’t want to open another tab. So we keep everything in Slack — paste a screenshot, drop a log file, view full traces, all without leaving the thread. On setup, we analyze your codebase and past incidents to understand your stack, then auto-build the integrations so things work out of the box.
Try it in our Slack (no setup): https://join.slack.com/t/incidentfox/shared_invite/zt-3ojlxv...
Or self-host the whole thing (Apache 2.0): https://github.com/incidentfox/incidentfox
We’d love to hear what you think!
Main website: https://www.incidentfox.ai/