SRE-skills-bench
GitHub
A benchmark suite for evaluating AI agents on real-world SRE tasks — incident diagnosis, runbook execution, and infrastructure troubleshooting.
Topics
SREAI AgentsIncident Management
12 stars
Tech Stack
PythonShell
A benchmark suite for evaluating AI agents on real-world SRE tasks — incident diagnosis, runbook execution, and infrastructure troubleshooting.
Topics
Tech Stack