Resolve and prevent operational incidents with AWS DevOps Agent and New Relic
Blog post from New Relic
Modern distributed systems are inherently complex, generating vast amounts of metrics, traces, and logs, which pose a challenge for manual root cause analysis and extend the mean time to detect and resolve incidents. By leveraging Agentic AI, Site Reliability Engineers (SREs) and DevOps teams can automate and enhance the incident resolution process. New Relic has partnered with AWS to integrate its Model Context Protocol (MCP) server with the AWS DevOps Agent, providing automated root cause analysis and actionable recommendations through AI. The AWS DevOps Agent acts like an experienced DevOps engineer, investigating incidents, identifying operational improvements, and correlating telemetry, code, and deployment data to expedite incident resolution. This integration allows for faster issue resolution and proactive incident prevention, significantly reducing Mean Time to Resolution (MTTR) and driving operational excellence. Through a use case involving a retail chain's shopping cart service experiencing high latency, the blog demonstrates how these tools can quickly identify and remediate issues, minimizing technical disruptions and ensuring business continuity.