The rise of agentic AI in production: Can observability systems run themselves?
Blog post from Grafana Labs
In a recent episode of the "Grafana's Big Tent" podcast, experts including Tom Wilkie, Spiros Xanthos, Manoj Acharya, and Cyril Tovena discuss the potential of agentic AI in observability and whether it can autonomously manage production systems. The conversation explores automated root cause analysis, knowledge graphs, and the skepticism faced by site reliability engineers (SREs) towards AI, emphasizing that while AI can assist in debugging and troubleshooting, it is not yet poised to replace human engineers. The panelists highlight the evolution of AI tools in addressing complex production issues, such as a recent incident where AI identified a latent deadlock bug in a system. They also discuss the importance of trust and transparency in AI systems, as well as the challenges related to pricing in an agent-driven world. The conversation concludes with optimism about the future, suggesting that AI's role in observability will grow, potentially achieving autonomy in resolving incidents by the end of the year, akin to the progress seen in AI for coding.