Company
Date Published
Author
Wei Li
Word count
1834
Language
English
Hacker News points
None

Summary

Grafana Cloud has enhanced its troubleshooting capabilities by integrating Asserts.ai, an AI/ML-driven tool, to automate the correlation of anomalies across application and infrastructure layers, thereby reducing the mean time to resolution (MTTR) for complex microservice-based applications. This integration, announced at ObservabilityCON 2024, introduces a suite of unified workflows that automate root cause analysis, making it easier even for junior engineers to diagnose issues effectively. The Asserts tool, designed for Prometheus and OpenTelemetry instrumentation, offers real-time monitoring and SLO-based alerts, providing a contextual layer to telemetry data and reducing alert fatigue. It features an RCA Workbench that helps visualize and prioritize system anomalies, and it integrates seamlessly with Grafana Cloud's observability solutions, enhancing the troubleshooting process. The integration is available to Grafana Cloud Advanced customers as of October 2, and includes the ability to perform domain-specific investigations through detailed dashboards and bi-directional workflows, offering a comprehensive and efficient approach to resolving application and infrastructure issues.