Home / Companies / New Relic / Blog / Post Details
Content Deep Dive

Observability in Practice: Running A Game Day With New Relic One And Gremlin

Blog post from New Relic

Post Details
Company
Date Published
Author
Drew Decker
Word Count
1,056
Language
English
Hacker News Points
-
Summary

In a collaborative effort between New Relic and Gremlin, a controlled outage was orchestrated to demonstrate the efficacy of New Relic's observability platform. This exercise, often referred to as a "game day," involved simulating real-world failure scenarios to test and improve incident response protocols. By introducing controlled chaos, such as adding latency to a customer's Cassandra database, New Relic's platform successfully identified and alerted teams to various issues, enabling them to trace failures back to their root causes. The exercise highlighted the importance of chaos engineering in uncovering system weaknesses before they impact users and reinforced the value of robust monitoring and alerting systems. The collaborative game day not only improved the customer's incident management processes but also increased their confidence in New Relic's capabilities, demonstrating the practical benefits of combining observability with chaos engineering to enhance system resilience.