Home / Companies / Gremlin / Blog / Post Details
Content Deep Dive

Adrian Cockroft: "Chaos Engineering - What it is, and where it's going" - Chaos Conf 2018

Blog post from Gremlin

Post Details
Company
Date Published
Author
Gremlin
Word Count
10,647
Language
English
Hacker News Points
-
Summary

Adrian Cockroft's keynote at Chaos Conf 2018 focuses on Chaos Engineering, emphasizing its role in preparing systems to handle failures effectively. He discusses the evolution of Chaos Engineering from traditional disaster recovery practices, highlighting its importance in today's cloud-based infrastructure where systems must be resilient to a range of failures, from hardware malfunctions to software bugs and operational missteps. Cockroft emphasizes the need for a culture that encourages reporting and learning from small incidents to prevent larger failures, advocating for continuous automated testing rather than annual disaster recovery exercises. He also highlights the importance of observability and the role of human judgment in managing unforeseen failures, underscoring the integration of Chaos Engineering into organizational processes to enhance the reliability and safety of complex systems.