Company
Date Published
Author
Andre Newman
Word count
1058
Language
English
Hacker News points
None

Summary

Gremlin has introduced a series of updates and new features designed to enhance system testing and reliability management. Notably, the new Process Exhaustion experiment simulates massive parallel workloads to test system stability under high process loads, and Gremlin's integration with AWS Key Management Service (KMS) simplifies and secures deployments. The platform now supports restricted time windows to prevent testing during critical periods and has improved its ability to discover and track service dependencies using DNS-based methods. Additionally, Gremlin has enhanced its auditing tools with new API endpoints for retrieving log data and refined its web app interface for a smoother user experience. Improvements to agent updates include new container drivers that reduce CPU and I/O usage and better handling of network-related experiments. These advancements aim to empower users to identify and mitigate availability risks more effectively.