Company
Date Published
Author
Redis
Word count
365
Language
English
Hacker News points
None

Summary

For the second time during June 2012, the AWS us-east-1 region failed due to a power outage caused by extreme weather conditions. This resulted in potential data loss for users of in-memory data stores like Memcached or services like ElastiCache, potentially causing dramatic performance degradation and crashes. However, the company Garantia Data implemented its own replication and auto-failover processes to ensure minimal downtime during such events, allowing it to recover from a failed node without damage. The use of persistent storage, daily S3 backups, and robust data-persistence mechanisms enabled Garantia Data to successfully recover all users' datasets from the outage, ensuring business continuity despite the AWS failure.