Disaster recovery plans are essential for businesses, especially in sectors like banking, where maintaining service during outages is critical. Compliance with global regulations such as APRA CPS 230, DORA, FCA PS21/3, and OSFI Guideline E-21 highlights their importance. Reliability engineering, utilizing tools like Gremlin, can test these plans by simulating various failure scenarios, ensuring systems can handle real-world conditions and maintain operations during disasters. These simulations include network outages, traffic pattern shifts, and partial resource failures, allowing businesses to verify their plans' effectiveness and compliance. By standardizing and automating these tests, organizations can confidently maintain service thresholds and provide documented proof of compliance, transforming disaster recovery from a manual process into an automated, reliable practice.