Code Orange: Fail Small — our resilience plan following recent incidents
Blog post from Cloudflare
In response to two significant network outages in late 2025, Cloudflare has initiated a project called "Code Orange: Fail Small" to enhance its network's resilience and reliability. The outages, caused by rapid, uncontrolled configuration changes, revealed vulnerabilities in the company's deployment processes. Code Orange prioritizes controlled rollouts for configuration changes, similar to their software updates, to prevent widespread disruptions. The initiative also focuses on reviewing and improving system failure modes and adjusting internal procedures to ensure swift access to necessary tools during emergencies, while maintaining strict security protocols. Cloudflare aims to implement these changes by the end of the first quarter of 2026, with ongoing efforts to refine processes and technologies to prevent future incidents.