Company
Date Published
Author
Ash Pallarito Joe Abley
Word count
2045
Language
English
Hacker News points
None

Summary

On July 14, 2025, Cloudflare experienced a significant outage affecting its 1.1.1.1 public DNS Resolver, causing a global service disruption for 62 minutes. This incident was triggered by a misconfiguration during a service topology update, inadvertently linking the 1.1.1.1 Resolver's IP addresses to a non-production service, leading to their withdrawal from Cloudflare's data centers globally. The error originated from a June 6 configuration change meant for a new Data Localization Suite service, which remained dormant until the July update prompted a global configuration refresh. The outage significantly impacted Internet services for users relying on 1.1.1.1, although DNS-over-HTTPS traffic remained largely unaffected due to its reliance on different IP addresses. Cloudflare rapidly deployed a fix to restore service, while also committing to phasing out legacy systems and improving deployment methodologies to prevent future occurrences. This incident underscored the need for robust configuration management and highlighted Cloudflare's efforts to enhance the resilience of its infrastructure.