GitHub Availability Report: March 2022
Blog post from GitHub
In March, GitHub faced several incidents that impacted the availability of core services, largely due to resource contention in the mysql1 database cluster during peak load times, affecting write operations and causing delays in GitHub Actions. The company responded by proactively managing loads, optimizing queries, and increasing database capacity, which significantly reduced query rates and transaction volumes. They also implemented maintenance windows and paused certain operations to minimize user impact, while accelerating efforts to shard the problematic cluster and improve alerting thresholds. A subsequent incident during a database migration for GitHub Actions highlighted misconfigurations, which led to temporary job delays, prompting a reevaluation of operational workflows and permissions checks. GitHub continues to focus on transparency and resilience, sharing updates to reinforce customer trust and accountability.