Chef at PagerDuty
Blog post from PagerDuty
PagerDuty's Operations Engineering team, comprising four generalist engineers, focuses on infrastructure automation, host-level security, data stores, and productivity tools, and ensures high availability by utilizing resilient software on resilient infrastructure. The team automates its infrastructure using Chef, a ruby-based configuration management tool, and follows a feature branch workflow with unit testing via Jenkins and integration testing in a staging environment. They prioritize using community cookbooks, supplemented by custom wrapper cookbooks with the "pd" prefix, managed by Berkshelf, and employ custom knife plugins for tasks such as server backups and spawns. Despite challenges like cross-component versioning and code quality issues in community cookbooks, the team remains confident in their infrastructure's reliability, thanks to a test-driven development approach and the adoption of security testing automation tools.