Company
Date Published
Author
Giovanni Rago
Word count
1148
Language
English
Hacker News points
None

Summary

We recently released the first version of our OpsGenie integration, allowing users to leverage Checkly with the popular alerting and incident management platform more easily. Our plan is to add additional work to enhance the experience. We use public dashboards to monitor checks running against production and staging environments, setting up alerts via email, Slack, and OpsGenie for check priority. To integrate OpsGenie using webhooks, users can create an alert on failure and automatically close it when the check recovers. If Checkly goes down, we continuously monitor lower-level components like queue and notification services to prevent issues. We use Amazon CloudWatch alerts and AppOptics to handle spikes in incoming messages and ensure timely results. Our backend systems are separated into multiple services, allowing for independent deployment, operation, and functional decoupling.