Home / Companies / Datadog / Blog / Post Details
Content Deep Dive

Introducing cluster-level service monitoring

Blog post from Datadog

Post Details
Company
Date Published
Author
Conor Branagan
Word Count
535
Language
English
Hacker News Points
-
Summary

In late 2014, Datadog introduced Availability Monitoring to monitor hosts, applications, and services, allowing customization of alerts for availability issues. A major update now enables setting alerts when a percentage of servers in a cluster experience availability problems. This feature is particularly useful for cloud-based applications where the focus is on overall application or service availability rather than individual server statuses. Datadog allows users to set two types of alerts: Warning and Critical, and monitor by various groupings such as availability zone, environment, roles, etc., with different alert threshold percentages specified for each grouping.