Home / Companies / Datadog / Blog / Post Details
Content Deep Dive

How we saved over $3 million in idle compute costs with Datadog Kubernetes Autoscaling

Blog post from Datadog

Post Details
Company
Date Published
Author
Jacob Simonov, Danny Driscoll, Jesse Feinman
Word Count
1,065
Company Posts That Month
57
Language
English
Hacker News Points
-
Summary

Datadog's introduction of Datadog Kubernetes Autoscaling (DKA) has significantly optimized its Kubernetes resource management by intelligently automating both horizontal and vertical workload scaling. The implementation has helped eliminate over $3 million in annual idle compute costs, while improving reliability across its services. A central part of this success was demonstrated by Rapid, a platform team at Datadog, which adopted DKA to manage over 1,800 services and 20,000 deployments. DKA's multidimensional scaling mode addressed previous challenges with separate autoscaling tools, simplifying configurations and reducing both overprovisioning and underprovisioning issues. This allowed Rapid to fine-tune resource allocations based on actual usage, cutting costs by over 50% in some data centers and improving service reliability. DKA's successful rollout not only streamlined resource management but also fostered a culture of cost ownership among Datadog's engineering teams, enabling them to optimize services and share improvements company-wide, creating a cycle of continuous enhancement.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
Kubernetes 7 1,993 294 100 +1%
Platform Engineering 1 1,249 211 81 -3%