Challenges using Prometheus at scale
Blog post from Sysdig
Prometheus, a widely-used monitoring tool in Kubernetes environments, faces several challenges when scaled for production needs, including lack of global visibility, complex configuration management, and horizontal scaling limitations. As organizations grow and the number of services increases, Prometheus struggles with memory issues due to the growing number of time series, and its inability to scale horizontally becomes evident. Workarounds like sharding and long-term storage solutions such as Cortex, Thanos, and M3 exist but come with their own complexities and high operational costs. Sysdig aims to address these scaling challenges by evolving its platform to be fully compatible with Prometheus, offering a scalable and secure solution that retains data for longer periods, thereby allowing developers to focus on innovation rather than maintaining complex monitoring infrastructures.