Home / Companies / Sysdig / Blog / Post Details
Content Deep Dive

How to monitor etcd

Blog post from Sysdig

Post Details
Company
Date Published
Author
Victor Hernando
Word Count
2,344
Language
English
Hacker News Points
-
Summary

Etcd is a distributed key-value open-source database that serves as a critical component in Kubernetes infrastructure by storing cluster-related data and leveraging the RAFT algorithm for leader election and consensus. Proper monitoring of etcd is essential to prevent issues such as loss of quorum, which can halt changes within a Kubernetes cluster. The article outlines how to monitor etcd using its built-in metrics endpoint accessible via the client port, requiring appropriate certificates for access. It details setting up Prometheus to scrape these metrics and highlights key metrics to monitor, including node availability, leader status, proposal handling, disk performance, and network latency. By properly monitoring these metrics, users can ensure the stability and performance of their Kubernetes clusters. The article also introduces the Sysdig Monitor tool as a resource for accelerating monitoring and troubleshooting processes.