Handling Termination Grace Period
Blog post from Kestra
In a recent blog post, Kestra introduces a new liveness and heartbeat mechanism aimed at enhancing the reliability of its distributed orchestration platform. This mechanism allows Kestra's services to periodically send heartbeat messages to its backend, ensuring the timely detection of server failures and seamless continuity in workflow executions. The updated system moves from a binary state model to a more nuanced lifecycle approach, offering greater flexibility and configurability for different service types and workloads. This development addresses challenges such as resource saturation and duplicate executions by allowing for individualized configurations and the introduction of a termination grace period. By adjusting properties like the worker task restart strategy, users can tailor the balance between availability and consistency according to their operational needs. Additionally, Kestra Enterprise Edition provides a Cluster Monitor dashboard for enhanced visibility into service states and transitions, further supporting Kestra’s commitment to building a reliable orchestration platform for business-critical workflows.