Implement auto-remediation using New Relic and Amazon EventBridge
Blog post from New Relic
Auto-remediation, which aims to reduce human intervention and minimize mean time to resolve (MTTR), is effectively implemented through the integration of New Relic and AWS EventBridge, as illustrated in a comprehensive guide using an Amazon EC2 instance with high memory utilization. The process involves monitoring the EC2 instance with New Relic's infrastructure agent to trigger an alert when memory usage surpasses a predefined threshold. This alert sends a notification to EventBridge, which then executes a rule to restart the instance automatically. The guide emphasizes configuring the New Relic to EventBridge integration, including dynamically supplying the EC2 instance ID from alert notifications to ensure accurate remediation, and creating an SSM Automation Role in AWS IAM for rebooting EC2 instances. It also details setting up EventBridge rules to parse notifications from New Relic and restart the correct instance, highlighting the importance of specifying automation parameters and execution roles. Testing and validation steps are included to ensure the auto-remediation process functions correctly, with suggestions for troubleshooting and additional integration options provided for users interested in leveraging New Relic's capabilities further.