Groundcover Blog Summaries 2025

December 2025: 3 posts

DevOpsDays Tel Aviv

DevOpsDays Tel Aviv, scheduled for December 11, 2025, at Expo Tel Aviv, Pavilion 10, will feature sessions aimed at enhancing cloud monitoring and cost management. A notable speaking session by Aviv Zohari, Field CTO at groundcover, will address the issue of cross-availability zone (AZ) traffic that can inflate cloud bills by using eBPF for real-time traffic analysis without sidecars or packet drops. This session will provide insights on tracing network flows back to specific services, nodes, and zones, offering platform teams the visibility needed to manage cloud costs effectively. Additionally, a demo theater session led by Shahar Azulay, Co-founder & CEO at groundcover, will explore full-stack observability for complex AI systems, such as large language models (LLMs) and autonomous agents, emphasizing the importance of visibility and control in these environments.

Dec 08, 2025 312 words in the original blog post.

What is Security Context? Misconfigurations & Best Practices

Security context is a crucial concept in IT security that defines access rights and privileges for resources within applications, operating systems, and container orchestration platforms like Kubernetes. It allows organizations to establish granular access controls, helping to mitigate security risks by ensuring that IT resources only have the permissions necessary for their specific roles, akin to giving different keys to different people in an office building based on their needs. Implementing security context involves configuring rules that govern each resource's access capabilities, which are then automatically enforced by controllers to prevent unauthorized actions. Misconfigurations, such as allowing containers to run with root privileges or neglecting to set filesystems as read-only, can undermine the effectiveness of security context, highlighting the importance of proper configuration and adherence to best practices like enforcing least privilege and robust authentication. Emerging trends include the use of AI to automate the generation and management of security contexts, enhancing efficiency and adaptability in environments with dynamic access needs. Additionally, observability solutions like groundcover complement security contexts by providing insights into resource activities, enabling organizations to detect anomalies and ensure that security policies are enforced properly, thereby enhancing the overall security posture.

Dec 05, 2025 2,361 words in the original blog post.

Adding AWS Bedrock Support to LLM Observability

Groundcover addresses the challenges of monitoring Large Language Models (LLMs) in production environments by providing comprehensive observability tools that offer insights into aspects like request volume, latency, errors, and costs. Traditional monitoring tools fail to provide the necessary visibility into LLMs, which often function as black boxes in the technology stack. Groundcover's solution leverages eBPF technology, enabling developers to monitor LLM activity without the need for additional instrumentation or SDKs, thus eliminating blind spots. The platform supports AWS Bedrock alongside OpenAI and Anthropic, providing a unified view of LLM performance across different providers, and ensures data residency and security within an AWS instance. This observability is crucial for understanding and controlling costs, as LLMs can quickly become expensive if not monitored properly. Moreover, the integration of LLM observability with other monitoring metrics like CPU, memory, and network allows teams to identify and address issues efficiently, preventing costly outages and ensuring data safety by flagging sensitive information. As the AI landscape rapidly evolves, having such visibility is essential for teams to manage AI systems effectively and maintain operational integrity from the onset of LLM adoption.

Dec 02, 2025 1,098 words in the original blog post.

November 2025: 13 posts

The Datadog Migration Tool Is Now GA and Switching Observability Platforms Just Became Easy

Groundcover has officially launched its Datadog Migration feature, making it generally available to simplify the process of transitioning from legacy observability vendors to their platform. This tool addresses the significant challenge of migration fear by offering a fully automated, self-service solution that eliminates the need for consultants and ensures zero downtime and switching costs. It efficiently maps the entire environment, including dashboards, monitors, and integrations, while ensuring pixel-perfect dashboard translations and clear validation flows to maintain workflow integrity. The new Integrations Center supports multiple platforms like AWS, GCP, and Azure, allowing teams to configure setups directly through the UI or via API and Terraform. The tool promises significant improvements, such as 90% faster migration times and up to 70% lower total cost of ownership, positioning it as a viable and fear-free option for teams seeking to modernize their observability infrastructure.

Nov 30, 2025 805 words in the original blog post.

LLM Observability for AWS Bedrock & AgentCore: Zero Friction

Shahar Azulay, CEO of groundcover, announces the integration of Amazon Bedrock into their LLM Observability feature, enhancing real-time visibility for engineering and platform teams without requiring any SDKs or code changes. This feature supports Bedrock's approach to data handling, where AWS ensures data security and ownership by keeping traffic within a user's VPC and creating dedicated model copies for enterprise data. Groundcover's BYOC model guarantees that all telemetry stays within the user's cloud environment, ensuring data privacy and control. The use of eBPF allows for comprehensive, frictionless observability at the kernel layer, providing full visibility of AI interactions in real time. Furthermore, groundcover's support for Bedrock AgentCore offers insights into the logic and decision-making of AI workflows, moving beyond simple model interactions to understanding the reasoning paths of AI agents. As LLM applications evolve into more autonomous systems, the need for such visibility becomes crucial for operating AI with confidence, marking a shift away from the opaque "black box" era of AI.

Nov 30, 2025 889 words in the original blog post.

Kubernetes Secrets: How They Work, Types, and Best Practices

Kubernetes Secrets offer a secure method to store and manage sensitive data, such as passwords and certificates, within a Kubernetes cluster, thereby simplifying the deployment and management of workloads requiring authentication or encryption. These Secrets are stored as key-value pairs in Etcd and managed declaratively, but they are only encoded using Base64, not encrypted, necessitating additional security measures like enabling encryption in Etcd or using external secrets managers like HashiCorp Vault for enhanced security. Various types of Kubernetes Secrets exist, including opaque, basic authentication, TLS, registry, and bootstrap token Secrets, each serving specific use cases, from managing authentication data to encrypting network traffic. Best practices for managing Kubernetes Secrets involve selecting the appropriate type of Secret, updating and removing outdated Secrets regularly, enabling encryption, and monitoring events related to Secrets to mitigate risks such as unauthorized access. Groundcover enhances visibility into Kubernetes clusters by tracking metrics and performance trends, aiding in efficient Secrets management and troubleshooting.

Nov 27, 2025 2,415 words in the original blog post.

Why Datadog Broke Your Heart (and How groundcover Helps You Move On)

Engineering teams often start using Datadog, a popular observability tool, during their early growth stages due to its recognition and effectiveness in monitoring metrics, traces, and alerts. However, as companies scale, they encounter high costs associated with Datadog's volume-based pricing model, leading to limited visibility and expensive bills. In response, some teams consider switching to open-source solutions but face challenges in maintaining these complex systems. The text suggests that the real problem lies in the SaaS observability model, which isn't suited for rapidly growing companies. It proposes using Bring Your Own Cloud (BYOC) and eBPF technology as alternatives, offering flat costs and full visibility without the drawbacks of traditional SaaS tools, thus advocating for a shift to more scalable observability solutions that align with company growth.

Nov 27, 2025 916 words in the original blog post.

Meet groundcover at re:Invent 2025

At re:Invent 2025 in Las Vegas, groundcover invites attendees to engage in discussions about observability needs and to participate in a speaking session titled "Tracing the Untraceable: Full-Stack Observability for LLMs and Agents," which addresses the challenges organizations face when deploying LLM workflows without adequate monitoring. The session will focus on monitoring techniques for LLM applications, including token usage and response latency, using AWS Bedrock and other commercial LLM stacks, without requiring instrumentation. Groundcover offers an AWS-native solution, providing complete Kubernetes stack visibility with eBPF instrumentation, allowing teams to reduce costs with predictable pricing and troubleshoot infrastructure issues efficiently. The platform integrates with AWS CloudWatch and is compatible with various Kubernetes and Linux distributions, ensuring comprehensive observability across applications and infrastructure.

Nov 21, 2025 533 words in the original blog post.

More Metrics, Without Lifting a Finger

Groundcover seeks to streamline the migration from legacy observability vendors by minimizing user effort, particularly in mapping and validating metrics across various resources. To address challenges such as closing gaps in metric collection and ensuring comprehensive mapping, Groundcover adheres to the Prometheus convention with added prefixes for its metrics. A significant technical hurdle involved transitioning to cAdvisor for container metrics, which initially posed performance issues due to frequent file operations. By creatively using the Go compiler directive to modify cAdvisor's private function pointer at runtime, Groundcover improved performance, achieving a 90% reduction in file operations while benefiting from cAdvisor's extensive coverage. The company also automates the translation of legacy queries and metrics into their format, simplifying transitions without manual intervention.

Nov 12, 2025 795 words in the original blog post.

Dashboards: The Crux of Observability Migrations

Migrating dashboards in observability platforms is a complex process due to numerous nuances and configurations, akin to rearranging the layout and furniture of a house to fit a new space. This process involves translating metrics, queries, and layouts accurately to ensure functionality and maintain the user's preferences, which can be challenging without careful planning and expertise. Groundcover addresses these challenges by making opinionated, deterministic decisions to facilitate seamless translations of other vendors' dashboards, such as adopting a 24-column layout for easier coordinate and width translation. Particular attention is given to migrating variables, requiring reverse engineering of value calculations to prevent discrepancies. The goal is to provide a quick and painless migration experience that enables users to enjoy their new platform without dwelling on the complexities involved in the transition.

Nov 12, 2025 711 words in the original blog post.

Kubernetes Secrets: How They Work, Types, and Best Practices

Kubernetes Secrets are a crucial component for securely managing sensitive information like passwords, tokens, and certificates within a Kubernetes cluster, helping to simplify tasks that require authentication, authorization, or encryption by storing data as key-value pairs in Etcd. Kubernetes Secrets can be categorized into types such as Opaque, Basic Authentication, TLS, Registry, and Bootstrap Token, each serving specific use cases like HTTP authentication or traffic encryption. Despite their utility, Kubernetes Secrets face limitations, including the lack of default encryption and the complexity of access control, prompting some administrators to use external solutions like HashiCorp Vault for enhanced security features such as dynamic Secrets generation and encryption. Best practices for managing Kubernetes Secrets include enabling encryption in Etcd, choosing the appropriate type of Secret based on use case, and regularly updating and removing outdated Secrets, while monitoring and alerting can be achieved through Kubernetes's audit logging feature. Groundcover provides visibility into Kubernetes clusters to assist in troubleshooting and efficiently managing Secrets by tracking metrics and performance trends, ensuring that workloads can access the necessary Secrets securely.

Nov 11, 2025 2,423 words in the original blog post.

The Framework Behind the Query Language

Amir Sheffer, a Product Manager, discusses the complexities of data querying as groundcover develops a migration tool from legacy vendors to its platform. The effort involved revisiting their querying framework to handle every edge case and translate queries from other observability vendors into groundcover's languages. Utilizing ClickHouse for logs, traces, and events, and VictoriaMetrics for metrics, they settled on MetricsQL for metrics querying due to its power and expressiveness. They introduced a MetricsQL query builder that seamlessly switches between builder and code modes to enhance user experience. For logs, traces, and events, they created a middleware query language inspired by Victoria's LogsQL to optimize both simple and complex queries, ensuring the entire platform speaks the same language. The development process involved extensive optimization and deep-dive sessions, resulting in a robust framework that serves 95% of use cases while providing flexibility for the remaining 5%. As they continue to expand the framework's reach into areas like RUM session analytics and security use cases, they acknowledge that data querying remains an unsolved problem, but believe they have crafted a strong solution.

Nov 11, 2025 683 words in the original blog post.

The New Age of Self-Integration Management | Empowering Teams with Smarter Control

Groundcover introduces a new standard for integration management by enhancing observability platforms with streamlined configurations and comprehensive data correlation. By focusing on empowering customers to manage their ecosystems, the platform simplifies the integration setup process, allowing users to add, duplicate, and manage configurations directly from a SaaS interface. It supports Infrastructure-as-Code management through Terraform and offers the flexibility to pause and resume integrations based on specific needs, such as temporary testing environments. This approach aims to provide full visibility into performance and the ability to set up monitors and dashboards, facilitating dynamic control over the environment in line with evolving requirements. This initiative marks the beginning of a journey to reshape integration management, reflecting a commitment to data-driven leadership and innovative solutions for scaling startups.

Nov 10, 2025 579 words in the original blog post.

Migrations are here: Move from Datadog to groundcover

Groundcover has launched an automated migration experience for transitioning from other vendors, beginning with Datadog, to its own observability platform. This move comes after much internal debate, with the company ultimately deciding that the consolidation and improvement of observability tools are crucial in an era where AI rapidly evolves software. The migration process not only transfers assets but also aims to enhance existing systems by adopting a modern, open-source, and cost-efficient ecosystem, leveraging technologies like eBPF. Groundcover's approach seeks to address issues such as the escalating costs and vendor lock-in associated with traditional observability solutions, offering a path to seamlessly adopt updated standards without the constraints of legacy systems. This development is a strategic attempt to redefine observability practices by ensuring all data is centralized and accessible without proprietary limitations, thereby enabling future scalability and adaptability.

Nov 10, 2025 1,022 words in the original blog post.

Escaping Datadog: How We Built an Automated Observability Migration Tool

Groundcover, a company specializing in observability solutions, has developed an AI-powered migration tool designed to automate the transition from legacy observability vendors like Datadog to their platform. This tool addresses the significant challenges and costs associated with migrating enterprise systems, which often involve extensive manual processes, platform differences, and integration complexities. By offering a self-service, automated solution, Groundcover aims to eliminate the traditional barriers to switching vendors, such as high costs and long migration times, thereby providing a cost-effective alternative to expensive legacy systems. The company plans to showcase this tool at industry events, emphasizing its potential to disrupt the observability market by reducing vendor lock-in and enabling enterprises to adopt more competitive pricing models without compromising on data visibility and operational efficiency.

Nov 09, 2025 2,145 words in the original blog post.

Migration Doesnât Have to Be Scary

Groundcover has introduced a new Migrations feature designed to ease the transition from legacy observability systems, focusing on transparency, clarity, and control to instill user confidence. This feature employs preview modes to prevent surprises and insights panels that streamline decision-making, while a modern visual design makes complex steps more approachable. Initially starting with a linear wizard flow, the design evolved to allow non-linear progress, recognizing that migrations can be complex and require flexibility. The tool accommodates multi-threaded workflows with saved states and visual indicators, acknowledging that migrations often involve multiple stakeholders and extended discussions. Instead of a binary progress state, a battery-style progress graph provides a more nuanced view of migration status, emphasizing quality over quantity. Groundcover's approach is innovative, creating a fresh design system tailored to current team workflows rather than replicating outdated patterns. Overall, the Migrations tool aims to ensure clarity and safety throughout the migration process, aligning with the fast-paced and dynamic nature of the cloud-native world.

Nov 09, 2025 712 words in the original blog post.

October 2025: 1 post

Weâre officially Cool! groundcover Named a GartnerÂ® Cool Vendor in Container Management

Groundcover has been recognized as a 2025 Gartner Cool Vendor in Container Management, highlighting its innovative approach to cloud-native observability. The company's Bring Your Own Cloud (BYOC) model combines the simplicity of SaaS with the control of in-house cloud operations, allowing for efficient management of deployment, scaling, and data retention. Groundcover's architecture leverages high-performance storage and cloud provider services, ensuring cost-effective and resilient observability with complete data privacy and compliance. This recognition reflects Groundcover's commitment to proactive innovation and positions it as a leader in redefining observability for modern cloud environments.

Oct 20, 2025 763 words in the original blog post.

August 2025: 2 posts

Why We Monitor the Wrong Code & How to Fix It

In the blog post, Aviv Zohari explores the challenges developers face in effectively monitoring code, highlighting the irony that the most scrutinized parts of code, often monitored thoroughly, tend to be robust, while overlooked areas can fail silently due to insufficient attention. Zohari discusses the concept of "unknown unknowns" in system behavior, emphasizing the need for comprehensive monitoring to capture unexpected failures, which traditional methods often miss. The article introduces eBPF as a tool to achieve kernel-level observability without performance overhead, allowing developers to monitor every aspect of their systems. It also presents the Bring Your Own Cloud (BYOC) approach to mitigate the financial burden of storing massive amounts of monitoring data, advocating for storing data in proprietary cloud infrastructure to reduce costs. The piece concludes by promoting the groundcover platform, which uses eBPF for wide monitoring coverage, helping developers anticipate and respond to system anomalies without predicting specific failure points in advance.

Aug 20, 2025 1,167 words in the original blog post.

groundcover Zero-Instrumentation Observability for LLM Apps

Large Language Models (LLMs) have quickly become integral to modern software applications, enhancing capabilities across industries by providing AI-driven functionalities such as customer support and code generation. While these advancements offer significant business value, they also introduce complexities and risks, including performance volatility, cost unpredictability, quality drift, and security concerns. Traditional Application Performance Monitoring (APM) methods, which rely on heavy instrumentation, are inadequate for the dynamic nature of LLMs. In response, groundcover has developed a zero-instrumentation observability solution using eBPF technology, allowing organizations to monitor LLM interactions without code changes, ensuring security and compliance by keeping all data within the organization's cloud environment. This approach provides comprehensive insights into token usage, latency, and error patterns, helping teams optimize performance and manage costs effectively.

Aug 18, 2025 1,603 words in the original blog post.

July 2025: 4 posts

Postgres in Kubernetes: How to Deploy, Scale, and Manage

Running PostgreSQL databases in Kubernetes, despite being seen as complex due to their stateful nature, offers advantages such as simplified management, scalability, and high availability. While traditionally PostgreSQL might be hosted externally to Kubernetes, deploying it within a Kubernetes cluster allows for efficient resource management and connectivity, especially for applications running in the same cluster. Key components for this setup include Pods for hosting, Persistent Volume Claims for storage, StatefulSets for stable identity, and Services for consistent network access. While advantages include easier scaling and high availability, drawbacks include increased complexity and monitoring challenges. Various deployment methods such as using Helm, operators, or manual configurations are available, with operators generally recommended for their flexibility and automation capabilities. Kubernetes-native observability solutions like groundcover can enhance monitoring efficiency by utilizing eBPF, providing deep insights with minimal overhead, thereby optimizing the performance and reliability of PostgreSQL databases in Kubernetes environments.

Jul 20, 2025 3,356 words in the original blog post.

Fixing Kubernetes Pod Pending: Causes & Solutions

Kubernetes pod pending issues arise when a pod fails to transition from the pending to the running state, primarily due to resource constraints, scheduling conflicts, or configuration errors within the Kubernetes environment. The pending state indicates that a pod is waiting to start, which can occur if there are insufficient nodes, improper node selector or affinity rules, unfulfilled storage requirements, or failed container image pulls. Troubleshooting involves using commands like `kubectl describe pod` to diagnose issues, inspecting node availability, checking persistent volume configurations, and verifying pod specifications. Solutions include adjusting resource requests, correcting scheduling rules, ensuring adequate storage, and validating image paths. Best practices to prevent pending issues include monitoring node capacity, using configuration linting tools, testing deployments in staging environments, and implementing autoscaling. Groundcover is a tool that helps in troubleshooting these issues by providing comprehensive monitoring and visibility into the Kubernetes clusters, assisting in quickly identifying the root causes of pending pods.

Jul 16, 2025 2,596 words in the original blog post.

Why BYOC Is the Future of Observability

Bring Your Own Cloud (BYOC) is emerging as a vital model for observability, addressing the challenges of scale, cost, and data ownership faced by engineering teams. Traditional SaaS observability models, often expensive and non-compliant with data regulations like GDPR and HIPAA, are becoming less viable as data volumes grow. BYOC allows companies to deploy a vendor's backend infrastructure within their own cloud environment, maintaining control and compliance while benefiting from managed services. Unlike traditional on-premises solutions, which are typically costly and complex, BYOC provides the control and privacy of on-prem with the flexibility of cloud-native tools, making it accessible for organizations of all sizes. Groundcover has built its platform on the BYOC model, ensuring telemetry data remains within customer environments and offering a fully managed experience without the unpredictability of ingestion-based pricing. This approach provides the economic benefits of self-hosting without the operational complexities, making BYOC a necessary evolution in the observability landscape to accommodate modern infrastructure needs.

Jul 09, 2025 942 words in the original blog post.

OPA with Kubernetes: How It Works & Benefits of Use

Kubernetes allows users to define and deploy resources using code, but lacks inherent mechanisms to validate these configurations for errors or security issues. Open Policy Agent (OPA) provides a solution by enabling the definition and enforcement of policies through code, using a language called Rego, to ensure that Kubernetes resource definitions meet specific criteria. OPA can be deployed in Kubernetes using either sidecar containers or the Gatekeeper add-on, with Gatekeeper offering a more seamless integration by using custom resource definitions. This allows for scalable policy evaluations and centralized policy management, enhancing security and compliance. OPA policies, once set up, automatically validate resource configurations during deployment to prevent violations, such as missing memory limits or unapproved container registries. Moreover, the integration of OPA with tools like groundcover enhances Kubernetes observability and troubleshooting capabilities, providing comprehensive assurance for resource configuration and performance monitoring.

Jul 03, 2025 2,421 words in the original blog post.

June 2025: 1 post

AI-Ready Observability: Build High-Quality Data Pipelines

The convergence of AI agents and observability is transforming software development and operations by requiring a shift from traditional observability tools, which generate vast amounts of telemetry data, to systems engineered specifically for AI consumption. This transformation necessitates the extraction of intelligent insights and patterns from raw data to avoid overwhelming AI systems, which have limited context windows. As AI agents increasingly take on roles traditionally held by developers, such as code generation and troubleshooting, engineering roles are evolving to focus more on high-level concerns like system architecture and observability infrastructure. Companies like groundcover are addressing these challenges by developing features like Log Patterns and Log Insights, which identify recurring structures in logs and provide structured, contextual data for AI to work with efficiently. This approach enhances developer productivity by allowing AI to quickly identify patterns and surface relevant context, thereby enabling a new model of autonomous operations where AI systems provide immediate feedback on code changes and operational impacts. Organizations that adopt AI-ready observability strategies will be better positioned to leverage AI for operational excellence in the emerging era of autonomous development.

Jun 29, 2025 1,533 words in the original blog post.

May 2025: 4 posts

How groundcover Powers MCP with Digestible Observability Data

Large Language Models (LLMs) face challenges in processing the vast, complex streams of observability data, which include logs, traces, and metrics essential for system behavior analysis. The Model Context Protocol (MCP), introduced by Anthropic, addresses these challenges by standardizing how AI assistants retrieve the necessary context, regardless of the data source or LLM vendor, thus avoiding the need for multiple bespoke integrations. Groundcover's innovative MCP server transforms these raw data streams into AI-ready insights, utilizing purpose-built design choices such as log pattern summarization, drilldown mode for focusing on key attributes, and anomaly detection to provide distilled and structured insights. This approach enhances AI effectiveness by delivering curated, high-value input that aligns with AI reasoning processes, facilitated by a unique architecture combining eBPF sensors with Bring Your Own Cloud (BYOC) capabilities. As a result, AI becomes deeply integrated into observability systems, enabling developers and support teams to conduct investigations, run tests, and debug with greater efficiency and accuracy.

May 27, 2025 1,563 words in the original blog post.

Top 12 Docker Alternatives in 2025: Features & Comparisons

Docker, while often synonymous with container technology, is just one of many platforms for container orchestration, and in 2025, several alternatives offer varied benefits depending on specific use cases. Docker's popularity since its 2013 launch is due to its end-to-end container platform capabilities, but other tools like Kubernetes, Podman, and CRI-O provide alternatives that may integrate better with existing systems, offer enhanced performance, or are easier to learn. Kubernetes, for example, is a popular alternative for orchestration with greater scalability than Docker's Swarm, though it requires a separate container runtime. Podman offers a lightweight, compatible alternative, while BuildKit and Kaniko provide efficient image-building capabilities without needing a Docker daemon. Other tools like Rancher Desktop and OpenShift enhance user experience and integration with specific ecosystems, while solutions like Linux Containers (LXC) and Apache Mesos offer different operational paradigms. Transitioning from Docker can improve resource efficiency and tailor the container management experience but may introduce a learning curve and compatibility challenges. Tools like groundcover can aid in monitoring and observing infrastructure beyond Docker, ensuring seamless transitions to alternatives.

May 04, 2025 3,204 words in the original blog post.

Enhancing Log Management with Log Insights and Log Patterns

Groundcover has introduced two new features, Log Insights and Log Patterns, to enhance log management in cloud-native environments, addressing the challenges of overwhelming log volumes. Log Insights focuses on detecting error anomalies using statistical algorithms and user interactions to improve accuracy over time, thereby reducing time spent on troubleshooting and storage costs. Log Patterns streamlines logs by abstracting dynamic components like timestamps and IP addresses, enabling users to identify recurring issues and trends more efficiently. These features leverage advanced machine learning techniques and are supported by groundcover’s eBPF sensor, which captures logs efficiently at the kernel level, reducing traditional log collection overhead. As groundcover continues to expand its suite of insights, it aims to integrate AI and large language models (LLMs) to further transform observability, promising more precise and less labor-intensive log management in the future.

May 04, 2025 874 words in the original blog post.

Crossplane in Kubernetes: Features & How to Implement It

Crossplane, an open-source project by the Cloud Native Computing Foundation, extends Kubernetes' capabilities by allowing it to manage external IT resources in the same way as internal Kubernetes resources, using familiar tools and configuration strategies. This cloud-native framework serves as a universal control plane, managing not only pods and nodes within a Kubernetes cluster but also external servers and applications through APIs. Unlike traditional Infrastructure-as-Code tools like Terraform, which use standalone frameworks, Crossplane leverages Kubernetes' declarative approach for resource management, enabling developers to describe desired states and allowing Kubernetes to automatically reconcile them. Key features of Crossplane include extensibility through custom resource definitions (CRDs), multi-cloud and hybrid cloud support, continuous reconciliation, policy enforcement, and governance, making it a powerful solution for multi-cloud resource provisioning, infrastructure automation in CI/CD pipelines, and simplifying tooling and account management. Despite its advantages, Crossplane poses a learning curve, relies on provider APIs for functionality, and adds complexity to monitoring and observability, with solutions like groundcover recommended for enhancing visibility. Although not suited for everyone, Crossplane offers a consolidated and declarative resource management experience for those familiar with Kubernetes tooling.

May 04, 2025 3,146 words in the original blog post.

April 2025: 4 posts

Kubernetes DaemonSet: Examples, Use Cases & Best Practices

Kubernetes DaemonSets are a deployment method that ensures certain pods run on specific nodes within a cluster, offering precise control over pod placement, unlike the default Kubernetes behavior which automatically balances workloads across nodes. DaemonSets are particularly useful for deploying monitoring, logging, or security agents across all nodes, enabling consistent observability and performance management. They differ from Deployments and StatefulSets, which serve other purposes like running applications without node-specific constraints or managing stateful storage. Although powerful, DaemonSets can introduce challenges such as resource overhead and update complexity, which need careful management to avoid performance issues. Innovations like groundcover leverage eBPF technology to optimize observability and reduce the resource impact traditionally associated with DaemonSets, offering deeper insights and efficiency. By using DaemonSets with eBPF, users can achieve enhanced visibility without the typical drawbacks of user-space monitoring agents.

Apr 27, 2025 2,660 words in the original blog post.

Kubernetes ConfigMaps: Examples, Benefits & How to Use Them

Containers are often touted for their ability to allow applications to "build once, run anywhere," but this claim can be misleading due to environment-specific configuration settings. Kubernetes ConfigMaps address this issue by separating these settings from the containers themselves, enhancing portability without needing to rebuild container images for each environment. ConfigMaps store configuration data in key-value pairs, making it easy to update settings dynamically without redeploying applications. Unlike environment variables, ConfigMaps offer more convenience as they don't require container redeployment for changes. While similar to Kubernetes Secrets, ConfigMaps are intended for non-sensitive data, as they do not encrypt stored information. ConfigMaps' main advantages include decoupling configuration from applications, enabling dynamic updates, and providing centralized configuration management, though they lack built-in version control and encryption. Advanced uses include facilitating application migration between clusters and environments, but best practices like avoiding storing sensitive data in ConfigMaps, documenting ConfigMaps, and using live edits for updates are crucial for maximizing their utility. Tools like groundcover can aid in monitoring ConfigMap-related issues that might affect application performance, thus supporting the "build once, run anywhere" promise.

Apr 24, 2025 3,021 words in the original blog post.

8 Kubernetes Deployment Strategies: Pros, Cons & Use Cases

Kubernetes offers a range of deployment strategies that provide flexibility in application lifecycle management, each suited to different needs based on factors such as application availability, testing requirements, and workload management. The article compares eight popular Kubernetes deployment strategies, including recreate, rolling, blue/green, canary, A/B testing, shadow, best-effort controlled rollout, and ramped slow rollout, explaining their advantages and disadvantages. These strategies are typically configured in YAML files and are chosen based on the specific requirements of the application, such as stateless or stateful nature, traffic load, and downtime tolerance. Factors like security, compliance, scalability, and auto-healing are also considered when selecting an appropriate strategy. The article emphasizes the importance of simplicity, testing, monitoring, and resource management in optimizing Kubernetes deployments and highlights the role of tools like groundcover in providing observability data to make informed decisions.

Apr 17, 2025 2,991 words in the original blog post.

Kubernetes Network Policy: Benefits, Use Cases & Best Practices

Kubernetes Network Policies are essential for managing traffic flow within a cluster, enhancing both performance and security. Without these policies, a cluster can suffer from excessive and unnecessary network traffic, leading to potential performance degradation and security vulnerabilities. Network Policies define rules for pod communication, specifying ingress and egress traffic, and can be configured using YAML code. They rely on Container Network Interface (CNI) plugins for enforcement, which support basic features like IP filtering, though advanced functionalities depend on the specific CNI used. The benefits of implementing Network Policies include reduced bandwidth usage, improved troubleshooting efficiency, network segmentation, and compliance with data protection regulations. However, challenges such as policy complexity, conflicts, and performance overhead must be managed carefully. Best practices involve setting default policies for the entire cluster, using namespace-level policies to reduce complexity, and ensuring knowledge of the CNI in use. Tools like eBPF and platforms like groundcover enhance the management and troubleshooting of Network Policies by providing visibility into network behavior and traffic flows.

Apr 16, 2025 2,807 words in the original blog post.

March 2025: 7 posts

groundcover enters frontend observability with Real User Monitoring (RUM)

Groundcover has entered frontend observability with its Real User Monitoring (RUM) capability, expanding its Bring Your Own Cloud (BYOC) deployment model to include eBPF and BYOC on the client side. This allows for complete privacy of user data, leveraging groundcover's unique backend observability capabilities in a single platform with frontend observability, facilitating end-to-end correlation and visibility over the entire experience. Frontend teams can now benefit from real-time performance insights, improved user experience and engagement, faster debugging and issue resolution, by monitoring real user behavior and correlating it with backend telemetry. The RUM capability is built into a lightweight custom SDK that seamlessly integrates into existing groundcover backends, collecting detailed data on navigation events, user interactions, custom events, frontend errors and exceptions, and comprehensive session context. This unified view of frontend user experiences alongside backend metrics, logs, and traces accelerates troubleshooting and improves user satisfaction.

Mar 31, 2025 1,406 words in the original blog post.

RBAC in groundcover: Enterprise-Grade Observability Security

At groundcover, security and observability are deeply intertwined. The company has built its sensor with eBPF at its core and uses a unique BYOC deployment model to ensure secure data ingestion and storage. To address this, groundcover is introducing Role-Based Access Control (RBAC), which provides granular control over user access to observability data. This feature allows organizations to define fine-grained access policies, enforce data limitations, seamlessly integrate with Single Sign-On (SSO) solutions, limit access to only necessary data, and ensure real-time policy enforcement. With RBAC, administrators can create custom access policies directly within the groundcover UI, giving them total control over how observability data is accessed across their organization. The system also integrates with SSO providers like Okta, Azure AD, and Google Workspace, ensuring seamless access management for large enterprises.

Mar 27, 2025 759 words in the original blog post.

Introducing groundcover's One-Click Kubernetes Dashboards: Simplifying Observability with eBPF | KubeCon London Announcement

Groundcover has introduced One-Click Kubernetes Dashboards, powered by eBPF technology, to simplify observability and troubleshooting for engineering teams. These dashboards provide instant, context-rich insights into Kubernetes node and cluster health, allowing teams to quickly respond to alerts or proactively monitor system performance. Powered by eBPF, these dashboards offer real-time data collection with minimal overhead, enabling teams to gather granular insights without impacting system performance. The integration also includes seamless data exploration, making it easy for engineers to pivot from visualization to deep-dive data exploration. Additionally, the dashboards are integrated with OpenTelemetry, providing trace and analysis capabilities. Groundcover's platform prioritizes user experience, offering a simplified interface for generating and exploring queries, eliminating the need to master complex query languages or manually configure dashboards. The company aims to build a unified observability experience that empowers teams to monitor, troubleshoot, and optimize their systems effortlessly.

Mar 27, 2025 937 words in the original blog post.

Guide to Kubernetes Resource Quota: Examples & Pros and Cons

In Kubernetes, resource quotas are a key method for managing resources and preventing applications from consuming excessive amounts of CPU, memory, or storage. They work by setting limits on the total number of objects, total resource requests, and total resource usage levels permissible within a given namespace. Resource quotas can be used to manage compute resources, storage resources, object count resources, and extended resources. They offer several advantages, including improved resource utilization, simple limit configurations, and granular controls, but also have some drawbacks, such as complexity, limitations on application performance, and limited granularity. To get the most out of resource quotas, it's recommended to set up namespaces strategically, start high and scale down, monitor and adjust quotas, include both resource requests and limits, and use tools like groundcover for comprehensive visibility into resource usage.

Mar 20, 2025 2,312 words in the original blog post.

Lessons from Yet Another PM Building Dashboards

We built a data visualization platform, Dashboards and Data Exploration, from scratch in just 12 weeks. We started by building the Query Bar, which became the backbone of our platform, allowing users to generate PromQL queries easily. The Query Bar was expanded to support Logs and Traces, and later Infra Metrics, showcasing one of our biggest advantages over competitors: great Infra Monitoring out of the box. We prioritized user feedback, releasing features in quick cycles and iterating in days instead of months. Our platform is now fully integrated with Grafana, but we're shifting focus to building a custom graphing library for full control over visualizations. Future plans include driving adoption, closing feature gaps, and reimagining what dashboards can be, including enhanced sharing capabilities and intelligent automation.

Mar 16, 2025 1,532 words in the original blog post.

How to Drain a Node with kubectl: Step-by-Step Tutorial

In Kubernetes, draining a node is an essential process that allows you to remove pods from a node without disrupting your applications and services. The kubectl drain command provides a way to do this, but it requires careful planning and execution to avoid issues such as downtime, data loss, or application failures. By understanding the benefits of draining nodes, how to use the kubectl drain command, and how to troubleshoot common challenges, you can ensure a smooth and efficient node draining process that minimizes risks and maximizes availability for your applications.

Mar 02, 2025 2,499 words in the original blog post.

Kubernetes Liveness Probes: Configuration & Best Practices

Kubernetes liveness probes are a crucial component of Kubernetes monitoring and observability, allowing you to determine whether a container is "alive" or running normally. By providing visibility into what's happening inside a pod, probes play an important role in detecting issues before they become major problems. Liveness checks can be categorized into four types: command execution, TCP socket, HTTP GET, and gRPC. Probes offer several benefits, including improved application availability, efficient health checks, automated restarts, and customizability. However, liveness probes also have limitations, such as providing little insight into why a container is not working normally and not assessing performance. To configure Kubernetes liveness probes, you can use YAML code that defines the type of probe to run, parameters like initial delay and period seconds, and failure threshold. It's essential to test probes manually before deploying them automatically and to update probes as applications change. By combining liveness probes with observability tools, such as groundcover, you can gain deeper insights into container performance issues and fix problems permanently.

Mar 02, 2025 2,559 words in the original blog post.

February 2025: 1 post

Kubernetes Secrets Management: Limitations & Best Practices

Kubernetes secrets are objects that provide access to and manage sensitive data, allowing workloads and services to authenticate with each other and unlock resources protected by access controls. They help avoid embedding sensitive data directly into configuration files or commands, which would be insecure. Secrets can store any type of data defined by users, including authentication secrets, service account tokens, encryption secrets, SSH credentials, and opaque secrets. Kubernetes secrets use cases include supporting logins and data encryption for cloud-native applications and managing interactions within the cluster using service accounts. Creating secrets in Kubernetes involves using kubectl to generate or reference a file containing secret values. Once created, secrets can be used by injecting them into workloads as environment variables or mounting them as data volumes. However, Kubernetes secrets have limitations, including lack of encryption, challenges with secret rotation, misconfiguration risks, limited integration with external systems, lack of scalability, and limited auditing. External secrets managers like HashiCorp Vault or AWS Secrets Manager can address these limitations by providing stronger security, comprehensive secrets management, enhanced scalability, and improved auditing capabilities. Best practices for managing Kubernetes secrets include encrypting secrets, using short-lived secrets, auditing RBAC configurations, implementing secrets rotation, avoiding hard-coded credentials, limiting secrets access to specific containers, and securing secret data after access.

Feb 25, 2025 1,945 words in the original blog post.

2025 Blog Summaries