KubeCon EU 2026 Recap: The Year AI Moved Into Production on Kubernetes
Blog post from Pulumi
KubeCon + CloudNativeCon EU 2026 in Amsterdam highlighted the growing focus on AI, particularly the challenges and developments surrounding AI workloads on Kubernetes. Despite 66% of organizations using Kubernetes for generative AI, only 7% manage daily production deployments, revealing a significant gap between experimentation and real-world application. The conference showcased several advancements, including NVIDIA's contributions to GPU infrastructure with projects like the DRA driver and KAI Scheduler, which enhance GPU management in Kubernetes. The introduction of llm-d, a collaboration between major tech companies, signifies a move towards an AI runtime for Kubernetes, addressing inference complexity. There was a strong emphasis on agentic AI, with projects like kagent and kagenti focusing on agent management and security. The conference also underscored the importance of sovereignty in infrastructure architecture due to regulatory pressures, with data residency and GPU multi-tenancy being key considerations. Overall, the event demonstrated that while significant progress is being made, there is still a considerable gap between current capabilities and the demands of production-level AI on Kubernetes.