Home / Companies / Pulumi / Blog / Post Details
Content Deep Dive

KubeCon EU 2026 Recap: The Year AI Moved Into Production on Kubernetes

Blog post from Pulumi

Post Details
Company
Date Published
Author
Engin Diri
Word Count
1,795
Language
English
Hacker News Points
-
Summary

KubeCon + CloudNativeCon EU 2026 in Amsterdam highlighted the growing focus on AI, particularly the challenges and developments surrounding AI workloads on Kubernetes. Despite 66% of organizations using Kubernetes for generative AI, only 7% manage daily production deployments, revealing a significant gap between experimentation and real-world application. The conference showcased several advancements, including NVIDIA's contributions to GPU infrastructure with projects like the DRA driver and KAI Scheduler, which enhance GPU management in Kubernetes. The introduction of llm-d, a collaboration between major tech companies, signifies a move towards an AI runtime for Kubernetes, addressing inference complexity. There was a strong emphasis on agentic AI, with projects like kagent and kagenti focusing on agent management and security. The conference also underscored the importance of sovereignty in infrastructure architecture due to regulatory pressures, with data residency and GPU multi-tenancy being key considerations. Overall, the event demonstrated that while significant progress is being made, there is still a considerable gap between current capabilities and the demands of production-level AI on Kubernetes.