The model is fine. The session is broken.
Blog post from Ably
AI agent demos often falter in real-world applications due to unreliable connections and user experience issues, rather than model deficiencies. Despite advancements in AI capabilities, the communication experience with agents across devices remains problematic, with common issues such as broken streams, session immobility, and silent agent failures. The problem lies in the delivery layer, where current frameworks are not equipped to handle durable sessions that persist across network drops and device changes. Companies like Ably are addressing this gap by developing infrastructure to support persistent, multi-device connections, enabling a seamless user experience similar to what Uber and Deliveroo achieved through reliable service delivery. This approach allows even smaller teams without extensive engineering resources to offer robust AI services.