Build a Realtime Video Restyling Agent with Gemini 3 + Decart AI
Blog post from Stream
Released on November 18, 2025, Google's Gemini 3 enables the creation of AI applications with advanced multimodal reasoning and tool-use capabilities, which can be combined with Decart AI's Mirage LSD to produce zero-latency, real-time video restyling agents. This technology allows users to transform their live camera feeds into various artistic styles, such as Neon Nostalgia or Studio Ghibli, using simple voice commands, without any additional scaffolding. The integration of speech recognition and synthesis models further enhances the user experience by allowing seamless and instantaneous video style changes in response to spoken prompts. The system utilizes a stack of technologies, including Google's Gemini 3 for prompt understanding, Decart AI for video processing, and additional tools for speech-to-text and text-to-speech functionalities. The open-source Vision Agents framework powers the entire setup, enabling developers to rapidly deploy a fully-functional, low-latency video AI agent in pure Python with minimal code.