Build a Realtime Video Restyling Agent with Gemini 3 + Decart AI

Post Details

Company

Stream

Date Published

Dec. 16, 2025

Author

Amos G.

Word Count

1,061

Company Posts That Month

32

Language

English

Hacker News Points

-

Source URL

getstream.io/blog/gemini-decart-restyling-agent

Summary

Released on November 18, 2025, Google's Gemini 3 enables the creation of AI applications with advanced multimodal reasoning and tool-use capabilities, which can be combined with Decart AI's Mirage LSD to produce zero-latency, real-time video restyling agents. This technology allows users to transform their live camera feeds into various artistic styles, such as Neon Nostalgia or Studio Ghibli, using simple voice commands, without any additional scaffolding. The integration of speech recognition and synthesis models further enhances the user experience by allowing seamless and instantaneous video style changes in response to spoken prompts. The system utilizes a stack of technologies, including Google's Gemini 3 for prompt understanding, Decart AI for video processing, and additional tools for speech-to-text and text-to-speech functionalities. The open-source Vision Agents framework powers the entire setup, enabling developers to rapidly deploy a fully-functional, low-latency video AI agent in pure Python with minimal code.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
Real-time	9	7,285	1,202	224	+60%
LLM	7	3,775	638	202	-32%
AI Agents	1	2,834	598	185	-18%
Voice AI	1	552	97	35	-50%