Lemon Slice is building the next generation of video foundation models focused on humans, with a platform that allows anyone to create videos of expressive, talking characters. Their goal is to envision AI video not just as a creator tool, but as the future of interactive media and embodied AI. To achieve this, they developed a zero-shot model that supports 25fps streaming, which can generate real-time video of a character speaking with just a single image and audio input. Lemon Slice partnered with Daily to productize their model into a complete interactive experience, leveraging Pipecat's open-source framework for building multi-modal AI applications. The collaboration enabled Lemon Slice to focus on pushing the boundaries of AI video generation while handling complex infrastructure and architectural challenges.