Vibe Draw is an innovative voice-first creative tool developed as a weekend project by Ryan Morrison, combining ElevenLabs' voice AI with FLUX Kontext for voice-powered image creation. This tool allows users to create and manipulate images by simply describing them out loud, leveraging FLUX Kontext's ability to generate and edit images based on spoken prompts. Vibe Draw utilizes various technologies, including the Web Speech API for speech recognition and ElevenLabs' text-to-speech API for responsive voice interactions, all running client-side for lightweight functionality, though it recommends server-side handling for production security. The system tackles challenges like natural language understanding and contextual awareness to distinguish between new creations and edits, ensuring seamless user experience with an audio queue system to manage responses. The project demonstrates the potential of conversational AI in visual creativity, removing barriers between imagination and execution, and opens possibilities for new capabilities like multimodal input and collaborative sessions.