Prompting Google Bard with Images & How it Compares to Bing
Blog post from Roboflow
Google's Bard chatbot has introduced a new feature allowing it to accept image prompts, positioning it as a multimodal tool similar to Microsoft's Bing chat powered by OpenAI's GPT-4. While Bard shows promising results in image captioning and classification tasks with a high accuracy rate, it struggles with tasks like counting objects and entirely rejects images containing human faces, unlike Bing which blurs them. The analysis suggests that Bard's image capabilities may be based on a combination of Google Lens and other Google services, indicating potential for generalized search and lookup tasks rather than specific computer vision applications. Despite its limitations in handling complex computer vision tasks, Bard's integration of Google's extensive features makes it a useful tool for consumer-friendly image tasks and suggests its strength lies in zero-shot image-to-text applications and general image classification without prior training.