Home / Companies / Voxel51 / Blog / Post Details
Content Deep Dive

Ask Your Images Anything

Blog post from Voxel51

Post Details
Company
Date Published
Author
Jacob Marks
Word Count
1,069
Language
English
Hacker News Points
-
Summary

This week's FiftyOne Plugin is the Visual Question Answering (VQA) Plugin. It allows users to ask open-ended questions about their images within the FiftyOne App, without writing any code. The plugin supports two models: a Vision-Language Transformer and BLIP2 from Salesforce. Users can install the plugin using pip and download it from GitHub. The VQA Plugin is a Python Plugin consisting of four files and doesn't write anything onto the samples themselves, but returns the model's answer as output.