Home / Companies / Voxel51 / Blog / Post Details
Content Deep Dive

Natural-Language Video Search in FiftyOne with TwelveLabs

Blog post from Voxel51

Post Details
Company
Date Published
Author
Jimmy Guerrero
Word Count
1,248
Company Posts That Month
2
Language
English
Hacker News Points
-
Summary

TwelveLabs and FiftyOne have integrated their technologies to enhance video data handling through natural-language video search, transforming how users interact with video datasets. TwelveLabs provides video foundation models like Marengo and Pegasus, which enable embedding video clips into a shared space with text for easy searching by description, and generating natural-language captions and answers for video content. This integration allows users to embed, search, and caption videos using plain English without the need for local GPUs, as computations occur server-side via the TwelveLabs API. FiftyOne, an open-source toolkit, complements this by offering a visual app for exploring images and video, employing a query language for data slicing, and a "Brain" layer for advanced analysis, thereby facilitating better data curation. This partnership unlocks enhanced functionalities such as natural-language video search, zero-shot captioning, similarity detection, and embedding visualizations, enabling users to manage video datasets more efficiently and intuitively.

Trends Found in this Post

No tracked trend matches for this post yet.