Home / Companies / Google Cloud / Blog / Post Details
Content Deep Dive

MediaPipe on the Web

Blog post from Google Cloud

Post Details
Company
Date Published
Author
-
Word Count
797
Language
English
Hacker News Points
-
Summary

MediaPipe is a framework designed to create cross-platform multimodal machine learning pipelines, and its team, led by Michael Hays and Tyler Mullen, has developed a way to run these pipelines directly in web browsers using WebAssembly and XNNPack ML Inference Library. By integrating this capability into the MediaPipe Visualizer tool, users can design and test MediaPipe graphs live in the browser with the aid of WebGL for rendering and a simple API for JavaScript and C++ communication. While WebAssembly generally executes faster than JavaScript, it is slower than native C++, prompting optimizations like GPU use for image operations and the XNNPack library for speed improvements. However, current limitations include restricted graph calculators and single-threaded execution, with future plans to expand functionality and remove these constraints. The article acknowledges contributions from several individuals and encourages following updates on the Google Developer blog and Twitter for further developments.