Building real-world on-device AI with LiteRT and NPU
Blog post from Google Cloud
LiteRT is a cross-platform framework designed to enhance on-device AI performance by leveraging Neural Processing Units (NPUs) across various platforms such as mobile, desktop, and IoT. It allows developers to integrate advanced AI models without the need for vendor-specific code, providing both CPU, GPU, and NPU acceleration. This framework is utilized by industry leaders like Google Meet, Epic Games, and Argmax Inc. for applications ranging from real-time video effects and facial animation to speech recognition, demonstrating significant improvements in efficiency and responsiveness. LiteRT simplifies the deployment of AI features by abstracting complex NPU integrations and supporting a wide range of hardware, thus enabling developers to optimize performance across different devices. The Google AI Edge Gallery App and Google AI Edge Portal offer developers tools to test, validate, and benchmark their AI models, ensuring optimal performance across various configurations.