Clarifai Release 8.4

Post Details

Company

Clarifai

Date Published

May 3, 2022

Author

Ian Kelk

Word Count

425

Language

English

Hacker News Points

-

Source URL

www.clarifai.com/blog/clarifai-release-8.4

Summary

In the latest Clarifai Release 8.4, several advanced AI models have been introduced for diverse applications, including logo detection, image captioning, optical character recognition (OCR), and grammar correction. The Logo Detection model, utilizing YOLOv5, is trained on a comprehensive dataset to identify nearly 3,500 logos in images and videos. The Image Captioning model, based on Salesforce's BLIP framework, offers state-of-the-art vision-language understanding and generation capabilities. For image recognition, the Vision Transformer model provides a versatile solution with leading performance metrics. The OCR model employs Microsoft's Transformer OCR fine-tuned on the SROIE dataset to accurately recognize printed text, using a combination of image and text Transformers. Lastly, the English Grammar Correction model, known as Gramformer, leverages the T5 architecture to detect and correct grammatical errors with high precision, supported by a quality estimator. Each model is accessible for users to explore and integrate into various applications through the Clarifai platform.