Home / Companies / HuggingFace / Blog / Post Details
Content Deep Dive

H Company's new Holo2 model takes the lead in UI Localization

Blog post from HuggingFace

Post Details
Company
Date Published
Author
Ramzi De Coster, Hamza Benchekroun, and Aurélien Lac
Word Count
214
Language
-
Hacker News Points
-
Summary

H Company has introduced the Holo2-235B-A22B Preview, an advanced UI localization model that sets a new state-of-the-art performance with 78.5% accuracy on Screenspot-Pro and 79.0% on OSWorld G. Released two months after the debut of the initial Holo2 models, this iteration focuses on improving localization of UI elements, particularly on high-resolution 4K interfaces where small elements can be challenging to detect. The model employs agentic localization, which allows it to iteratively refine predictions and enhance accuracy, achieving significant relative gains across all Holo2 model sizes. In its initial phase, the Holo2-235B-A22B Preview attains 70.6% accuracy on Screenspot-Pro in just one step, and in agent mode, it reaches the 78.5% mark within three steps, marking a record-breaking achievement in GUI grounding benchmarks. Available on Hugging Face, this release is geared towards research and development in UI element localization.