NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI
Blog post from HuggingFace
NVIDIA has unveiled Cosmos Reason 2, an advanced open reasoning vision-language model designed to enhance physical AI by enabling robots and AI agents to perform complex tasks in the real world with human-like understanding and planning capabilities. This model surpasses its predecessor in accuracy, ranking as the top open model on Physical AI and Physical Reasoning leaderboards, and supports improved spatio-temporal understanding, timestamp precision, and an expanded set of spatial and visual perception capabilities. Cosmos Reason 2 is adaptable to various use cases, including video analytics, data annotation, and robot planning, with successful applications reported in industries such as autonomous driving and workplace safety. It supports flexible deployment options, from edge to cloud, and is available in different model sizes. Users can explore its features on NVIDIA's platform and download models from Hugging Face, with further availability on major cloud services anticipated.