Building the Next Generation of Physical Agents with Gemini Robotics-ER 1.5

Post Details

Company

Google Cloud

Date Published

Sept. 25, 2025

Author

Kendra Byrne, and Fei Xia

Word Count

1,960

Company Posts That Month

20

Language

English

Hacker News Points

-

Source URL

developers.googleblog.com/building-the-next-generation-of-physical-agents-with-gemini-robotics-er-15

Summary

Gemini Robotics-ER 1.5, now available in preview via Google AI Studio and the Gemini API, is a pioneering model designed to enhance robotics with advanced embodied reasoning capabilities. This model excels in visual and spatial understanding, task planning, and progress estimation, making it adept at handling complex tasks that require contextual information and multiple steps, such as sorting objects into recycling bins based on local guidelines. It is optimized for rapid spatial reasoning, generating precise 2D points, and orchestrating advanced agentic behaviors through spatial and temporal reasoning. Users can control the latency versus accuracy trade-off, allowing the model to think longer for complex tasks or respond quickly for simpler ones. Additionally, Gemini Robotics-ER 1.5 includes improved semantic safety filters and physical constraint awareness, ensuring safer operation within defined parameters. As a high-level reasoning engine for robots, it integrates with various tools and APIs to execute sophisticated tasks, demonstrating significant performance on both academic and internal benchmarks.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
LLM	1	3,636	538	190	-7%