Google DeepMind Unveils Gemini Robotics-ER 1.6: Teaching Robots to Read Industrial Instruments
Google DeepMind has released Gemini Robotics-ER 1.6, a specialized embodied reasoning model designed to function as a sophisticated high-level brain for robotic systems. The update introduces substantial advancements in spatial reasoning, multi-view perception, and a novel capability for interpreting complex industrial instruments. In benchmarking tests, the model achieved a 93% success rate on instrument-reading tasks, compared to just 23% with the previous version. The model utilizes a process called 'agentic vision' to autonomously zoom into relevant areas, identify key points such as needles and tick marks, and execute internal code to calculate precise values based on dial proportions. This breakthrough transforms a robot from a mobile camera into a functional inspector capable of making real-time decisions based on the data it sees.