Gemini Robotics 1.5: Thinking while acting

Gemini Robotics 1.5 showcases an advanced robotic system capable of perceiving its environment, verbalizing its thought process, and adapting in real-time to perform complex tasks like fruit sorting and laundry organization with emergent problem-solving abilities. Demonstrating flexibility and generalization even with novel objects, the robot highlights significant progress toward autonomous, intelligent robots capable of integrated thinking and acting in dynamic, real-world settings.

The video showcases the capabilities of Gemini Robotics 1.5, an advanced robotic system designed to think while acting. Initially, the robot is demonstrated performing a simple task of sorting fruits by color into matching plates. This task, while seemingly straightforward, highlights the robot’s ability to perceive its environment, recognize different colors of objects and plates, and execute a multi-step process by thinking through each step before completing the task. This foundational ability is significant as it underpins the potential for more complex applications.

The presenter then introduces a more sophisticated task involving laundry sorting. The robot, named Apollo, is asked to sort clothes into white and dark bins. What makes this demonstration impressive is that Apollo not only performs the task but also verbalizes its thought process in real-time. For example, it identifies which item to pick up next and decides where to place it. This transparency allows observers to understand the robot’s decision-making and see how it chains multiple actions together seamlessly, moving beyond simple one-step commands to a continuous, adaptive workflow.

A particularly notable feature of Gemini Robotics 1.5 is its emergent problem-solving ability. During the laundry sorting task, Apollo encounters a situation where it needs to adjust the position of a bin to successfully pick up a cloth. This adjustment was not explicitly programmed but emerged naturally during the robot’s inference process, demonstrating a level of autonomy and reactivity that is rare in robotic systems. The robot’s ability to adapt to changes in its environment in real-time is a key advancement.

To further test the robot’s reactivity and perception, the presenter challenges Apollo with packing tasks involving familiar objects placed in varying locations. The robot successfully adapts to these changes, showing that its responses are not hardcoded but flexible within the domain it has learned. This adaptability is crucial for real-world applications where environments are dynamic and unpredictable.

Finally, the video pushes the boundaries by introducing completely new objects that Apollo has never encountered before, such as a brown football and a loofah. Despite the novelty, Apollo manages to handle these objects effectively, demonstrating generalization capabilities. The video concludes with a successful completion of the laundry sorting task, emphasizing Gemini Robotics 1.5’s potential to power more sophisticated robots, including humanoids, to perform complex daily tasks with integrated thinking and acting abilities.