Model

Gemini Robotics

Google DeepMind's vision-language-action model for embodied reasoning, tool use, and physical interaction.

Closed sourcemultimodalGemini Roboticsgenerally availableGoogle DeepMindOfficial page

Model metadata

Modalities: text, image, action

Recent coverage

Related events

Related papers