Model
Gemini Robotics
Google DeepMind's vision-language-action model for embodied reasoning, tool use, and physical interaction.
Model metadata
Modalities: text, image, action
Model
Google DeepMind's vision-language-action model for embodied reasoning, tool use, and physical interaction.