VITRA Redefines VLA Pre-training Paradigms via Human Video Reconstruction
When you see robots participating in running races or performing folk dances on stage, you might envision a future where a simple natural language command is all it takes for a robot to tidy up a desk, clean a room, or even serve tea. For…