Finding Actors and Actions in Movies

  • P. Bojanowski ,
  • F. Bach ,
  • I. Laptev ,
  • J. Ponce ,
  • C. Schmid ,
  • J. Sivic

International Conference on Computer Vision |

Published by IEEE

PDF | PDF | PDF | Publication | Publication | Publication | Publication | Publication | Publication | Publication

We address the problem of learning a joint model of actors and actions in movies using weak supervision provided by scripts. Specifically, we extract actor/action pairs from the script and use them as constraints in a discriminative clustering framework. The corresponding optimization problem is formulated as a quadratic program under linear constraints. People in video are represented by automatically extracted and tracked faces together with corresponding motion features. First, we apply the proposed framework to the task of learning names of characters in the movie and demonstrate significant improvements over previous methods used for this task. Second, we explore the joint actor/action constraint and show its advantage for weakly supervised action learning. We validate our method in the challenging setting of localizing and recognizing characters and their actions in feature length movies Casablanca and American Beauty.