Microsoft Research Video Description Corpus

Language:
English
This data consists of about 120K sentences collected during the summer of 2010. Last published: November 12, 2010.
  • Version:

    1.0

    File Name:

    VideoDescriptions.msi

    Date Published:

    5/12/2016

    File Size:

    2.6 MB

      This data consists of about 120K sentences collected during the summer of 2010. Workers on Mechanical Turk were paid to watch a short video snippet and then summarize the action in a single sentence. The result is a set of roughly parallel descriptions of more than 2,000 video snippets. Because the workers were urged to complete the task in the language of their choice, both paraphrase and bilingual alternations are captured in the data. We expect this data to be useful for training and testing translation and paraphrase algorithms. A paper describing how the data was created and used is in progress.
  • Supported Operating System

    Windows 10 , Windows 7, Windows 8

      • Windows 7, Windows 8, or Windows 10
      • Click Download and follow the instructions.
Site feedback
Microsoft

What category would you like to give web site feedback on?



Rate your level of satisfaction with this web page today:

Comments:

Submit