Microsoft Research Question-Answering Corpus

Language:
English
This download consists of data only: a text file containing 1.4K questions aimed at the text of Encarta 98, the full text of Encarta 98, and a set of human annotations identifying pieces of text in Encarta that fully or partially answer the question. Last published: November 13, 2008.
  • Version:

    1.0.0

    File Name:

    MSR Encarta QA Corpus.msi

    Date Published:

    5/12/2016

    File Size:

    36.8 MB

      This download consists of data only: a text file containing 1.4K questions aimed at the text of Encarta 98, the full text of Encarta 98, and a set of human annotations identifying pieces of text in Encarta that fully or partially answer the question. These annotations additionally specify information about the precise nature of the match, such as whether the linguistic forms of the question and the answer are similar. The annotation data has been split in two different ways to facilitate different algorithm-training methodologies: 1) 10 files, each containing 10 percent of the original 1.4K questions, along with the full set of answers for each question, and 2) 10 files, each containing 10 percent of the full, pooled set of 10K+ question/answer pairs.
  • Supported Operating System

    Windows 10 , Windows 7, Windows 8

      • Windows 7, Windows 8, or Windows 10
      • Click Download and follow the instructions.
Site feedback
Microsoft

What category would you like to give web site feedback on?



Rate your level of satisfaction with this web page today:

Comments:

Submit