Download Microsoft Research Paraphrase Corpus from Official Microsoft Download Center

The Surface family of devices

Surface devices

Anything but ordinary

Person using Power BI Desktop

Power BI

Transform data into actionable insights with dashboards and reports

Microsoft Research Paraphrase Corpus

Important! Selecting a language below will dynamically change the complete page content to that language.

Language:
English
This download consists of data only: a text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship. Last published: March 3, 2005.