Internet Explorer was retired on June 15, 2022
IE 11 is no longer accessible. You can reload Internet Explorer sites with IE mode in Microsoft Edge.
MSR Abstractive Text Compression Dataset
This dataset contains sentences and short paragraphs with corresponding shorter (compressed) versions. There are up to five compressions for each input text, together with quality judgements of their meaning preservation and grammaticality.
Important! Selecting a language below will dynamically change the complete page content to that language.
Version:
1.0
Date Published:
11/14/2016
File Name:
Release.zip
File Size:
17.5 MB
This dataset contains sentences and short paragraphs with corresponding shorter (compressed) versions. There are up to five compressions for each input text, together with quality judgements of their meaning preservation and grammaticality. The dataset is derived using source texts from the Open American National Corpus (ww.anc.org) and crowd-sourcing. More details can be found in the included README and the paper: “A dataset and evaluation metrics for abstractive compression of sentences and short paragraphs” [Toutanova, Brockett, Tran, and Amershi, EMNLP 2016].Supported Operating Systems
Android, Apple Mac OS X, Linux, Windows 10, Windows 8
- Windows 8, Windows 10, Android, Apple Mac OS X, Linux
- Click Download and follow the instructions.