Microsoft Research Asia Chinese Word-Segmentation Data Set
A set of manually annotated Chinese word-segmentation data and specifications for training and testing a Chinese word-segmentation system for research purposes. The data was extracted from the People’s Daily, which we have licensed for commercial…