ClueWeb 09 Labeled Near-Duplicate News Articles

Language:
English
This data release is a companion to the paper Duplicate News Story Detection Revisited by Omar Alonso, Dennis Fetterly, and Mark Manasse published at The Ninth Asia Information Retrieval Societies Conference (AIRS 2013) in December 2013. Last published: August 28, 2013.