Microsoft Research Video Description Corpus
This data consists of about 120K sentences collected during the summer of 2010. Workers on Mechanical Turk were paid to watch a short video snippet and then summarize the action in a single sentence. The…
WikiBhasha: Enhancing Multilingual Content in Wikipedia
Making Wikipedia more multilingual inspired creation of the WikiBhasha tool. WikiBhasha—“Wiki,” signifying its community-oriented approach; “Bhasha,” a Sanskrit word meaning “language”—features a content-creation platform that combines linguistic services, such as machine translation, with a Wikipedia-friendly…
Enhancing Multilingual Content in Wikipedia
By Douglas Gantenbein, Senior Writer, Microsoft News Center Wikipedia has become one of the world’s largest and perhaps most powerful information repositories. But it is heavily English-centric. Making Wikipedia more multilingual inspired a Microsoft Research…
Software Aids Language Learners
By Gary Alt, Writer, Microsoft Imagine mining the web to learn a language. No, not the jargon of webspeak, where IMHO means “in my humble opinion” or F2F is “face to face,” but real, spoken…