This is the Trace Id: 7465622223f0b802d932606c18a3bf2f
Skip to main content Microsoft 365 Office Azure Copilot Windows Support Windows Apps OneDrive Outlook Moving from Skype to Teams OneNote Microsoft Teams Accessories PC games Microsoft AI Microsoft Security Azure Dynamics 365 Microsoft 365 for business Microsoft Power Platform Windows 365 Digital Sovereignty Microsoft Developer Microsoft Learn Support for AI marketplace apps Microsoft Tech Community Microsoft Marketplace Visual Studio Marketplace Rewards Free downloads & security Education Gift cards View Sitemap

Data Set of English-Spanish Term Vectors from Wikipedia

This data set consists of the term vectors extracted from 60,730 Wikipedia English articles and their comparable Spanish articles, sampled in 2009. Last published: August 8, 2011.

Important! Selecting a language below will dynamically change the complete page content to that language.

Download
  • Version:

    1.0.0

    Date Published:

    12/15/2023

    File Name:

    EN-ES_Wiki.zip

    File Size:

    218.4 MB

    This data set consists of the term vectors extracted from 60,730 Wikipedia English articles and their comparable Spanish articles, sampled in 2009. We used this data set to test various models for creating translingual document representations, work published in [Platt et al. EMNLP-2010] and [Yih et al. CoNLL-2011]. More detail of this data set can be found in the ReadMe file.
  • Supported Operating Systems

    Windows 10, Windows 7, Windows 8

    • Windows 7, Windows 8, or Windows 10
    • Click Download and follow the instructions.