{"id":170480,"date":"2010-06-09T20:07:32","date_gmt":"2010-06-09T20:07:32","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/project\/microsoft-learning-to-rank-datasets\/"},"modified":"2022-01-13T03:51:12","modified_gmt":"2022-01-13T11:51:12","slug":"mslr","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/mslr\/","title":{"rendered":"Microsoft Learning to Rank Datasets"},"content":{"rendered":"<p>We released two large scale datasets for research on learning to rank: MSLR-WEB30k with more than 30,000 queries and a random sampling of it MSLR-WEB10K with 10,000 queries.<\/p>\n<p>&nbsp;<\/p>\n<h2>Dataset Descriptions<\/h2>\n<p>The datasets are machine learning data, in which queries and urls are represented by IDs. The datasets consist of feature vectors extracted from query-url pairs along with relevance judgment labels:<\/p>\n<p>(1) The relevance judgments are obtained from a retired labeling set of a commercial web search engine (<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/www.bing.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">Microsoft Bing<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>), which take 5 values from 0 (irrelevant) to 4 (perfectly relevant).<\/p>\n<p>(2) The features are basically extracted by us, and are those widely used in the research community.<\/p>\n<p>In the data files, each row corresponds to a query-url pair. The first column is relevance label of the pair, the second column is query id, and the following columns are features. The larger value the relevance label has, the more relevant the query-url pair is. A query-url pair is represented by a 136-dimensional feature vector.<\/p>\n<p>Below are two rows from MSLR-WEB10K dataset:<\/p>\n<p>==============================================<\/p>\n<p>0 qid:1 1:3 2:0 3:2 4:2 &#8230; 135:0 136:0<\/p>\n<p>2 qid:1 1:3 2:3 3:0 4:0 &#8230; 135:0 136:0<\/p>\n<p>==============================================<\/p>\n<h2>Dataset Partition<\/h2>\n<p>We have partitioned each dataset into five parts with about the same number of queries, denoted as S1, S2, S3, S4, and S5, for five-fold cross validation. In each fold, we propose using three parts for training, one part for validation, and the remaining part for test (see the following table). The training set is used to learn ranking models. The validation set is used to tune the hyper parameters of the learning algorithms, such as the number of iterations in RankBoost and the combination coefficient in the objective function of Ranking SVM. The test set is used to evaluate the performance of the learned ranking models.<\/p>\n<table style=\"height: 151px\" width=\"564\">\n<tbody>\n<tr style=\"background-color: #eeeeee\">\n<td>\u00a0<strong>Folds<\/strong><\/td>\n<td><strong>\u00a0Training Set<\/strong><\/td>\n<td><strong>Validation Set<\/strong><\/td>\n<td><strong>Test Set<\/strong><\/td>\n<\/tr>\n<tr>\n<td>\u00a0Fold1<\/td>\n<td>\u00a0{S1,S2,S3}<\/td>\n<td>\u00a0S4<\/td>\n<td>\u00a0S5<\/td>\n<\/tr>\n<tr style=\"background-color: #eeeeee\">\n<td>\u00a0Fold2<\/td>\n<td>\u00a0{S2,S3,S4}<\/td>\n<td>\u00a0S5<\/td>\n<td>\u00a0S1<\/td>\n<\/tr>\n<tr>\n<td>\u00a0Fold3<\/td>\n<td>\u00a0{S3,S4,S5}<\/td>\n<td>\u00a0S1<\/td>\n<td>\u00a0S2<\/td>\n<\/tr>\n<tr style=\"background-color: #eeeeee\">\n<td>\u00a0Fold4<\/td>\n<td>\u00a0{S4,S5,S1}<\/td>\n<td>\u00a0S2<\/td>\n<td>\u00a0S3<\/td>\n<\/tr>\n<tr>\n<td>\u00a0Fold5<\/td>\n<td>\u00a0{S5,S1,S2}<\/td>\n<td>\u00a0S3<\/td>\n<td>\u00a0S4<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<h2>Datasets<\/h2>\n<p>The datasets were released on June 16, 2010.<\/p>\n<p>To use the datasets, you must read and accept the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/1drv.ms\/t\/s!AtsMfWUz5l8na0O2EkXxl3fwYek?e=eB9hmL\">online agreement<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. By using the datasets, you agree to be bound by the terms of its license.<\/p>\n<table style=\"height: 95px\" width=\"538\">\n<thead><\/thead>\n<tbody>\n<tr class=\"teal-stripeTableHeaderRow\" style=\"background-color: #eeeeee\">\n<td class=\"teal-stripeTableHeaderEvenCol\"><strong>Datasets<\/strong><\/td>\n<td class=\"teal-stripeTableHeaderOddCol\">\u00a0\u00a0\u00a0\u00a0 <strong>Size<\/strong><\/td>\n<td class=\"teal-stripeTableHeaderOddCol\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<strong> MD5<\/strong><\/td>\n<\/tr>\n<tr class=\"teal-stripeTableOddRow\">\n<td class=\"teal-stripeTableEvenCol\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/1drv.ms\/u\/s!AtsMfWUz5l8nbOIoJ6Ks0bEMp78\">MSLR-WEB10K<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/td>\n<td class=\"teal-stripeTableOddCol\">\u00a0\u00a0\u00a0\u00a0 ~ 1.2G<\/td>\n<td class=\"teal-stripeTableOddCol\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 97c5d4e7c171e475c91d7031e4fd8e79<\/td>\n<\/tr>\n<tr class=\"teal-stripeTableEvenRow\" style=\"background-color: #eeeeee\">\n<td class=\"teal-stripeTableEvenCol\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/1drv.ms\/u\/s!AtsMfWUz5l8nbXGPBlwD1rnFdBY\">MSLR-WEB30K<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/td>\n<td class=\"teal-stripeTableOddCol\">\u00a0\u00a0\u00a0\u00a0 ~ 3.7G<\/td>\n<td class=\"teal-stripeTableOddCol\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 4beae4bee0cd244fc9b2aff355a61555<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<h2>Evaluation tools<\/h2>\n<p>The evaluation script was updated on Jan. 13, 2011. Thank you to Yasser Ganjisaffar for pointing out the bug.<\/p>\n<ul>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" title=\"\" href=\"https:\/\/1drv.ms\/t\/s!AtsMfWUz5l8nbo-k8ceNDf8o5Vc\" target=\"_blank\" rel=\"noopener noreferrer\">Evaluation script<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> for NDCG(meanNDCG) and Precision(MAP)<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" title=\"\" href=\"https:\/\/1drv.ms\/t\/s!AtsMfWUz5l8nb9AKwTCM--9y0Ro\" target=\"_blank\" rel=\"noopener noreferrer\">Significance test script<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> for algorithm comparison<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h2>Feature List<\/h2>\n<p>Each query-url pair is represented by a 136-dimensional vector.<\/p>\n<table style=\"height: 4112px\" width=\"748\" cellspacing=\"0\" cellpadding=\"0\">\n<thead><\/thead>\n<tbody>\n<tr class=\"blue-stripeTableHeaderRow\" style=\"background-color: #eeeeee\">\n<td class=\"blue-stripeTableHeaderEvenCol\" colspan=\"4\"><b>Feature List of Microsoft Learning to Rank Datasets<\/b><\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\"><b>feature id<\/b><\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"203\"><b>\u00a0\u00a0 feature description<\/b><\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\"><b>\u00a0\u00a0\u00a0\u00a0 stream<\/b><\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"156\"><b>comments<\/b><\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">1<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 covered query term number<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 156px;background-color: #eeeeee\" rowspan=\"95\" width=\"156\"><\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">2<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">3<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">4<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">5<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">6<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 covered query term ratio<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">7<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">8<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">9<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">10<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">11<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 stream length<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">12<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">13<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">14<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">15<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">16<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 IDF(Inverse document frequency)<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">17<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">18<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">19<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">20<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">21<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 sum of term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">22<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">23<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">24<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">25<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">26<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 min of term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">27<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">28<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">29<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">30<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\" style=\"background-color: #eeeeee\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">31<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 max of term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">32<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">33<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">34<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">35<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">36<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 mean of term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">37<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">38<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">39<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">40<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">41<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 variance of term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">42<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">43<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">44<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">45<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">46<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 sum of stream length normalized term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">47<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">48<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">49<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">50<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">51<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 min of stream length normalized term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">52<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">53<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">54<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">55<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">56<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 max of stream length normalized term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">57<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">58<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">59<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">60<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">61<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 mean of stream length normalized term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">62<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">63<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">64<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">65<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">66<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 variance of stream length normalized term frequency<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">67<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">68<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">69<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">70<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">71<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 sum of tf*idf<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">72<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">73<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">74<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">75<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">76<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 min of tf*idf<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">77<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">78<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">79<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">80<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">81<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 max of tf*idf<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">82<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">83<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">84<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">85<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">86<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 mean of tf*idf<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">87<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">88<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">89<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">90<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">91<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 variance of tf*idf<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">92<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">93<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">94<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">95<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">96<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 boolean model<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"15\" width=\"156\"><\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">97<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">98<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">99<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">100<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">101<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 vector space model<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">102<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">103<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">104<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">105<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">106<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 BM25<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">107<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">108<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">109<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">110<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">111<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 LMIR.ABS<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 156px;background-color: #eeeeee\" rowspan=\"5\" width=\"156\">Language model approach for information retrieval (IR) with absolute discounting smoothing<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">112<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">113<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">114<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0\u00a0url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">115<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">116<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 LMIR.DIR<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"156\">Language model approach for IR with Bayesian smoothing using Dirichlet priors<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">117<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">118<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">119<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">120<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">121<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" rowspan=\"5\" width=\"203\">\u00a0\u00a0 LMIR.JM<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 body<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 156px;background-color: #eeeeee\" rowspan=\"5\" width=\"156\">Language model approach for IR with Jelinek-Mercer smoothing<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">122<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 anchor<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">123<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 title<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">124<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"178\">\u00a0\u00a0\u00a0\u00a0 url<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">125<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 178px;background-color: #eeeeee\">\u00a0\u00a0\u00a0\u00a0 whole document<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">126<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"203\">\u00a0\u00a0 Number of slash in URL<\/td>\n<td class=\"blue-stripeTableEvenCol\" rowspan=\"5\" width=\"178\"><\/td>\n<td class=\"blue-stripeTableOddCol\" rowspan=\"5\" width=\"156\"><\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">127<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" width=\"203\">\u00a0\u00a0 Length of URL<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">128<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"203\">\u00a0\u00a0 Inlink number<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">129<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" width=\"203\">\u00a0\u00a0 Outlink number<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">130<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"203\">\u00a0\u00a0 PageRank<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">131<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" width=\"203\">\u00a0\u00a0 SiteRank<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\"><\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 156px;background-color: #eeeeee\" width=\"156\">Site level PageRank<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">132<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"203\">\u00a0\u00a0 QualityScore<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\"><\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"156\">The quality score of a web page. The score is outputted by a web page quality classifier.<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">133<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" width=\"203\">\u00a0\u00a0 QualityScore2<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\"><\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 156px;background-color: #eeeeee\" width=\"156\">The quality score of a web page. The score is outputted by a web page quality classifier, which measures the badness of a web page.<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">134<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"203\">\u00a0\u00a0 Query-url click count<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\"><\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"156\">The click count of a query-url pair at a search engine in a period<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableEvenRow\">\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 64px;background-color: #eeeeee\" width=\"64\">135<\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 203px;background-color: #eeeeee\" width=\"203\">\u00a0\u00a0 url click count<\/td>\n<td class=\"blue-stripeTableEvenCol\" style=\"width: 178px;background-color: #eeeeee\" width=\"178\"><\/td>\n<td class=\"blue-stripeTableOddCol\" style=\"width: 156px;background-color: #eeeeee\" width=\"156\">The click count of a url aggregated from user browsing data in a period<\/td>\n<\/tr>\n<tr class=\"blue-stripeTableOddRow\">\n<td class=\"blue-stripeTableEvenCol\" width=\"64\">136<\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"203\">\u00a0\u00a0 url dwell time<\/td>\n<td class=\"blue-stripeTableEvenCol\" width=\"178\"><\/td>\n<td class=\"blue-stripeTableOddCol\" width=\"156\">The average dwell time of a url aggregated from\u00a0user browsing data in a period<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>Reference<\/h2>\n<p>You can cite this dataset as below.<\/p>\n<pre class=\"verbatim select-on-click\" title=\"click to select text\">@article{DBLP:journals\/corr\/QinL13,\r\n  author    = {Tao Qin and\r\n               Tie{-}Yan Liu},\r\n  title     = {Introducing {LETOR} 4.0 Datasets},\r\n  journal   = {CoRR},\r\n  volume    = {abs\/1306.2597},\r\n  year      = {2013},\r\n  url       = {http:\/\/arxiv.org\/abs\/1306.2597},\r\n  timestamp = {Mon, 01 Jul 2013 20:31:25 +0200},\r\n  biburl    = {http:\/\/dblp.uni-trier.de\/rec\/bib\/journals\/corr\/QinL13},\r\n  bibsource = {dblp computer science bibliography, http:\/\/dblp.org}\r\n}<\/pre>\n<h2>Release Notes<\/h2>\n<ul>\n<li>The following people have contributed to the construction of the data: <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/taoqin\/\">Tao Qin<\/a>, <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/tyliu\/\">Tie-Yan Liu<\/a>, Wenkui Ding, Jun Xu, Hang Li.<\/li>\n<li>We would like to thank Bing team for the support in dataset creation. We would also like to thank Nick Craswell for the help in dataset release.<\/li>\n<li>If you have any questions or suggestions, please kindly <a title=\"\" href=\"mailto:taoqin%40microsoft.com?subject=Questions about Microsoft Learning to Rank Datasets\" target=\"_self\" rel=\"noopener\">let us know<\/a>.<\/li>\n<li>Related links: <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/letor-learning-rank-information-retrieval\/\">LETOR3.0 and LETOR4.0 datasets<\/a>.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>We released two large scale datasets for research on learning to rank: MSLR-WEB30k with more than 30,000 queries and a random sampling of it MSLR-WEB10K with 10,000 queries. &nbsp; Dataset Descriptions The datasets are machine learning data, in which queries and urls are represented by IDs. The datasets consist of feature vectors extracted from query-url [&hellip;]<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"research-area":[13556,13555],"msr-locale":[268875],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-170480","msr-project","type-msr-project","status-publish","hentry","msr-research-area-artificial-intelligence","msr-research-area-search-information-retrieval","msr-locale-en_us","msr-archive-status-active"],"msr_project_start":"2010-06-10","related-publications":[],"related-downloads":[],"related-videos":[],"related-groups":[],"related-events":[],"related-opportunities":[],"related-posts":[],"related-articles":[],"tab-content":[],"slides":[],"related-researchers":[],"msr_research_lab":[199560],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/170480","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-project"}],"version-history":[{"count":4,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/170480\/revisions"}],"predecessor-version":[{"id":811423,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/170480\/revisions\/811423"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=170480"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=170480"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=170480"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=170480"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=170480"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}