{"id":471867,"date":"2018-03-07T10:45:37","date_gmt":"2018-03-07T18:45:37","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&#038;p=471867"},"modified":"2018-03-07T13:53:13","modified_gmt":"2018-03-07T21:53:13","slug":"feature-improvement-related-publications","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/feature-improvement-related-publications\/","title":{"rendered":"Feature improvement: Related Publications"},"content":{"rendered":"<p>Some of us remember walking into a library to look for a book or journal article and leaving with an armful of books. Browsing the materials in physical proximity to the one we were looking for was a form of research, as it helped us discover related publications that we may not have come across otherwise. A lot of this serendipity is lost in online search, but we gladly give it up for the many other advantages is offers. With this week\u2019s graph update, we bring the best of both worlds, by enhancing our powerful semantic search with an improved Related Publications feature.<\/p>\n<p>On Microsoft Academic, related publications are papers that are not necessarily cited by or citing a paper but are sufficiently relevant that readers interested in the original paper will likely be interested in these publications as well. You can access related publications from a paper\u2019s detail page. Just click a publication\u2019s title in the list of search results to navigate to its detail page, which currently has three sections: References, Citations, and Related Publications.<\/p>\n<p>While related publications have existed on Microsoft Academic for a while, they are now much improved. First, related paper relevance is now truly semantic. Second, we increased the number of papers that have related publications listed on their detail page. Third, we increased the number of related publications for a paper in our graph.<\/p>\n<p><strong>Semantic relevance of Related Publications<\/strong><\/p>\n<p>The paper selection appearing in the Related Publications section has improved as we have changed the method for identifying relevant papers and computing their similarity. Previously, we used to identify related publications by only using citation data. For example, if paper A cited paper B, and paper C also cited paper B, then we could infer that readers interested in paper A were likely to also be interested in paper C. The drawbacks to this approach were that not only was it complicated to acquire accurate citation data, but that citation data is inherently biased towards publications within the same research domain \u2013 we rarely see papers citing other papers outside of their main research area even if other research areas are working on the same problem.<\/p>\n<p>The new method we are using is very different, and truly semantic. By using a technique known as <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/search?iq=Composite%28F.FN%3D%3D%27word%20embedding%27%29&q=word%20embedding&filters=&from=0&sort=0\" target=\"_blank\" rel=\"noopener noreferrer\">word embedding<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, we transform meaning into mathematics. Word embeddings transform each word into a multi-dimensional vector. If two multi-dimensional vectors are similar, then the words they represent tend to be used in similar contexts and are likely synonyms. So, the vectors for car and automobile will be very similar. To calculate relevance among papers, we go beyond individual words and create an embedding for an entire document by using its title, abstract and keywords. The relevance among papers shown in Related Publications is now computed using a combination of co-citation data and document embeddings. Citation data provides us with a human behavior-based signal for relevance, while document embeddings provide a content-based signal. The results are lists of related publications that are more relevant to the original paper but may not necessarily use the same words or even be from the same research domain although they present similar underlying concepts.<\/p>\n<p>Take, for example <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/detail\/2150312211\" target=\"_blank\" rel=\"noopener noreferrer\">this paper<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> about computer animation, which discusses animating the behavior of flocks and herds, and<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/academic\/articles\/microsoft-academic-increases-power-semantic-search-adding-fields-study\/\" target=\"_blank\" rel=\"noopener\"> has been tagged with fields of study<\/a> such as <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/search?iq=Composite%28F.FN%3D%3D%27computer%20animation%27%29&q=computer%20animation&filters=&from=0&sort=0\" target=\"_blank\" rel=\"noopener noreferrer\">Computer animation<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/search?iq=Composite%28F.FN%3D%3D%27simulation%27%29&q=simulation&filters=&from=0&sort=0\" target=\"_blank\" rel=\"noopener noreferrer\">Simulation<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/search?iq=@Computer%20Science@&q=Computer%20Science&filters=&from=0&sort=0\" target=\"_blank\" rel=\"noopener noreferrer\">Computer science<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-471876 size-large\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs1-1024x485.png\" alt=\"Screenshot of paper Flocks, herds and schools: A distributed behavioral model showing fields of study tagged onto paper.\" width=\"1024\" height=\"485\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs1-1024x485.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs1-300x142.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs1-768x364.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>As we click on &#8220;Related Publications,&#8221; we scroll down the page past the list of references and papers citing this paper, to the Related Publications section:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-471879 size-large\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs2-663x1024.png\" alt=\"Screenshot of list of Related Papers\" width=\"663\" height=\"1024\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs2-663x1024.png 663w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs2-194x300.png 194w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs2-768x1185.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs2.png 975w\" sizes=\"auto, (max-width: 663px) 100vw, 663px\" \/><\/p>\n<p>As we browse related publications, we see the paper,\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/detail\/2143969246\" target=\"_blank\" rel=\"noopener noreferrer\">Collective Memory and Spatial Sorting in Animal Groups<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, published in the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/detail\/109682412\" target=\"_blank\" rel=\"noopener noreferrer\">Journal of Theoretical Biology<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and tagged with fields of study in <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/search?iq=Composite%28F.FN%3D%3D%27biology%27%29&q=biology&filters=&from=0&sort=0\" target=\"_blank\" rel=\"noopener noreferrer\">Biology<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/search?iq=Composite%28F.FN%3D%3D%27ecology%27%29&q=ecology&filters=&from=0&sort=0\" target=\"_blank\" rel=\"noopener noreferrer\">Ecology<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/search?iq=@Self-organization@&q=Self-organization&filters=&from=0&sort=0\" target=\"_blank\" rel=\"noopener noreferrer\">Self-organization<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, which are very different from the fields of study tagged onto our initial paper. This example shows how the improved Related Publications feature can help you discover relevant scholarship outside your own discipline.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-471873 size-large\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs3-1024x561.png\" alt=\"Screenshot of paper, Collective Memory and Spatial Sorting in Animal Groups \" width=\"1024\" height=\"561\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs3-1024x561.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs3-300x164.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/03\/RelatedPubs3-768x421.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p><strong>Increased number of papers with Related Publications<\/strong><\/p>\n<p>Not all papers have related publications. This is a feature only available for papers in the English language as word embeddings do not work well across languages. Even for English papers there are cases where paper relevance cannot be computed &#8211; for example, if the abstract is missing. There is also the case where the embedding generated for a document may not be easily clustered with other documents (an important step in computing relevance), which prevents us from finding related papers. Very recent papers published in the last 6-8 months do not yet have related publications identified with the new method. Overall, however, the number of papers for which we offer related publications has increased in this release by 400%, from about 30 million to about 120 million.<\/p>\n<p>We are experimenting with new technologies that have shown promise to further improve related publications, such as the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/detail\/2761896323\" target=\"_blank\" rel=\"noopener noreferrer\">NetMF algorithm<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0we presented at<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/academic.microsoft.com\/#\/search?iq=And%28Composite%28C.CN%3D%3D%27wsdm%27%29%2CY%3D2018%29&q=wsdm%202018&filters=&from=0&sort=0\" target=\"_blank\" rel=\"noopener noreferrer\"> WSDM 2018<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. Once we complete further tests and validation, we hope to bring even more power to related papers, and potentially, other related entities on the site.<\/p>\n<p><strong>Higher number of Related Publications<\/strong><\/p>\n<p>The new method for calculating publication relevance enables us to generate a higher number of related publications. We cap the number of related publications to 20 on our website to ensure relevance quality and avoid overwhelming users. That said, if users want to see all the related publications for a paper they are encouraged to use our graph for further analysis. The <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/microsoft-academic-graph\/\" target=\"_blank\" rel=\"noopener\">Microsoft Academic Graph<\/a> can be accessed through the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/labs.cognitive.microsoft.com\/en-us\/project-academic-knowledge\" target=\"_blank\" rel=\"noopener noreferrer\">Academic Knowledge API<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, or, for power users, we offer another option through <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/azure.microsoft.com\/en-us\/solutions\/data-lake\/\" target=\"_blank\" rel=\"noopener noreferrer\">Azure Data Lake<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (please contact us if you are interested in the latter).<\/p>\n<p>We hope these improvements enable you to discover more research faster, and that you find the serendipity of our improved Related Papers section useful and enjoyable.<\/p>\n<p>How do you unleash the power of semantic search? As always, we would like to hear from you either through the feedback link at the bottom right of the\u00a0website, or on\u00a0Twitter. You can also find our project home page with this blog on the Microsoft Research site at\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/aka.ms\/msracad\" target=\"_blank\" rel=\"noopener noreferrer\">aka.ms\/msracad<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n<p>Happy researching!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Some of us remember walking into a library to look for a book or journal article and leaving with an armful of books. Browsing the materials in physical proximity to the one we were looking for was a form of research, as it helped us discover related publications that we may not have come across [&hellip;]<\/p>\n","protected":false},"author":36804,"featured_media":471879,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-content-parent":170262,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[],"msr-locale":[268875],"msr-post-option":[],"class_list":["post-471867","msr-blog-post","type-msr-blog-post","status-publish","has-post-thumbnail","hentry","msr-locale-en_us"],"msr_assoc_parent":{"id":170262,"type":"project"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/471867","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-blog-post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/36804"}],"version-history":[{"count":6,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/471867\/revisions"}],"predecessor-version":[{"id":471924,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/471867\/revisions\/471924"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/471879"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=471867"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=471867"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=471867"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=471867"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}