{"id":321,"date":"2013-06-19T09:00:00","date_gmt":"2013-06-19T09:00:00","guid":{"rendered":"https:\/\/blogs.technet.microsoft.com\/inside_microsoft_research\/2013\/06\/19\/big-data-analytics-from-theory-to-systems\/"},"modified":"2016-07-20T07:31:33","modified_gmt":"2016-07-20T14:31:33","slug":"big-data-analytics-from-theory-to-systems","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/big-data-analytics-from-theory-to-systems\/","title":{"rendered":"Big-Data Analytics, from Theory to Systems"},"content":{"rendered":"<div id=\"share-icons-top-of-post\">\n<ul class=\"post-social\"><!--          FACEBOOK LIKE   CODE           --><\/p>\n<li class=\"post-facebook\"><iframe style=\"border: none; overflow: hidden; width: 100px; height: 21px;\" src=\"http:\/\/www.facebook.com\/plugins\/like.php?href=http%3A%2F%2Fblogs.technet.com%2Fb%2Finside_microsoft_research%2Farchive%2F2013%2F06%2F19%2Fbig-data-analytics-from-theory-to-systems.aspx&send=false&layout=button_count&width=100&show_faces=true&font=segoe+ui&colorscheme=light&action=like&height=21\" frameborder=\"0\" scrolling=\"no\"><\/iframe><\/li>\n<p><!--          TWITTER SHARING CODE           --><\/p>\n<li class=\"post-twitter\"><a class=\"twitter-share-button\" href=\"https:\/\/x.com\/share\" data-count=\"horizontal\" data-via=\"msftresearch\">Tweet<\/a><br \/>\n<script type=\"text\/javascript\" language=\"JavaScript\" src=\"https:\/\/msdnshared.blob.core.windows.net\/media\/TNBlogsFS\/prod.evol.blogs.technet.com\/CommunityServer.Blogs.Components.WeblogFiles\/00\/00\/00\/90\/35\/js\/6811.tweet-this.js\" original-url=\"http:\/\/blogs.technet.com\/cfs-file.ashx\/__key\/communityserver-blogs-components-weblogfiles\/00-00-00-90-35-js\/6811.tweet_2D00_this.js\"><\/script>\n<\/li>\n<\/ul>\n<\/div>\n<p><!--    END SOCIAL SHARING CONTENT    --> <!-- ENTER AUTHOR AND BLOG CONTENT BELOW --><\/p>\n<p class=\"posted-by\">Posted by <span class=\"author\">Rob Knies<\/span><\/p>\n<p class=\"posted-by\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/msdnshared.blob.core.windows.net\/media\/TNBlogsFS\/prod.evol.blogs.technet.com\/CommunityServer.Blogs.Components.WeblogFiles\/00\/00\/00\/90\/35\/1261.Big%20Data%20Analytics.PNG\" original-url=\"http:\/\/blogs.technet.com\/cfs-file.ashx\/__key\/communityserver-blogs-components-weblogfiles\/00-00-00-90-35\/1261.Big-Data-Analytics.PNG\"><img decoding=\"async\" style=\"margin: 10px; border: 0px currentColor; float: left;\" title=\"Big Data Analytics 2013 logo\" src=\"https:\/\/msdnshared.blob.core.windows.net\/media\/TNBlogsFS\/prod.evol.blogs.technet.com\/CommunityServer.Blogs.Components.WeblogFiles\/00\/00\/00\/90\/35\/1261.Big%20Data%20Analytics.PNG\" original-url=\"http:\/\/blogs.technet.com\/resized-image.ashx\/__size\/550x0\/__key\/communityserver-blogs-components-weblogfiles\/00-00-00-90-35\/1261.Big-Data-Analytics.PNG\" alt=\"Big Data Analytics 2013 logo\" width=\"149\" \/><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n<p class=\"posted-by\"><span class=\"author\">Computing today is generating and capturing a wealth of data previously unimaginable. Such information has great promise for unlocking some of society&rsquo;s most elusive secrets, but how can those secrets be unearthed and identified?<\/p>\n<p>That pursuit provided the impetus behind <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Big Data Analytics 2013\" href=\"http:\/\/research.microsoft.com\/en-US\/events\/bda2013\/default.aspx\" target=\"_blank\">Big Data Analytics 2013<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, a first-ever workshop held at <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Microsoft Research Cambridge\" href=\"http:\/\/research.microsoft.com\/en-us\/labs\/cambridge\/default.aspx\" target=\"_blank\">Microsoft Research Cambridge<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> on May 23-24. More than 130 participants from academia and industry&mdash;including a strong contingent from the hosting lab, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Microsoft Research Redmond\" href=\"http:\/\/research.microsoft.com\/en-us\/labs\/redmond\/default.aspx\" target=\"_blank\">Microsoft Research Redmond<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Microsoft Research Silicon Valley\" href=\"http:\/\/research.microsoft.com\/en-us\/labs\/siliconvalley\/default.aspx\" target=\"_blank\">Microsoft Research Silicon Valley<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Advanced Technology Labs Europe\" href=\"http:\/\/research.microsoft.com\/en-us\/labs\/atle\/default.aspx\" target=\"_blank\">Advanced Technology Labs Europe<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>&mdash;gathered to discuss and identify the most important and challenging directions for the evolution of algorithms and systems for big data.<\/p>\n<p>&ldquo;The organization of the workshop was prompted by a surge of interest and activity in the area of big-data analytics,&rdquo; says <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Milan Vojnovic\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/milanv\/\" target=\"_blank\">Milan Vojnovic<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, co-organizer of the event and senior researcher in the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Cambridge Systems and Networking\" href=\"http:\/\/research.microsoft.com\/en-us\/groups\/camsys\/\" target=\"_blank\">Cambridge Systems and Networking<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> group, &ldquo;including platforms for various kinds of processing, such as batch processing and querying of massive data sets, real-time analytics, streaming computations, and analytics on special data structures such as graphical data.<\/p>\n<p>&ldquo;The organization was also prompted by the rising activity in the big-data-analytics space across diverse communities, such as the theory of computation, working on the foundations of algorithms, and the systems community, working on the design of new platforms and infrastructures.&rdquo;<\/p>\n<p>The workshop was co-organized by <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Artur Czumaj\" href=\"http:\/\/www.dcs.warwick.ac.uk\/~czumaj\/\" target=\"_blank\">Artur Czumaj<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, head of the Department of Computer Science at the University of Warwick, just outside of Coventry, U.K., and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jingren Zhou\" href=\"http:\/\/research.microsoft.com\/en-us\/um\/people\/jrzhou\/\" target=\"_blank\">Jingren Zhou<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, partner development manager for the Bing Search Infrastructure team. They helped attract experts with varied backgrounds to discuss interesting challenges in big data.<\/p>\n<p>&ldquo;One of the goals was to bring together experts working in the area of big-data analytics to discuss the state-of-the-art research and the most important challenges for future research,&rdquo; Vojnovic says, &ldquo;bringing in one place those working on the theory side with those on the systems side who usually do not often meet.<\/p>\n<p>&ldquo;I think that this mix of profiles, which is rather unusual at standard conference venues, worked rather well and everybody appreciated and learned something new.&ldquo;<\/p>\n<p>The event featured three keynotes, from:<\/span>&nbsp;<\/p>\n<ul>\n<li>\n<div class=\"posted-by\"><span class=\"author\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Surajit Chaudhuri\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/surajitc\/\" target=\"_blank\">Surajit Chaudhuri<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, Microsoft distinguished scientist and managing director of Microsoft Research&rsquo;s <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"eXtreme Computing Group\" href=\"http:\/\/research.microsoft.com\/en-us\/labs\/xcg\/default.aspx\" target=\"_blank\">eXtreme Computing Group<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. His talk, titled Big Data and Enterprise Analytics, discussed key trends that characterize the field of big data with respect to enterprise requirements.<\/span><\/div>\n<\/li>\n<li>\n<div class=\"posted-by\"><span class=\"author\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Sanjeev Khanna\" href=\"http:\/\/www.cis.upenn.edu\/~sanjeev\/\" target=\"_blank\">Sanjeev Khanna<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, Henry Salvatori Professor of Computer and Information Science at the University of Pennsylvania, who spoke on Fast Algorithms for Perfect Matchings in Regular Bipartite Graphs. Khanna explained how a sequence of improvements over the years has culminated in a linear-time algorithm to solve the problem of finding perfect matching in a regular bipartite graph.<\/span><\/div>\n<\/li>\n<li>\n<div class=\"posted-by\"><span class=\"author\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Gerhard Weikum\" href=\"http:\/\/www.mpi-inf.mpg.de\/~weikum\/\" target=\"_blank\">Gerhard Weikum<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, research director at the Max-Planck Institute for Informatics, located in Saarbr&uuml;cken, Germany. In a keynote called From Text to Entities and from Entities to Insight: a Perspective on Unstructured Big Data, Weikum addressed the huge amounts of valuable content in the form of speech and text produced by news, social media, websites, and enterprise sources.<\/span><\/div>\n<\/li>\n<\/ul>\n<p class=\"posted-by\"><span class=\"author\">&ldquo;Another goal,&rdquo; Vojnovic says, &ldquo;was to serve as a summit for researchers across Microsoft Research&rsquo;s worldwide labs working in this area, with a strong participation from Microsoft and universities&rsquo; computer-science and other departments.&rdquo;<\/p>\n<p>That goal certainly seems to have been met. In addition to the keynotes, the workshop featured 17 presentations, ranging from big-data analytics in life sciences to foundations of algorithms for large-scale graph analysis. Posters were on display, and attendees got an opportunity to browse through a set of technical demonstrations. A highlight of the second day was a panel discussion called Big-Data Analytics: A Happy Marriage of Systems and Theory?, moderated by <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Graham Cormode\" href=\"http:\/\/dimacs.rutgers.edu\/~graham\/\" target=\"_blank\">Graham Cormode<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> of the University of Warwick and featuring Chaudhuri, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Sudipto Guha\" href=\"http:\/\/www.cis.upenn.edu\/~sudipto\/\" target=\"_blank\">Sudipto Guha<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> of the University of Pennsylvania, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Sergei Vassilvitskii\" href=\"http:\/\/research.google.com\/pubs\/SergeiVassilvitskii.html\" target=\"_blank\">Sergei Vassilvitskii<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> of Google, and Zhou.<\/p>\n<p>&ldquo;The top-level takeaway for attendees was that big-data analytics is an area where important innovations can happen by a joint effort of the theory and systems community,&rdquo; Vojnovic says. &ldquo;It was appreciated that there is a need for developing suitable abstractions both in analyzing important theoretical problems, as well on the side of computation and programming.<\/p>\n<p>&ldquo;The event reconfirmed my belief that impactful research and innovation would result from a marriage of systems and theory. The event turned out to be a great success, and I am looking forward to new editions.&rdquo;<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Tweet Posted by Rob Knies Computing today is generating and capturing a wealth of data previously unimaginable. Such information has great promise for unlocking some of society&rsquo;s most elusive secrets, but how can those secrets be unearthed and identified? That pursuit provided the impetus behind Big Data Analytics 2013, a first-ever workshop held at Microsoft [&hellip;]<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[],"msr_hide_image_in_river":0,"footnotes":""},"categories":[1],"tags":[200281,186833,200517,186831,200655,200659,200667,200693,201481,201557,201647,201715,201749,202203,202627,196435,196463,202777,202787,202827,203655,203745,204087,204099,204415,204431],"research-area":[],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-321","post","type-post","status-publish","format-standard","hentry","category-research-blog","tag-advanced-technology-labs-europe","tag-analytics","tag-artur-czumaj","tag-big-data","tag-big-data-analytics-2013","tag-big-data-and-enterprise-analytics","tag-big-data-analytics-a-happy-marriage-of-systems-and-theory","tag-bing-search-infrastructure","tag-enterprise","tag-fast-algorithms-for-perfect-matchings-in-regular-bipartite-graphs","tag-from-text-to-entities-and-from-entities-to-insight-a-perspective-on-unstructured-big-data","tag-gerhard-weikum","tag-graham-cormode","tag-jingren-zhou","tag-max-planck-institute-for-informatics","tag-microsoft-research-cambridge","tag-microsoft-research-redmond","tag-microsoft-research-silicon-valley","tag-microsoft-research-xcg","tag-milan-vojnovic","tag-sanjeev-khanna","tag-sergei-vassilvitskii","tag-sudipto-guha","tag-surajit-chaudhuri","tag-university-of-pennsylvania","tag-university-of-warwick","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","byline":"","formattedDate":"June 19, 2013","formattedExcerpt":"Tweet Posted by Rob Knies Computing today is generating and capturing a wealth of data previously unimaginable. Such information has great promise for unlocking some of society&rsquo;s most elusive secrets, but how can those secrets be unearthed and identified?That pursuit provided the impetus behind Big&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/321","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=321"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/321\/revisions"}],"predecessor-version":[{"id":261525,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/321\/revisions\/261525"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=321"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=321"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=321"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=321"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=321"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=321"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=321"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=321"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=321"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=321"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=321"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}