{"id":283451,"date":"2016-08-26T15:36:58","date_gmt":"2016-08-26T22:36:58","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-event&#038;p=283451"},"modified":"2025-08-06T11:59:32","modified_gmt":"2025-08-06T18:59:32","slug":"big-data-analytics-2013","status":"publish","type":"msr-event","link":"https:\/\/www.microsoft.com\/en-us\/research\/event\/big-data-analytics-2013\/","title":{"rendered":"Big Data Analytics 2013"},"content":{"rendered":"\n\n<p>Related Links:<\/p>\n<ul>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/cambridge-systems-and-networking\/\">Cambridge Systems and Networking<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/lab\/microsoft-research-cambridge\/\">Microsoft Research Cambridge<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<\/ul>\n<p>Organizers:<\/p>\n<ul>\n<li>Artur Czumaj, University of Warwick<\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/milanv\/\">Milan Vojnovic<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, Microsoft Research<\/li>\n<li>Jingren Zhou, Microsoft<\/li>\n<\/ul>\n<p>Local Arrangements:<\/p>\n<ul>\n<li>Cara Freeman, Microsoft Research<\/li>\n<\/ul>\n<p>Sponsors:<\/p>\n<ul>\n<li>Microsoft Research Ltd<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" target=\"_blank\" href=\"http:\/\/www2.warwick.ac.uk\/fac\/cross_fac\/dimap\">Centre for Discrete Mathematics and its Application<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>s, University of Warwick<\/li>\n<\/ul>\n<p><span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-283463 size-full alignnone\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/banner.jpg\" alt=\"banner\" width=\"595\" height=\"200\" \/><\/p>\n<div>\n<div class=\"center sw mt16\">\n<div id=\"contentArea\">\n<div id=\"mainZone\" class=\"zone\">\n<div class=\"compositeModule_2_1Zone \">\n<div class=\"zone\">\n<div class=\"conM \">\n<p align=\"justify\">We witness a rapid development of the research and technology for efficient processing of big data. There is a surge of commercial and open source platforms for big data analytics, including platforms for querying of massive datasets, batch processing, real-time analytics, streaming computations, iterative computations, graph data processing, and distributed machine learning. There have been some remarkable achievements on the side of designing scalable and efficient algorithms for processing of massive amounts of data, as well as on the side of architecture of systems and infrastructure. The goal of this workshop is to bring together researchers and system architects to discuss and identify the most important and challenging directions to push forward the area of algorithms and systems for big data. The topics of the workshop include but are not limited to computation and storage platforms, querying of massive datasets, sketches, streaming, scaling up distributed machine learning, iterative computations, and large-scale graph processing.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<p><strong>Day 1<\/strong><\/p>\n<p>8:00 &#8211; 8:45<br \/>\nContinental breakfast<\/p>\n<p>8:45 &#8211; 9:00<br \/>\nWelcome note<\/p>\n<p>9:00 &#8211; 10:00 | <a href=\"http:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/From-Text-to-Entities-and-from-Entities-to-Insight-a-Perspective-on-Unstructured-Big-Data.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/weikum-msr-cambridge-23may2013-posted.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/from-text-to-entities-and-from-entities-to-insight-a-perspective-on-unstructured-big-data\/\">Video<\/a><br \/>\nKeynote talk, Gerhard Weikum, Max Planck Institute for Informatics<br \/>\n<em>From Text to Entities and from Entities to Insight: a Perspective on Unstructured Big Data<\/em><\/p>\n<p>10:00 &#8211; 10:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/QueryingandExploringBigBrainData.pdf\">Abstract<\/a><br \/>\nAnastasia Ailamaki, EPFL<br \/>\n<em>Querying and Exploring Big Brain Data<\/em><\/p>\n<p>10:30 &#8211; 11:00<br \/>\nRefreshments<\/p>\n<p><em>Programming and Computation Models<\/em><\/p>\n<p>11:00 &#8211; 11:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/BigDataAnalyticswithStratosphere.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/stratosphere-aplatform-for-bda_msft-Copy.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/big-data-with-stratosphere\/\">Video<\/a><br \/>\nVolker Markl, TU-Berlin<br \/>\n<em>Big Data with Stratosphere<\/em><\/p>\n<p>11:30 &#8211; 12:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/SCOPEParallelDatabasesMeetMapReduce.pdf\">Abstract<\/a><br \/>\nJingren Zhou, Microsoft Corporation<br \/>\n<em>SCOPE: Parallel Databases Meet MapReduce<\/em><\/p>\n<p>12:00 &#8211; 13:00<br \/>\nLunch<\/p>\n<p><em>Algorithms, Infrastructure, and Platforms<\/em><\/p>\n<p>13:00 &#8211; 13:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/OnlineDataProcessingwithS4andOmid.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/big-data-workshop-fpj.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/online-data-processing-with-s4-and-omid\/\">Video<\/a><br \/>\nFlavio P. Junqueira, Microsoft Research<br \/>\n<em>Online Data Processing with S4 and Omid<\/em><\/p>\n<p>13:30 &#8211; 14:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/PreditableDataCenters.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/bigdataworkshopPreditableDataCenters.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/predictable-data-centers-3\/\">Video<\/a><br \/>\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/thomkar\/\">Thomas Karagiannis<\/a>, Microsoft Research<br \/>\n<em>Predictable Data Centers<\/em><\/p>\n<p>14:00 &#8211; 14:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/From-Terabytes-to-Megabytes-Finding-the-needle-by-shrinking-the-haystack.pdf\">Abstract<\/a><br \/>\nSergei Vassilvitskii, Google<br \/>\n<em>From Terabytes to Megabytes: Finding the Needle by Shrinking the Haystack<\/em><\/p>\n<p>14:30 &#8211; 15:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Incremental-Iterative-and-Interactive-Computation-using-Differential-Dataflow.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Incremental-Iterative-and-Interactive-Computation-using-Differential-Dataflow.pptx\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/incremental-iterative-and-interactive-computation-using-differential-dataflow\/\">Video<\/a><br \/>\nFrank McSherry, Microsoft Research<br \/>\n<em>Incremental, Iterative, and Interactive Computation using Differential Dataflow<\/em><\/p>\n<p>15:00 &#8211; 16:00 | <span style=\"text-decoration: underline\"><span style=\"color: #0000ff;text-decoration: underline\">See Demos and Posters Tab<\/span><\/span><br \/>\nRefreshments + Demo and Poster Session<\/p>\n<p><em>Large-scale Graph Analytics<\/em><\/p>\n<p>16:00 &#8211; 16:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Big-Data-on-Small-Machines.pdf\">Abstracts<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Big-Data-on-Small-Machines-Slides.pdf\">Slides<\/a><br \/>\nGuy Blelloch, Carnegie Mellon University<br \/>\n<em>Big Data on Small Machines<\/em><\/p>\n<p>16:30 &#8211; 17:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Graphs-and-Linear-Measurements.pdf\">Abstracts<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/graphs-and-linear-measurements-slides.pdf\">Slides<\/a><br \/>\nSudipto Guha, University of Pennsylvania<br \/>\n<em>Graphs and Linear Measurements<\/em><\/p>\n<p>17:00 &#8211; 17:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Partitioning-Clustering-Big-Graphs.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Partitioning-Clustering-Big-Graphs-slides.pdf\">Slides<\/a><br \/>\nGeorge Karypis, University of Minnesota<br \/>\n<em>Partitioning & Clustering Big Graphs<\/em><\/p>\n<p>17:30 &#8211; 18:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Online-Team-Formation-in-Social-Networks.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Online-Team-Formation-in-Social-Networks-Slides.pdf\">Slides<\/a><br \/>\nStefano Leonardi, Sapienza University of Rome<br \/>\n<em>Online Team Formation in Social Networks<\/em><\/p>\n<p>19:00 &#8211; 21:00<br \/>\nWorkshop reception and dinner at Emmanuel College<\/p>\n<p><strong>Day 2<\/strong><\/p>\n<p>8:30 &#8211; 9:00<br \/>\nCoffee and pastries<\/p>\n<p>9:00 &#8211; 10:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Big-Data-and-Enterprise-Analytics-Abstract.pdf\">Abstract<\/a><br \/>\nKeynote talk<br \/>\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/surajitc\/\">Surajit Chaudhuri<\/a>, Microsoft Research<br \/>\n<em>Big Data and Enterprise Analytics<\/em><\/p>\n<p>10:00 &#8211; 10:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Streaming-Verification-of-Outsourced-Computation-Abstract.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Streaming-Verification-of-Outsourced-Computation-Slides.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/streaming-verification-of-outsourced-computation\/\">Video<\/a><br \/>\nGraham Cormode, University of Warwick<br \/>\n<em>Streaming Verification of Outsourced Computation<\/em><\/p>\n<p>10:30 &#8211; 11:00<br \/>\nRefreshments + Group photo<\/p>\n<p>11:00 &#8211; 12:00 | Panel<br \/>\nModerator:<br \/>\nGraham Cormode, University of Warwick<br \/>\nPanelists:<br \/>\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/surajitc\/\">Surajit Chaudhuri<\/a>, Microsoft Research<br \/>\nSudipto Guha, University of Pennsylvania<br \/>\nSergei Vassilvitskii, Google<br \/>\nJingren Zhou, Microsoft Corporation<br \/>\n<em>Big Data Analytics: A Happy Marriage of Systems and Theory?<\/em><\/p>\n<p>12:00 &#8211; 13:30<br \/>\nLunch<\/p>\n<p>13:30 &#8211; 14:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Fast-Algorithms-for-Perfect-Matching-in-Regular-Bipartite-Graphs-Abstract.pdf\">Abstract<\/a><br \/>\nKeynote talk, Sanjeev Khanna, University of Pennsylvania<br \/>\n<em>Fast Algorithms for Perfect Matchings in Regular Bipartite Graphs<\/em><\/p>\n<p><em>Large-Scale Graph Analytics Cont&#8217;d<\/em><\/p>\n<p>14:30 &#8211; 15:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Cuts-Trees-and-Electrical-Flows-Abstract.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Cuts-Trees-and-Electrical-Flows-PPT.pps\">Slides<\/a><br \/>\nAleksandar Madry, EPFL<br \/>\n<em>Cuts, Trees, and Electrical Flows<\/em><\/p>\n<p>15:00 &#8211; 15:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Neighborhood-Sampling-for-Estimating-Local-Properties-on-a-Graph-Stream.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Neighborhood-Sampling-for-Estimating-Local-Properties-on-a-Graph-Stream-PPT.pptx\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/neighborhood-sampling-for-estimating-local-properties-on-a-graph-stream\/\">Video<\/a><br \/>\nSrikanta Tirthapura, Iowa State University<br \/>\n<em>Neighborhood Sampling for Estimating Local Properties on a Graph Stream<\/em><\/p>\n<p>15:30 &#8211; 16:00<br \/>\nRefreshments<\/p>\n<p><em>Streaming and Massive Distributed Data<\/em><\/p>\n<p>16:00 &#8211; 16:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/What-Cant-We-Compute-on-Data-Streams.pdf\">Abstract<\/a> |\u00a0 <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/what-cant-we-compute-on-data-streams\/\">Video<\/a><br \/>\nAmit Chakrabarti, Dartmouth College<br \/>\n<em>What Can&#8217;t We Compute on Data Streams?<\/em><\/p>\n<p>16:30 &#8211; 17:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/QueryingandExploringBigBrainData-1.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Querying-Big-Dynamic-Distributed-Data-Slides.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/querying-big-dynamic-distributed-data\/\">Video<\/a><br \/>\nMinos Garofalakis, Technical University of Crete<br \/>\n<em>Querying Big, Dynamic, Distributed Data<\/em><\/p>\n<p>17:00 &#8211; 17:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Mergeable-Summaries-Abstract.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Mergable-Summaries-PPT.pptx\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/mergable-summaries\/\">Video<\/a><br \/>\nKe Yi, Hong Kong University of Science and Technology<br \/>\n<em>Mergable Summaries<\/em><\/p>\n<p>17:30 &#8211; 18:00<br \/>\nClosing session<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<p>\t<div data-wp-context='{\"items\":[]}' data-wp-interactive=\"msr\/accordion\">\n\t\t\t\t\t<div class=\"clearfix\">\n\t\t\t\t<div\n\t\t\t\t\tclass=\"btn-group align-items-center mb-g float-sm-right\"\n\t\t\t\t\tdata-bi-aN=\"accordion-collapse-controls\"\n\t\t\t\t>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Expand all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onExpandAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tExpand all\t\t\t\t\t<\/button>\n\t\t\t\t\t<span aria-hidden=\"true\"> | <\/span>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Collapse all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onCollapseAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tCollapse all\t\t\t\t\t<\/button>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t\t\t<ul class=\"msr-accordion\">\n\t\t\t\t\t\t\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3854\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3854\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3853\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSurajit Chaudhuri\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3853\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3854\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283511\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/surajitc-1.jpg\" alt=\"surajitc\" width=\"75\" height=\"75\" \/><em>Distinguished Scientist and Managing Director, Microsoft Research<\/em><\/p>\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/surajitc\/\">Web Link<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n<p>Surajit Chaudhuri, Distinguished Scientist and a Managing Director at Microsoft Research, will discuss the key secular trends that characterize the field of Big Data with respect to enterprise analytics. He will describe some of the open challenges for enterprise analytics in the context of Big Data.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3856\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3856\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3855\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSanjeev Khanna\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3855\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3856\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283514\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/sanjeev_pic.jpg\" alt=\"sanjeev_pic\" width=\"75\" height=\"75\" \/><em>Professor, University of Pennsylvania<\/em><\/p>\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" target=\"_blank\" href=\"http:\/\/www.cis.upenn.edu\/~sanjeev\/\">Web Link<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n<p>Sanjeev Khanna is the Henry Salvatori Professor of Computer and Information Science at the University of Pennsylvania. His primary research interests are in the design and analysis of algorithms for combinatorial optimisation and in complexity theory. His research has been supported by National Science Foundation, an Alfred P. Sloan Fellowship, a Guggenheim Fellowship, and an IBM Faculty Award. Sanjeev serves on the Editorial boards of SICOMP, ACM TALG, and Foundations and Trends in Theoretical Computer Science and served as an area editor for Encyclopaedia of Algorithms.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3858\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3858\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3857\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tGerhard Weikum\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3857\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3858\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283517\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/gerhardweikum.jpg\" alt=\"gerhardweikum\" width=\"75\" height=\"75\" \/><em>Scientific Director, Max Planck Institute for Informatics<\/em><\/p>\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" target=\"_blank\" href=\"http:\/\/people.mpi-inf.mpg.de\/~weikum\/\">Web Link<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n<p>Gerhard Weikum is a Scientific Director at the Max Planck Institute for Informatics in Saarbruecken, Germany, and also an Adjunct Professor at Saarland University. His research spans transactional and distributed systems, self-tuning database systems, DB&IR integration, and automatic knowledge harvesting from Web and text sources. He co-authored a comprehensive textbook on transactional systems, received the VLDB 10-Year Award for his work on automatic DB tuning, and is one of the creators of the Yago knowledge base. Gerhard Weikum is an ACM Fellow and a member of the German Academy of Science and Engineering. He received a Google Focused Research Award in 2010, and the ACM SIGMOD Contributions Award in 2011.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t<\/div>\n\t<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<p>\t<div data-wp-context='{\"items\":[]}' data-wp-interactive=\"msr\/accordion\">\n\t\t\t\t\t<div class=\"clearfix\">\n\t\t\t\t<div\n\t\t\t\t\tclass=\"btn-group align-items-center mb-g float-sm-right\"\n\t\t\t\t\tdata-bi-aN=\"accordion-collapse-controls\"\n\t\t\t\t>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Expand all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onExpandAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tExpand all\t\t\t\t\t<\/button>\n\t\t\t\t\t<span aria-hidden=\"true\"> | <\/span>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Collapse all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onCollapseAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tCollapse all\t\t\t\t\t<\/button>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t\t\t<ul class=\"msr-accordion\">\n\t\t\t\t\t\t\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3860\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3860\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3859\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSurajit Chaudhuri\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3859\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3860\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283493\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/surajitc.jpg\" alt=\"surajitc\" width=\"75\" height=\"75\" \/>Surajit Chaudhuri, Distinguished Scientist and a Managing Director at Microsoft Research, will discuss the key secular trends that characterize the field of Big Data with respect to enterprise analytics. He will describe some of the open challenges for enterprise analytics in the context of Big Data.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3862\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3862\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3861\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSudipto Guha\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3861\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3862\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283496\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/guha.png\" alt=\"guha\" width=\"72\" height=\"72\" \/>Guha is an Associate Professor in the Department of Computer and Information Sciences at University of Pennsylvania since Fall 2001. He completed his Ph.D. in 2000 at Stanford University working on approximation algorithms. He is a recipient of the NSF CAREER award in 2007, and the Alfred P. Sloan Foundation fellowship.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3864\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3864\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3863\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSergei Vassilvitskii\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3863\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3864\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283502\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/vassilvitskii.png\" alt=\"vassilvitskii\" width=\"72\" height=\"72\" \/>Sergei Vassilvitskii is a Research Scientist at Google and an Adjunct Assistant Professor of Computer Science at Columbia University. He received his M.S. and Ph.D. in Computer Science from Stanford University and a B.S. from Cornell University. He works on problems at the intersection of economics, data mining and computer science.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3866\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3866\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3865\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tJingren Zhou\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3865\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3866\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283508\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/jingren.png\" alt=\"jingren\" width=\"72\" height=\"72\" \/>Jingren Zhou is a Partner Development Manager in the Bing&#8217;s Infrastructure Team. Jingren manages a team to develop a cloud-scale distributed computation system, called SCOPE, targeted for massive data analysis over tens of thousands of machines at Microsoft Bing. The SCOPE system combines benefits from both traditional parallel databases and MapReduce execution engines to allow easy programmability and deliver massive scalability and high performance through advanced optimization. The system processes petabytes of data daily for a variety of data analysis and data mining applications, powering Bing and other online services at Microsoft. Before he joined Bing Search in 2008, Jingren was a researcher in the database research group at Microsoft Research. He has published many articles in premier database conferences and journals. Jingren received a Ph.D. in Computer Science from Columbia University.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3868\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3868\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3867\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tGraham Cormode\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3867\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3868\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283505\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/cormode.png\" alt=\"cormode\" width=\"72\" height=\"72\" \/>Graham Cormode is a professor of Computer Science at the University of Warwick, UK. He works on topics around big data, streaming analytics, privacy and data mining. From 2006-2013, he was a researcher at AT&T Labs, and prior to that he was at Bell Labs<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t<\/div>\n\t<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<div>\n<div class=\"center sw mt16\">\n<div id=\"contentArea\">\n<div id=\"mainZone\" class=\"zone\">\n<div class=\"compositeModule_2_1Zone \">\n<div class=\"zone\">\n<div class=\"conM \">\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283490\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/bda_2013.jpg\" alt=\"bda_2013\" width=\"918\" height=\"571\" \/><\/p>\n<ul>\n<li>Anastasia Ailamaki, <em>EPFL<\/em><\/li>\n<li>Dan Alistarh, <em>MIT<\/em><\/li>\n<li>Hitesh Ballani, <em>Microsoft Research<\/em><\/li>\n<li>Alexandre de Baynast, <em>Microsoft ATL Europe<\/em><\/li>\n<li>Guy Blelloch, <em>Carnegie Mellon University<\/em><\/li>\n<li>Lucas Bordeaux, <em>Microsoft Research<\/em><\/li>\n<li>Florian Bourse, <em>ENS Paris<\/em><\/li>\n<li>John Bronskill, <em>Microsoft Research<\/em><\/li>\n<li>Miguel Castro, <em>Microsoft Research<\/em><\/li>\n<li>Nuno Cerquiera, <em>Microsoft (Data at Skype)<\/em><\/li>\n<li>Amit Chakrabarti, <em>Dartmouth College<\/em><\/li>\n<li>Stephen Clark, <em>University of Cambridge<\/em><\/li>\n<li>Raphael Clifford, <em>Bristol University<\/em><\/li>\n<li>Graham Cormode, <em>University of Warwick<\/em><\/li>\n<li>Paolo Costa, <em>Microsoft Research<\/em><\/li>\n<li>Artur Czumaj, <em>University of Warwick<\/em><\/li>\n<li>Valentin Dalibard, <em>University of Cambridge<\/em><\/li>\n<li>Bryan Dove, <em>Microsoft (Data at Skype)<\/em><\/li>\n<li>Moez Draief, <em>Imperial College<\/em><\/li>\n<li>Minos Garofalakis, <em>Technical University of Crete<\/em><\/li>\n<li>Richard Gibbens, <em>University of Cambridge<\/em><\/li>\n<li>Christos Gkantsidis, <em>Microsoft Research<\/em><\/li>\n<li>Thore Graepel, <em>Microsoft Research<\/em><\/li>\n<li>Peter Grindrod, <em>University of Reading<\/em><\/li>\n<li>Sudipto Guha, <em>University of Pennsylvania<\/em><\/li>\n<li>Mehdi Hosseini, <em>University of Cambridge<\/em><\/li>\n<li>Markus Jalsenius, <em>Bristol University<\/em><\/li>\n<li>Flavio Paiva Junqueira, <em>Microsoft Research<\/em><\/li>\n<li>Thomas Karagiannis, <em>Microsoft Research<\/em><\/li>\n<li>George Karypis, <em>University of Minnesota<\/em><\/li>\n<li>Ian Kash, <em>Microsoft Research<\/em><\/li>\n<li>Frank Kelly, <em>University of Cambridge<\/em><\/li>\n<li>Peter Key, <em>Microsoft Research<\/em><\/li>\n<li>Pushmeet Kohli, <em>Microsoft Research<\/em><\/li>\n<li>Philipp Kranen, <em>MSR ATL Europe<\/em><\/li>\n<li>Sergey Legtchenko, <em>Microsoft Research<\/em><\/li>\n<li>Marc Lelarge, <em>INRIA-ENS Paris<\/em><\/li>\n<li>Stefano Leonardi, <em>Sapienza University of Rome<\/em><\/li>\n<li>Aleksander Madry, <em>EPFL<\/em><\/li>\n<li>Volker Markl, <em>TU Berlin<\/em><\/li>\n<li>Laurent Massoulie,<em> INRIA<\/em><\/li>\n<li>Frank McSherry, <em>Microsoft Research<\/em><\/li>\n<li>Derek Murray, <em>Microsoft Research<\/em><\/li>\n<li>Dushyanth Narayanan, <em>Microsoft Research<\/em><\/li>\n<li>Karthik Nilakant<em>, University of Cambridge<\/em><\/li>\n<li>James Norris, <em>University of Cambridge<\/em><\/li>\n<li>Alexandre Proutiere, <em>KTH Stockholm<\/em><\/li>\n<li>Ant Rowstron, <em>Microsoft Research<\/em><\/li>\n<li>Benjamin Sach, <em>University of Warwick<\/em><\/li>\n<li>Richard Samworth, <em>University of Cambridge<\/em><\/li>\n<li>Eirini Spyropoulou, <em>University of Bristol<\/em><\/li>\n<li>Maxim Sviridenko, <em>University of Warwick<\/em><\/li>\n<li>Don Syme<em>, Microsoft Research<\/em><\/li>\n<li>Srikanta Tirthapura, <em>Iowa State University<\/em><\/li>\n<li>Andy Twigg, <em>University of Oxford<\/em><\/li>\n<li>Sergei Vassilvitskii, <em>Google<\/em><\/li>\n<li>Milan Vojnovic, <em>Microsoft Research<\/em><\/li>\n<li>Gerhard Weikum<em>, Max Planck Institute for Informatics<\/em><\/li>\n<li>Ken Woodberry, <em>Microsoft Research<\/em><\/li>\n<li>Ke Yi, <em>Hong Kong University of Science and Technology<\/em><\/li>\n<li>Eiko Yoneki, <em>University of Cambridge<\/em><\/li>\n<li>Reynold Xin, <em>University of California, Berkeley<\/em><\/li>\n<li>Jingren Zhou, <em>Microsoft Corporation (Bing)<\/em><\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p><span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<p>\t<div data-wp-context='{\"items\":[]}' data-wp-interactive=\"msr\/accordion\">\n\t\t\t\t\t<div class=\"clearfix\">\n\t\t\t\t<div\n\t\t\t\t\tclass=\"btn-group align-items-center mb-g float-sm-right\"\n\t\t\t\t\tdata-bi-aN=\"accordion-collapse-controls\"\n\t\t\t\t>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Expand all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onExpandAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tExpand all\t\t\t\t\t<\/button>\n\t\t\t\t\t<span aria-hidden=\"true\"> | <\/span>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Collapse all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onCollapseAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tCollapse all\t\t\t\t\t<\/button>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t\t\t<ul class=\"msr-accordion\">\n\t\t\t\t\t\t\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3870\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3870\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3869\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tBig Metadata &#8211; Programming at Internet-Scale\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3869\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3870\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Don Syme, Kenji Takeda, and Keith Battocchi (Microsoft Research)<\/em><\/p>\n<p>We live in an information society. Programming languages are not designed for this. As we move into a world of &#8220;Devices + Services&#8221;, it is vital that developers can productively integrate information at internet-scale into their everyday programming environment. Effectively exploring, navigating, understanding, analysing and presenting data \u2013 both big and broad \u2013 is a key to success for developers. Current experiences for this are cumbersome & labour-intensive. In this new demo we explain the latest research behind F# Type Providers and show how they can bring Internet-Scale Metadata to the fingers of developers. IntelliSense for Internet-Scale Metadata creates a completely new way of rapidly developing applications on Azure, when using Hadoop, and when programming against the web. We explain how this can transform F# into a uniquely powerful and developer-friendly Data Science scripting language for Azure.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3872\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3872\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3871\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tIncremental, Iterative and Interactive Computation in Naiad\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3871\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3872\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Frank McSherry, Derek Murray, Rebecca Isaacs, and Michael Isard (Microsoft Research)<\/em><\/p>\n<p>We are developing a system for scalable data analysis designed to support iterative queries over continuously changing inputs at interactive timescales. The system is based on a new computational framework, differential dataflow, that generalizes standard incremental dataflow for far greater reuse of previous results when collections change. Differential dataflow distinguishes between the multiple reasons a collection might change, including both loop feedback and new input data, allowing a system to re-use the most appropriate results from previously performed work when an incremental update arrives.<\/p>\n<p>Our prototype system, Naiad, demonstrates the practical application of differential dataflow. Like many systems, Naiad supports high-level declarative queries, data-parallel execution, and transparent distribution. Unlike other systems, Naiad efficiently executes queries with multiple (possibly nested) loops, while simultaneously responding with low latency to incremental changes to the inputs. We show how differential dataflow enables orders of magnitude speedups for a variety of complex workloads on real streaming data.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3874\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3874\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3873\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tDynamic Task Allocation in Asynchronous Shared Memory\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3873\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3874\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Dan Alistarh (MIT), James Aspnes (Yale University), Michael A. Bender (Stony Brook University), Rati Gelashvili (MIT), and Seth Gilbert (NUS)<\/em><\/p>\n<p>Task allocation is a classic distributed problem in which a set of p potentially faulty processes must perform a set of m tasks. The problem is all the more challenging when there is heterogeneity, either on the workers&#8217; side, since individual agents may have varying speed and robustness, or because of uneven workloads. In large-scale systems, heterogeneity is the norm. Our work considers the dynamic version of task allocation, in which tasks are injected adversarially during an asynchronous execution. Intuitively, a major challenge in this setting is the fact that, at the same time, the adversary controls scheduling, process crashes, and the input.<\/p>\n<p>We give a new shared-memory algorithm for dynamic task allocation which is within logarithmic factors of optimal. The main algorithmic idea is a randomized data structure called a dynamic to-do tree, which allows processes to pick new tasks to perform uniformly at random from the set of available tasks, and to insert tasks at uniform random locations in the data structure. We show that these properties avoid duplicating work unnecessarily in well-behaved executions, and that work duplication is inherent under certain input-schedule combinations. This is the first algorithm to allow efficient work sharing in this challenging setting, and the result has applications to other problems, such as producer-consumer buffers, and distributed renaming.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3876\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3876\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3875\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tRASP: Large-Scale Graph Traversal with SSD Prefetching\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3875\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3876\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Eiko Yoneki (University of Cambridge), Karthik Nilakant (University of Cambridge), Valentin Dalibard (University of Cambridge), and Amitabha Roy (EPFL)<\/em><\/p>\n<p>Mining large graphs has now become an important aspect of multiple diverse applications and a number of computer systems have been proposed to efficiently execute graph algorithms. Recent interest in this area has led to the construction of single machine graph computation systems that use solid state drives (SSDs) to store the graph. This approach reduces the cost and simplifies the implementation of graph algorithms, making computations on large graphs available to the average user. However, SSDs are slower than main memory, and making full use of their bandwidth is crucial for executing graph algorithms in a reasonable amount of time. We present RASP (the (R)un(A)head (S)SD(P)refetcher) for graph algorithms that parallelises requests to derive maximum throughput from SSDs. RASP combines a judicious distribution of graph state between main memory and SSDs with an innovative run-ahead algorithm to prefetch needed data in parallel. This is in contrast to existing approaches that depend on multi-threading the graph algorithms to saturate available bandwidth. Our experiments on graph algorithms using random access<\/p>\n<p>show that RASP not only is capable of maximising the throughput from SSDs but is also able to almost hide the effect of I\/O latency. The improvements in runtime for graph algorithms is up to 14 X when compared to a single threaded baseline. When compared to sophisticated multi-threaded implementations, RASP performs up to 80% faster without the program complexity and the programmer effort needed for multithreaded graph algorithms.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3878\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3878\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3877\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tActive Data Management for Distributed Graph Processing\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3877\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3878\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Karthik Nilakant (University of Cambridge) and Eiko Yoneki (University of Cambridge)<\/em><\/p>\n<p>Graph processing systems are subject to two conflicting concerns: firstly, a large number of algorithms for processing graphs exist, most of which exhibit poor locality of accesses. Secondly, the increasing scale of data has resulted in a need to distribute the data across multiple machines, further affecting locality. Most existing systems tend to consider these factors separately. We propose schemes for pro-active or reactive active graph management. Pro-actively, one could instrument algorithmic code fragments to allow the processing engine to predict future behaviour of the graph algorithm. Producing \u201cbalanced cuts\u201d on a graph is computationally expensive, and infeasible for large datasets. Instead, lightweight graph analytics could be deployed to re-arrange the graph into less tightly coupled clusters. We propose a framework for considering these factors in unison. Ultimately, the choice of method to use in a given scenario will depend on the structure of the dataset and the processing algorithm. Our focus will be on identifying the minimum characteristics that must be gleaned from the code or data to be processed (or alternatively provided by the user), in order to boost the performance of such a system.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3880\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3880\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3879\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tInfrastructure for real-time analytics on modern cluster networks\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3879\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3880\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Aleksandar Dragojevic, Dushyanth Narayanan, Orion Hodson, Miguel Castro (Microsoft Research)<\/em><\/p>\n<p>Two hardware trends present a great opportunity for building efficient infrastructure for real-time analytics: (1) servers today have tens to hundreds of gigabytes of RAM making it possible to keep large data sets in RAM of a moderately-sized cluster of servers and (2) modern cluster networks, such as InfiniBand and RoCE, support access to memory of remote servers with just a few micro-seconds of latency. By keeping the whole data set in main memory of a cluster and using direct remote memory access (RDMA) primitives of the network it is possible to support tens to hundreds of millions of point lookups in a terabyte-scale key-value store on a cluster of just several tens of servers.<\/p>\n<p>We present Farm, a platform for building infrastructure for supporting in-memory applications that efficiently use modern cluster networks. Farm exposes the memory of all servers in a cluster as a single, fault-tolerant shared memory; it supports atomic reads and writes of small user-defined objects, efficient short transactions for easier synchronization, and exposes the location of objects to support locality-aware optimizations. We describe how to use Farm to build efficient and scalable B-trees and hashtables, which can be used as key-value stores or building blocks for more complex systems.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3882\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3882\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3881\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tBalanced Edge-Partitioning of Massive Scale Graphs\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3881\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3882\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Florian Bourse (Ecole Normale Sup\u00e9rieure, Paris), Marc Lelarge (INRIA- Ecole Normale Sup\u00e9rieure, Paris), and Milan Vojnovic (Microsoft Research)<\/em><\/p>\n<p>Many computation tasks are performed on large scale graph data which may amount to as many as billions or even trillions of vertices or edges (e.g. online social network graphs, knowledge graphs, and web graphs). A standard way to scale up such computations is to partition an input graph into clusters of balanced sizes and small cut costs. Graph partitioning is of interest for a wide range of systems including iterative computation platforms, distributed graph databases, and stream processing platforms. Besides its practical importance, graph partitioning is one of the most fundamental and challenging theoretical computer science problems.<\/p>\n<p>In our work we consider novel graph partitioning problems that arise under some computation models of interest &#8212; which specify how the size of a cluster and the cost of a cut are defined. We devote special focus to online assignment algorithms that require a single pass through vertices or edges. Our preliminary performance evaluation results demonstrate significant benefits of using graph partitioning algorithms that are specifically tailored for given computation model.<\/p>\n<p>This is a collaboration project with MSR-INRIA joint research centre.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3884\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3884\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3883\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tMining Interesting Patterns in Multi-Relational Data\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3883\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3884\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Eirini Spyropoulou (University of Bristol) and Tijl De Bie (University of Bristol)<\/em><\/p>\n<p>Research on local pattern mining algorithms has focused for years on mining one table of transactions and items (market basket data). However, a large amount of real-world datasets are multi-relational, i.e., they contain more than two entities and more than one relationships. There is therefore increasing interest in developing techniques for mining such complex types of data. Defining a multi-relational pattern syntax that is suitably expressive and finding an efficient mining algorithm, is a challenging problem. Furthermore, since local pattern mining methods usually suffer from the combinatorial explosion of patterns in the output, assessing the quality of patterns is also very important in order to make them useful to the end user.<\/p>\n<p>In this work we introduce a novel approach for mining patterns in multi-relational data. We propose the pattern syntax of Complete Connected Subsets, a new syntax for multi-relational patterns. While this pattern syntax is generally applicable to multi-relational data it reduces to well known itemsets when the data is a simple transaction table. We also introduce RMiner, a simple yet practically efficient algorithm to mine such patterns. We show how the interestingness of patterns of the proposed syntax can be conveniently quantified using a framework for quantifying the subjective interestingness of patterns according to the user&#8217;s prior knowledge. Finally, we present results on real world data sets.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t<\/div>\n\t<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Related Links: Cambridge Systems and Networking (opens in new tab) Microsoft Research Cambridge (opens in new tab) Organizers: Artur Czumaj, University of Warwick Milan Vojnovic (opens in new tab), Microsoft Research Jingren Zhou, Microsoft Local Arrangements: Cara Freeman, Microsoft Research Sponsors: Microsoft Research Ltd Centre for Discrete Mathematics and its Application (opens in new tab)s, [&hellip;]<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr_startdate":"2013-05-23","msr_enddate":"2013-05-24","msr_location":"Microsoft Research, Cambridge, U.K.","msr_expirationdate":"","msr_event_recording_link":"","msr_event_link":"","msr_event_link_redirect":false,"msr_event_time":"","msr_hide_region":false,"msr_private_event":true,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[],"msr-region":[],"msr-event-type":[],"msr-video-type":[],"msr-locale":[268875],"msr-program-audience":[],"msr-post-option":[],"msr-impact-theme":[],"class_list":["post-283451","msr-event","type-msr-event","status-publish","hentry","msr-locale-en_us"],"msr_about":"<!-- wp:msr\/event-details {\"title\":\"Big Data Analytics 2013\",\"backgroundColor\":\"grey\"} \/-->\n\n<!-- wp:msr\/content-tabs --><!-- wp:msr\/content-tab {\"title\":\"About\"} --><!-- wp:freeform --><p>Related Links:<\/p>\n<ul>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/cambridge-systems-and-networking\/\">Cambridge Systems and Networking<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/lab\/microsoft-research-cambridge\/\">Microsoft Research Cambridge<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<\/ul>\n<p>Organizers:<\/p>\n<ul>\n<li>Artur Czumaj, University of Warwick<\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/milanv\/\">Milan Vojnovic<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, Microsoft Research<\/li>\n<li>Jingren Zhou, Microsoft<\/li>\n<\/ul>\n<p>Local Arrangements:<\/p>\n<ul>\n<li>Cara Freeman, Microsoft Research<\/li>\n<\/ul>\n<p>Sponsors:<\/p>\n<ul>\n<li>Microsoft Research Ltd<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" target=\"_blank\" href=\"http:\/\/www2.warwick.ac.uk\/fac\/cross_fac\/dimap\">Centre for Discrete Mathematics and its Application<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>s, University of Warwick<\/li>\n<\/ul>\n<p><span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-283463 size-full alignnone\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/banner.jpg\" alt=\"banner\" width=\"595\" height=\"200\" \/><\/p>\n<div>\n<div class=\"center sw mt16\">\n<div id=\"contentArea\">\n<div id=\"mainZone\" class=\"zone\">\n<div class=\"compositeModule_2_1Zone \">\n<div class=\"zone\">\n<div class=\"conM \">\n<p align=\"justify\">We witness a rapid development of the research and technology for efficient processing of big data. There is a surge of commercial and open source platforms for big data analytics, including platforms for querying of massive datasets, batch processing, real-time analytics, streaming computations, iterative computations, graph data processing, and distributed machine learning. There have been some remarkable achievements on the side of designing scalable and efficient algorithms for processing of massive amounts of data, as well as on the side of architecture of systems and infrastructure. The goal of this workshop is to bring together researchers and system architects to discuss and identify the most important and challenging directions to push forward the area of algorithms and systems for big data. The topics of the workshop include but are not limited to computation and storage platforms, querying of massive datasets, sketches, streaming, scaling up distributed machine learning, iterative computations, and large-scale graph processing.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<!-- \/wp:freeform --><!-- \/wp:msr\/content-tab --><!-- wp:msr\/content-tab {\"title\":\"Programme\"} --><!-- wp:freeform --><p>&lt;strong&gt;Day 1&lt;\/strong&gt;<\/p>\n<p>8:00 &#8211; 8:45<br \/>\nContinental breakfast<\/p>\n<p>8:45 &#8211; 9:00<br \/>\nWelcome note<\/p>\n<p>9:00 &#8211; 10:00 | <a href=\"http:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/From-Text-to-Entities-and-from-Entities-to-Insight-a-Perspective-on-Unstructured-Big-Data.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/weikum-msr-cambridge-23may2013-posted.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/from-text-to-entities-and-from-entities-to-insight-a-perspective-on-unstructured-big-data\/\">Video<\/a><br \/>\nKeynote talk, Gerhard Weikum, Max Planck Institute for Informatics<br \/>\n&lt;em&gt;From Text to Entities and from Entities to Insight: a Perspective on Unstructured Big Data&lt;\/em&gt;<\/p>\n<p>10:00 &#8211; 10:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/QueryingandExploringBigBrainData.pdf\">Abstract<\/a><br \/>\nAnastasia Ailamaki, EPFL<br \/>\n&lt;em&gt;Querying and Exploring Big Brain Data&lt;\/em&gt;<\/p>\n<p>10:30 &#8211; 11:00<br \/>\nRefreshments<\/p>\n<p>&lt;em&gt;Programming and Computation Models&lt;\/em&gt;<\/p>\n<p>11:00 &#8211; 11:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/BigDataAnalyticswithStratosphere.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/stratosphere-aplatform-for-bda_msft-Copy.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/big-data-with-stratosphere\/\">Video<\/a><br \/>\nVolker Markl, TU-Berlin<br \/>\n&lt;em&gt;Big Data with Stratosphere&lt;\/em&gt;<\/p>\n<p>11:30 &#8211; 12:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/SCOPEParallelDatabasesMeetMapReduce.pdf\">Abstract<\/a><br \/>\nJingren Zhou, Microsoft Corporation<br \/>\n&lt;em&gt;SCOPE: Parallel Databases Meet MapReduce&lt;\/em&gt;<\/p>\n<p>12:00 &#8211; 13:00<br \/>\nLunch<\/p>\n<p>&lt;em&gt;Algorithms, Infrastructure, and Platforms&lt;\/em&gt;<\/p>\n<p>13:00 &#8211; 13:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/OnlineDataProcessingwithS4andOmid.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/big-data-workshop-fpj.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/online-data-processing-with-s4-and-omid\/\">Video<\/a><br \/>\nFlavio P. Junqueira, Microsoft Research<br \/>\n&lt;em&gt;Online Data Processing with S4 and Omid&lt;\/em&gt;<\/p>\n<p>13:30 &#8211; 14:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/PreditableDataCenters.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/bigdataworkshopPreditableDataCenters.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/predictable-data-centers-3\/\">Video<\/a><br \/>\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/thomkar\/\">Thomas Karagiannis<\/a>, Microsoft Research<br \/>\n&lt;em&gt;Predictable Data Centers&lt;\/em&gt;<\/p>\n<p>14:00 &#8211; 14:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/From-Terabytes-to-Megabytes-Finding-the-needle-by-shrinking-the-haystack.pdf\">Abstract<\/a><br \/>\nSergei Vassilvitskii, Google<br \/>\n&lt;em&gt;From Terabytes to Megabytes: Finding the Needle by Shrinking the Haystack&lt;\/em&gt;<\/p>\n<p>14:30 &#8211; 15:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Incremental-Iterative-and-Interactive-Computation-using-Differential-Dataflow.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Incremental-Iterative-and-Interactive-Computation-using-Differential-Dataflow.pptx\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/incremental-iterative-and-interactive-computation-using-differential-dataflow\/\">Video<\/a><br \/>\nFrank McSherry, Microsoft Research<br \/>\n&lt;em&gt;Incremental, Iterative, and Interactive Computation using Differential Dataflow&lt;\/em&gt;<\/p>\n<p>15:00 &#8211; 16:00 | <span style=\"text-decoration: underline\"><span style=\"color: #0000ff;text-decoration: underline\">See Demos and Posters Tab<\/span><\/span><br \/>\nRefreshments + Demo and Poster Session<\/p>\n<p>&lt;em&gt;Large-scale Graph Analytics&lt;\/em&gt;<\/p>\n<p>16:00 &#8211; 16:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Big-Data-on-Small-Machines.pdf\">Abstracts<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Big-Data-on-Small-Machines-Slides.pdf\">Slides<\/a><br \/>\nGuy Blelloch, Carnegie Mellon University<br \/>\n&lt;em&gt;Big Data on Small Machines&lt;\/em&gt;<\/p>\n<p>16:30 &#8211; 17:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Graphs-and-Linear-Measurements.pdf\">Abstracts<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/graphs-and-linear-measurements-slides.pdf\">Slides<\/a><br \/>\nSudipto Guha, University of Pennsylvania<br \/>\n&lt;em&gt;Graphs and Linear Measurements&lt;\/em&gt;<\/p>\n<p>17:00 &#8211; 17:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Partitioning-Clustering-Big-Graphs.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Partitioning-Clustering-Big-Graphs-slides.pdf\">Slides<\/a><br \/>\nGeorge Karypis, University of Minnesota<br \/>\n&lt;em&gt;Partitioning &amp; Clustering Big Graphs&lt;\/em&gt;<\/p>\n<p>17:30 &#8211; 18:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Online-Team-Formation-in-Social-Networks.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Online-Team-Formation-in-Social-Networks-Slides.pdf\">Slides<\/a><br \/>\nStefano Leonardi, Sapienza University of Rome<br \/>\n&lt;em&gt;Online Team Formation in Social Networks&lt;\/em&gt;<\/p>\n<p>19:00 &#8211; 21:00<br \/>\nWorkshop reception and dinner at Emmanuel College<\/p>\n<p>&lt;strong&gt;Day 2&lt;\/strong&gt;<\/p>\n<p>8:30 &#8211; 9:00<br \/>\nCoffee and pastries<\/p>\n<p>9:00 &#8211; 10:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Big-Data-and-Enterprise-Analytics-Abstract.pdf\">Abstract<\/a><br \/>\nKeynote talk<br \/>\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/surajitc\/\">Surajit Chaudhuri<\/a>, Microsoft Research<br \/>\n&lt;em&gt;Big Data and Enterprise Analytics&lt;\/em&gt;<\/p>\n<p>10:00 &#8211; 10:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Streaming-Verification-of-Outsourced-Computation-Abstract.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Streaming-Verification-of-Outsourced-Computation-Slides.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/streaming-verification-of-outsourced-computation\/\">Video<\/a><br \/>\nGraham Cormode, University of Warwick<br \/>\n&lt;em&gt;Streaming Verification of Outsourced Computation&lt;\/em&gt;<\/p>\n<p>10:30 &#8211; 11:00<br \/>\nRefreshments + Group photo<\/p>\n<p>11:00 &#8211; 12:00 | Panel<br \/>\nModerator:<br \/>\nGraham Cormode, University of Warwick<br \/>\nPanelists:<br \/>\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/surajitc\/\">Surajit Chaudhuri<\/a>, Microsoft Research<br \/>\nSudipto Guha, University of Pennsylvania<br \/>\nSergei Vassilvitskii, Google<br \/>\nJingren Zhou, Microsoft Corporation<br \/>\n&lt;em&gt;Big Data Analytics: A Happy Marriage of Systems and Theory?&lt;\/em&gt;<\/p>\n<p>12:00 &#8211; 13:30<br \/>\nLunch<\/p>\n<p>13:30 &#8211; 14:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Fast-Algorithms-for-Perfect-Matching-in-Regular-Bipartite-Graphs-Abstract.pdf\">Abstract<\/a><br \/>\nKeynote talk, Sanjeev Khanna, University of Pennsylvania<br \/>\n&lt;em&gt;Fast Algorithms for Perfect Matchings in Regular Bipartite Graphs&lt;\/em&gt;<\/p>\n<p>&lt;em&gt;Large-Scale Graph Analytics Cont&#8217;d&lt;\/em&gt;<\/p>\n<p>14:30 &#8211; 15:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Cuts-Trees-and-Electrical-Flows-Abstract.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Cuts-Trees-and-Electrical-Flows-PPT.pps\">Slides<\/a><br \/>\nAleksandar Madry, EPFL<br \/>\n&lt;em&gt;Cuts, Trees, and Electrical Flows&lt;\/em&gt;<\/p>\n<p>15:00 &#8211; 15:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Neighborhood-Sampling-for-Estimating-Local-Properties-on-a-Graph-Stream.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Neighborhood-Sampling-for-Estimating-Local-Properties-on-a-Graph-Stream-PPT.pptx\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/neighborhood-sampling-for-estimating-local-properties-on-a-graph-stream\/\">Video<\/a><br \/>\nSrikanta Tirthapura, Iowa State University<br \/>\n&lt;em&gt;Neighborhood Sampling for Estimating Local Properties on a Graph Stream&lt;\/em&gt;<\/p>\n<p>15:30 &#8211; 16:00<br \/>\nRefreshments<\/p>\n<p>&lt;em&gt;Streaming and Massive Distributed Data&lt;\/em&gt;<\/p>\n<p>16:00 &#8211; 16:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/What-Cant-We-Compute-on-Data-Streams.pdf\">Abstract<\/a> |\u00a0 <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/what-cant-we-compute-on-data-streams\/\">Video<\/a><br \/>\nAmit Chakrabarti, Dartmouth College<br \/>\n&lt;em&gt;What Can&#8217;t We Compute on Data Streams?&lt;\/em&gt;<\/p>\n<p>16:30 &#8211; 17:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/QueryingandExploringBigBrainData-1.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Querying-Big-Dynamic-Distributed-Data-Slides.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/querying-big-dynamic-distributed-data\/\">Video<\/a><br \/>\nMinos Garofalakis, Technical University of Crete<br \/>\n&lt;em&gt;Querying Big, Dynamic, Distributed Data&lt;\/em&gt;<\/p>\n<p>17:00 &#8211; 17:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Mergeable-Summaries-Abstract.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Mergable-Summaries-PPT.pptx\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/mergable-summaries\/\">Video<\/a><br \/>\nKe Yi, Hong Kong University of Science and Technology<br \/>\n&lt;em&gt;Mergable Summaries&lt;\/em&gt;<\/p>\n<p>17:30 &#8211; 18:00<br \/>\nClosing session<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<!-- \/wp:freeform --><!-- \/wp:msr\/content-tab --><!-- wp:msr\/content-tab {\"title\":\"Keynote Speakers\"} --><!-- wp:freeform --><p>\t<div data-wp-context='{\"items\":[]}' data-wp-interactive=\"msr\/accordion\">\n\t\t\t\t\t<div class=\"clearfix\">\n\t\t\t\t<div\n\t\t\t\t\tclass=\"btn-group align-items-center mb-g float-sm-right\"\n\t\t\t\t\tdata-bi-aN=\"accordion-collapse-controls\"\n\t\t\t\t>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Expand all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onExpandAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tExpand all\t\t\t\t\t<\/button>\n\t\t\t\t\t<span aria-hidden=\"true\"> | <\/span>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Collapse all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onCollapseAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tCollapse all\t\t\t\t\t<\/button>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t\t\t<ul class=\"msr-accordion\">\n\t\t\t\t\t\t\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3854\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3854\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3853\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSurajit Chaudhuri\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3853\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3854\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283511\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/surajitc-1.jpg\" alt=\"surajitc\" width=\"75\" height=\"75\" \/><em>Distinguished Scientist and Managing Director, Microsoft Research<\/em><\/p>\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/surajitc\/\">Web Link<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n<p>Surajit Chaudhuri, Distinguished Scientist and a Managing Director at Microsoft Research, will discuss the key secular trends that characterize the field of Big Data with respect to enterprise analytics. He will describe some of the open challenges for enterprise analytics in the context of Big Data.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3856\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3856\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3855\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSanjeev Khanna\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3855\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3856\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283514\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/sanjeev_pic.jpg\" alt=\"sanjeev_pic\" width=\"75\" height=\"75\" \/><em>Professor, University of Pennsylvania<\/em><\/p>\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" target=\"_blank\" href=\"http:\/\/www.cis.upenn.edu\/~sanjeev\/\">Web Link<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n<p>Sanjeev Khanna is the Henry Salvatori Professor of Computer and Information Science at the University of Pennsylvania. His primary research interests are in the design and analysis of algorithms for combinatorial optimisation and in complexity theory. His research has been supported by National Science Foundation, an Alfred P. Sloan Fellowship, a Guggenheim Fellowship, and an IBM Faculty Award. Sanjeev serves on the Editorial boards of SICOMP, ACM TALG, and Foundations and Trends in Theoretical Computer Science and served as an area editor for Encyclopaedia of Algorithms.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3858\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3858\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3857\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tGerhard Weikum\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3857\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3858\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283517\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/gerhardweikum.jpg\" alt=\"gerhardweikum\" width=\"75\" height=\"75\" \/><em>Scientific Director, Max Planck Institute for Informatics<\/em><\/p>\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" target=\"_blank\" href=\"http:\/\/people.mpi-inf.mpg.de\/~weikum\/\">Web Link<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n<p>Gerhard Weikum is a Scientific Director at the Max Planck Institute for Informatics in Saarbruecken, Germany, and also an Adjunct Professor at Saarland University. His research spans transactional and distributed systems, self-tuning database systems, DB&amp;IR integration, and automatic knowledge harvesting from Web and text sources. He co-authored a comprehensive textbook on transactional systems, received the VLDB 10-Year Award for his work on automatic DB tuning, and is one of the creators of the Yago knowledge base. Gerhard Weikum is an ACM Fellow and a member of the German Academy of Science and Engineering. He received a Google Focused Research Award in 2010, and the ACM SIGMOD Contributions Award in 2011.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t<\/div>\n\t<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<!-- \/wp:freeform --><!-- \/wp:msr\/content-tab --><!-- wp:msr\/content-tab {\"title\":\"Panel\"} --><!-- wp:freeform --><p>\t<div data-wp-context='{\"items\":[]}' data-wp-interactive=\"msr\/accordion\">\n\t\t\t\t\t<div class=\"clearfix\">\n\t\t\t\t<div\n\t\t\t\t\tclass=\"btn-group align-items-center mb-g float-sm-right\"\n\t\t\t\t\tdata-bi-aN=\"accordion-collapse-controls\"\n\t\t\t\t>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Expand all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onExpandAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tExpand all\t\t\t\t\t<\/button>\n\t\t\t\t\t<span aria-hidden=\"true\"> | <\/span>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Collapse all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onCollapseAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tCollapse all\t\t\t\t\t<\/button>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t\t\t<ul class=\"msr-accordion\">\n\t\t\t\t\t\t\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3860\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3860\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3859\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSurajit Chaudhuri\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3859\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3860\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283493\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/surajitc.jpg\" alt=\"surajitc\" width=\"75\" height=\"75\" \/>Surajit Chaudhuri, Distinguished Scientist and a Managing Director at Microsoft Research, will discuss the key secular trends that characterize the field of Big Data with respect to enterprise analytics. He will describe some of the open challenges for enterprise analytics in the context of Big Data.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3862\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3862\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3861\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSudipto Guha\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3861\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3862\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283496\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/guha.png\" alt=\"guha\" width=\"72\" height=\"72\" \/>Guha is an Associate Professor in the Department of Computer and Information Sciences at University of Pennsylvania since Fall 2001. He completed his Ph.D. in 2000 at Stanford University working on approximation algorithms. He is a recipient of the NSF CAREER award in 2007, and the Alfred P. Sloan Foundation fellowship.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3864\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3864\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3863\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tSergei Vassilvitskii\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3863\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3864\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283502\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/vassilvitskii.png\" alt=\"vassilvitskii\" width=\"72\" height=\"72\" \/>Sergei Vassilvitskii is a Research Scientist at Google and an Adjunct Assistant Professor of Computer Science at Columbia University. He received his M.S. and Ph.D. in Computer Science from Stanford University and a B.S. from Cornell University. He works on problems at the intersection of economics, data mining and computer science.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3866\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3866\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3865\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tJingren Zhou\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3865\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3866\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283508\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/jingren.png\" alt=\"jingren\" width=\"72\" height=\"72\" \/>Jingren Zhou is a Partner Development Manager in the Bing&#8217;s Infrastructure Team. Jingren manages a team to develop a cloud-scale distributed computation system, called SCOPE, targeted for massive data analysis over tens of thousands of machines at Microsoft Bing. The SCOPE system combines benefits from both traditional parallel databases and MapReduce execution engines to allow easy programmability and deliver massive scalability and high performance through advanced optimization. The system processes petabytes of data daily for a variety of data analysis and data mining applications, powering Bing and other online services at Microsoft. Before he joined Bing Search in 2008, Jingren was a researcher in the database research group at Microsoft Research. He has published many articles in premier database conferences and journals. Jingren received a Ph.D. in Computer Science from Columbia University.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3868\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3868\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3867\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tGraham Cormode\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3867\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3868\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283505\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/cormode.png\" alt=\"cormode\" width=\"72\" height=\"72\" \/>Graham Cormode is a professor of Computer Science at the University of Warwick, UK. He works on topics around big data, streaming analytics, privacy and data mining. From 2006-2013, he was a researcher at AT&amp;T Labs, and prior to that he was at Bell Labs<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t<\/div>\n\t<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<!-- \/wp:freeform --><!-- \/wp:msr\/content-tab --><!-- wp:msr\/content-tab {\"title\":\"Participants\"} --><!-- wp:freeform --><div>\n<div class=\"center sw mt16\">\n<div id=\"contentArea\">\n<div id=\"mainZone\" class=\"zone\">\n<div class=\"compositeModule_2_1Zone \">\n<div class=\"zone\">\n<div class=\"conM \">\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-283490\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/bda_2013.jpg\" alt=\"bda_2013\" width=\"918\" height=\"571\" \/><\/p>\n<ul>\n<li>Anastasia Ailamaki, <em>EPFL<\/em><\/li>\n<li>Dan Alistarh, <em>MIT<\/em><\/li>\n<li>Hitesh Ballani, <em>Microsoft Research<\/em><\/li>\n<li>Alexandre de Baynast, <em>Microsoft ATL Europe<\/em><\/li>\n<li>Guy Blelloch, <em>Carnegie Mellon University<\/em><\/li>\n<li>Lucas Bordeaux, <em>Microsoft Research<\/em><\/li>\n<li>Florian Bourse, <em>ENS Paris<\/em><\/li>\n<li>John Bronskill, <em>Microsoft Research<\/em><\/li>\n<li>Miguel Castro, <em>Microsoft Research<\/em><\/li>\n<li>Nuno Cerquiera, <em>Microsoft (Data at Skype)<\/em><\/li>\n<li>Amit Chakrabarti, <em>Dartmouth College<\/em><\/li>\n<li>Stephen Clark, <em>University of Cambridge<\/em><\/li>\n<li>Raphael Clifford, <em>Bristol University<\/em><\/li>\n<li>Graham Cormode, <em>University of Warwick<\/em><\/li>\n<li>Paolo Costa, <em>Microsoft Research<\/em><\/li>\n<li>Artur Czumaj, <em>University of Warwick<\/em><\/li>\n<li>Valentin Dalibard, <em>University of Cambridge<\/em><\/li>\n<li>Bryan Dove, <em>Microsoft (Data at Skype)<\/em><\/li>\n<li>Moez Draief, <em>Imperial College<\/em><\/li>\n<li>Minos Garofalakis, <em>Technical University of Crete<\/em><\/li>\n<li>Richard Gibbens, <em>University of Cambridge<\/em><\/li>\n<li>Christos Gkantsidis, <em>Microsoft Research<\/em><\/li>\n<li>Thore Graepel, <em>Microsoft Research<\/em><\/li>\n<li>Peter Grindrod, <em>University of Reading<\/em><\/li>\n<li>Sudipto Guha, <em>University of Pennsylvania<\/em><\/li>\n<li>Mehdi Hosseini, <em>University of Cambridge<\/em><\/li>\n<li>Markus Jalsenius, <em>Bristol University<\/em><\/li>\n<li>Flavio Paiva Junqueira, <em>Microsoft Research<\/em><\/li>\n<li>Thomas Karagiannis, <em>Microsoft Research<\/em><\/li>\n<li>George Karypis, <em>University of Minnesota<\/em><\/li>\n<li>Ian Kash, <em>Microsoft Research<\/em><\/li>\n<li>Frank Kelly, <em>University of Cambridge<\/em><\/li>\n<li>Peter Key, <em>Microsoft Research<\/em><\/li>\n<li>Pushmeet Kohli, <em>Microsoft Research<\/em><\/li>\n<li>Philipp Kranen, <em>MSR ATL Europe<\/em><\/li>\n<li>Sergey Legtchenko, <em>Microsoft Research<\/em><\/li>\n<li>Marc Lelarge, <em>INRIA-ENS Paris<\/em><\/li>\n<li>Stefano Leonardi, <em>Sapienza University of Rome<\/em><\/li>\n<li>Aleksander Madry, <em>EPFL<\/em><\/li>\n<li>Volker Markl, <em>TU Berlin<\/em><\/li>\n<li>Laurent Massoulie,<em> INRIA<\/em><\/li>\n<li>Frank McSherry, <em>Microsoft Research<\/em><\/li>\n<li>Derek Murray, <em>Microsoft Research<\/em><\/li>\n<li>Dushyanth Narayanan, <em>Microsoft Research<\/em><\/li>\n<li>Karthik Nilakant<em>, University of Cambridge<\/em><\/li>\n<li>James Norris, <em>University of Cambridge<\/em><\/li>\n<li>Alexandre Proutiere, <em>KTH Stockholm<\/em><\/li>\n<li>Ant Rowstron, <em>Microsoft Research<\/em><\/li>\n<li>Benjamin Sach, <em>University of Warwick<\/em><\/li>\n<li>Richard Samworth, <em>University of Cambridge<\/em><\/li>\n<li>Eirini Spyropoulou, <em>University of Bristol<\/em><\/li>\n<li>Maxim Sviridenko, <em>University of Warwick<\/em><\/li>\n<li>Don Syme<em>, Microsoft Research<\/em><\/li>\n<li>Srikanta Tirthapura, <em>Iowa State University<\/em><\/li>\n<li>Andy Twigg, <em>University of Oxford<\/em><\/li>\n<li>Sergei Vassilvitskii, <em>Google<\/em><\/li>\n<li>Milan Vojnovic, <em>Microsoft Research<\/em><\/li>\n<li>Gerhard Weikum<em>, Max Planck Institute for Informatics<\/em><\/li>\n<li>Ken Woodberry, <em>Microsoft Research<\/em><\/li>\n<li>Ke Yi, <em>Hong Kong University of Science and Technology<\/em><\/li>\n<li>Eiko Yoneki, <em>University of Cambridge<\/em><\/li>\n<li>Reynold Xin, <em>University of California, Berkeley<\/em><\/li>\n<li>Jingren Zhou, <em>Microsoft Corporation (Bing)<\/em><\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p><span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<!-- \/wp:freeform --><!-- \/wp:msr\/content-tab --><!-- wp:msr\/content-tab {\"title\":\"Demos and Posters\"} --><!-- wp:freeform --><p>\t<div data-wp-context='{\"items\":[]}' data-wp-interactive=\"msr\/accordion\">\n\t\t\t\t\t<div class=\"clearfix\">\n\t\t\t\t<div\n\t\t\t\t\tclass=\"btn-group align-items-center mb-g float-sm-right\"\n\t\t\t\t\tdata-bi-aN=\"accordion-collapse-controls\"\n\t\t\t\t>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Expand all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllExpanded\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onExpandAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tExpand all\t\t\t\t\t<\/button>\n\t\t\t\t\t<span aria-hidden=\"true\"> | <\/span>\n\t\t\t\t\t<button\n\t\t\t\t\t\tclass=\"btn btn-link m-0\"\n\t\t\t\t\t\tdata-bi-cN=\"Collapse all\"\n\t\t\t\t\t\tdata-wp-bind--aria-controls=\"state.ariaControls\"\n\t\t\t\t\t\tdata-wp-bind--aria-expanded=\"state.ariaExpanded\"\n\t\t\t\t\t\tdata-wp-bind--disabled=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-class--inactive=\"state.isAllCollapsed\"\n\t\t\t\t\t\tdata-wp-on--click=\"actions.onCollapseAll\"\n\t\t\t\t\t\ttype=\"button\"\n\t\t\t\t\t>\n\t\t\t\t\t\tCollapse all\t\t\t\t\t<\/button>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t\t\t<ul class=\"msr-accordion\">\n\t\t\t\t\t\t\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3870\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3870\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3869\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tBig Metadata - Programming at Internet-Scale\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3869\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3870\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Don Syme, Kenji Takeda, and Keith Battocchi (Microsoft Research)<\/em><\/p>\n<p>We live in an information society. Programming languages are not designed for this. As we move into a world of &#8220;Devices + Services&#8221;, it is vital that developers can productively integrate information at internet-scale into their everyday programming environment. Effectively exploring, navigating, understanding, analysing and presenting data \u2013 both big and broad \u2013 is a key to success for developers. Current experiences for this are cumbersome &amp; labour-intensive. In this new demo we explain the latest research behind F# Type Providers and show how they can bring Internet-Scale Metadata to the fingers of developers. IntelliSense for Internet-Scale Metadata creates a completely new way of rapidly developing applications on Azure, when using Hadoop, and when programming against the web. We explain how this can transform F# into a uniquely powerful and developer-friendly Data Science scripting language for Azure.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3872\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3872\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3871\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tIncremental, Iterative and Interactive Computation in Naiad\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3871\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3872\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Frank McSherry, Derek Murray, Rebecca Isaacs, and Michael Isard (Microsoft Research)<\/em><\/p>\n<p>We are developing a system for scalable data analysis designed to support iterative queries over continuously changing inputs at interactive timescales. The system is based on a new computational framework, differential dataflow, that generalizes standard incremental dataflow for far greater reuse of previous results when collections change. Differential dataflow distinguishes between the multiple reasons a collection might change, including both loop feedback and new input data, allowing a system to re-use the most appropriate results from previously performed work when an incremental update arrives.<\/p>\n<p>Our prototype system, Naiad, demonstrates the practical application of differential dataflow. Like many systems, Naiad supports high-level declarative queries, data-parallel execution, and transparent distribution. Unlike other systems, Naiad efficiently executes queries with multiple (possibly nested) loops, while simultaneously responding with low latency to incremental changes to the inputs. We show how differential dataflow enables orders of magnitude speedups for a variety of complex workloads on real streaming data.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3874\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3874\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3873\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tDynamic Task Allocation in Asynchronous Shared Memory\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3873\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3874\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Dan Alistarh (MIT), James Aspnes (Yale University), Michael A. Bender (Stony Brook University), Rati Gelashvili (MIT), and Seth Gilbert (NUS)<\/em><\/p>\n<p>Task allocation is a classic distributed problem in which a set of p potentially faulty processes must perform a set of m tasks. The problem is all the more challenging when there is heterogeneity, either on the workers&#8217; side, since individual agents may have varying speed and robustness, or because of uneven workloads. In large-scale systems, heterogeneity is the norm. Our work considers the dynamic version of task allocation, in which tasks are injected adversarially during an asynchronous execution. Intuitively, a major challenge in this setting is the fact that, at the same time, the adversary controls scheduling, process crashes, and the input.<\/p>\n<p>We give a new shared-memory algorithm for dynamic task allocation which is within logarithmic factors of optimal. The main algorithmic idea is a randomized data structure called a dynamic to-do tree, which allows processes to pick new tasks to perform uniformly at random from the set of available tasks, and to insert tasks at uniform random locations in the data structure. We show that these properties avoid duplicating work unnecessarily in well-behaved executions, and that work duplication is inherent under certain input-schedule combinations. This is the first algorithm to allow efficient work sharing in this challenging setting, and the result has applications to other problems, such as producer-consumer buffers, and distributed renaming.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3876\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3876\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3875\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tRASP: Large-Scale Graph Traversal with SSD Prefetching\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3875\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3876\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Eiko Yoneki (University of Cambridge), Karthik Nilakant (University of Cambridge), Valentin Dalibard (University of Cambridge), and Amitabha Roy (EPFL)<\/em><\/p>\n<p>Mining large graphs has now become an important aspect of multiple diverse applications and a number of computer systems have been proposed to efficiently execute graph algorithms. Recent interest in this area has led to the construction of single machine graph computation systems that use solid state drives (SSDs) to store the graph. This approach reduces the cost and simplifies the implementation of graph algorithms, making computations on large graphs available to the average user. However, SSDs are slower than main memory, and making full use of their bandwidth is crucial for executing graph algorithms in a reasonable amount of time. We present RASP (the (R)un(A)head (S)SD(P)refetcher) for graph algorithms that parallelises requests to derive maximum throughput from SSDs. RASP combines a judicious distribution of graph state between main memory and SSDs with an innovative run-ahead algorithm to prefetch needed data in parallel. This is in contrast to existing approaches that depend on multi-threading the graph algorithms to saturate available bandwidth. Our experiments on graph algorithms using random access<\/p>\n<p>show that RASP not only is capable of maximising the throughput from SSDs but is also able to almost hide the effect of I\/O latency. The improvements in runtime for graph algorithms is up to 14 X when compared to a single threaded baseline. When compared to sophisticated multi-threaded implementations, RASP performs up to 80% faster without the program complexity and the programmer effort needed for multithreaded graph algorithms.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3878\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3878\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3877\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tActive Data Management for Distributed Graph Processing\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3877\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3878\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Karthik Nilakant (University of Cambridge) and Eiko Yoneki (University of Cambridge)<\/em><\/p>\n<p>Graph processing systems are subject to two conflicting concerns: firstly, a large number of algorithms for processing graphs exist, most of which exhibit poor locality of accesses. Secondly, the increasing scale of data has resulted in a need to distribute the data across multiple machines, further affecting locality. Most existing systems tend to consider these factors separately. We propose schemes for pro-active or reactive active graph management. Pro-actively, one could instrument algorithmic code fragments to allow the processing engine to predict future behaviour of the graph algorithm. Producing \u201cbalanced cuts\u201d on a graph is computationally expensive, and infeasible for large datasets. Instead, lightweight graph analytics could be deployed to re-arrange the graph into less tightly coupled clusters. We propose a framework for considering these factors in unison. Ultimately, the choice of method to use in a given scenario will depend on the structure of the dataset and the processing algorithm. Our focus will be on identifying the minimum characteristics that must be gleaned from the code or data to be processed (or alternatively provided by the user), in order to boost the performance of such a system.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3880\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3880\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3879\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tInfrastructure for real-time analytics on modern cluster networks\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3879\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3880\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Aleksandar Dragojevic, Dushyanth Narayanan, Orion Hodson, Miguel Castro (Microsoft Research)<\/em><\/p>\n<p>Two hardware trends present a great opportunity for building efficient infrastructure for real-time analytics: (1) servers today have tens to hundreds of gigabytes of RAM making it possible to keep large data sets in RAM of a moderately-sized cluster of servers and (2) modern cluster networks, such as InfiniBand and RoCE, support access to memory of remote servers with just a few micro-seconds of latency. By keeping the whole data set in main memory of a cluster and using direct remote memory access (RDMA) primitives of the network it is possible to support tens to hundreds of millions of point lookups in a terabyte-scale key-value store on a cluster of just several tens of servers.<\/p>\n<p>We present Farm, a platform for building infrastructure for supporting in-memory applications that efficiently use modern cluster networks. Farm exposes the memory of all servers in a cluster as a single, fault-tolerant shared memory; it supports atomic reads and writes of small user-defined objects, efficient short transactions for easier synchronization, and exposes the location of objects to support locality-aware optimizations. We describe how to use Farm to build efficient and scalable B-trees and hashtables, which can be used as key-value stores or building blocks for more complex systems.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3882\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3882\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3881\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tBalanced Edge-Partitioning of Massive Scale Graphs\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3881\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3882\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Florian Bourse (Ecole Normale Sup\u00e9rieure, Paris), Marc Lelarge (INRIA- Ecole Normale Sup\u00e9rieure, Paris), and Milan Vojnovic (Microsoft Research)<\/em><\/p>\n<p>Many computation tasks are performed on large scale graph data which may amount to as many as billions or even trillions of vertices or edges (e.g. online social network graphs, knowledge graphs, and web graphs). A standard way to scale up such computations is to partition an input graph into clusters of balanced sizes and small cut costs. Graph partitioning is of interest for a wide range of systems including iterative computation platforms, distributed graph databases, and stream processing platforms. Besides its practical importance, graph partitioning is one of the most fundamental and challenging theoretical computer science problems.<\/p>\n<p>In our work we consider novel graph partitioning problems that arise under some computation models of interest &#8212; which specify how the size of a cluster and the cost of a cut are defined. We devote special focus to online assignment algorithms that require a single pass through vertices or edges. Our preliminary performance evaluation results demonstrate significant benefits of using graph partitioning algorithms that are specifically tailored for given computation model.<\/p>\n<p>This is a collaboration project with MSR-INRIA joint research centre.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t<li class=\"m-0\" data-wp-context='{\"id\":\"accordion-content-3884\"}' data-wp-init=\"callbacks.init\">\n\t\t<div class=\"accordion-header\">\n\t\t\t<button\n\t\t\t\taria-controls=\"accordion-content-3884\"\n\t\t\t\tclass=\"btn btn-collapse\"\n\t\t\t\tdata-wp-bind--aria-expanded=\"state.isExpanded\"\n\t\t\t\tdata-wp-on--click=\"actions.onClick\"\n\t\t\t\tid=\"accordion-button-3883\"\n\t\t\t\ttype=\"button\"\n\t\t\t>\n\t\t\t\tMining Interesting Patterns in Multi-Relational Data\t\t\t<\/button>\n\t\t<\/div>\n\t\t<div\n\t\t\taria-labelledby=\"accordion-button-3883\"\n\t\t\tclass=\"msr-accordion__content\"\n\t\t\tdata-wp-bind--inert=\"!state.isExpanded\"\n\t\t\tdata-wp-run=\"callbacks.run\"\n\t\t\tid=\"accordion-content-3884\"\n\t\t>\n\t\t\t<div class=\"msr-accordion__body\">\n\t\t\t\t<p><em>Eirini Spyropoulou (University of Bristol) and Tijl De Bie (University of Bristol)<\/em><\/p>\n<p>Research on local pattern mining algorithms has focused for years on mining one table of transactions and items (market basket data). However, a large amount of real-world datasets are multi-relational, i.e., they contain more than two entities and more than one relationships. There is therefore increasing interest in developing techniques for mining such complex types of data. Defining a multi-relational pattern syntax that is suitably expressive and finding an efficient mining algorithm, is a challenging problem. Furthermore, since local pattern mining methods usually suffer from the combinatorial explosion of patterns in the output, assessing the quality of patterns is also very important in order to make them useful to the end user.<\/p>\n<p>In this work we introduce a novel approach for mining patterns in multi-relational data. We propose the pattern syntax of Complete Connected Subsets, a new syntax for multi-relational patterns. While this pattern syntax is generally applicable to multi-relational data it reduces to well known itemsets when the data is a simple transaction table. We also introduce RMiner, a simple yet practically efficient algorithm to mine such patterns. We show how the interestingness of patterns of the proposed syntax can be conveniently quantified using a framework for quantifying the subjective interestingness of patterns according to the user&#8217;s prior knowledge. Finally, we present results on real world data sets.<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t<\/div>\n\t<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<!-- \/wp:freeform --><!-- \/wp:msr\/content-tab --><!-- \/wp:msr\/content-tabs -->","tab-content":[{"id":0,"name":"About","content":"<img class=\"wp-image-283463 size-full alignnone\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/banner.jpg\" alt=\"banner\" width=\"595\" height=\"200\" \/>\r\n<div>\r\n<div class=\"center sw mt16\">\r\n<div id=\"contentArea\">\r\n<div id=\"mainZone\" class=\"zone\">\r\n<div class=\"compositeModule_2_1Zone \">\r\n<div class=\"zone\">\r\n<div class=\"conM \">\r\n<p align=\"justify\">We witness a rapid development of the research and technology for efficient processing of big data. There is a surge of commercial and open source platforms for big data analytics, including platforms for querying of massive datasets, batch processing, real-time analytics, streaming computations, iterative computations, graph data processing, and distributed machine learning. There have been some remarkable achievements on the side of designing scalable and efficient algorithms for processing of massive amounts of data, as well as on the side of architecture of systems and infrastructure. The goal of this workshop is to bring together researchers and system architects to discuss and identify the most important and challenging directions to push forward the area of algorithms and systems for big data. The topics of the workshop include but are not limited to computation and storage platforms, querying of massive datasets, sketches, streaming, scaling up distributed machine learning, iterative computations, and large-scale graph processing.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n&nbsp;"},{"id":1,"name":"Programme","content":"&lt;strong&gt;Day 1&lt;\/strong&gt;\r\n\r\n8:00 - 8:45\r\nContinental breakfast\r\n\r\n8:45 - 9:00\r\nWelcome note\r\n\r\n9:00 - 10:00 | <a href=\"http:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/From-Text-to-Entities-and-from-Entities-to-Insight-a-Perspective-on-Unstructured-Big-Data.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/weikum-msr-cambridge-23may2013-posted.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/from-text-to-entities-and-from-entities-to-insight-a-perspective-on-unstructured-big-data\/\">Video<\/a>\r\nKeynote talk, Gerhard Weikum, Max Planck Institute for Informatics\r\n&lt;em&gt;From Text to Entities and from Entities to Insight: a Perspective on Unstructured Big Data&lt;\/em&gt;\r\n\r\n10:00 - 10:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/QueryingandExploringBigBrainData.pdf\">Abstract<\/a>\r\nAnastasia Ailamaki, EPFL\r\n&lt;em&gt;Querying and Exploring Big Brain Data&lt;\/em&gt;\r\n\r\n10:30 - 11:00\r\nRefreshments\r\n\r\n&lt;em&gt;Programming and Computation Models&lt;\/em&gt;\r\n\r\n11:00 - 11:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/BigDataAnalyticswithStratosphere.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/stratosphere-aplatform-for-bda_msft-Copy.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/big-data-with-stratosphere\/\">Video<\/a>\r\nVolker Markl, TU-Berlin\r\n&lt;em&gt;Big Data with Stratosphere&lt;\/em&gt;\r\n\r\n11:30 - 12:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/SCOPEParallelDatabasesMeetMapReduce.pdf\">Abstract<\/a>\r\nJingren Zhou, Microsoft Corporation\r\n&lt;em&gt;SCOPE: Parallel Databases Meet MapReduce&lt;\/em&gt;\r\n\r\n12:00 - 13:00\r\nLunch\r\n\r\n&lt;em&gt;Algorithms, Infrastructure, and Platforms&lt;\/em&gt;\r\n\r\n13:00 - 13:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/OnlineDataProcessingwithS4andOmid.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/big-data-workshop-fpj.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/online-data-processing-with-s4-and-omid\/\">Video<\/a>\r\nFlavio P. Junqueira, Microsoft Research\r\n&lt;em&gt;Online Data Processing with S4 and Omid&lt;\/em&gt;\r\n\r\n13:30 - 14:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/PreditableDataCenters.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/bigdataworkshopPreditableDataCenters.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/predictable-data-centers-3\/\">Video<\/a>\r\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/thomkar\/\">Thomas Karagiannis<\/a>, Microsoft Research\r\n&lt;em&gt;Predictable Data Centers&lt;\/em&gt;\r\n\r\n14:00 - 14:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/From-Terabytes-to-Megabytes-Finding-the-needle-by-shrinking-the-haystack.pdf\">Abstract<\/a>\r\nSergei Vassilvitskii, Google\r\n&lt;em&gt;From Terabytes to Megabytes: Finding the Needle by Shrinking the Haystack&lt;\/em&gt;\r\n\r\n14:30 - 15:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Incremental-Iterative-and-Interactive-Computation-using-Differential-Dataflow.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Incremental-Iterative-and-Interactive-Computation-using-Differential-Dataflow.pptx\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/incremental-iterative-and-interactive-computation-using-differential-dataflow\/\">Video<\/a>\r\nFrank McSherry, Microsoft Research\r\n&lt;em&gt;Incremental, Iterative, and Interactive Computation using Differential Dataflow&lt;\/em&gt;\r\n\r\n15:00 - 16:00 | <span style=\"text-decoration: underline\"><span style=\"color: #0000ff;text-decoration: underline\">See Demos and Posters Tab<\/span><\/span>\r\nRefreshments + Demo and Poster Session\r\n\r\n&lt;em&gt;Large-scale Graph Analytics&lt;\/em&gt;\r\n\r\n16:00 - 16:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Big-Data-on-Small-Machines.pdf\">Abstracts<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Big-Data-on-Small-Machines-Slides.pdf\">Slides<\/a>\r\nGuy Blelloch, Carnegie Mellon University\r\n&lt;em&gt;Big Data on Small Machines&lt;\/em&gt;\r\n\r\n16:30 - 17:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Graphs-and-Linear-Measurements.pdf\">Abstracts<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/graphs-and-linear-measurements-slides.pdf\">Slides<\/a>\r\nSudipto Guha, University of Pennsylvania\r\n&lt;em&gt;Graphs and Linear Measurements&lt;\/em&gt;\r\n\r\n17:00 - 17:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Partitioning-Clustering-Big-Graphs.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Partitioning-Clustering-Big-Graphs-slides.pdf\">Slides<\/a>\r\nGeorge Karypis, University of Minnesota\r\n&lt;em&gt;Partitioning &amp; Clustering Big Graphs&lt;\/em&gt;\r\n\r\n17:30 - 18:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Online-Team-Formation-in-Social-Networks.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Online-Team-Formation-in-Social-Networks-Slides.pdf\">Slides<\/a>\r\nStefano Leonardi, Sapienza University of Rome\r\n&lt;em&gt;Online Team Formation in Social Networks&lt;\/em&gt;\r\n\r\n19:00 - 21:00\r\nWorkshop reception and dinner at Emmanuel College\r\n\r\n&lt;strong&gt;Day 2&lt;\/strong&gt;\r\n\r\n8:30 - 9:00\r\nCoffee and pastries\r\n\r\n9:00 - 10:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Big-Data-and-Enterprise-Analytics-Abstract.pdf\">Abstract<\/a>\r\nKeynote talk\r\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/surajitc\/\">Surajit Chaudhuri<\/a>, Microsoft Research\r\n&lt;em&gt;Big Data and Enterprise Analytics&lt;\/em&gt;\r\n\r\n10:00 - 10:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Streaming-Verification-of-Outsourced-Computation-Abstract.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Streaming-Verification-of-Outsourced-Computation-Slides.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/streaming-verification-of-outsourced-computation\/\">Video<\/a>\r\nGraham Cormode, University of Warwick\r\n&lt;em&gt;Streaming Verification of Outsourced Computation&lt;\/em&gt;\r\n\r\n10:30 - 11:00\r\nRefreshments + Group photo\r\n\r\n11:00 - 12:00 | Panel\r\nModerator:\r\nGraham Cormode, University of Warwick\r\nPanelists:\r\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/surajitc\/\">Surajit Chaudhuri<\/a>, Microsoft Research\r\nSudipto Guha, University of Pennsylvania\r\nSergei Vassilvitskii, Google\r\nJingren Zhou, Microsoft Corporation\r\n&lt;em&gt;Big Data Analytics: A Happy Marriage of Systems and Theory?&lt;\/em&gt;\r\n\r\n12:00 - 13:30\r\nLunch\r\n\r\n13:30 - 14:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Fast-Algorithms-for-Perfect-Matching-in-Regular-Bipartite-Graphs-Abstract.pdf\">Abstract<\/a>\r\nKeynote talk, Sanjeev Khanna, University of Pennsylvania\r\n&lt;em&gt;Fast Algorithms for Perfect Matchings in Regular Bipartite Graphs&lt;\/em&gt;\r\n\r\n&lt;em&gt;Large-Scale Graph Analytics Cont'd&lt;\/em&gt;\r\n\r\n14:30 - 15:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Cuts-Trees-and-Electrical-Flows-Abstract.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Cuts-Trees-and-Electrical-Flows-PPT.pps\">Slides<\/a>\r\nAleksandar Madry, EPFL\r\n&lt;em&gt;Cuts, Trees, and Electrical Flows&lt;\/em&gt;\r\n\r\n15:00 - 15:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Neighborhood-Sampling-for-Estimating-Local-Properties-on-a-Graph-Stream.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Neighborhood-Sampling-for-Estimating-Local-Properties-on-a-Graph-Stream-PPT.pptx\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/neighborhood-sampling-for-estimating-local-properties-on-a-graph-stream\/\">Video<\/a>\r\nSrikanta Tirthapura, Iowa State University\r\n&lt;em&gt;Neighborhood Sampling for Estimating Local Properties on a Graph Stream&lt;\/em&gt;\r\n\r\n15:30 - 16:00\r\nRefreshments\r\n\r\n&lt;em&gt;Streaming and Massive Distributed Data&lt;\/em&gt;\r\n\r\n16:00 - 16:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/What-Cant-We-Compute-on-Data-Streams.pdf\">Abstract<\/a> |\u00a0 <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/what-cant-we-compute-on-data-streams\/\">Video<\/a>\r\nAmit Chakrabarti, Dartmouth College\r\n&lt;em&gt;What Can't We Compute on Data Streams?&lt;\/em&gt;\r\n\r\n16:30 - 17:00 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/QueryingandExploringBigBrainData-1.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Querying-Big-Dynamic-Distributed-Data-Slides.pdf\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/querying-big-dynamic-distributed-data\/\">Video<\/a>\r\nMinos Garofalakis, Technical University of Crete\r\n&lt;em&gt;Querying Big, Dynamic, Distributed Data&lt;\/em&gt;\r\n\r\n17:00 - 17:30 | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Mergeable-Summaries-Abstract.pdf\">Abstract<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/Mergable-Summaries-PPT.pptx\">Slides<\/a> | <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/video\/mergable-summaries\/\">Video<\/a>\r\nKe Yi, Hong Kong University of Science and Technology\r\n&lt;em&gt;Mergable Summaries&lt;\/em&gt;\r\n\r\n17:30 - 18:00\r\nClosing session"},{"id":2,"name":"Keynote Speakers","content":"[accordion]\r\n\r\n[panel header=\"Surajit Chaudhuri\"]\r\n<img class=\"alignleft size-full wp-image-283511\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/surajitc-1.jpg\" alt=\"surajitc\" width=\"75\" height=\"75\" \/><em>Distinguished Scientist and Managing Director, Microsoft Research<\/em>\r\n\r\n<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/surajitc\/\">Web Link<\/a>\r\n\r\nSurajit Chaudhuri, Distinguished Scientist and a Managing Director at Microsoft Research, will discuss the key secular trends that characterize the field of Big Data with respect to enterprise analytics. He will describe some of the open challenges for enterprise analytics in the context of Big Data.\r\n[\/panel]\r\n\r\n[panel header=\"Sanjeev Khanna\"]\r\n<img class=\"alignleft size-full wp-image-283514\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/sanjeev_pic.jpg\" alt=\"sanjeev_pic\" width=\"75\" height=\"75\" \/><em>Professor, University of Pennsylvania<\/em>\r\n\r\n<a href=\"http:\/\/www.cis.upenn.edu\/~sanjeev\/\">Web Link<\/a>\r\n\r\nSanjeev Khanna is the Henry Salvatori Professor of Computer and Information Science at the University of Pennsylvania. His primary research interests are in the design and analysis of algorithms for combinatorial optimisation and in complexity theory. His research has been supported by National Science Foundation, an Alfred P. Sloan Fellowship, a Guggenheim Fellowship, and an IBM Faculty Award. Sanjeev serves on the Editorial boards of SICOMP, ACM TALG, and Foundations and Trends in Theoretical Computer Science and served as an area editor for Encyclopaedia of Algorithms.[\/panel]\r\n\r\n[panel header=\"Gerhard Weikum\"]\r\n<img class=\"alignleft size-full wp-image-283517\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/gerhardweikum.jpg\" alt=\"gerhardweikum\" width=\"75\" height=\"75\" \/><em>Scientific Director, Max Planck Institute for Informatics<\/em>\r\n\r\n<a href=\"http:\/\/people.mpi-inf.mpg.de\/~weikum\/\">Web Link<\/a>\r\n\r\nGerhard Weikum is a Scientific Director at the Max Planck Institute for Informatics in Saarbruecken, Germany, and also an Adjunct Professor at Saarland University. His research spans transactional and distributed systems, self-tuning database systems, DB&amp;IR integration, and automatic knowledge harvesting from Web and text sources. He co-authored a comprehensive textbook on transactional systems, received the VLDB 10-Year Award for his work on automatic DB tuning, and is one of the creators of the Yago knowledge base. Gerhard Weikum is an ACM Fellow and a member of the German Academy of Science and Engineering. He received a Google Focused Research Award in 2010, and the ACM SIGMOD Contributions Award in 2011.[\/panel]\r\n\r\n[\/accordion]"},{"id":3,"name":"Panel","content":"[accordion]\r\n\r\n[panel header=\"Surajit Chaudhuri\"]\r\n<img class=\"alignleft size-full wp-image-283493\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/surajitc.jpg\" alt=\"surajitc\" width=\"75\" height=\"75\" \/>Surajit Chaudhuri, Distinguished Scientist and a Managing Director at Microsoft Research, will discuss the key secular trends that characterize the field of Big Data with respect to enterprise analytics. He will describe some of the open challenges for enterprise analytics in the context of Big Data.\r\n[\/panel]\r\n\r\n[panel header=\"Sudipto Guha\"]\r\n<img class=\"alignleft size-full wp-image-283496\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/guha.png\" alt=\"guha\" width=\"72\" height=\"72\" \/>Guha is an Associate Professor in the Department of Computer and Information Sciences at University of Pennsylvania since Fall 2001. He completed his Ph.D. in 2000 at Stanford University working on approximation algorithms. He is a recipient of the NSF CAREER award in 2007, and the Alfred P. Sloan Foundation fellowship.[\/panel]\r\n\r\n[panel header=\"Sergei Vassilvitskii\"]\r\n<img class=\"alignleft size-full wp-image-283502\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/vassilvitskii.png\" alt=\"vassilvitskii\" width=\"72\" height=\"72\" \/>Sergei Vassilvitskii is a Research Scientist at Google and an Adjunct Assistant Professor of Computer Science at Columbia University. He received his M.S. and Ph.D. in Computer Science from Stanford University and a B.S. from Cornell University. He works on problems at the intersection of economics, data mining and computer science.\r\n[\/panel]\r\n\r\n[panel header=\"Jingren Zhou\"]\r\n<img class=\"alignleft size-full wp-image-283508\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/jingren.png\" alt=\"jingren\" width=\"72\" height=\"72\" \/>Jingren Zhou is a Partner Development Manager in the Bing's Infrastructure Team. Jingren manages a team to develop a cloud-scale distributed computation system, called SCOPE, targeted for massive data analysis over tens of thousands of machines at Microsoft Bing. The SCOPE system combines benefits from both traditional parallel databases and MapReduce execution engines to allow easy programmability and deliver massive scalability and high performance through advanced optimization. The system processes petabytes of data daily for a variety of data analysis and data mining applications, powering Bing and other online services at Microsoft. Before he joined Bing Search in 2008, Jingren was a researcher in the database research group at Microsoft Research. He has published many articles in premier database conferences and journals. Jingren received a Ph.D. in Computer Science from Columbia University.\r\n[\/panel]\r\n\r\n[panel header=\"Graham Cormode\"]\r\n<img class=\"alignleft size-full wp-image-283505\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/cormode.png\" alt=\"cormode\" width=\"72\" height=\"72\" \/>Graham Cormode is a professor of Computer Science at the University of Warwick, UK. He works on topics around big data, streaming analytics, privacy and data mining. From 2006-2013, he was a researcher at AT&amp;T Labs, and prior to that he was at Bell Labs\r\n[\/panel]\r\n\r\n[\/accordion]"},{"id":4,"name":"Participants","content":"<div>\r\n<div class=\"center sw mt16\">\r\n<div id=\"contentArea\">\r\n<div id=\"mainZone\" class=\"zone\">\r\n<div class=\"compositeModule_2_1Zone \">\r\n<div class=\"zone\">\r\n<div class=\"conM \">\r\n\r\n<img class=\"alignleft size-full wp-image-283490\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/08\/bda_2013.jpg\" alt=\"bda_2013\" width=\"918\" height=\"571\" \/>\r\n<ul>\r\n \t<li>Anastasia Ailamaki, <em>EPFL<\/em><\/li>\r\n \t<li>Dan Alistarh, <em>MIT<\/em><\/li>\r\n \t<li>Hitesh Ballani, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Alexandre de Baynast, <em>Microsoft ATL Europe<\/em><\/li>\r\n \t<li>Guy Blelloch, <em>Carnegie Mellon University<\/em><\/li>\r\n \t<li>Lucas Bordeaux, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Florian Bourse, <em>ENS Paris<\/em><\/li>\r\n \t<li>John Bronskill, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Miguel Castro, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Nuno Cerquiera, <em>Microsoft (Data at Skype)<\/em><\/li>\r\n \t<li>Amit Chakrabarti, <em>Dartmouth College<\/em><\/li>\r\n \t<li>Stephen Clark, <em>University of Cambridge<\/em><\/li>\r\n \t<li>Raphael Clifford, <em>Bristol University<\/em><\/li>\r\n \t<li>Graham Cormode, <em>University of Warwick<\/em><\/li>\r\n \t<li>Paolo Costa, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Artur Czumaj, <em>University of Warwick<\/em><\/li>\r\n \t<li>Valentin Dalibard, <em>University of Cambridge<\/em><\/li>\r\n \t<li>Bryan Dove, <em>Microsoft (Data at Skype)<\/em><\/li>\r\n \t<li>Moez Draief, <em>Imperial College<\/em><\/li>\r\n \t<li>Minos Garofalakis, <em>Technical University of Crete<\/em><\/li>\r\n \t<li>Richard Gibbens, <em>University of Cambridge<\/em><\/li>\r\n \t<li>Christos Gkantsidis, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Thore Graepel, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Peter Grindrod, <em>University of Reading<\/em><\/li>\r\n \t<li>Sudipto Guha, <em>University of Pennsylvania<\/em><\/li>\r\n \t<li>Mehdi Hosseini, <em>University of Cambridge<\/em><\/li>\r\n \t<li>Markus Jalsenius, <em>Bristol University<\/em><\/li>\r\n \t<li>Flavio Paiva Junqueira, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Thomas Karagiannis, <em>Microsoft Research<\/em><\/li>\r\n \t<li>George Karypis, <em>University of Minnesota<\/em><\/li>\r\n \t<li>Ian Kash, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Frank Kelly, <em>University of Cambridge<\/em><\/li>\r\n \t<li>Peter Key, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Pushmeet Kohli, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Philipp Kranen, <em>MSR ATL Europe<\/em><\/li>\r\n \t<li>Sergey Legtchenko, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Marc Lelarge, <em>INRIA-ENS Paris<\/em><\/li>\r\n \t<li>Stefano Leonardi, <em>Sapienza University of Rome<\/em><\/li>\r\n \t<li>Aleksander Madry, <em>EPFL<\/em><\/li>\r\n \t<li>Volker Markl, <em>TU Berlin<\/em><\/li>\r\n \t<li>Laurent Massoulie,<em> INRIA<\/em><\/li>\r\n \t<li>Frank McSherry, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Derek Murray, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Dushyanth Narayanan, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Karthik Nilakant<em>, University of Cambridge<\/em><\/li>\r\n \t<li>James Norris, <em>University of Cambridge<\/em><\/li>\r\n \t<li>Alexandre Proutiere, <em>KTH Stockholm<\/em><\/li>\r\n \t<li>Ant Rowstron, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Benjamin Sach, <em>University of Warwick<\/em><\/li>\r\n \t<li>Richard Samworth, <em>University of Cambridge<\/em><\/li>\r\n \t<li>Eirini Spyropoulou, <em>University of Bristol<\/em><\/li>\r\n \t<li>Maxim Sviridenko, <em>University of Warwick<\/em><\/li>\r\n \t<li>Don Syme<em>, Microsoft Research<\/em><\/li>\r\n \t<li>Srikanta Tirthapura, <em>Iowa State University<\/em><\/li>\r\n \t<li>Andy Twigg, <em>University of Oxford<\/em><\/li>\r\n \t<li>Sergei Vassilvitskii, <em>Google<\/em><\/li>\r\n \t<li>Milan Vojnovic, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Gerhard Weikum<em>, Max Planck Institute for Informatics<\/em><\/li>\r\n \t<li>Ken Woodberry, <em>Microsoft Research<\/em><\/li>\r\n \t<li>Ke Yi, <em>Hong Kong University of Science and Technology<\/em><\/li>\r\n \t<li>Eiko Yoneki, <em>University of Cambridge<\/em><\/li>\r\n \t<li>Reynold Xin, <em>University of California, Berkeley<\/em><\/li>\r\n \t<li>Jingren Zhou, <em>Microsoft Corporation (Bing)<\/em><\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>"},{"id":5,"name":"Demos and Posters","content":"[accordion]\r\n\r\n[panel header=\"Big Metadata - Programming at Internet-Scale\"]\r\n<em>Don Syme, Kenji Takeda, and Keith Battocchi (Microsoft Research)<\/em>\r\n\r\nWe live in an information society. Programming languages are not designed for this. As we move into a world of \"Devices + Services\", it is vital that developers can productively integrate information at internet-scale into their everyday programming environment. Effectively exploring, navigating, understanding, analysing and presenting data \u2013 both big and broad \u2013 is a key to success for developers. Current experiences for this are cumbersome &amp; labour-intensive. In this new demo we explain the latest research behind F# Type Providers and show how they can bring Internet-Scale Metadata to the fingers of developers. IntelliSense for Internet-Scale Metadata creates a completely new way of rapidly developing applications on Azure, when using Hadoop, and when programming against the web. We explain how this can transform F# into a uniquely powerful and developer-friendly Data Science scripting language for Azure.\r\n[\/panel]\r\n\r\n[panel header=\"Incremental, Iterative and Interactive Computation in Naiad\r\n\"]\r\n<em>Frank McSherry, Derek Murray, Rebecca Isaacs, and Michael Isard (Microsoft Research)<\/em>\r\n\r\nWe are developing a system for scalable data analysis designed to support iterative queries over continuously changing inputs at interactive timescales. The system is based on a new computational framework, differential dataflow, that generalizes standard incremental dataflow for far greater reuse of previous results when collections change. Differential dataflow distinguishes between the multiple reasons a collection might change, including both loop feedback and new input data, allowing a system to re-use the most appropriate results from previously performed work when an incremental update arrives.\r\n\r\nOur prototype system, Naiad, demonstrates the practical application of differential dataflow. Like many systems, Naiad supports high-level declarative queries, data-parallel execution, and transparent distribution. Unlike other systems, Naiad efficiently executes queries with multiple (possibly nested) loops, while simultaneously responding with low latency to incremental changes to the inputs. We show how differential dataflow enables orders of magnitude speedups for a variety of complex workloads on real streaming data.\r\n[\/panel]\r\n\r\n[panel header=\"Dynamic Task Allocation in Asynchronous Shared Memory\"]\r\n<em>Dan Alistarh (MIT), James Aspnes (Yale University), Michael A. Bender (Stony Brook University), Rati Gelashvili (MIT), and Seth Gilbert (NUS)<\/em>\r\n\r\nTask allocation is a classic distributed problem in which a set of p potentially faulty processes must perform a set of m tasks. The problem is all the more challenging when there is heterogeneity, either on the workers' side, since individual agents may have varying speed and robustness, or because of uneven workloads. In large-scale systems, heterogeneity is the norm. Our work considers the dynamic version of task allocation, in which tasks are injected adversarially during an asynchronous execution. Intuitively, a major challenge in this setting is the fact that, at the same time, the adversary controls scheduling, process crashes, and the input.\r\n\r\nWe give a new shared-memory algorithm for dynamic task allocation which is within logarithmic factors of optimal. The main algorithmic idea is a randomized data structure called a dynamic to-do tree, which allows processes to pick new tasks to perform uniformly at random from the set of available tasks, and to insert tasks at uniform random locations in the data structure. We show that these properties avoid duplicating work unnecessarily in well-behaved executions, and that work duplication is inherent under certain input-schedule combinations. This is the first algorithm to allow efficient work sharing in this challenging setting, and the result has applications to other problems, such as producer-consumer buffers, and distributed renaming.\r\n[\/panel]\r\n\r\n[panel header=\"RASP: Large-Scale Graph Traversal with SSD Prefetching\"]\r\n<em>Eiko Yoneki (University of Cambridge), Karthik Nilakant (University of Cambridge), Valentin Dalibard (University of Cambridge), and Amitabha Roy (EPFL)<\/em>\r\n\r\nMining large graphs has now become an important aspect of multiple diverse applications and a number of computer systems have been proposed to efficiently execute graph algorithms. Recent interest in this area has led to the construction of single machine graph computation systems that use solid state drives (SSDs) to store the graph. This approach reduces the cost and simplifies the implementation of graph algorithms, making computations on large graphs available to the average user. However, SSDs are slower than main memory, and making full use of their bandwidth is crucial for executing graph algorithms in a reasonable amount of time. We present RASP (the (R)un(A)head (S)SD(P)refetcher) for graph algorithms that parallelises requests to derive maximum throughput from SSDs. RASP combines a judicious distribution of graph state between main memory and SSDs with an innovative run-ahead algorithm to prefetch needed data in parallel. This is in contrast to existing approaches that depend on multi-threading the graph algorithms to saturate available bandwidth. Our experiments on graph algorithms using random access\r\n\r\nshow that RASP not only is capable of maximising the throughput from SSDs but is also able to almost hide the effect of I\/O latency. The improvements in runtime for graph algorithms is up to 14 X when compared to a single threaded baseline. When compared to sophisticated multi-threaded implementations, RASP performs up to 80% faster without the program complexity and the programmer effort needed for multithreaded graph algorithms.\r\n[\/panel]\r\n\r\n[panel header=\"Active Data Management for Distributed Graph Processing\"]\r\n<em>Karthik Nilakant (University of Cambridge) and Eiko Yoneki (University of Cambridge)<\/em>\r\n\r\nGraph processing systems are subject to two conflicting concerns: firstly, a large number of algorithms for processing graphs exist, most of which exhibit poor locality of accesses. Secondly, the increasing scale of data has resulted in a need to distribute the data across multiple machines, further affecting locality. Most existing systems tend to consider these factors separately. We propose schemes for pro-active or reactive active graph management. Pro-actively, one could instrument algorithmic code fragments to allow the processing engine to predict future behaviour of the graph algorithm. Producing \u201cbalanced cuts\u201d on a graph is computationally expensive, and infeasible for large datasets. Instead, lightweight graph analytics could be deployed to re-arrange the graph into less tightly coupled clusters. We propose a framework for considering these factors in unison. Ultimately, the choice of method to use in a given scenario will depend on the structure of the dataset and the processing algorithm. Our focus will be on identifying the minimum characteristics that must be gleaned from the code or data to be processed (or alternatively provided by the user), in order to boost the performance of such a system.\r\n[\/panel]\r\n\r\n[panel header=\"Infrastructure for real-time analytics on modern cluster networks\"]\r\n<em>Aleksandar Dragojevic, Dushyanth Narayanan, Orion Hodson, Miguel Castro (Microsoft Research)<\/em>\r\n\r\nTwo hardware trends present a great opportunity for building efficient infrastructure for real-time analytics: (1) servers today have tens to hundreds of gigabytes of RAM making it possible to keep large data sets in RAM of a moderately-sized cluster of servers and (2) modern cluster networks, such as InfiniBand and RoCE, support access to memory of remote servers with just a few micro-seconds of latency. By keeping the whole data set in main memory of a cluster and using direct remote memory access (RDMA) primitives of the network it is possible to support tens to hundreds of millions of point lookups in a terabyte-scale key-value store on a cluster of just several tens of servers.\r\n\r\nWe present Farm, a platform for building infrastructure for supporting in-memory applications that efficiently use modern cluster networks. Farm exposes the memory of all servers in a cluster as a single, fault-tolerant shared memory; it supports atomic reads and writes of small user-defined objects, efficient short transactions for easier synchronization, and exposes the location of objects to support locality-aware optimizations. We describe how to use Farm to build efficient and scalable B-trees and hashtables, which can be used as key-value stores or building blocks for more complex systems.\r\n[\/panel]\r\n\r\n[panel header=\"Balanced Edge-Partitioning of Massive Scale Graphs\"]\r\n<em>Florian Bourse (Ecole Normale Sup\u00e9rieure, Paris), Marc Lelarge (INRIA- Ecole Normale Sup\u00e9rieure, Paris), and Milan Vojnovic (Microsoft Research)<\/em>\r\n\r\nMany computation tasks are performed on large scale graph data which may amount to as many as billions or even trillions of vertices or edges (e.g. online social network graphs, knowledge graphs, and web graphs). A standard way to scale up such computations is to partition an input graph into clusters of balanced sizes and small cut costs. Graph partitioning is of interest for a wide range of systems including iterative computation platforms, distributed graph databases, and stream processing platforms. Besides its practical importance, graph partitioning is one of the most fundamental and challenging theoretical computer science problems.\r\n\r\nIn our work we consider novel graph partitioning problems that arise under some computation models of interest -- which specify how the size of a cluster and the cost of a cut are defined. We devote special focus to online assignment algorithms that require a single pass through vertices or edges. Our preliminary performance evaluation results demonstrate significant benefits of using graph partitioning algorithms that are specifically tailored for given computation model.\r\n\r\nThis is a collaboration project with MSR-INRIA joint research centre.\r\n[\/panel]\r\n\r\n[panel header=\"Mining Interesting Patterns in Multi-Relational Data\"]\r\n<em>Eirini Spyropoulou (University of Bristol) and Tijl De Bie (University of Bristol)<\/em>\r\n\r\nResearch on local pattern mining algorithms has focused for years on mining one table of transactions and items (market basket data). However, a large amount of real-world datasets are multi-relational, i.e., they contain more than two entities and more than one relationships. There is therefore increasing interest in developing techniques for mining such complex types of data. Defining a multi-relational pattern syntax that is suitably expressive and finding an efficient mining algorithm, is a challenging problem. Furthermore, since local pattern mining methods usually suffer from the combinatorial explosion of patterns in the output, assessing the quality of patterns is also very important in order to make them useful to the end user.\r\n\r\nIn this work we introduce a novel approach for mining patterns in multi-relational data. We propose the pattern syntax of Complete Connected Subsets, a new syntax for multi-relational patterns. While this pattern syntax is generally applicable to multi-relational data it reduces to well known itemsets when the data is a simple transaction table. We also introduce RMiner, a simple yet practically efficient algorithm to mine such patterns. We show how the interestingness of patterns of the proposed syntax can be conveniently quantified using a framework for quantifying the subjective interestingness of patterns according to the user's prior knowledge. Finally, we present results on real world data sets.\r\n[\/panel]\r\n\r\n[\/accordion]"}],"msr_startdate":"2013-05-23","msr_enddate":"2013-05-24","msr_event_time":"","msr_location":"Microsoft Research, Cambridge, U.K.","msr_event_link":"","msr_event_recording_link":"","msr_startdate_formatted":"May 23, 2013","msr_register_text":"Watch now","msr_cta_link":"","msr_cta_text":"","msr_cta_bi_name":"","featured_image_thumbnail":null,"event_excerpt":"We witness a rapid development of the research and technology for efficient processing of big data. There is a surge of commercial and open source platforms for big data analytics, including platforms for querying of massive datasets, batch processing, real-time analytics, streaming computations, iterative computations, graph data processing, and distributed machine learning. There have been some remarkable achievements on the side of designing scalable and efficient algorithms for processing of massive amounts of data, as&hellip;","msr_research_lab":[],"related-researchers":[],"msr_impact_theme":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-opportunities":[],"related-publications":[],"related-videos":[],"related-posts":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event\/283451","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-event"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event\/283451\/revisions"}],"predecessor-version":[{"id":1147279,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event\/283451\/revisions\/1147279"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=283451"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=283451"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=283451"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=283451"},{"taxonomy":"msr-video-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video-type?post=283451"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=283451"},{"taxonomy":"msr-program-audience","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-program-audience?post=283451"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=283451"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=283451"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}