{"id":2123,"date":"2012-01-25T16:30:00","date_gmt":"2012-01-25T16:30:00","guid":{"rendered":"https:\/\/blogs.msdn.microsoft.com\/msr_er\/2012\/01\/25\/managing-the-scientific-data-explosion-a-response-to-the-ostp-digital-data-rfi\/"},"modified":"2016-07-20T07:33:11","modified_gmt":"2016-07-20T14:33:11","slug":"managing-the-scientific-data-explosion-a-response-to-the-ostp-digital-data-rfi","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/managing-the-scientific-data-explosion-a-response-to-the-ostp-digital-data-rfi\/","title":{"rendered":"Managing the Scientific Data Explosion: a Response to the OSTP Digital Data RFI"},"content":{"rendered":"<p><span style=\"font-family: verdana,geneva; font-size: medium;\">Scientists can agree that there&rsquo;s a lot of data out there, and that we could be using it more efficiently. Now the White House has asked for input on how to do just that.<\/span><\/p>\n<p><span style=\"font-family: verdana,geneva; font-size: medium;\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/msdnshared.blob.core.windows.net\/media\/MSDNBlogsFS\/prod.evol.blogs.msdn.com\/CommunityServer.Blogs.Components.WeblogFiles\/00\/00\/01\/32\/81\/4426.ostp.png\" original-url=\"http:\/\/blogs.msdn.com\/cfs-file.ashx\/__key\/communityserver-blogs-components-weblogfiles\/00-00-01-32-81\/4426.ostp.png\"><img decoding=\"async\" style=\"border: 0px currentColor; margin-right: 10px; margin-left: 10px; float: left;\" title=\"Data intensive research\" alt=\"Data intensive research\" src=\"https:\/\/msdnshared.blob.core.windows.net\/media\/MSDNBlogsFS\/prod.evol.blogs.msdn.com\/CommunityServer.Blogs.Components.WeblogFiles\/00\/00\/01\/32\/81\/4426.ostp.png\" original-url=\"http:\/\/blogs.msdn.com\/resized-image.ashx\/__size\/222x150\/__key\/communityserver-blogs-components-weblogfiles\/00-00-01-32-81\/4426.ostp.png\" \/><span class=\"sr-only\"> (opens in new tab)<\/span><\/a>Data from scientific research is important to a diverse array of user communities from researchers, governments, and companies to wildlife managers, transportation managers, hospitals, and teachers. As the quantity of data in individual and community collections grows, its potential value also increases but, unfortunately, so do the associated challenges of data access, privacy, storage, and archiving. These challenges are social, economic, and technical, and the solutions will require collaborative contributions from universities, federal agencies, companies, scientific societies, and other organizations.<\/span><\/p>\n<p><span style=\"font-family: verdana,geneva; font-size: medium;\">Effective approaches to realizing the benefits of scientific data are likely to require many elements, including:<\/span><\/p>\n<ul>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\">Providing incentives and rewards for sharing data<\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\">Creating and disseminating software tools and online services that enable users to find and analyze data of interest<\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\">Developing and using standard metadata schemas, well-documented data formats, and access protocols to enable data re-use and cross-domain fusing of data<\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\">Facilitating systems by which funding agencies and users can contribute to the costs of data storage, sharing, and analysis<\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\">Developing systems and metrics to determine when and how data is worth preserving and sharing<\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: verdana,geneva; font-size: medium;\">Microsoft believes that these are challenges worth tackling, and that coordinated efforts are urgently needed to advance our ability to curate, preserve, and use digital scientific data to maximize the societal and economic impact of research. Therefore, on January 12, 2012, Microsoft submitted our input in response to the White House Office of Science and Technology Policy (OSTP) request for information (RFI) on <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.federalregister.gov\/articles\/2011\/11\/04\/2011-28621\/request-for-information-public-access-to-digital-data-resulting-from-federally-funded-scientific\" target=\"_blank\">Public Access to Digital Data Resulting From Federally Funded Scientific Research<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/span><\/p>\n<p><span style=\"font-family: verdana,geneva; font-size: medium;\">The <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/UM\/redmond\/about\/collaboration\/ERblog\/Microsoft_Response_to_OSTP_RFI_on_Digital_Data_01-12-2012.pdf\" target=\"_blank\">Microsoft response<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> emphasizes two areas: Economic Models and Software Tools and Online Services. We discuss that nations, to facilitate research and realize societal benefits of that research, should create environments in which innovation can occur around the critical elements that enable data sharing, retention, and use, and the costs should be shared among the various groups that receive benefits from the data and associated discoveries. In some cases, dissemination and use of specific data sets are necessary to meet high priority scientific, policy, economic, or societal goals, and thus should be supported by relevant government agencies. In other cases, there are opportunities to create a tool or service infrastructure that enhances the value of data and allows the provider to monetize access at a level sufficient to cover the investment made in creating or maintaining the data archive. We emphasize that in determining which data to share and how, it is important to recognize that consumers of a particular data set may be outside of the research community that created it (for example, in another scientific field or at a commercial enterprise). These consumers should still help define the value of the data and drive the creation of tools to facilitate its cross-domain use. They must also share in paying for its maintenance costs. Overall, we stress the value that innovations in information technology, including emerging cloud services, can bring to facilitating data sharing and analysis and enabling collaborative, multi-disciplinary, and international science.<\/span><\/p>\n<p><span style=\"font-family: verdana,geneva; font-size: medium;\">While the Microsoft response to the OSTP RFI on access to digital scientific data focuses on a few specific areas, it builds on collaborative work already done by the research community and Federal agencies in this area. Experts from Microsoft participate regularly in and support such efforts. In particular, we remain committed to the conclusions of the National Science Foundation&rsquo;s Advisory Committee for Cyberinfrastructure&rsquo;s <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.nsf.gov\/od\/oci\/taskforces\/TaskForceReport_Data.pdf\" target=\"_blank\">Task Force on Data and Visualization<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and the Blue Ribbon <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/brtf.sdsc.edu\/\" target=\"_blank\">Task Force on Sustainable Digital Preservation and Access<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. We also agree with many of the challenges described and conclusions reached in the National Science Board&#8217;s draft <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.nsf.gov\/nsb\/news\/news_summ.jsp?cntn_id=122702\" target=\"_blank\">Data Policies Report<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> released on January 5, 2012.<\/span><\/p>\n<p><span style=\"font-family: verdana,geneva; font-size: medium;\">The above reports and activities focus on the policy side of realizing the value of scientific data. Microsoft is also working to create, demonstrate, and implement the technical side of these challenges. In the book <em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/collaboration\/fourthparadigm\/\" target=\"_blank\">The Fourth Paradigm<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/em>, the authors identify a range of opportunities where access to data is fundamentally changing the way science is conducted. Microsoft, in partnership with the academic community, is working to put these ideas into practice. Examples include <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.worldwidetelescope.org\/Home.aspx\" target=\"_blank\">WorldWide Telescope<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>; the new earth-science data explorer, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/blogs.msdn.com\/b\/msr_er\/archive\/2011\/12\/06\/layerscape-for-earth-science-storytelling.aspx\" target=\"_blank\">Layerscape<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>; the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/blogs.msdn.com\/b\/see\/archive\/2011\/12\/01\/microsoft-at-cop17-working-together-to-organize-environmental-data.aspx\" target=\"_blank\">Eye on Earth<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> network for environmental maps; and data analytics tools such as <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/projects\/daytona\/default.aspx\" target=\"_blank\">Daytona<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/projects\/exceldatascope\/default.aspx\" target=\"_blank\">Excel DataScope<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/span><\/p>\n<p><span style=\"font-family: verdana,geneva; font-size: medium;\">&mdash;<em>Elizabeth Grossman, Technology Policy Group, Microsoft Corporation<\/em><\/span><\/p>\n<p><span style=\"font-family: verdana,geneva; font-size: medium;\"><strong>January 31, 2012, update:<\/strong> The White House Office of Science & Technology Policy (OSTP) has publicly posted all of the responses to the RFI. <\/span><\/p>\n<ul>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.whitehouse.gov\/administration\/eop\/ostp\/library\/digitaldata\" target=\"_blank\">Read the responses<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: verdana,geneva; font-size: medium;\"><strong>Learn More<\/strong><\/span><\/p>\n<ul>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"https:\/\/www.federalregister.gov\/articles\/2011\/11\/04\/2011-28621\/request-for-information-public-access-to-digital-data-resulting-from-federally-funded-scientific\" target=\"_blank\">OSTP RFI: Public Access to Digital Data Resulting From Federally Funded Scientific Research<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/UM\/redmond\/about\/collaboration\/ERblog\/Microsoft_Response_to_OSTP_RFI_on_Digital_Data_01-12-2012.pdf\" target=\"_blank\">Microsoft Corporation Response to OSTP RFI<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.nsf.gov\/od\/oci\/taskforces\/TaskForceReport_Data.pdf\" target=\"_blank\">NSF Task Force on Data and Visualization Final Report, March 2011<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/brtf.sdsc.edu\/\" target=\"_blank\">Blue Ribbon Task Force on Sustainable Digital Preservation and Access<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.nsf.gov\/nsb\/news\/news_summ.jsp?cntn_id=122702\" target=\"_blank\">National Science Board&#8217;s Data Policies Report<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/collaboration\/fourthparadigm\/\" target=\"_blank\">The Fourth Paradigm: Data-Intensive Scientific Discovery<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/span><\/li>\n<li><span style=\"font-family: verdana,geneva; font-size: small;\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/collaboration\/focus\/education\/default.aspx\" target=\"_blank\">Education and Scholarly Communication at Microsoft Research Connections<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/span><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Scientists can agree that there&rsquo;s a lot of data out there, and that we could be using it more efficiently. Now the White House has asked for input on how to do just that. Data from scientific research is important to a diverse array of user communities from researchers, governments, and companies to wildlife managers, [&hellip;]<\/p>\n","protected":false},"author":32627,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[],"msr_hide_image_in_river":0,"footnotes":""},"categories":[1],"tags":[193598,195275,195299,195532,195542,193594,196439,196615,196718,197121,197260,197427,197755,187311],"research-area":[],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-2123","post","type-post","status-publish","format-standard","hentry","category-research-blog","tag-data","tag-data-intensive-science","tag-daytona","tag-excel-datascope","tag-eye-on-earth","tag-layerscape","tag-microsoft-research-connections","tag-national-science-foundation","tag-online-services","tag-scientific-data","tag-software-tools","tag-the-fourth-paradigm","tag-white-house-office-of-science-and-technology-policy","tag-worldwide-telescope","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","byline":"","formattedDate":"January 25, 2012","formattedExcerpt":"Scientists can agree that there&rsquo;s a lot of data out there, and that we could be using it more efficiently. Now the White House has asked for input on how to do just that. Data from scientific research is important to a diverse array of&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/2123","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/32627"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=2123"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/2123\/revisions"}],"predecessor-version":[{"id":262104,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/2123\/revisions\/262104"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=2123"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=2123"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=2123"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=2123"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=2123"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=2123"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=2123"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=2123"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=2123"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=2123"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=2123"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}