{"id":858204,"date":"2022-07-07T06:32:00","date_gmt":"2022-07-07T13:32:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-project&#038;p=858204"},"modified":"2023-06-08T11:42:58","modified_gmt":"2023-06-08T18:42:58","slug":"reverse-homology","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/reverse-homology\/","title":{"rendered":"Reverse Homology"},"content":{"rendered":"<section class=\"mb-3 moray-highlight\">\n\t<div class=\"card-img-overlay mx-lg-0\">\n\t\t<div class=\"card-background  has-background- card-background--full-bleed\">\n\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"1400\" height=\"788\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3.jpg\" class=\"attachment-full size-full\" alt=\"reverse homology logo\" style=\"\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3.jpg 1400w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-1024x576.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-343x193.jpg 343w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-240x135.jpg 240w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/reverse_homology_v3-1280x720.jpg 1280w\" sizes=\"auto, (max-width: 1400px) 100vw, 1400px\" \/>\t\t<\/div>\n\t\t<!-- Foreground -->\n\t\t<div class=\"card-foreground d-flex mt-md-n5 my-lg-5 px-g px-lg-0\">\n\t\t\t<!-- Container -->\n\t\t\t<div class=\"container d-flex mt-md-n5 my-lg-5 align-self-center\">\n\t\t\t\t<!-- Card wrapper -->\n\t\t\t\t<div class=\"w-100 w-lg-col-5\">\n\t\t\t\t\t<!-- Card -->\n\t\t\t\t\t<div class=\"card material-md-card py-5 px-md-5\">\n\t\t\t\t\t\t<div class=\"card-body \">\n\t\t\t\t\t\t\t\n\t\t\t\t\t\t\t\n\n<h1 class=\"wp-block-heading\" id=\"reverse-homology\">Reverse Homology<\/h1>\n\n\n\n<h2 class=\"wp-block-heading is-style-s\" id=\"hypothesis-discovery-for-intrinsically-disordered-regions\">Hypothesis Discovery for Intrinsically Disordered Regions<\/h2>\n\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/section>\n\n\n\n\n\n<h2 class=\"wp-block-heading is-style-m\" id=\"intrinsically-disordered-regions-play-critical-roles-in-protein-function\">Intrinsically disordered regions play critical roles in protein function<\/h2>\n\n\n\n<p>Intrinsically disordered regions (IDRs)&nbsp;are widespread in proteins, making up around 40% of the human proteome, and carry out critical functions in signaling, protein-protein interactions, phase separation, and more. Despite their <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0959440X19300041?via%3Dihub\" target=\"_blank\" rel=\"noopener noreferrer\">critical role in protein function and their widespread prevalence in the proteome<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, the systematic understanding of intrinsically disordered regions (IDRs) remains elusive. IDRs do not fold into a stable secondary or tertiary structure, enabling them to mediate functions distinct from structured regions. For example, some IDRs are essential to \u201chub\u201d proteins, as the lack of structure enables adaptation of conformation to different interaction partners. As research on IDRs grow, biologists have come to increasingly <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.cell.com\/cell\/fulltext\/S0092-8674(20)31622-6\" target=\"_blank\" rel=\"noopener noreferrer\">appreciate their role in human disease<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>: for example, mutations in IDRs can be implicated in neurological diseases or cancers.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full is-style-spectrum\"><img loading=\"lazy\" decoding=\"async\" width=\"710\" height=\"340\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/07\/1a5r_SUMO-1_protein.gif\" alt=\"SUMO-1 morphing based on NMR structure 1a5r. Ten alternative NMR models were morphed. Secondary structure elements: \u03b1-helices (red), \u03b2-strands (blue arrows).\" class=\"wp-image-859434\" \/><\/figure>\n\n\n\n<div style=\"height:20px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading is-style-m\" id=\"idrs-are-underserved-by-current-bioinformatics-resources\">IDRs are underserved by current bioinformatics resources<\/h2>\n\n\n\n<p>IDRs typically evolve rapidly and may have no detectable sequence homology between even closely related species. This creates methodological challenges for understanding IDR function computationally: while highly specific predictions of function can be produced for structured regions using universal resources that assume sequence conservation (like BLAST), these methods are not applicable to IDRs, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0022283621004290\" target=\"_blank\" rel=\"noopener noreferrer\">prompting calls for machine learning methods that can work for IDRs<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. One viable alternative strategy is that <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/elifesciences.org\/articles\/46883\" target=\"_blank\" rel=\"noopener noreferrer\">IDR functions can be predicted by identifying higher-order features of the sequences<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, which may be conserved over evolution even when the sequence is not.<\/p>\n\n\n\n<p>For example, charge and hydrophobicity are critical to mitochondrial targeting IDRs, as these features enable recognition by import mechanisms. However, these features are generally identified on an individual experimental basis, so our knowledge of features important to IDRs is likely not comprehensive. Computational methods that produce hypotheses about novel features (e.g. those in uncharacterized IDRs) are therefore valuable toward the goal of advancing a more universal understanding of IDRs.<\/p>\n\n\n\n<h2 class=\"wp-block-heading is-style-m\" id=\"the-first-systematic-hypothesis-discovery-resource-for-idrs-unbiased-by-prior-knowledge-of-function\">The first systematic hypothesis discovery resource for IDRs unbiased by prior knowledge of function<\/h2>\n\n\n\n<p>To help researchers understand what features in an IDR sequence are relevant to its function, we introduce the&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/journals.plos.org\/ploscompbiol\/article?id=10.1371\/journal.pcbi.1010238\" target=\"_blank\" rel=\"noopener noreferrer\">first systematic feature discovery method for IDRs<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, capable of uncovering features even without any prior knowledge or hypotheses of function. Our method, which we call \u201creverse homology\u201d, is powered by a self-supervised neural network, trained to predict if IDRs evolved from the same common ancestor or not (i.e. if they are homologues). This training task makes the neural network sensitive to evolutionarily conserved features, which tend to be important to function: by applying interpretation techniques, we can visualize the features that our neural network believes are likely to be conserved over evolution in an input sequence, directly generating hypotheses about residues and features that may be important to an IDR\u2019s function.<\/p>\n\n\n\n<div style=\"padding-bottom:32px; padding-top:32px\" class=\"wp-block-msr-immersive-section alignfull row has-background has-blue-background-color has-text-color has-white-color wp-block-msr-immersive-section\">\n\t\n\t<div class=\"container\">\n\t\t<div class=\"wp-block-msr-immersive-section__wrapper\">\n\t\t\t<h2 class=\"wp-block-heading has-text-align-center is-style-s\" id=\"try-out-reverse-homology-on-your-idr-sequences-with-our-web-app\">Try out reverse homology on your IDR sequences with our web app!<\/h2>\n\n\n\n<div style=\"height:14px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-16018d1d wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button is-style-outline is-style-outline--1\"><a data-bi-type=\"button\" class=\"wp-block-button__link has-white-color has-text-color wp-element-button\" href=\"http:\/\/reversehomology.eastus.cloudapp.azure.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">Try the demo<\/a><\/div>\n<\/div>\t\t<\/div>\n\t<\/div>\n\n\t<\/div>\n\n\n","protected":false},"excerpt":{"rendered":"<p>Intrinsically disordered regions (IDRs)&nbsp;are widespread in proteins, making up around 40% of the human proteome, and carry out critical functions in signaling, protein-protein interactions, phase separation, and more. Despite their critical role in protein function and their widespread prevalence in the proteome (opens in new tab), the systematic understanding of intrinsically disordered regions (IDRs) remains [&hellip;]<\/p>\n","protected":false},"featured_media":859425,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"research-area":[13556,13553],"msr-locale":[268875],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-858204","msr-project","type-msr-project","status-publish","has-post-thumbnail","hentry","msr-research-area-artificial-intelligence","msr-research-area-medical-health-genomics","msr-locale-en_us","msr-archive-status-active"],"msr_project_start":"","related-publications":[859011],"related-downloads":[],"related-videos":[],"related-groups":[703342],"related-events":[],"related-opportunities":[],"related-posts":[],"related-articles":[],"tab-content":[],"slides":[],"related-researchers":[{"type":"user_nicename","display_name":"Alex Lu","user_id":41036,"people_section":"Section name 0","alias":"lualex"}],"msr_research_lab":[],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/858204","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-project"}],"version-history":[{"count":37,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/858204\/revisions"}],"predecessor-version":[{"id":947613,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/858204\/revisions\/947613"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/859425"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=858204"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=858204"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=858204"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=858204"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=858204"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}