{"id":601884,"date":"2019-08-05T17:47:32","date_gmt":"2019-08-06T00:47:32","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&#038;p=601884"},"modified":"2019-08-13T10:42:55","modified_gmt":"2019-08-13T17:42:55","slug":"trustworthy-ai-through-user-interface-design","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/trustworthy-ai-through-user-interface-design\/","title":{"rendered":"Trustworthy AI through user interface design"},"content":{"rendered":"<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/yinavehb\/\">By Isaac Naveh-Benjamin, Software Engineer<\/a><\/p>\n<p><span data-contrast=\"auto\">The rise of artificial intelligence has led to a growing problem of\u00a0<\/span><i><span data-contrast=\"auto\">transparency<\/span><\/i><span data-contrast=\"auto\">. It is increasingly difficult for the layperson<\/span><span data-contrast=\"auto\">, or even for the developer of an AI system<\/span><span data-contrast=\"auto\">, to understand how\u00a0<\/span><span data-contrast=\"auto\">such<\/span><span data-contrast=\"auto\">\u00a0systems work. Sometimes, this\u00a0<\/span><span data-contrast=\"auto\">is\u00a0<\/span><span data-contrast=\"auto\">due to the use of proprietary algorithms \u2013 but more often, it\u2019s because modern AI systems\u00a0<\/span><span data-contrast=\"auto\">are<\/span><span data-contrast=\"auto\"> simply too complex <\/span><span data-contrast=\"auto\">for humans to\u00a0<\/span><span data-contrast=\"auto\">understand. Such systems are \u201cblack boxes\u201d whose functioning depends on millions of internal interdependent variables.<\/span><span data-ccp-props=\"{\"201341983\":0,\"335559739\":160,\"335559740\":259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">As\u00a0<\/span><span data-contrast=\"auto\">AI<\/span><span data-contrast=\"auto\">\u00a0systems are deployed\u00a0<\/span><span data-contrast=\"auto\">in the real world, questions of\u00a0<\/span><span data-contrast=\"auto\">transparency and\u00a0<\/span><span data-contrast=\"auto\">accountability<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">have become increasingly important<\/span><span data-contrast=\"auto\">. With AI being used in applications like\u00a0<\/span><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/aitechreporter.com\/2019\/02\/14\/ai-used-in-criminal-sentencing\/\"><span data-contrast=\"none\">criminal sentencing<\/span><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><span data-contrast=\"auto\">\u00a0or\u00a0<\/span><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/www.digitaljournal.com\/business\/artificial-intelligence-will-say-whether-you-get-a-bank-loan\/article\/499367\"><span data-contrast=\"none\">loan approval<\/span><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><span data-contrast=\"auto\">, it\u2019s important\u00a0<\/span><span data-contrast=\"auto\">to<\/span><span data-contrast=\"auto\">\u00a0be able to answer questions like \u201c<\/span><span data-contrast=\"auto\">Why did input A result in output B<\/span><span data-contrast=\"auto\">?\u201d or \u201cWhat should I do differently to get a favorable outcome?\u201d<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-ccp-props=\"{\"201341983\":0,\"335559739\":160,\"335559740\":259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">When it comes to\u00a0<\/span><span data-contrast=\"auto\">the\u00a0<\/span><span data-contrast=\"auto\">medical\u00a0<\/span><span data-contrast=\"auto\">domain<\/span><span data-contrast=\"auto\">, the stakes are\u00a0<\/span><span data-contrast=\"auto\">even higher<\/span><span data-contrast=\"auto\">. Doctors<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">can\u2019t afford to trust the recommendations of an AI system whose workings\u00a0<\/span><span data-contrast=\"auto\">they don\u2019t understand<\/span><span data-contrast=\"auto\">.\u00a0<\/span><span data-contrast=\"auto\">While they don&#8217;t<\/span><span data-contrast=\"auto\">\u00a0need to understand the\u00a0<\/span><span data-contrast=\"auto\">precise technical<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">details<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">of the algorithms, they do need<\/span><span data-contrast=\"auto\">\u00a0to have an intuitive\u00a0<\/span><span data-contrast=\"auto\">grasp<\/span><span data-contrast=\"auto\">\u00a0of\u00a0<\/span><span data-contrast=\"auto\">how the system works (<\/span><span data-contrast=\"auto\">so that they<\/span><span data-contrast=\"auto\">\u00a0can understand its limitations)<\/span><span data-contrast=\"auto\">. They also\u00a0<\/span><span data-contrast=\"auto\">need an \u201caudit trail\u201d that explains how a machine came to a specific conclusion.<\/span><span data-ccp-props=\"{\"201341983\":0,\"335559739\":160,\"335559740\":259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">At EmpowerMD, a Microsoft incubation project, we\u2019re developing a<\/span><span data-contrast=\"auto\">n\u00a0<\/span><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/empowermd\/articles\/project-empowermd-ambient-intelligence-for-the-clinic-2\"><span data-contrast=\"none\">Intelligent Scribe Service<\/span><\/a><span data-contrast=\"auto\">\u00a0that\u00a0<\/span><span data-contrast=\"auto\">listens to doctor-patient interactions and automatically generates a medical note.<\/span><span data-contrast=\"auto\">\u00a0This note is then vetted for accuracy by the doctor, who can edit it as she sees fit. This design is\u00a0<\/span><span data-contrast=\"auto\">known as<\/span><i><span data-contrast=\"auto\">\u00a0human in the\u00a0<\/span><\/i><i><span data-contrast=\"auto\">loop<\/span><\/i><span data-contrast=\"auto\">, and<\/span><span data-contrast=\"auto\">\u00a0is one of the main ways of making AI systems more transparent. By incorporating human feedback into the process, we can be more confident in the\u00a0<\/span><span data-contrast=\"auto\">overall\u00a0<\/span><span data-contrast=\"auto\">integrity of the system.\u00a0<\/span><span data-contrast=\"auto\">(Note that it is also a legal imperative, since by law, medical notes must be reviewed for accuracy and signed by a doctor<\/span><span data-contrast=\"auto\">).<\/span><span data-ccp-props=\"{\"201341983\":0,\"335559739\":160,\"335559740\":259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">In designing the ISS\u2019s user interface, we added several additional features that help make it more transparent. Below, you can see a\u00a0<\/span><span data-contrast=\"auto\">screenshot<\/span><span data-contrast=\"auto\">\u00a0of the Intelligent Scribe, with\u00a0<\/span><span data-contrast=\"auto\">the\u00a0<\/span><span data-contrast=\"auto\">transcript of conversation on the right<\/span><span data-contrast=\"auto\">-hand side<\/span><span data-contrast=\"auto\">, and the\u00a0<\/span><span data-contrast=\"auto\">machine-<\/span><span data-contrast=\"auto\">generated note on the left.<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">Whenever<\/span><span data-contrast=\"auto\">\u00a0the user clicks any part of the medical note, we give a visual indication of\u00a0<\/span><span data-contrast=\"auto\">where in<\/span><span data-contrast=\"auto\">\u00a0the conversation it came<\/span><span data-contrast=\"auto\">\u00a0from by highlighting the corresponding parts of the transcript. This makes it easy for the user to determine the provenance of any part of the medical note.<\/span><span data-ccp-props=\"{\"201341983\":0,\"335559739\":160,\"335559740\":259}\">\u00a0<\/span><\/p>\n<p><span style=\"font-size: 0.888em;font-weight: 600\">[Click to enlarge]<\/span><\/p>\n<ul id='gallery-1' class='gallery galleryid-601884 gallery-columns-1 gallery-size-full stripped ms-row fixed-small'><li class=' xs-margin-bottom-sp1 s-margin-bottom-sp2'><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/08\/align-final.png\" data-mfp-src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/08\/align-final.png\" data-caption=\"\" class=\"gallery-item\"><img decoding=\"async\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/08\/align-final.png\" alt=\"\" class=\"db full-width\" \/><\/a><\/li><br style=\"clear: both\" \/>\n\t\t<\/ul>\n\n<p><span data-contrast=\"auto\">Another feature of the ISS is the\u00a0<\/span><i><span data-contrast=\"auto\">layer view<\/span><\/i><span data-contrast=\"auto\">. This feature divides the medical note into\u00a0<\/span><i><span data-contrast=\"auto\">layers<\/span><\/i><span data-contrast=\"auto\">\u00a0based on where that information came from \u2013 whether from the electronic record (EHR), from the clinical conversation, or from the doctor<\/span><span data-contrast=\"auto\">.<\/span><span data-contrast=\"auto\">\u00a0<\/span><span data-contrast=\"auto\">This allows the user to tell, at a glance, which parts of the note are machine-generated, and which came from a human.<\/span><span data-ccp-props=\"{\"201341983\":0,\"335559739\":160,\"335559740\":259}\">\u00a0<\/span><\/p>\n<p><span style=\"font-size: 0.888em;font-weight: 600\">[Click to enlarge]<\/span><\/p>\n<ul id='gallery-2' class='gallery galleryid-601884 gallery-columns-1 gallery-size-full stripped ms-row fixed-small'><li class=' xs-margin-bottom-sp1 s-margin-bottom-sp2'><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/08\/layers-final.png\" data-mfp-src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/08\/layers-final.png\" data-caption=\"\" class=\"gallery-item\"><img decoding=\"async\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/08\/layers-final.png\" alt=\"\" class=\"db full-width\" \/><\/a><\/li><br style=\"clear: both\" \/>\n\t\t<\/ul>\n\n<p><span data-contrast=\"auto\">Finally, we use the typography of the note to give clues as to\u00a0<\/span><span data-contrast=\"auto\">the<\/span><span data-contrast=\"auto\">\u00a0origin\u00a0<\/span><span data-contrast=\"auto\">of specific phrases<\/span><span data-contrast=\"auto\">. For example, when a phrase is italicized,\u00a0<\/span><span data-contrast=\"auto\">that indicates that it is\u00a0<\/span><span data-contrast=\"auto\">a near-verbatim quote from the patient (we call this the \u201cpatient\u2019s voice<\/span><span data-contrast=\"auto\">\u201d<\/span><span data-contrast=\"auto\">). And when a section of the note appears grayed out, that indicates that it\u2019s a low-confidence suggestion from our ML engine, and that it requires the doctor\u2019s review.<\/span><span data-ccp-props=\"{\"201341983\":0,\"335559739\":160,\"335559740\":259}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">The EmpowerMD interface incorporates AI that&#8217;s transparent by design. <\/span><span data-contrast=\"auto\">In our mind,\u00a0<\/span><span data-contrast=\"auto\">that<\/span><span data-contrast=\"auto\">\u00a0has clear advantages over\u00a0<\/span><span data-contrast=\"auto\">other types of explanatory UI, which are often designed separately from, or as an adjunct to the systems they attempt to explain.\u00a0<\/span><span data-contrast=\"auto\">At EmpowerMD, we built <\/span><span data-contrast=\"auto\">transparency into the system right from the start.<\/span><span data-contrast=\"auto\">\u00a0It is integral to the user experience. This, in turn, allows doctors to have more confidence in the system\u2019s suggestions and recommendations.<\/span><span data-ccp-props=\"{\"201341983\":0,\"335559739\":160,\"335559740\":259}\">\u00a0<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>AI trustworthiness can be conveyed via effective user design.<\/p>\n","protected":false},"author":38410,"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-content-parent":468111,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[],"msr-locale":[268875],"msr-post-option":[],"class_list":["post-601884","msr-blog-post","type-msr-blog-post","status-publish","hentry","msr-locale-en_us"],"msr_assoc_parent":{"id":468111,"type":"project"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/601884","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-blog-post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/38410"}],"version-history":[{"count":36,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/601884\/revisions"}],"predecessor-version":[{"id":602241,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/601884\/revisions\/602241"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=601884"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=601884"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=601884"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=601884"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}