{"id":97,"date":"2012-03-08T09:00:00","date_gmt":"2012-03-08T09:00:00","guid":{"rendered":"https:\/\/blogs.technet.microsoft.com\/inside_microsoft_research\/2012\/03\/08\/teaching-computers-to-speak-in-tongues\/"},"modified":"2016-07-20T07:33:03","modified_gmt":"2016-07-20T14:33:03","slug":"teaching-computers-to-speak-in-tongues","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/teaching-computers-to-speak-in-tongues\/","title":{"rendered":"Teaching Computers to Speak in Tongues"},"content":{"rendered":"<p class=\"posted-by\">Posted by <span class=\"author\">Kevin Schofield<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>Perhaps&nbsp;it&rsquo;s just me, but I wince whenever my American phone or GPS tries to pronounce a French restaurant name or a Spanish street name, or the name of one of my non-American friends. While we&rsquo;ve made much progress in creating text-to-speech (TTS) systems with human-sounding voices in a comforting accent, they haven&rsquo;t fared well in our multilingual world.<\/p>\n<p>Frank Soong and his team from Microsoft Research Asia have been working to solve that problem by &ldquo;cross-training&rdquo; a text-to-speech system&nbsp; so that it can correctly pronounce words from multiple languages even if it was built from the voice samples of someone who only speaks one.<\/p>\n<p>Frank gave a demo of their TTS system during Rick Rashid&rsquo;s keynote speech at Techfest yesterday. You can see it in the video below.<\/p>\n<p><object data=\"data:application\/x-oleobject;base64,QfXq3+HzJEysrJnDBxUISgAJAAASIQAAbBkAAAwAAAB3AGgAaQB0AGUAAAAAAAAAAAAAAAAAAACMAAAAaAB0AHQAcAA6AC8ALwByAGUAcwBlAGEAcgBjAGgALgBtAGkAYwByAG8AcwBvAGYAdAAuAGMAbwBtAC8AYQBwAHAAcwAvAHYAaQBkAGUAbwAvAEMAbABpAGUAbgB0AEIAaQBuAC8ARQBtAGIAZQBkAGQAZQBkAFAAbABhAHkAZQByAC4AeABhAHAAAAA8AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAOgAAAGkAZAA9ADEANgAwADcAMgA1ACwAcwB0AGEAcgB0AD0ANwAxADkALABlAG4AZAA9ADEAMgA3ADAAAAAAAAAAAAAAAP\/\/AAABAAAAAAAAAAAAAAAAAAAAGAAAADMALgAwAC4ANAAwADgAMQA4AC4AMAAAAAoAAAB0AHIAdQBlAAAAAAAAAAAAAAAAAAAA\" width=\"320\" type=\"application\/x-silverlight-2\" height=\"246\"><param name=\"source\" value=\"http:\/\/research.microsoft.com\/apps\/video\/ClientBin\/EmbeddedPlayer.xap\" \/><param name=\"enableHtmlAccess\" value=\"true\" \/><param name=\"initParams\" value=\"id=160725,start=719,end=1270\" \/><param name=\"background\" value=\"white\" \/><param name=\"minRuntimeVersion\" value=\"3.0.40818.0\" \/><param name=\"autoUpgrade\" value=\"true\" \/><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" style=\"text-decoration: none;\" href=\"http:\/\/go.microsoft.com\/fwlink\/?LinkID=149156&v=3.0.40818.0\"><img decoding=\"async\" style=\"border-style: none;\" alt=\"Get Microsoft Silverlight\" src=\"http:\/\/go.microsoft.com\/fwlink\/?LinkId=108181\" \/><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/object><\/p>\n<p>&nbsp;<\/p>\n<p>This is another great example of how the next generation of technologies will continue to make interacting with computers more natural and help to more seamlessly blend together physical and virtual elements.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Posted by Kevin Schofield &nbsp; Perhaps&nbsp;it&rsquo;s just me, but I wince whenever my American phone or GPS tries to pronounce a French restaurant name or a Spanish street name, or the name of one of my non-American friends. While we&rsquo;ve made much progress in creating text-to-speech (TTS) systems with human-sounding voices in a comforting accent, [&hellip;]<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[],"msr_hide_image_in_river":0,"footnotes":""},"categories":[1],"tags":[196432,193514],"research-area":[],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-97","post","type-post","status-publish","format-standard","hentry","category-research-blog","tag-microsoft-research-asia","tag-techfest","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","byline":"","formattedDate":"March 8, 2012","formattedExcerpt":"Posted by Kevin Schofield &nbsp; Perhaps&nbsp;it&rsquo;s just me, but I wince whenever my American phone or GPS tries to pronounce a French restaurant name or a Spanish street name, or the name of one of my non-American friends. While we&rsquo;ve made much progress in creating&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/97","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=97"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/97\/revisions"}],"predecessor-version":[{"id":262050,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/97\/revisions\/262050"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=97"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=97"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=97"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=97"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=97"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=97"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=97"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=97"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=97"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=97"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=97"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}