{"id":352877,"date":"2017-01-15T21:00:49","date_gmt":"2017-01-16T05:00:49","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-event&#038;p=352877"},"modified":"2025-08-06T11:58:41","modified_gmt":"2025-08-06T18:58:41","slug":"interspeech-2017-special-session-speech-technologies-for-code-switching-in-multilingual-communities","status":"publish","type":"msr-event","link":"https:\/\/www.microsoft.com\/en-us\/research\/event\/interspeech-2017-special-session-speech-technologies-for-code-switching-in-multilingual-communities\/","title":{"rendered":"Interspeech 2017 Special Session: Speech Technologies for Code-Switching in Multilingual Communities"},"content":{"rendered":"\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" target=\"_blank\" href=\"http:\/\/www.interspeech2017.org\/\">Interspeech 2017<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> Special Session on<br \/>\n<strong>Speech Technologies for Code-Switching in Multilingual Communities<\/strong><\/p>\n<p><strong>Important Dates<\/strong>:<\/p>\n<p>Paper submission deadline: <strong>14 March 2017<\/strong><br \/>\nFinal PDF upload: 21 March 2017<br \/>\nPaper notification of acceptance: 22 May 2017<br \/>\nCamera-ready paper due: 5 June 2017<br \/>\nInterspeech 2017: 20-24 August 2017<\/p>\n<p><strong>Organizing Committee:<\/strong><br \/>\nKalika Bali, Microsoft Research India<br \/>\nAlan W Black, Carnegie Mellon University<br \/>\nMona Diab, George Washington University<br \/>\nJulia Hirschberg, Columbia University<br \/>\nSunayana Sitaram, Microsoft Research India<br \/>\nThamar Solorio, University of Houston<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<p>Speech technologies exist for many high resource languages, and attempts are being made to reach the next billion users by building resources and systems for many more languages. In the past, the main focus of the speech community has been in building monolingual systems that are capable of processing speech in a single language. Multilingual communities pose special challenges for the design and development of speech processing systems. One of these challenges is <strong>code-switching<\/strong>, which is the switching of two or more languages at the conversation, utterance and sometimes even word level.<\/p>\n<p>In addition to conversational speech, code-switching is now found in text in social media, instant messaging and blogs in multilingual communities. Monolingual natural language and speech systems fail when they encounter code-switched speech and text. There is also a lack of linguistic data and resources for code-switched speech and text, although one or more of the languages being mixed could be high-resource.<\/p>\n<p>Code-switching provides various interesting challenges to the speech community, such as language modeling for mixed languages, acoustic modeling of mixed language speech, pronunciation modeling and language identification from speech. The special session will include oral presentations and a panel discussion. Please see the Special Session schedule tab for more details. We expect participants from academic and industry spanning a wide variety of language pairs and data sets. We also expect discussions on how to create speech and language resources for code-switching and sharing of data.<\/p>\n<p>Topics of interest for this special session will include but are not limited to: \uf0b7<br \/>\n<li>Speech Recognition of code-switched speech<\/li><br \/>\n<li>Language Modeling for code-switched speech<\/li><br \/>\n<li>Speech Synthesis of code-switched text<\/li><br \/>\n<li>Speech Translation of code-switched languages<\/li><br \/>\n<li>Spoken Dialogue Systems that can handle code-switching<\/li><br \/>\n<li>Speech data and resources for code-switching<\/li><br \/>\n<li>Language Identification from speech<\/li><\/p>\n<p>&nbsp;<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<p>The special session will be held on 21 August 2017, distributed over two slots. All 9 papers will be presented as oral presentations. In addition, we will also have a panel at the end of the second session in which we will discuss topics such as data and resources for code-switching research. More details about this panel discussion will be available shortly.<\/p>\n<table style=\"border-collapse: collapse;border-spacing: inherit\" width=\"100%\">\n<thead>\n<tr>\n<td style=\"padding: inherit;border: inherit\"><strong>Date<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Time<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Room<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Presentation type<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Paper code<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Paper ID<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Title<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Authors<\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">11:20-11:40<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-1<\/td>\n<td style=\"padding: inherit;border: inherit\">301<\/td>\n<td style=\"padding: inherit;border: inherit\">Longitudinal Speaker Clustering and Verification Corpus with Code-Switching Frisian-Dutch Speech<\/td>\n<td style=\"padding: inherit;border: inherit\">Emre Yilmaz, Jelske Dijkstra, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk Van den Heuvel, David Van Leeuwen<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">11:40-12:00<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-2<\/td>\n<td style=\"padding: inherit;border: inherit\">391<\/td>\n<td style=\"padding: inherit;border: inherit\">Exploiting Untranscribed Broadcast Data for Improved Code-Switching Detection<\/td>\n<td style=\"padding: inherit;border: inherit\">Emre Yilmaz, Henk van den Heuvel, David van Leeuwen<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">12:00-12:20<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-3<\/td>\n<td style=\"padding: inherit;border: inherit\">1198<\/td>\n<td style=\"padding: inherit;border: inherit\">Jee haan, I\u2019d like both, por favor: Elicitation of a Code-Switched Corpus of Hindi-English and Spanish-English Human-Machine Dialog<\/td>\n<td style=\"padding: inherit;border: inherit\">Vikram Ramanarayanan, David Suendermann-Oeft<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">12:20-12:40<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-4<\/td>\n<td style=\"padding: inherit;border: inherit\">1244<\/td>\n<td style=\"padding: inherit;border: inherit\">On building mixed lingual speech synthesis systems<\/td>\n<td style=\"padding: inherit;border: inherit\">SaiKrishna Rallabandi, Alan W Black<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">12:40-13:00<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-5<\/td>\n<td style=\"padding: inherit;border: inherit\">1259<\/td>\n<td style=\"padding: inherit;border: inherit\">Speech Synthesis for Mixed-Language Navigation Instructions<\/td>\n<td style=\"padding: inherit;border: inherit\">Khyathi Chandu, Sai Krishna Rallabandi, Sunayana Sitaram, Alan W Black<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">14:30-14:50<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-6<\/td>\n<td style=\"padding: inherit;border: inherit\">1373<\/td>\n<td style=\"padding: inherit;border: inherit\">Addressing Code-Switching in French\/Algerian Arabic Speech<\/td>\n<td style=\"padding: inherit;border: inherit\">Amazouz Djegdjiga, Martine Adda-Decker, Lori Lamel<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">14:50-15:10<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-7<\/td>\n<td style=\"padding: inherit;border: inherit\">1429<\/td>\n<td style=\"padding: inherit;border: inherit\">Metrics for modeling code-switching across corpora<\/td>\n<td style=\"padding: inherit;border: inherit\">Wally Guzman, Joseph Ricard, Jacqueline Serigos, Barbara Bullock, Almeida Jacqueline Toribio<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">15:10-15:30<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-8<\/td>\n<td style=\"padding: inherit;border: inherit\">1437<\/td>\n<td style=\"padding: inherit;border: inherit\">Synthesising isiZulu-English code-switch bigrams using word embeddings<\/td>\n<td style=\"padding: inherit;border: inherit\">Ewald Van der westhuizen, Thomas Niesler<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">15:30-15:50<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-9<\/td>\n<td style=\"padding: inherit;border: inherit\">1663<\/td>\n<td style=\"padding: inherit;border: inherit\">Crowdsourcing Universal Part-Of-Speech Tags for Code-Switching<\/td>\n<td style=\"padding: inherit;border: inherit\">Victor Soto, Julia Hirschberg<\/td>\n<\/tr>\n<\/thead>\n<\/table>\n<p><span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Interspeech 2017 (opens in new tab) Special Session on Speech Technologies for Code-Switching in Multilingual Communities Important Dates: Paper submission deadline: 14 March 2017 Final PDF upload: 21 March 2017 Paper notification of acceptance: 22 May 2017 Camera-ready paper due: 5 June 2017 Interspeech 2017: 20-24 August 2017 Organizing Committee: Kalika Bali, Microsoft Research India [&hellip;]<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr_startdate":"","msr_enddate":"","msr_location":"","msr_expirationdate":"","msr_event_recording_link":"","msr_event_link":"","msr_event_link_redirect":false,"msr_event_time":"","msr_hide_region":false,"msr_private_event":false,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[13545],"msr-region":[],"msr-event-type":[],"msr-video-type":[],"msr-locale":[268875],"msr-program-audience":[],"msr-post-option":[],"msr-impact-theme":[],"class_list":["post-352877","msr-event","type-msr-event","status-publish","hentry","msr-research-area-human-language-technologies","msr-locale-en_us"],"msr_about":"<!-- wp:msr\/event-details {\"title\":\"Interspeech 2017 Special Session: Speech Technologies for Code-Switching in Multilingual Communities\",\"backgroundColor\":\"grey\"} \/-->\n\n<!-- wp:msr\/content-tabs --><!-- wp:msr\/content-tab {\"title\":\"Special Session Description\"} --><!-- wp:freeform --><p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" target=\"_blank\" href=\"http:\/\/www.interspeech2017.org\/\">Interspeech 2017<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> Special Session on<br \/>\n<strong>Speech Technologies for Code-Switching in Multilingual Communities<\/strong><\/p>\n<p><strong>Important Dates<\/strong>:<\/p>\n<p>Paper submission deadline: <strong>14 March 2017<\/strong><br \/>\nFinal PDF upload: 21 March 2017<br \/>\nPaper notification of acceptance: 22 May 2017<br \/>\nCamera-ready paper due: 5 June 2017<br \/>\nInterspeech 2017: 20-24 August 2017<\/p>\n<p><strong>Organizing Committee:<\/strong><br \/>\nKalika Bali, Microsoft Research India<br \/>\nAlan W Black, Carnegie Mellon University<br \/>\nMona Diab, George Washington University<br \/>\nJulia Hirschberg, Columbia University<br \/>\nSunayana Sitaram, Microsoft Research India<br \/>\nThamar Solorio, University of Houston<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<p>Speech technologies exist for many high resource languages, and attempts are being made to reach the next billion users by building resources and systems for many more languages. In the past, the main focus of the speech community has been in building monolingual systems that are capable of processing speech in a single language. Multilingual communities pose special challenges for the design and development of speech processing systems. One of these challenges is <strong>code-switching<\/strong>, which is the switching of two or more languages at the conversation, utterance and sometimes even word level.<\/p>\n<p>In addition to conversational speech, code-switching is now found in text in social media, instant messaging and blogs in multilingual communities. Monolingual natural language and speech systems fail when they encounter code-switched speech and text. There is also a lack of linguistic data and resources for code-switched speech and text, although one or more of the languages being mixed could be high-resource.<\/p>\n<p>Code-switching provides various interesting challenges to the speech community, such as language modeling for mixed languages, acoustic modeling of mixed language speech, pronunciation modeling and language identification from speech. The special session will include oral presentations and a panel discussion. Please see the Special Session schedule tab for more details. We expect participants from academic and industry spanning a wide variety of language pairs and data sets. We also expect discussions on how to create speech and language resources for code-switching and sharing of data.<\/p>\n<p>Topics of interest for this special session will include but are not limited to: \uf0b7<br \/>\n&lt;li&gt;Speech Recognition of code-switched speech&lt;\/li&gt;<br \/>\n&lt;li&gt;Language Modeling for code-switched speech&lt;\/li&gt;<br \/>\n&lt;li&gt;Speech Synthesis of code-switched text&lt;\/li&gt;<br \/>\n&lt;li&gt;Speech Translation of code-switched languages&lt;\/li&gt;<br \/>\n&lt;li&gt;Spoken Dialogue Systems that can handle code-switching&lt;\/li&gt;<br \/>\n&lt;li&gt;Speech data and resources for code-switching&lt;\/li&gt;<br \/>\n&lt;li&gt;Language Identification from speech&lt;\/li&gt;<\/p>\n<p>&nbsp;<span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<!-- \/wp:freeform --><!-- \/wp:msr\/content-tab --><!-- wp:msr\/content-tab {\"title\":\"Special Session Schedule\"} --><!-- wp:freeform --><p>The special session will be held on 21 August 2017, distributed over two slots. All 9 papers will be presented as oral presentations. In addition, we will also have a panel at the end of the second session in which we will discuss topics such as data and resources for code-switching research. More details about this panel discussion will be available shortly.<\/p>\n<table style=\"border-collapse: collapse;border-spacing: inherit\" width=\"100%\">\n<thead>\n<tr>\n<td style=\"padding: inherit;border: inherit\"><strong>Date<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Time<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Room<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Presentation type<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Paper code<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Paper ID<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Title<\/strong><\/td>\n<td style=\"padding: inherit;border: inherit\"><strong>Authors<\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">11:20-11:40<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-1<\/td>\n<td style=\"padding: inherit;border: inherit\">301<\/td>\n<td style=\"padding: inherit;border: inherit\">Longitudinal Speaker Clustering and Verification Corpus with Code-Switching Frisian-Dutch Speech<\/td>\n<td style=\"padding: inherit;border: inherit\">Emre Yilmaz, Jelske Dijkstra, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk Van den Heuvel, David Van Leeuwen<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">11:40-12:00<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-2<\/td>\n<td style=\"padding: inherit;border: inherit\">391<\/td>\n<td style=\"padding: inherit;border: inherit\">Exploiting Untranscribed Broadcast Data for Improved Code-Switching Detection<\/td>\n<td style=\"padding: inherit;border: inherit\">Emre Yilmaz, Henk van den Heuvel, David van Leeuwen<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">12:00-12:20<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-3<\/td>\n<td style=\"padding: inherit;border: inherit\">1198<\/td>\n<td style=\"padding: inherit;border: inherit\">Jee haan, I\u2019d like both, por favor: Elicitation of a Code-Switched Corpus of Hindi-English and Spanish-English Human-Machine Dialog<\/td>\n<td style=\"padding: inherit;border: inherit\">Vikram Ramanarayanan, David Suendermann-Oeft<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">12:20-12:40<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-4<\/td>\n<td style=\"padding: inherit;border: inherit\">1244<\/td>\n<td style=\"padding: inherit;border: inherit\">On building mixed lingual speech synthesis systems<\/td>\n<td style=\"padding: inherit;border: inherit\">SaiKrishna Rallabandi, Alan W Black<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">12:40-13:00<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-5<\/td>\n<td style=\"padding: inherit;border: inherit\">1259<\/td>\n<td style=\"padding: inherit;border: inherit\">Speech Synthesis for Mixed-Language Navigation Instructions<\/td>\n<td style=\"padding: inherit;border: inherit\">Khyathi Chandu, Sai Krishna Rallabandi, Sunayana Sitaram, Alan W Black<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">14:30-14:50<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-6<\/td>\n<td style=\"padding: inherit;border: inherit\">1373<\/td>\n<td style=\"padding: inherit;border: inherit\">Addressing Code-Switching in French\/Algerian Arabic Speech<\/td>\n<td style=\"padding: inherit;border: inherit\">Amazouz Djegdjiga, Martine Adda-Decker, Lori Lamel<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">14:50-15:10<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-7<\/td>\n<td style=\"padding: inherit;border: inherit\">1429<\/td>\n<td style=\"padding: inherit;border: inherit\">Metrics for modeling code-switching across corpora<\/td>\n<td style=\"padding: inherit;border: inherit\">Wally Guzman, Joseph Ricard, Jacqueline Serigos, Barbara Bullock, Almeida Jacqueline Toribio<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">15:10-15:30<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-8<\/td>\n<td style=\"padding: inherit;border: inherit\">1437<\/td>\n<td style=\"padding: inherit;border: inherit\">Synthesising isiZulu-English code-switch bigrams using word embeddings<\/td>\n<td style=\"padding: inherit;border: inherit\">Ewald Van der westhuizen, Thomas Niesler<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\n<td style=\"padding: inherit;border: inherit\">15:30-15:50<\/td>\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-9<\/td>\n<td style=\"padding: inherit;border: inherit\">1663<\/td>\n<td style=\"padding: inherit;border: inherit\">Crowdsourcing Universal Part-Of-Speech Tags for Code-Switching<\/td>\n<td style=\"padding: inherit;border: inherit\">Victor Soto, Julia Hirschberg<\/td>\n<\/tr>\n<\/thead>\n<\/table>\n<p><span id=\"label-external-link\" class=\"sr-only\" aria-hidden=\"true\">Opens in a new tab<\/span><\/p>\n<!-- \/wp:freeform --><!-- \/wp:msr\/content-tab --><!-- \/wp:msr\/content-tabs -->","tab-content":[{"id":0,"name":"Special Session Description","content":"Speech technologies exist for many high resource languages, and attempts are being made to reach the next billion users by building resources and systems for many more languages. In the past, the main focus of the speech community has been in building monolingual systems that are capable of processing speech in a single language. Multilingual communities pose special challenges for the design and development of speech processing systems. One of these challenges is <strong>code-switching<\/strong>, which is the switching of two or more languages at the conversation, utterance and sometimes even word level.\r\n\r\nIn addition to conversational speech, code-switching is now found in text in social media, instant messaging and blogs in multilingual communities. Monolingual natural language and speech systems fail when they encounter code-switched speech and text. There is also a lack of linguistic data and resources for code-switched speech and text, although one or more of the languages being mixed could be high-resource.\r\n\r\nCode-switching provides various interesting challenges to the speech community, such as language modeling for mixed languages, acoustic modeling of mixed language speech, pronunciation modeling and language identification from speech. The special session will include oral presentations and a panel discussion. Please see the Special Session schedule tab for more details. We expect participants from academic and industry spanning a wide variety of language pairs and data sets. We also expect discussions on how to create speech and language resources for code-switching and sharing of data.\r\n\r\nTopics of interest for this special session will include but are not limited to: \uf0b7\r\n&lt;li&gt;Speech Recognition of code-switched speech&lt;\/li&gt;\r\n&lt;li&gt;Language Modeling for code-switched speech&lt;\/li&gt;\r\n&lt;li&gt;Speech Synthesis of code-switched text&lt;\/li&gt;\r\n&lt;li&gt;Speech Translation of code-switched languages&lt;\/li&gt;\r\n&lt;li&gt;Spoken Dialogue Systems that can handle code-switching&lt;\/li&gt;\r\n&lt;li&gt;Speech data and resources for code-switching&lt;\/li&gt;\r\n&lt;li&gt;Language Identification from speech&lt;\/li&gt;\r\n\r\n&nbsp;"},{"id":1,"name":"Special Session Schedule","content":"The special session will be held on 21 August 2017, distributed over two slots. All 9 papers will be presented as oral presentations. In addition, we will also have a panel at the end of the second session in which we will discuss topics such as data and resources for code-switching research. More details about this panel discussion will be available shortly.\r\n<table style=\"border-collapse: collapse;border-spacing: inherit\" width=\"100%\">\r\n<thead>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\"><strong>Date<\/strong><\/td>\r\n<td style=\"padding: inherit;border: inherit\"><strong>Time<\/strong><\/td>\r\n<td style=\"padding: inherit;border: inherit\"><strong>Room<\/strong><\/td>\r\n<td style=\"padding: inherit;border: inherit\"><strong>Presentation type<\/strong><\/td>\r\n<td style=\"padding: inherit;border: inherit\"><strong>Paper code<\/strong><\/td>\r\n<td style=\"padding: inherit;border: inherit\"><strong>Paper ID<\/strong><\/td>\r\n<td style=\"padding: inherit;border: inherit\"><strong>Title<\/strong><\/td>\r\n<td style=\"padding: inherit;border: inherit\"><strong>Authors<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\r\n<td style=\"padding: inherit;border: inherit\">11:20-11:40<\/td>\r\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-1<\/td>\r\n<td style=\"padding: inherit;border: inherit\">301<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Longitudinal Speaker Clustering and Verification Corpus with Code-Switching Frisian-Dutch Speech<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Emre Yilmaz, Jelske Dijkstra, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk Van den Heuvel, David Van Leeuwen<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\r\n<td style=\"padding: inherit;border: inherit\">11:40-12:00<\/td>\r\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-2<\/td>\r\n<td style=\"padding: inherit;border: inherit\">391<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Exploiting Untranscribed Broadcast Data for Improved Code-Switching Detection<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Emre Yilmaz, Henk van den Heuvel, David van Leeuwen<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\r\n<td style=\"padding: inherit;border: inherit\">12:00-12:20<\/td>\r\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-3<\/td>\r\n<td style=\"padding: inherit;border: inherit\">1198<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Jee haan, I\u2019d like both, por favor: Elicitation of a Code-Switched Corpus of Hindi-English and Spanish-English Human-Machine Dialog<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Vikram Ramanarayanan, David Suendermann-Oeft<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\r\n<td style=\"padding: inherit;border: inherit\">12:20-12:40<\/td>\r\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-4<\/td>\r\n<td style=\"padding: inherit;border: inherit\">1244<\/td>\r\n<td style=\"padding: inherit;border: inherit\">On building mixed lingual speech synthesis systems<\/td>\r\n<td style=\"padding: inherit;border: inherit\">SaiKrishna Rallabandi, Alan W Black<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\r\n<td style=\"padding: inherit;border: inherit\">12:40-13:00<\/td>\r\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-5<\/td>\r\n<td style=\"padding: inherit;border: inherit\">1259<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Speech Synthesis for Mixed-Language Navigation Instructions<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Khyathi Chandu, Sai Krishna Rallabandi, Sunayana Sitaram, Alan W Black<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\r\n<td style=\"padding: inherit;border: inherit\">14:30-14:50<\/td>\r\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-6<\/td>\r\n<td style=\"padding: inherit;border: inherit\">1373<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Addressing Code-Switching in French\/Algerian Arabic Speech<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Amazouz Djegdjiga, Martine Adda-Decker, Lori Lamel<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\r\n<td style=\"padding: inherit;border: inherit\">14:50-15:10<\/td>\r\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-7<\/td>\r\n<td style=\"padding: inherit;border: inherit\">1429<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Metrics for modeling code-switching across corpora<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Wally Guzman, Joseph Ricard, Jacqueline Serigos, Barbara Bullock, Almeida Jacqueline Toribio<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\r\n<td style=\"padding: inherit;border: inherit\">15:10-15:30<\/td>\r\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-8<\/td>\r\n<td style=\"padding: inherit;border: inherit\">1437<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Synthesising isiZulu-English code-switch bigrams using word embeddings<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Ewald Van der westhuizen, Thomas Niesler<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"padding: inherit;border: inherit\">2017-08-21<\/td>\r\n<td style=\"padding: inherit;border: inherit\">15:30-15:50<\/td>\r\n<td style=\"padding: inherit;border: inherit\">F11<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Oral<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Mon-SS-1-11-9<\/td>\r\n<td style=\"padding: inherit;border: inherit\">1663<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Crowdsourcing Universal Part-Of-Speech Tags for Code-Switching<\/td>\r\n<td style=\"padding: inherit;border: inherit\">Victor Soto, Julia Hirschberg<\/td>\r\n<\/tr>\r\n<\/thead>\r\n<\/table>"}],"msr_startdate":"","msr_enddate":"","msr_event_time":"","msr_location":"","msr_event_link":"","msr_event_recording_link":"","msr_startdate_formatted":"","msr_register_text":"Register now","msr_cta_link":"","msr_cta_text":"","msr_cta_bi_name":"","featured_image_thumbnail":null,"event_excerpt":"Speech technologies exist for many high resource languages, and attempts are being made to reach the next billion users by building resources and systems for many more languages. In the past, the main focus of the speech community has been in building monolingual systems that are capable of processing speech in a single language. Multilingual communities pose special challenges for the design and development of speech processing systems. One of these challenges is code-switching, which&hellip;","msr_research_lab":[199562],"related-researchers":[{"type":"user_nicename","display_name":"Kalika Bali","user_id":32477,"people_section":"Group 1","alias":"kalikab"}],"msr_impact_theme":[],"related-academic-programs":[],"related-groups":[144940],"related-projects":[],"related-opportunities":[],"related-publications":[],"related-videos":[],"related-posts":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event\/352877","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-event"}],"version-history":[{"count":3,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event\/352877\/revisions"}],"predecessor-version":[{"id":1147197,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event\/352877\/revisions\/1147197"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=352877"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=352877"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=352877"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=352877"},{"taxonomy":"msr-video-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video-type?post=352877"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=352877"},{"taxonomy":"msr-program-audience","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-program-audience?post=352877"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=352877"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=352877"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}