Statistical Semantic Interpretation Modeling for Spoken Language Understanding with Enriched Semantic Features

Asli Celikyilmaz; Dilek Hakkani-Tür; Gokhan Tur

Statistical Semantic Interpretation Modeling for Spoken Language Understanding with Enriched Semantic Features

Asli Celikyilmaz ,
Dilek Hakkani-Tür ,
Gokhan Tur

IEEE Workshop on Spoken Language Technologies | December 2012

Download BibTex

In natural language human-machine statistical dialog systems, semantic interpretation is a key task typically performed following semantic parsing, and aims to extract canonical meaning representations of semantic components. In the literature, usually manually built rules are used for this task, even for implicitly mentioned nonnamed semantic components (like genre of a movie or price range of a restaurant). In this study, we present statistical methods for modeling interpretation, which can also beneﬁt from semantic features extracted from large in-domain knowledge sources. We extract features from user utterances using a semantic parser and additional semantic features from textual sources (online reviews, synopses, etc.) using a novel tree clustering approach, to represent unstructured information that correspond to implicit semantic components related to targeted slots in the user’s utterances. We evaluate our models on a virtual personal assistance system and demonstrate that our interpreter is effective in that it does not only improve the utterance interpretation in spoken dialog systems (reducing the interpretation error rate by 36% relative compared to a language model baseline), but also unveils hidden semantic units that are otherwise nearly impossible to extract from purely manual lexical features that are typically used in utterance interpretation.