Information retrieval and access have become central technologies for managing and leveraging the ongoing explosion of digital content. While effective, current techniques for designing retrieval models are limited by two issues. First, they have restricted representational power, and generally deal with simple settings that estimate the quality of individual results independently of other results. Second, existing methodologies for designing retrieval functions are labor intensive and cannot be efficiently applied to accommodate a growing variety of retrieval domains.
In this talk, I will describe two learning approaches for designing new retrieval models. The first is a structured prediction approach, which considers inter-dependencies between results in order to optimize for more sophisticated objectives such as information diversity. The second is an interactive learning approach, which reduces the efficiency bottleneck of relying on human experts by leveraging data gathered from online user interactions; such data is both cheap to collect as well as naturally representative of user utilities in the target domain.
This is joint work with Thorsten Joachims.