Events

Filter by:

Limit to events where the title matches:
Limit to events where the first date of the event:
Date range
Limit to events where the first date of the event:
Limit to events where the type is one or more of:
Limit to events tagged with one or more of:
Limit to events where the audience is one or more of:
Tuesday, November 12, 2013 12:00 am - Wednesday, November 13, 2013 12:00 am EST (GMT -05:00)

DIVA Workshop

DIVA Workshop

Wednesday, January 29, 2014 12:00 am - 12:00 am EST (GMT -05:00)

Invited Talk

Invited Talk

Wednesday, February 26, 2014 11:00 am - 11:00 am EST (GMT -05:00)

Invited Talk: Modeling Term Associations for Probabilistic Information Retrieval

Modeling Term Associations for Probabilistic Information Retrieval

Abstract:

Traditionally, in many probabilistic retrieval models, query terms are assumed to be independent. Although such models can achieve reasonably good performance, associations can exist among terms from human being.s point of view. There are some recent studies that investigate how to model term associations/dependencies by proximity measures. However, the modeling of term associations theoretically under the probabilistic retrieval framework is still largely unexplored. In this talk, I will introduce a new concept named Cross Term, to model term proximity, with the aim of boosting retrieval performance. With Cross Terms, the association of multiple query terms can be modeled in the same way as a simple unigram term. In particular, an occurrence of a query term is assumed to have an impact on its neighboring text. The degree of the query term impact gradually weakens with increasing distance from the place of occurrence. We use shape functions to characterize such impacts. Based on this assumption, we first propose a bigram CRoss TErm Retrieval (CRTER2) model as the basis model, and then recursively propose a generalized n-gram CRoss TErm Retrieval (CRTERn) model for n query terms where n > 2.
Specifically, a bigram Cross Term occurs when the corresponding query terms appear close to each other, and its impact can be modeled by the intersection of the respective shape functions of the query terms. For n-gram Cross Term, we develop several distance metrics with different properties and employ them in the proposed models for ranking. We also show how to extend the language model using the newly proposed cross terms. Extensive experiments on a number of TREC collections demonstrate the effectiveness of our proposed models.

Biography:

Jimmy Huang is a Professor & Director at the School of Information Technology and the founding director of Information Retrieval & Knowledge Management Research Lab at the York University.