The Faculty of Informatics is pleased to announce a seminar given by Christina Lioma
TITLE: Graph Based Term Weights for Information Retrieval: Text as a Network of Words
SPEAKER: Christina Lioma, Katholieke Universiteit Leuven, Belgium
DATE: Friday, April 3rd, 2009
PLACE: USI Università della Svizzera italiana, room SI-008, Informatics building (Via G. Buffi 13)
The task of an Information Retrieval (IR) system is to retrieve documents from a large repository of data, which are relevant to a user query. This task is addressed by matching documents to queries on a term basis, and by ranking the documents accordingly. A core component of this process is the use of "term weights", which are weights representing how much a term contributes to the meaning of the text where it occurs.
Typical term weights are computed using lexical frequency statistics, i.e. word counts. This talk will present a different type of term weights, namely "graph based term weights". The computation of such weights involves modeling text as a graph, where vertices denote terms, and edges denote co-occurrence and grammatical relations between terms.
Modeling text as a graph is an interesting alternative to modeling text as a bag of words, and allows to compute term weights that contain statistical or linguistic relations as an integral part of their computation. Experimental evaluation confirms the usability of graph based term weights for IR systems.
Christina Lioma is Postdoctoral Fellow in Language Intelligence & Information Retrieval (LIIR) research group of the Katholieke Universiteit Leuven, Belgium. She joined LIIR in December 2007, after completing her PhD thesis at the University of Glasgow Information Retrieval group. For more information please visit: http://www.cs.kuleuven.be/~christin/
HOST: Prof. Fabio Crestani
URL 1: http://www.inf.unisi.ch