Using Search Engines for Robust Cross-Domain Named Entity Recognition

Decanato - Facoltà di scienze informatiche

Data d'inizio: 2 Marzo 2011

Data di fine: 3 Marzo 2011

DATE: Wednesday, March 02nd, 2011
SPEAKER: Massimiliano Ciaramita, Google, Zurigo
PLACE: USI Università della Svizzera italiana, room A32, Red building (Via G. Buffi 13)
TIME: 14.30

We use search engine results to address a particularly difficult domain adaptation problem, the adaptation of named entity recognition (NER) from news text to web queries. The key novelty of the method is that we submit a token with context to a search engine and use similar contexts in the search results as additional context for correctly disambiguating each token. We achieve strong gains in NER performance in-domain and out-of-domain.

Massimiliano Ciaramita is a research scientist at Google Zurich. Previously he has worked as a researcher at Yahoo! Research and the Italian National Research Council. He did his undergradutate studies at the University of Rome "La Sapienza" and obtained ScM and PhD degrees from Brown University. His main research interests involve language uderstanding and its applications to search technologies. He has worked on a wide range of topics in natural language processing and information retrieval, including disambiguation,

acquisition, information extraction, syntactic and semantic parsing, query analysis, computational advertising and question answering

HOST: Prof. Fabio Crestani