Analysis and Enhancement of Wikification for Microblogs with Context Expansion

Taylor Cassidy, Heng Ji, Lev-Arie Ratinov, Arkaitz Zubiaga, Hongzhao Huang

COLING. 2012.

Disambiguation to Wikipedia (D2W) is the task of linking mentions of concepts in text to their corresponding Wikipedia entries. Most previous work has focused on linking terms in formal texts (e.g. newswire) to Wikipedia. Linking terms in short informal texts (e.g. tweets) is difficult for systems and humans alike as they lack a rich disambiguation context. A critical evaluation of an existing twitter dataset, as well as the D2W task in general, provides intuition that tweet context expansion based on both authorship and TextRank based clustering may enhance the disambiguation context and improve D2W results. Experiments using a state-of-the-art D2W system support this claim.

Download PDF file