Entity Disambiguation with Freebase

Entity disambiguation with a knowledge base becomes increasingly popular in the NLP community. In this paper, we employ Freebase as the knowledge base, which contains significantly more entities than Wikipedia and others. While huge in size, Freebase lacks context for most entities, such as the desc...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology Ročník 1; s. 82 - 89
Hlavní autoři:	Zheng, Zhicheng, Si, Xiance, Li, Fangtao, Chang, Edward Y., Zhu, Xiaoyan
Médium:	Konferenční příspěvek
Jazyk:	angličtina
Vydáno:	IEEE 01.12.2012
Témata:	Algorithms Companies Encyclopedias Freebase Hypertext systems Internet Knowledge based systems Name Entity Disambiguation Naturally Disambiguated Online services Taxonomy Type Taxonomy Vectors
ISBN:	9781467360579, 1467360570
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	Entity disambiguation with a knowledge base becomes increasingly popular in the NLP community. In this paper, we employ Freebase as the knowledge base, which contains significantly more entities than Wikipedia and others. While huge in size, Freebase lacks context for most entities, such as the descriptive text and hyperlinks in Wikipedia, which are useful for disambiguation. Instead, we leverage two features of Freebase, namely the naturally disambiguated mention phrases (aka aliases) and the rich taxonomy, to perform disambiguation in an iterative manner. Specifically, we explore both generative and discriminative models for each iteration. Experiments on 2, 430, 707 English sentences and 33, 743 Freebase entities show the effectiveness of the two features, where 90% accuracy can be reached without any labeled data. We also show that discriminative models with proposed split training strategy is robust against over fitting problem, and constantly outperforms the generative ones.
ISBN:	9781467360579 1467360570
DOI:	10.1109/WI-IAT.2012.26