Mining Concepts from Wikipedia for Ontology Construction

An ontology is a structured knowledgebase of concepts organized by relations among them. But concepts are usually mixed with their instances in the corpora for knowledge extraction. Concepts and their corresponding instances share similar features and are difficult to distinguish. In this paper, a n...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03 Ročník 3; s. 287 - 290
Hlavní autoři:	Cui, Gaoying, Lu, Qin, Li, Wenjie, Chen, Yirong
Médium:	Konferenční příspěvek
Jazyk:	angličtina
Vydáno:	Washington, DC, USA IEEE Computer Society 15.09.2009 IEEE
Edice:	ACM Conferences
Témata:	Collaboration Computing methodologies > Artificial intelligence > Knowledge representation and reasoning Computing methodologies > Artificial intelligence > Knowledge representation and reasoning > Semantic networks Computing methodologies > Artificial intelligence > Natural language processing Computing methodologies > Machine learning > Machine learning approaches > Rule learning Concept Conferences Information science Information systems > Information retrieval Information systems > Information retrieval > Evaluation of retrieval results Information systems > Information systems applications > Data mining Intelligent agent Intelligent structures Ontologies Ontology Construction Search engines Statistics Taxonomy Wikipedia Concept Wikipedia Ontology Construction
ISBN:	0769538010, 9780769538013
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	An ontology is a structured knowledgebase of concepts organized by relations among them. But concepts are usually mixed with their instances in the corpora for knowledge extraction. Concepts and their corresponding instances share similar features and are difficult to distinguish. In this paper, a novel approach is proposed to comprehensively obtain concepts with the help of definition sentences and Category Labels in Wikipedia pages. N-gram statistics and other NLP knowledge are used to help extracting appropriate concepts. The proposed method identified nearly 50,000 concepts from about 700,000 Wiki pages. The precision reaching 78.5% makes it an effective approach to mine concepts from Wikipedia for ontology construction.
ISBN:	0769538010 9780769538013
DOI:	10.1109/WI-IAT.2009.284