Disambiguating Implicit Temporal Queries by Clustering Top Relevant Dates in Web Snippets

With the growing popularity of research in Temporal Information Retrieval (T-IR), a large amount of temporal data is ready to be exploited. The ability to exploit this information can be potentially useful for several tasks. For example, when querying "Football World Cup Germany", it would...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology Ročník 1; s. 1 - 8
Hlavní autoři: Campos, Ricardo, Jorge, Alipio Mario, Dias, Gael, Nunes, Celia
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.12.2012
Témata:
ISBN:9781467360579, 1467360570
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:With the growing popularity of research in Temporal Information Retrieval (T-IR), a large amount of temporal data is ready to be exploited. The ability to exploit this information can be potentially useful for several tasks. For example, when querying "Football World Cup Germany", it would be interesting to have two separate clusters {1974,2006} corresponding to each of the two temporal instances. However, clustering of search results by time is a non-trivial task that involves determining the most relevant dates associated to a query. In this paper, we propose a first approach to flat temporal clustering of search results. We rely on a second order co-occurrence similarity measure approach which first identifies top relevant dates. Documents are grouped at the year level, forming the temporal instances of the query. Experimental tests were performed using real-world text queries. We used several measures for evaluating the performance of the system and compared our approach with Carrot Web-snippet clustering engine. Both experiments were complemented with a user survey.
ISBN:9781467360579
1467360570
DOI:10.1109/WI-IAT.2012.158