Semantic Query Routing and Distributed Top-k Query Processing in Peer-to-Peer Networks

Uložené v:
Podrobná bibliografia
Názov: Semantic Query Routing and Distributed Top-k Query Processing in Peer-to-Peer Networks
Autori: Ioannis Chrysakis, Dimitris Plexousakis
Prispievatelia: The Pennsylvania State University CiteSeerX Archives
Zdroj: http://www.ics.forth.gr/ftp/tech-reports/2006/2006.TR387_Distributed_Top-K_Query_Processing_P2P_Networks.pdf.
Rok vydania: 2006
Zbierka: CiteSeerX
Predmety: top-k queries, query processing, query routing, p2p networks, schema-based p2p systems} Technical areas, peer-to-peer systems, databases, semantic web
Popis: Requirements for widely distributed information systems supporting virtual organizations have given rise to a new category of peer-to-peer (p2p) systems called schema-based. In such systems each peer is a database management system in itself, exposing its own schema. In such a setting, a main objective is the efficient search across peer databases by processing each incoming query without overly consuming bandwidth. In this report, we adopt a super-peer-based architecture and suggest a query routing mechanism, upon which we propose a query processing technique for top-k queries. Top-k queries in the context of p2p systems give the opportunity to filter the results and to eliminate network traffic by choosing the k highest ranked results. We introduce HT-p2p and HT-p2p+, two extended versions of the Hybrid Threshold Algorithm adapted to our p2p scenario. For the evaluation of these algorithms we implemented a prototype system upon the JXTA platform. Extensive experiments with different data sets and parameters have shown promising results about the performance of the query processing strategy. ± Acknowledgement: Special thanks to Evi Dagalaki for her contribution to the evaluation process. 1
Druh dokumentu: text
Popis súboru: application/pdf
Jazyk: English
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.102.199
Dostupnosť: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.102.199
http://www.ics.forth.gr/ftp/tech-reports/2006/2006.TR387_Distributed_Top-K_Query_Processing_P2P_Networks.pdf
Rights: Metadata may be used without restrictions as long as the oai identifier remains attached to it.
Prístupové číslo: edsbas.34F2EBB1
Databáza: BASE
Popis
Abstrakt:Requirements for widely distributed information systems supporting virtual organizations have given rise to a new category of peer-to-peer (p2p) systems called schema-based. In such systems each peer is a database management system in itself, exposing its own schema. In such a setting, a main objective is the efficient search across peer databases by processing each incoming query without overly consuming bandwidth. In this report, we adopt a super-peer-based architecture and suggest a query routing mechanism, upon which we propose a query processing technique for top-k queries. Top-k queries in the context of p2p systems give the opportunity to filter the results and to eliminate network traffic by choosing the k highest ranked results. We introduce HT-p2p and HT-p2p+, two extended versions of the Hybrid Threshold Algorithm adapted to our p2p scenario. For the evaluation of these algorithms we implemented a prototype system upon the JXTA platform. Extensive experiments with different data sets and parameters have shown promising results about the performance of the query processing strategy. ± Acknowledgement: Special thanks to Evi Dagalaki for her contribution to the evaluation process. 1