Semantic Query Transformations for Increased Parallelization in Distributed Knowledge Graph Query Processing

Ontologies have become an increasingly popular semantic layer for integrating multiple heterogeneous datasets. However, significant challenges remain with supporting efficient and scalable processing of queries with data linked with ontologies (ontological queries). Ontological query processing quer...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:SC19: International Conference for High Performance Computing, Networking, Storage and Analysis s. 1 - 14
Hlavní autoři: Kim, Hyeongsik, Bhattacharyya, Abhisha, Anyanwu, Kemafor
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: ACM 17.11.2019
Témata:
ISSN:2167-4337
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Ontologies have become an increasingly popular semantic layer for integrating multiple heterogeneous datasets. However, significant challenges remain with supporting efficient and scalable processing of queries with data linked with ontologies (ontological queries). Ontological query processing queries requires explicitly defined query patterns be expanded to capture implicit ones, based on available ontology inference axioms. However, in practice such as in the biomedical domain, the complexity of the ontological axioms results in significantly large query expansions which present day query processing infrastructure cannot support. In particular, it remains unclear how to effectively parallelize such queries. In this paper, we propose data and query transformations that enable inter-operator parallelism of ontological queries on Hadoop platforms. Our transformation techniques exploit ontological axioms, second order data types and operator rewritings to eliminate expensive query substructures for increased parallelizability. Comprehensive experiments conducted on benchmark datasets show up to 25X performance improvement over existing approaches.
ISSN:2167-4337
DOI:10.1145/3295500.3356212