Capítulo de livro Acesso aberto Revisado por pares

Distributed RDF Query Answering with Dynamic Data Exchange

2016; Springer Science+Business Media; Linguagem: Inglês

10.1007/978-3-319-46523-4_29

ISSN

1611-3349

Autores

Anthony Potter, Boris Motik, Yavor Nenov, Ian Horrocks,

Tópico(s)

Distributed and Parallel Computing Systems

Resumo

Evaluating joins over RDF data stored in a shared-nothing server cluster is key to processing truly large RDF datasets. To the best of our knowledge, the existing approaches use a variant of the data exchange operator that is inserted into the query plan statically (i.e., at query compile time) to shuffle data between servers. We argue that such approaches often miss opportunities for local computation, and we present a novel solution to distributed query answering that consists of two main components. First, we present a query answering algorithm based on dynamic data exchange, which exploits data locality to maximise the amount of computation on a single server. Second, we present a partitioning algorithm for RDF data based on graph partitioning whose aim is to increase data locality. We have implemented our approach in the RDFox system, and our performance evaluation suggests that our techniques outperform the state of the art by up to an order of magnitude in terms of query evaluation times, network communication, and memory use.

Referência(s)