Pontificia Universidad Católica de Chile Pontificia Universidad Católica de Chile
Arroyuelo D., Gomez-Brandon A., Hogan A., Navarro G., Rojas-Ledesma J. (2023)

Optimizing RPQs over a compact graph representation

Revista : VLDB JOURNAL
Tipo de publicación : ISI Ir a publicación

Abstract

We propose techniques to evaluate regular path queries (RPQs) over labeled graphs (e.g., RDF). We apply a bit-parallel simulation of a Glushkov automaton representing the query over a ring: a compact wavelet-tree-based index of the graph. To the best of our knowledge, our approach is the first to evaluate RPQs over a compact representation of such graphs, where we show the key advantages of using Glushkov automata in this setting. Our scheme obtains optimal time, in terms of alternation complexity, for traversing the product graph. We further introduce various optimizations, such as the ability to process several automaton states and graph nodes/labels simultaneously, and to estimate relevant selectivities. Experiments show that our approach uses 3-5xdocumentclass[12pt]{minimal}usepackage{amsmath}usepackage{wasysym}usepackage{amsfonts}usepackage{amssymb}usepackage{amsbsy}usepackage{mathrsfs}usepackage{upgreek}setlength{oddsidemargin}{-69pt}begin{document}$$times $$end{document} less space, and is over 5xdocumentclass[12pt]{minimal}usepackage{amsmath}usepackage{wasysym}usepackage{amsfonts}usepackage{amssymb}usepackage{amsbsy}usepackage{mathrsfs}usepackage{upgreek}setlength{oddsidemargin}{-69pt}begin{document}$$times $$end{document} faster, on average, than the next best state-of-the-art system for evaluating RPQs.