Compression of RDF dictionaries

Abstract

The use of dictionaries is a common practice among those applications performing on huge RDF datasets. It allows long terms occurring in the RDF triples to be replaced by short IDs which reference them. This decision greatly compacts the dataset and thus mitigates its scalability issues. However, the dictionary size is not negligible and the techniques used for its representation also suffer from scalability limitations. This paper focuses on this scenario by adapting compression techniques for string dictionaries to the case of RDF. We propose a novel technique: Dcomp, which can be tuned to represent the dictionary in compressed space (22–64%) and to perform in a few microseconds (1–50μs).

Publication
In Proc. 27th Annual ACM Symposium on Applied ComputingACM SIGAPP Applied Computing Review
Date
Links