Cloud-Based RDF Data Management That'S Both Powerful And Extensible



Even if there have been some recent improvements in the administration of distributed RDF data, it is still rather difficult to do analysis on large amounts of RDF data using the cloud. Although having a very easy data paradigm, RDF is capable of storing complex graphs that mix information at the instance-level and the schema-level. The distributed operations that are produced as a consequence of sharding this sort of data using standard approaches, such as partitioning the graph using usual min-cut algorithms, are exceedingly inefficient and call for a number of joins to be performed. In this paper, we explore DC, a cloud-optimized distributed RDF data management system that is both effective and scalable. It was created primarily for use in cloud environments. In contrast to more conventional approaches, DC first does a physiological analysis on both the instance data and the schema data before it divides the data. In this paper, we provide an overview of the architecture of DC, covering its fundamental data structures as well as the innovative approaches that we use for the division and distribution of data. In addition to this, we provide a comprehensive analysis of DC, which demonstrates that, for the vast majority of workloads, our system is often twice as fast as the most modern alternatives.


K. Aberer, P. Cudre-Mauroux, M. Hauswirth, and T. van Pelt, “GridVine: Building Internet-scale semantic overlay networks,” in Proc. Int. Semantic Web Conf., 2004, pp. 107–121.

P. Cudr e-Mauroux, S. Agarwal, and K. Aberer, “GridVine: An infrastructure for peer information management,” IEEE Internet Comput., vol. 11, no. 5, pp. 36–44, Sep./Oct. 2007.

M. Wylot, J. Pont, M. Wisniewski, and P. Cudr e-Mauroux. (2011). dipLODocus[RDF]: Short and long-tail RDF analytics for massive webs of data. Proc. 10th Int. Conf. Semantic Web - Vol. Part I, pp. 778–793


M. Wylot, P. Cudre-Mauroux, and P. Groth, “TripleProv: Efficient processing of lineage queries in a native RDF store,” in Proc. 23rd Int. Conf. World Wide Web, 2014, pp. 455–466.

M. Wylot, P. Cudr e-Mauroux, and P. Groth, “Executing provenance-enabled queries over web data,” in Proc. 24th Int. Conf. World Wide Web, 2015, pp. 1275–1285.

B. Haslhofer, E. M. Roochi, B. Schandl, and S. Zander. (2011). Europeana RDF store report. Univ. Vienna, Wien, Austria, Tech. Rep.


Y. Guo, Z. Pan, and J. Heflin, “An evaluation of knowledge base systems for large OWL datasets,” in Proc. Int. Semantic Web Conf., 2004, pp 274–288.

Faye, O. Cure, and Blin, “A survey of RDF storage approaches,” ARIMA J., vol. 15, pp. 11–35, 2012.

B. Liu and B. Hu, “An Evaluation of RDF Storage Systems for Large Data Applications,” in Proc. 1st Int. Conf. Semantics, Known. Grid, Nov. 2005,

Z. Kaoudi and I. Manolescu, “RDF in the clouds: A survey,” VLDB J. Int. J. Very Large Data Bases, vol. 24, no. 1, pp. 67–91, 2015.

Full Text: PDF


  • There are currently no refbacks.

Copyright © 2012 - 2023, All rights reserved.|

Creative Commons License
International Journal of Innovative Technology and Research is licensed under a Creative Commons Attribution 3.0 Unported License.Based on a work at IJITR , Permissions beyond the scope of this license may be available at