π€ AI Summary
This study addresses the fragmentation and weak interoperability between Linked Data and labeled property graph (LPG) ecosystems. We propose rdf2pg, the first framework enabling scalable, bidirectional, semantics-preserving mapping between RDF knowledge graphs and semantically equivalent LPGs. Methodologically, we design a multi-backend adapter supporting Virtuoso, Neo4j, and ArcadeDB; implement cross-language query translation among SPARQL, Cypher, and Gremlin; and introduce the polyglot data endpoint paradigm. Our contributions are threefold: (1) the first systematic benchmark for evaluating semantic equivalence in RDFβLPG mappings; (2) a FAIR-compliance-driven mapping validation mechanism ensuring findability, accessibility, interoperability, and reusability; and (3) empirical evaluation on a plant biology knowledge graph demonstrating preserved semantic integrity post-conversion, while quantitatively revealing inherent trade-offs among scalability, expressive power, and standards compliance across the three graph database systems.
π Abstract
Linked Data and labelled property graphs (LPG) are two data management approaches with complementary strengths and weaknesses, making their integration beneficial for sharing datasets and supporting software ecosystems. In this paper, we introduce rdf2pg, an extensible framework for mapping RDF data to semantically equivalent LPG formats and data-bases. Utilising this framework, we perform a comparative analysis of three popular graph databases - Virtuoso, Neo4j, and ArcadeDB - and the well-known graph query languages SPARQL, Cypher, and Gremlin. Our qualitative and quantitative as-sessments underline the strengths and limitations of these graph database technologies. Additionally, we highlight the potential of rdf2pg as a versatile tool for enabling polyglot access to knowledge graphs, aligning with established standards of Linked Data and the Semantic Web.