RDF Reification Benchmark (REF) using the Biomedical Knowledge Repository (BKR)

doi:10.5281/zenodo.4148888

Published October 28, 2020 | Version 0.2

Dataset Open

RDF Reification Benchmark (REF) using the Biomedical Knowledge Repository (BKR)

1. ADAPT Centre, Trinity College Dublin

This resource can be used for benchmarking different RDF modelling solutions for statement-level metadata, namely:

- RDF Reification,

- Singleton Property,

- RDF* (RDF-star).

More details about this resource can be found in the following publication:

Fabrizio Orlandi, Damien Graux, Declan O'Sullivan, "Benchmarking RDF Metadata Representations: Reification, Singleton Property and RDF*", 15th IEEE International Conference on Semantic Computing (ICSC), 2021.

Pre-print available at: http://fabriziorlandi.net/pdf/2021/ICSC2021_REF-Benchmark.pdf

The dataset contains 3 different versions of the Biomedical Knowledge Repository (BKR) knowledge graph, as described in:

Vinh Nguyen, Olivier Bodenreider, Amit Sheth. "Don't Like RDF Reification? Making Statements About Statements Using Singleton Property" WWW 2014, doi: 10.1145/2566486.2567973.

and,

Satya S. Sahoo, Olivier Bodenreider, Pascal Hitzler, Amit Sheth and Krishnaprasad Thirunarayan. "Provenance Context Entity (PaCE): Scalable Provenance Tracking for Scientific RDF Data" in Sci Stat Database Manag. 2010; 6187: 461–470. doi: 10.1007/978-3-642-13818-8_32

The 3 knowledge graphs dumps are packaged as Gzipped RDF files in Turtle (and Turtle*) syntax.

BKR-R-fullKGdump.ttl.gz for the Reification method,

BKR-S-fullKGdump.ttl.gz for the Singleton method,

BKR-star-fullKGdump.ttls.gz for the RDF* (RDF-star) method.

The RDF REiFication Benchmark (REF) includes also a set of SPARQL (and SPARQL*) queries that can be used to compare the performance of different triplestores.

Details about the SPARQL queries, and the queries themselves, are included in the "REF-Benchmark.tar.gz" archive. The queries are named after the dataset they are designed for (BKR-R or BKR-S or BKR-star), plus they include a letter identifying the query set, and a query number.

E.g. the query in the file "BKR-R_F-Q3.rq" is for the BKR-R (standard reification) dataset, it is part of the query set "F" and it is the number 3 of that set "F". Hence, the same query, but translated for the RDF* dataset in SPARQL* syntax, is contained in "BKR-star_F-Q3.rq".

Sets "A" and "B" are derived from the queries introduced by V. Nguyen et al. in: "Don't Like RDF Reification? Making Statements About Statements Using Singleton Property" WWW 2014, doi: 10.1145/2566486.2567973. Set "F" has been designed more with RDF* in mind as part of this benchmark (see [Orlandi et al., ICSC 2021])

Files

Files (2.7 GB)

Name	Size	Download all
BKR-R-fullKGdump.ttl.gz md5:22b2a5aedde486826507f1a2edc69bf4	453.8 MB	Download
BKR-S-fullKGdump.ttl.gz md5:ad6a65e7b84e22f1d9c6199b09d24de8	1.9 GB	Download
BKR-star-fullKGdump.ttls.gz md5:eb91d948b94c3d7607c4f07650433e16	360.8 MB	Download
REF-Benchmark.tar.gz md5:a3ce94159b64bffd7f201d78024a81e2	7.9 kB	Download

Additional details

ELITE-S – Future Leadership in ICT Standards in Europe “ELITE-S” 801522: European Commission
EDGE – Cutting Edge Training - Cutting Edge Technology 713567: European Commission
ADAPT: Centre for Digital Content Platform Research 13/RC/2106: Science Foundation Ireland

	All versions	This version
Views	812	476
Downloads	94	76
Data volume	79.2 GB	63.2 GB

RDF Reification Benchmark (REF) using the Biomedical Knowledge Repository (BKR)

Creators

Description

Files

Files (2.7 GB)

Additional details

Funding