Published October 28, 2020 | Version 0.2
Dataset Open

RDF Reification Benchmark (REF) using the Biomedical Knowledge Repository (BKR)

  • 1. ADAPT Centre, Trinity College Dublin

Description

This resource can be used for benchmarking different RDF modelling solutions for statement-level metadata, namely: 

- RDF Reification,

- Singleton Property,

- RDF* (RDF-star). 

 

More details about this resource can be found in the following publication:

Fabrizio Orlandi, Damien Graux, Declan O'Sullivan, "Benchmarking RDF Metadata Representations: Reification, Singleton Property and RDF*", 15th IEEE International Conference on Semantic Computing (ICSC), 2021.

Pre-print available at: http://fabriziorlandi.net/pdf/2021/ICSC2021_REF-Benchmark.pdf

 

The dataset contains 3 different versions of the Biomedical Knowledge Repository (BKR) knowledge graph, as described in:

Vinh Nguyen, Olivier Bodenreider, Amit Sheth. "Don't Like RDF Reification? Making Statements About Statements Using Singleton Property" WWW 2014, doi: 10.1145/2566486.2567973.

and,

Satya S. Sahoo, Olivier Bodenreider, Pascal Hitzler, Amit Sheth and Krishnaprasad Thirunarayan. "Provenance Context Entity (PaCE): Scalable Provenance Tracking for Scientific RDF Data" in Sci Stat Database Manag. 2010; 6187: 461–470. doi: 10.1007/978-3-642-13818-8_32

 

The 3 knowledge graphs dumps are packaged as Gzipped RDF files in Turtle (and Turtle*) syntax. 

BKR-R-fullKGdump.ttl.gz for the Reification method,

BKR-S-fullKGdump.ttl.gz for the Singleton method,

BKR-star-fullKGdump.ttls.gz for the RDF* (RDF-star) method.

 

The RDF REiFication Benchmark (REF) includes also a set of SPARQL (and SPARQL*) queries that can be used to compare the performance of different triplestores.

Details about the SPARQL queries, and the queries themselves, are included in the "REF-Benchmark.tar.gz" archive. The queries are named after the dataset they are designed for (BKR-R or BKR-S or BKR-star), plus they include a letter identifying the query set, and a query number. 

E.g. the query in the file "BKR-R_F-Q3.rq" is for the BKR-R (standard reification) dataset, it is part of the query set "F" and it is the number 3 of that set "F". Hence, the same query, but translated for the RDF* dataset in SPARQL* syntax, is contained in "BKR-star_F-Q3.rq".

Sets "A" and "B" are derived from the queries introduced by V. Nguyen et al. in: "Don't Like RDF Reification? Making Statements About Statements Using Singleton Property" WWW 2014, doi: 10.1145/2566486.2567973. Set "F" has been designed more with RDF* in mind as part of this benchmark (see [Orlandi et al., ICSC 2021]) 

 

 

 

Files

Files (2.7 GB)

Name Size Download all
md5:22b2a5aedde486826507f1a2edc69bf4
453.8 MB Download
md5:ad6a65e7b84e22f1d9c6199b09d24de8
1.9 GB Download
md5:eb91d948b94c3d7607c4f07650433e16
360.8 MB Download
md5:a3ce94159b64bffd7f201d78024a81e2
7.9 kB Download

Additional details

Funding

ELITE-S – Future Leadership in ICT Standards in Europe “ELITE-S” 801522
European Commission
EDGE – Cutting Edge Training - Cutting Edge Technology 713567
European Commission
ADAPT: Centre for Digital Content Platform Research 13/RC/2106
Science Foundation Ireland