ABSTRACT
Building an efficient SPAQRL query over the great variety of copious bio-medical Linked Open Data (LOD) requires users to understand the data schema, and makes it difficult for biologists to handle such data. To address this problem, we challenge to realise a SPARQL query builder that generates a structurally-optimised query by logically analysing the target RDF/OWL data; but still the corresponding unifying logic over RDF/OWL data needs to be implemented. An ontology-based smart SPARQL query builder named Bio-SPARQL is an implementation of the needed logic. Bio-SPARQL generates structurally-optimised queries over an ontologically classified RDF/OWL based bio-medical LOD by logically analysing their semantic graph structure. Bio-SPARQL employs our database named BioLOD having LOD data sets categorised in 744 classes with 7.88 million data items (instances) integrated public various types of omic databases by human curation and provides a set of LOD data files of each class. To aid in writing a query, it provides a graphical user interface that suggests possible data path schema and filters by analysing its corresponding ontological BioLOD data structure. The generated SPARQL query is designed to be performed in a user's local environment with its corresponding downloaded BioLOD data files in order to control the influence on query results due to data updates.
- F. Belleau, M.-A. Nolin, N. Tourigny, P. Rigault, and J. Morissettea. Bio2rdf: Towards a mashup to build bioinformatics knowledge systems. J. Biomed. Inform., 41(5):706--716, 2008. Google ScholarDigital Library
- L. Chepelev and M. Dumontier. Semantic Web integration of Cheminformatics resources with the SADI framework. J. Cheminform., 3(1):16, 2011.Google ScholarCross Ref
- N. Kobayashi, M. Ishii, S. Takahashi, Y. Mochizuki, A. Matsushima, and T. Toyoda. Semantic-JSON: a lightweight web service interface for semantic web contents integrating multiple life science databases. Nucl. Acids Res., 39(Web Server issue):W533--40, 2011.Google Scholar
- H. Masuya, Y. Makita, N. Kobayashi, K. Nishikata, Y. Yoshida, Y. Mochizuki, et al. The RIKEN integrated database of mammals. Nucl. acids res., 39(Database issue):D861--D870, 2011.Google Scholar
Index Terms
- BioSPARQL: ontology-based smart building of SPARQL queries for biological linked open data
Recommendations
RDF, Jena, SparQL and the 'Semantic Web'
SIGUCCS '09: Proceedings of the 37th annual ACM SIGUCCS fall conference: communication and collaborationThe Resource Description Format (RDF) is used to represent information modeled as a "graph": a set of individual objects, along with a set of connections among those objects. In that role, RDF is one of the pillars of the so-called Semantic Web. This ...
A SPARQL Query Rewriting Approach on Heterogeneous Ontologies with Mapping Reliability
IIAI-AAI '12: Proceedings of the 2012 IIAI International Conference on Advanced Applied InformaticsSPARQL is a standard query language for RDF data that are commonly used to represent and store Semantic Web data. There are a lot of SPARQL endpoints to retrieve and see the data by SPARQL queries. Although it greatly helps us query semantic data with ...
The RDF foundry: call for an initiative to build enhanced RDF resources for biological data integration
WIMS '11: Proceedings of the International Conference on Web Intelligence, Mining and SemanticsCurrently, the OBO Foundry plays an important role by setting guidelines to formalise the concepts within the biomedical domain. The ontologies within the OBO Foundry are usually represented in the OBO ontology language. While being human-readable, this ...
Comments