Published November 15, 2018 | Version v1
Poster Open

De-duplicating the OpenAIRE Scholarly Communication Big Graph

  • 1. CNR-ISTI

Description

The OpenAIRE infrastructure populates a scholarly communication big graph interlinking metadata objects of publications, datasets, software, organizations, funders, and projects.In order to de-duplicate this graph, OpenAIRE has developed GDup , an integrated, scalable, general-purpose system for entity deduplication over big information graphs. GDup offers functionalities to realize a fully-fledged entity deduplication workflow over a generic input graph, inclusive of Ground Truth support, end-user feedback, and strategies for identifying and merging duplicates to obtain an output disambiguated graph.

Files

Poster gDup eScience 2018 A0.pdf

Files (459.9 kB)

Name Size Download all
md5:bb1273ed7ce298c8c103bf3e6ec8350c
459.9 kB Preview Download

Additional details

Related works

Is identical to
10.1109/eScience.2018.00104 (DOI)

Funding

OpenAIRE-Advance – OpenAIRE Advancing Open Scholarship 777541
European Commission
OpenAIRE2020 – Open Access Infrastructure for Research in Europe 2020 643410
European Commission