An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda

Mazimwe, Allan; Hammouda, Imed; Gidudu, Anthony

doi:10.3390/ijgi8110484

Open AccessArticle

An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda

by

Allan Mazimwe

^1,*,†,

Imed Hammouda

^2,† and

Anthony Gidudu

¹

Department of Geomatics and Land Management, Makerere University, Kampala 00000, Uganda

²

Mediterranean Institute of Technology, South Mediterranean University, Tunis 1053, Tunisia

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

ISPRS Int. J. Geo-Inf. 2019, 8(11), 484; https://doi.org/10.3390/ijgi8110484

Submission received: 10 September 2019 / Revised: 17 October 2019 / Accepted: 22 October 2019 / Published: 26 October 2019

(This article belongs to the Special Issue Open Science in the Geospatial Domain)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

One of the grand challenges of disaster management is for stakeholders to be able to discover, access, integrate and analyze task-appropriate data together with their associated algorithms and work-flows. Even with a growing number of initiatives to publish data in the disaster management sector using open principles, integration and reuse are still difficult due to existing interoperability barriers within datasets. Several frameworks for assessing data interoperability exist but do not generate best practice solutions to existing barriers based on the assessment they use. In this study, we assess interoperability for datasets in the disaster management sector in Uganda and identify generic solutions to interoperability challenges in the context of disaster management. Semi-structured interviews and focus group discussions were used to collect qualitative data from sector stakeholders in Uganda. Data interoperability was measured to provide an understanding of interoperability in the sector. Interoperability maturity is measured using qualitative methods, while data compatibility metrics are computed from identifiers in the RDF-triple model. Results indicate high syntactic and technical interoperability maturity for data in the sector. On the contrary, there exists considerable semantic and legal interoperability barriers that hinder data integration and reuse in the sector. A mapping of the interoperability challenges in the disaster management sector to solutions reveals a potential to reuse established patterns for managing data interoperability. These include; the federated pattern, linked data patterns, broadcast pattern, rights and policy harmonization patterns, dissemination and awareness pattern, ontology design patterns among others. Thus a systematic approach to combining patterns is critical to managing data interoperability barriers among actors in the disaster management ecosystem.

Keywords:

disaster; hazard; FAIR; data interoperability; patterns

1. Introduction

Disasters cause severe losses to communities, the effects of which are felt by the most vulnerable in communities. In order to cope with impacts of hazard events, organizations require resources and capabilities of other stakeholder institutions thus making effective collaboration and inter-operation essential. To make full use of existing data in the disaster management sector, the disaster community needs to adopt mechanisms that ensure resource findability, accessibility, interoperability, reuse and promote the growth of open data initiatives. Even with growing open data initiatives, integration and reuse of existing data by sector stakeholders is hindered by existing interoperability barriers. The barriers in sharing and coordinating information before and after hazard events often leads to a breakdown in prevention, preparedness and response to extreme hazard events [1]. Thus lack of interoperability for disaster/hazard related data is in itself an indicator of lack of resilience. Wilkinson et al. [2,3] discuss guiding principles to make data in repositories Findable, Accessible, Interoperable and Reusable (FAIR) and associated metrics for evaluating compliance. In the disaster domain, [4] have developed a framework for assessing interoperability as well as other domain interoperability frameworks listed by [5] can be reused in the disaster management sector.

While these frameworks provide a way of measuring interoperability, they lack approaches for identifying and development of best practice solutions to interoperability challenges based on interoperability measurement. A combination of interoperability concepts and patterns have been proposed as a way of overcoming the interoperability barriers [6,7]. Patterns provide reusable proven best practice solutions for recurring interoperability problems in a given context [8]. However, there still exists a gap in pattern demand and availability to enable domain experts to construct high quality interoperability solutions without requiring expert knowledge in interoperability.

Therefore, the contribution of this paper is: First, to provide an understanding of interoperability maturity and compatibility between existing data sets in the disaster management sector in Uganda. Second, from interoperability assessments, we identify potentially reusable interoperability patterns—also known as interoperability best practices. In this paper, we define interoperability patterns as typical solutions to commonly reoccurring interoperability problems within in the context of disaster management. These patterns can be systematically combined to manage interoperability problems in the disaster management sector in Uganda. The rest of the paper is structured as follows: Related work is presented in Section 2, materials and methods in Section 3 while results and discussions in Section 4 and Section 5 respectively. Section 6 contains the conclusions and future work.

2. Related Work

Currently, there exists is a strategic shift in disaster risk management with actors placing more attention on proactive disaster preparedness and prevention as opposed to reactive measures [9]. Some of the proactive measures include rigorous risk assessments, enhanced early warning systems, awareness and coordination of actors. Early warning typically constitute multiple actors (such as national sectors, private sector, civil society and communities) playing different yet complementary roles in disaster risk management [10,11]. The success of an enhanced early warning system typically depends on the ability of multiple actors to share data/ information (i.e., data about past events, real-time streams etc.) [12,13].

2.1. Disaster Data Interoperability

Vast amounts of hazard/disaster-related data existing today are typically dispersed geographically and owned by various entities making it difficult to access and reuse for disaster management. As such, Open Data initiatives aiming at increasing the availability of machine-readable data provided under an open license and facilitate data reuse have been advanced [14,15,16]. The FAIR principles defined by [2] provide a guide for ensuring such open disaster related data is findable, accessible, interoperable and reusable (FAIR) by machines/humans to support disaster management functions and so become as valuable as is possible [17]. The concept of interoperability serves to enable creation, exchange and consumption of data with explicit, shared expectation of context and meaning [18]. Different frameworks have been advanced to reconcile several perspectives on the notion of data interoperability [19]. These frameworks include the IMI [20], EIF [21] and LCIM [22] to mention but a few. After reviewing these frameworks, Rezaei et al. [23] categorises data interoperability into several distinct levels. These include: syntactic, semantic, technical and organizational interoperability. To ensure that data providers publish a machine-readable open data in an interoperable way, Barners-lee [24,25] introduces a five-star rating for data that relates to the levels of interoperability defined by Rezaei et al. [23].

2.2. Interoperability Assessment

Leal et al. [5] classify interoperability assessment types used by different frameworks into three categories. The first assessment type is the potentiality assessment that evaluates whether a system is mature enough to overcome existing interoperability barriers. Examples of such an assessment include [26] used in Uganda and [4] in the disaster domain. The second type is the compatibility assessment that evaluates the current state of data to identify potential conflicts that may cause interoperability problems as exemplified in [22,27,28]. The last assessment type evaluates the quality of implementing an interoperable solution at run-time. While these frameworks provide a way to measure interoperability, Leal et al. [5] conclude that they lack approaches for identifying best practices necessary to improve interoperability based on assessment results.

2.3. Interoperability Patterns

To solve interoperability challenges, different authors [6,7,23] propose a combination of interoperability concepts and patterns (also known as information supply chain patterns). The concept of patterns emerges from software engineering [29], data modeling [30] and knowledge engineering [31] among others to interoperability. To overcome technical interoperability barriers, several patterns have been developed as best practice solutions. Examples of application domains where interoperability patterns have been applied to manage technical barriers include the Internet of Things (IoT) [32], enterprise information systems exchange (a catalog of patterns can be found here: http://project-interoperability.github.io/exchange-patterns/) [33] and health domain [7]. In the semantic ecosystem, Gangemi [34] proposes ontology design patterns (ODPs) as common strategies for implementing semantic interoperability [8]. To solve organization interoperability barriers [6,7] identify several patterns. While the prospects for using interoperability patterns to manage interoperability barriers are enormous, there still exists a gap between their demand and availability [35]. Thus, there is a need to develop a critical amount of reusable patterns for managing disaster/ hazard related data interoperability.

3. Materials and Methods

In this study, we use the case of Uganda that experiences several disasters such as famine, floods, earthquakes, conflict, landslides and epidemics. With each hazard, there exists a lead institution (See the list of lead institutions in the Uganda disaster preparedness and management policy that can be downloaded from here: https://reliefweb.int/report/uganda/national-policy-disaster-preparedness-and-management) responsible for monitoring, assessing and reporting risk levels. For the institutions to achieve their mandates, they need to share, integrate and reuse data from other co-opted stakeholders. Through the disaster management sector working group, the Office of the Prime Minister (OPM) coordinates all stakeholders (data publishers and users) to promote integrated monitoring, reporting and actions to minimize hazard impacts. As a matter of fact, the current study investigates interoperability issues that affect the sharing and reuse of data among actors in the disaster management sector. In terms of theoretical scope, we investigate the data interoperability concern wherein two types of data are considered for the study (i.e., raw data and derived data like meteorological data obtained after processing raw sensor data). Therefore, this paper mainly addresses two research questions (RQ).

RQ1: To what extent is disaster/ hazard-related data interoperable in Uganda?
Knowledge of data interoperability conflicts and maturity level are needed to guide the mapping of appropriate solutions to barriers in sharing/reusing data in the disaster management sector.
RQ2: Are there emerging interoperability patterns for managing data interoperability barriers in the disaster sector?
Pattern instances revealed through the mapping of local use case interoperability problems in the sector to generic solutions are critical for stakeholders to implement appropriate solutions with ease.

To answer RQ1, two assessment types (i.e., the maturity and compatibility assessment) are used to explore data interoperability in the disaster sector as shown in Figure 1. For the maturity test, data are collected using semi-structured interviews from seven data points representing disaster sector stakeholders (see Table A3) that generate and reuse data in the sector. To gain more insight into interoperability issues in the disaster sector, data from members of the disaster sector working group was collected through the formulation of four Focused Group Discussions (FGD) constituting a total of 16 data users from 12 stakeholder institutions (see description of institutions in Table A4). The disaster sector working group is an inter-agency disaster risk reduction platform that constitutes the following categories of actors; lead Ministries, UN agencies, NGOs, CSOs, researchers and other relevant stakeholders as defined in the disaster policy. In this study, we endeavor to cover all actors by selecting at least a data point in each category to guarantee good representation. Thus a total of 15 institutions were selected to participate in the interviews and FGD based on purposive sampling. This represents about 24.6% of the total number of sector stakeholders (The derived percentage is based on 61 institutions included in the DRR platform/sector mailing list of 20th July 2018). The compatibility assessment utilizes structured data obtained during prioritization in FGDs. However, disaster sector data is both derived and raw (i.e., requires pre-processing for use as input in assessment models e.g., extracting indicator data from imagery). To avoid researcher bias that could affect the internal validity of the study, we do not pre-process any raw data into derived data but rather use existing data.

Qualitative data collected from interviews and FGD were transcribed into text and later coded in atlas-Ti. The codes were then grouped into code groups and analyzed using content thematic analysis. To indirectly measure the interoperability maturity (i.e., the ability to overcome existing interoperability barriers), metrics in Table 1 were developed and classified according to data interoperability barriers presented by [23].

To generate metrics that are in line with recommendations of the FAIR framework, we combine FAIR metrics [3] with the 5-star rating for data [25]/ linked open data [24] as illustrated in Table A1. The decision of the final metric is based on a generalization of FAIR interoperability metrics and data rating concepts. Where no FAIR interoperability metrics exist, 5-star rating metrics are selected taking into consideration their compliance to findability, accessibility and reusability metrics of the FAIR framework e.g. in metric T1. A numerical measure of maturity is generated by assigning compliance value codes (where 0 means false, 1 for True and 0.5 for partially) based on metrics in Table 1. Finally, the maturity indicator is computed by averaging compliance values.

To measure interoperability compatibility, test datasets (research tools and data used can be found here https://github.com/mazimweal/mazimweal.github.io) are compared using the method in Colpaert et al. [27]. This method is best suited for evaluating semantic inter-linkedness between RDF datasets. However, it also provides a good experimentation on the semantic inter-linkedness and similarity of concepts in tabular datasets using simple RDF syntax.

Consider two datasets A and B, (seen in Figure 2) being assessed for compatibility. Four scenarios regarding sharing of identifiers and concepts are possible (see Figure 2). Below is an explanation of implications for scenarios in Figure 2 on interoperability.

Scenario 1: Does not affect interoperability since neither identifiers nor concepts are shared.
Scenario 2: Denotes true ID matches where identifiers increase interoperability since they represent different concepts that are shared by datasets.
Scenario 3: Two datasets share an identifier for which the concepts do not match. Therefore, it denotes a false ID match that negatively affects interoperability.
Scenario 4: Also negatively affects interoperability since concepts match in the two data sets yet the identifiers do not match.

In this method, a pair of identifiers (IDs) shared by two datasets is evaluated to determine whether they represent the same concept. Therefore, the interoperability calculation will only consider scenarios 2, 3, 4 and exclude 1.

The data (as illustrated in Figure 3) are transformed into the RDF format from which a list of identifiers is generated (see example in Figure 4). We then combine all identifiers from participating datasets into one file (totaling to 149 IDs).

Using Algorithm (Figure 5a), every pair of identifiers in any two datasets is compared to generate a list of questions of the form

< i d 1 > ? u n k n o w n < i d 2 >

(i.e., from 149 IDs, we generate 149 × 148 = 22,052 questions). The list of IDs are then compared for concept similarity by replacing the

? u n k n o w n

predicate with the iiop:SameAs or iiop:notSameAs depending on real-world inputs crowd-sourced from domain experts. The relationship between two IDs is denoted by iiop:SameAs if concepts are the same and iiop:notSameAs (the identifier interoperability vocabulary can be found here; http://semweb.mmlab.be/ns/iiop#) when different in two datasets as shown below.

Prefix ns1: <https://w3id.org/gicentre/onto#>

@prefix iiop: <http://semweb.mmlab.be/ns/iiop#>

<ns1:desinventar/Event>iiop:SameAs <ns1:response/Disaster_type>

<ns1:desinventar/Location>iiop:SameAs <ns1:response/Location>

<ns1:desinventar/Serial>iiop:SameAs <ns1:response/id>

The EYE reasoning server (find reasoning examples here https://n3.restdesc.org/n3/) [36] was then used to reason through the questions thus generating a list of answers (also known as interoperability statements) based on query Figure 5b. EYE is a semantic web reasoning engine that supports Euler paths. Through interoperability with Cwm via NTriple (N3), EYE provides a mechanism to query check, transform and filter information (See query in Figure 5b).

To measure semantic interoperability, the open DesInventar dataset is used as a reference for computing the interoperability ratio in Equation (1). DesInventar (click this link to access the Desinventar dataset: https://www.desinventar.net/DesInventar/profiletab.jsp?countrycode=uga) is an open disaster management information system managed by the mandated disaster management institution (OPM). The system enables collection, documentation and analysis of data about losses caused by disasters associated with natural hazards. Thus it helps to understand disaster trends and their impacts for better prevention, mitigation and preparedness among communities.

I d e n t i f i e r I n t e r o p e r a b i l i t y = \frac{ρ_{i d e n t i f i e r s}}{ρ_{r e a l - w o r l d}}

(1)

Consider a pair of datasets (see Figure 2), the

ρ_{r e a l - w o r l d}

relevance in Equation (1) is equal to the occurrence of consolidated identifiers in a joined dataset. While

ρ_{i d e n t i f i e r}

relevance is calculated similarly as the real-world relevance except that the strings in the identifiers and the respective concepts they represent need to match. In other words, the

ρ_{i d e n t i f i e r}

is the number of triples in a joined dataset with matching true positives (see scenario 2 in Figure 2). Based on expert opinion, the elicited datasets used in the compatibility assessment are rated to determine how easy it is to merge them with the reference dataset. These ratings are compared with interoperability metrics from Equation (1).

To answer RQ2, the resulting challenges from qualitative data analysis are matched with the problem part of pattern templates [37] to generate a list of relevant patterns in the disaster domain as illustrated in Figure 6. The solution space constitutes of generic patterns applied in other contexts other than disaster data interoperability. These were derived from literature and existing catalogs. The decision on the final pattern selected also takes into consideration the results from data interoperability assessment in the sector.

4. Results

4.1. To What Extent Is Disaster/Hazard-Related Data Interoperable among Stakeholders?

This section presents results of interoperability maturity level and compatibility measures for disaster data in Uganda. The results are organized based on the four interoperability levels defined by [23].

4.1.1. Technical Interoperability

Technical interoperability is reached when data/information can be shared and accessed using common standard protocols. For instance, a good indicator of technical interoperability is the ability to access disaster data via Http/URL and download it fully (see metric T1 in Table 2). All interviewed respondents manage web portals with downloadable disaster data/information except for one as shown in Table 2. This implies that technical interoperability can fully be achieved in the disaster sector in Uganda. In addition, sharing of disaster data is also achieved through email and flash-drives.

4.1.2. Syntactic Interoperability

From Table A2, all participating organization have unstructured data (e.g., pdf), structured data (S1) and machine readable data (S2). Data are encoded in different formats depending on intent for data/information use. For instance, the intent of keeping disaster data as spreadsheets best suits analysis whereas shape file format suits mapping. The only open machine readable format used is the CSV, and that no institution uses RDF and XML machine readable formats for sharing data. Interview and FGDs indicated that the respondents had no working knowledge of XML and RDF which are also semantic markup languages signaling a lack of capacity in linked data/semantic technology among data publishers and users (see Section 4.1.4). However, since there existed both structured and machine readable data all of which the serializations are common among all institutions, we can conclude from Table 2 that at 93%, syntactic interoperability for data in the disaster management sector is high.

4.1.3. Organization/Legal Interoperability

A logical indicator for measuring legal interoperability is whether data has an open license such that the machine can understand what the reuse rights are. Generally, disaster sector stakeholders have no licenses attached to data (O1 in Table 2) making it hard to determine the openness of the data available. However, most of this data can freely be accessed, downloaded and reused (O2). In some instances, only derived data are available rather than the raw data e.g., climatic data. Inter-institutional arrangements among users and publishers exist, as illustrated in O3 to enable sharing of disaster related data. The data publishers asserted that they have institutional policies on how data should be accessed. This makes it hard for data users to access the data because each institution has its own policy as illustrated in the quote below.

Respondent 4; “We know data is there but obtaining it is a problem. In most cases you need to use informal means to access it”

To solve these organizational interoperability barriers, there is a need to encourage license definition for data as well as harmonization of policies for maintaining and accessing data in the sector.

4.1.4. Semantic Interoperability

From Table 2, all respondents (data providers) have not used base URI for their data (Sem1) and neither do they have linked vocabularies (Sem2) describing concepts in their dataset. However, from qualitative results, semantic issues still exist while utilizing disaster/hazard-related data as illustrated in the quotes below.

Respondent 1: “The climate domain uses difficult terminology that is often understood differently in other domains or by the common man for example the concepts above average rainfall, late start, dry spell are understood differently for agricultural domain and yet for layman it could also mean something else”

Focus group discussion 2; “Since the desinventar database is populated by different volunteers, we are not sure whether all volunteers will conceptualize events for reporting purposes in the same way. For instance, there is no clear reporting on terms like rains, heavy rains, storms, windstorms, hailstorms.”

The quotes above imply that data users are aware that semantic issues exist when reusing disaster/hazard-related data. From Section 4.1.2, we note that there is a lack of capacity in using semantic markup. Therefore, the absence of implementations to manage semantic interoperability issues in data can be explained by the lack of capacity among data producers. An important indicator of whether data can be semantically integrated is through comparison of identifiers in different data sets to check for concepts that are similar in the real world. Upon experimenting with the RDF model, results in Table 3 generally indicate a high interoperability identifier ratio except for the land cover dataset. Since the land cover dataset does not share identifiers and concepts with any other dataset and vice-versa, it’s interoperability identifier value is zero. Thus has no impact on the interoperability of other datasets (see scenario 1 in Figure 2). Therefore, the results in Table 3 indicate that data in the disaster management sector is highly semantically inter-linked.

However, the semantic issue presented by respondent 1 requires a more expressive language to infer extreme weather events/conditions from rainfall data that can not be captured by simple RDF syntax used in the experiment above. While results in Table 3 suggest that linked data could improve semantic inter-linkedness between sector datasets, a more expressive way for representing explicit but also shared conceptualizations (i.e., through use of formal ontologies in OWL-DL) could be explored. Organizing sector data with ontologies requires an understanding of the complex and abstract ontology modeling notions expressed in OWL-DL. However, disaster domain experts are not often familiar with these constructs and will thus need a steep learning curve [38]. To mitigate the issue of lack of technical capacity in semantic technologies(as expressed in Section 4.1.2), best practice solutions (ontology design patterns and tools) can be adopted to make ontology modeling easy and intuitive enough for domain experts to develop ontologies for organising sector data efficiently and correctly without the help of Knowledge Engineers.

4.2. Are There Emerging Patterns for Managing Data Interoperability in the Disaster Management Sector?

Table 4 presents patterns (a description of revealed patterns is available here: https://github.com/mazimweal/mazimweal.github.io) obtained by mapping interoperability challenges with the problem part of the pattern template in the context of disaster management. Stakeholders in the disaster management sector share both raw and derived data. To help integrate and reuse data, respondents recommended the inclusion of metadata with pre-processing techniques. Also, respondents noted that shared data often has gaps, duplication and erroneous entries. To solve these data issues, respondents recommend best practices that will not only fill sector data gaps by enrichment but also ensure data validation with related datasets from other stakeholders. Linked data is a suitable technology for increasing data interoperability as explored by results in Table 3 not only by uniquely interlinking concepts but also through validation and enrichment of sector data with other sources. Therefore, several linked data patterns (a description of the patterns for modeling publishing and consuming of linked data are available at http://patterns.dataincubator.org/book/) for publishing and consuming linked data emerge. These include the “Follow Your Nose pattern” to find additional relevant data, “Progressive Enrichment pattern” for data quality and “Missing Isn’t Broken pattern” to handle messy and incomplete data.

A technical interoperability barrier identified in the sector is the inability to access data scattered in multiple disaster/hazard related repositories. The federated query pattern provides a solution to enable access to multiple data stores with a single query. Furthermore, disaster/early warning managers expressed need for a solution that can send geolocated alert or notification to stakeholders across multiple areas to communicate details of specific hazard events. The broadcast pattern would be instrumental in solving such a problem.

From Section 4.1.3, there is a difficulty in finding and accessing data from publishers as organizations have different policies and regulations for maintaining/accessing disaster-related data. Best practice solutions from respondents show the need for organizations to advertise their data. The political will of the government to invest in data is critical. Besides, mandated government institutions ought to take lead in the formulation and implementation of policies supporting the growth of open data initiatives and harmonization of data sharing practices. Thus the following patterns emerge for reuse: dissemination and awareness pattern, policy development and harmonization patterns (e.g. rights pattern, private policy pattern, policy override and conflict mediation patterns), coordination and implementation patterns. These patterns are synonymous with pattern instances described in a different context by Nehta [7].

To resolve semantic barriers identified in Section 4.1.4, respondents proposed a unified vocabulary across domains and institutions (see Table 4). Given that institutions maintain data based on different institutional policies as illustrated in Table 2, a bottom-up approach to ontology engineering that does not necessitate policy harmonization is preferable. As such, we identify content ontology design patterns for organizing data in the disaster management sector. Content ODPs allow institutions to align themselves to the interoperability infrastructure/global pattern schema on a gradual basis depending on the organizational capacity (see Section 4.1.4) and policies.

5. Discussion

5.1. Disaster Data Interoperability Evaluation

From the results above, there exists a high level of maturity for technical and syntactic interoperability for data in the sector. Data users can only access machine-readable CSV format (for instance, one can easily download disaster impact data from DesInventar database as CSV) implying that data is limited to the 3rd star of open data rating by BarnesLee [25]. On the contrary, semantic and organization interoperability is still a challenge in the sector. Since the study uses FAIR interoperability metrics for interoperability assessment, the maturity results show that disaster/hazard related data are far from fulfilling FAIR recommendations. In this paper, we explore a way of resolving interoperability issues through a mapping to best practice solutions. Therefore, future work could also focus on holistically exploring the FAIRness of disaster/hazard related data by further evaluating its findability, accessibility and reusability.

The identifier interoperability provides a mechanism to reason about the semantics of concepts in datasets based on the RDF data model. Results from compatibility measurement suggest that using the linked data can improve interoperability. However, users in the disaster management sector often use both structured, unstructured data (like pdf) and raw imagery that can be easily transformed using well-established procedures. A big challenge to this study is obtaining only the structured data available in the disaster sector to enable examination of conflicts in data. To address this problem, we triangulate data sources and interoperability assessment types to gain a bigger picture of data interoperability in the disaster sector. This allows demonstration of the fact that RDF alone is not sufficient to capture complex semantic notions (e.g. inferring extreme weather events/conditions from rainfall data/thresholds). This implies that a more expressive mechanism is required to capture such semantic constraints of data such as OWL-DL. However, there is a lack of technical capacity in semantic markup technologies such as XML/RDF/OWL. Thus, future work can focus on furthering work on intuitive RDF interfaces (such as RDF data generators https://rdforms.org/#!start.md), and ODPs [39] to support managing of data by users/ disaster domain experts.

From a legal perspective, clearly defined license (open/proprietary) increases interoperability for both humans and machine. In Table 2, most disaster-related data are freely available but have no license attached. Thus machines will not explicitly understand data reuse rights even when data are seemingly open. Even with inter-institutional collaborations, data interoperability in the sector will remain a challenge since institutions maintain data based on different policies/regulations. The mandated hazard lead government institutions have a role to develop policies that will open and harmonize data management among stakeholders. Since the role of policy development is outside the purview of a typical user/stakeholder in the disaster sector, there is a need to develop interoperable disaster management solutions that take into consideration changing policies and related constraints.

5.2. Interoperability Patterns

The interoperability assessment (in Section 4.1) together with elicited challenges enable identification of suitable interoperability best practices for managing data in the disaster management sector. Patterns identified can be grouped using different classification systems such as categorization by interoperability barrier (see Table 4), granularity (e.g., federated and broadcast patterns are more coarse than ODPs) etc. Construction of high-quality interoperability solutions by domain experts without expert knowledge entails a systematic effort to combine patterns such as those in Table 4. Figure 7 illustrates the concept of pattern combination in the context of a multi-hazard early warning system. This suggests that a catalog of patterns and tools for combining these patterns must be readily accessible. While patterns identified in Table 4 are specific to interoperability issues in Uganda’s disaster management sector, future work could also focus on the identification of new patterns at the different classification levels in a broader disaster community. The choice of pattern instances used in a composition is largely dependent on organizational policies and business processes. For instance, the selection of content ODPs in Section 4.2 to manage semantic barriers takes into consideration the fact that multi-hazard indicator data is managed based on different organizational policies. Therefore, applying patterns without reflecting on organizational policies and processes can lead to incorrectly modeled interoperability solutions.

At a coarse level(in this study, coarse level patterns are similar to architectural patterns), Figure 7 combines the federated and broadcast patterns to enable query multi-hazard data from different actors in the sector and generate alerts and notifications based on the queries. At a finer scale, willing institutions organise and publish hazard-related linked data using ontology design patterns as illustrated in [40]. Publishing data using ODPs allows for continuous connection to the unified global ontology pattern schema that is implicitly glued together with a foundational ontology. This linked hazard data can further be enriched and validated with related data on the web using linked-data patterns. A federated data query can then be posted onto data sources of various actors in the disaster management sector using interfaces such as that defined by [41]. The reasoning possibilities of ontologies facilitate the deduction of new facts during federated data queries that are critical for early warning.

Nonetheless, federated queries on linked data require that disaster data users and publishers are knowledgeable in the query technology in use e.g., SPARQL or GraphQL-LD. In Table 4, respondents recommended semantic systems with interfaces that hide such complexity. To overcome this challenge, future work could explore the use of natural language by data users/novices to make federated queries to decentralized repositories where semantic issues have been treated with ODPs. The work of [42,43] is useful to guide this conversation in the disaster management sector.

6. Conclusions

The success of disaster risk management efforts depends on the ability of multiple stakeholders to integrate and reuse existing disaster data. In this paper, we evaluate the extent to which data from multiple stakeholders in the disaster management sector is interoperable and identify patterns for reuse while solving interoperability challenges in the domain. Results indicate high technical and syntactic interoperability maturity for data in the sector. On the contrary, the maturity of semantic and legal interoperability is still low. The maturity results show that data in the disaster sector is far from fulfilling FAIR interoperability recommendations. Data users and publishers in the sector have a clear understanding of semantic issues existing in the datasets but lack the technical capacity to implement semantic technologies. Moreover, there also exists gaps in published data that are not easily accessed from multiple repositories. To solve these issues, we identify patterns that will enable disaster domain experts to construct highly interoperable solutions without the help of interoperability experts. Pattern identification takes into consideration the results of interoperability assessments. Pattern reuse requires a systematic effort to combine them to solve existing data interoperability barriers in the sector. The choice of patterns used will typically depend on organizational policies and business processes. In this study, we use a multi-hazard early warning solution to illustrate modular interoperability pattern reuse based on the assessment. The architecture of a multi-hazard early warning system will typically constitute the federated and broadcast patterns at a coarse level and linked data published using ODPs at a finer level. Publishing disaster data as linked data using ODPs is critical for data enrichment, validation and handling of semantic issues while taking into consideration the technical capacity and organizational constraints. Linked data from multiple publishers can, therefore, be integrated using federated queries. Thus future work could focus on the use of natural language for ODP based linked data query answering over federated repositories. This will enable data users to access data without technical competence in semantic technologies. More so, future work could consider identification of more interoperability patterns at different classification levels. For instance, at a finer level, researchers could identify specific ontology design patterns that can be reused to publish data in the disaster management sector. This study assesses interoperability and provides best practices solutions to data interoperability challenges based interoperability metrics of the FAIR framework. Future work could also focus on holistically exploring the FAIRness of disaster data/vocabularies in Uganda not only by further evaluating its Findability, Accessibility, and Reusability but also proposing ways to increase its FAIRness.

Author Contributions

Conceptualization, I.H., and A.M.; methodology, A.M. and I.H.; software, A.M., data curation, A.M. and A.G.; writing—original draft preparation, A.M.; writing—review and editing, I.H., A.G. and A.M.; supervision, I.H. and A.G.

Funding

This work is supported by the Swedish International Development Agency (SIDA) funding to Makerere University, Kampala in partnership with Chalmers and Gothenburg University, Sweden under the BRIGHT project 317.

Acknowledgments

We acknowledge support from Pieter Colpaert with regard to ideas on the FAIR framework and compatibility assessment.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CSOs	Civil Society Organisations
CSV	Comma separated variable format
Cwm	Cwm (pronounced coom) is a general-purpose data processor for the semantic web
DRR	Disaster Risk Reduction
EWS	Early Warning Systems
EIF	European Interoperability Framework
EYE	Euler Yet another proof Engine
FAIR	Findable, Accessible, Interoperable and Reusable
FGDs	Focused Group Discussions
GraphQL-LD	GraphQL Query Language for linked data
HTML	Hypertext Markup Language
HTTP	Hypertext Transfer Protocol
IMI	Information Modeling and Interoperability
LCIM	Level of Conceptual Interoperability Model
MDA	Ministries, Departments and Agencies
NEWS	National Early Warning System
NGOs	Non-Governmental Organisations
NOAA	National Oceanic and Atmospheric Administration
ODPs	Ontology Design Patterns
OSM	Open Street Map
OWL_DL	Web Ontology Language -Description Logic profile
RDF	Resource description framework
SPARQL	Language for querying linked data
UN	United Nations
UNMA	Uganda National Meteorological Authority
URI	Uniform Resource Identifier
USGS	United States Geological Survey (Context: data explorer)
XML	Extensible Markup Language

Appendix A. Interoperability Maturity

Appendix A.1. Interoperability Maturity Metrics

A full list of FAIR Metrics and corresponding descriptions can be found at https://github.com/FAIRMetrics/Metrics/tree/master/MaturityIndicators/Gen1.

Table A1. Defining metrics for interoperability maturity assessment.

Rezaei et al. (2014)	5 Star Data Rating (Berners-Lee 2006)	5 Stars of Linked Data Vocabulary Use (Janowicz et al. 2014)	FAIR Metrics	Metrics Derived for Study
Technical	1st Star:Make your stuff available on the Web (whatever format) under an open license		FM-A1.1: Access Protocol is open, free, and universally implementable	T1: Data is published in any format and accessed via a common protocol such as Http URL
Syntactic	2nd Star: make it available as structured data (e.g., Excel instead of image scan of a table)			S1: Data is available as structured data e.g., excel instead of a scanned image
Syntactic	3rd Star: Make it available in a non-proprietary open format	2nd Star: The information is available as machine-readable explicit axiomatization of the vocabulary. E.g OWL, RDF W3C standards	FM-RI: Meets Community Standards (meta)data meet domain-relevant community standards FM-I1: Use a Knowledge Representation Language-(meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation	S2: Data is available in a machine-readable non-proprietary format e.g., CSV, XML, RDF, OWL, XML, GML etc
Semantic	4th Star: use URIs to denote things, so that people can point at stuff	1st Star: Linked Data without any vocabulary		Sem 1: Use URI to denote concepts to enable unique identification
Semantic	5th Star: link your data to other data to provide context	3rd Star: The vocabulary is linked to other vocabularies 4th Star: Metadata about the vocabulary is available 5th Star: The vocabulary is linked to by other vocabularies (note: this is the reverse of 3rd star)	FM-I1: Use a Knowledge Representation Language: (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation FM-I2: Use FAIR Vocabularies: IRIs representing the vocabularies used for (meta)data FM-I3: Use Qualified References: (meta)data include qualified references to other (meta)data	Sem2: Data is available with vocabularies that are linked via unique and qualified references
Organisation	Ist Star: Make your stuff available on the Web (whatever format) under an open license		FM-R1.1: Accessible Usage License: (meta)data are released with a clear and accessible data usage license	O1: License attached to data O2: Data is accessed freely (but may does not have explicit license but it is openly accessible Observed in interview data)
			FM-F1B: Identifier persistence-Whether there is a policy that describes what the provider will do in the event an identifier scheme becomes deprecated.	O4: Uses a data policy that is binding to all institutions
				O3: Presence of institutional arrangements (From interview data)

Appendix A.2. Summary of Data Used for Measuring Maturity

Table A2. Qualitative measurement of disaster data interoperability.

Respondent	Syntax	Technical	Semantic	Organisation
Respondent 1	Excel/CSV, html, ppt, docx, pdf, shapefile	Email, websites-URL, Database	No URI and No linked data	Data License: not sure Data is open: Partially Policy: institutional Institutional arrangements:yes
Respondent 2	Excel/CSV, shapefile, pdf, html	Email, websites-URL, Flash Drive	No URI and No linked data	Data License: No Data is open: yes Nature of Policy: institutional Institutional arrangements
Respondent 3	html, shapefile, OSM	Website-URL	No URI and No linked data	Data License: yes Data is open: yes Nature of Policy: institutional Institutional arrangements:Yes
Respondent 4	Excel/CSV, html, ppt, docx, pdf, shapefile	Website, URL	No URI and No linked data	Data License: yes Data is open: yes Nature of Policy: institutional Institutional arrangements:Yes
Respondent 5	Excel/CSV, html, ppt, docx, pdf	Database, Website-URL, email	No URI and No linked data	Data License: No Data is open: yes Nature of Policy: institutional Institutional arrangements:Yes
Respondent 6	Excel/CSV, html, ppt, docx, pdf	Database, Website-URL, email	No URI and No linked data	Data License: No Data is open: partially Nature of Policy: institutional Institutional arrangements:Yes
Respondent 7	ppt, docx, pdf, shapefile	Database, flashdrive	No URI and No linked data	Data License: No Data is open: partially Nature of Policy: institutional Institutional arrangements:Yes

Appendix B. Nature of Participants in the Study

Appendix B.1. Summary of Respondents that Participated in Interviews

Table A3. Description of respondents in the interview.

Respondent	Nature of Organization	Data Shared between Stakeholders
Respondent 1	Institution interested in food insecurity scenario analysis and early warning based on relevant drivers.	Collects primary data such as price data and volumes of informal trade but relies on a network of partners to access data on farm gain, Climate and weather data from NOAA and USGS, Vegetation conditions, Rainfall data from UNMA, consumer prices, nutrition data, conflict data
Respondent 2	National statistical body	Provides fundamental data sets like administrative boundary data upon which disaster data is overlayed, consumer prices, vulnerability indicators
Respondent 3	Humanitarian that collects open data and maps it on OSM platform	Generates exposure data and maps post disaster impacts. Also relies on data from other institutions such as administration boundaries etc.
Respondent 4	Humanitarian organization for disaster response and preparedness. Collaborates with lead agencies on Flood modeling and early warning, indigenous knowledge EWS, Forecast based financing (FbF)	Generate rapid assessment reports/response data, relies on data from other institutions for joint Hazard, Vulnerability and Risk assessment. Such data include disaster Impact data, land cover, imagery etc
Respondent 5	National Office mandated to coordinate disaster preparedness, prevention and emergency response	Manages DesInventar impact data, Institution does not primarily collect data but rather relies on data from other institutions. Coordinates early warning efforts of other institutions e.g., weather/climate forecasts, food security, crop and pasture conditions, Multi hazard early warning (based on sector data) etc
Respondent 6	Semi autonomous government authority for weather and climate services	Weather data (Rainfall/ humidity/temperature data) and fore casts (daily, 10 day dekadal, monthly and seasonal forecasts), climatic statistics
Respondent 7	Water Resource monitoring department in charge flood hazards	Provide data on water levels in various rivers and lakes being monitored; flood hazard and sanitation data

Appendix B.2. Description of Institutions that Participated in Focused Group Discussions

Table A4. Nature of organization where focused group discussions (FGD) participants were drawn.

No	Nature of Organization
1	Non governmental organization offering relief services to Refugees in Uganda
2	United Nations agency supporting disaster management in Uganda
3	Office mandated to coordinate disaster management in Uganda
4	Higher learning and research institution in Uganda
5	International Non Governmental Organization that supports humanitarian efforts in Uganda. Also supports data management for community EWS
6	Ministry in charge of health in Uganda (provides human epidemic data)
7	Church based humanitarian organization in Uganda
8	Ministry that provides National data and information on agriculture sector; department involved in food insecurity analysis and early warning
9	Non governmental Organization involved in climate-smart disaster risk reduction and Forecast based financing
10	Civil Society Organization helping reduce Community Vulnerability to Climate Change Induced Disasters
11	Humanitarian Organization spear heading Forecast based Financing in Uganda.

Appendix C. Interoperability Patterns

Technical patterns include the federated and broadcast patterns. Semantic patterns include a list of Linked data patterns, and Ontology design patterns. Organisational patterns include the dissemination pattern, rights pattern among others. All Pattern templates and diagrams can be found here: https://github.com/mazimweal/mazimweal.github.io.

References

Bharosa, N.; Lee, J.; Janssen, M. Challenges and obstacles in sharing and coordinating information during multi-agency disaster response: Propositions from field exercises. Inf. Syst. Front. 2010, 12, 49–65. [Google Scholar] [CrossRef]
Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wilkinson, M.D.; Sansone, S.A.; Schultes, E.; Doorn, P.; da Silva Santos, L.O.B.; Dumontier, M. A design framework and exemplar metrics for FAIRness. Sci. Data 2018, 5, 180118. [Google Scholar] [CrossRef] [PubMed] [Green Version]
da Silva Avanzi, D.; Foggiatto, A.; dos Santos, V.A.; Deschamps, F.; Loures, E.D. A framework for interoperability assessment in crisis management. J. Ind. Inf. Integr. 2017, 5, 26–38. [Google Scholar]
Leal, G.D.; Guédria, W.; Panetto, H. Interoperability assessment: A systematic literature review. Comput. Ind. 2019, 106, 111–132. [Google Scholar] [CrossRef]
Milosevic, Z. Addressing interoperability in E-health: An Australian Approach. In Proceedings of the 10th IEEE International Enterprise Distributed Object Computing Conference Workshops (EDOCW’06’), Hong Kong, China, 16–20 October 2006. [Google Scholar]
NEHTA. Interoperability Framework V2. 2007. Available online: https://www.ghdonline.org/uploads/NEHTA_2007_-_InteroperabilityFramework.pdf (accessed on 24 October 2019).
Fritzsche, D.; Grüninger, M.; Baclawski, K.; Bennett, M.; Berg-Cross, G.; Schneider, T.; Sriram, R.; Underwood, M.; Westerinen, A. Ontology Summit 2016 Communique: Ontologies within semantic interoperability ecosystems. Appl. Ontol. 2017, 12, 91–111. [Google Scholar] [CrossRef] [Green Version]
Field, C.B.; Barros, V.; Stocker, T.F.; Dahe, Q. Managing the Risks of Extreme Events and Disasters to Advance Climate Change Adaptation: Special Report of the Intergovernmental Panel on Climate Change; Cambridge University Press: New York, NY, USA, 2012. [Google Scholar]
Haigh, R.; Amaratunga, D.; Hemachandra, K. A capacity analysis framework for multi-hazard early warning in coastal communities. Procedia Eng. 2018, 212, 1139–1146. [Google Scholar] [CrossRef]
Al Hmoudi, A.; El, R.; Aziz, Z. Integrated elements of early warning systems to enhance disaster resilience in the Arab region. J. Geod. Geomat. Eng. 2015, 2, 73–81. [Google Scholar] [CrossRef]
Rogers, D.; Tsirkunov, V. Implementing Hazard Early Warning Systems. GFDRR WCIDS Report 11-03. 2011. Available online: https://www.preventionweb.net/files/24259_implementingearlywarningsystems1108.pdf (accessed on 17 September 2019).
Samarasundera, E.; Hansell, A.; Leibovici, D.; Horwell, C.J.; Anand, S.; Oppenheimer, C. Geological hazards: From early warning systems to public health toolkits. Health Place 2014, 30, 116–119. [Google Scholar] [CrossRef]
Yao, K. Open Government Data for Disaster Risk Reduction. In Proceedings of the Training Workshop on Knowledge and Policy Gaps in Disaster Risk Reduction and Development Planning, Bangkok, Thailand, 8–9 March 2016; pp. 8–9. [Google Scholar]
Li, G.; Zhao, J.; Murray, V.; Song, C.; Zhang, L. Gap analysis on open data interconnectivity for disaster risk research. Geo-Spat. Inf. Sci. 2019, 22, 45–58. [Google Scholar] [CrossRef] [Green Version]
Kucera, J.; Chlapek, D.; Klímek, J.; Necaskỳ, M. Methodologies and Best Practices for Open Data Publication; DATESO: Jičín, Czech Republic, 2015; pp. 52–64. [Google Scholar]
Mons, B.; Neylon, C.; Velterop, J.; Dumontier, M.; da Silva Santos, L.O.B.; Wilkinson, M.D. Cloudy, increasingly FAIR; revisiting the FAIR Data guiding principles for the European Open Science Cloud. Inf. Serv. Use 2017, 37, 49–56. [Google Scholar] [CrossRef] [Green Version]
Chen, D.; Daclin, N. Framework for enterprise interoperability. In Proceedings of the IFAC Workshop EI2N, Bordeaux, France, 22–24 March 2006; pp. 77–88. [Google Scholar]
Ford, T.C.; Colombi, J.M.; Graham, S.R.; Jacques, D.R. Survey on Interoperability Measurement. In Proceedings of the International Command and Control Research and Technology Symposium (ICCRTS), Newport, RI, USA, 18–21 June 2007. [Google Scholar]
Melnik, S.; Decker, S. A layered approach to information modeling and interoperability on the web. In Proceedings of the ECDL’00 Workshop on the Semantic Web, Lisbon, Portugal, 21 September 2000. [Google Scholar]
Commission, E. European Interoperability Framework—Implementation Strategy. 2017. Available online: https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=COM:2017:134:FIN (accessed on 5 September 2019).
Wang, W.; Tolk, A.; Wang, W. The levels of conceptual interoperability model: Applying systems engineering principles to M&S. arXiv 2009, arXiv:0908.0191. [Google Scholar]
Rezaei, R.; Chiew, T.K.; Lee, S.P. A review on E-business Interoperability Frameworks. J. Syst. Softw. 2014, 93, 199–216. [Google Scholar] [CrossRef]
Janowicz, K.; Hitzler, P.; Adams, B.; Kolas, D.; Vardeman, I. Five stars of linked data vocabulary use. Semant. Web 2014, 5, 173–176. [Google Scholar]
Berners-Lee, T. Linked Data. Design Issues for the World Wide Web. World Wide Web Consortium, 2006. Available online: http://www.w3.org/DesignIssues/LinkedData.html? (accessed on 5 September 2019).
Moses, M.; Stevens, T.; Bax, G. GIS Data Interoperability in Uganda. Int. J. Spat. Data Infrastruct. Res. 2012, 7, 488–507. [Google Scholar]
Colpaert, P.; Van Compernolle, M.; De Vocht, L.; Dimou, A.; Vander Sande, M.; Verborgh, R.; Mechant, P.; Mannens, E. Quantifying the interoperability of open government datasets. Computer 2014, 47, 50–56. [Google Scholar] [CrossRef]
Yahia, E.; Lezoche, M.; Aubry, A.; Panetto, H. Semantics enactment for interoperability assessment in Enterprise Information Systems. Annu. Rev. Control 2012, 36, 101–117. [Google Scholar] [CrossRef]
Gamma, E.; Helm, R.; Johnson, R.E.; Vlissides, J. Design Patterns: Elements of Reusable Object-Oriented Software; Addison-Wesley: Reading, MA, USA, 1995. [Google Scholar]
Hay, D.C. Data Model Patterns: A Metadata Map; Elsevier: Amsterdam, The Netherlands, 2010. [Google Scholar]
Clark, P.; Thompson, J.; Porter, B. Knowledge Patterns. In Principles of Knowledge Representation and Reasoning; Springer: Berlin/Heidelberg, Germany, 2000; pp. 591–600. [Google Scholar]
Bröring, A.; Schmid, S.; Schindhelm, C.K.; Khelil, A.; Kabisch, S.; Kramer, D.; Le Phuoc, D.; Mitic, J.; Anicic, D.; Teniente López, E. Enabling IoT ecosystems through platform interoperability. IEEE Softw. 2017, 34, 54–61. [Google Scholar] [CrossRef]
Spalazzese, R.; Inverardi, P. Mediating connector patterns for components interoperability. In European Conference on Software Architecture; Springer: Berlin/Heidelberg, Germany, 2010; pp. 335–343. [Google Scholar]
Gangemi, A.; Presutti, V. Ontology design patterns. In Handbook on Ontologies; Springer: Berlin/Heidelberg, Germany, 2009; pp. 221–243. [Google Scholar]
Blomqvist, E.; Hitzler, P.; Janowicz, K.; Krisnadhi, A.; Narock, T.; Solanki, M. Considerations regarding Ontology Design Patterns. Semant. Web 2016, 7, 1–7. [Google Scholar] [CrossRef]
Verborgh, R.; De Roo, J. Drawing conclusions from linked data on the web: The EYE reasoner. IEEE Softw. 2015, 32, 23–27. [Google Scholar] [CrossRef]
Tešanovic, A. What Is a Pattern; Linköping Univeristy: Linköping, Sweden, 2004. [Google Scholar]
Dimitrova, V.; Denaux, R.; Hart, G.; Dolbear, C.; Holt, I.; Cohn, A.G. Involving domain experts in authoring OWL ontologies. In International Semantic Web Conference; Springer: Berlin/Heidelberg, Germany, 2008; pp. 1–16. [Google Scholar]
Hammar, K. Content Ontology Design Patterns: Qualities, Methods, and Tools; Linköping University Electronic Press: Linköping, Sweden, 2017; Volume 1879. [Google Scholar]
Rodrıguez-Doncel, V.; Krisnadhi, A.A.; Hitzler, P.; Cheatham, M.; Karima, N.; Amini, R. Pattern-based linked data publication: The linked chess dataset case. In Proceedings of the 6th International Workshop on Consuming Linked Data co-located with 14th International Semantic Web Conference (ISWC 2105), Bethlehem, PA, USA, 12 October 2015. [Google Scholar]
Taelman, R.; Van Herwegen, J.; Vander Sande, M.; Verborgh, R. Comunica: A modular SPARQL query engine for the web. In International Semantic Web Conference; Springer: Berlin/Heidelberg, Germany, 2018; pp. 239–255. [Google Scholar]
Ngomo, N. 9th Challenge on Question Answering over Linked Data (QALD-9). Semdeep/NLIWoD@ ISWC. 2018, pp. 58–64. Available online: ceur-ws.org/Vol-2241/paper-06.pdf (accessed on 5 September 2019).
Atzori, M.; Mazzeo, G.M.; Zaniolo, C. QA3: A natural language approach to question answering over RDF data cubes. Semant. Web 2019, 10, 587–604. [Google Scholar] [CrossRef]

Figure 1. General flow of methodology.

Figure 2. Dataset comparison (adopted from Colpaert et al. [27]).

Figure 3. Data extracts.

Figure 4. NTriple (N3) and ID generation from datasets in Figure 3.

Figure 5. Computing identifier interoperability [27].

Figure 6. Problem and solution matching resulting into reusable patterns, i.e., “interoperability patterns”.

Figure 7. Illustrative example of a modular pattern composition in an early warning system.

Table 1. Interoperability maturity metrics.

Technical Interoperability
T1:	Data is published in any format and accessed via a common protocol such as Http URL
Syntactic Interoperability
S1:	Data is available as structured data e.g., shapefile, excel instead of scanned image
S2:	Data is available in a machine readable non proprietary format e.g., CSV, XML, GML, RDF, OWL etc
Semantic Interoperability
Sem1:	Use URI to denote concepts for ensuring unique identification
Sem2:	Data is available with vocabularies that are linked
Organization Interoperability
O1:	Licence attached to data
O2:	Data can be accessed freely
O3:	Have institutional arrangements to share data
O4:	Uses a data policy that is binding to other stakeholders in the sector(one that is not only institutional)

Table 2. Interoperability maturity (see Table 1 and Table A2).

Interoperability Level	Technical	Syntax		Semantic		Organisation
Metric	T1	S1	S2	Sem1	Sem2	O1	O2	O3	O4
Respondent 1	1	1	0	0	0	0	0.5	1	0
Respondent 2	1	1	1	0	0	0	1	1	0
Respondent 3	1	1	1	0	0	1	1	1	0
Respondent 4	1	1	1	0	0	1	1	1	0
Respondent 5	1	1	1	0	0	0	0.5	1	0
Respondent 6	1	1	1	0	0	0	0.5	1	0
Respondent 7	0	1	1	0	0	0	0.5	1	0
Total	6	7	6	0	0	2	5	7	0
Average metric	86%	100%	86%	0%	0%	14%	71%	100%	0%
Maturity metric	86%	93%		0%		46%

Table 3. Interoperability identifier ratio with the DesInventar dataset as reference.

Dataset	Relevance of String Matching IDs	Relevance of Real-World Concepts	Interoperability Identifier	Remark
Landcover	0	0	0%	low
Rainfall data	20	20	100%	high
Emdat_impact data	209	300	69.7%	medium
Risk data	32	32	100%	high
Response data	270	330	81.8%	high
Poverty data	38	38	100%	high

Table 4. Emerging interoperability patterns.

Level	Interoperability Challenges	Recommended Solution	Emerging Patterns
General	1-Missing meta data 2-Missing, incomplete, and erroneous datasets 3-Data is not up to date 4-No/ immature standards 5-Duplication of data collection efforts.	1-Metadata documentation portal 2-Enriching and linking disaster/hazard related data to fill gaps and validation of data 3-Develop standards for sector data	1-Linked data patterns e.g., -Follow Your Nose pattern to find additional relevant data -Progressive Enrichment pattern for data quality -Missing Isn’t Broken pattern to handle messy and incomplete data
Technical	6-Data is not easily retrievable because it is scattered	4-Centralized access to repositories with linked documents	2-Federated queries pattern
Technical	7-Absence of directed alerts, notifications based sharing	8-Geo-tagged alerts, and notifications 9-Embrace changing open architectures	3-Broadcast pattern
Syntax	8-Lack of capacity in semantic mark-ups and technologies	10-Semantic systems that hide their complexity behind an easy to use and intuitive interface	–
Semantic	9-Terminology is not standard across datasets to be integrated	11-Unified vocabulary across domains e.g., glossary. However, the vocabulary should be expressive across different domains	4-Content Ontology design patterns (ODPs)
Organisation	10-Data is not easily accessed and retrieved but it is available in different institutions 11-Limited funding for data collection and management in the sector -	12-Mandated institutions should take lead in policy formulation 13-Develop policies that Open up government data (i.e. embrace Open Government Data (OGD) principles) 14-Government should invest in data management 15-Create awareness and disseminate available disaster resources 16-organizations should add licenses to data	5-Coordination and implementation pattern 6-Dissemination 7-privacy policy pattern 8-Policy harmonization(i.e. policy override and conflict mediation patterns) 9-Rights pattern

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mazimwe, A.; Hammouda, I.; Gidudu, A. An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda. ISPRS Int. J. Geo-Inf. 2019, 8, 484. https://doi.org/10.3390/ijgi8110484

AMA Style

Mazimwe A, Hammouda I, Gidudu A. An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda. ISPRS International Journal of Geo-Information. 2019; 8(11):484. https://doi.org/10.3390/ijgi8110484

Chicago/Turabian Style

Mazimwe, Allan, Imed Hammouda, and Anthony Gidudu. 2019. "An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda" ISPRS International Journal of Geo-Information 8, no. 11: 484. https://doi.org/10.3390/ijgi8110484

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda

Abstract

1. Introduction

2. Related Work

2.1. Disaster Data Interoperability

2.2. Interoperability Assessment

2.3. Interoperability Patterns

3. Materials and Methods

4. Results

4.1. To What Extent Is Disaster/Hazard-Related Data Interoperable among Stakeholders?

4.1.1. Technical Interoperability

4.1.2. Syntactic Interoperability

4.1.3. Organization/Legal Interoperability

4.1.4. Semantic Interoperability

4.2. Are There Emerging Patterns for Managing Data Interoperability in the Disaster Management Sector?

5. Discussion

5.1. Disaster Data Interoperability Evaluation

5.2. Interoperability Patterns

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Interoperability Maturity

Appendix A.1. Interoperability Maturity Metrics

Appendix A.2. Summary of Data Used for Measuring Maturity

Appendix B. Nature of Participants in the Study

Appendix B.1. Summary of Respondents that Participated in Interviews

Appendix B.2. Description of Institutions that Participated in Focused Group Discussions

Appendix C. Interoperability Patterns

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI