Semantic integration in healthcare networks

https://doi.org/10.1016/j.ijmedinf.2006.05.008Get rights and content

Abstract

A seamless support of information flow for increasingly distributed healthcare processes requires to integrate heterogeneous IT systems into a comprehensive distributed information system. Different standards contribute to ease this integration. In a research project focussing on the development of a reference architecture for inter-institutional health information systems, we identified concurring standards currently in use. We therefore categorized these integration standards by distinguishing between technical and semantic integration on the one hand, and data and functional integration on the other hand. In addition, standards for semantic integration are roughly categorized according to their scope. By placing standards into a corresponding matrix a “semantic gap” is revealed, which cannot be covered by standards as it contains volatile medical concepts. As a conclusion, it is recommended to conceptually consider the necessity of system evolution in system architectures and also in future integration standards.

Introduction

Healthcare increasingly changes from isolated treatment episodes towards a continuous treatment process involving multiple healthcare professionals and various institutions. This change motivates comprehensive, inter-institutional IT support in health information systems and imposes new demanding requirements for IT [1]. IT applications should guide data acquisition in a way that data are placed in a meaningful context from the beginning, so that they are ready for reuse in different contexts without the need to manually index or transform the data. To achieve such an IT support, heterogeneous IT systems have to be integrated into a comprehensive distributed information system. Integrating autonomous software components, however, is a difficult task, as individual applications usually are not designed to cooperate. Applications are often based on differing conceptualizations of the application domain. Today powerful integration tools (e.g. application servers, object brokers, different kinds of message-oriented middleware, and workflow management systems [2]) are available to overcome technical and syntactical heterogeneity of autonomous system components. Yet, semantic heterogeneity remains as a major barrier to seamless integration of autonomously developed software components (cf. [3]). Semantic heterogeneity occurs when there is disagreement about the meaning, interpretation or intended use of the same or related data [4]. It occurs in different contexts, like database schema integration, ontology mapping, or integration of different terminologies. The underlying problems are more or less the same, though they are often complex and still poorly understood. Stonebraker characterizes disparate systems as “islands of information” and points out two major factors which aggravate systems integration [5]:

  • 1.

    Each island (i.e. application) will have its own meaning of enterprise objects.

  • 2.

    Each island will have data that overlaps data in other islands. This partial redundancy generates a serious data integrity problem.

Based on this statement, data integration can be led back to a mapping problem (how to map different conceptualizations in a semantically correct way) and a synchronization problem (how to ensure mutual consistency of redundant data which are stored in different databases under the control of autonomous applications). The mapping problem is essentially related to the schema integration problem of database systems, which has been extensively discussed in the database literature in recent years (e.g. [6], [7], [8], [9]). A major perception in data integration research has been that schema integration cannot be automated in general. In [10] it is stated: “The general problem of schema integration is undecidable.” Heiler states that “understanding data and software can never be fully automated” [11]. As a consequence, the process of schema integration always needs a human integrator for certain semantic decisions. Colomb even goes a step further by stating that there are cases where no consistent interpretation of heterogeneous sources is possible (“fundamental semantic heterogeneity”) [12]. In such cases one either has to accept a low degree of data quality, or systems have to be modified to resolve fundamental semantic heterogeneity.

In order to reduce the integration efforts caused by semantic heterogeneity standards for systems integration are needed. Moreover, as medicine is a rapidly evolving domain, concepts for system evolution are needed. Fortunately, there are already far reaching standards that support information interchange in the medical domain. Yet, healthcare software is still far away from plug and play compatibility, and systems integration is typically a difficult process. In a research project in which we focus on the development of a reference architecture for comprehensive information systems in healthcare networks [1], [13], we have identified concurring and semantically overlapping standards. To get an overview of the standards’ characteristics and interrelations, we have arranged them to a system of standards which we find to be helpful for architecture development.

Section snippets

Objectives

In this article we try to clarify how different standards contribute to systems integration by distinguishing different aspects and dimensions of integration. The objective of this approach is to identify and characterize the “semantic gap” which is not covered by current standards, and which is responsible for the high effort for systems integration. The goal of this clarification is to derive recommendations for future system architectures and standards development.

Methods

At a conceptual level, information systems are designed around three layers: presentation, application logic, and resource management [2]. According to this well known abstract model of information systems, we distinguished different aspects of integration: data integration, functional integration and presentation integration:

  • Data integration: we have already characterized semantic heterogeneity as the main cause for high integration efforts. We thereby focused on data integration. The reason

Results

XML and RDF are examples for standard syntactic frameworks supporting data integration [15]. Standards for semantic integration in healthcare are increasingly based on XML in order to improve syntactical compatibility with commonly accepted data processing formats.

Middleware standards typically provide a common infrastructure for interconnecting distributed software components. Such standards are primarily intended to provide programming abstractions, which help a programmer to easily bridge

Discussion and conclusions

Different kinds of standards are necessary to ease systems integration. In particular, both reference ontologies and application frameworks are needed to support semantic integration. Yet, standards should not try to comprehensively model an application domain, because systems must be capable to rapidly adapt to an evolving application domain. If IT systems should bring medical knowledge to the point of care they must be capable of incorporating the results of ongoing consensus processes among

References (55)

  • B. Blobel et al.

    Comparing middleware concepts for advanced healthcare system architectures

    Int. J. Med. Inform.

    (1997)
  • P. Ciccarese et al.

    Architectures and tools for innovative Health Information Systems: the Guide Project

    Int. J. Med. Inform.

    (2005)
  • M. Beyer et al.
  • G. Alonso et al.

    Web Services—Concepts, Architectures and Applications

    (2003)
  • J.T. Pollock

    The Web Services Scandal—How Data Semantics Have Been Overlooked in Integration Solutions

    EAI J.

    (2002)
  • A. Sheth et al.

    Federated database systems for managing distributed, heterogeneous, and autonomous databases

    ACM Comput. Surv.

    (1990)
  • M. Stonebraker

    Integrating islands of information

    EAI J.

    (1999)
  • S. Conrad

    Schemaintegration-Integrationskonflikte, Lösungsansätze, aktuelle Herausforderungen

    Informatik Forschung Entwicklung

    (2002)
  • A. Bouguettaya et al.

    Interconnecting Heterogeneous Information Systems

    (1998)
  • E. Rahm et al.

    A survey of approaches to automatic schema matching

    VLDB J.

    (2001)
  • C. Batini et al.

    A comparative analysis of methodologies for database schema integration

    ACM Comput. Surv.

    (1986)
  • S. Heiler

    Semantic interoperability

    ACM Comput. Surv.

    (1995)
  • R.M. Colomb

    Impact of semantic heterogeneity on federating databases

    Comput. J.

    (1997)
  • R. Lenz et al.

    Informationsintegration in Gesundheitsversorgungsnetzen-Herausforderungen an die Informatik

    Inform. Spekt.

    (2005)
  • B.T. Pille et al.

    Application integration

  • H. Schöning

    XML und Datenbanken—Konzepte und Systeme

    (2003)
  • Cited by (68)

    • Simulation of patient flow in multiple healthcare units using process and data mining techniques for model identification

      2018, Journal of Biomedical Informatics
      Citation Excerpt :

      The area gets rapid growth, with introduction of big data approaches and technologies [78–80] enabling multiple data sources to be considered for integration: EHR, recommendations of various levels (guidelines, standards, laws), population data (census, official reports), omics data, wearable devices, social media, human-generated data (surveys, self-reported data), financial data, pharmaceutical data, and many others. The integrated solutions often consider the issues of semantic integration [81,82], managing patient-centered data collections [53,83], building advanced tools for the analysis of available data [84,85], as well as general model-based [86,87] and workflow-based [88] integration. These approaches are used to identify models and support simulation (see, e.g., ISPOR Task Force Reports [89]) within different approaches (including model identification, calibration, verification, etc.).

    • A national standards-based assessment on functionality of electronic medical records systems used in Kenyan public-Sector health facilities

      2017, International Journal of Medical Informatics
      Citation Excerpt :

      Much of recent investment in eHealth innovations has gone toward implementing stand-alone “first generation” electronic systems, rather than building linkages between systems [14,15]. Achieving interoperability involves addressing the complexities of both technical and semantic integration, attending both to data integration (via syntactic frameworks and semantic ontologies) and functional integration (via middleware and application frameworks) [16]. Foundational elements for successful interoperability include: governance structures to define EMRs architecture and to oversee adoption of international and local data standards; technical expertise to define unique identifier schema and core data sets; financial incentives to adopt standards-based approaches in designing software; structures which support local as well as shared “ownership” of data; policies for security and privacy of data; and trust-building and alignment with medical communities [14,17–19].

    • The interplay between global standards and local practice in nursing

      2013, International Journal of Medical Informatics
    • Development and evaluation of SOA-based AAL services in real-life environments: A case study and lessons learned

      2013, International Journal of Medical Informatics
      Citation Excerpt :

      The use case, feature and actor models represent best practice and could therefore be reused in other system designs and even used in standardization processes. This is inline with the ideas of Lenz, Beyer and Kuhn in [37] where they argue for a separation of domain concepts and system implementation: “in order to cope with domain evolution, modelling of domain concepts should be separated from IT system implementation. IT systems should be implemented by IT experts and medical knowledge should be modelled and maintained by domain experts.”

    View all citing articles on Scopus
    View full text