Semantic integration in healthcare networks

doi:10.1016/j.ijmedinf.2006.05.008

International Journal of Medical Informatics

Volume 76, Issues 2–3, February–March 2007, Pages 201-207

https://doi.org/10.1016/j.ijmedinf.2006.05.008 Get rights and content

Abstract

A seamless support of information flow for increasingly distributed healthcare processes requires to integrate heterogeneous IT systems into a comprehensive distributed information system. Different standards contribute to ease this integration. In a research project focussing on the development of a reference architecture for inter-institutional health information systems, we identified concurring standards currently in use. We therefore categorized these integration standards by distinguishing between technical and semantic integration on the one hand, and data and functional integration on the other hand. In addition, standards for semantic integration are roughly categorized according to their scope. By placing standards into a corresponding matrix a “semantic gap” is revealed, which cannot be covered by standards as it contains volatile medical concepts. As a conclusion, it is recommended to conceptually consider the necessity of system evolution in system architectures and also in future integration standards.

Introduction

Healthcare increasingly changes from isolated treatment episodes towards a continuous treatment process involving multiple healthcare professionals and various institutions. This change motivates comprehensive, inter-institutional IT support in health information systems and imposes new demanding requirements for IT [1]. IT applications should guide data acquisition in a way that data are placed in a meaningful context from the beginning, so that they are ready for reuse in different contexts without the need to manually index or transform the data. To achieve such an IT support, heterogeneous IT systems have to be integrated into a comprehensive distributed information system. Integrating autonomous software components, however, is a difficult task, as individual applications usually are not designed to cooperate. Applications are often based on differing conceptualizations of the application domain. Today powerful integration tools (e.g. application servers, object brokers, different kinds of message-oriented middleware, and workflow management systems [2]) are available to overcome technical and syntactical heterogeneity of autonomous system components. Yet, semantic heterogeneity remains as a major barrier to seamless integration of autonomously developed software components (cf. [3]). Semantic heterogeneity occurs when there is disagreement about the meaning, interpretation or intended use of the same or related data [4]. It occurs in different contexts, like database schema integration, ontology mapping, or integration of different terminologies. The underlying problems are more or less the same, though they are often complex and still poorly understood. Stonebraker characterizes disparate systems as “islands of information” and points out two major factors which aggravate systems integration [5]:

1.
Each island (i.e. application) will have its own meaning of enterprise objects.
2.
Each island will have data that overlaps data in other islands. This partial redundancy generates a serious data integrity problem.

Based on this statement, data integration can be led back to a mapping problem (how to map different conceptualizations in a semantically correct way) and a synchronization problem (how to ensure mutual consistency of redundant data which are stored in different databases under the control of autonomous applications). The mapping problem is essentially related to the schema integration problem of database systems, which has been extensively discussed in the database literature in recent years (e.g. [6], [7], [8], [9]). A major perception in data integration research has been that schema integration cannot be automated in general. In [10] it is stated: “The general problem of schema integration is undecidable.” Heiler states that “understanding data and software can never be fully automated” [11]. As a consequence, the process of schema integration always needs a human integrator for certain semantic decisions. Colomb even goes a step further by stating that there are cases where no consistent interpretation of heterogeneous sources is possible (“fundamental semantic heterogeneity”) [12]. In such cases one either has to accept a low degree of data quality, or systems have to be modified to resolve fundamental semantic heterogeneity.

In order to reduce the integration efforts caused by semantic heterogeneity standards for systems integration are needed. Moreover, as medicine is a rapidly evolving domain, concepts for system evolution are needed. Fortunately, there are already far reaching standards that support information interchange in the medical domain. Yet, healthcare software is still far away from plug and play compatibility, and systems integration is typically a difficult process. In a research project in which we focus on the development of a reference architecture for comprehensive information systems in healthcare networks [1], [13], we have identified concurring and semantically overlapping standards. To get an overview of the standards’ characteristics and interrelations, we have arranged them to a system of standards which we find to be helpful for architecture development.

Section snippets

Objectives

In this article we try to clarify how different standards contribute to systems integration by distinguishing different aspects and dimensions of integration. The objective of this approach is to identify and characterize the “semantic gap” which is not covered by current standards, and which is responsible for the high effort for systems integration. The goal of this clarification is to derive recommendations for future system architectures and standards development.

Methods

At a conceptual level, information systems are designed around three layers: presentation, application logic, and resource management [2]. According to this well known abstract model of information systems, we distinguished different aspects of integration: data integration, functional integration and presentation integration:

•
Data integration: we have already characterized semantic heterogeneity as the main cause for high integration efforts. We thereby focused on data integration. The reason

Results

XML and RDF are examples for standard syntactic frameworks supporting data integration [15]. Standards for semantic integration in healthcare are increasingly based on XML in order to improve syntactical compatibility with commonly accepted data processing formats.

Middleware standards typically provide a common infrastructure for interconnecting distributed software components. Such standards are primarily intended to provide programming abstractions, which help a programmer to easily bridge

Discussion and conclusions

Different kinds of standards are necessary to ease systems integration. In particular, both reference ontologies and application frameworks are needed to support semantic integration. Yet, standards should not try to comprehensively model an application domain, because systems must be capable to rapidly adapt to an evolving application domain. If IT systems should bring medical knowledge to the point of care they must be capable of incorporating the results of ongoing consensus processes among

References (55)

G.W. Beeler
HL7 version 3—an object-oriented methodology for collaborative standards development
Int. J. Med. Inform.
(1998)
P.E. Zanstra et al.
Coding systems and classifications in healthcare: the link to the record
Int. J. Med. Inform.
(1998)
J.F. Coyle et al.
Standards for detailed clinical models as the basis for medical data exchange and decision support
Int. J. Med. Inform.
(2003)
K.U. Heitmann et al.
Discharge and referral data exchange using global standards—the SCIPHOX project in Germany
Int. J. Med. Inform.
(2003)
R. Lenz et al.
Report of conference track. 2. Pathways to open architectures
Int. J. Med. Inform.
(2003)
P.A. de Clercq et al.
Approaches for creating computer-interpretable guidelines that facilitate decision support
Artif. Intell. Med.
(2004)
C.J. McDonald et al.
What is done, what is needed and what is realistic to expect from medical informatics standards
Int. J. Med. Inform.
(1998)
R. Lenz et al.
Towards a continuous evolution and adaptation of information systems in healthcare
Int. J. Med. Inform.
(2004)
R. Heeks
Health information systems: Failure, success and improvisation
Int. J. Med. Inform.
(2006)
J. Mykkanen et al.
A process for specifying integration for multi-tier applications in healthcare
Int. J. Med. Inform.
(2003)

B. Blobel et al.

Comparing middleware concepts for advanced healthcare system architectures

Int. J. Med. Inform.

(1997)

P. Ciccarese et al.

Architectures and tools for innovative Health Information Systems: the Guide Project

Int. J. Med. Inform.

(2005)

M. Beyer et al.

G. Alonso et al.

Web Services—Concepts, Architectures and Applications

(2003)

J.T. Pollock

The Web Services Scandal—How Data Semantics Have Been Overlooked in Integration Solutions

EAI J.

(2002)

A. Sheth et al.

Federated database systems for managing distributed, heterogeneous, and autonomous databases

ACM Comput. Surv.

(1990)

M. Stonebraker

Integrating islands of information

EAI J.

(1999)

S. Conrad

Schemaintegration-Integrationskonflikte, Lösungsansätze, aktuelle Herausforderungen

Informatik Forschung Entwicklung

(2002)

A. Bouguettaya et al.

Interconnecting Heterogeneous Information Systems

(1998)

E. Rahm et al.

A survey of approaches to automatic schema matching

VLDB J.

(2001)

C. Batini et al.

A comparative analysis of methodologies for database schema integration

ACM Comput. Surv.

(1986)

S. Heiler

Semantic interoperability

ACM Comput. Surv.

(1995)

R.M. Colomb

Impact of semantic heterogeneity on federating databases

Comput. J.

(1997)

R. Lenz et al.

Informationsintegration in Gesundheitsversorgungsnetzen-Herausforderungen an die Informatik

Inform. Spekt.

(2005)

B.T. Pille et al.

Application integration

H. Schöning

XML und Datenbanken—Konzepte und Systeme

(2003)

Cited by (68)

Simulation of patient flow in multiple healthcare units using process and data mining techniques for model identification
2018, Journal of Biomedical Informatics
Citation Excerpt :
The area gets rapid growth, with introduction of big data approaches and technologies [78–80] enabling multiple data sources to be considered for integration: EHR, recommendations of various levels (guidelines, standards, laws), population data (census, official reports), omics data, wearable devices, social media, human-generated data (surveys, self-reported data), financial data, pharmaceutical data, and many others. The integrated solutions often consider the issues of semantic integration [81,82], managing patient-centered data collections [53,83], building advanced tools for the analysis of available data [84,85], as well as general model-based [86,87] and workflow-based [88] integration. These approaches are used to identify models and support simulation (see, e.g., ISPOR Task Force Reports [89]) within different approaches (including model identification, calibration, verification, etc.).
An approach to building a hybrid simulation of patient flow is introduced with a combination of data-driven methods for automation of model identification. The approach is described with a conceptual framework and basic methods for combination of different techniques. The implementation of the proposed approach for simulation of the acute coronary syndrome (ACS) was developed and used in an experimental study.
A combination of data, text, process mining techniques, and machine learning approaches for the analysis of electronic health records (EHRs) with discrete-event simulation (DES) and queueing theory for the simulation of patient flow was proposed. The performed analysis of EHRs for ACS patients enabled identification of several classes of clinical pathways (CPs) which were used to implement a more realistic simulation of the patient flow. The developed solution was implemented using Python libraries (SimPy, SciPy, and others).
The proposed approach enables more a realistic and detailed simulation of the patient flow within a group of related departments. An experimental study shows an improved simulation of patient length of stay for ACS patient flow obtained from EHRs in Almazov National Medical Research Centre in Saint Petersburg, Russia.
The proposed approach, methods, and solutions provide a conceptual, methodological, and programming framework for the implementation of a simulation of complex and diverse scenarios within a flow of patients for different purposes: decision making, training, management optimization, and others.
A national standards-based assessment on functionality of electronic medical records systems used in Kenyan public-Sector health facilities
2017, International Journal of Medical Informatics
Citation Excerpt :
Much of recent investment in eHealth innovations has gone toward implementing stand-alone “first generation” electronic systems, rather than building linkages between systems [14,15]. Achieving interoperability involves addressing the complexities of both technical and semantic integration, attending both to data integration (via syntactic frameworks and semantic ontologies) and functional integration (via middleware and application frameworks) [16]. Foundational elements for successful interoperability include: governance structures to define EMRs architecture and to oversee adoption of international and local data standards; technical expertise to define unique identifier schema and core data sets; financial incentives to adopt standards-based approaches in designing software; structures which support local as well as shared “ownership” of data; policies for security and privacy of data; and trust-building and alignment with medical communities [14,17–19].
Variations in the functionality, content and form of electronic medical record systems (EMRs) challenge national roll-out of these systems as part of a national strategy to monitor HIV response. To enforce the EMRs minimum requirements for delivery of quality HIV services, the Kenya Ministry of Health (MoH) developed EMRs standards and guidelines. The standards guided the recommendation of EMRs that met a preset threshold for national roll-out.
Using a standards-based checklist, six review teams formed by the MoH EMRs Technical Working Group rated a total of 17 unique EMRs in 28 heath facilities selected by individual owners for their optimal EMR implementation. EMRs with an aggregate score of ≥60% against checklist criteria were identified by the MoH as suitable for upgrading and rollout to Kenyan public health facilities.
In Kenya, existing EMRs scored highly in health information and reporting (mean score = 71.8%), followed by security, system features, core clinical information, and order entry criteria (mean score = 58.1% − 55.9%), and lowest against clinical decision support (mean score = 17.6%) and interoperability criteria (mean score = 14.3%). Four EMRs met the 60.0% threshold: OpenMRS, IQ-Care, C-PAD and Funsoft. On the basis of the review, the MoH provided EMRs upgrade plans to owners of all the 17 systems reviewed.
The standards-based review in Kenya represents an effort to determine level of conformance to the EMRs standards and prioritize EMRs for enhancement and rollout. The results support concentrated use of resources towards development of the four recommended EMRs. Further review should be conducted to determine the effect of the EMR-specific upgrade plans on the other 13 EMRs that participated in the review exercise.
The interplay between global standards and local practice in nursing
2013, International Journal of Medical Informatics
The paper assesses the extent, form, and transformation of global nursing classifications (NANDA) in a nursing practice during a period of 5 years.
A longitudinal case study was used to trace implementation, adoption and use of nursing classifications as an integral part of an electronic nursing module. A mixed method of data collection was used, including semi-structured interviews, observation and document analysis.
A surprisingly high proportion of nursing diagnoses was consistent with the global standard, in spite of a gradual increase of user-generated concepts. This is elaborated more thoroughly through a co-constructing perspective, emphasizing how the global standard and the practice mutually shaped each other over several years.
Standardization is an iterative process that is performed in close relationship with practice. The mutual interrelation between formal classifications (NANDA) and local practices are co-constructed in a dynamic interplay that evolves over time. In such a process, the use of local classifications and local strategies can be a means to bridge the gap between these two extreme points.
Development and evaluation of SOA-based AAL services in real-life environments: A case study and lessons learned
2013, International Journal of Medical Informatics
Citation Excerpt :
The use case, feature and actor models represent best practice and could therefore be reused in other system designs and even used in standardization processes. This is inline with the ideas of Lenz, Beyer and Kuhn in [37] where they argue for a separation of domain concepts and system implementation: “in order to cope with domain evolution, modelling of domain concepts should be separated from IT system implementation. IT systems should be implemented by IT experts and medical knowledge should be modelled and maintained by domain experts.”
The proper use of ICT services can support seniors in living independently longer. While such services are starting to emerge, current proprietary solutions are often expensive, covering only isolated parts of seniors’ needs, and lack support for sharing information between services and between users. For developers, the challenge is that it is complex and time consuming to develop high quality, interoperable services, and new techniques are needed to simplify the development and reduce the development costs.
This paper provides the complete view of the experiences gained in the MPOWER project with respect to using model-driven development (MDD) techniques for Service Oriented Architecture (SOA) system development in the Ambient Assisted Living (AAL) domain.
To address this challenge, the approach of the European research project MPOWER (2006–2009) was to investigate and record the user needs, define a set of reusable software services based on these needs, and then implement pilot systems using these services. Further, a model-driven toolchain covering key development phases was developed to support software developers through this process. Evaluations were conducted both on the technical artefacts (methodology and tools), and on end user experience from using the pilot systems in trial sites.
The outcome of the work on the user needs is a knowledge base recorded as a Unified Modeling Language (UML) model. This comprehensive model describes actors, use cases, and features derived from these. The model further includes the design of a set of software services, including full trace information back to the features and use cases motivating their design. Based on the model, the services were implemented for use in Service Oriented Architecture (SOA) systems, and are publicly available as open source software. The services were successfully used in the realization of two pilot applications. There is therefore a direct and traceable link from the user needs of the elderly, through the service design knowledge base, to the service and pilot implementations.
The evaluation of the SOA approach on the developers in the project revealed that SOA is useful with respect to job performance and quality. Furthermore, they think SOA is easy to use and support development of AAL applications. An important finding is that the developers clearly report that they intend to use SOA in the future, but not for all type of projects. With respect to using model-driven development in web services design and implementation, the developers reported that it was useful. However, it is important that the code generated from the models is correct if the full potential of MDD should be achieved.
The pilots and their evaluation in the trial sites showed that the services of the platform are sufficient to create suitable systems for end users in the domain.
A SOA platform with a set of reusable domain services is a suitable foundation for more rapid development and tailoring of assisted living systems covering reoccurring needs among elderly users. It is feasible to realize a tool-chain for model-driven development of SOA applications in the AAL domain, and such a tool-chain can be accepted and found useful by software developers.
DEUS: Distributed Electronic Patient File Update System
2023, arXiv
Sharing biomedical data: Strengthening ai development in healthcare
2021, Healthcare (Switzerland)

View all citing articles on Scopus

View full text

Semantic integration in healthcare networks

Abstract

Introduction

Section snippets

Objectives

Methods

Results

Discussion and conclusions

Int. J. Med. Inform.

Int. J. Med. Inform.

Int. J. Med. Inform.

Int. J. Med. Inform.

Int. J. Med. Inform.

Artif. Intell. Med.

Int. J. Med. Inform.

Int. J. Med. Inform.

Int. J. Med. Inform.

Int. J. Med. Inform.

Int. J. Med. Inform.

Int. J. Med. Inform.

Web Services—Concepts, Architectures and Applications

The Web Services Scandal—How Data Semantics Have Been Overlooked in Integration Solutions

EAI J.

Federated database systems for managing distributed, heterogeneous, and autonomous databases

ACM Comput. Surv.

Integrating islands of information

EAI J.

Schemaintegration-Integrationskonflikte, Lösungsansätze, aktuelle Herausforderungen

Informatik Forschung Entwicklung

Interconnecting Heterogeneous Information Systems

A survey of approaches to automatic schema matching

VLDB J.

A comparative analysis of methodologies for database schema integration

ACM Comput. Surv.

Semantic interoperability

ACM Comput. Surv.

Impact of semantic heterogeneity on federating databases

Comput. J.

Informationsintegration in Gesundheitsversorgungsnetzen-Herausforderungen an die Informatik

Inform. Spekt.

Application integration

XML und Datenbanken—Konzepte und Systeme