Using Knowledge Graph Structures for Semantic Interoperability in Electronic Health Records Data Exchanges

Sachdeva, Shelly; Bhalla, Subhash

doi:10.3390/info13020052

Open AccessArticle

Using Knowledge Graph Structures for Semantic Interoperability in Electronic Health Records Data Exchanges

by

Shelly Sachdeva

^1,†

and

Subhash Bhalla

^2,*,†

¹

Department of Computer Science Engineering, National Institute of Technology Delhi, Delhi 110040, India

²

Department of Computer Science, University of Aizu, Fukushima 965-8580, Japan

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Information 2022, 13(2), 52; https://doi.org/10.3390/info13020052

Submission received: 30 November 2021 / Revised: 10 January 2022 / Accepted: 17 January 2022 / Published: 21 January 2022

(This article belongs to the Special Issue Knowledge Graph Technology and Its Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Information sharing across medical institutions is restricted to information exchange between specific partners. The lifelong electronic health records (EHR) structure and content require standardization efforts. The existing standards such as openEHR, HL7, and CEN TC251 EN 13606 (Technical committee on Health Informatics of the European Committee for Standardization) aim to achieve data independence along with semantic interoperability. This study aims to discover knowledge representation to achieve semantic health data exchange. OpenEHR and CEN TC251 EN 13606 use archetype-based technology for semantic interoperability. The HL7 Clinical Document Architecture is on its way to adopting this through HL7 templates. Archetypes are the basis for knowledge-based systems as these are means to define clinical knowledge. The paper examines a set of formalisms for the suitability of describing, representing, and reasoning about archetypes. Each of the information exchange technologies such as XML, Web Ontology Language, Object Constraint Language, and Knowledge Interchange Format is evaluated as a part of the knowledge representation experiment. These examine the representation of Archetypes as described by Archetype Definition Language. The evaluation maintains a clear focus on the syntactic and semantic transformations among different EHR standards.

Keywords:

archetypes; electronic health records; dual-model approach; knowledge representation; EHR; XML; ADL; OWL

1. Introduction

Health care is a continuously evolving domain. New findings of diseases and clinical treatments are continuously being made. It has raised the need for increased information exchange among various medical institutions. Electronic Health Records (EHRs) contain the medical history and treatments of the patients. In the classical approach, the information and knowledge are stored together. Storage of each clinical concept in a single relation led to a huge data model that was difficult to manage and expensive to maintain. Among the existing interoperability approaches for the EHRs, the dual-model approach [1] seems to be most promising. It consists of an information layer and a knowledge layer. The key feature of this approach is the segregation of knowledge (represented as Archetypes [2]). A conceptual idea is virtually transferred through the medium of an intermediate structure of a knowledge graph. This study analyzes its components.

Knowledge graphs are used to capture knowledge in application situations that need large-scale integration, management, and extraction of value from a variety of data sources. Recent studies examine all currently available knowledge graphs (KGs), including their characteristics, approaches, applications, issues, and challenges [3,4].

This paper focuses on using knowledge representation and information interchange technologies for archetype representation.

1.1. Overview Electronic Health Records

The EHRs domain is very complex. It consists of different types of data (textual to multimedia) with new data requirements emerging over time. For example, there are about 300,000 terms at present (defined in SNOMED-CT), including medical tests and procedures created and modified constantly. EHRs have a complex structure based on archetypes. These may include data from about 100–200 parameters, such as temperature, blood pressure, and body mass index. Each of the individual parameters has its specific contents. Each of the individual parameters (concepts) is represented as archetypes. For example, an archetype contains an item, such as ‘data’ (e.g., captured for a heart rate observation). It offers complete knowledge about a clinical context (i.e., attributes of data), ‘state’ (context for interpretation of data), and ‘protocol’ (information regarding the gathering of data), as shown in Appendix A. Various standard organizations are working for interoperable semantic EHRs. For example, Health Level 7 (HL7), European Committee of standardization Technical committee 251, International standard organization (ISO), and openEHR.

It is desirable to have EHR systems that are functionally interoperable systems and semantically interoperable systems. Interoperability can be defined as an ability to communicate data such that the data are sufficient to perform the tasks at the receiving system, the data items associated have the same meaning for the creator of the sending party and the users of the receiving party, also the task performed must be to the satisfaction of the receiving party. To tackle the EHR interoperability problem, many authorized organizations have defined several standards, such as the Health Level 7 (HL7) Clinical Document Architecture (CDA), Continuity of Care Record (CCR), CEN EN 13606 EHRcom, and openEHR. The main objective of all these EHR standards is to structure the data and markup the content of the medical information to be exchanged.

The three levels of interoperability namely: syntactic (data) interoperability, structural interoperability/semantic interpretability, and semantic interoperability are explained in detail [5]. The main mechanisms for interoperability are reference model, archetypes, and domain knowledge governance.

1.2. Archetypes and Semantic Interoperability

The standards for semantic interoperability (such as HL-7 CDA, openEHR, CEN ISO 13606) endorse the two-level modeling approach for storing EHRs [6]. It consists of two layers that propose to segregate information modeling from content (knowledge) modeling. The reference (information) model layer represents the generic structures of components of the healthcare data. The content model on the other hand is used to represent more domain-specific data, which in general have instability due to variability and high rate of change in the usage (e.g., formal description of physical examination or prescription).

In openEHR and CEN13606, the first level is known as the reference model and the second level consists of archetypes. The reference model (RM) defines the basic fundamental structure and represents the generic structures of components of the healthcare data at the storage level (information modeling). At the second layer, the archetype model (AM) constraints the generic structure to encompass logical semantics and thus, provide a standard definition that aids in semantic interoperability. AM provides deliverables in the form of archetypes and templates. An archetype provides the meta-description of structured clinical records as a computable formalism. In HL7, the two levels are Reference Information Model (RIM) and the HL7 templates. The HL7 Clinical Document Architecture templates are the same as the archetype concept.

These standards support compatibility among each other. In the openEHR/ CEN 13606 the only means of achieving interoperability with a generic information model is the archetypes. In fact, ADL archetypes can be defined against any Unified Modeling Language (UML) model, and it is also possible to write the archetypes against the HL7 version 3 Reference Information Model and also the HL7 Clinical Document Architecture. Ref. [7] describes how the clinical statements of two different EHR standards derived from the same RIM can be mapped to each other by using archetypes, Refined Message Information Model derivations, and semantic tools.

An archetype is an agreed formal and interoperable specification of a re-usable clinical data set that underpins an EHR. It captures the maximum possible information about a particular and discrete clinical concept [2]. A conceptual definition of data as archetypes can be developed in terms of constraints on structure, types, values, and behaviors of reference model classes based on the dual-model approach. It consists of the knowledge layer as archetypes and a reference model. An example of a simple archetype is WEIGHT, which can be used in multiple places, wherever is required within an EHR. Semantics in archetypes have dual nature. It consists of structural and terminological components:

1.: The structure of archetype provides support for semantics.
2.: EHR component links form a set of interrelated conceptual, clinical entities. Each entity has a set of terminological bindings associated (specified by links to terms of specific medical terminologies).

If data elements are created and modified using archetypes, the archetypes constrain the configuration of data instances to be valid according to the archetype. These are a paradigm for building semantically enabled software systems, providing data validation, clinical modeling (by domain experts), a basis for querying, and form design. An archetype might define or constrain relationships between data values within a data structure. These are expressed as algorithms, formulae, or rules. Its metadata defines its core concept, purpose, use, evidence, authorship, and versioning. An archetype ensures a maximal dataset. It contains all the relevant information regarding a clinical concept. Once the format of an archetype is agreed upon and published, it is held in a ‘library’ and made available for use in any part of a given application by multiple vendor systems, multiple institutions, and multiple geographical regions. Each group or entity using the same archetype will understand and compute data captured by the same archetype in another clinical environment. Thus, an archetype serves the following key purposes:

1.: It allows domain experts (clinicians) to capture data for their information systems;
2.: It provides runtime validation of data input, thus improving data entry quality; and
3.: It provides a basis for intelligent querying of data [8,9]

1.3. Representing Internal Data in Archetypes

Matching clinical data to codes in controlled terminologies is the first step towards achieving data standardization for safe and accurate data interoperability. Archetypes have the feature to separate the internal model data from formal terminologies. Existing terminologies, taxonomies, and ontologies have been written in many languages. For example, MeSH (Medical Subject Heading) [10] and NCI (National Cancer Institute) [11] have their own proprietary formalisms (now commonly expressed also in XML). The ’term binding’ section of the archetype is used to describe the equivalences between archetype local terms and terms found in external terminologies, such as SNOMED-CT (Systematized Nomenclature of Medicine-Clinical Terms) or Unified Medical Language System (UMLS). The internal data are assigned local names, later bound or mapped to external terminology codes. This feature eliminates the need to make changes to the model whenever the terminology changes. For formal description, the Archetype Definition Language (ADL) uses three other syntaxes, cADL (constraint form of ADL), dADL (data definition form of ADL), and a version of first-order predicate logic (FOPL), to describe constraints on data, which are instances of some information model (e.g., expressed in UML) [12]. Thus, ADL can be used to write archetypes for any domain where formal object model(s) exist, which describe data instances.

1.4. EHRs: Data Modeling

The openEHR architecture [1] includes a design principle called ‘Ontological separation’, which regulates the EHR modeling; see Figure 1. The model consists of two main categories entitled ‘ontologies of information and ’ontologies of reality’. The ‘ontologies of information’ contain the information models of the EHR content, whereas the ‘ontologies of reality’ describe real phenomena with descriptions and classifications.

The ‘ontologies of information’ are divided into:

1.: ‘Domain content models’ (knowledge models) containing formal definitions of the clinical content. These are developed using archetypes, which are designed such that, these can change when new clinical needs arise.
2.: ‘Information representation models’ are implemented in the electronic health care systems software. These are used as a foundation for the domain content models and are designed to be stable regarding model changes. In openEHR, this component is named the reference model (RM).

In simpler terms, if RM is equivalent to the set of letter/digits, then each and every archetype would be a grammar constraining which strings could be expressed for that archetype. Thus, in formal terms,

RM = {Set of classes C1, C2, C3 ...Cn}.

Archetype = {Set of rules for valid combination of classes of RM}.

The ‘ontologies of reality’ contains examples:

1.: Classifications: ICDx(International Classification of Diseases) and ICPC(International Classification for Primary Care).
2.: Process descriptions: Clinical guidelines.
3.: Descriptive terminologies: SNOMED-CT, LONIC.

In Figure 1, the EHR extracts are based on commonly shared archetypes. These are proposed as a means to exchange information between different health care providers [1]. The semantics of the domain content models (e.g., archetypes) are provided by terminology binding. The meaning of nodes in archetypes is given by textual descriptions and references to external terminology systems. These are in the form of term definition and term binding. Representation of archetypes in various possibilities such as XML (Extensive Markup Language), ADL (Archetype Definition Language), and OWL (Web Ontology Language) has been described in the paper. Subsequently, a comparison is drawn amongst these information exchange technologies, and the advantages and disadvantages of each have been analyzed. The main aim of the study is to find the best representation of an archetype. The paper examines a set of formalisms for the suitability of describing, representing, and reasoning about archetypes.

2. Formalisms in Transformation of Archetypes among Different Standards: Background

The prevalent standards such as HL7 Clinical Document Architecture (CDA), openEHR, and CEN TC251 EN 13606 use archetype-based technology for semantically interoperable exchange. ADL archetypes can be written against any UML model, and it would be possible to write archetypes directly against the HL7v3 RIM (Reference Information Model) [13], and also the CDA specification [14] (using a UML expression derived from its XML schema). These standards define the structure and the markup of the clinical content to make EHR exchange interoperable. They all rely on dual-model technology for semantic interoperability. They have different classes in the reference model having abstract semantics. Although the names of the classes are not shared between these standards, their semantics are similar. The RM for all the standards is stable. If alignments are performed between archetypes of different standards, aligning algorithms based on similarity measures will fail, as class names (of RM) are disparate. Dictionary-based approaches will not be of much help, as all names are quite abstract [15]. The various ongoing exchange efforts based on archetypes use different formalisms to represent archetypes. The mapping among different standards makes use of model management and uses OWL transformations and XML representation of archetypes.

For interoperability among EHR standards based on ontologies, the possible approaches are (i) by building common ontology and (ii) by reusing existing ontologies and combining them.

The first approach requires the transformation of ADL archetypes into OWL [16,17]. The ontological information and archetype models have been compared to find similarities and differences among the CEN and openEHR representations. The software tool available based on this approach is Poseacle convertor [18]. This approach involves the use of XML and OWL representations. The latter is obtained by reusing existing ontologies and combining them through ontology mapping. The ARTEMIS [19] project was implemented based on this. The various formalisms such as XML, ADL, and OWL have been used [19].

It has been found that archetypes must be agreed upon before communication. However, it does not seem feasible to expect all professionals of various disciplines to agree on exactly all details of the archetypes associated with the data they would like to exchange. If this approach becomes widely accepted, it is certain that the number of available archetypes will become very large. Although archetypes are annotated with terms from standardized ontologies (terminologies, taxonomies, etc.), there will still be differences at the archetype level and at the terminology level. Due to competing standards, local variations at the archetype level will stem from the specialization of archetypes for specific purposes and research projects. Further, several widely used terminologies could be used to annotate archetypes (e.g., SNOMED-CT, MeSH, NCI). Local ontologies are also used to annotate archetypes; therefore, a sound and general process for matching archetypes are essential. A unified framework based on a fuzzy ontology was proposed to show how to exploit semantic web technologies to support EHR semantic interoperability [20]. Costa et al. introduced ontologies and rules as a means of establishing interoperability amongst heterogeneous health systems (openEHR and HL7) [21]. OmniPHR interoperable model is proposed for assessing the structure of semantic interoperability and database integration from various health standards [22]. OmniPHR uses artificial intelligence, Natural Language Processing, and a standard ontology to achieve interoperability. Knowledge graphs are represented using a variety of methodologies, but machine learning methods are frequently employed to create a low-dimensional representation that can support a wide range of applications [23].

Thus, there is a requirement to identify the role of the suitability of various formalisms for achieving full semantic interoperability through archetypes, which has been addressed in the current research.

3. Knowledge Representation

Ontologies (31 percent), semantic web-related formalisms (26 percent), decision tables and rules (19 percent), logic (14 percent), and probabilistic models (10 percent) are the most general knowledge representation approaches. Knowledge is primarily represented in medical informatics through computer interpretable clinical recommendations (43%), medical domain ontologies (26%), and electronic health care records (22%) [24]. Embedding codes can convey the meaning of the concepts that are represented by an archetype from a commonly recognized terminology at appropriate points in the archetype. Archetypes are the unit of communication between interoperating applications, as they define the minimum context that must be considered for safe communication [15]. Expressivity is a key parameter in choosing or creating a knowledge representation. It is easier and more compact to express a fact or element of knowledge within the semantics and grammar of a more expressive knowledge representation. However, more expressive languages will likely require more complex logic and algorithms to construct equivalent inferences. A highly expressive knowledge representation is also less likely to be complete and consistent. Less expressive KRs may be both complete and consistent [25]. This section describes the knowledge technologies for the representation of archetypes.

3.1. ADL and XML

The ADL approach uses existing UML semantics and existing terminologies and adds convenient syntax for expressing the required constraints. Expressing the semantics of archetypes using XML-based exchange formats leads to the conflation of abstract and concrete representational semantics [12]. ADL syntax is straightforward and powerful. It has allowed mappings to other formalisms to be more correctly defined and understood. Previously, archetypes have been expressed as XML instance documents conforming to W3C XML schemas [26], for example, in the Good Electronic Health Record (GEHR) [27] and openEHR projects. Subsequently, expressing archetype constraints using numerous schema languages for XML (such as XML schema, RELAX NG, and Schematron) has been examined. Because of the issues reported, these languages were abandoned for archetype validation [28,29]. For example, in XML schema, classes in RM were mapped to complex types, and archetypes were mapped to class restrictions. The strict rules (unique particle attribution, complex enumerations, placing regular expression constraint) in using the restriction feature in XML schema did not permit the implementation of archetype constraints.

With ADL parsing tools, it is possible to convert ADL to any number of forms, including various XML formats. XML instances can be generated from the object form of an archetype in memory. XML archetypes are equivalent to serialized instances of the parse tree, i.e., particular ADL archetypes serialized from objects into XML instances. Archetypes connect information structures to formal terminologies. They are path-addressable similar to XML data, using path expressions directly convertible to Xpath expressions. An XML schema corresponding to the ADL object model has been published at openEHR.org [30]. The XML schema corresponding to RM is published at [21].

The XML has a role in the exchange requirement. The EHR_Extract, used for exchange, is expressed using XML [1]. A recent study examines whether W3C XML schema provides a practicable solution for the semantic validation of standard-based EHR documents [28]. The EHR_Extract needs to be validated against the RM and the associated archetypes.

Example of XML/ADL use in openEHR: To accept a report from pathology laboratory for inclusion in the EHR repository of a patient (in the ADL form), an XML form is generated using the archetype. This form is shared with the laboratory for on-site validation of data input. Thus, XML is used as an input and transport medium.

The comparison between ADL and XML is shown below and Appendix B.

1.: Both are machine processable.
2.: ADL is human readable, whereas XML is sometimes unreadable (e.g., XML schema instance, OWL-RDF ontologies).
3.: ADL adheres to object-oriented semantics, particularly for container types, whereas XML schema languages do not follow object-oriented semantics.
4.: For ontological reference, ADL has domain entities/archetypes, and XML has global terms/concepts.
5.: ADL uses attributes, and XML uses attributes and sub-elements to represent object properties.
6.: ADL uses nearly half of space (storage) for tags, XML may have data redundancy in contents.
7.: In terms of efficiency, ADL is a domain-specific language (sufficiently rich to capture and model medical domain) in comparison to XML, which is good for web document modeling with limited ability to represent database contents.

3.2. ADL and OWL

Web Ontology Language (OWL) [31] is a language for the semantic web, which allows:

1.: Offering expressiveness and the possibility of reasoning over the information it describes.
2.: Allows making annotations on classes or properties, and semantic similarity functions are available.
3.: OWL is related to terminologies. For example, SNOMED-CT is currently in the process of adapting its representation to semantic web environments [32]. Having a representation of both the clinical and terminological information in the same formalism would facilitate better clinical knowledge management and would enrich archetypes by adding more information to them.
4.: OWL modeling brings all information concerning a particular term together (e.g., code, definition, bindings, and translations).
5.: There exists several tools to process it.

OWL has an abstract syntax, an extension of XML-based syntax known as the resource description framework (RDF). OWL is general-purpose description logic (DL) and is primarily used to describe “classes” of things in such a way as to support subsumptive inferencing within the ontology, and by extension, on data that are instances of ontology classes. The OWL includes intersection, union, complement of existential quantification, universal quantification, min cardinality, max cardinality, equivalence, and specialization. Thus, it has a decidability and computability nature, offering expressiveness and the possibility of reasoning over the information it describes. OWL version 2 has a property of qualified cardinality restrictions (which makes it possible to capture the occurrences restrictions from ADL), property chain inclusions, and OWL/XML syntax.

ADL provides a rich set of constraints on primitive types, including dates and times; however, ADL has significant drawbacks for achieving the goal of semantic interoperability, such as its syntactic orientation. Consequently, the formalization of the exchange and transformation processes is more than using semantic-oriented models such as ontological ones. In addition to this, syntactic approaches also make important archetype-related tasks, such as comparing and classifying archetypes, difficult. Archetypes in ADL can be represented in OWL. It requires [12]:

1.: Express the relevant reference models in OWL.
2.: Express the relevant terminologies in OWL.
3.: Be able to represent concepts (i.e., constraints) independently of natural language.
4.: Convert the cADL part of an archetype to OWL.

Using OWL-expressed archetypes to validate data (which would require massive amounts of data to be converted to OWL statements) is unlikely to be anywhere near as efficient as doing it with archetypes expressed in ADL or one of its concrete expressions [12]. The UML-like representation is not suitable for performing formal reasoning at the conceptual level; however, OWL offers a great deal of inferencing power of the far wider scope in comparison to specific reasoning.

The object-oriented semantics apply in the UML specification of RM and the corresponding XML schema mapped from it. This needs to be mapped for conversion to OWL. Practically, this is possible, as illustrated in a recent study [16]. The XML-schema class is mapped to OWL class with the same name. The restricted data type definitions in XML schema are mapped to OWL data type. The attributes in XML-schema mapped to OWL property. The instances/objects in XML are equivalent to individuals in OWL.

Reference [33] provides mapping archetypes to OWL. For automatically transforming archetype definitions from ADL to OWL, the archetype ontologies use the ehr2ont framework [34]. The LinkEHR [35] is also a recent project for obtaining the OWL representation of archetypes.

3.3. ADL and Other Formalisms

A comparative analysis of ADL with other formalisms such as the Object Constraint Language (OCL) and Knowledge Interchange Format (KIF) is shown in Section 4.3. The OCL allows constraining all class instances to conform to the specific configuration of instances. In contrast, ADL provides the ability to create numerous archetypes, each describing in detail a concrete configuration of instances of a class. ADL archetypes include invariants (which are expressed in a syntax similar to OCL).

The Resource Description Framework (RDF) is a data model for objects (“resources”) and relations between them provide a simple semantics for this data model [36]. OWL takes the essential fact-stating ability of RDF and the class- and property-structuring capabilities of RDF schema and extends them in essential ways. OWL classes can be specified as logical combinations (intersections, unions, or complements) of other classes, or as enumerations of specified objects, going beyond the capabilities of the RDF schema. OWL can declare classes and organize these classes in a subsumption (“subclass”) hierarchy, as an RDF schema. The significant extension over RDF schema is the ability in OWL to provide restrictions on how properties behave (that are local to a class). A recent study concentrates on the semantic interoperability of diverse EHRs and their standards, proposing the transformation of heterogeneous EHR datasets into XML syntactic models and their translation into a common ontological representation for semantic knowledge acquisition [37].

4. Evaluation in Practice

We ran into many problems concerning alternative representations of the archetype and the inability to express some of the constraints in different knowledge representation languages. We detail these problems as a cautionary tale to others planning to use pre-existing archetypes for semantic interoperability, as a list of issues to consider when describing concepts formally in any language, and as a collection of criteria for evaluating alternative representations.

The three best-suited knowledge formalisms ADL, XML, and OWL for archetype representation have been evaluated for the features mentioned in the paper through the experimental setup KnowledgeRep. The ADL formalism has been compared with XML and OWL to find the suitable role of these formalisms in a working semantic interoperable system.

4.1. KnowledgeRep: Simulation

Various knowledge formalisms used in different EHR standards for obtaining semantic interoperability have been examined in current research with the OpenEHR standard. The underlying Java Reference Implementation [19] of OpenEHR has been used. It has been named as KnowledgeRep. It is an evaluation setup for examining various archetype representations for EHR concepts. Figure 2, shows an application GUI (graphical user interface), the knowledge representations, and the communication classes. The application GUI represents various system interactions between the users and the EHR system. It is responsible for multiple calls to the utility and server-side classes to perform the formatting and transformations of the data and extract relevant data. It comprises the following modules: the form presented to the user, the template mechanism required to create the template corresponding to the EHR system, the data binding module, and the result viewer, which represents the report desired by the user. GWT facilitates all these modules. It introduces the necessary flexibility into the model-driven approach of standard-based EHR system development.

The knowledge representation (archetypes) in ADL, XML, and OWL formalisms and the communication classes are shown in the second part of the diagram. It contains components that are responsible for communication. It also contains logic that connects the GUI with the underlying java reference implementation of OpenEHR. The entire set of knowledge is parsed from ADL/XML/OWL files using the classes of RM and mapped to the GUI (form). The knowledge representation and communication part of the setup comprises the ArchetypeWrapper and DatabaseWrapper classes, which act as the communication nodes. The ArchetypeDAO maps the database and the basic RM data types. The clinical software database is shown (the details are beyond the scope of current research).

4.2. Methodology

Here we describe the analysis procedure applied for each formalism, ADL, XML, and OWL. The archetypes used for the analysis have been downloaded from CKM [38], Appendix C. Each knowledge representation has been analyzed from the perspective of how strong it is to support the client side and the server side of the EHR system. In other words, we have tried to explore knowledge representation from a user’s perspective and the machine’s perspective.

4.2.1. Representation in ADL

The analysis of the ADL was performed as given. A user selects a form to make a data entry, the corresponding .adl file is picked up from the archetype repository, parsed to extract all the node paths of the mandatory fields, and then a form is generated. When the user enters the value of any of the fields in the archetype, they are bonded with the exact path in the .adl file. We see that the ADL formalism is a powerful knowledge representation that can act as a GUI generator. At the same time, when the data are to be persisted, a user inputs to the corresponding archetype are mapped to the RM data types by intermediate classes and stored in the database. Along with the data type and the data value, the path of the node in the adl file and the name of the adl file are also persisted—this aids in the retrieval of the data from the database. Whenever the data are to be represented as a report, the data value is picked, mapped to the node whose path corresponds to it in the database, and then represented in the form corresponding to the archetype. Thus, ADL formalism plays a major role in data persistence in this case. From the above analysis, we can conclude that ADL, on the one hand, provides the object-oriented semantics and, on the other hand, is machine processable; thus, it plays a significant role in both the client side and backend of an EHR system; therefore, it is a powerful knowledge formalism.

However, the parsing of the ADL archetype returns objects according to the archetype object model. The archetype object model (AOM) is the definitive formal representation of archetype semantics. It is independent of syntax. The primary goal of the AOM specification is to tell developers how to build archetype tools and EHR components that utilize archetypes. It can be used to generate the output side of parsers that process archetypes in a linguistic format, such as the openEHR Archetype Definition Language (ADL). The semantics defined in the AOM is used to express the object structures of archetypes. These objects are equivalent to a syntax tree. The archetype object model can be thought of as a model of an in-memory archetype or template, or as a standard syntax tree for any serialized format—not only ADL. An archetype’s canonical abstract syntactic form is ADL, although it may also be parsed from and serialized to XML, JSON, or any other format. Calls to an appropriate AOM construction API from an archetype or template editing tool can also produce the in-memory archetype representation. This model is common to all dual-model-based standards; it will have no information about the particular reference model for which the archetype was built. Thus, the obtained objects cannot be used to perform any semantic activity such as comparison, selection, or classification.

4.2.2. Representation in XML

For experimental analysis, an XML representation is automatically generated by the archetype editor is being used. This XML format conforms to the archetype constraint classes defined in the OpenEHR RM, can be directly imported into the OpenEHR RM to initialize the relevant archetypes. Thus, the XML format is again generated via the archetype editor from a corresponding ADL counterpart. Each piece of data can subsequently be referenced using its XML path. For example, the patient’s first name (Paul) could be referenced as “/record/name/firstname”. The drawback of this formalism can be tracked in its inefficiency to represent the archetype constraints completely. It is a suitable formalism for transforming and exchanging data from one form to another.

4.2.3. Representation in OWL

The semantic web enables greater access not only to content but also to services on the Web [39]. Users and software agents should discover, invoke, compose, and monitor Web resources offering particular services and having specific properties. They should be able to do so with a high degree of automation if desired. Powerful tools should be enabled by service descriptions across the Web service lifecycle. OWL-S (formerly DAML-S) [39] is an ontology of services that makes these functionalities possible.

Since the experiment used the OpenEHR Java Reference Implementation of RM, it has been found that in order to use OWL formalism, the RM (i.e., information model) should be transformed into OWL statements (OWL is found to be at a more abstract level as compared to ADL, XML, and XML schema). It has been analyzed that there exists a method for binding terminology to the EHR system with OWL as a bridging technology, given that the terminology is expressible in OWL.

Recent research [40] describes the ADL-to-OWL translation approach, describes the techniques to map archetypes to formal ontologies and demonstrates how rules can be applied to the resulting representation. It translates definitions expressed in the openEHR ADL to a formal representation expressed using the OWL. The formal representations are then integrated with rules expressed with the Semantic Web Rule Language (SWRL) [41] expressions, providing an approach to apply the SWRL rules to concrete instances of clinical data. Sharing the knowledge expressed in rules is consistent with the philosophy of open sharing, encouraged by archetypes. The approach also allows the reuse of formal knowledge, expressed through ontologies, and extends reuse to propositions of declarative knowledge, such as those encoded in clinical guidelines.

4.3. Findings

To the best of the author’s knowledge, KnowledgeRep has investigated the suitability of various formalisms in the context of archetypes, using the specific semantics of their underlying reference models and hierarchical structure (tree) of archetype definitions. It examines whether currently available archetype languages provide direct support for mapping to formal ontologies and then exploiting reasoning on clinical knowledge, which are critical ingredients of full semantic interoperability.

Through KnowledgeRep, various comparisons presented in Table 1 have been obtained. The conclusions of various results are also shown in Table 2 and Table 3.

From Table 1 we infer that ADL best represents archetypes definitions. The XML schema languages were not able to specify the archetype constraints; therefore, the XML-encoded EHR data and archetype definitions will be read into the EHR system, which will validate the EHR data against the archetype constraints programmatically. XML has a role in the exchange requirement over the web. OWL would not be a suitable representation for storing the EHR data itself. It is designed on a mathematical base specifically for reasoning. Thus, it is suitable for semantic activities such as classification, comparing, reasoning, and transformation.

Table 3 presents a consolidated view of the possibility of transformation of ADL archetypes. Thus, the XML archetypes are serialized instances of the parse tree obtained from ADL parser. Expressing archetypes using OWL/KIF requires the existing information model and relevant terminologies to be converted to OWL/KIF statements. Valuing data using OWL expressed archetypes requires massive amounts of data to be converted to OWL statements; however, OWL plays a significant role in ontology mapping, which permits the exchange of information among health care information systems conforming to different standards. The transformation to OCL is impossible, but the ADL syntax includes assertion language based on OCL.

Although, OWL is not suitable for representing conditional expressions used in representing constraints and rules. Some researchers have proposed ways to represent such richer expressions, e.g., by adopting rule languages (e.g., RuleML) into RDF and OWL representations [39]. RuleML is a family of languages whose modular system of XML schemas permits high-precision Web rule interchange [42]. Recent work [40,43] makes a case for the potential of combining the translated OpenEHR Archetype Definitions with a rule-based approach using Semantic Web Rule Language (SWRL) [41] rules. Hajjamy et al. developed a semi-automatic integration approach for integrating data from traditional information systems to a better ontologies-based system. Different data sources via UML, RDB, and XML are converted into local ontologies (OWL2) in the mentioned approach. Based on semantic, structural, and syntactic similarity measurement, all the local ontologies output then merge into a global ontology [44].

5. Discussions

The ADL has various advantages and a few disadvantages compared to the other languages, as shown in Table 1. A recent study [16] states that difference between OpenEHR RM/ADL and XML/XML schema as:

1.: The OpenEHR RM has a “domain bias” and is, therefore, more extensive and expressive than XML, a minimal generic model to describe hierarchical textual structures. This means that more has to be expressed in an XML schema in order to obtain a document class that is semantically equivalent to an archetype.
2.: The OpenEHR RM and archetype object models are modeled in UML. There is a direct correspondence between an ADL instance and the archetype object model (AOM). These models have an object-oriented bias, which is beneficial when seen as representations for software systems. They may be further equipped with logical artifacts, such as pre-and post-conditions, loop, and class invariants, and finally, operational implementation artifacts.

Therefore, the XML/XML schema models can be considered more abstract than the OpenEHR RM/AOM/ADL models, in the sense that the latter is more “implementation-oriented”.

To standardize the knowledge representation in the clinical domain, a significant contribution comes from languages such as the Resource Description Framework schema or OWL [45]. OWL is even more abstract (omitting as much as possible about internal structures and reducing the description to a set of statements about what is true in the description). This minimalism enables formal reasoner engines to operate on the models, assisting us in discovering irregularities and inconsistencies [16].

To publish pre-existing ontologies (e.g., Unified Medical Language System (UMLS)) on the Semantic Web, there exist several problems concerning alternative interpretations of the semantic network (SN) notation and the inability to express some of the interpretations in OWL. In the study [46], the authors run into several difficulties in this undertaking. Some obstacles were due to ambiguities in the SN notation’s semantics or the notation’s under-specification (e.g., what can be inferred from the absence of edges?). Other problems were the inability to express the SN as OWL axioms that would provide the desired inferences, and the difficulty of making choices between multiple possible representations. The authors encountered obstacles representing the semantics of “links” in the SN, especially in the context of requirements such as

δ

/

ρ

inheritance, inheritance blocking, and polymorphism. Thus, OWL language cannot be used to represent some notions in UMLS.

δ

/

ρ

inheritance is domain and range inheritance. The “is-a” link in SN gives rise to inheritance. Whenever there is a conflict between the placement of types in the SN and the links to be inherited, the SN provides a mechanism to explicitly “block” inheritance. The relationships whose arguments, i.e., domain and range values can be instances of multiple classes are called polymorphic relationships. These requirements led the authors of [46] to investigate the possible interpretations and encodings of a “link” in the SN.

Codes representing the meanings of nodes and constraints on text or terms, bindings to terminologies such as SNOMED or LOINC, are stated in the ontology section of an archetype. The report [15] describes a novel approach to address semantic interoperability in the healthcare domain, taking into account ontology mapping. Archetypes are used to scope the context of the matching process that will allow two independent healthcare providers to interoperate. It structures the matching algorithms at two different levels, terminology and archetype, leveraging the most mature research on ontology matching. The approach identifies relationships between ontological terms that have been embedded in pairs of archetypes as a means of matching these terms. The matched terms can then, in turn, be used to identify similarities between archetypes. The context of the archetype will limit the matching space to allow for more accurate mapping results and, due to the nature of archetypes, ultimately to a very high level of automation. The Simple Knowledge Organization System (SKOS) [47] is a standard data model for sharing and linking knowledge organization systems via the Semantic Web. The possibility of exploring SKOS shortly to bridge the gaps among records that have been created by different representation languages or represented on different semantics of terms may be done. Costa et al. introduced a method of achieving interoperability heterogeneous health systems through ontologies and rules. That approach used OWL features to map out the balance between openEHR records mixed with HL7 [48]. Recent research proposes a semantic ontological framework that might unify several EHR data formats. It takes different data formats as input (ADL, SQL, XML, CDA-HL7) and converts them into an OWL ontology [49].

6. Conclusions

Semantic interoperability is essential to facilitate the computerized support for alerts, workflow management, and evidence-based healthcare across heterogeneous EHR systems. The archetypes have been presented as a key to knowledge representation and information exchange; however, currently available archetype languages do not provide direct support for mapping to formal ontologies and then exploiting reasoning on clinical knowledge, which are critical ingredients of full semantic interoperability. The study presented various technologies for representing archetypes from XML to ADL to OWL through the simulation named as KnowledgeRep. The comparative analysis among the formalisms such as OWL, OCL, and KIF are presented (Table 1).

The semantic interoperable exchange among different standards requires these formalisms as is illustrated by recent research projects (Table 3) in the study.

The KnowledgeRep study has explored the following objectives:

1.: Given an EHR system expressed in OWL ( ${ES}_{OWL}$ ) and a terminology system expressed in OWL ( ${TS}_{OWL}$ ), there exists a method for binding terminology to EHR expressed in OWL ( $T_{OWL}$ ). The EHR system here includes the RM and archetype. This enables both the RM and archetypes to be expressed in OWL representation. That is,
∃ $T_{OWL}$ ( ${ES}_{OWL}$ , ${TS}_{OWL})$ .
2.: It has been found that given an EHR system expressed in UML+ADL ( ${ES}_{UML + ADL}$ ) and a terminology system ( ${TS}_{F}$ ) expressed in formalism F, there exists a method for binding terminology to EHR through the ‘term binding’ (TB) section of archetype ( $T_{TB}$ ). The EHR system here includes the RM (expressed in UML) and archetypes (expressed in ADL representation). The archetypes contain a ‘term binding’ section (Section 1.3). That is,
∃ $T_{TB}$ ( ${ES}_{UML + ADL}$ , ${TS}_{F})$ .
3.: It has also been found that given an EHR system expressed in XML ( ${ES}_{XML}$ ) and a terminology system expressed in XML ( ${TS}_{XML}$ ), there exists a method for binding terminology to EHR. The EHR system here includes the RM (expressed in XML schema) and Archetypes (represented in XML). That is,
∃ $T_{XML}$ ( ${ES}_{XML}$ , ${TS}_{XML})$ .

For coping with the semantics, two things are required.

1.: Establishment of the terminology binding (achieved through archetypes).
2.: Establishment of the mapping between different standardized EHR systems, i.e., data elements of one EHR system transformed to the data elements of another EHR system and vice versa (through various knowledge formalisms, see Table 1).

Thus, the paper concludes that ADL is suitable for domain modeling and constraint representation. XML schema do not provide the support for defining archetypes constraints [29]; however, XML is the global standard for exchange requirements and is also used for bringing the data from a non-archetype-based EHR system into an archetype-based EHR repository (through XSLT transformations). The OWL representation is the best fit for semantic activities, without which the semantic exchange among the three prevalent standards is impossible. The structural components of a knowledge graph have been considered for forming virtual intermediate support for the information exchanges. The current research can contribute to helping the EHR system vendors and developers to choose the appropriate technology for the required purpose, keeping in mind the semantic interoperability.

The study is limited to establishing a comparison between distinct approaches (XML, OWL, OCL, and KIF) to interoperate within the domain-specific application such as healthcare; however, it does not discuss intelligent, semi auto-constructed knowledge graph framework in the e-health context—a subject that can be further explored. Moreover, the research may be extended for various knowledge graph problems [3] such as data insufficiency, explainability, incomplete and incorrect knowledge, inconsistencies, and integration of knowledge.

Author Contributions

S.S. and S.B. conceived, formulated, and verified the formalisms for archetypes. S.S. and S.B. analyzed the results, proofread the manuscript, and corrected the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Parameter Heart_Rate as an Archetype

The structure of an archetype is given in Figure A1.

Figure A1. Parameter Heart_rate as an archetype.

Appendix B. Comparison of ADL and XML

The comparison of ADL and XML is given in Table A1.

Table A1. Comparison of ADL and XML.

Properties	ADL	XML
Machine Processable	Yes	Yes
Human Readable	Yes	Sometimes unreadable (e.g., XML schema instance, OWL-RDF ontologies)
Leaf Data Types	More comprehensive set, including interval of numerics and date/time types	String data; with XML String data; schema option- more comprehensive set
Structure	Universal schema for temporal database (EHRs) (history database)	Semi-structured data (rooted acyclic graph with unique path from root to leaf)
Adhering to object-oriented semantics	Yes, particularly for container types	XML schema languages do not follow object-oriented semantics
Ontological reference	Domain entities/archetypes	Global terms/concepts
Representation of object properties	Uses attributes	Uses attributes and Sub- elements
Space (for storage)	Uses nearly half of space for tags	May have data redundancy in contents
Efficiency	Is a domain specific language (sufficiently rich to capture and model medical domain)	Good for web document modeling with limited ability to represent database contents
Example (Archetype)	ADL representation of Heart_rate (Appendix D)	XML representation of Heart_rate (Appendix D)

Appendix C. List of Archetypes

1.: openEHR-EHR-ITEM_TREE.medication.v1.
2.: openEHR-EHR-OBSERVATION.heart_rate.v1.
3.: openEHR-EHR-OBSERVATION.blood_pressure.v1.
4.: openEHR-EHR-COMPOSITION.report.v1.
5.: openEHR-EHR-OBSERVATION.body_weight.v1.

Appendix D. ADL and XML Representation of the Heart_Rate

The ADL and XML representation of the Heart_rate archetype is given in Table A2.

Table A2. ADL and XML representation of the Heart_rate.

Heart Rate XML Representation	Heart Rate ADL Representation

<?xml version="1.0" encoding="UTF-8"?>	archetype (adl_version=1.4; uid=566c355d-9e8f-473d-a80d-90fcd8d61414)
<archetype xmlns="http://schemas.openehr.org/v1" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema">	openEHR-EHR-OBSERVATION.pulse.v2
.
.	.
.	.
<ontology>	.
<term_definitions language="ar-sy">	ontology
<items code="at0000">	term_definitions = <
<items id="text">*Pulse/Heart beat(en)</items>	["ar-sy"] = <
<items id="description">*The rate and associated attributes for a pulse or heart beat. (en)</items>	items = <["at0000"] = <
</items>	*text = <"Pulse/Heart beat(en)">**
	description = <"*The rate and associated attributes for a pulse or heart beat. (en)">
<items code="at1037">
<items id="text">*Body site(en)</items>	["at1037"] = <
<items id="description">*Body site where the pulse or heart beat were observed.(en)</items>	text = <"*Body site(en)">
</items>	description = <"*Body site where the pulse or heart beat were observed.(en)">
	["at1005"] = <
<items code="at1005">	text = <"*Presence(en)">
<items id="text">*Presence(en)</items>	description = <"*Presence of a pulse or heart beat.(en)">
<items id="description">*Presence of a pulse or heart beat.(en)</items>	comment = <"*It can be implied that the pulse or heart beat is present if Rate >0 /min. (en)">
<items id="comment">*It can be implied that the pulse or heart beat is present if Rate >0 /min. (en)</items>
</items>

<items code="at1013">	["at1013"] = <
<items id="text">*Device(en)</items>	text = <"*Device(en)">
<items id="description">*Details about the device used to measure the pulse rate or heart rate.(en)</items>	description = <"*Details about the device used to measure the pulse rate or heart rate.(en)">

<items code="at0013">	["at0013"] = <
<items id="text">*Position(en)</items>	text = <"*Position(en)">
<items id="description">*The body position of the subject during the observation.(en)</items>	description = <"*The body position of the subject during the observation.(en)">
</items>	> >
.	.
.	.
.	.
</ontology>	>
</archetype>	>

References

Beale, T.; Heard, S. The openEHR Architecture: Architecture Overview. In The openEHR Release 1.0.2; openEHR Foundation, 2008; Available online: https://specifications.openehr.org/releases/BASE/Release-1.0.3/architecture_overview.html (accessed on 19 January 2022).
Beale, T. Archetypes: Constraint-based Domain Models for Future-proof Information Systems. In Proceedings of the 11th OOPSLA Workshop on Behavioural Semantics, Seattle, WA, USA, 4–8 November 2002. [Google Scholar]
Tiwari, S.; Al-Aswadi, F.N.; Gaurav, D. Recent trends in knowledge graphs: Theory and practice. Soft Comput. 2021, 25, 8337–8355. [Google Scholar] [CrossRef]
Hogan, A.; Blomqvist, E.; Cochez, M.; D’amato, C.; De Melo, G.; Gutierrez, C.; Kirrane, S.; Gayo, J.E.L.; Navigli, R.; Neumaier, S.; et al. Knowledge Graphs. ACM Comput. Surv. 2022, 54, 71. [Google Scholar]
Sachdeva, S.; Bhalla, S. Semantic Interoperability in Healthcare information for EHR databases. In LNCS, Proceedings of the 6th International Workshop on Databases in Networked Information Systems Aizu, Wakamatsu, Japan, 29–31 March 2010; Springer: Berlin/Heidelberg, Germany, 2010; Volume 5999, pp. 157–173. ISBN 978-3-642-12037-4. [Google Scholar]
Bird, L.; Goodchild, A.; Tun, Z. Experiences with a two-level modelling approach to electronic health records. J. Res. Pract. Inf. Technol. 2003, 35, 121. [Google Scholar]
Kilic, O.; Dogac, A. Achieving Clinical Statement Interoperability using R-MIM and Archetype-based Semantic Transformations. IEEE Trans. Inf. Technol. Biomed. 2009, 13, 467–477. [Google Scholar] [CrossRef] [PubMed]
Ocean Informatics EHR Service Interface. Available online: https://openehr.atlassian.net/wiki/spaces/spec/pages/4915240/Ocean+Informatics+EHR+Service+Interface#OceanInformaticsEHRServiceInterface-Querying (accessed on 9 January 2022).
Sachdeva, S.; Bhalla, S. Implementing High-Level Query Language Interfaces for Archetype-Based Electronic Health Records Database. In Proceedings of the International Conference on Management of Data (COMAD), Mysore, India, 9–12 December 2009; pp. 235–238. [Google Scholar]
MeSH (Medical Subject Headings). Available online: http://www.ncbi.nlm.nih.gov/mesh (accessed on 5 November 2021).
National Cancer Institute. Available online: http://www.cancer.gov/ (accessed on 9 November 2021).
Beale, T.; Heard, S. The openEHR Archetype Model-Archetype Definition Language ADL 1.4. openEHR Release 1.0.2, Issue Date 12 December 2008. Available online: https://specifications.openehr.org/releases/1.0.2/architecture/am/adl.pdf (accessed on 9 January 2022).
HL7 Reference Information Model. Available online: http://www.hl7.org/v3ballot/html/infrastructure/rim/rim.html (accessed on 6 September 2021).
HL7 Clinical Document Architecture, Release 2. Available online: http://www.hl7.org/v3ballot/html/infrastructure/cda/cda.htm (accessed on 6 September 2021).
Bisbal, J.; Berry, D. Archetype Alignment-A Two-level Driven Semantic Matching Approach to Interoperability in the Clinical Domain. In Proceedings of the 2nd International Conference on Health Informatics, Tsukuba, Japan, 8–10 June 2018; pp. 216–221. [Google Scholar]
Hedayat, R. Semantic Web Technologies in the Quest for Compatible Distributed Health Records. Master’s Thesis, Uppsala University, Uppsala, Sweden, 2010. [Google Scholar]
Maldonado, J.A.; Costa, C.M.; Moner, D.; Menarguez-Tortosa, M.; Bosca, D.; Minarro Gimenez, J.A.; Fernandez-Breis, J.T.; Robles, M. Using the ResearchEHR platform to facilitate the practical application of the EHR standards. J. Biomed. Inform. 2012, 45, 746–762. [Google Scholar] [CrossRef] [PubMed]
Poseacle Convertor. Available online: http://miuras.inf.um.es/~researchehr/ISO13606-OpenEHR-Mappings.html (accessed on 5 September 2021).
Chen, R.; Klein, G. The openEHR Java reference implementation project. Stud. Health Technol. Inform. 2007, 129 Pt 1, 58–62. [Google Scholar] [PubMed]
Adel, E.; El-sappagh, S.; Elmogy, M.; Barakat, S.; Kwak, K.S. A fuzzy ontological infrastructure for semantic interoperability in Distributed Electronic Health Record. Intell. Autom. Soft Comput. 2019, 26, 237–251. [Google Scholar] [CrossRef]
Martinez-Costa, C.; Menarguez-Tortosa, M.; Maldonado, J.A.; Fernandez-Breis, J.T. Semantic Web Technologies for Managing EHR-Related Clinical Knowledge; Semantic Web, Wu, G., Eds.; InTech: London, UK, 2010; ISBN 978-953-7619-54-1. [Google Scholar] [CrossRef] [Green Version]
Roehrs, A.; da Costa, C.A.; da Rosa Righi, R.; Rigo, S.J.; Wichman, M.H. Toward a model for personal health record interoperability. IEEE J. Biomed. Health Inform. 2019, 23, 867–873. [Google Scholar] [CrossRef] [PubMed]
Nicholson, D.N.; Greene, C.S. Constructing knowledge graphs and their biomedical applications. Comput. Struct. Biotechnol. J. 2020, 18, 1414–1428. [Google Scholar] [CrossRef] [PubMed]
Riano, D.; Peleg, M.; Ten Teije, A. Ten years of knowledge representation for health care (2009–2018): Topics, trends, and challenges. Artif. Intell. Med. 2019, 100, 101713. [Google Scholar] [CrossRef] [Green Version]
Blobel, B. Knowledge representation and management enabling intelligent interoperability-principles and standards. Stud. Health TechnolInform. 2013, 186, 3–21. [Google Scholar]
OpenEHR XML-Schemas-Release 1.0.2. Available online: http://www.openehr.org/releases/1.0.2/its/XML-schema/index.html (accessed on 8 December 2021).
The GEHR Project. Available online: https://cordis.europa.eu/project/id/A2014 (accessed on 9 January 2022).
Rinner, C.; Janzek-Hawlat, S.; Sibinovic, S.; Duftschmid, G. Semantic validation of standard based electronic health record documents with W3C XML Schema. Methods Inf. Med. 2010, 49, 271–280. [Google Scholar]
Tun, Z.; Bird, L.J.; Goodchild, A. Validating Electronic Health Records Using Archetypes and XML: Cooperative Research Centre for Enterprise Distributed Systems; University of Queensland: Brisbane, Australia, 2002. [Google Scholar]
OpenEHR Community. Available online: www.openehr.org (accessed on 15 November 2021).
W3C. OWL—The Web Ontology Language. Available online: https://www.w3.org/TR/2004/REC-owl-features-20040210/ (accessed on 9 January 2022).
Sundvall, E.; Qamar, R.; Nyström, M.; Forss, M.; Petersson, H.; Karlsson, D.; Ahlfeldt, H.; Rector, A. Integration of tools for binding archetypes to SNOMED CT. BMC Med. Inform. Decis. Mak. 2008, 8 (Suppl. 1), S7. [Google Scholar] [CrossRef] [Green Version]
Kilic, O.; Bicer, V.; Dogac, A. Mapping Archetypes to OWL. June 2005. Available online: https://www.srdc.com.tr/share/publications/2005/MappingArchetypestoOWLTechnical.pdf (accessed on 10 November 2021).
Archetype Ontologizer. Available online: http://code.google.com/p/ehr2ont/wiki/ehr2ont_Wiki (accessed on 10 November 2021).
LinkEHR Tool. Available online: http://www.linkehr.com/ (accessed on 10 November 2021).
RDF (Resource Description Framework). Available online: https://www.w3.org/RDF/ (accessed on 9 January 2022).
Kiourtis, A.; Mavrogiorgou, A.; Kyriazis, D. Gaining the Semantic Knowledge of Healthcare Data through Syntactic Models Transformations. In Proceedings of the 2017 International Symposium on Computer Science and Intelligent Controls (ISCSIC), Budapest, Hungary, 20–22 October 2017; pp. 102–107. [Google Scholar]
Clinical Knowledge Manager. Available online: http://openehr.org/knowledge/ (accessed on 9 November 2021).
OWL-S: Semantic Markup for Web Services. Available online: http://www.w3.org/Submission/OWL-S/ (accessed on 6 November 2021).
Lezcano, L.; Sicilia, M.A.; Rodríguez-Solano, C. Integrating reasoning and clinical archetypes using OWL ontologies and SWRL rules. J. Biomed. Inform. 2011, 44, 343–353. [Google Scholar] [CrossRef] [PubMed] [Green Version]
SWRL: A Semantic Web Rule Language Combining OWL and RuleML. Available online: http://www.w3.org/Submission/SWRL/ (accessed on 6 November 2021).
Boley, H.; Paschke, A.; Shafiq, O. RuleML 1.0: The Overarching Specification of Web Rules. In Proceedings of the RuleML, Washington, DC, USA, 21–23 October 2010; pp. 162–178. [Google Scholar]
Lezcano, L.; Sicilia, M.A.; Serrano-Balazote, P. Combining OpenEHR Archetype Definitions with SWRL Rules—A Translation Approach. In World Summit on Knowledge Society; Lecture Notes in Computer Science; Springer: Berlin, Germany, 2008. [Google Scholar]
El Hajjamy, O.; Alaoui, L.; Bahaj, M. Integration of heterogeneous classical data sources in an ontological database. In Communications in Computer and Information Science; Springer: Cham, Switzerland, 2018; pp. 417–432. [Google Scholar]
Gatta, R.; Vallati, M.; Cappelli, C.; De Bari, B.; Salvetti, M.; Finardi, S.; Muiesan, M.; Valentini, V.; Castellano, M. Bridging the Gap between Knowledge Representation and Electronic Health Records. In Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies-Volume 5: HEALTHINF, Rome, Italy, 21–23 February 2016; pp. 159–165, ISBN 978-989-758-170-0. [Google Scholar]
Kashyap, V.; Borgida, A. Representing the UMLS Semantic Network Using OWL (Or “What’s in a Semantic Web Link?”). In LNCS2870, Proceedings of the International Semantic Web Conference 2003 (ISWC2003), Sanibel Island, FL, USA, 20–23 October 2003; Springer: Berlin/Heidelberg, Germany, 2003; pp. 1–16. [Google Scholar]
SKOS Simple Knowledge Organization System RDF Schema. Available online: http://www.w3.org/TR/2008/WD-skos-reference-20080829/skos.html (accessed on 6 November 2021).
Da Costa, C.A.; Wichman, M.H.; da Rosa Righi, R.; Yamin, A.C. Ontology-based model for interoperability between openehr and Hl7 health applications. In Proceedings of the International Conference in Health Informatics and Medical Systems (HIMS19), Las Vegas, NV, USA, 29 July–1 August 2019. [Google Scholar]
Adel, E.; El-Sappagh, S.; Barakat, S.; Hu, J.W.; Elmogy, M. An Extended Semantic Interoperability Model for Distributed Electronic Health Record Based on Fuzzy Ontology Semantics. Electronics 2021, 10, 1733. [Google Scholar] [CrossRef]
LinkEHR Toolb, ox. Available online: http://www.linkehr.com/ (accessed on 6 September 2021).
Knowledge Interchange Format. Available online: http://www-ksl.stanford.edu/knowledge-sharing/kif/#other (accessed on 6 September 2021).

Figure 1. Illustration of openEHR’s ontological_structure.

Figure 2. KnowledgeRep: Evaluation setup for various archetype representations for EHR concepts.

Table 1. Comparative analysis of various knowledge representations (in context to archetype).

Features	ADL	XML	OWL	OCL	KIF
Domain Modeling	Archetype Description Language(ADL).	Web document model.	Web-enabled ontologies for building the Semantic Web.	Constraints on object models (not on data) can describe archetypes.	Formal semantics (sharable among software entities).
Reference Model (RM) with object-oriented semantics	ADL syntax adheres to object-oriented reference models (expressed in UML for constraints).	XML and XML schema languages do not follow object-oriented semantics.	Requires explicit expression of a reference model in OWL to represent archetype constraints.	All statements are FOPL statements. It is impossible to express an archetype in a structural way.	Existing information model and terminologies have to be converted to KIF statements to describe archetypes.
Constraint Representation	ADL enables constraints to be expressed in a structural and nested way for archetypes.	Strict rules in XML schema cannot express archetype constraints.	Inconvenient in OWL.	OCL constraint types include function pre- and post-conditions, and class variants.
Path Traceability	ADL has a path syntax based on XPath (open EHR path) to deal with heavily nested structures.	Inbuilt Xpath mechanism.	No inbuilt path mechanism.	The OCL syntax for paths (that traverse associations) is similar to XPath.	No inbuilt path mechanism.
Inbuilt Ontology Section	ADL provides independence from natural language and terminology issues by having a separate ontology per archetype. It contains ’bindings’ and language-specific translations.	No built in syntax.	No built in syntax. It requires the semantics to be represented from first principles.	No built in syntax.	No built in syntax.

Table 2. Suitability of ADL, XML, and OWL.

Various Formalisms	Suitability
ADL	Representation of archetypes
XML	Information interchange
OWL	Semantic activities and transformation among archetypes

Table 3. Summary of transformation of ADL with other formalisms.

	XML	OCL	OWL	KIF
Trans-	Yes	No	Yes	Yes
formation of ADL archetypes	(GEHR Project) [27]		(Poseacle convertor, ARTEMIS project, Archetype Ontologiser, LinkEHR project) [50]	[51]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sachdeva, S.; Bhalla, S. Using Knowledge Graph Structures for Semantic Interoperability in Electronic Health Records Data Exchanges. Information 2022, 13, 52. https://doi.org/10.3390/info13020052

AMA Style

Sachdeva S, Bhalla S. Using Knowledge Graph Structures for Semantic Interoperability in Electronic Health Records Data Exchanges. Information. 2022; 13(2):52. https://doi.org/10.3390/info13020052

Chicago/Turabian Style

Sachdeva, Shelly, and Subhash Bhalla. 2022. "Using Knowledge Graph Structures for Semantic Interoperability in Electronic Health Records Data Exchanges" Information 13, no. 2: 52. https://doi.org/10.3390/info13020052

APA Style

Sachdeva, S., & Bhalla, S. (2022). Using Knowledge Graph Structures for Semantic Interoperability in Electronic Health Records Data Exchanges. Information, 13(2), 52. https://doi.org/10.3390/info13020052

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Using Knowledge Graph Structures for Semantic Interoperability in Electronic Health Records Data Exchanges

Abstract

1. Introduction

1.1. Overview Electronic Health Records

1.2. Archetypes and Semantic Interoperability

1.3. Representing Internal Data in Archetypes

1.4. EHRs: Data Modeling

2. Formalisms in Transformation of Archetypes among Different Standards: Background

3. Knowledge Representation

3.1. ADL and XML

3.2. ADL and OWL

3.3. ADL and Other Formalisms

4. Evaluation in Practice

4.1. KnowledgeRep: Simulation

4.2. Methodology

4.2.1. Representation in ADL

4.2.2. Representation in XML

4.2.3. Representation in OWL

4.3. Findings

5. Discussions

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Parameter Heart_Rate as an Archetype

Appendix B. Comparison of ADL and XML

Appendix C. List of Archetypes

Appendix D. ADL and XML Representation of the Heart_Rate

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI