Business to business interoperability: A current review of XML data integration standards

https://doi.org/10.1016/j.csi.2008.12.006Get rights and content

Abstract

Despite the dawn of the XML era, semantic interoperability issues still remain unsolved. As various initiatives trying to address how the underlying business information should be modelled, named and structured are being realised throughout the world, the importance of moving towards a holistic approach in eBusiness magnifies. In this paper, an attempt to clarify between the standards prevailing in the area is performed and the XML Data Standards providing generic XML Schemas are presented. Based on this “XML Data Standards Map”, a multi-faceted classification mechanism is proposed, leading to an extensible taxonomy of standards. A set of facets is analyzed for each standard, allowing for their classification based on their scope, completeness, compatibility with other standards, openness, ability to modify the schemas and maturity, to name a few. Through populating and querying this multi-faceted classification, a common understanding of Data Integration Standards can be ensured and the choice of a standard according to the requirements of each business can be systematically addressed.

Introduction

Data modelling issues have aroused the interest of the research community since the late 1960s when EDI (Electronic Data Interchange) that facilitated the application-to-application exchange of standard business documents between companies, independently of software, hardware, and communication networks, was introduced. With the exponential growth of the Web that opened new opportunities for businesses to transact across all types of boundaries (geographical, national, business category, etc.), early research had focused on providing a lingua franca for B2B e-Commerce, XML, that went beyond HTML to reflect the richness of the data being published. In progress of time e-Business modelling frameworks went through an evolutionary path from monolithic and proprietary standards (e.g. TRADACOMS [32] for the UK retail industry, ANSI ASC X.12 [1] in North America and the United Nations-recommended UN/EDIFACT [33]) towards flexible and standardized XML-based stacks covering the requirements from different industries.

It had been a common belief that creating XML vocabularies was sufficient to achieve data interoperability, yet this assumption goes far beyond reality. XML by itself does not guarantee that XML expressed business information exchanged in the span of business processes across different enterprises will be understood equally well by all systems. This is because the XML syntax only provides for creating markup languages used as metadata, it does not address how the underlying business information must be modelled, named and structured. Semantics come to cover this gap by attaching meaning to data in a structured and technical way that both humans and machines can understand and process.

The difficulty however is that currently most modelling languages focus almost exclusively on the technical aspects of creating the model rather than the semantic aspects necessary for true interoperability. Many industry-specific consortia, like CIDX [47], PIDX [48], OAGi [23], and RosettaNet [49], have indeed solved major technical issues of traditional EDI, but appear insufficient to provide a common understanding of the underlying data and arrange the semantics of the business information. The prevalent “business standards dilemma”, defined as the diversity of standards that address particular data requirements, but are designed on such a different basis that make the choice of a specific standard to be adopted a new challenge, is compounding the problem [15], [29]. For example, trading partners have to deal with several standards at the same time and, since only recently a methodology for standardizing business semantics (ISO 15000-5, commonly known as UN/CEFACT Core Component Technical Specification) has emerged, careful analysis — on the part of the developer to ensure correct understanding and interpretation, mapping and integration between interfaces is required incurring great expense.

As long as the semantic discourse of data and standards exists, a holistic approach in e-Business cannot be achieved [27]. In this context, the present paper is oriented towards analyzing the Data Modelling State of the Art in terms of core technologies, like XML and UN/CEFACT CCTS (Core Components Technical Specification), and international standards and initiatives in Data Modelling, that have produced sets of XML Schemas, like xCBL, eBIS-XML, OAGIS, UBL, XBRL and xCBL. Such standards fall into the same jurisdiction and try to address data integration issues with the adoption of semantically-enabled XML Schemas.

With regard to the fact that generic classifications of standards (e.g. International Classification of Standards [18]) do not serve the exact needs of the selection and evaluation of data modelling standards; they merely list different standards, a conceptual framework that takes the form of a multi-faceted taxonomy has been developed for the systematic evaluation of standards. The first version of the proposed evaluation framework was developed in the context of the EU-funded “GENESIS: Enterprise Application Interoperability via Internet-Integration for SMEs, Governmental Organizations and Intermediaries in the New European Union” Project [12]. Its aim is the research, development and pilot application of the needed methodologies, infrastructure and software components that will allow the typical, usually small and medium, European enterprise to conduct its Business transactions over Internet, by interconnecting its main transactional software applications and systems with those of collaborating enterprises (B2B transactions), governmental bodies (B2G transactions), banking and insurance institutions (BNK transactions) with respect to the EC current legal and regulatory status and the existing one in the new EU, candidate and associate countries. The development of the evaluation framework was due to a practical need faced in the project [11]: there were many potential data modelling standards and available specifications, and the need to evaluate and select those that would serve each integration need (i.e. B2B, B2G and BNK) emerged. In addition, it was not always clear, which transactions could be covered by the proposed models, standards and specifications and it quickly became evident that there was no evaluation model in the bibliography that would consider all the needed aspects.

In this context, a data modelling evaluation framework was developed and used both for a quick overview of several data modelling standards and a more thorough evaluation of a selected core related to the scope of the project. This paper proposes an extended version of the GENESIS-related evaluation framework that provides the means for the systematic analysis of any B2B data standard and for deducting conclusions regarding the most appropriate standard according to the weight each business poses to the criteria — facets. The set of parameters and characteristics of the standards indicatively include scope, completeness, openness, modularity, maturity, configuration support and modelling of messages and aim to provide a thorough understanding of the standard before implementation. Our approach also presents innovative aspects by incorporating for the first time facets, such as integrated management of enterprise and data models, cross-country support, support for rules modelling, workflow capabilities incorporated into the documents and compatibility with other standards, in the standards' evaluation phase.

The remainder of the paper is structured as follows: in the second section related work in literature is reviewed and discussed. An introduction on Data Modelling is given in Section 3. Section 4 proceeds with the core technologies in Data Modelling. The presentation of the XML Data Integration Standards providing generic (not industry-specific) B2B XML Schemas and the taxonomy of the standards follow in 5 XML Data Standards providing generic B2B Schemas, 6 Multi-facet evaluation taxonomy for XML data standards and lead to the evaluation of standards in Section 7. A short summary of our results and further research activities required towards the next generation of enterprise data integration complement this work.

Section snippets

Literature review

In the span of this work related research efforts which reference Business Information and Data Integration and Modelling [2], [3], [4], [6], [7], [10], [17], [19], [20], [21], [26], [28], [30], [31], [38], [45], [46], [50] were examined and reviewed. The main findings upon which our approach builds originate from the following relevant work:

  • In [20], a state of the art in e-Business Frameworks that are suitable for industrial procurement, design, production or distribution and were active in

Primer of the Data Modelling concept

In computer science, Data Modelling is considered as the process of structuring and organizing data. While data analysis is a common term for data modelling, the activity actually has more in common with the ideas and methods of synthesis (inferring general concepts from particular instances) than it does with analysis (identifying component concepts from more general ones). It strives to bring the data structures of interest together into a cohesive, inseparable whole by eliminating

W3C ΧΜL

The Extensible Markup Language (XML) [41] is a W3C-recommended general-purpose markup language for creating special-purpose markup languages, capable of describing many different kinds of data. It is a simplified subset of Standard Generalized Markup Language (SGML — ISO 8879:1986) and by construction, XML documents are conforming SGML documents. Originally designed to meet the challenges of large-scale electronic publishing, the primary purpose of XML is to facilitate the sharing of data

XML Data Standards providing generic B2B Schemas

This section introduces the industry independent XML Data Standards, namely xCBL, eBIS-XML, OAGIS, UBL, XBRL and xCBL, that will populate the proposed evaluation framework. The presentation order for the Standards is based on their alphabetical order while the information provided is primarily based on the specifications and information published in their official web sites until June 2007.

To our knowledge, UN/CEFACT has also released an International e-Invoice (Cross-Industry electronic

Multi-facet evaluation taxonomy for XML data standards

In order to classify the various standards prevailing in business transactions, a Standard Evaluation Record has been constructed that contains all the Taxonomy-related information of a Standard, that characterize it. There are 12 identified “facets” for a standard, some of which can be further analyzed in sub-facets, taking values from restricted lists of values, as shown in Table 1.

The purpose of each facet is to describe specific characteristics of a standard in a methodological and coherent

Populating the multi-facet evaluation taxonomy for XML data standards

The XML Data Standards providing generic (not industry-specific) B2B, Schemas recognised and analysed in previous sections have been inserted in the above classification scheme, with the results presented in the following “XML Data Integration Standards Evaluation Matrix” (Table 2).

Conclusions

Traditionally, data has been designed for specific applications and databases without regard to integration. Achieving B2B integration does not simply mean achieving a technical realization of connectivity between systems — that is readily addressed through the use of existing technical standards and support middleware such as Web services. The biggest remaining challenge for achieving B2B data interoperability is the lack of common understanding at the collaborative business process and data

Acknowledgments

This paper has been created closely to research activities during the EU-funded project GENESIS (Contract Number FP6-027867).

Fenareti Lampathaki has graduated with a M.Sc. Degree from the School of Electrical and Computer Engineering of the National Technical University of Athens (NTUA) and is currently a Ph.D. candidate in the field of Semantic Interoperability at the same university. Her research interests lie on e-Government and e-Business Interoperability, XML Data Management and Integration, Semantic Web Services, Service-oriented Architectures and Government Transformation. During the last years, she has been

References (50)

  • cXML Version 2.0.17, Retrieved June 2007 from...
  • eBIS-XML Suite Version 3.09, Retrieved May 2006 from...
  • Eva Soderstrom

    Standardising the business vocabulary of standards

  • GENESIS Deliverable D3.1: Analysis of the Data Modelling State of the Art, June...
  • GENESIS Project, 2007,...
  • Graeme C. Simsion, Graham C. Witt, Data Modelling Essentials, Third Edition, Morgan Kaufmann Publications, Elsevier,...
  • Gunther Stuhec

    How to solve the business standards dilemma — CCTS key model concepts

  • Gunther Stuhec

    How to solve the business standards dilemma — the context driven business exchange

  • H.K. Klein et al.

    A comparative framework of data modelling paradigms and approaches

    Computer Journal

    (1987)
  • IDA Project: XML-Based Business Frameworks, Deliverable 2.1 Evaluation of XML Frameworks, Version 2.1, May...
  • ISO, International Classification of Standards, 2007,...
  • J.M. Nurmilaakso et al.

    A review of XML-based supply-chain integration

    Production Planning and Control

    (2004)
  • OAGIS Version 9.1, Retrieved May 2007 from...
  • OASIS
  • Robert J. Glushko et al.

    Document engineering for e-business

  • Cited by (68)

    • Information sharing in supply chains – Interoperability in an era of circular economy

      2022, Cleaner Logistics and Supply Chain
      Citation Excerpt :

      The literature reviews conducted thus far in the area of data sharing in SCs have been carried out along two lines. The first category includes those studies that review the technologies and standards used in SC data sharing (Nurmilaakso, 2008; Lampathaki et al., 2009; Chituc, 2017; Chituc, 2019). The latter category involves reviewing the implications of data sharing in SCs with regards to I4.0 (Naseem and Yang, 2021; Spanaki et al., 2018), collaboration framework (Singh et al., 2018), sustainability (Fritz et al., 2017, ?, ?) and those indicating the recent trends and possible future research (Pan et al., 2021; Ralston et al., 2017; Daneshvar Kakhki and Gargeya, 2019; Frederico, 2021).

    • A novel business context-based approach for improved standards-based systems integration—a feasibility study

      2022, Journal of Industrial Information Integration
      Citation Excerpt :

      Consequently, it could not be a basis for streamlined, efficient, and effective DES usage specification development. The Core Component Technical Specification is an ISO-approved meta-model standard for the development and use of DESes [17] and one of the core technologies in data (exchange) modeling [48]. The first key notion of CCTS is CCs, which represent reusable DES building blocks.

    View all citing articles on Scopus

    1. Download : Download full-size image
    Fenareti Lampathaki has graduated with a M.Sc. Degree from the School of Electrical and Computer Engineering of the National Technical University of Athens (NTUA) and is currently a Ph.D. candidate in the field of Semantic Interoperability at the same university. Her research interests lie on e-Government and e-Business Interoperability, XML Data Management and Integration, Semantic Web Services, Service-oriented Architectures and Government Transformation. During the last years, she has been involved on several EU-funded and national research projects with interoperability aspects, including the FP6 GENESIS, the FP7 Greek Interoperability Centre and the Greek e-Government Interoperability Framework.

    1. Download : Download full-size image
    Spiros Mouzakitis is a Ph.D. candidate at the School of Electrical and Computer Engineering at the National Technical University of Athens. He has nine years of industry experience in conception, analysis and implementation of information technology systems. His current research is focused on decision analysis in the field of eBusiness, enterprise interoperability and eGovernment.

    1. Download : Download full-size image
    George Gionis holds a M.Sc. Degree in Electrical and Computer Engineer at National Technical University of Athens and is currently finalizing his Ph.D. in the field of Interoperability Middleware Architectures and in particular in the field of Rule-Driven Enterprise Service Bus. His research interests comprise modeling and execution of business and legal rules, definition and execution of web-service orchestrations, semantic web services as well as mashup architectures and application and social web practices and tools.

    1. Download : Download full-size image
    Yannis Charalabidis is a Computer Engineer (BSc, MSc) and holds a PhD in Complex Information Systems Engineering from the National Technical University of Athens (NTUA), where he is currently heading eBusiness and eGovernment Research. He has been employed for 10 years in the European IT Sector, as a manager in Singular IT Group in Greece, Poland, Bulgaria, Romania and the Netherlands. During the last 20 years, he has been involved as a project manager or key executive in more than 50 projects under the ESPRIT, Brite-EUram, FP6-FP7 programmes and National Initiatives in European Union, Asia and the US. He is a contributing member in several eGovernment and eBusiness standardization committees and initiatives, including CEN/ISSS eGov, ETSI/B2B Interoperability Standardisation, INTEROP-VLAB, NESSI iGOV Working Group, DEMONET Network, European Commission Interoperability Cluster (as an editor of the Enterprise Interoperability Research Roadmap), IDABC/EIF, SEMIC.EU and IFIP WG 8.5. He has published more than 100 papers in scientific books, journals and conferences, and he lectures on eGovernment, eBusiness and Interoperability in the National Technical University of Athens and the University of the Aegean.

    1. Download : Download full-size image
    Dimitris Askounis is an Assistant Professor in the School of Electrical and Computer Engineering of the National Technical University of Athens (NTUA). He has been involved in numerous EU funded IT research projects since 1988 (ESPRIT, BRITE-EURAM, FP5, FP6) in the areas of computer integrated manufacturing, enterprise resource planning, decision support, project management, knowledge management, quality management, business and data modelling, interoperability and e-business (PRAXIS, GENESIS), etc. He has also participated in several other EU-funded projects within the EUROPAID framework in CEEC, NIS and MEDA countries concerning management training, monitoring and evaluation of large projects, energy policy and planning.

    View full text