Elsevier

Data & Knowledge Engineering

Volumes 81–82, November–December 2012, Pages 118-139
Data & Knowledge Engineering

A model driven approach for the development of metadata editors, applicability to the annotation of geographic information resources

https://doi.org/10.1016/j.datak.2012.09.001Get rights and content

Abstract

Metadata are a key element for the development of information infrastructures because they facilitate the semantic description of contents and services. However, the diversity and heterogeneity of metadata standards have become a barrier for the generation of these metadata. Many metadata editors are not useful anymore because they do not support the latest version of metadata standards or the new profiles arisen in the market. Thus, this work proposes a model driven approach for the development of metadata editors, more focused on the generic treatment of metadata models than on the development of specific edition forms for a reduced set of metadata standards. This approach has been tested in the context of Spatial Data Infrastructures for the development of an Open Source tool called CatMDEdit. Additionally, the approach could be also applied to improve the efficiency of any metadata editor using a metamodeling development strategy.

Introduction

Spatial Data Infrastructures (SDI) are special types of information infrastructures consisting of the relevant base collection of technologies, policies and institutional arrangements that facilitate the availability of and access to spatial data. Traditionally, spatial data (also known as geographic information) were the core component of Geographic Information Systems (GIS), which is the term commonly used to refer to the software packages that allow one to capture, store, check, integrate, manipulate, analyze and display them. However, the potential of spatial data as an instrument to facilitate decision-making and resource management in diverse areas (e.g., natural resources, facilities, cadaster or agriculture) of government or private sectors has led to the evolution of GIS into the broader concept of SDI [1].

One of the main elements for the success in the development of SDI or any other type of information infrastructure is the appropriate annotation of resources to be accessed and distributed by means of metadata. Metadata constitute the mechanism to characterize data and services (e.g., descriptions of the content, quality, condition, authorship and any other features) in order to enable other users and applications to make use of such data and services. However, due to the heterogeneity of contents in information infrastructures, it is not possible to consider a unique metadata model or schema.

The diversity of metadata standards has been a critical issue for the development of SDIs. During the last fifteen years, standardization bodies have proposed different metadata standards such as the Content Standard for Digital Geospatial Metadata (CSDGM) [2] or ISO 19115 Geographic Information — Metadata [3]. Additionally, apart from the standards, it is also common to find application profiles and extensions of these standards. The Dublin Core Metadata Initiative [4] defines a Metadata Application Profile as a declaration of the elements (either selected from the standard, or new elements) that an organization or user community employs in their metadata, and how these elements have been customized to a particular application domain. Within the SDI context, there are multiple examples of metadata profiles for the description of remote sensing data [5], [6], environmental data [7], [8], or the customization of general metadata standards such as Dublin Core [9].

In parallel to the definition of this wide range of geographic metadata standards, there has been an increasing need for desktop or web-based metadata editors able to manage these complex standards (hundreds of elements with different data types organized in hierarchical entities) and providing, among other functionalities, an internationalized interface, online help, validation of standard conformance, and serialization to XML or other semi-structured formats. Through different surveys [10], [47], it can be verified that the development of these metadata editors has been highly promoted during the last years: about 50 tools have been published by different organizations and software companies. At the beginning, these tools were developed to support a unique metadata model. However, these first tools based on fixed structures became rapidly out-of-date. In contrast, nowadays software developers are sensitive to this problem of flexibility. Instead of implementing a particular metadata model, most of the current editors enable as well the configuration of the model using some kind of schema language.

From a software engineering perspective, we could say that annotation tools in the SDI context have moved towards a higher level of abstraction: the focus is now on the management of metamodels. According to ISO/IEC 11179‐3 [11], a metamodel provides a mechanism for understanding the precise structure and components of the specified models, which are needed for the successful sharing of the models by users and/or software facilities. Although the available documentation of most editors does not reveal a conscious interest on metamodels, the key component of these tools is the mechanism to describe the different metadata standards that can be used later to customize dynamically the software for editing metadata in conformance to these standards.

The need for flexibility and the consequent increase of model abstraction is leading the development of metadata editors to a stage that, although not directly intended, is close to the software engineering paradigm of Model Driven Engineering (MDE). MDE focuses on models as the primary artifact in the development process, with transformations as the primary operation on models, used to map information from one model to another [12]. However, in order to avoid ad-hoc and un-efficient implementations of this MDE paradigm, it is important to follow a systematic and acknowledged methodology such as the one defined by the Model Driven Architecture (MDA), the OMG instantiation of MDE [13], [19]. Although in the SDI context MDA has been considered for interoperability issues and data access services [14], [15], [16], it has received little attention for the development of metadata editors. The objective of this work is to provide the guidelines and framework to accomplish the development of metadata editors according to MDA in order to facilitate their rapid development, decreasing the development effort and, at the same time, augmenting their efficiency and flexibility. This paper presents the different domain specific languages and transformations required to define and transform the models involved in the development of a metadata editor: the Platform Independent Model (PIM) for the representation of supported metadata schemas; the Platform Specific Models (PSM) for the specification of a set of GUI (Graphical User Interface) edition forms, and the representation of controlled vocabularies; and the transformation of PSM models into code.

The rest of this paper is structured as follows. Section 2 presents our proposal to apply an MDA approach in the development of metadata editors. Following this proposal, Section 3 describes how this proposal has been used in CatMDEDit, an open-source metadata editor, for the support of different metadata standards and profiles. Then, Section 4 describes the related work in the development of annotation tools from a metamodeling perspective, and discusses the benefits from applying an MDA development approach. Finally, Section 5 draws some conclusions and outlines future work.

Section snippets

Overview

As stated by Djurić et al. [17], if we look back to the history of software development, we can see a notable increase in model abstraction. The activity of modeling is now more separated from the specification of the details of the underlying platform. This evolution allows domain experts to focus on defining reusable models of the real world, alleviating them from acquiring the knowledge about specific computer systems. Two examples of this evolution are the methodologies known as Model

Testing the MDA approach in CatMDEdit

CatMDEdit is a Java-based Open Source tool for the semantic annotation of geographical information. Initial versions of this tool were directly implemented around the logical model of an extended version of the CSDGM metadata standard. However, with the increasing requirements for supporting new arising metadata standards and profiles, it was soon acknowledged that this development approach was costly and error prone. Since version 3.7, the development team decided to adopt the MDA approach

Related work

Given the increasing importance of geographic metadata, numerous software packages (dedicated tools or plug-ins in GIS tools) have appeared during the last decade for the creation of metadata. The Wisconsin Land Information Clearinghouse (WiscLINC) [10] and the Federal Geographic Data Committee [47] provide detailed reviews of edition tools based on CSDGM (the old North American standard) and ISO 19115 metadata standards. Without being exhaustive, the first review, last updated in 2006, reports

Conclusions

This paper has presented the guidelines to apply an MDA approach for the development of annotation tools, which can be customized to different metadata standards and profiles with minimum effort. Applying this approach, experts in metadata standards (without any special programming skills) can focus their efforts on the definition of new metadata models using a domain specific language (the one used to define PIM models), whose abstraction level is close to the way of expressing metadata

Javier Nogueras-Iso holds MS and PhD degrees in Computer Science from the University of Zaragoza (Spain). After working for the Economic and Social Committee of the European Communities (Brussels) in 1998, he started his research at the Advanced Information Systems Laboratory of the Computer Science and Systems Engineering Department of the University of Zaragoza. Currently, he is an Associate Professor of Computer Science at that University. Additionally, he completed a postdoctoral stay at

References (62)

  • C. Mayr et al.

    View-based model-driven architecture for enhancing maintainability of data access services

    Data & Knowledge Engineering

    (2011)
  • Developing Spatial Data Infrastructures: the SDI cookbook v.2.0

  • Federal Geographic Data Committee

    Content Standard for Digital Geospatial Metadata, version 2.0

    Document FGDC-STD-001-1998

    (1998)
  • International Organization for Standardization

    Geographic information — metadata

    ISO 19115:2003

    (2003)
  • Dublin Core Metadata Initiative (DCMI) Usage Board

    DCMI metadata terms

  • International Organization for Standardization

    Geographic information — metadata — part 2: extensions for imagery and gridded data

    ISO 19115‐2:2009

    (2009)
  • Federal Geographic Data Committee

    Content Standard for Digital Geospatial Metadata: extensions for remote sensing metadata

    Public review draft

    (2002)
  • W. Swoboda et al.

    The UDK approach: the 4th generation of an Environmental data catalogue introduced in Austria and Germany

  • M. Wilde et al.

    An environmental metadata profile for the EU project MEDIS

  • R. Tolosana-Calasanz et al.

    Semantic interoperability based on Dublin Core hierarchical one-to-one mappings

    International Journal of Metadata, Semantics and Ontologies (IJMS&O)

    (2006)
  • Wisconsin Land Information Clearinghouse

    Metadata tools for geospatial data

  • Information technology — metadata registries (MDR) — part 3: registry metamodel and basic attributes

    ISO/IEC 11179‐3:2003

    (2003)
  • R. Grangel et al.

    Ontology based communications through model driven tools: feasibility of the MDA approach in urban engineering

  • Meta Object Facility (MOF) Specification v1.4

    OMG Document formal/02–04–03

    (April 2002)
  • T. Kutzner et al.

    Critical remarks on the use of conceptual schemas in geospatial data modelling — a schema translation perspective

  • P. Staub et al.

    Semantic interoperability through the definition of conceptual model transformations

    Transactions in GIS

    (2008)
  • D. Djurić et al.

    Ontology Modeling and MDA

    Journal on Object Technology

    (2005)
  • J. Sztipanovits et al.

    Model-integrated computing

    IEEE Computer

    (April 1997)
  • MDA Guide version 1.0.1

    Document Number: OMG/2003–06–01

    (2003)
  • S. Kelly et al.

    Domain-Specific Modeling: Enabling Full Code Generation

    (2008)
  • G. Nordstrom et al.

    Metamodeling — rapid design and evolution of domain-specific modeling environments

  • A. Miles et al.

    SKOS: simple knowledge organisation for the web

    Cataloging and Classification Quarterly

    (2007)
  • D. Steinberg et al.

    EMF: Eclipse Modeling Framework

    (2008)
  • C. Driver et al.

    Managing embedded systems complexity with aspect-oriented model-driven engineering

    ACM Transactions on Embedded Computing Systems

    (2011)
  • F. Jouault et al.

    Transforming models with ATL

  • J. Oldevik

    MOFScript user guide, version 0.9 (MOFScript v 1.4.0)

  • D. Nebert et al.

    OpenGIS — Catalogue Services Specification (version: 2.0.2)

    OGC 07-006r1

    (2007)
  • International Organization for Standardization

    Geographic information — metadata — XML schema implementation

    ISO/TS 19139:2007

    (2007)
  • A. Powell et al.

    DCMI abstract model, Dublin Core Metadata Initiative (DCMI)

  • F. Jouault et al.

    TCS: a DSL for the specification of textual concrete syntaxes in model engineering

  • J. Lacasta et al.

    A Web Ontology Service to facilitate interoperability within a Spatial Data Infrastructure: applicability to discovery

    Data & Knowledge Engineering

    (2007)
  • Cited by (9)

    • Model-driven disaster management

      2013, Information and Management
      Citation Excerpt :

      Structuring information and maintaining it takes time and effort and requires a type of quality control [16] that often relies on the use of metadata [24]. Metadata is often seen as the cornerstone element for the development of information infrastructures because metadata facilitates the semantic description of content and services [24]. The conceptual frameworks of crisis management for knowledge repositories to strengthen the resilient capabilities of organizations and communities to face a crisis have been developed [8,20].

    • A Systematic Review of Metamodelling in Software Engineering

      2021, Studies in Systems, Decision and Control
    • Raising semantics-awareness in geospatial metadata management

      2018, ISPRS International Journal of Geo-Information
    • Creating and managing metadata

      2017, Ecological Informatics: Data Management and Knowledge Discovery: Third Edition
    View all citing articles on Scopus

    Javier Nogueras-Iso holds MS and PhD degrees in Computer Science from the University of Zaragoza (Spain). After working for the Economic and Social Committee of the European Communities (Brussels) in 1998, he started his research at the Advanced Information Systems Laboratory of the Computer Science and Systems Engineering Department of the University of Zaragoza. Currently, he is an Associate Professor of Computer Science at that University. Additionally, he completed a postdoctoral stay at the Institute of Environment and Sustainability of the Joint Research Centre (Ispra, Italy) in 2005. His research interests are focused on Spatial Data Infrastructures and Geographic Information Retrieval. Within this context, he has co-authored numerous publications in books, journals or conference proceedings; and has collaborated in several R + D projects.

    Miguel Ángel Latre holds a MS degree in Computing from the University of Zaragoza (Spain) since October 1999. He is currently a tenured Assistant Professor at the Department of Computer Science and Systems Engineering at the same university. He is working with the Advanced Information Systems Laboratory where he has been involved in several R&D projects related to the software engineering aspects of Geographic Information Systems and its application to the Environmental field. He is now working to complete his PhD degree in Computer Science.

    Rubén Béjar holds MS and PhD degrees in Computing from the University of Zaragoza (Spain). He is currently a tenured Assistant Professor at the Department of Computer Science and Systems Engineering at the same university. He is working with the Advanced Information Systems Laboratory where he has been involved in several R&D projects related to the software engineering aspects of Geographic Information Systems and has several publications in that area.

    Pedro R. Muro-Medrano holds MS and PhD degrees in Industrial Engineering from the University of Zaragoza (Spain). He has worked in the private industry for 2 years and has held different visiting research positions at the Carnegie Mellon University's Robotics Institute (Pittsburgh, PA, USA), the University of Maryland (College Park, MD, USA) and the US National Institutes of Health (Bethesda, MD, USA). He has 25 years of experience with R&D activities in software development and engineering and he is a Professor of Computer Science at the University of Zaragoza. He is a co-author of numerous national and international publications in books, journals and conference proceedings. Currently, he is the head of the Advanced Information Systems Laboratory at the Computer Science and Systems Engineering Department and the Aragón Institute for Engineering Research from the University of Zaragoza. His research interests include Spatial Data Infrastructures, Geographic Information Systems, Environmental Information systems and Remote Sensing.

    F. Javier Zarazaga-Soria holds a MS degree in computer science from the Technical University of Valencia (Spain) and PhD degree in Computer Science from the University of Zaragoza (Spain). He did his MS Thesis at the Middlesex University's Road Safety Engineering Laboratory, London (UK). In September 1994 he began to work at the Advanced Information Systems Laboratory. He has been an Assistant Professor at the Computer Science and Systems Engineering Department of the University of Zaragoza since November 1996 to June 2003. Since June 2003 he has been an Associate Professor at that University. He is a co-author of numerous national and international publications in books, journals and conference proceedings. He has participated in several R&D projects with companies and/or Public Administrations. His research interests include Information Retrieval and Ontologies in the context of Spatial Data Infrastructures.

    This work has been partially supported by the Spanish Government through the project TIN2007-65341, the National Geographic Institute of Spain (IGN), and GeoSpatiumLab S.L.

    View full text