Top

2006 | Book

The Semantic Web - ISWC 2006

5th International Semantic Web Conference, ISWC 2006, Athens, GA, USA, November 5-9, 2006. Proceedings

Editors: Isabel Cruz, Stefan Decker, Dean Allemang, Chris Preist, Daniel Schwabe, Peter Mika, Mike Uschold, Lora M. Aroyo

Publisher: Springer Berlin Heidelberg

Book Series : Lecture Notes in Computer Science

Part of: Springer Professional "Wirtschaft+Technik" , Springer Professional "Technik" , Springer Professional "Wirtschaft"

About this book

“Evolve or perish” – this is the motto for living systems. Judging by this saying, the Web is alive and well: new sites and business ideas are coming online almost daily and are able to attract millions of users often. The more recently coined term “Web 2.0” summarizes many of the new developments, capturing efforts making the Web more interactive (like Ajax), more collaborative (like Wikis), or more relationship oriented (like online social networks), aiming to partially fulfill the original promise of the Web. These new Web developments offer an opportunity and challenge for the Semantic Web: what previously manifested itself mostly in “dry” specifications is now becoming the foundation for information exchange on the Web, creating a shared semantic information space. These and other challenges have been picked up by several hundred computer scientists, developers, vendors, government workers, venture capitalists, students, and users, gathered in Athens, Atlanta, USA, November 5–9, 2006, for the Fifth International Semantic Web Conference (ISWC 2006). Building on previous successful meetings in Sardinia, Sanibel Island, Hiroshima, and Galway, this sixth annual conference demonstrates new research results, technology, and applications that show current incarnations of the Semantic Web. Especially encouraging is the shift towards more applications—whereas the Research Track attracted roughly as many papers as in the previous year, the contributions submitted to the In-Use Track doubled.

Frontmatter

Research Track

Ranking Ontologies with AKTiveRank

Ontology search and reuse is becoming increasingly important as the quest for methods to reduce the cost of constructing such knowledge structures continues. A number of ontology libraries and search engines are coming to existence to facilitate locating and retrieving potentially relevant ontologies. The number of ontologies available for reuse is steadily growing, and so is the need for methods to evaluate and rank existing ontologies in terms of their relevance to the needs of the knowledge engineer. This paper presents AKTiveRank, a prototype system for ranking ontologies based on a number of structural metrics.

Harith Alani, Christopher Brewster, Nigel Shadbolt

Three Semantics for Distributed Systems and Their Relations with Alignment Composition

An ontology alignment explicitly describes the relations holding between two ontologies. A system composed of ontologies and alignments interconnecting them is herein called a distributed system. We give three different semantics of a distributed system, that do not interfere with the semantics of ontologies. Their advantages are compared with respect to allowing consistent merge of ontologies, managing heterogeneity and complying with an alignment composition operation. We show that only the first two variants, which differ from other proposed semantics, can offer a sound composition operation.

Antoine Zimmermann, Jérôme Euzenat

Semantics and Complexity of SPARQL

SPARQL is the W3C candidate recommendation query language for RDF. In this paper we address systematically the formal study of SPARQL, concentrating in its graph pattern facility. We consider for this study simple RDF graphs without special semantics for literals and a simplified version of filters which encompasses all the main issues. We provide a compositional semantics, prove there are normal forms, prove complexity bounds, among others that the evaluation of SPARQL patterns is PSPACE-complete, compare our semantics to an alternative operational semantics, give simple and natural conditions when both semantics coincide and discuss optimization procedures.

Jorge Pérez, Marcelo Arenas, Claudio Gutierrez

Ontology-Driven Automatic Entity Disambiguation in Unstructured Text

Precisely identifying entities in web documents is essential for document indexing, web search and data integration. Entity disambiguation is the challenge of determining the correct entity out of various candidate entities. Our novel method utilizes background knowledge in the form of a populated ontology. Additionally, it does not rely on the existence of any structure in a document or the appearance of data items that can provide strong evidence, such as email addresses, for disambiguating person names. Originality of our method is demonstrated in the way it uses different relationships in a document as well as from the ontology to provide clues in determining the correct entity. We demonstrate the applicability of our method by disambiguating names of researchers appearing in a collection of DBWorld posts using a large scale, real-world ontology extracted from the DBLP bibliography website. The precision and recall measurements provide encouraging results.

Joseph Hassell, Boanerges Aleman-Meza, I. Budak Arpinar

Augmenting Navigation for Collaborative Tagging with Emergent Semantics

We propose an approach that unifies browsing by tags and visual features for intuitive exploration of image databases. In contrast to traditional image retrieval approaches, we utilise tags provided by users on collaborative tagging sites, complemented by simple image analysis and classification. This allows us to find new relations between data elements. We introduce the concept of a navigation map, that describes links between users, tags, and data elements for the example of the collaborative tagging site Flickr. We show that introducing similarity search based on image features yields additional links on this map. These theoretical considerations are supported by examples provided by our system, using data and tags from real Flickr users.

Melanie Aurnhammer, Peter Hanappe, Luc Steels

On the Semantics of Linking and Importing in Modular Ontologies

Modular ontology languages, such as Distributed Description Logics (DDL),

$\mathcal{E}$

-connections and Package-based Description Logics (P-DL) offer two broad classes of approaches to connect multiple ontology modules: the use of

mappings

linkings

between ontology modules e.g., DDL and

$\mathcal{E}$

-connections; and the use of

importing

e.g., P-DL. The major difference between the two approaches is on the usage of “foreign terms” at the syntactic level, and on the local model disjointness at the semantic level. We compare the semantics of linking in DDL and

$\mathcal{E}$

-connections, and importing in P-DL within the Distributed First Order Logics (DFOL) framework. Our investigation shows that the domain disjointness assumption adopted by the linking approach leads to several semantic difficulties. We explore the possibility of avoiding some of these difficulties using the importing approach to linking ontology modules.

Jie Bao, Doina Caragea, Vasant G. Honavar

RS2D: Fast Adaptive Search for Semantic Web Services in Unstructured P2P Networks

In this paper, we present an approach, called RS2D v1, to adaptive probabilistic search for semantic web services in unstructured P2P networks. Each service agent dynamically learns the averaged query-answer behavior of its neighbor peers, and forwards service requests to those with minimal mixed Bayesian risk of doing so in terms of estimated semantic gain and commmunication cost. Experimental evaluation shows that the RS2D search mechanism is robust against changes in the network, and fast with reasonably high precision compared to other existing relevant approaches.

Ulrich Basters, Matthias Klusch

SADIe: Semantic Annotation for Accessibility

Visually impaired users are hindered in their efforts to access the largest repository of electronic information in the world – the World Wide Web (Web). The web is visually-centric with regard to presentation and information order / layout, this can (and does) hinder users who need presentation-agnostic access to information. Transcoding can help to make information more accessible via a restructuring of pages. We describe an approach based on annotation of web pages, encoding semantic information that can then be used by tools in order to manipulate and present web pages in a form that provides easier access to content. Annotations are made directly to style sheet information, allowing the annotation of large numbers of similar pages with little effort.

Sean Bechhofer, Simon Harper, Darren Lunn

Automatic Annotation of Web Services Based on Workflow Definitions

Semantic annotations of web services can facilitate the discovery of services, as well as their composition into workflows. At present, however, the practical utility of such annotations is limited by the small number of service annotations available for general use. Resources for manual annotation are scarce, and therefore some means is required by which services can be automatically (or semi-automatically) annotated. In this paper, we show how information can be inferred about the semantics of operation parameters based on their connections to other (annotated) operation parameters within tried-and-tested workflows. In an open-world context, we can infer only constraints on the semantics of parameters, but these so-called

loose annotations

are still of value in detecting errors within workflows, annotations and ontologies, as well as in simplifying the manual annotation task.

Khalid Belhajjame, Suzanne M. Embury, Norman W. Paton, Robert Stevens, Carole A. Goble

A Constraint-Based Approach to Horizontal Web Service Composition

The task of automatically composing Web services involves two main composition processes, vertical and horizontal composition. Vertical composition consists of defining an appropriate combination of simple processes to perform a composition task. Horizontal composition process consists of determining the most appropriate Web service, from among a set of functionally equivalent ones for each component process. Several recent research efforts have dealt with the Web service composition problem. Nevertheless, most of them tackled only the vertical composition of Web services despite the growing trend towards functionally equivalent Web services. In an attempt to facilitate and streamline the process of horizontal composition of Web services while taking the above limitation into consideration, this work includes two main contributions. The first is a generic formalization of any Web service composition problem based on a constraint optimization problem (COP); this formalization is compatible to any Web service description language. The second contribution is an incremental user-intervention-based protocol to find the optimal composite Web service according to some predefined criteria at run-time. Our goal is

) to deal with many crucial natural features of Web services such as dynamic and distributed environment, uncertain and incomplete Web service information, etc; and

) to allow human user intervention to enhance the solving process. Three approaches are described in this work, a centralized approach, a distributed approach and a multi-agent approach to deal with realistic domains.

Ahlem Ben Hassine, Shigeo Matsubara, Toru Ishida

GINO – A Guided Input Natural Language Ontology Editor

The casual user is typically overwhelmed by the formal logic of the Semantic Web. The gap between the end user and the logic-based scaffolding has to be bridged if the Semantic Web’s capabilities are to be utilized by the general public. This paper proposes that controlled natural languages offer one way to bridge the gap. We introduce GINO, a

guided input natural language ontology editor

that allows users to edit and query ontologies in a language akin to English. It uses a small static grammar, which it dynamically extends with elements from the loaded ontologies. The usability evaluation shows that GINO is well-suited for novice users when editing ontologies. We believe that the use of guided entry overcomes the

habitability problem

, which adversely affects most natural language systems. Additionally, the approach’s dynamic grammar generation allows for easy adaptation to new ontologies.

Abraham Bernstein, Esther Kaufmann

Fresnel: A Browser-Independent Presentation Vocabulary for RDF

Semantic Web browsers and other tools aimed at displaying RDF data to end users are all concerned with the same problem: presenting content primarily intended for machine consumption in a human-readable way. Their solutions differ but in the end address the same two high-level issues, no matter the underlying representation paradigm: specifying (i)

what

information contained in RDF models should be presented (content selection) and (ii)

how

this information should be presented (content formatting and styling). However, each tool currently relies on its own

ad hoc

mechanisms and vocabulary for specifying RDF presentation knowledge, making it difficult to share and reuse such knowledge across applications. Recognizing the general need for presenting RDF content to users and wanting to promote the exchange of presentation knowledge, we designed Fresnel as a browser-independent vocabulary of core RDF display concepts. In this paper we describe Fresnel’s main concepts and present several RDF browsers and visualization tools that have adopted the vocabulary so far.

Emmanuel Pietriga, Christian Bizer, David Karger, Ryan Lee

A Software Engineering Approach to Design and Development of Semantic Web Service Applications

We present a framework for designing and developing Semantic Web Service applications that span over several enterprises by applying techniques, methodologies, and notations offered by Software engineering, Web engineering, and Business Process modeling. In particular, we propose to exploit existing standards for the specification of business processes (e.g., BPMN), for modeling the cross enterprise process, combined with powerful methodologies, tools and notations (e.g., WebML) borrowed from the Web engineering field for designing and developing semantically rich Web applications, with semi-automatic elicitation of semantic descriptions (i.e., WSMO Ontologies, Goals, Web Services and Mediators) from the design of the applications, with huge advantages in terms of efficiency of the design and reduction of the extra work necessary for semantically annotating the information crossing the organization boundaries.

Marco Brambilla, Irene Celino, Stefano Ceri, Dario Cerizza, Emanuele Della Valle, Federico Michele Facca

A Model Driven Approach for Building OWL DL and OWL Full Ontologies

This paper presents an approach for visually modeling OWL DL and OWL Full ontologies based on the well-established visual modeling language UML. We discuss a metamodel for OWL based on the Meta-Object Facility, an associated UML profile as visual syntax, and transformations between both. The work we present supports model-driven development of OWL ontologies and is currently undergoing the standardization process of the Object Management Group. After describing our approach, we present the implementation of our approach and an example, showing how the metamodel and UML profile can be used to improve developing Semantic Web applications.

Saartje Brockmans, Robert M. Colomb, Peter Haase, Elisa F. Kendall, Evan K. Wallace, Chris Welty, Guo Tong Xie

IRS-III: A Broker for Semantic Web Services Based Applications

In this paper we describe IRS-III which takes a semantic broker based approach to creating applications from Semantic Web Services by mediating between a service requester and one or more service providers. Business organisations can view Semantic Web Services as the basic mechanism for integrating data and processes across applications on the Web. This paper extends previous publications on IRS by providing an overall description of our framework from the point of view of application development. More specifically, we describe the IRS-III methodology for building applications using Semantic Web Services and illustrate our approach through a use case on e-government.

Liliana Cabral, John Domingue, Stefania Galizia, Alessio Gugliotta, Vlad Tanasescu, Carlos Pedrinaci, Barry Norton

Provenance Explorer – Customized Provenance Views Using Semantic Inferencing

This paper presents

Provenance Explorer

, a secure provenance visualization tool, designed to dynamically generate customized views of scientific data provenance that depend on the viewer’s requirements and/or access privileges. Using RDF and graph visualizations, it enables scientists to view the data, states and events associated with a scientific workflow in order to understand the scientific methodology and validate the results. Initially the Provenance Explorer presents a simple, coarse-grained view of the scientific process or experiment. However the GUI allows permitted users to expand links between nodes (input states, events and output states) to reveal more fine-grained information about particular sub-events and their inputs and outputs. Access control is implemented using Shibboleth to identify and authenticate users and XACML to define access control policies. The system also provides a platform for publishing scientific results. It enables users to select particular nodes within the visualized workflow and drag-and-drop them into an RDF package for publication or e-learning. The direct relationships between the individual components selected for such packages are inferred by the rule-inference engine.

Kwok Cheung, Jane Hunter

On How to Perform a Gold Standard Based Evaluation of Ontology Learning

In recent years several measures for the gold standard based evaluation of ontology learning were proposed. They can be distinguished by the layers of an ontology (e.g. lexical term layer and concept hierarchy) they evaluate. Judging those measures with a list of criteria we show that there exist some measures sufficient for evaluating the lexical term layer. However, existing measures for the evaluation of concept hierarchies fail to meet basic criteria. This paper presents a new taxonomic measure which overcomes the problems of current approaches.

Klaas Dellschaft, Steffen Staab

Characterizing the Semantic Web on the Web

Semantic Web languages are being used to represent, encode and exchange

semantic

data in many contexts beyond the Web – in databases, multiagent systems, mobile computing, and ad hoc networking environments. The core paradigm, however, remains what we call the

Web aspect

of the Semantic Web – its use by independent and distributed agents who publish and consume data on the World Wide Web. To better understand this central use case, we have harvested and analyzed a collection of Semantic Web documents from an estimated ten million available on the Web. Using a corpus of more than 1.7 million documents comprising over 300 million RDF triples, we describe a number of global metrics, properties and usage patterns. Most of the metrics, such as the size of Semantic Web documents and the use frequency of Semantic Web terms, were found to follow a power law distribution.

Li Ding, Tim Finin

MultiCrawler: A Pipelined Architecture for Crawling and Indexing Semantic Web Data

The goal of the work presented in this paper is to obtain large amounts of semistructured data from the web. Harvesting semistructured data is a prerequisite to enabling large-scale query answering over web sources. We contrast our approach to conventional web crawlers, and describe and evaluate a five-step pipelined architecture to crawl and index data from both the traditional and the Semantic Web.

Andreas Harth, Jürgen Umbrich, Stefan Decker

/facet: A Browser for Heterogeneous Semantic Web Repositories

Facet browsing has become popular as a user friendly interface to data repositories. The Semantic Web raises new challenges due to the heterogeneous character of the data. First, users should be able to select and navigate through facets of resources of any type and to make selections based on properties of other, semantically related, types. Second, where traditional facet browsers require manual configuration of the software, a semantic web browser should be able to handle any RDFS dataset without any additional configuration. Third, hierarchical data on the semantic web is not designed for browsing: complementary techniques, such as search, should be available to overcome this problem. We address these requirements in our browser, /facet. Additionally, the interface allows the inclusion of facet-specific display options that go beyond the hierarchical navigation that characterizes current facet browsing. /facet is a tool for Semantic Web developers as an instant interface to their complete dataset. The automatic facet configuration generated by the system can then be further refined to configure it as a tool for end users. The implementation is based on current Web standards and open source software. The new functionality is motivated using a scenario from the cultural heritage domain.

Michiel Hildebrand, Jacco van Ossenbruggen, Lynda Hardman

Using Ontologies for Extracting Product Features from Web Pages

In this paper, we show how to use ontologies to bootstrap a knowledge acquisition process that extracts product information from tabular data on Web pages. Furthermore, we use logical rules to reason about product specific properties and to derive higher-order knowledge about product features. We will also explain the knowledge acquisition process, covering both ontological and procedural aspects. Finally, we will give an qualitative and quantitative evaluation of our results.

Wolfgang Holzinger, Bernhard Krüpl, Marcus Herzog

Block Matching for Ontologies

Ontology matching is a crucial task to enable interoperation between Web applications using different but related ontologies. Today, most of the ontology matching techniques are targeted to find 1:1 mappings. However, block mappings are in fact more pervasive. In this paper, we discuss the block matching problem and suggest that both the mapping quality and the partitioning quality should be considered in block matching. We propose a novel partitioning-based approach to address the block matching issue. It considers both linguistic and structural characteristics of domain entities based on virtual documents, and uses a hierarchical bisection algorithm for partitioning. We set up two kinds of metrics to evaluate of the quality of block matching. The experimental results demonstrate that our approach is feasible.

Wei Hu, Yuzhong Qu

A Relaxed Approach to RDF Querying

We explore flexible querying of RDF data, with the aim of making it possible to return data satisfying query conditions with varying degrees of exactness, and also to rank the results of a query depending on how “closely” they satisfy the query conditions. We make queries more flexible by logical relaxation of their conditions based on RDFS entailment and RDFS ontologies. We develop a notion of ranking of query answers, and present a query processing algorithm for incrementally computing the relaxed answer of a query. Our approach has application in scenarios where there is a lack of understanding of the ontology underlying the data, or where the data objects have heterogeneous sets of properties or irregular structures.

Carlos A. Hurtado, Alexandra Poulovassilis, Peter T. Wood

Mining Information for Instance Unification

Instance unification determines whether two instances in an ontology refer to the same object in the real world. More specifically, this paper addresses the instance unification problem for person names. The approach combines the use of citation information (i.e., abstract, initials, titles and co-authorship information) with web mining, in order to gather additional evidence for the instance unification algorithm. The method is evaluated on two datasets – one from the BT digital library and one used in previous work on name disambiguation. The results show that the information mined from the web contributes substantially towards the successful handling of highly ambiguous cases which lowered the performance of previous methods.

Niraj Aswani, Kalina Bontcheva, Hamish Cunningham

The Summary Abox: Cutting Ontologies Down to Size

Reasoning on OWL ontologies is known to be intractable in the worst-case, which is a serious problem because in practice, most OWL ontologies have large Aboxes, i.e., numerous assertions about individuals and their relations. We propose a technique that uses a summary of the ontology (

summary Abox

) to reduce reasoning to a small subset of the original Abox, and prove that our techniques are sound and complete. We demonstrate the scalability of this technique for consistency detection in 4 ontologies, the largest of which has 6.5 million role assertions.

Achille Fokoue, Aaron Kershenbaum, Li Ma, Edith Schonberg, Kavitha Srinivas

Semantic Metadata Generation for Large Scientific Workflows

In recent years, workflows have been increasingly used in scientific applications. This paper presents novel metadata reasoning capabilities that we have developed to support the creation of large workflows. They include 1) use of semantic web technologies in handling metadata constraints on file collections and nested file collections, 2) propagation and validation of metadata constraints from inputs to outputs in a workflow component, and through the links among components in a workflow, and 3) sub-workflows that generate metadata needed for workflow creation. We show how we used these capabilities to support the creation of large executable workflows in an earthquake science application with more than 7,000 jobs, generating metadata for more than 100,000 new files.

Jihie Kim, Yolanda Gil, Varun Ratnakar

Reaching Agreement over Ontology Alignments

When agents communicate, they do not necessarily use the same vocabulary or ontology. For them to interact successfully, they must find correspondences (mappings) between the terms used in their respective ontologies. While many proposals for matching two agent ontologies have been presented in the literature, the resulting alignment may not be satisfactory to both agents, and thus may necessitate additional negotiation to identify a mutually agreeable set of correspondences.

We propose an approach for supporting the creation and exchange of different arguments, that support or reject possible correspondences. Each agent can decide, according to its preferences, whether to accept or refuse a candidate correspondence. The proposed framework considers arguments and propositions that are specific to the matching task and are based on the ontology semantics. This argumentation framework relies on a formal argument manipulation schema and on an encoding of the agents’ preferences between particular kinds of arguments. Whilst the former does not vary between agents, the latter depends on the interests of each agent. Thus, this approach distinguishes clearly between alignment rationales which are valid for all agents and those specific to a particular agent.

Loredana Laera, Valentina Tamma, Jérôme Euzenat, Trevor Bench-Capon, Terry Payne

A Formal Model for Semantic Web Service Composition

Automated composition of Web services or the process of forming new value added Web services is one of the most promising challenges in the semantic Web service research area. Semantics is one of the key elements for the automated composition of Web services because such a process requires rich machine-understandable descriptions of services that can be shared. Semantics enables Web service to describe their capabilities and processes, nevertheless there is still some work to be done. Indeed Web services described at functional level need a formal context to perform the automated composition of Web services. The suggested model (i.e., Causal link matrix) is a necessary starting point to apply problem-solving techniques such as regression-based search for Web service composition. The model supports a semantic context in order to find a correct, complete, consistent and optimal plan as a solution. In this paper an innovative and formal model for an AI planning-oriented composition is presented.

Freddy Lécué, Alain Léger

Evaluating Conjunctive Triple Pattern Queries over Large Structured Overlay Networks

We study the problem of evaluating conjunctive queries composed of triple patterns over RDF data stored in distributed hash tables. Our goal is to develop algorithms that scale to large amounts of RDF data, distribute the query processing load evenly and incur little network traffic. We present and evaluate two novel query processing algorithms with these possibly conflicting goals in mind. We discuss the various tradeoffs that occur in our setting through a detailed experimental evaluation of the proposed algorithms.

Erietta Liarou, Stratos Idreos, Manolis Koubarakis

PowerMap: Mapping the Real Semantic Web on the Fly

Ontology mapping plays an important role in bridging the semantic gap between distributed and heterogeneous data sources. As the Semantic Web slowly becomes real and the amount of online semantic data increases, a new generation of tools is developed that automatically find and integrate this data. Unlike in the case of earlier tools where mapping has been performed at the design time of the tool, these new tools require mapping techniques that can be performed at run time. The contribution of this paper is twofold. First, we investigate the general requirements for run time mapping techniques. Second, we describe our PowerMap mapping algorithm that was designed to be used at run-time by an ontology based question answering tool.

Vanessa Lopez, Marta Sabou, Enrico Motta

Ontology-Driven Information Extraction with OntoSyphon

The Semantic Web’s need for machine understandable content has led researchers to attempt to automatically acquire such content from a number of sources, including the web. To date, such research has focused on “document-driven” systems that individually process a small set of documents, annotating each with respect to a given ontology. This paper introduces OntoSyphon, an alternative that strives to more fully leverage existing ontological content while scaling to extract comparatively shallow content from millions of documents. OntoSyphon operates in an “ontology-driven” manner: taking any ontology as input, OntoSyphon uses the ontology to specify web searches that identify possible semantic instances, relations, and taxonomic information. Redundancy in the web, together with information from the ontology, is then used to automatically verify these candidate instances and relations, enabling OntoSyphon to operate in a fully automated, unsupervised manner. A prototype of OntoSyphon is fully implemented and we present experimental results that demonstrate substantial instance learning in a variety of domains based on independently constructed ontologies. We also introduce new methods for improving instance verification, and demonstrate that they improve upon previously known techniques.

Luke K. McDowell, Michael Cafarella

Ontology Query Answering on Databases

With the fast development of Semantic Web, more and more RDF and OWL ontologies are created and shared. The effective management, such as storage, inference and query, of these ontologies on databases gains increasing attention. This paper addresses ontology query answering on databases by means of Datalog programs. Via epistemic operators, integrity constraints are introduced, and used for conveying semantic aspects of OWL that are not covered by Datalog-style rule languages. We believe such a processing suitable to capture ontologies in the database flavor, while keeping reasoning tractable. Here, we present a logically equivalent knowledge base whose (sound and complete) inference system appears as a Datalog program. As such, SPARQL query answering on OWL ontologies could be solved in databases. Bi-directional strategies, taking advantage of both forward and backward chaining, are then studied to support this kind of customized Datalog programs, returning exactly answers to the query within our logical framework.

Jing Mei, Li Ma, Yue Pan

Formal Model for Ontology Mapping Creation

In a semantic environment data is described by ontologies and heterogeneity problems have to be solved at the ontological level. This means that alignments between ontologies have to be created, most probably during design-time, and used in various run-time processes. Such alignments describe a set of mappings between the source and target ontologies, where the mappings show how instance data from one ontology can be expressed in terms of another ontology. We propose a formal model for mapping creation. Starting from this model we explore how such a model maps onto a design-time graphical tool that can be used in creating alignments between ontologies. We also investigate how such a model helps in expressing the mappings in a logical language, based on the semantic relationships identified using the graphical tool.

Adrian Mocan, Emilia Cimpian, Mick Kerrigan

A Semantic Context-Aware Access Control Framework for Secure Collaborations in Pervasive Computing Environments

Wireless connectivity and widespread diffusion of portable devices offer novel opportunities for users to share resources anywhere and anytime, and to form ad-hoc coalitions. Resource access control is crucial to leverage these ad-hoc collaborations. In pervasive scenarios, however, collaborating entities cannot be predetermined and resource availability frequently varies, even unpredictably, due to user/device mobility, thus complicating resource access control. Access control policies cannot be defined based on entity’s identities/roles, as in traditional access control solutions, or be specified a priori to face any operative run time condition, but require continuous adjustments to adapt to the current situation. To address these issues, this paper advocates the adoption of novel access control policy models that follow two main design guidelines:

context-awareness

to control resource access on the basis of context visibility and to enable dynamic adaptation of policies depending on context changes, and

semantic technologies

for context/policy specification to allow high-level description and reasoning about context and policies. The paper also describes the design of a semantic context-aware policy model that adopts ontologies and rules to express context and context-aware access control policies and supports policy adaptation.

Alessandra Toninelli, Rebecca Montanari, Lalana Kagal, Ora Lassila

Extracting Relations in Social Networks from the Web Using Similarity Between Collective Contexts

Social networks have recently garnered considerable interest. With the intention of utilizing social networks for the Semantic Web, several studies have examined automatic extraction of social networks. However, most methods have addressed extraction of the strength of relations. Our goal is extracting the underlying relations between entities that are embedded in social networks. To this end, we propose a method that automatically extracts labels that describe relations among entities. Fundamentally, the method clusters similar entity pairs according to their collective contexts in Web documents. The descriptive labels for relations are obtained from results of clustering. The proposed method is entirely unsupervised and is easily incorporated into existing social network extraction methods. Our method also contributes to ontology population by elucidating relations between instances in social networks. Our experiments conducted on entities in political social networks achieved clustering with high precision and recall. We extracted appropriate relation labels to represent the entities.

Junichiro Mori, Takumi Tsujishita, Yutaka Matsuo, Mitsuru Ishizuka

Can OWL and Logic Programming Live Together Happily Ever After?

Logic programming (LP) is often seen as a way to overcome several shortcomings of the Web Ontology Language (OWL), such as the inability to model integrity constraints or perform closed-world querying. However, the open-world semantics of OWL seems to be fundamentally incompatible with the closed-world semantics of LP. This has sparked a heated debate in the Semantic Web community, resulting in proposals for alternative ontology languages based entirely on logic programming. To help resolving this debate, we investigate the practical use cases which seem to be addressed by logic programming. In fact, many of these requirements have already been addressed outside the Semantic Web. By drawing inspiration from these existing formalisms, we present a novel logic of

hybrid MKNF knowledge bases

, which seamlessly integrates OWL with LP. We are thus capable of addressing the identified use cases without a radical change in the architecture of the Semantic Web.

Boris Motik, Ian Horrocks, Riccardo Rosati, Ulrike Sattler

Innovation Detection Based on User-Interest Ontology of Blog Community

Recently, the use of blogs has been a remarkable means to publish user interests. In order to find suitable information resources from a large amount of blog entries which are published every day, we need an information filtering technique to automatically transcribe user interests to a user profile in detail. In this paper, we first classify user blog entries into service domain ontologies and extract interest ontologies that express a user’s interests semantically as a hierarchy of classes according to interest weight by a top-down approach. Next, with a bottom-up approach, users modify their interest ontologies to update their interests in more detail. Furthermore, we propose a similarity measurement between ontologies considering the interest weight assigned to each class and instance. Then, we detect innovative blog entries that include concepts that the user has not thought about in the past based on the analysis of approximated ontologies of a user’s interests. We present experimental results that demonstrate the performance of our proposed methods using a large-scale blog entries and music domain ontologies.

Makoto Nakatsuji, Yu Miyoshi, Yoshihiro Otsuka

Modeling Social Attitudes on the Web

This paper argues that in order to allow for the representation, comparison and assessment of possibly controversial or uncertain information on the web, the semantic web effort requires capabilities for the social reasoning about web ontologies and other information acquired from multiple heterogeneous sources. As an approach to this, we propose formal means for the representation of possibly controversial opinions of groups and individuals, and of several other social attitudes regarding information on the web. Doing so, we integrate concepts from distributed artificial intelligence with approaches to web semantics, aiming for a

social semantics

of web content.

Matthias Nickles

A Framework for Ontology Evolution in Collaborative Environments

With the wider use of ontologies in the Semantic Web and as part of production systems, multiple scenarios for ontology maintenance and evolution are emerging. For example, successive ontology versions can be posted on the (Semantic) Web, with users discovering the new versions serendipitously; ontology-development in a collaborative environment can be synchronous or asynchronous; managers of projects may exercise quality control, examining changes from previous baseline versions and accepting or rejecting them before a new baseline is published, and so on. In this paper, we present different scenarios for ontology maintenance and evolution that we have encountered in our own projects and in those of our collaborators. We define several features that categorize these scenarios. For each scenario, we discuss the high-level tasks that an editing environment must support. We then present a unified comprehensive set of tools to support different scenarios in a single framework, allowing users to switch between different modes easily.

Natalya F. Noy, Abhita Chugh, William Liu, Mark A. Musen

Extending Faceted Navigation for RDF Data

Data on the Semantic Web is semi-structured and does not follow one fixed schema. Faceted browsing [23] is a natural technique for navigating such data, partitioning the information space into orthogonal conceptual dimensions. Current faceted interfaces are manually constructed and have limited query expressiveness. We develop an expressive faceted interface for semi-structured data and formally show the improvement over existing interfaces. Secondly, we develop metrics for automatic ranking of facet quality, bypassing the need for manual construction of the interface. We develop a prototype for faceted navigation of arbitrary RDF data. Experimental evaluation shows improved usability over current interfaces.

Eyal Oren, Renaud Delbru, Stefan Decker

Reducing the Inferred Type Statements with Individual Grouping Constructs

A common approach for reasoning is to compute the deductive closure of an ontology using the rules specified and to work on the closure at query time. This approach reduces the run time complexity but increases the space requirements. The main reason of this increase is the type and subclass statements in the ontology. Type statements show a significant percentage in most ontologies. Since subclass is a transitive property, derivation of other statements, in particular type statements relying on it, gives rise to cyclic repetition and an excess of inferred type statements. In brief, a major part of closure computation is deriving the type statements relying on subclass statements. In this paper, we propose a syntactic transformation that is based on novel individual grouping constructs. This transformation reduces the number of inferred type statements relying on subclass relations. Thus, the space requirement of reasoning is reduced without affecting the soundness and the completeness.

Övünç Öztürk, Tuğba Özacar, Murat Osman Ünalır

A Framework for Schema-Driven Relationship Discovery from Unstructured Text

We address the issue of extracting implicit and explicit relationships between entities in biomedical text. We argue that entities seldom occur in text in their simple form and that relationships in text relate the modified, complex forms of entities with each other. We present a rule-based method for (1) extraction of such complex entities and (2) relationships between them and (3) the conversion of such relationships into RDF. Furthermore, we present results that clearly demonstrate the utility of the generated RDF in discovering knowledge from text corpora by means of locating paths composed of the extracted relationships.

Cartic Ramakrishnan, Krys J. Kochut, Amit P. Sheth

Web Service Composition Via Generic Procedures and Customizing User Preferences

We claim that user preferences are a key component of Web service composition – a component that has largely been ignored. In this paper we propose a means of specifying and intergrating user preferences into Web service composition. To this end, we propose a means of performing automated Web service composition by exploiting generic procedures together with rich qualitative user preferences. We exploit the agent programming language Golog to represent our generic procedures and a first-order preference language to represent rich qualitative temporal user preferences. From these we generate Web service compositions that realize the generic procedure, satisfying the user’s hard constraints and optimizing for the user’s preferences. We prove our approach sound and optimal. Our system, GologPref, is implemented and interacting with services on the Web. The language and techniques proposed in this paper can be integrated into a variety of approaches to Web or Grid service composition.

Shirin Sohrabi, Nataliya Prokoshyna, Sheila A. McIlraith

Querying the Semantic Web with Preferences

Ranking is an important concept to avoid empty or overfull and unordered result sets. However, such scoring can only express total orders, which restricts its usefulness when several factors influence result relevance. A more flexible way to express relevance is the notion of preferences. Users state which kind of answers they ‘prefer’ by adding soft constraints to their queries.

Current approaches in the Semantic Web offer only limited facilities for specification of scoring and result ordering. There is no common language element to express and formalize ranking and preferences. We present a comprehensive extension of SPARQL which directly supports the expression of preferences. This includes formal syntax and semantics of preference expressions for SPARQL. Additionally, we report our implementation of preference query processing, which is based on the ARQ query engine.

Wolf Siberski, Jeff Z. Pan, Uwe Thaden

ONTOCOM: A Cost Estimation Model for Ontology Engineering

The technical challenges associated with the development and deployment of ontologies have been subject to a considerable number of research initiatives since the beginning of the nineties. The economical aspects of these processes are, however, still poorly exploited, impeding the dissemination of ontology-driven technologies beyond the boundaries of the academic community. This paper aims at contributing to the alleviation of this situation by proposing

ONTOCOM

(

Onto

logy

odel), a model to predict the costs arising in ontology engineering processes. We introduce a methodology to generate a cost model adapted to a particular ontology development strategy, and an inventory of cost drivers which influence the amount of effort invested in activities performed during an ontology life cycle. We further present the results of the model validation procedure, which covered an expert-driven evaluation and a statistical calibration on 36 data points collected from real-world projects. The validation revealed that ontology engineering processes have a high learning rate, indicating that the building of very large ontologies is feasible from an economic point of view. Moreover, the complexity of ontology evaluation, domain analysis and conceptualization activities proved to have a major impact on the final ontology engineering process duration.

Elena Paslaru Bontas Simperl, Christoph Tempich, York Sure

Tree-Structured Conditional Random Fields for Semantic Annotation

The large volume of web content needs to be annotated by ontologies (called Semantic Annotation), and our empirical study shows that strong dependencies exist across different types of information (it means that identification of one kind of information can be used for identifying the other kind of information). Conditional Random Fields (CRFs) are the state-of-the-art approaches for modeling the dependencies to do better annotation. However, as information on a Web page is not necessarily linearly laid-out, the previous linear-chain CRFs have their limitations in semantic annotation. This paper is concerned with semantic annotation on hierarchically dependent data (

hierarch-ical semantic annotation

). We propose a Tree-structured Conditional Random Field (TCRF) model to better incorporate dependencies across the hierarchic-ally laid-out information. Methods for performing the tasks of model-parameter estimation and annotation in TCRFs have been proposed. Experimental results indicate that the proposed TCRFs for hierarchical semantic annotation can significantly outperform the existing linear-chain CRF model.

Jie Tang, Mingcai Hong, Juanzi Li, Bangyong Liang

Framework for an Automated Comparison of Description Logic Reasoners

OWL is an ontology language developed by the W3C, and although initially developed for the Semantic Web, OWL has rapidly become a de facto standard for ontology development in general. The design of OWL was heavily influenced by research in description logics, and the specification includes a formal semantics. One of the goals of this formal approach was to provide interoperability: different OWL reasoners should provide the same results when processing the same ontologies. In this paper we present a system that allows users: (a) to test and compare OWL reasoners using an extensible library of real-life ontologies; (b) to check the “correctness” of the reasoners by comparing the computed class hierarchy; (c) to compare the performance of the reasoners when performing this task; and (d) to use SQL queries to analyse and present the results in any way they see fit.

Tom Gardiner, Dmitry Tsarkov, Ian Horrocks

Integrating and Querying Parallel Leaf Shape Descriptions

Information integration and retrieval have been important problems for many information systems — it is hard to combine new information with any other piece of related information we already possess, and to make them both available for application queries. Many ontology-based applications are still cautious about integrating and retrieving information from natural language (NL) documents, preferring structured or semi-structured sources. In this paper, we investigate how to use ontologies to facilitate integrating and querying information on parallel leaf shape descriptions from NL documents. Our approach takes advantage of ontologies to precisely represent the semantics in shape description, to integrates parallel descriptions according to their semantic distances, and to answer shape-related species identification queries. From this highly specialised domain, we learn a set of more general methodological rules, which could be useful in other domains.

Shenghui Wang, Jeff Z. Pan

A Survey of the Web Ontology Landscape

We survey nearly 1300 OWL ontologies and RDFS schemas. The collection of statistical data allows us to perform analysis and report some trends. Though most of the documents are syntactically OWL Full, very few stay in OWL Full when they are syntactically patched by adding type triples. We also report the frequency of occurrences of OWL language constructs and the shape of class hierarchies in the ontologies. Finally, we note that of the largest ontologies surveyed here, most do not exceed the description logic expressivity of

$\mathcal{ALC}$

Taowei David Wang, Bijan Parsia, James Hendler

CropCircles: Topology Sensitive Visualization of OWL Class Hierarchies

OWL ontologies present many interesting visualization challenges. Here we present CropCircles, a technique designed to view the class hierarchies in ontologies as trees. We place special emphasis on topology understanding when designing the tool. We drew inspiration from treemaps, but made substantial changes in the representation and layout. Most notably, the spacefillingness of treemap is relaxed in exchange for visual clarity. We outline the problem scape of visualizing ontology hierarchies, note the requirements that go into the design of the tool, and discuss the interface and implementation. Finally, through a controlled experiment involving tasks common to understanding ontologies, we show the benefits of our design.

Taowei David Wang, Bijan Parsia

Towards Knowledge Acquisition from Information Extraction

In our research to use information extraction to help populate the semantic web, we have encountered significant obstacles to interoperability between the technologies. We believe these obstacles to be endemic to the basic paradigms, and not quirks of the specific implementations we have worked with. In particular, we identify five dimensions of interoperability that must be addressed to successfully populate semantic web knowledge bases from information extraction systems that are

suitable for reasoning.

We call the task of transforming IE data into knowledge-bases

knowledge integration

, and briefly present a framework called KITE in which we are exploring these dimensions. Finally, we report on the initial results of an experiment in which the knowledge integration process uses the deeper semantics of OWL ontologies to improve the precision of relation extraction from text.

Chris Welty, J. William Murdock

A Method for Learning Part-Whole Relations

Part-whole relations are important in many domains, but typically receive less attention than subsumption relation. In this paper we describe a method for finding part-whole relations. The method consists of two steps: (i) finding phrase patterns for both explicit and implicit part-whole relations, and (ii) applying these patterns to find part-whole relation instances. We show results of applying this method to a domain of finding sources of carcinogens.

Willem Robert van Hage, Hap Kolb, Guus Schreiber

Semantic Web in Use

OntoWiki – A Tool for Social, Semantic Collaboration

We present OntoWiki, a tool providing support for agile, distributed knowledge engineering scenarios. OntoWiki facilitates the visual presentation of a knowledge base as an information map, with different views on instance data. It enables intuitive authoring of semantic content, with an inline editing mode for editing RDF content, similar to WYSIWYG for text documents. It fosters social collaboration aspects by keeping track of changes, allowing to comment and discuss every single part of a knowledge base, enabling to rate and measure the popularity of content and honoring the activity of users. Ontowiki enhances the browsing and retrieval by offering semantic enhanced search strategies. All these techniques are applied with the ultimate goal of decreasing the entrance barrier for projects and domain experts to collaborate using semantic technologies. In the spirit of the Web 2.0 OntoWiki implements an ”architecture of participation” that allows users to add value to the application as they use it. It is available as open-source software and a demonstration platform can be accessed at http://3ba.se.

Sören Auer, Sebastian Dietzold, Thomas Riechert

Towards a Semantic Web of Relational Databases: A Practical Semantic Toolkit and an In-Use Case from Traditional Chinese Medicine

Integrating relational databases is recently acknowledged as an important vision of the Semantic Web research, however there are not many well-implemented tools and not many applications that are in large-scale real use either. This paper introduces the Dartgrid which is an application development framework together with a set of semantic tools to facilitate the integration of heterogenous relational databases using semantic web technologies. For examples, DartMapping is a visualized mapping tool to help DBA in defining semantic mappings from heterogeneous relational schemas to ontologies. DartQuery is an ontology-based query interface helping user to construct semantic queries, and capable of rewriting SPARQL semantic queries to a set of SQL queries. DartSearch is an ontology-based search engine enabling user to make full-text search over all databases and to navigate across the search results semantically. It is also enriched with a concept ranking mechanism to enable user to find more accurate and reliable results. This toolkit has been used to develop an currently in-use application for China Academy of Traditional Chinese Medicine (CATCM). In this application, over 70 legacy relational databases are semantically interconnected by an ontology with over 70 classes and 800 properties, providing integrated semantic-enriched query, search and navigation services to TCM communities.

Huajun Chen, Yimin Wang, Heng Wang, Yuxin Mao, Jinmin Tang, Cunyin Zhou, Ainin Yin, Zhaohui Wu

Information Integration Via an End-to-End Distributed Semantic Web System

A distributed, end-to-end information integration system that is based on the Semantic Web architecture is of considerable interest to both commercial and government organizations. However, there are a number of challenges that have to be resolved to build such a system given the currently available Semantic Web technologies. We describe here the ISENS prototype system we designed, implemented, and tested (on a small scale) to address this problem. We discuss certain system limitations (some coming from underlying technologies used) and future ISENS development to resolve them and to enable an extended set of capabilities.

Dimitre A. Dimitrov, Jeff Heflin, Abir Qasem, Nanbor Wang

NEWS: Bringing Semantic Web Technologies into News Agencies

In the current Information Society, being informed is a basic necessity. As one of the main news bussiness actors, news agencies are required to provide fresh, relevant, high-quality information to their customers. Dealing with this requirement is not an easy task, but, as partners of the NEWS (News Engine Web Services) project, we believe that the usage of Semantic Web technologies could help news agencies in achieving that objective. In this paper we will describe the aims and main achievements of the NEWS project, that was just completed.

Norberto Fernández, José M. Blázquez, Jesús A. Fisteus, Luis Sánchez, Michael Sintek, Ansgar Bernardi, Manuel Fuentes, Angelo Marrara, Zohar Ben-Asher

Semantically-Enabled Large-Scale Science Data Repositories

Large heterogeneous online repositories of scientific information have the potential to change the way science is done today. In order for this potential to be realized, numerous challenges must be addressed concerning access to and interoperability of the online scientific data. In our work, we are using semantic web technologies to improve access and interoperability by providing a framework for collaboration and a basis for building and distributing advanced data simulation tools. Our initial scientific focus area is the solar terrestrial physics community. In this paper, we will present our work on the Virtual Solar Terrestrial Observatory (VSTO). We will present the emerging trend of the virtual observatory – a virtual integrated evolving scientific data repository – and describe the general use case and our semantically-enabled architecture. We will also present our specific implementation and describe the benefits of the semantic web in this setting. Further, we speculate on the future of the growing adoption of semantic technologies in this important application area of scientific cyberinfrastructure and semantically enabled scientific data repositories.

Peter Fox, Deborah McGuinness, Don Middleton, Luca Cinquini, J. Anthony Darnell, Jose Garcia, Patrick West, James Benedict, Stan Solomon

Construction and Use of Role-Ontology for Task-Based Service Navigation System

We have been developing a task-based service navigation system that offers to the user for his selected services relevant to the task the user wants to perform. We observed that the tasks likely to be performed in a given situation depend on the user’s role such as businessman or father. To further our research, we constructed a role-ontology and utilized it to improve the usability of task-based service navigation. We have enhanced a basic task-model by associating tasks with role-concepts defined in the new role-ontology. We can generate a task-list that is precisely tuned to the user’s current role. In addition, we can generate a personalized task-list from the task-model based on the user’s task selection history. Because services are associated with tasks, our approach makes it much easier to navigate a user to the most appropriate services. In this paper, we describe the construction of our role-ontology and the task-based service navigation system based on the role-ontology.

Yusuke Fukazawa, Takefumi Naganuma, Kunihiro Fujii, Shoji Kurakake

Enabling an Online Community for Sharing Oral Medicine Cases Using Semantic Web Technologies

This paper describes how Semantic Web technologies have been used in an online community for knowledge sharing between clinicians in oral medicine in Sweden. The main purpose of this community is to serve as repository of interesting and difficult cases, and as a support for monthly teleconferences. All information regarding users, meetings, news, and cases is stored in RDF. The community was built using the Struts framework and Jena was used for interacting with RDF.

Marie Gustafsson, Göran Falkman, Fredrik Lindahl, Olof Torgersson

EKOSS: A Knowledge-User Centered Approach to Knowledge Sharing, Discovery, and Integration on the Semantic Web

The scientific enterprise depends on the effective transfer of knowledge from creator to user. Recently the rate of scientific knowledge production is overwhelming the ability for researchers to process it. Semantic web technologies may help to handle this vast amount of scientific knowledge. However, automatic computerized techniques that extract semantics from natural language text for use in matching with the requests of knowledge seekers achieve only mediocre results. Clearly, semantic descriptions of expert knowledge that are constructed by the knowledge creators themselves will be more accurate. We report an approach and software implementation of a knowledge sharing platform based on semantic web technologies, called EKOSS for expert knowledge ontology-based semantic search, that helps knowledge creators construct semantic descriptions of their knowledge. The EKOSS system enables knowledge creators to construct computer-interpretable semantically rich statements describing their knowledge with minimal effort and without any knowledge of semantic web technologies.

Steven Kraines, Weisen Guo, Brian Kemper, Yutaka Nakamura

Ontogator — A Semantic View-Based Search Engine Service for Web Applications

View-based search provides a promising paradigm for formulating complex semantic queries and representing results on the Semantic Web. A challenge for the application of the paradigm is the complexity of providing view-based search services through application programming interfaces (API) and web services. This paper presents a solution on how semantic view-based search can be provided efficiently through an API or as web service to external applications. The approach has been implemented as the open source tool Ontogator, that has been applied successfully in several practical semantic portals on the web.

Eetu Mäkelä, Eero Hyvönen, Samppa Saarela

Explaining Conclusions from Diverse Knowledge Sources

The ubiquitous non-semantic web includes a vast array of unstructured information such as HTML documents. The semantic web provides more structured knowledge such as hand-built ontologies and semantically aware databases. To leverage the full power of both the semantic and non-semantic portions of the web, software systems need to be able to reason over both kinds of information. Systems that use both structured and unstructured information face a significant challenge when trying to convince a user to believe their results: the sources

and

the kinds of reasoning that are applied to the sources are radically different in their nature and their reliability. Our work aims at explaining conclusions derived from a combination of structured and unstructured sources. We present our solution that provides an infrastructure capable of encoding justifications for conclusions in a single format. This integration provides an end-to-end description of the knowledge derivation process including access to text or HTML documents, descriptions of the analytic processes used for extraction, as well as descriptions of the ontologies and many kinds of information manipulation processes, including standard deduction. We produce unified traces of extraction and deduction processes in the Proof Markup Language (PML), an OWL-based formalism for encoding provenance for inferred information. We provide a browser for exploring PML and thus enabling a user to understand how some conclusion was reached.

J. William Murdock, Deborah L. McGuinness, Paulo Pinheiro da Silva, Chris Welty, David Ferrucci

A Mixed Initiative Semantic Web Framework for Process Composition

Semantic Web technologies offer the prospect of significantly reducing the amount of effort required to integrate existing enterprise functionality in support of new composite processes.– whether within a given organization or across multiple ones. A significant body of work in this area has aimed to fully automate this process, while assuming that all functionality has already been encapsulated in the form of semantic web services with rich and accurate annotations. In this article, we argue that this assumption is often unrealistic. Instead, we describe a mixed initiative framework for semantic web service discovery and composition that aims at flexibly interleaving human decision making and automated functionality in environments where annotations may be incomplete and even inconsistent. An initial version of this framework has been implemented in SAP’s Guided Procedures, a key element of SAP’s Enterperise Service Architecture (ESA).

Jinghai Rao, Dimitar Dimitrov, Paul Hofmann, Norman Sadeh

Semantic Desktop 2.0: The Gnowsis Experience

In this paper we present lessons learned from building a Semantic Desktop system, the gnowsis beta. On desktop computers, semantic software has to provide stable services and has to reflect the personal view of the user. Our approach to ontologies, the

Personal Information Model

PIMO allows to create tagging services like del.icio.us on the desktop. A semantic wiki allows further annotations. Continuous evaluations of the system helped to improve it. These results were created in the EPOS research project and are available in the open source projects Aperture, kaukoluwiki, and gnowsis and will be continued in the Nepomuk project. By using these components, other developers can create new desktop applications the web 2.0 way.

Leo Sauermann, Gunnar Aastrand Grimnes, Malte Kiesel, Christiaan Fluit, Heiko Maus, Dominik Heim, Danish Nadeem, Benjamin Horak, Andreas Dengel

Towards Semantic Interoperability in a Clinical Trials Management System

Clinical trials are studies in human patients to evaluate the safety and effectiveness of new therapies. Managing a clinical trial from its inception to completion typically involves multiple disparate applications facilitating activities such as trial design specification, clinical sites management, participants tracking, and trial data analysis. There remains however a strong impetus to integrate these diverse applications – each supporting different but related functions of clinical trial management – at syntactic and semantic levels so as to improve clarity, consistency and correctness in specifying clinical trials, and in acquiring and analyzing clinical data. The situation becomes especially critical with the need to manage multiple clinical trials at various sites, and to facilitate meta-analyses on trials. This paper introduces a knowledge-based framework that we are building to support a suite of clinical trial management applications. Our initiative uses semantic technologies to provide a consistent basis for the applications to interoperate. We are adapting this approach to the Immune Tolerance Network (ITN), an international research consortium developing new therapeutics in immune-mediated disorders.

Ravi D. Shankar, Susana B. Martins, Martin J. O’Connor, David B. Parrish, Amar K. Das

Active Semantic Electronic Medical Record

The healthcare industry is rapidly advancing towards the widespread use of electronic medical records systems to manage the increasingly large amount of patient data and reduce medical errors. In addition to patient data there is a large amount of data describing procedures, treatments, diagnoses, drugs, insurance plans, coverage, formularies and the relationships between these data sets. While practices have benefited from the use of EMRs, infusing these essential programs with rich domain knowledge and rules can greatly enhance their performance and ability to support clinical decisions. Active Semantic Electronic Medical Record (ASEMR) application discussed here uses Semantic Web technologies to reduce medical errors, improve physician efficiency with accurate completion of patient charts, improve patient safety and satisfaction in medical practice, and improve billing due to more accurate coding. This results in practice efficiency and growth by enabling physicians to see more patients with improved care. ASEMR has been deployed and in daily use for managing all patient records at the Athens Heart Center since December 2005. This showcases an application of Semantic Web in health care, especially small clinics.

A. Sheth, S. Agrawal, J. Lathem, N. Oldham, H. Wingate, P. Yadav, K. Gallagher

Semantic Web Challenge

Foafing the Music: Bridging the Semantic Gap in Music Recommendation

In this paper we give an overview of the

Foafing the Music

system. The system uses the

Friend of a Friend

(FOAF) and

RDF Site Summary

(RSS) vocabularies for recommending music to a user, depending on the user’s musical tastes and listening habits. Music information (new album releases, podcast sessions, audio from MP3 blogs, related artists’ news and upcoming gigs) is gathered from thousands of RSS feeds.

The presented system provides music discovery by means of: user profiling (defined in the user’s FOAF description), context based information (extracted from music related RSS feeds) and content based descriptions (extracted from the audio itself), based on a common ontology (OWL DL) that describes the music domain.

The system is available at:

http://foafing-the-music.iua.upf.edu

Òscar Celma

Semantic MediaWiki

Semantic MediaWiki is an extension of

MediaWiki

– a widely used wiki-engine that also powers

Wikipedia

. Its aim is to make semantic technologies available to a broad community by smoothly integrating them with the established usage of MediaWiki. The software is already used on a number of productive installations world-wide, but the main target remains to establish “Semantic Wikipedia” as an early adopter of semantic technologies on the web. Thus usability and scalability are as important as powerful semantic features.

Markus Krötzsch, Denny Vrandečić, Max Völkel

Enabling Semantic Web Communities with DBin: An Overview

In this paper we give an overview of the DBin Semantic Web information manager. Then we describe how it enables users to create and experience the Semantic Web by exchanging RDF knowledge in P2P “topic” channels. Once sufficient information has been collected locally, rich and fast browsing of the Semantic Web becomes possible without generating external traffic or computational load. In this way each client builds and populates a ’personal semantic space’ on which user defined rules, trust metrics and filtering can be freely applied. We also discuss issues such as end user interaction and the social aggregation model induced by this novel application.

Giovanni Tummarello, Christian Morbidoni, Michele Nucci

MultimediaN E-Culture Demonstrator

The main objective of the MultimediaN E-Culture project is to demonstrate how novel semantic-web and presentation technologies can be deployed to provide better indexing and search support within large virtual collections of cultural-heritage resources. The architecture is fully based on open web standards, in particular XML, SVG, RDF/OWL and SPARQL. One basic hypothesis underlying this work is that the use of explicit background knowledge in the form of ontologies/vocabularies/thesauri is in particular useful in information retrieval in knowledge-rich domains.

Guus Schreiber, Alia Amin, Mark van Assem, Victor de Boer, Lynda Hardman, Michiel Hildebrand, Laura Hollink, Zhisheng Huang, Janneke van Kersen, Marco de Niet, Borys Omelayenko, Jacco van Ossenbruggen, Ronny Siebes, Jos Taekema, Jan Wielemaker, Bob Wielinga

A Semantic Web Services GIS Based Emergency Management Application

In an emergency situation, relevant information about involved elements is required. This information ranges from demographic data, weather forecasts and sensor data, available transportation means, presence of helpful agents, land use and cover statistics or values, etc. Moreover, the emergency management process is dynamic as it involves several definite steps, described in standard procedures from which the Emergency Officer (EO) should not depart without good reason. Multiple agencies own the relevant data and possess parts of emergency related knowledge.

Vlad Tanasescu, Alessio Gugliotta, John Domingue, Rob Davies, Leticia Gutiérrez-Villarías, Mary Rowlatt, Marc Richardson, Sandra Stinčić

Doctoral Consortium

Package-Based Description Logics – Preliminary Results

Many representative applications on the semantic web, including collaborative ontology building, partial ontology reuse, selective knowledge hiding and distributed data management, call for modular ontologies,. However, although OWL allows using owl:imports to connect multiple ontologies, its current semantics requires all involved ontologies to have a single global semantics, thus providing only a syntactical solution to modularity. As a result, there is a growing interest in modular ontology languages such as Distributed Description Logics (DDL) [7] and

${\mathcal{E}}$

-connections [8]. However, these proposals are also limited in expressivity and reasoning soundness [2,3].

Jie Bao, Doina Caragea, Vasant G. Honavar

Distributed Policy Management in Semantic Web

With the growth of Internet and the fast development of semantic web technologies, the access and usage of information will become much more easier. However, the security of information gathering for both information suppliers and demanders is a critical issue. Policies determine the ideal behaviors for web concepts. The concepts of policy, policy languages and policy ontologies must be determined, effective tools for policy definition and management must be developed.

Özgü Can, Murat Osman Ünalır

Evaluation of SPARQL Queries Using Relational Databases

Basic storage and querying of RDF data using a relational database can be done in a very simple manner. Such approach can run into trouble when used on large and complex data. This paper presents such data and several sample queries together with analysis of their performance. It also describes two possible ways of improving the performance based on this analysis.

Jiří Dokulil

Dynamic Contextual Regulations in Open Multi-agent Systems

Following software engineering approaches for the Semantic Web (SW) and also Hendler’s vision [3], I believe that the SW will not be a unique large complex Web, but it will be composed, mainly, of several small contextualized domain applications. These domain applications will be, in my opinion, Multi-Agent Systems (MAS) [5]. MAS have emerged as a promising approach for the development of information systems, which are constituted of cooperative goal-oriented problem-solving entities (named agents) [6]. Agent-based computing is rapidly emerging as a powerful technology for the development of distributed and complex information systems.

Carolina Howard Felicíssimo

From Typed-Functional Semantic Web Services to Proofs

Web standards are currently seen as increasingly fragmented between colloquial XML, the Semantic Web, and Web Services. Semantic Web Services, while an immensely productive area of research, has yet to reach in-roads and a large user-base.We propose a minimalist, yet powerful, unifying framework built upon solid computational and philosophical foundations: Web Services are

functions

, ontologies are

types

, and therefore Semantic Web Services are

typed functions

Harry Halpin

Towards a Usable Group Editor for Ontologies

Ontologies represent a shared understanding of some domain of interest. Therefore, tools to develop ontologies have to support this sharing in some way. However, current tools lack support of this important aspect, if they tackle it at all. Beyond this, each interactive system cannot be limited to its utility but must also make sure that this is provided in a usable way. However, current ontology editors mostly make the impression of research prototypes, thus not caring too much about this aspect. These two problems are crucial: If we don’t support collaborative ontology development, produced ontologies will always lack being product of a social process. Also if the tool support lacks usability, the ontology engineering community cannot expect to spread their ideas to a wider non-expert audience. Therefore the PhD thesis in process tries to tackle these problems and to advance the state of the art. It combines these two aspects as they intervene with each other thus making an integrated approach more promising. The improvements will be thoroughly evaluated with regard to both utility and usability.

Jan Henke

Talking to the Semantic Web – Query Interfaces to Ontologies for the Casual User

The Semantic Web presents the vision of a dynamically growing knowledge base that should allow users to draw on and combine distributed information sources specified in languages based on formal logic. Common users, however, were shown to have problems even with the simplest Boolean expressions [4]; the use of the logic formalism underlying the Semantic Web is beyond their understanding. So how can we bridge the gap between the logic-based SemanticWeb and real-world users, who are ill at ease and, oftentimes, unable to use formal logic concepts?

An often proposed solution to address this problem is the use of natural language interfaces (NLIs). Most NLIs, however, only understand some subset of natural language (NL), but often suggest full understanding, which leads to confusing interaction with users [1]. This mismatch between the users’ expectations and the capabilities of a NLI is called the habitability problem [5]. Furthermore, the development of NL tools requires computationally intensive algorithms relying on large amounts of background knowledge making the tools highly domaindependent and inapplicable to new domains or applications [1].

Esther Kaufmann

Changing Ontology Breaks Queries

Updating an ontology that is in use may result in inconsistencies between the ontology and the knowledge base, dependent ontologies and applications/services. Current research concentrates on the creation of ontologies and how to manage ontology changes in terms of mapping ontology versions and keeping consistent with the instances. Very little work investigated controlling the impact on dependent applications/services; which is the aim of the system presented in this paper. The approach we propose is to make use of ontology change logs to analyse incoming RDQL queries and amend them as necessary. Revised queries can then be used to query the ontology and knowledge base as requested by the applications and services. We describe our prototype system and discuss related problems and future directions.

General Terms.

Ontology Management.

Yaozhong Liang, Harith Alani, Nigel Shadbolt

Towards a Global Scale Semantic Web

By transforming the Web from a collection of documents to a collection of semantically rich data sources, the Semantic Web promises an unprecedented benefit of a global knowledge infrastructure. My PhD research will try to help make that happen at a global scale by doing the following proposed work:

1. Design and implement a highly scalable Semantic Web knowledge base system by exploiting modern relational database technologies combined with the state-of-the-art description logics reasoners.

2. Build and empirically verify a framework that handles ontology evolution and the reuse of data on top of the perspective theory [1]. This framework should be able to relieve inconsistency and heterogeneity in a global scale Semantic Web.

3. Systematically evaluate the resulting system’s capability and scalability in processing, integrating and querying data under the real world environment.

Zhengxiang Pan

Schema Mappings for the Web

Current solutions to data integration present many inconvenients. The bottleneck seems to be the impossible automation of the whole process. Human intervention will always be needed at some point, and the problem is to find where and how this intervention can be performed the most efficiently. In traditional mediator approaches the global schema and mappings between the global and local schemas are designed by hand. This is not the way to go if we want to see emerging a ”semantic web”. The collaborative development of one-to-one mappings driven by application needs has much more chance to rapidly create a network of schemas. We propose to build on top on this view, shifting the human intervention from the global schema elaboration to the one-to-one mapping between local schemas. This repartition of efforts associated with publication of the local mappings is the only solution if we want to see the deep web rising up and the semantic web vision becoming true. I propose to contribute to this paradigm at two levels. First, mappings between heterogeneous schemas must be universally understandable, as schema descriptions may be of various natures (XML, relational, Ontologies, Semi structured, ...). An independant language able to model correspondences between two schemas is then needed. This language also serves as an exchange format for matching algorithms as well as graphical mapping tools. A global schema is still necessary in order to provide a unified view over resources. We propose in the following to study how from a network of related schemas can we extract a global schema together with the associated mapping rules.

François Scharffe

Triple Space Computing for Semantic Web Services – A PhD Roadmap

This thesis will address how to enable Triple Space Computing as a communication paradigm for Semantic Web Services. Currently, Semantic Web Services are following a message based communication paradigm. Triple Space Computing is envisioned as communication and coordination paradigm for Semantic Web Services which is an extension of tuple space computing to support RDF and then use it for communication based on the principle of persistent publication and read of data. Web Service Modeling Ontology (WSMO) is our conceptual model for Semantic Web Services. Web Service Execution Environment (WSMX) is one of the reference implementations of the WSMO conceptual model. The paper presents an overview of technical insights about integration of WSMX with Triple Space Computing and proposes that how WSMX can use Triple Space computing for its communication and coordination in terms of dynamic components management, external communication management, resource management and coordination of different interconnected WSMXs.

M. Omair Shafiq

Toward Making Online Biological Data Machine Understandable

Huge amounts of biological data are available online. To obtain needed information, biologists sometimes have to traverse different Web sources and combine their data manually. We introduce a system that can automatically interpret the structures of heterogeneous Web pages, extract useful information from them, and also transform them to machine-understandable pages for the Semantic Web, so that a Semantic Web agent can automatically find the information of interest.

Cui Tao

KeynoteAbstracts

Where the Social Web Meets the Semantic Web

The Semantic Web is an ecosystem of interaction among computer systems. The social web is an ecosystem of conversation among people. Both are enabled by conventions for layered services and data exchange. Both are driven by human-generated content and made scalable by machine-readable data. Yet there is a popular misconception that the two worlds are alternative, opposing ideologies about how the web ought to be. Folksonomy vs. ontology. Practical vs. formalistic. Humans vs. machines. This is nonsense, and it is time to embrace a unified view. I subscribe to the vision of the Semantic Web as a substrate for collective intelligence. The best shot we have of collective intelligence in our lifetimes is large, distributed human-computer systems. The best way to get there is to harness the ”people power” of the Web with the techniques of the Semantic Web. In this presentation I will show several ways that this can be, and is, happening.

Tom Gruber

The Semantic Web: Suppliers and Customers

The notion of the Semantic Web can be coined as a Web of data when bringing database content to the Web or as a Web of enriched human-readable content when encoding the semantics of web-resources in a machine-interpretable form.

It has been clear from the beginning that realizing the Semantic Web vision will require interdisciplinary research. At this the fifth ISWC, it is time to re-examine the extent to which interdisciplinary work has played and can play a role in Semantic Web research, and even how Semantic Web research can contribute to other disciplines. Core Semantic Web research has drawn from various disciplines, such as knowledge representation and formal ontologies, reusing and further developing their techniques in a new context.

However, there are several other disciplines that explore research issues very relevant to the Semantic Web in different guises and to differing extents. As a community, we can benefit by also recognizing and drawing from the research in these different disciplines. On the other hand, Semantic Web research also has much to contribute to these disciplines and communities. For example, the Semantic Web offers scenario that often ask for unprecedented scalability of techniques from other disciplines. Throughout the talk, I will illustrate these points through examples from disciplines such as natural language processing, databases, software engineering and automated reasoning.

The industry also has a major role to play in the realization of the Semantic Web vision. I will therefore additionally examine the added value of Semantic Web technologies for commercial applications and discuss issues that should be addressed for broadening the market for Semantic Web technologies.

Rudi Studer

The Semantic Web and Networked Governance: Promise and Challenges

The virtual state is a metaphor meant to draw attention to the structures and processes of the state that are becoming increasingly aligned with the structures and processes of the semantic web. Semantic Web researchers understand the potential for information sharing, enhanced search, improved collaboration, innovation, and other direct implications of contemporary informatics. Yet many of the broader democratic and governmental implications of increasingly networked governance remain elusive, even in the world of public policy and politics.

Governments, not businesses, remain the major information processing entities in the world. But where do they stand as knowledge managers, bridge builders and creators? As they strive to become not simply information-based but also knowledge-creating organizations, public agencies and institutions face a set of striking challenges. These include threats to privacy, to intellectual property, to identity, and to traditional processes of review and accountability. From the perspective of the organization of government, what are some of the key challenges faced by governments as they seek to become networked? What best practices are emerging globally? And in the networked world that is rapidly emerging and becoming institutionalized, how can public, private and nonprofit sectors learn from one another?

Jane E. Fountain

Backmatter

Title: The Semantic Web - ISWC 2006
Editors: Isabel Cruz
Stefan Decker
Dean Allemang
Chris Preist
Daniel Schwabe
Peter Mika
Mike Uschold
Lora M. Aroyo
Publisher: Springer Berlin Heidelberg
Electronic ISBN: 978-3-540-49055-5
Print ISBN: 978-3-540-49029-6
DOI: https://doi.org/10.1007/11926078

Springer Professional

About this book

Table of Contents

Frontmatter

Research Track

Ranking Ontologies with AKTiveRank

Three Semantics for Distributed Systems and Their Relations with Alignment Composition

Semantics and Complexity of SPARQL

Ontology-Driven Automatic Entity Disambiguation in Unstructured Text

Augmenting Navigation for Collaborative Tagging with Emergent Semantics

On the Semantics of Linking and Importing in Modular Ontologies

RS2D: Fast Adaptive Search for Semantic Web Services in Unstructured P2P Networks

SADIe: Semantic Annotation for Accessibility

Automatic Annotation of Web Services Based on Workflow Definitions

A Constraint-Based Approach to Horizontal Web Service Composition

GINO – A Guided Input Natural Language Ontology Editor

Fresnel: A Browser-Independent Presentation Vocabulary for RDF

A Software Engineering Approach to Design and Development of Semantic Web Service Applications

A Model Driven Approach for Building OWL DL and OWL Full Ontologies

IRS-III: A Broker for Semantic Web Services Based Applications

Provenance Explorer – Customized Provenance Views Using Semantic Inferencing

On How to Perform a Gold Standard Based Evaluation of Ontology Learning

Characterizing the Semantic Web on the Web

MultiCrawler: A Pipelined Architecture for Crawling and Indexing Semantic Web Data

/facet: A Browser for Heterogeneous Semantic Web Repositories

Using Ontologies for Extracting Product Features from Web Pages

Block Matching for Ontologies

A Relaxed Approach to RDF Querying

Mining Information for Instance Unification

The Summary Abox: Cutting Ontologies Down to Size

Semantic Metadata Generation for Large Scientific Workflows

Reaching Agreement over Ontology Alignments

A Formal Model for Semantic Web Service Composition

Evaluating Conjunctive Triple Pattern Queries over Large Structured Overlay Networks

PowerMap: Mapping the Real Semantic Web on the Fly

Ontology-Driven Information Extraction with OntoSyphon

Ontology Query Answering on Databases

Formal Model for Ontology Mapping Creation

A Semantic Context-Aware Access Control Framework for Secure Collaborations in Pervasive Computing Environments

Extracting Relations in Social Networks from the Web Using Similarity Between Collective Contexts

Can OWL and Logic Programming Live Together Happily Ever After?

Innovation Detection Based on User-Interest Ontology of Blog Community

Modeling Social Attitudes on the Web

A Framework for Ontology Evolution in Collaborative Environments

Extending Faceted Navigation for RDF Data

Reducing the Inferred Type Statements with Individual Grouping Constructs

A Framework for Schema-Driven Relationship Discovery from Unstructured Text

Web Service Composition Via Generic Procedures and Customizing User Preferences

Querying the Semantic Web with Preferences

ONTOCOM: A Cost Estimation Model for Ontology Engineering

Tree-Structured Conditional Random Fields for Semantic Annotation

Framework for an Automated Comparison of Description Logic Reasoners

Integrating and Querying Parallel Leaf Shape Descriptions

A Survey of the Web Ontology Landscape

CropCircles: Topology Sensitive Visualization of OWL Class Hierarchies

Towards Knowledge Acquisition from Information Extraction

A Method for Learning Part-Whole Relations

Semantic Web in Use

OntoWiki – A Tool for Social, Semantic Collaboration

Towards a Semantic Web of Relational Databases: A Practical Semantic Toolkit and an In-Use Case from Traditional Chinese Medicine

Information Integration Via an End-to-End Distributed Semantic Web System

NEWS: Bringing Semantic Web Technologies into News Agencies

Semantically-Enabled Large-Scale Science Data Repositories

Construction and Use of Role-Ontology for Task-Based Service Navigation System

Enabling an Online Community for Sharing Oral Medicine Cases Using Semantic Web Technologies

EKOSS: A Knowledge-User Centered Approach to Knowledge Sharing, Discovery, and Integration on the Semantic Web

Ontogator — A Semantic View-Based Search Engine Service for Web Applications

Explaining Conclusions from Diverse Knowledge Sources

A Mixed Initiative Semantic Web Framework for Process Composition

Semantic Desktop 2.0: The Gnowsis Experience

Towards Semantic Interoperability in a Clinical Trials Management System

Active Semantic Electronic Medical Record

Semantic Web Challenge

Foafing the Music: Bridging the Semantic Gap in Music Recommendation

Semantic MediaWiki

Enabling Semantic Web Communities with DBin: An Overview

MultimediaN E-Culture Demonstrator

A Semantic Web Services GIS Based Emergency Management Application

Doctoral Consortium

Package-Based Description Logics – Preliminary Results