skip to main content
10.1145/1317353acmconferencesBook PagePublication PagescikmConference Proceedingsconference-collections
CIMS '07: Proceedings of the ACM first workshop on CyberInfrastructure: information management in eScience
ACM2007 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
CIKM07: Conference on Information and Knowledge Management Lisbon Portugal 9 November 2007
ISBN:
978-1-59593-831-2
Published:
09 November 2007
Sponsors:
Next Conference
Bibliometrics
Skip Abstract Section
Abstract

Increasingly, with the abundance of data, science requires robust, new cyberinfrastructure. Cyberinfrastructure has focussed on the immediate demands of experimental and analytical work using large scale data and processing infrastructure, but the problems of recording, preserving, accessing and reusing the resultant data scientific outputs have largely been overlooked in the immediate gratification that comes from the ability to start undertaking Big Science. In the recent years, escience may have received a lot of attention, but the problem of long term information curation and management that it leaves in its wake has not.

CIMS'07 is a workshop that aims to facilitate discussions and exchange off ideas among computer scientists working on cyberinfrastructure issues for scientific applications. We have accepted a set of exciting papers to be presented and discussed at the workshop. During the workshop, we hope to be able to redefine this new field, identify problems associated with creating computational infrastructure and more importantly on the curation, management, and preservation of the data for scientific applications, learn about exciting directions of research and propose some solutions to some problems that our participants have dealt with before. The workshop will discuss papers dealing with data modeling, data preservation, data and metadata visualization, tool support for scientific applications, and data mining tools for various cyberinfrastructure component.

Skip Table Of Content Section
SESSION: eScience applications
research-article
A volcano erupts: semantically mediated integration of heterogeneous volcanic and atmospheric data

We present a research effort into the application of semantic web methods and technologies to address the challenging problem of integrating heterogeneous volcanic and atmospheric data in support of assessing the atmospheric effects of a volcanic ...

research-article
ChemXSeer: a digital library and data repository for chemical kinetics

In this paper, we describe the ChemXSeer system that hosts data and scholarly articles related to chemical kinetics. Domain scientists have different needs that are not served by general search engines. ChemXSeer enables chemists (and others) to search ...

research-article
Towards a SOA infrastructure for statistically analysing public health data

To respond to the need for interoperable information systems in public health, several proposals based on XML-related technologies are currently available. For instance, the CDA [8] is an architecture developed by the HL7 organization for representing ...

research-article
Management and preservation of research data with iRODS

This paper presents first steps towards implementing a data layer to support a semi-automated preservation management system for research data in the arts and humanities. We suggest to use e-Science technology and grid middleware to implement a ...

SESSION: Models & tools for eSciences
research-article
A conceptual modeling and execution framework for process based scientific applications

In recent years, scientists are dealing more and more with data intensive and complex applications. Many scientific workflow systems emerged which adapt technology and methods stemming from the workflow management area and that should support scientists ...

research-article
Metadata management for federated databases

A federated database consists of several loosely integrated databases, where each database may contain hundreds of tables and thousands of columns,interrelated by complex foreign key relationships. In general, there exists a lot of semistructured data ...

research-article
RDF data exploration and visualization

We present Paged Graph Visualization (PGV), a new semi-autonomous tool for RDF data exploration and visualization. PGV consists of two main components: a) the "PGV explorer" and b) the "RDF pager" module utilizing BRAHMS, our high per-formance main-...

SESSION: Information management and retrieval
research-article
SLOQUE: slot-based query expansion for complex questions

Searching answers to complex questions is a challenging IR task. In this paper, we examine the use of query templates with semantic slots to formulate slot-based queries. These queries have query terms assigned to entity and relationship slots. We ...

research-article
Effective ranked conceptual retrieval

We describe the design and implementation of the SMARTER prototype system for collecting, structuring and searching information derived from full text ebooks available on the World Wide Web [4]. Standard methods of parsing and extended Boolean schemes ...

SESSION: Databases and data mining
research-article
Measuring referential integrity in distributed databases

Distributed relational databases are used by different organizations located at multiple sites that work together on common projects. In this article, we focus on distributed relational databases with incomplete and inconsistent content. We propose to ...

research-article
Perfect hash functions for large dictionaries

We describe a new practical algorithm for finding perfect hash functions with no specification space at all, suitable for key sets ranging in size from small to very large. The method is able to find perfect hash functions for various sizes of key sets ...

Contributors
  • Pennsylvania State University
  • Pennsylvania State University
  • University of Southampton

Recommendations