skip to main content
10.1145/543613acmconferencesBook PagePublication PagespodsConference Proceedingsconference-collections
PODS '02: Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
ACM2002 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
SIGMOD/PODS02: International Conference on Management of Data and Symposium on Principles Database and Systems Madison Wisconsin June 3 - 5, 2002
ISBN:
978-1-58113-507-7
Published:
03 June 2002
Sponsors:

Bibliometrics
Skip Abstract Section
Abstract

This volume contains the proceedings of the Twenty-first ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS 2002), held at Madison, Wisconsin on June 3-5, 2002 in conjunction with the 2002 ACM SIGMOD International Conference on Management of Data. It consists of a paper based on the invited talk by Rajeev Motwani, two papers based on the invited tutorials by Maurizio Lenzerini and Dennis Shasha, and 24 contributed papers that were selected by the program committee for presentation at the symposium.The contributed papers were selected from 109 submissions. Most of the papers are "extended abstracts" and are preliminary reports on work in progress. While they have been read by program committee members, they have not been formally refereed. It is expected that much of the research described in these papers will be publihsed in detail in computer science journals.The program committee selected "Monadic Datalog over Trees and the Expressive Power of Languages for Web Information Extraction" by Georg Gottlob and Christoph Koch for the PODS 2002 Best Paper Award and "From Discrepancy to Declustering: Near optimal multidimensional declustering strategies for range queries" by Chung-Min Chen and Christine T. Cheng for the PODS 2002 Best Newcomer Award. Warmest congratulations to the authors of these papers.

Skip Table Of Content Section
SESSION: PODS invited talk
Article
Models and issues in data stream systems

In this overview paper we motivate the need for and research issues arising from a new model of data processing. In this model, data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, time-varying data ...

SESSION: Research session 1: award winning papers
Article
Monadic datalog and the expressive power of languages for web information extraction

Research on information extraction from Web pages (wrapping) has seen much activity in recent times (particularly systems implementations), but little work has been done on formally studying the expressiveness of the formalisms proposed or on the ...

Article
From discrepancy to declustering: near-optimal multidimensional declustering strategies for range queries

Declustering schemes allocate data blocks among multiple disks to enable parallel retrieval. Given a declustering scheme D, its response time with respect to a query Q, rt(Q), is defined to be the maximum number of disk blocks of the query stored by the ...

SESSION: Invited tutorial 1
Article
Algorithmics and applications of tree and graph searching

Modern search engines answer keyword-based queries extremely efficiently. The impressive speed is due to clever inverted index structures, caching, a domain-independent knowledge of strings, and thousands of machines. Several research efforts have ...

SESSION: Research sessions 2 and 3: information processing on WWW and XML
Article
Validating streaming XML documents

This paper investigates the on-line validation of streaming XML documents with respect to a DTD, under memory constraints. We first consider validation using constant memory, formalized by a finite-state automaton (FSA). We examine two flavors of the ...

Article
Containment and equivalence for an XPath fragment

XPath is a simple language for navigating an XML document and selecting a set of element nodes. XPath expressions are used to query XML data, describe key constraints, express transformations, and reference elements in remote documents. This paper ...

Article
On the power of walking for querying tree-structured data

XSLT is the prime example of an XML query language based on tree-walking. Indeed, stripped down, XSLT is just a tree-walking tree-transducer equipped with registers and look-ahead. Motivated by this connection, we want to pinpoint the computational ...

Article
A normal form for XML documents

This paper takes a first step towards the design and normalization theory for XML documents. We show that, like relational databases, XML documents may contain redundant information, and may be prone to update anomalies. Furthermore, such problems are ...

Article
Distributed computation of web queries using automata

We introduce and investigate a distributed computation model for querying the Web. Web queries are computed by interacting automata running at different nodes in the Web. The automata which we are concerned with can be viewed as register automata ...

SESSION: Research session 4: query processing and optimization I
Article
Conjunctive selection conditions in main memory

We consider the fundamental operation of applying a conjunction of selection conditions to a set of records. With large main memories available cheaply, systems may choose to keep the data entirely in main memory, in order to improve query and/or update ...

Article
Efficient aggregation over objects with extent

We examine the problem of efficiently computing sum/count/avg aggregates over objects with non-zero extent. Recent work on computing multi-dimensional aggregates has concentrated on objects with zero extent (points) on a multi-dimensional grid, or one-...

Article
How to evaluate multiple range-sum queries progressively

Users of decision support system typically submit batches of range-sum queries simultaneously rather than issuing individual, unrelated queries. We propose a wavelet based technique that exploits T/O sharing across a query batch to evaluate the set of ...

SESSION: Research session 5: views and warehousing
Article
On the computation of relational view complements

Views as a means to describe parts of a given data collection play an important role in many database applications. In dynamic environments, where data is updated, not only information provided by views, but also information provided by data sources but ...

Article
On propagation of deletions and annotations through views

We study two classes of view update problems in relational databases. We are given a source database S, a monotone query Q, and the view Q(S) generated by the query. The first problem that we consider is the classical view deletion problem where we wish ...

Article
The view-selection problem has an exponential-time lower bound for conjunctive queries and views

The view-selection problem is to design and materialize a set of views over a database schema, such that the choice of views minimizes the cost of evaluating the selected workload of queries, and the combined size of the materialized views does not ...

SESSION: Research session 6: OLAP and constraints
Article
OLAP dimension constraints

In multidimensional data models intended for online analytic processing (OLAP), data are viewed as points in a multidimensional space. Each dimension has structure, described by a directed graph of categories, a set of members for each category, and a ...

Article
Fast algorithms for hierarchical range histogram construction

Data Warehousing and OLAP applications typically view data an having multiple logical dimensions (e.g., product, location) with natural hierarchies defined on each dimension. OLAP queries usually involve hierarchical selections on some of the dimensions,...

Article
On moving object queries: (extended abstract)

Database applications for moving objects pose new challenges in modeling, querying, and maintenance of objects whose locations are rapidly changing over time. Previous work on modeling and querying spatio-temporal databases and constraint databases ...

SESSION: Research session 7: queries and views
Article
Knowledge compilation = query rewriting + view synthesis

In Knowledge Compilation (KC) an intractable deduction problem KBf is split into two phases: 1) KB is preprocessed, thus obtaining a data structure DKB; 2) the problem is efficiently solved using DKB and f. Our goal is to study KC in the context of ...

Article
Answering queries using views with arithmetic comparisons

We consider the problem of answering queries using views, where queries and views are conjunctive queries with arithmetic comparisons (CQACs) over dense orders. Previous work only considered limited variants of this problem, without giving a complete ...

Article
Characterizing memory requirements for queries over continuous data streams

We consider conjunctive queries with arithmetic comparisons over multiple continuous data streams. We specify an algorithm for determining whether or not a query can be evaluated using a bounded amount of memory for all possible instances of the data ...

SESSION: Invited tutorial 2
Article
Data integration: a theoretical perspective

Data integration is the problem of combining data residing at different sources, and providing the user with a unified view of these data. The problem of designing data integration systems is important in current real world applications, and is ...

SESSION: Research session 8: semistructured data and XML
Article
Lossless regular views

If the only information we have on a certain database is through a set of views, the question arises of whether this is sufficient to answer completely a given query. We say that the set of views is lossless with respect to the query, if, no matter what ...

Article
On verifying consistency of XML specifications

XML specifications often consist of a type definition (typically, a DTD) and a set of integrity constraints. It has been shown previously that such specifications can be inconsistent, and thus it is often desirable to check consistency at compile-time. ...

Article
Labeling dynamic XML trees

We present algorithms to label the nodes of an XML tree which is subject to insertions and deletions of nodes. The labeling is done such that (1) we label each node immediately when it is inserted and this label remains unchanged, and (2) from a pair of ...

SESSION: Research session 9: query processing and optimization II
Article
On the complexity of approximate query optimization

In this work, we study the complexity of the problem of approximate query optimization. We show that, for any δ > 0, the problem of finding a join order sequence whose cost is within a factor 2Θ(log1-δ(K)) of K, where K is the cost of the optimal join ...

Article
Least expected cost query optimization: what can we expect?

A standard assumption in the database query optimization literature is that it suffices to optimize for the "typical" case---that is, the case in which various parameters (e.g., the amount of available memory, the selectivities of predicates, etc.) take ...

Contributors
  • IBM Research
  • Higher Normal School - PSL
  • University of California, Santa Cruz

Recommendations

Acceptance Rates

PODS '02 Paper Acceptance Rate24of109submissions,22%Overall Acceptance Rate642of2,707submissions,24%
YearSubmittedAcceptedRate
PODS '19872933%
PODS '171012929%
PODS '16943133%
PODS '15802531%
PODS '14672233%
PODS '13972425%
PODS '121012626%
PODS '111132522%
PODS '101132724%
PODS '09972627%
PODS '081592818%
PODS '071872815%
PODS '061853519%
PODS '031362720%
PODS '021092422%
PODS '01992626%
PODS '001192622%
PODS '991163228%
PODS '981192824%
PODS '971182319%
PODS '96842226%
PODS '95942527%
PODS '941172824%
PODS '931152623%
Overall2,70764224%