User-oriented smart-cache for the Web: what you seek is what you get!

Authors:
Zoé Lacroix

Institute for Research in Cognitive Science, University of Pennsylvania

Institute for Research in Cognitive Science, University of Pennsylvania
View Profile

,
Arnaud Sahuguet

Computer and Information Science, University of Pennsylvania

Computer and Information Science, University of Pennsylvania
View Profile

,
Raman Chandrasekar

Institute for Research in Cognitive Science & Center for the Advanced Study of India, University of Pennsylvania

Institute for Research in Cognitive Science & Center for the Advanced Study of India, University of Pennsylvania
View Profile

Authors Info & Claims

ACM SIGMOD Record Volume 27 Issue 2June 1998pp 572–574https://doi.org/10.1145/276305.276385

Published:01 June 1998Publication History

ACM SIGMOD Record

Abstract

Standard database approaches to querying information on the Web focus on the source(s) and provide a query language based on a given predefined organization (schema) of the data: this is the source-driven approach. However, can the Web be seen as a standard database? There is no super-user in charge of monitoring the source(s) (the data is constantly updated), there is no homogeneous structure (no common explicit structure thus), the Web itself never stops growing, etc. For these reasons, we believe that the source-driven standard approach is not suitable to the Web.

As an alternative, we propose a user-oriented approach based on the idea that the schema is a posteriori expressed by the user's needs when asking a query. Given a user query, AKIRA (Agentive Knowledge-based Information Retrieval Architecture) [6] extracts a target structure (structure expressed in the query) and uses standard information retrieval and filtering techniques to access potentially relevant documents.

The user-oriented paradigm means that the structure through which the data is viewed does not come from the source but is extracted from the user query. When a user asks a query, the relevant information is retrieved from the Web and stored as is in a cache. Then the information is extracted from the raw data using computational linguistic techniques. The AKIRA cache (smart-cache) represents these extracted layers of meta-information on top of the raw data. The smart-cache is an object-oriented database whose schema is inferred from the user's target structure. It is designed on demand through a library of concepts that can be assembled together to match concepts and meta-concepts required in the user's query. The smart cache can be seen as a view of the Web.

To the best of our knowledge, AKIRA is the only system that uses information retrieval and extraction integrated with database techniques to provide maximum flexibility to the user and offer transparent access to the content of Web documents.

References

1 G. Aro('ena and A. Mendelzon. WebOQL: Restructuring D()cuments, Databases and Webs. In Proceedings of the International Confl~rcnce on Data Engineering, Orlando, February 1998. Google ScholarDigital Library
2 i~. Chandrasekar and B. Srinivas. Using Syntactic In/brmarion in Document Filtering: A Comparative Study ~)t P~rt-ot:speech Tagging and Supertagging. In In Proc~:t:ding.~' of RIA 0'97, Montreal, June 1997.Google Scholar
3 M. Fernandez, D. Florescu, A. Levy, and D. Suciu. A Query Language and Processor for a Web-Site ManageliD{tilt System. hi A CM SIGMOD Workshop on Management of Semistructured Data, Tucson, Arizona, May 1997. Google ScholarDigital Library
4 R. Goldman and J. Widom. DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In Proc. of Intl. Co@ on Very Large Data Bases, Delphi, Greece, August 1997. to appear. Google ScholarDigital Library
5 A.K. Joshi and B. Srinivas. Disambiguation of Super Parts of Speech (or Supertags): Almost Parsing. In Proc~e.di'ngs of tile. 17th International Conference on Computational Linguistics (COLING '94), Kyoto, Japan, August 1994. Google ScholarDigital Library
6 Z. Lacroix, A. Sahuguet, and R. Chandrasekar. Informati(m Extraction & Database techniques: a user-oriented a l)proach to query the Web. In 10th Conference on Advanced In}brmation Systems Engineering (CA iSE'98), Pisa, Italy, June 1998. Google ScholarDigital Library
7 A-M. Vercoustre, J. Dell'Oro, and B. Hills. Reuse of hdbrmation through virtual documents, in Proceedings o.f the'. 2''~ Australian Document Computing Symposium, Melbourne, Australia, April 1997.Google Scholar

Index Terms

User-oriented smart-cache for the Web: what you seek is what you get!
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction paradigms
      1. Web-based interaction
2. Information systems
  1. World Wide Web
    1. Web applications
    2. Web services

Recommendations

User-oriented smart-cache for the Web: what you seek is what you get!
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data

Standard database approaches to querying information on the Web focus on the source(s) and provide a query language based on a given predefined organization (schema) of the data: this is the source-driven approach. However, can the Web be seen as a ...
Read More
User-oriented evaluation methods for information retrieval: a case study based on conceptual models for query expansion
Exploring artificial intelligence in the new millennium

This chapter discusses evaluation methods based on the use of nondichotomous relevance judgments in information retrieval (IR) experiments. It is argued that evaluation methods should credit IR methods for their ability to retrieve highly relevant ...
Read More
User-oriented evaluation of color descriptors for web image retrieval
ECDL'10: Proceedings of the 14th European conference on Research and advanced technology for digital libraries

This paper proposes a methodology for effectiveness evaluation in content-based image retrieval systems. The methodology is based on the opinion of real users. This paper also presents the results of using this methodology to evaluate color descriptors ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM SIGMOD Record Volume 27, Issue 2
June 1998
595 pages
ISSN:0163-5808
DOI:10.1145/276305
Chairmen:
Laura Haas
IBM Almaden Research Center, San Jose, CA
,
Pamela Drew
Boeing Co.
,
Editor:
Ashutosh Tiwary
Boeing Co.; and Univ. of Washington, Seattle
Issue’s Table of Contents
SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data
June 1998
599 pages
ISBN:0897919955
DOI:10.1145/276304
Chairmen:
Laura Haas
IBM AlmadenResearch Center, San Jose, CA
,
Pamela Drew
Boeing Co.
,
Editors:
Ashutosh Tiwary
Boeing Co.; and Univ. of Washington, Seattle
,
Michael Franklin
Univ. of Maryland, College Park
Copyright © 1998 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 June 1998
Check for updates
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 5
  Total Citations
  View Citations
- 296
  Total Downloads
- Downloads (Last 12 months)23
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

User-oriented smart-cache for the Web: what you seek is what you get!

ACM SIGMOD Record

Abstract

References

Cited By

Index Terms

Recommendations

User-oriented smart-cache for the Web: what you seek is what you get!

User-oriented evaluation methods for information retrieval: a case study based on conceptual models for query expansion

User-oriented evaluation of color descriptors for web image retrieval