skip to main content
10.1145/253168.253189acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article
Free Access

S-tree: a dynamic balanced signature index for office retrieval

Authors Info & Claims
Published:01 September 1986Publication History

ABSTRACT

The signature approach is an access method for partial-match retrieval which meets many requirements of an office environment. Signatures are hash coded binary words derived from objects stored in the data base. They serve as a filter for retrieval in order to discard a large number of nonqualifying objects. In an indexed signature method the signatures of objects stored on a single page are used to form a signature for that page. In this paper we describe a new technique of indexed signatures which combines the dynamic balancing of B-trees with the signature approach. The main problem of appropriate splitting is solved in a heuristic way. Operations are described and a simple performance analysis is given. The analysis and some experimental results indicate a considerable performance gain. Moreover, the new S-tree approach supports a clustering on a signature basis. Further remarks on adaptability complete this work.

References

  1. BAYE72.Bayer, R., McCreight, E.M.: Organia#tion and Maintenance of Large Ordered Indexes, Acta Informatica 1972, pp. 173- 189.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. CHRI84.Christodoulakis, S. et al.: Development of a Multimedia information System for an Office Environment, Proc. VLDB 1984, pp. 261- 271. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. DADA83.Dadam, P., Pistor, P., Schek, H.-J.: A Predicate oriented Locking Approach for Integrated Information Systems, in: Mason, R.E.A. (ed.): Information Processing 83, Proc. IFIP 1983, pp. 763- 768.Google ScholarGoogle Scholar
  4. DADA86.P. Dadam, K. Kuespert, F. Andersen, H. Blanken, R. Erbe, J. Guenauer, V. Lure, P. Pistor, G. Walch: A DBMS Prototype to Support Extended NF2 Relations: An/ntegrated View on Flat Tables and Hierarchies, accepted for SIGMOD 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. DEPP85.Deppisch, U., Obermeit, V., Paul, H.-B., Schek, H.-J., SchoU, M.H., Weikum, G.: The Storage Subsystem of a Data Base Kernel System (in German), Proc. GI Conf. on Database Systems for Office Automation, Engineering and Scientific Applications 1985, pp. 421 -. 440; English version.available as Technical Report DVSI-1985-T1, TU Darmstadt, 1985. "Google ScholarGoogle Scholar
  6. FALO84.Fedoutsos, C., Christodoulakis, S.: Signature Files: An Access Method for Documents and its Analytical Performance Evaluation, TOOIS 1984, pp. 267- 288. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. FALO85a.Faloutsos, C.: Access Methods for Text, Computing Surveys 1985, pp. 49- 74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. FALO85b.Fedoutsos, C.: Signature files: Design and per~ formance comparison of some signature extraction methods, Proc. SIGMOD 1985, pp. 63 - 82. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. FALO85c.Faloutsos, C., Christodoulakis, S.: Design of a Signature File Method that Accounts for Non-Unlform Occurence and Query Frequencies, Proc. VLDB 1985, pp. 165- 170.Google ScholarGoogle Scholar
  10. GIBB83.Gibbs, S., Tsichritsis, D.: A Data Modelling Approach for Office Information Systems, TOOIS 1983, pp. 299- 319. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. GUTT84.Guttman, A.: R-T#eee: A dynamic index structure for spatial searching, Proc. SIG- MOD 1984, pp. 47- 57. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. HARR71.Harrison, M.C.: Implementation of the Substring Test by Hashing, CACM 1971, pp. 777 - 779. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. OREN84.Orenstein, J.A., Merrett, T.H.: A Class of Data Structures for Associative Searching, Proc. PODS 1984, pp. 181- 190. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. PFAL80.Pfedts, J.L., Berman, W.J., Cagley, E.M.: Partial-Match Retrieval using Indexed Descriptor Files, CACM 1980, pp. 522- 528. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. PRAB83.Prabhakax, T.V., Saha#abuddhe, H.V.: Signature Trees- A Data Structure for Index Organisatlon, Proc. IEEE Conf. on Systems, Man and Cybernetics 1983, pp. 1145- 1147.Google ScholarGoogle Scholar
  16. RABI84.Rabitti, F., Zizka, J.: Evaluation of Acces Methods to Text Documents in Office Systems, in: Rijsbergen, van, C.J. (ed.): Proc. Research and Development in Information Retrieval 1984, pp. 21 - 40. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. RABI85.Rabitti, F.: A Model for Multimedia Documents, in: TSIC85a, pp. 227- 250.Google ScholarGoogle Scholar
  18. RIVE76.Rivest, R.L.: Partial-Match Retrieval Algorithms, SIAM Journal of Computing 1976, pp. 19- 50.Google ScholarGoogle ScholarCross RefCross Ref
  19. RIJS79.Rijsbergen, van, C.J.: Information Retrieval, 2rid ed., London: Butherworth 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. ROBE79.Roberts, C.S.: Partial-Match Retrieved via the Method of Superimposed Codes, Proc. IEEE 1979, pp. 1624- 1642.Google ScholarGoogle ScholarCross RefCross Ref
  21. SACK83.Sacks-Davis, R., Ramamohanarao, K.: A twolevel superimposed coding scheme for Partial Match Retrieval, information Systems 1983, pp. 273- 280.Google ScholarGoogle Scholar
  22. SALT78.Salton, G., Wong, A.: Generation and Search of Clustered Files, TODS 1978, pp. 321 - 346. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. SALT83a.Salton, G. Fox, E.A., Wu, H.: Extended Boolean Information Retrieval, CACM 1983, pp. 1022- 1036. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. SALT83b.Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval, New York: McGraw-Hill 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. SCHE77.Schek, H.-J.: Tolerating Fussiness in Keywords by Similarity Searches, Kybernetes 1977, pp. 175- 184.Google ScholarGoogle ScholarCross RefCross Ref
  26. SCHE78.Schek, H.-J.: The Reference String Access Method and Partial Match Retrieval, IBM Scientific Center Report TR 77.12.008; partly contained in: The Reference String Indexing Method, in: Bracchi, G., Lockemann, P.C. (eds.): Proc. Information System Methodolog3' 1978, Springer LNCS 65, pp. 432- 459. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. SCHE81.Schek, H.-J.: Methods for the Administration of Textual Data in Data Base Systems, in: Oddy, R.N. et al. (eds.): Information Retrieval Research, London: Butherworth 1981, pp. 218 - 235. Google ScholarGoogle Scholar
  28. SCHE82.Schek, H.-J., Pistor, P.: Data Structures for an Integrated Data Base Management and Information Retrieval System, Proc. VLDB 1982, pp. 197- 207. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. SEMS86.Semsroth, T.: Experimental Studies on S- Trees, Master Thesis, TU Darmstadt, 1986, forthcoming.Google ScholarGoogle Scholar
  30. TSIC85a.Teichritzis, D. (ed.): Office Automation, Berlin, Heidelberg: Springer 1985.Google ScholarGoogle Scholar
  31. TSIC85b.Tsichritzis, D., Christodoulakis, S., Lee, A., Vandenbroek, J.: A Multimedia Filing System, in: TSIC85a, pp. 43- 65.Google ScholarGoogle Scholar
  1. S-tree: a dynamic balanced signature index for office retrieval

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '86: Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
      September 1986
      283 pages
      ISBN:0897911873
      DOI:10.1145/253168

      Copyright © 1986 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 September 1986

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader