skip to main content
10.1145/96749.98226acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article
Free Access

Surrogate subsets: a free space management strategy for the index of a text retrieval system

Authors Info & Claims
Published:01 December 1989Publication History

ABSTRACT

This paper presents a new data structure and an associated strategy to be utilized by indexing facilities for text retrieval systems. The paper starts by reviewing some of the goals that may be considered when designing such an index and continues with a small survey of various current strategies. It then presents an indexing strategy referred to as surrogate subsets discussing its appropriateness in the light of the specified goals. Various design issues and implementation details are discussed. Our strategy requires that a surrogate file be divided into a large number of subsets separated by free space which will allow the index to expand when new material is appended to the database. Experimental results report on the utilization of free space when the database is enlarged.

References

  1. BAY72.BAYER, R. AND MCCREIGHT, E., Organization and maintenance of large ordered indexes, Acta Informatica, vol. 1, no. 3, 1972, pp. 173-189. ~Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. BER87.BERRA, P. B., CHUNG, S. M. AND HACHEM, N. I., Computer architecture for a surrogate file to a very large data / knowledge base, IEEE Computer, vol. 20, no. 3, 1987, pp. 25-32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. CHR84.CHRISTODOULAKIS, S. AND FALOUTSOS, C., Design considerations for a message file server, IEEE Trans. Sofn#,are Engineering, Vol. SE-10, No. 2, Mar. 1984, pp. 201-210.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. CHR86.CHRISTODOULAKIS, S. AND FALOUTSOS, C., Design and performance considerations for an optical disk-based, multimedia object server, Computer, Vol. 19, No. 12, Dec. 1986, pp. 45-56. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. DEF88.DEFAZIO, S. AND GREENWALD, C., The Mead information retrieval system, IEEE Compcon 88, Feb. 1988, pp. 431.Google ScholarGoogle Scholar
  6. DEF89.DEFAZIO, S., Private communication.Google ScholarGoogle Scholar
  7. FAL84.FALOUTSOS, C. AND CHRISTODOULAKIS, S., Signature f'des: an access method for documents and its analytical performance evaluation, A CM Transactions on Office information Systems, Vol. 2, No. 4, Oct. 1984, pp.267-288. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. FAL85.FALOUTSOS, C., Access methods for text, Computing Surveys, Vol. 17, No. 1, Mar. 1985, pp. 49-74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. FAL87D.FALOUTSOS, C. AND CHAN, R., Fast text access methods for optical disks- designs and performance comparison, UMIACS-TR-87-66, CS-TR- 1958, Dept. of Comp. Sci. and Inst. for Adv. Comp. Studies, Univ. of Maryland, Dec. 1987, 29 pages.Google ScholarGoogle Scholar
  10. FAL87S.FALOUTSOS, C. AND CHRISTOIX)ULAKIS, S., Optimal signature extraction and information loss, ACM Transactions on Database Systems, Vol. 12, No.3, Sept. 1987, pp. 395-428. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. HAS81.HASKIN, R. L., Special purpose processors for text retrieval, Database Engineering, Vol. 4, No. 1, Sept. 1981, pp. 16-29.Google ScholarGoogle Scholar
  12. LAR83.LARSON, P. A., A method for speeding up text retrieval, Proceedings of ACM SIGMOD Conference, May, ACM, New York, 1983, pp. 117-123. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. ROB79.ROBERTS, C. S., Partial-match retrieval via the method of superimposed codes, Proc. IEEE, 67,12, Dec. 1979, 1624-1642.Google ScholarGoogle ScholarCross RefCross Ref
  14. SAL86.S ALTON, G., Another look at automatic text retrieval systems, Communications of the ACM, Vol. 29, No. 7, July 1986, pp. 648-656. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. STA86.STANFILL, C. AND KAHLE, t}., Parallel free-text search on the connection machine system, Communications of the ACM, Vol. 29, No. 12, Dec. 1986, pp. 1229-1239. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. TEO82.TEORY, T. J. AND FRY, J. P., Design of Database Structures, Prentice Hall, Englewood Cliffs, New Jersey, 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. TSI83.TSICHRITZIS D. AND CHRISTODOLR.AKiS, S., Message files, ACM Trans. Office Information Systems, Vol. 1, No. 1, Jan. 1983, pp. 88-98. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Surrogate subsets: a free space management strategy for the index of a text retrieval system

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              SIGIR '90: Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
              December 1989
              509 pages
              ISBN:0897914082
              DOI:10.1145/96749

              Copyright © 1989 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 1 December 1989

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              Overall Acceptance Rate792of3,983submissions,20%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader